fix: prevent session message loss in LocalManagedAgent._persist_state by cursor[bot] · Pull Request #1727 · MervinPraison/PraisonAI

cursor · 2026-05-23T09:04:15Z

Bug and impact

LocalManagedAgent._persist_state() could silently delete chat messages from the JSON session store when metadata was persisted while another writer (or another DefaultSessionStore instance) had added messages via add_message().

This affects managed/local agent runs that call _persist_state() after each turn while the gateway, TUI, or another process also writes to the same session file.

Root cause

The DefaultSessionStore path used the same unsafe pattern fixed in #1724 for other mutators:

get_session() → stale in-memory copy
Update metadata
_save_session() → full session overwrite on disk

If a concurrent add_message() ran between steps 1 and 3, those messages were lost.

Fix

Use update_session_metadata() (locked read-modify-write from disk) instead of get_session() + _save_session().

Validation

Added test_local_managed_persist_state_preserves_messages reproducing the two-instance race from session store tests.
Test passed locally.

Summary by CodeRabbit

Bug Fixes
- Improved session state persistence to prevent accidental loss of user messages added concurrently to the session store.
Tests
- Added test coverage verifying that session chat history and metadata are correctly preserved during concurrent session operations.

Use update_session_metadata (locked read-modify-write) instead of get_session + _save_session, which could overwrite messages written concurrently via add_message on another store instance. Adds regression test mirroring session store race coverage from #1724. Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

MervinPraison · 2026-05-23T09:04:24Z

@coderabbitai review

MervinPraison · 2026-05-23T09:04:25Z

/review

qodo-code-review · 2026-05-23T09:04:28Z

Qodo reviews are paused for this user.

Troubleshooting steps vary by plan Learn more →

On a Teams plan?
Reviews resume once this user has a paid seat and their Git account is linked in Qodo.
Link Git account →

Using GitHub Enterprise Server, GitLab Self-Managed, or Bitbucket Data Center?
These require an Enterprise plan - Contact us
Contact us →

coderabbitai · 2026-05-23T09:04:29Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai · 2026-05-23T09:05:14Z

Important

Review skipped

Bot user detected.

To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 724e0dc6-31be-45bb-b713-7a56c6cabdf1

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

This PR fixes session state persistence in LocalManagedAgent to prevent concurrent message overwrites. The implementation now prefers update_session_metadata() when available, otherwise loads and merges into the existing session instead of creating new records. A test validates messages added concurrently to disk are preserved after state persistence.

Changes

Session Metadata Persistence Fix

Layer / File(s)	Summary
Session metadata persistence refactor and validation `src/praisonai/integrations/managed_local.py`, `src/praisonai/tests/unit/integrations/test_managed_agents.py`	`LocalManagedAgent._persist_state()` now calls `update_session_metadata()` when available; otherwise loads the existing session and merges metadata in-place before saving, removing the prior behavior that created new `SessionData` records. New test `test_local_managed_persist_state_preserves_messages()` verifies that concurrently added user messages are retained when agent state is persisted.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Possibly related PRs

MervinPraison/PraisonAI#1709: Fixes update_session_metadata() to prevent stale-cache overwrites and message loss, which is the underlying store method that the main PR now prefers to call.
MervinPraison/PraisonAI#1724: Modifies DefaultSessionStore atomic/locked mutation mechanisms to prevent overwriting concurrent message writes, complementing the main PR's changes to how metadata is persisted.

Poem

🐰 Concurrency's dance, now in perfect stride,
Messages merge, none left behind,
Metadata flows where session state glides,
A safer persist, peace of mind—aye! 🌟

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and specifically summarizes the main fix: preventing message loss in LocalManagedAgent._persist_state, which is the central change across both modified files.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch cursor/critical-correctness-bugs-05fc

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

MervinPraison · 2026-05-23T09:05:22Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:

Bloat check: Are changes minimal and focused? Any unnecessary code or scope creep?
Security: Any hardcoded secrets, unsafe eval/exec, missing input validation?
Performance: Any module-level heavy imports? Hot-path regressions?
Tests: Are tests included? Do they cover the changes adequately?
Backward compat: Any public API changes without deprecation?
Code quality: DRY violations, naming conventions, error handling?
Address reviewer feedback: If Qodo, Coderabbit, or Gemini flagged valid issues, include them in your review
Suggest specific improvements with code examples where possible

Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/4d08ce0a-1b7b-4e50-b841-6a08416f2f53 Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

Copilot · 2026-05-23T09:08:25Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:...

Completed a full review and incorporated all available reviewer signals.

Action taken (fd788e9): I fixed a backward-compatibility gap in _persist_state fallback logic (managed_local.py) so stores without update_session_metadata still handle get_session(...) is None by creating SessionData (restores prior behavior while keeping the race-safe path).

Findings:

Bloat/scope: changes remain minimal and focused to session metadata persistence.
Security: no secrets/unsafe eval/exec introduced; no new input-handling surface.
Performance: no heavy module-level imports added; hot path unchanged except metadata write method choice.
Tests: regression test for message-preservation race is in place and passes locally (test_local_managed_persist_state_preserves_messages).
Backward compatibility: addressed by fd788e9 fallback restoration.
Code quality: no DRY or naming regressions found.
Reviewer feedback: Qodo had no review (paused), Coderabbit had no actionable code issue besides a style suggestion; Gemini had no posted review comments.

CI note: multiple workflow runs reported failure with zero jobs created (workflow-level failure/skip behavior), so there were no failed job logs to act on in this PR scope.

MervinPraison · 2026-05-23T09:08:31Z

@claude You are the FINAL architecture reviewer. If the branch is under MervinPraison/PraisonAI (not a fork), you are able to make modifications to this branch and push directly. SCOPE: Focus ONLY on Python packages (praisonaiagents, praisonai). Do NOT modify praisonai-rust or praisonai-ts. Read ALL comments above from Gemini, Qodo, CodeRabbit, and Copilot carefully before responding.

Phase 1: Review per AGENTS.md

Protocol-driven: check heavy implementations vs core SDK
Backward compatible: ensure zero feature regressions
Performance: no hot-path regressions

Phase 2: FIX Valid Issues
4. For any VALID bugs or architectural flaws found by Gemini, CodeRabbit, Qodo, Copilot, or any other reviewer: implement the fix
5. Push all code fixes directly to THIS branch (do NOT create a new PR)
6. Comment a summary of exact files modified and what you skipped

Phase 3: Final Verdict
7. If all issues are resolved, approve the PR / close the Issue
8. If blocking issues remain, request changes / leave clear action items

praisonai-triage-agent · 2026-05-23T09:08:52Z

qodo-code-review · 2026-05-23T09:28:55Z

Qodo reviews are paused for this user.

Troubleshooting steps vary by plan Learn more →

On a Teams plan?
Reviews resume once this user has a paid seat and their Git account is linked in Qodo.
Link Git account →

Using GitHub Enterprise Server, GitLab Self-Managed, or Bitbucket Data Center?
These require an Enterprise plan - Contact us
Contact us →

greptile-apps · 2026-05-23T09:33:06Z

Greptile Summary

This PR fixes a race condition in LocalManagedAgent._persist_state() where the old get_session + _save_session pattern could silently drop messages written concurrently by another DefaultSessionStore instance. The fix replaces it with update_session_metadata(), which performs a file-locked read-modify-write.

managed_local.py: _persist_state now calls store.update_session_metadata(session_id, **state) when available, falling back to the original (unsafe) path only for stores lacking that method.
test_managed_agents.py: Adds test_local_managed_persist_state_preserves_messages, a two-instance test that verifies messages added between a stale _load_session and _persist_state are not lost.

Confidence Score: 3/5

The core race-condition fix is correct, but the chosen helper skips None-valued fields, meaning a cleared compute instance ID is not written back to disk and will be incorrectly restored on the next session resume.

The switch to update_session_metadata correctly solves the message-loss race. However, that helper's if value is None: continue guard introduces a new defect: when _compute_instance_id is set to None after a shutdown and _persist_state is called again, the old instance ID stays in metadata. _restore_state then re-hydrates the freed ID, leaving the agent pointing at a resource that no longer exists. The test added does not cover this path.

src/praisonai/praisonai/integrations/managed_local.py — specifically the interaction between None-valued state fields and the update_session_metadata helper's skip logic.

Important Files Changed

Filename	Overview
src/praisonai/praisonai/integrations/managed_local.py	Replaces racy get+save with locked update_session_metadata, but the new helper silently skips None-valued fields, introducing a regression where clearing compute_instance_id is not persisted.
src/praisonai/tests/unit/integrations/test_managed_agents.py	Adds a well-structured two-instance race-condition test that correctly validates both message preservation and metadata correctness after _persist_state.

Sequence Diagram

sequenceDiagram
    participant A as LocalManagedAgent
    participant S as DefaultSessionStore (reader)
    participant D as Disk (session.json)
    participant W as Concurrent Writer

    Note over A,W: Old (racy) path
    A->>S: get_session(id) → stale copy
    W->>D: add_message("second") [atomic write]
    A->>D: _save_session(stale copy) [overwrites "second"]

    Note over A,W: New (fixed) path
    A->>S: "update_session_metadata(id, **state)"
    S->>D: FileLock acquired
    S->>D: _load_session_from_disk → fresh copy (has "second")
    S->>D: mutate metadata only
    S->>D: atomic write (messages preserved)
    S->>D: FileLock released

_{Reviews (1): Last reviewed commit: "fix: preserve legacy _persist_state fall..." | Re-trigger Greptile}

greptile-apps · 2026-05-23T09:33:10Z

+            if hasattr(store, "update_session_metadata"):
+                store.update_session_metadata(self._session_id, **state)


Stale compute_instance_id after instance shutdown

DefaultSessionStore.update_session_metadata silently skips any field whose value is None (see if value is None: continue in store.py:541). This means if a compute instance is freed mid-session — setting self._compute_instance_id = None at line 1043 — the subsequent _persist_state() call won't overwrite the old instance ID that is already in the metadata. On the next _restore_state(), line 547 (if meta.get("compute_instance_id"): self._compute_instance_id = ...) will restore the stale ID, leaving the agent referencing a freed compute resource.

The same applies to compute_provider when self._compute is None. Previously the raw session.metadata.update(state) explicitly stored None, clearing the stale value. The new path no longer does so.

Copilot started work on behalf of MervinPraison May 23, 2026 09:05 View session

fix: preserve legacy _persist_state fallback when session is absent

fd788e9

Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/4d08ce0a-1b7b-4e50-b841-6a08416f2f53 Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

Copilot finished work on behalf of MervinPraison May 23, 2026 09:08

Copilot AI requested a review from MervinPraison May 23, 2026 09:08

MervinPraison marked this pull request as ready for review May 23, 2026 09:28

Copilot AI review requested due to automatic review settings May 23, 2026 09:28

MervinPraison merged commit 5ff79d3 into main May 23, 2026
3 of 4 checks passed

Copilot started reviewing on behalf of MervinPraison May 23, 2026 09:29 View session

MervinPraison mentioned this pull request May 23, 2026

docs: document session-store concurrency guarantees + LocalManagedAgent persist-state safety (PR #1727) MervinPraison/PraisonAIDocs#423

Open

6 tasks

greptile-apps Bot reviewed May 23, 2026

View reviewed changes

cursor Bot review requested due to automatic review settings May 23, 2026 09:49

		if hasattr(store, "update_session_metadata"):
		store.update_session_metadata(self._session_id, **state)

Uh oh!

Conversation

cursor Bot commented May 23, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bug and impact

Root cause

Fix

Validation

Summary by CodeRabbit

Uh oh!

MervinPraison commented May 23, 2026

Uh oh!

MervinPraison commented May 23, 2026

Uh oh!

qodo-code-review Bot commented May 23, 2026

Qodo reviews are paused for this user.

Uh oh!

coderabbitai Bot commented May 23, 2026

Uh oh!

coderabbitai Bot commented May 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

MervinPraison commented May 23, 2026

Uh oh!

Copilot AI commented May 23, 2026

Uh oh!

MervinPraison commented May 23, 2026

Uh oh!

praisonai-triage-agent Bot commented May 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Final Architecture Review ✅

Comprehensive Architecture Review

✅ Protocol-Driven Design Compliance

✅ Backward Compatibility Verified

✅ Performance Impact Assessment

✅ Multi-Agent Safety & Async Safety

✅ Previous Reviewer Findings Addressed

✅ Test Coverage Adequate

FINAL VERDICT: ✅ APPROVED

Uh oh!

qodo-code-review Bot commented May 23, 2026

Qodo reviews are paused for this user.

Uh oh!

Uh oh!

greptile-apps Bot commented May 23, 2026

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps Bot May 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cursor Bot commented May 23, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 23, 2026 •

edited

Loading

praisonai-triage-agent Bot commented May 23, 2026 •

edited

Loading