LCORE-1166: store tool calls and tool results in cache entry #984

yangcao77 · 2026-01-12T22:02:43Z

Description

Include tool call and tool results in cache entry, which stored & being retrieved for /v2/conversations endpoint.

Type of change

Tools used to create PR

Identify any AI code assistants used in this PR (for transparency and review context)

Assisted-by: (e.g., Claude, CodeRabbit, Ollama, etc., N/A if not used)
Generated by: (e.g., tool name and version; N/A if not used)

Related Tickets & Documents

Related Issue https://issues.redhat.com/browse/RHIDP-11356

Checklist before requesting a review

I have performed a self-review of my code.
PR has passed all pre-merge test jobs.
If it is a core feature, I have added thorough tests.

Testing

Please provide detailed steps to perform tests related to this code change.
How were the fix/results from this change verified? Please provide relevant screenshots or results.

Summary by CodeRabbit

New Features
- Conversation cache now stores tool call and tool result metadata alongside transcripts.
Improvements
- Cache retrieval restores tool interaction details for fuller conversation context and continuity.
- End-to-end storage preserves tool execution summaries across sessions, improving replay and debugging.
Tests
- Unit tests updated to cover storing, retrieving, and round-tripping of tool call/result metadata.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

Signed-off-by: Stephanie <yangcao@redhat.com>

coderabbitai · 2026-01-12T22:02:53Z

Walkthrough

Tool call/result metadata fields (tool_calls, tool_results) were added and propagated through TurnSummary → CacheEntry → QueryResponse, and persisted/restored by Postgres and SQLite cache backends with JSON (de)serialization; tests updated to cover these fields.

Changes

Cohort / File(s)	Summary
Models & Types `src/models/cache_entry.py`, `src/models/responses.py`, `src/utils/types.py`	Added optional `tool_calls: Optional[list[ToolCallSummary]]` and `tool_results: Optional[list[ToolResultSummary]]` to data models/types.
Endpoint handlers `src/app/endpoints/query.py`, `src/utils/endpoints.py`	Propagate `tool_calls` and `tool_results` from turn/summary into `CacheEntry` and include them in `QueryResponse`; minor formatting cleanup.
Postgres cache backend `src/cache/postgres_cache.py`	Add `tool_calls`/`tool_results` jsonb handling: SELECT/INSERT statements extended, serialize on write, deserialize into summary types on read with logging for failures.
SQLite cache backend `src/cache/sqlite_cache.py`	Add `tool_calls`/`tool_results` text columns; serialize/deserialize JSON on write/read and update SQL statements; log (de)serialization issues.
Tests `tests/unit/cache/test_postgres_cache.py`, `tests/unit/cache/test_sqlite_cache.py`, `tests/unit/utils/test_endpoints.py`	Updated/added tests to assert serialization, storage, retrieval, and propagation of `tool_calls` and `tool_results`; test fixtures use `ToolCallSummary`/`ToolResultSummary`.

Sequence Diagram(s)

(omitted — changes are schema/propagation-focused and do not introduce a new multi-component control flow requiring a sequence diagram)

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

Base implementation of streaming Responses API #754 — Related work extending cache models and persistence to carry tool_calls/tool_results and touching cleanup/streaming flows.
Add tool calls to stored transcripts #404 — Prior PR that threaded tool-call summaries through endpoints and storage; overlaps types and persistence logic.

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Title check	✅ Passed	The pull request title accurately describes the main objective: storing tool calls and tool results in cache entries. The summary of changes confirms this is the primary focus across multiple files (cache_entry.py, postgres_cache.py, sqlite_cache.py, endpoints.py, and related tests).

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Signed-off-by: Stephanie <yangcao@redhat.com>

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (10)

src/cache/sqlite_cache.py (6)

13-14: Fix ungrouped imports to resolve linter warning.

The import from utils.types should be grouped with other utils imports to satisfy the ungrouped-imports linter rule.

♻️ Suggested fix

Move line 13 to be adjacent to line 15 (after from log import get_logger):

 from models.config import SQLiteDatabaseConfiguration
 from models.responses import ConversationData, ReferencedDocument
-from utils.types import ToolCallSummary, ToolResultSummary
 from log import get_logger
 from utils.connection_decorator import connection
+from utils.types import ToolCallSummary, ToolResultSummary

248-253: Fix implicit string concatenation in logger warning.

The linter flagged implicit string concatenation. Use a single string instead.

♻️ Suggested fix

                 except (json.JSONDecodeError, ValueError) as e:
                     logger.warning(
-                        "Failed to deserialize tool_calls for " "conversation %s: %s",
+                        "Failed to deserialize tool_calls for conversation %s: %s",
                         conversation_id,
                         e,
                     )

264-269: Fix implicit string concatenation in logger warning.

Same issue as above - use a single string.

♻️ Suggested fix

                 except (json.JSONDecodeError, ValueError) as e:
                     logger.warning(
-                        "Failed to deserialize tool_results for " "conversation %s: %s",
+                        "Failed to deserialize tool_results for conversation %s: %s",
                         conversation_id,
                         e,
                     )

333-338: Fix implicit string concatenation in logger warning.

♻️ Suggested fix

             except (TypeError, ValueError) as e:
                 logger.warning(
-                    "Failed to serialize tool_calls for " "conversation %s: %s",
+                    "Failed to serialize tool_calls for conversation %s: %s",
                     conversation_id,
                     e,
                 )

347-352: Fix implicit string concatenation in logger warning.

♻️ Suggested fix

             except (TypeError, ValueError) as e:
                 logger.warning(
-                    "Failed to serialize tool_results for " "conversation %s: %s",
+                    "Failed to serialize tool_results for conversation %s: %s",
                     conversation_id,
                     e,
                 )

197-284: Consider refactoring get method to reduce local variables.

The pylint error indicates too many local variables (19/15). Consider extracting the deserialization logic into helper methods to improve readability and satisfy the linter.

♻️ Suggested approach

Extract the JSON deserialization into a private helper method:

def _deserialize_json_field(
    self,
    json_str: str | None,
    model_class: type,
    field_name: str,
    conversation_id: str,
) -> list | None:
    """Deserialize a JSON string field into a list of Pydantic models."""
    if not json_str:
        return None
    try:
        data = json.loads(json_str)
        return [model_class.model_validate(item) for item in data]
    except (json.JSONDecodeError, ValueError) as e:
        logger.warning(
            "Failed to deserialize %s for conversation %s: %s",
            field_name,
            conversation_id,
            e,
        )
        return None

Then use it in get:

docs_obj = self._deserialize_json_field(
    conversation_entry[6], ReferencedDocument, "referenced_documents", conversation_id
)
tool_calls_obj = self._deserialize_json_field(
    conversation_entry[7], ToolCallSummary, "tool_calls", conversation_id
)
tool_results_obj = self._deserialize_json_field(
    conversation_entry[8], ToolResultSummary, "tool_results", conversation_id
)

src/cache/postgres_cache.py (4)

12-14: Fix ungrouped imports to resolve linter warning.

Same as SQLite cache - group utils imports together.

♻️ Suggested fix

 from models.responses import ConversationData, ReferencedDocument
-from utils.types import ToolCallSummary, ToolResultSummary
 from log import get_logger
 from utils.connection_decorator import connection
+from utils.types import ToolCallSummary, ToolResultSummary

364-369: Fix implicit string concatenation in logger warning.

♻️ Suggested fix

                 except (TypeError, ValueError) as e:
                     logger.warning(
-                        "Failed to serialize tool_calls for " "conversation %s: %s",
+                        "Failed to serialize tool_calls for conversation %s: %s",
                         conversation_id,
                         e,
                     )

378-383: Fix implicit string concatenation in logger warning.

♻️ Suggested fix

                 except (TypeError, ValueError) as e:
                     logger.warning(
-                        "Failed to serialize tool_results for " "conversation %s: %s",
+                        "Failed to serialize tool_results for conversation %s: %s",
                         conversation_id,
                         e,
                     )

230-317: Consider refactoring get method to reduce local variables.

Similar to the SQLite cache, the get method has too many local variables (16/15). Consider extracting deserialization logic into a helper method to satisfy the linter and reduce code duplication between the two cache implementations.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2bc4bdb and 7789ae1.

📒 Files selected for processing (7)

src/app/endpoints/query.py
src/cache/postgres_cache.py
src/cache/sqlite_cache.py
src/models/cache_entry.py
src/utils/endpoints.py
tests/unit/cache/test_postgres_cache.py
tests/unit/cache/test_sqlite_cache.py

🧰 Additional context used

📓 Path-based instructions (5)

**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.py: Use absolute imports for internal modules: from authentication import get_auth_dependency
Use FastAPI dependencies: from fastapi import APIRouter, HTTPException, Request, status, Depends
Use Llama Stack imports: from llama_stack_client import AsyncLlamaStackClient
Check constants.py for shared constants before defining new ones
All modules start with descriptive docstrings explaining purpose
Use logger = logging.getLogger(__name__) pattern for module logging
Type aliases defined at module level for clarity
All functions require docstrings with brief descriptions
Complete type annotations for function parameters and return types, using typing_extensions.Self for model validators
Use union types with modern syntax: str | int instead of Union[str, int]
Use Optional[Type] for optional parameters
Use snake_case with descriptive, action-oriented function names (get_, validate_, check_)
Avoid in-place parameter modification anti-patterns; return new data structures instead
Use async def for I/O operations and external API calls
Handle APIConnectionError from Llama Stack in error handling
Use logger.debug() for detailed diagnostic information
Use logger.info() for general information about program execution
Use logger.warning() for unexpected events or potential problems
Use logger.error() for serious problems that prevented function execution
All classes require descriptive docstrings explaining purpose
Use PascalCase for class names with descriptive names and standard suffixes: Configuration, Error/Exception, Resolver, Interface
Use ABC for abstract base classes with @abstractmethod decorators
Complete type annotations for all class attributes
Follow Google Python docstring conventions (https://google.github.io/styleguide/pyguide.html) with sections: Args, Returns, Raises, Attributes
Run uv run make format to auto-format code with black and ruff before completion
Run uv run make verify to run all linters (black, pyl...

Files:

tests/unit/cache/test_sqlite_cache.py
src/cache/sqlite_cache.py
src/cache/postgres_cache.py
src/utils/endpoints.py
src/models/cache_entry.py
src/app/endpoints/query.py
tests/unit/cache/test_postgres_cache.py

tests/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

tests/**/*.py: Use pytest for all unit and integration tests, not unittest
Use pytest-mock for AsyncMock objects in tests
Use MOCK_AUTH = ("mock_user_id", "mock_username", False, "mock_token") pattern for authentication mocks in tests

Files:

tests/unit/cache/test_sqlite_cache.py
tests/unit/cache/test_postgres_cache.py

tests/unit/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

tests/unit/**/*.py: Unit tests require 60% code coverage
Write unit tests covering new functionality before completion

Files:

tests/unit/cache/test_sqlite_cache.py
tests/unit/cache/test_postgres_cache.py

src/models/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

src/models/**/*.py: Use @field_validator and @model_validator for custom validation in Pydantic models
Extend BaseModel for data Pydantic models
Use @model_validator and @field_validator for Pydantic model validation

Files:

src/models/cache_entry.py

src/app/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

Use FastAPI HTTPException with appropriate status codes for API endpoint error handling

Files:

src/app/endpoints/query.py

🧠 Learnings (1)

📚 Learning: 2026-01-12T10:58:40.230Z

Learnt from: blublinsky
Repo: lightspeed-core/lightspeed-stack PR: 972
File: src/models/config.py:459-513
Timestamp: 2026-01-12T10:58:40.230Z
Learning: In lightspeed-core/lightspeed-stack, for Python files under src/models, when a user claims a fix is done but the issue persists, verify the current code state before accepting the fix. Steps: review the diff, fetch the latest changes, run relevant tests, reproduce the issue, search the codebase for lingering references to the original problem, confirm the fix is applied and not undone by subsequent commits, and validate with local checks to ensure the issue is resolved.

Applied to files:

src/models/cache_entry.py

🧬 Code graph analysis (5)

tests/unit/cache/test_sqlite_cache.py (4)

src/utils/types.py (2)

ToolCallSummary (111-119)

ToolResultSummary (122-133)

src/models/cache_entry.py (1)

CacheEntry (9-30)

src/models/responses.py (2)

model (1736-1749)

ReferencedDocument (328-342)

src/cache/sqlite_cache.py (2)

insert_or_append (287-379)

get (197-284)

src/cache/sqlite_cache.py (1)

src/utils/types.py (2)

ToolCallSummary (111-119)

ToolResultSummary (122-133)

src/cache/postgres_cache.py (1)

src/utils/types.py (2)

ToolCallSummary (111-119)

ToolResultSummary (122-133)

src/models/cache_entry.py (1)

src/utils/types.py (2)

ToolCallSummary (111-119)

ToolResultSummary (122-133)

tests/unit/cache/test_postgres_cache.py (2)

src/utils/types.py (2)

ToolCallSummary (111-119)

ToolResultSummary (122-133)

src/models/cache_entry.py (1)

CacheEntry (9-30)

🪛 GitHub Actions: Python linter

src/cache/sqlite_cache.py

[error] 197-197: Too many local variables (19/15) (too-many-locals) detected by pylint.

[warning] 250-250: Implicit string concatenation found in call (implicit-str-concat).

[warning] 266-266: Implicit string concatenation found in call (implicit-str-concat).

[warning] 335-335: Implicit string concatenation found in call (implicit-str-concat).

[warning] 349-349: Implicit string concatenation found in call (implicit-str-concat).

[warning] 15-15: Imports from package utils are not grouped (ungrouped-imports).

src/cache/postgres_cache.py

[error] 230-230: Too many local variables (16/15) (too-many-locals) detected by pylint.

[warning] 366-366: Implicit string concatenation found in call (implicit-str-concat).

[warning] 380-380: Implicit string concatenation found in call (implicit-str-concat).

[warning] 14-14: Imports from package utils are not grouped (ungrouped-imports).

🪛 GitHub Actions: Unit tests

src/utils/endpoints.py

[error] 801-801: pydantic ValidationError: tool_results must be a list. Received a Mock object in CacheEntry; ensure tool_results is provided as a list (e.g., [] or a list of results).

[error] 801-801: CacheEntry creation failed during tests: tool_results expects a list but a non-list value (Mock) was provided.

[error] 1-1: Command failed with exit code 1. uv run pytest tests/unit --cov=src --cov=runner --cov-report term-missing

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (4)

GitHub Check: build-pr
GitHub Check: Konflux kflux-prd-rh02 / lightspeed-stack-on-pull-request
GitHub Check: E2E: library mode / ci
GitHub Check: E2E: server mode / ci

🔇 Additional comments (8)

src/models/cache_entry.py (1)

6-30: LGTM!

The new tool_calls and tool_results fields are correctly defined as optional lists with proper type annotations. The implementation follows the existing pattern established by referenced_documents.

tests/unit/cache/test_sqlite_cache.py (2)

478-525: LGTM!

Comprehensive test coverage for the new tool_calls and tool_results fields. The test properly validates:

Storage of ToolCallSummary and ToolResultSummary objects

Correct round-trip serialization/deserialization through SQLite

Field values are preserved after retrieval

528-573: LGTM!

Good coverage of the combined case where all optional fields (referenced_documents, tool_calls, tool_results) are present. This ensures the serialization logic handles multiple JSON fields correctly.

src/app/endpoints/query.py (2)

385-387: LGTM!

Consistent handling of tool_calls and tool_results in the CacheEntry construction, following the same pattern used elsewhere in the codebase.

419-420: LGTM!

The QueryResponse now correctly exposes tool_calls and tool_results to API consumers, enabling clients to access tool usage metadata from query responses.

tests/unit/cache/test_postgres_cache.py (2)

726-839: LGTM!

Comprehensive test coverage for PostgreSQL cache handling of tool_calls and tool_results. The test validates:

Correct JSON serialization on insert

Proper parameter positioning in SQL statements

Correct deserialization and model validation on retrieval

Round-trip consistency of all field values

601-606: LGTM!

Index updates correctly reflect the new column order with tool_calls and tool_results at the end. The comment clarifies the parameter positioning.

src/utils/endpoints.py (1)

809-811: No issues found. The code logic is correct, and tests properly instantiate TurnSummary with real list objects rather than Mock objects. The truthy check correctly handles both empty and non-empty lists, converting empty lists to None as expected by the CacheEntry optional fields.

Likely an incorrect or invalid review comment.

Signed-off-by: Stephanie <yangcao@redhat.com>

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/cache/postgres_cache.py (1)

1-14: Run Black formatter before merging.

The pipeline indicates Black would reformat this file. Per coding guidelines, run uv run make format to auto-format the code with Black and Ruff before completion.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 7789ae1 and ba3e2a6.

📒 Files selected for processing (2)

src/cache/postgres_cache.py
src/cache/sqlite_cache.py

🚧 Files skipped from review as they are similar to previous changes (1)

src/cache/sqlite_cache.py

🧰 Additional context used

📓 Path-based instructions (1)

**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.py: Use absolute imports for internal modules: from authentication import get_auth_dependency
Use FastAPI dependencies: from fastapi import APIRouter, HTTPException, Request, status, Depends
Use Llama Stack imports: from llama_stack_client import AsyncLlamaStackClient
Check constants.py for shared constants before defining new ones
All modules start with descriptive docstrings explaining purpose
Use logger = logging.getLogger(__name__) pattern for module logging
Type aliases defined at module level for clarity
All functions require docstrings with brief descriptions
Complete type annotations for function parameters and return types, using typing_extensions.Self for model validators
Use union types with modern syntax: str | int instead of Union[str, int]
Use Optional[Type] for optional parameters
Use snake_case with descriptive, action-oriented function names (get_, validate_, check_)
Avoid in-place parameter modification anti-patterns; return new data structures instead
Use async def for I/O operations and external API calls
Handle APIConnectionError from Llama Stack in error handling
Use logger.debug() for detailed diagnostic information
Use logger.info() for general information about program execution
Use logger.warning() for unexpected events or potential problems
Use logger.error() for serious problems that prevented function execution
All classes require descriptive docstrings explaining purpose
Use PascalCase for class names with descriptive names and standard suffixes: Configuration, Error/Exception, Resolver, Interface
Use ABC for abstract base classes with @abstractmethod decorators
Complete type annotations for all class attributes
Follow Google Python docstring conventions (https://google.github.io/styleguide/pyguide.html) with sections: Args, Returns, Raises, Attributes
Run uv run make format to auto-format code with black and ruff before completion
Run uv run make verify to run all linters (black, pyl...

Files:

src/cache/postgres_cache.py

🧬 Code graph analysis (1)

src/cache/postgres_cache.py (3)

src/utils/types.py (2)

ToolCallSummary (111-119)

ToolResultSummary (122-133)

src/cache/sqlite_cache.py (1)

get (197-286)

src/cache/cache.py (1)

get (47-59)

🪛 GitHub Actions: Black

src/cache/postgres_cache.py

[error] 1-1: Black would reformat this file. Run 'uv tool run black --check .' to see details.

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (4)

GitHub Check: build-pr
GitHub Check: Konflux kflux-prd-rh02 / lightspeed-stack-on-pull-request
GitHub Check: E2E: server mode / ci
GitHub Check: E2E: library mode / ci

🔇 Additional comments (6)

src/cache/postgres_cache.py (6)

13-14: LGTM!

Imports are correctly added for the new ToolCallSummary and ToolResultSummary types needed for the tool metadata serialization/deserialization.

36-38: LGTM!

Schema documentation and table definition are consistent. Using jsonb for tool metadata is appropriate for PostgreSQL as it provides efficient JSON storage with indexing capabilities.

Also applies to: 61-62

82-95: LGTM!

SQL statements are correctly updated. The SELECT column order (indices 7 and 8 for tool_calls and tool_results) aligns with the deserialization logic in get(), and the INSERT placeholder count matches the parameter list.

270-302: LGTM!

Deserialization logic correctly handles PostgreSQL's automatic jsonb parsing. Unlike the SQLite implementation that requires json.loads(), psycopg2 deserializes jsonb to Python objects directly. Catching ValueError and TypeError for model_validate failures is appropriate, and the warning-based graceful degradation maintains consistency with the referenced_documents pattern.

304-315: LGTM!

CacheEntry construction properly includes the new tool_calls and tool_results fields, maintaining consistency with the SQLite implementation and the expanded model definition.

357-402: LGTM!

Serialization logic correctly converts Pydantic models to JSON strings for PostgreSQL jsonb storage. The pattern matches the existing referenced_documents handling, with appropriate error handling that logs warnings while allowing the insert to proceed with null values for the affected fields.

Signed-off-by: Stephanie <yangcao@redhat.com>

tisnik · 2026-01-13T08:45:05Z

@coderabbitai full review

coderabbitai · 2026-01-13T08:45:11Z

✅ Actions performed

Full review triggered.

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

src/cache/postgres_cache.py (1)

49-65: Database migration required for existing PostgreSQL databases with new tool_calls and tool_results columns.

The CREATE TABLE IF NOT EXISTS statement will not add tool_calls and tool_results columns to existing cache tables. Production databases must execute an explicit migration (e.g., ALTER TABLE cache ADD COLUMN tool_calls jsonb; and ALTER TABLE cache ADD COLUMN tool_results jsonb;) to support the new columns referenced in SELECT (lines 84) and INSERT (lines 92-94) statements.

src/cache/sqlite_cache.py (1)

48-64: Add database migration for existing SQLite deployments to support new tool_calls and tool_results columns.

The CREATE_CACHE_TABLE statement uses CREATE TABLE IF NOT EXISTS, which skips table creation if the table already exists. Existing SQLite databases will retain their old schema without the newly added tool_calls and tool_results columns. Subsequent SELECT and INSERT operations against these columns will fail at runtime.

Implement a migration strategy such as:

ALTER TABLE cache ADD COLUMN tool_calls text;

ALTER TABLE cache ADD COLUMN tool_results text;

Alternatively, use a database schema version check (via PRAGMA user_version) in initialize_cache() to detect old schemas and apply necessary alterations, ensuring backward compatibility with existing deployments.

🧹 Nitpick comments (1)

tests/unit/utils/test_endpoints.py (1)

1126-1128: LGTM!

Consistent update with the first test case.

Consider adding test cases that verify non-empty tool_calls and tool_results are correctly propagated to the CacheEntry in cleanup_after_streaming. This would provide coverage for the new functionality beyond just ensuring the mocks don't break existing tests.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2bc4bdb and 3899c70.

📒 Files selected for processing (8)

src/app/endpoints/query.py
src/cache/postgres_cache.py
src/cache/sqlite_cache.py
src/models/cache_entry.py
src/utils/endpoints.py
tests/unit/cache/test_postgres_cache.py
tests/unit/cache/test_sqlite_cache.py
tests/unit/utils/test_endpoints.py

🧰 Additional context used

📓 Path-based instructions (5)

**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.py: Use absolute imports for internal modules: from authentication import get_auth_dependency
Use FastAPI dependencies: from fastapi import APIRouter, HTTPException, Request, status, Depends
Use Llama Stack imports: from llama_stack_client import AsyncLlamaStackClient
Check constants.py for shared constants before defining new ones
All modules start with descriptive docstrings explaining purpose
Use logger = logging.getLogger(__name__) pattern for module logging
Type aliases defined at module level for clarity
All functions require docstrings with brief descriptions
Complete type annotations for function parameters and return types, using typing_extensions.Self for model validators
Use union types with modern syntax: str | int instead of Union[str, int]
Use Optional[Type] for optional parameters
Use snake_case with descriptive, action-oriented function names (get_, validate_, check_)
Avoid in-place parameter modification anti-patterns; return new data structures instead
Use async def for I/O operations and external API calls
Handle APIConnectionError from Llama Stack in error handling
Use logger.debug() for detailed diagnostic information
Use logger.info() for general information about program execution
Use logger.warning() for unexpected events or potential problems
Use logger.error() for serious problems that prevented function execution
All classes require descriptive docstrings explaining purpose
Use PascalCase for class names with descriptive names and standard suffixes: Configuration, Error/Exception, Resolver, Interface
Use ABC for abstract base classes with @abstractmethod decorators
Complete type annotations for all class attributes
Follow Google Python docstring conventions (https://google.github.io/styleguide/pyguide.html) with sections: Args, Returns, Raises, Attributes
Run uv run make format to auto-format code with black and ruff before completion
Run uv run make verify to run all linters (black, pyl...

Files:

tests/unit/utils/test_endpoints.py
src/cache/postgres_cache.py
src/models/cache_entry.py
src/cache/sqlite_cache.py
tests/unit/cache/test_postgres_cache.py
src/utils/endpoints.py
src/app/endpoints/query.py
tests/unit/cache/test_sqlite_cache.py

tests/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

tests/**/*.py: Use pytest for all unit and integration tests, not unittest
Use pytest-mock for AsyncMock objects in tests
Use MOCK_AUTH = ("mock_user_id", "mock_username", False, "mock_token") pattern for authentication mocks in tests

Files:

tests/unit/utils/test_endpoints.py
tests/unit/cache/test_postgres_cache.py
tests/unit/cache/test_sqlite_cache.py

tests/unit/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

tests/unit/**/*.py: Unit tests require 60% code coverage
Write unit tests covering new functionality before completion

Files:

tests/unit/utils/test_endpoints.py
tests/unit/cache/test_postgres_cache.py
tests/unit/cache/test_sqlite_cache.py

src/models/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

src/models/**/*.py: Use @field_validator and @model_validator for custom validation in Pydantic models
Extend BaseModel for data Pydantic models
Use @model_validator and @field_validator for Pydantic model validation

Files:

src/models/cache_entry.py

src/app/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

Use FastAPI HTTPException with appropriate status codes for API endpoint error handling

Files:

src/app/endpoints/query.py

🧠 Learnings (1)

📚 Learning: 2026-01-12T10:58:40.230Z

Learnt from: blublinsky
Repo: lightspeed-core/lightspeed-stack PR: 972
File: src/models/config.py:459-513
Timestamp: 2026-01-12T10:58:40.230Z
Learning: In lightspeed-core/lightspeed-stack, for Python files under src/models, when a user claims a fix is done but the issue persists, verify the current code state before accepting the fix. Steps: review the diff, fetch the latest changes, run relevant tests, reproduce the issue, search the codebase for lingering references to the original problem, confirm the fix is applied and not undone by subsequent commits, and validate with local checks to ensure the issue is resolved.

Applied to files:

src/models/cache_entry.py

🧬 Code graph analysis (3)

src/models/cache_entry.py (1)

src/utils/types.py (2)

ToolCallSummary (111-119)

ToolResultSummary (122-133)

tests/unit/cache/test_postgres_cache.py (3)

src/utils/types.py (2)

ToolCallSummary (111-119)

ToolResultSummary (122-133)

src/cache/postgres_cache.py (1)

PostgresCache (19-522)

src/models/cache_entry.py (1)

CacheEntry (9-30)

tests/unit/cache/test_sqlite_cache.py (4)

src/utils/types.py (2)

ToolCallSummary (111-119)

ToolResultSummary (122-133)

src/models/cache_entry.py (1)

CacheEntry (9-30)

src/models/responses.py (2)

model (1736-1749)

ReferencedDocument (328-342)

src/cache/sqlite_cache.py (2)

insert_or_append (287-379)

get (197-284)

🔇 Additional comments (17)

src/models/cache_entry.py (1)

1-30: LGTM!

The new tool_calls and tool_results fields are properly added with correct types, appropriate Optional typing with None defaults for backward compatibility, and the docstring is updated to document the new attributes.

src/utils/endpoints.py (1)

809-810: LGTM!

The new fields are correctly populated from the summary object, following the same pattern used for referenced_documents on line 808. This ensures consistency in how optional list fields are handled (converting empty lists to None).

src/cache/sqlite_cache.py (2)

196-284: LGTM on deserialization logic.

The deserialization of tool_calls (index 7) and tool_results (index 8) follows the established pattern for referenced_documents. Error handling with warning logs is appropriate for graceful degradation when encountering malformed data.

326-368: LGTM on serialization logic.

The serialization follows the same pattern as referenced_documents, using model_dump(mode="json") for Pydantic models and proper error handling with warning logs.

src/cache/postgres_cache.py (2)

270-314: LGTM on deserialization logic.

The deserialization correctly relies on psycopg2's native jsonb handling (no json.loads needed) and follows the established pattern for referenced_documents. The error handling with warning logs ensures graceful degradation.

357-401: LGTM on serialization logic.

The serialization correctly uses json.dumps for inserting jsonb data and follows the same error handling pattern as referenced_documents.

tests/unit/utils/test_endpoints.py (1)

1072-1074: LGTM!

The mock summary is correctly updated to include both tool_calls and tool_results attributes, matching the expected TurnSummary structure.

tests/unit/cache/test_sqlite_cache.py (4)

16-16: LGTM!

The import correctly uses absolute imports as per coding guidelines, importing the new ToolCallSummary and ToolResultSummary types needed for the new test cases.

474-476: LGTM!

Good addition to verify that the new optional fields default to None when not explicitly provided, ensuring backward compatibility with existing cache entries.

478-525: LGTM!

Comprehensive test that validates the full serialization/deserialization round-trip for tool_calls and tool_results. The test properly constructs model instances, persists them, retrieves them, and verifies both equality and individual field values.

528-573: LGTM!

Excellent integration test that verifies all optional fields (referenced_documents, tool_calls, tool_results) can coexist in a single CacheEntry and survive the serialization/deserialization round-trip. This provides good confidence in the schema integrity.

src/app/endpoints/query.py (2)

385-387: LGTM!

The CacheEntry construction correctly populates tool_calls and tool_results from the summary, using conditional expressions to normalize empty lists to None for consistency with the Optional field types.

416-426: LGTM!

The QueryResponse construction correctly includes tool_calls and tool_results, maintaining consistency with the CacheEntry construction pattern. This ensures tool usage metadata is available to API consumers.

tests/unit/cache/test_postgres_cache.py (4)

17-17: LGTM!

Correct absolute import of ToolCallSummary and ToolResultSummary types needed for the new test cases.

601-628: LGTM!

Good updates to account for the new schema columns. The comment clarifying the index position (-3 for referenced_documents) improves maintainability, and the mock return value correctly includes None for the new tool_calls and tool_results columns.

655-682: LGTM!

Thorough updates that validate all optional fields (referenced_documents, tool_calls, tool_results) are properly handled as None when not provided. The inline comments improve code clarity.

726-839: LGTM!

Comprehensive test that validates the full round-trip for tool_calls and tool_results in the PostgreSQL cache. The test properly:

Constructs model instances with all required fields

Verifies JSON serialization structure matches expected format

Mocks the database return value correctly

Validates deserialization produces equivalent objects

Follows established mocking patterns in the test suite

tisnik

LGTM, thank you!

store tool calls and tool summary in cache entry

539c630

Signed-off-by: Stephanie <yangcao@redhat.com>

fix format

7789ae1

Signed-off-by: Stephanie <yangcao@redhat.com>

coderabbitai bot reviewed Jan 12, 2026

View reviewed changes

fix pylint

ba3e2a6

Signed-off-by: Stephanie <yangcao@redhat.com>

coderabbitai bot reviewed Jan 12, 2026

View reviewed changes

fix test failure

3899c70

Signed-off-by: Stephanie <yangcao@redhat.com>

coderabbitai bot reviewed Jan 13, 2026

View reviewed changes

tisnik changed the title ~~store tool calls and tool results in cache entry~~ LCORE-1166: store tool calls and tool results in cache entry Jan 13, 2026

tisnik approved these changes Jan 13, 2026

View reviewed changes

tisnik merged commit 3dfedf0 into lightspeed-core:main Jan 13, 2026
19 of 23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LCORE-1166: store tool calls and tool results in cache entry #984

LCORE-1166: store tool calls and tool results in cache entry #984

Uh oh!

yangcao77 commented Jan 12, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Jan 12, 2026 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

tisnik commented Jan 13, 2026

Uh oh!

coderabbitai bot commented Jan 13, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

tisnik left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LCORE-1166: store tool calls and tool results in cache entry #984

LCORE-1166: store tool calls and tool results in cache entry #984

Uh oh!

Conversation

yangcao77 commented Jan 12, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Tools used to create PR

Related Tickets & Documents

Checklist before requesting a review

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

tisnik commented Jan 13, 2026

Uh oh!

coderabbitai bot commented Jan 13, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

tisnik left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yangcao77 commented Jan 12, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 12, 2026 •

edited

Loading