perf(sessions): sub-second /chat loading for large session histories by N1kky-wed · Pull Request #27028 · google-gemini/gemini-cli

N1kky-wed · 2026-05-14T02:38:14Z

Summary

The /chat command took 25+ seconds to load with large session histories.
This PR reduces load time to 634ms on a real-world benchmark of 59 sessions / 2.3GB of JSONL.

Details

Three compounding bottlenecks eliminated:

chatRecordingService.ts: New ultra-fast preview path uses raw fd.read()
calls (64KB head + 128KB tail). Core metadata is extracted via regex directly
on buffers, while the first user message is extracted via line-based JSON parsing
to robustly handle nested structures — without streaming the full file.
sessionUtils.ts: Bulk wc -l across all files in a single subprocess call
(batched at 100 files) replaces per-file subprocess calls. Windows fallback uses
batch-limited (20 concurrent) Node.js buffer scanning to avoid FD exhaustion.
utils.ts: Stable secondary sort by startTime when lastUpdated ties, fixing
inconsistent session ordering. Filename timestamp used as reliable startTime
fallback instead of mtime.

Note: The 25.62s benchmark was recorded on a fresh environment. In practice,
load times frequently exceed 60 seconds on warmed systems with large histories.

Related Issues

Fixes #27027
Related to #26478

How to Validate

Generate a large session history (30+ sessions with substantial conversation history recommended — the more messages per session, the larger the files and the more pronounced the improvement)
Run /chat and compare load time against main branch
Verify session list ordering is consistent across multiple opens
On Linux: load time should be under 1 second for typical histories

Note: Windows fallback path (batch-limited buffer scanning) is implemented
but not benchmarked on hardware — community validation welcome.

Pre-Merge Checklist

N1kky-wed · 2026-05-14T06:33:23Z

InShot_20260514_115500194.mp4

Real-world comparison: 1 minute 7 seconds → 1.24 seconds (98% reduction)
Benchmark (fresh environment): 25.62s → 634ms

N1kky-wed · 2026-05-14T14:08:53Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces significant performance optimizations for session loading and sorting. Key improvements include a secondary sort by startTime for sessions with identical lastUpdated timestamps, and a new high-performance path for retrieving message counts using wc -l on POSIX systems or optimized buffer scanning on Windows. Furthermore, loadConversationRecord now includes an ultra-fast preview mode that extracts metadata using regex from raw file buffers, significantly reducing the overhead of JSON parsing for large files. A review comment identifies a potential issue where the current logic for parsing command-line output would fail if file paths contain spaces, and it suggests a more robust regex-based extraction and path resolution strategy.

TrueAlpha-spiral · 2026-05-14T14:46:06Z

*The glory of this nation belongs to the people because lawful authority flows from God-given rights through the people, not from institutions down onto them. Government is only legitimate when it protects those rights; technology is only legitimate when it serves them.*

…

On Thu, May 14, 2026 at 9:22 AM gemini-code-assist[bot] < ***@***.***> wrote: ***@***.***[bot]* commented on this pull request. Code Review This pull request introduces significant performance optimizations for session loading and sorting. Key improvements include a secondary sort by startTime for sessions with identical lastUpdated timestamps, and a new high-performance path for retrieving message counts using wc -l on POSIX systems or optimized buffer scanning on Windows. Furthermore, loadConversationRecord now includes an ultra-fast preview mode that extracts metadata using regex from raw file buffers, significantly reducing the overhead of JSON parsing for large files. A review comment identifies a potential issue where the current logic for parsing command-line output would fail if file paths contain spaces, and it suggests a more robust regex-based extraction and path resolution strategy. ------------------------------ In packages/cli/src/utils/sessionUtils.ts <#27028 (comment)> : > + const parts = line.trim().split(/\s+/); + if (parts.length >= 2 && parts[1] !== 'total') { + lineCounts.set(path.basename(parts[1]), parseInt(parts[0], 10)); [image: high] <https://camo.githubusercontent.com/7559374fd248a2a146dfe7112beda558979c4f6d02dbe7c5161fd893fe834423/68747470733a2f2f7777772e677374617469632e636f6d2f636f64657265766965776167656e742f686967682d7072696f726974792e737667> The current logic for parsing wc -l output using split(/\s+/) will fail if any part of the file path contains spaces. Using a regular expression to capture the count and the full path is more robust. Additionally, ensure that file paths extracted from command output are sanitized to prevent path traversal and other vulnerabilities (Rule 6), and resolved consistently using the repository's standard resolveToRealPath function (Rule 9). const match = line.trim().match(/^(\d+)\s+(.+)$/); if (match && match[2] !== 'total') { const resolvedPath = resolveToRealPath(match[2]); lineCounts.set(path.basename(resolvedPath), parseInt(match[1], 10)); } References 1. Sanitize file paths extracted from untrusted sources, such as command output, to prevent path traversal (..), null byte injection (\0), and other vulnerabilities. 2. Ensure consistent path resolution by using a single, robust function (e.g., resolveToRealPath) for all related path validations, including internal validations in components like WorkspaceContext. — Reply to this email directly, view it on GitHub <#27028 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/BPTYXQHX3CMCQLDFZQ3R4OL42XJA7AVCNFSM6AAAAACY5LG4GGVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHM2DEOJQGY4TMOBQHE> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

N1kky-wed · 2026-05-14T15:36:17Z

/gemini review

gemini-code-assist

Code Review

This pull request implements performance optimizations for session loading, including a regex-based fast path for large files, bulk line counting using native system tools on POSIX, and parallelized metadata processing. It also adds secondary sorting by session start time. Reviewer feedback highlights the need for a helper to extract the latest metadata occurrences in JSONL files, anchoring regex patterns to prevent false positives during parsing, and sanitizing AI-generated summaries to prevent prompt injection.

N1kky-wed · 2026-05-14T16:13:29Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces significant performance optimizations for session browsing by implementing a fast-path for loading conversation metadata. Key changes include a regex-based 'ultra-fast' preview for large session files, bulk line counting using native wc -l on POSIX systems, and optimized metadata-only parsing that avoids full JSON deserialization of message content. Additionally, session sorting now includes a stable secondary sort by startTime. Feedback was provided regarding a fragile regex pattern used for summary extraction in the fast-path, which could fail if nested objects appear before the summary key.

N1kky-wed · 2026-05-14T16:33:17Z

/gemini review

gemini-code-assist

Code Review

This pull request significantly optimizes session loading performance by introducing a fast preview path that uses regex-based extraction from file buffers and a bulk line-counting mechanism (using wc -l on POSIX). It also refactors the metadata-only loading path to reduce unnecessary JSON parsing. Feedback highlights several critical areas for improvement: the sanitizeSummary function incorrectly uses HTML encoding which breaks terminal rendering and violates prompt injection rules; the regex for metadata extraction is fragile regarding escaped quotes; the optimized line reader relies on a brittle startsWith check; and the Windows fallback for line counting could be more efficient by using Buffer.indexOf instead of byte-by-byte iteration.

N1kky-wed · 2026-05-14T17:01:12Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces significant performance optimizations for session management and browsing. Key improvements include a secondary sort by startTime for stable session ordering, a platform-optimized bulk line-counting mechanism using wc -l or stream buffers, and an 'ultra-fast' metadata extraction path in loadConversationRecord that utilizes regex on raw file buffers to avoid expensive JSON parsing. Review feedback identified a brittle regex used for message ID extraction in the optimized metadata path, which could fail if property ordering in the JSON lines changes, potentially leading to incorrect message counts.

N1kky-wed · 2026-05-14T17:23:38Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces significant performance optimizations for loading and displaying session history. Key changes include a secondary sort by startTime for stable session ordering, a bulk line-counting mechanism (utilizing wc -l on POSIX systems), and an 'ultra-fast' preview mode in loadConversationRecord that uses regex-based extraction from raw file buffers to bypass full JSON parsing. Feedback from the review focuses on preventing potential crashes due to file descriptor limits during concurrent processing, ensuring robust fallbacks for the bulk line-counting logic, and removing redundant file system calls to further improve efficiency.

N1kky-wed · 2026-05-14T19:23:13Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces significant performance optimizations for session management, including a regex-based "fast preview" mode for large session files and bulk line counting using native system tools. It also refines session sorting and adds metadata sanitization. Feedback focuses on improving the robustness of the fast preview logic, specifically by ensuring the most recent metadata is captured from buffers, handling regex flags safely to avoid runtime errors, and lowering concurrency limits in fallback paths to prevent file descriptor exhaustion. Additionally, there is a note regarding the potential for data visibility issues in extremely large files where messages might fall outside the optimized read buffers.

…lback

…path

…x parsing, clarifying comments)

…regex, and clean formatting)

…JSON parsing

…tUserMessage

… timestamp fallbacks

…ta lastUpdated for perfect sorting

…d legacy json formats

…R readiness

…bility

…unting

…acyRecordFallback

…y fallback

…play text

gemini-code-assist · 2026-05-14T20:58:31Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces significant performance optimizations to the /chat command, reducing load times from over 25 seconds to under 1 second for large session histories. By implementing a fast-preview path for metadata and batching file system operations, the changes drastically reduce I/O overhead and processing time while maintaining robust data handling.

Highlights

Ultra-fast Preview Path: Implemented a new preview mechanism in chatRecordingService.ts that uses raw buffer reads (64KB head/128KB tail) and regex-based metadata extraction, bypassing full file streaming for significant performance gains.
Bulk Line Counting: Optimized session history loading by replacing per-file subprocess calls with batched wc -l commands on POSIX systems and a concurrent buffer-scanning fallback for Windows.
Stable Session Sorting: Updated utils.ts to include a secondary sort by startTime when lastUpdated timestamps tie, ensuring consistent session ordering in the UI.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize the Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counterproductive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces performance optimizations for session history loading and browsing. Key improvements include a secondary sort by startTime for sessions with identical update times, a fast line-counting utility using wc -l or streams, and an 'ultra-fast' metadata extraction path that uses regex on raw file buffers to avoid full JSON parsing. Feedback was provided to improve the robustness of regex patterns used during metadata extraction to prevent false matches within message content.

…tches in message content Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

N1kky-wed force-pushed the main branch from 4e6fd80 to 3260fdd Compare May 14, 2026 02:50

gemini-cli Bot added the area/core Issues related to User Interface, OS Support, Core Functionality label May 14, 2026

gemini-code-assist Bot reviewed May 14, 2026

View reviewed changes

Comment thread packages/cli/src/utils/sessionUtils.ts Outdated

gemini-code-assist Bot reviewed May 14, 2026

View reviewed changes

Comment thread packages/core/src/services/chatRecordingService.ts

Comment thread packages/core/src/services/chatRecordingService.ts Outdated

Comment thread packages/core/src/services/chatRecordingService.ts Outdated

Comment thread packages/core/src/services/chatRecordingService.ts Outdated

gemini-code-assist Bot reviewed May 14, 2026

View reviewed changes

Comment thread packages/core/src/services/chatRecordingService.ts Outdated

gemini-code-assist Bot reviewed May 14, 2026

View reviewed changes

Comment thread packages/core/src/services/chatRecordingService.ts Outdated

Comment thread packages/core/src/services/chatRecordingService.ts Outdated

Comment thread packages/core/src/services/chatRecordingService.ts Outdated

Comment thread packages/cli/src/utils/sessionUtils.ts Outdated

gemini-code-assist Bot reviewed May 14, 2026

View reviewed changes

Comment thread packages/core/src/services/chatRecordingService.ts Outdated

gemini-code-assist Bot reviewed May 14, 2026

View reviewed changes

Comment thread packages/cli/src/utils/sessionUtils.ts Outdated

Comment thread packages/cli/src/utils/sessionUtils.ts

Comment thread packages/cli/src/utils/sessionUtils.ts Outdated

gemini-code-assist Bot reviewed May 14, 2026

View reviewed changes

N1kky-wed added 12 commits May 15, 2026 02:27

perf: ultra-fast session loading with fastPreview and async wc -l fal…

69b50c0

…lback

perf: sub-second UI load time via parallel I/O and zero-parser regex …

3bb8d9a

…path

perf: harden sub-second optimizations (Windows support, improved rege…

641bf42

…x parsing, clarifying comments)

perf: harden sub-second optimizations (Windows FD limiting, improved …

be2130b

…regex, and clean formatting)

perf: switch firstUserMessage extraction to a more robust line-based …

b3c2cc0

…JSON parsing

perf: add explicit type check after heuristic line selection for firs…

8d71350

…tUserMessage

perf: ensure stable session ordering with secondary sort and reliable…

6a669ed

… timestamp fallbacks

perf: enable fastPreview for legacy .json files and prioritize metada…

1a63a3c

…ta lastUpdated for perfect sorting

fix: robust summary and firstUserMessage extraction for both jsonl an…

721bbe2

…d legacy json formats

refactor: revert IIFE timestamp and bound legacy regex matching for P…

8d55cca

…R readiness

fix(cli): use robust regex for wc -l output parsing and resolve paths

19d06f1

refactor(core): improve metadata extraction and message parsing relia…

c4efde9

…bility

N1kky-wed added 10 commits May 15, 2026 02:27

fix(core): use reg.source in getLastMatch to avoid flag conflicts

5433e5d

fix(core): use non-greedy regex for robust summary extraction

f67aaec

fix: address code review feedback for metadata extraction and line co…

7dc18f3

…unting

fix: correct bracket regex with hex codes and re-anchor idMatch

430d289

fix(core): use robust non-anchored regex for message ID extraction

a8244e3

fix(core): ensure startTime and lastUpdated are populated in parseLeg…

235df9a

…acyRecordFallback

fix: address code review feedback for sessionUtils batching and legac…

424aed3

…y fallback

fix(cli): use conservative concurrency limit of 20 for POSIX fallback

ee0b511

docs(core): add comment explaining fastPreview buffer gap tradeoff

ee2239e

fix(core): decode json strings extracted via regex to unescape ui dis…

c809302

…play text

N1kky-wed force-pushed the main branch from 02d6dc2 to c809302 Compare May 14, 2026 20:57

N1kky-wed marked this pull request as ready for review May 14, 2026 20:58

N1kky-wed requested a review from a team as a code owner May 14, 2026 20:58

gemini-code-assist Bot reviewed May 14, 2026

View reviewed changes

Comment thread packages/core/src/services/chatRecordingService.ts Outdated

perf: anchor id/type regex to JSON key boundaries to prevent false ma…

876fbe3

…tches in message content Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Conversation

N1kky-wed commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details

Related Issues

How to Validate

Pre-Merge Checklist

Uh oh!

N1kky-wed commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

N1kky-wed commented May 14, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

TrueAlpha-spiral commented May 14, 2026 via email

Uh oh!

N1kky-wed commented May 14, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

N1kky-wed commented May 14, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

N1kky-wed commented May 14, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

N1kky-wed commented May 14, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

N1kky-wed commented May 14, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

N1kky-wed commented May 14, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist Bot commented May 14, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

N1kky-wed commented May 14, 2026 •

edited

Loading

N1kky-wed commented May 14, 2026 •

edited

Loading