feat: Expose configurable indexing parameters for codebase indexing #10397

roomote · 2025-12-30T04:52:25Z

Summary

This PR attempts to address Issue #10396 by exposing configurable chunking and batch processing parameters for codebase indexing.

Changes

Adds three new configurable parameters for codebase indexing:

Embedding Batch Size (10-200, default 60): Number of code segments batched together for embeddings. Higher values can speed up indexing on powerful hardware. Lower values reduce memory usage.
Max Chunk Size (200-5000 chars, default 1000): Maximum characters per code chunk. Larger chunks provide more context but may reduce search precision. Smaller chunks enable finer-grained search results.
Parsing Concurrency (1-50, default 10): Number of files to parse concurrently during indexing. Higher values speed up indexing but use more CPU and memory.

Implementation Details

Added new config fields in packages/types/src/codebase-index.ts with Zod validation
Updated constants in src/services/code-index/constants/index.ts to use defaults from the types package
Updated config-manager.ts to load and expose new settings with getters
Updated DirectoryScanner to use configurable parsing concurrency
Added UI sliders in CodeIndexPopover.tsx for the new settings in the Advanced Settings section
Added i18n translations for new settings labels

Testing

All existing tests pass
Linting passes with no warnings

Feedback and guidance are welcome!

…r codebase indexing Adds three new configurable parameters for codebase indexing: - Embedding Batch Size (10-200, default 60): Number of code segments batched for embeddings - Max Chunk Size (200-5000 chars, default 1000): Maximum characters per code chunk - Parsing Concurrency (1-50, default 10): Number of concurrent file parsing operations Changes: - Added new config fields in packages/types/src/codebase-index.ts - Updated constants in src/services/code-index/constants/index.ts - Updated config-manager.ts to load and expose new settings - Updated DirectoryScanner to use configurable parsing concurrency - Added UI sliders in CodeIndexPopover.tsx for the new settings - Added i18n translations for new settings labels Closes #10396

roomote · 2025-12-30T04:52:53Z

Rooviewer See task on Roo Cloud

Reviewed the latest indentation fix. All previous issues remain resolved and no new issues found.

maxChunkSize configuration not applied to parser - Fixed. The CodeParser class now accepts a maxBlockChars parameter via constructor, the factory creates instances with config.maxChunkSize, and FileWatcher uses the injected parser instead of the singleton.

Previous reviews

d96f9f4: Review #1

e60b32f: Review #2

19f24da: Review #3

_{Mention @roomote in a comment to request specific changes to this pull request or fix all unresolved issues.}

roomote · 2025-12-30T04:56:16Z

src/services/code-index/service-factory.ts

-		}
-		return new DirectoryScanner(embedder, vectorStore, parser, this.cacheManager, ignoreInstance, batchSize)
+		// Get the configurable settings from config manager
+		const config = this.configManager.getConfig()


The maxChunkSize configuration is retrieved via config.maxChunkSize but never actually used. The CodeParser class in processors/parser.ts is a singleton that still uses the hardcoded MAX_BLOCK_CHARS constant directly. Unlike embeddingBatchSize and parsingConcurrency which are correctly passed to DirectoryScanner, the chunk size setting will have no effect on parsing behavior because the parser doesn't receive this configuration.

To fix this, the parser would need to either accept a maxChunkSize parameter (similar to how DirectoryScanner accepts batchSegmentThreshold), or read from the config at runtime.

_{Fix it with Roo Code or mention @roomote and request a fix.}

@roomote request fix

Fixaroo See task on Roo Cloud

Fixed the reported issue. All local checks passed.

_{View commit | Revert commit}

Fixaroo See task on Roo Cloud

Fixed the reported issues. Added 9 missing translation keys to all 17 locale files. All local checks passed.

_{View commit | Revert commit}

@roomote The translation file is currently missing, please fix it

Fixaroo See task on Roo Cloud

The translation files were already added in commit d96f9f4. Verified that all 9 translation keys are present in all 18 locales and all tests pass.

_{View commit | Revert commit}

@roomote Fix it to ensure it passes Code QA Roo Code / platform-unit-test

Fixaroo See task on Roo Cloud

Fixed the indentation issue in parser.ts. All local checks passed.

_{View commit | Revert commit}

roomote · 2025-12-30T05:14:58Z

Fixaroo See task on Roo Cloud

Fixed the reported issue. The maxChunkSize configuration is now properly applied to CodeParser. All local checks passed.

_{View commit | Revert commit}

- Modified CodeParser to accept maxBlockChars as constructor parameter - Updated FileWatcher to receive ICodeParser via constructor injection - Updated service-factory to create CodeParser with configured maxChunkSize - This ensures the codebaseIndexMaxChunkSize setting actually affects parsing

Add 9 missing translation keys to settings.json for all 17 locales: - embeddingBatchSizeLabel, embeddingBatchSizeDescription, embeddingBatchSizeResetTooltip - maxChunkSizeLabel, maxChunkSizeDescription, maxChunkSizeResetTooltip - parsingConcurrencyLabel, parsingConcurrencyDescription, parsingConcurrencyResetTooltip

github-project-automation bot added this to Roo Code Roadmap Dec 30, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Dec 30, 2025

github-project-automation bot added this to Roo Code Roadmap Dec 30, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Dec 30, 2025

roomote bot mentioned this pull request Dec 30, 2025

Feature Request: Expose Configurable Chunking and Batch Processing Parameters for Codebase Indexing #10396

Open

2 tasks

roomote bot commented Dec 30, 2025

View reviewed changes

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Dec 30, 2025

roomote added 3 commits December 30, 2025 05:19

fix: correct indentation in CodeParser

6d30caa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Expose configurable indexing parameters for codebase indexing #10397

feat: Expose configurable indexing parameters for codebase indexing #10397

Uh oh!

roomote bot commented Dec 30, 2025

Uh oh!

roomote bot commented Dec 30, 2025 •

edited

Loading

Uh oh!

roomote bot Dec 30, 2025

Uh oh!

SKIPPINGpetticoatconvent Dec 30, 2025

Uh oh!

roomote bot Dec 30, 2025 •

edited

Loading

Uh oh!

roomote bot Dec 30, 2025 •

edited

Loading

Uh oh!

SKIPPINGpetticoatconvent Dec 30, 2025

Uh oh!

roomote bot Dec 30, 2025 •

edited

Loading

Uh oh!

SKIPPINGpetticoatconvent Dec 30, 2025

Uh oh!

roomote bot Dec 30, 2025 •

edited

Loading

Uh oh!

roomote bot commented Dec 30, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: Expose configurable indexing parameters for codebase indexing #10397

Are you sure you want to change the base?

feat: Expose configurable indexing parameters for codebase indexing #10397

Uh oh!

Conversation

roomote bot commented Dec 30, 2025

Summary

Changes

Implementation Details

Testing

Uh oh!

roomote bot commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

roomote bot Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

SKIPPINGpetticoatconvent Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

roomote bot Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SKIPPINGpetticoatconvent Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SKIPPINGpetticoatconvent Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

roomote bot commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

roomote bot commented Dec 30, 2025 •

edited

Loading

roomote bot Dec 30, 2025 •

edited

Loading

roomote bot Dec 30, 2025 •

edited

Loading

roomote bot Dec 30, 2025 •

edited

Loading

roomote bot Dec 30, 2025 •

edited

Loading

roomote bot commented Dec 30, 2025 •

edited

Loading