Add token limit handling to prevent 8k token overflow #8

YosefHayim · 2026-01-18T08:41:47Z

This PR adds token limit handling to prevent API failures when large git diffs exceed the 8k token limit.

Problem

GitHub Models API has an 8k token limit for entire requests
Large git diffs can exceed this limit, causing API failures
Users experience failures when staging large changes

Solution

Added token estimation using character-based heuristic (1 token ≈ 4 chars)
Implemented truncation logic that preserves UTF-8 boundaries
Added intelligent content prioritization when over limit

Implementation Details

estimateTokens(): Approximates tokens for any text content
truncateToTokenLimit(): Safely truncates text with ellipsis indicator
Modified GenerateCommitMessage() to:
- Estimate tokens for prompt templates + changes + examples
- Reserve tokens for templates (with buffer)
- Prioritize examples (20% of remaining tokens) when present
- Truncate changes to fit remaining budget
- Display warning when truncation occurs

Benefits

Prevents API failures from token overflow
Maintains functionality by preserving maximum content
User-friendly with clear truncation warnings
No external dependencies, follows existing code style
Gracefully handles both changes-only and changes+examples scenarios

Testing

Code compiles successfully
Follows existing project patterns and style
Token estimation uses common industry heuristic

Problem: - GitHub Models API has an 8k token limit for entire requests - Large git diffs can exceed this limit, causing API failures - Users experience failures when staging large changes Solution: - Added token estimation using character-based heuristic (1 token ≈ 4 chars) - Implemented truncation logic that preserves UTF-8 boundaries - Added intelligent content prioritization when over limit Implementation Details: - estimateTokens(): Approximates tokens for any text content - truncateToTokenLimit(): Safely truncates text with ellipsis indicator - Modified GenerateCommitMessage() to: * Estimate tokens for prompt templates + changes + examples * Reserve tokens for templates (with buffer) * Prioritize examples (20% of remaining tokens) when present * Truncate changes to fit remaining budget * Display warning when truncation occurs Benefits: - Prevents API failures from token overflow - Maintains functionality by preserving maximum content - User-friendly with clear truncation warnings - No external dependencies, follows existing code style - Gracefully handles both changes-only and changes+examples scenarios

YosefHayim force-pushed the main branch from dac7a36 to 81c62aa Compare January 18, 2026 08:42

YosefHayim force-pushed the main branch from 81c62aa to f76b2b0 Compare January 18, 2026 08:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add token limit handling to prevent 8k token overflow #8

Add token limit handling to prevent 8k token overflow #8

Uh oh!

YosefHayim commented Jan 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add token limit handling to prevent 8k token overflow #8

Are you sure you want to change the base?

Add token limit handling to prevent 8k token overflow #8

Uh oh!

Conversation

YosefHayim commented Jan 18, 2026

Problem

Solution

Implementation Details

Benefits

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant