Truncate GenAI prompt if this exceeds 128k tokens #5774

mvilanova · 2025-02-13T20:34:57Z

Requirements Changes

Added tiktoken package to requirements
Updated several package versions:
- aiohappyeyeballs from 2.4.4 to 2.4.6
- aiohttp from 3.11.11 to 3.11.12
- boto3 from 1.36.13 to 1.36.19
- botocore from 1.36.13 to 1.36.19
- google-api-python-client from 2.160.0 to 2.161.0
- numpy from 2.2.2 to 2.2.3
- openai from 1.61.0 to 1.62.0

Functional Changes in `src/dispatch/ai/service.py`

New Constants

Added MAX_TOKENS = 128000 constant

New Functions

Added num_tokens_from_string(message: str, model: str) -> tuple[list[int], int, tiktoken.Encoding]
- Calculates token count for a given string using specified model
- Returns tokenized message, token count, and encoding object
Added truncate_prompt(tokenized_prompt: list[int], num_tokens: int, encoding: tiktoken.Encoding) -> str
- Truncates prompts that exceed the maximum token limit
- Returns truncated prompt as string

Modified Functions

Updated generate_case_signal_summary
- Added token counting and truncation logic
- Separated prompt construction from API call
Updated generate_incident_summary
- Added token counting and truncation logic before making API calls

Key Improvements

Token Management: Added functionality to handle large prompts by checking and truncating them if they exceed token limits
Better Error Handling: Improved logging for tokenization issues
Code Organization: Separated prompt construction from API calls for better maintainability

The changes primarily focus on managing token limits in AI prompts and updating dependencies to their latest versions.

Trucate GenAI prompt if this exceeds 128k tokens

25cfd3f

mvilanova added the enhancement New feature or request label Feb 13, 2025

mvilanova requested review from whitdog47 and wssheldon February 13, 2025 20:34

removes commented code

ae5c5b0

wssheldon approved these changes Feb 14, 2025

View reviewed changes

mvilanova merged commit 6e22f32 into main Feb 14, 2025
9 checks passed

mvilanova deleted the feature/llm-context-length-limit branch February 14, 2025 19:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Truncate GenAI prompt if this exceeds 128k tokens #5774

Truncate GenAI prompt if this exceeds 128k tokens #5774

Uh oh!

mvilanova commented Feb 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Truncate GenAI prompt if this exceeds 128k tokens #5774

Truncate GenAI prompt if this exceeds 128k tokens #5774

Uh oh!

Conversation

mvilanova commented Feb 13, 2025

Requirements Changes

Functional Changes in src/dispatch/ai/service.py

Key Improvements

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Functional Changes in `src/dispatch/ai/service.py`