-
-
Notifications
You must be signed in to change notification settings - Fork 162
Open
Labels
Description
Describe the bug
When the context limit is exceeded, the system does not recover gracefully.
After a context overflow error, the LLM fails to restart and the Unity Editor becomes unstable.
Steps to reproduce
- ChatBot Demo Scene
- Model: Llama 3.1 8B
- Set Context Size = 512
- Num Predict / Batch Size = 512
- Play and enter somtihing like "Tell a absurd sifi story" and "continue" until limit is hit
- LlamaLib error 400: request (523 tokens) exceeds the available context size (512 tokens), try increasing it
- Stopping and starting Play mode does not recover the model
- Editor restart required and Editor may crash on exit
LLMUnity version
3.0.1
Operating System
macOs
Reactions are currently unavailable