-
Notifications
You must be signed in to change notification settings - Fork 113
Open
Description
Hello 👋 ,
I am writing to propose a new feature for OpenWeb UI Monitor that would allow users to take into account the caching costs of requests made to various large language model (LLM) APIs, such as OpenAI and Anthropic.
As many users rely on these APIs for their projects, it is essential to consider the associated costs, especially when dealing with long conversations that can quickly become expensive without proper caching. Currently, OpenWeb UI Monitor does not provide an option to automatically structure prompts to use cache_control, which can lead to higher costs for maintainer.
Metadata
Metadata
Assignees
Labels
No labels