Skip to content

Commit 93a5e5b

Browse files
authored
Merge pull request #348 from azais-corentin/patch-1
docs: add uniform token pricing providers to best use cases
2 parents 418ad56 + 1e211d4 commit 93a5e5b

1 file changed

Lines changed: 5 additions & 0 deletions

File tree

README.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -54,6 +54,11 @@ LLM providers like Anthropic and OpenAI cache prompts based on exact prefix matc
5454
5555
**Best use case:** Providers that count usage in requests, such as Github Copilot and Google Antigravity, have no negative price impact.
5656

57+
**Best use cases:**
58+
59+
- **Request-based billing** — Providers that count usage in requests, such as Github Copilot and Google Antigravity, have no negative price impact.
60+
- **Uniform token pricing** — Providers that bill cached tokens at the same rate as regular input tokens, such as Cerebras, see pure savings with no cache-miss penalty.
61+
5762
**Claude Subscriptions:** Anthropic subscription users (who receive "free" caching) may experience faster limit depletion than hit-rate ratios suggest due to the higher relative cost of cache misses. See [Claude Cache Limits](https://she-llac.com/claude-limits) for details.
5863

5964
## Configuration

0 commit comments

Comments
 (0)