Feature Request: Core support for Semantic Caching (Redis HNSW / Vector Search) #570

AnkitNeupane007 · 2026-05-09T12:45:41Z

AnkitNeupane007
May 9, 2026

With the growing adoption of LLM/RAG applications built on FastAPI, I’ve noticed that traditional exact-key caching becomes less effective for prompt-driven workloads.

For example:

“How do I reset my password?”
“How can I change my password?”

These produce semantically equivalent responses, but result in separate cache entries under exact-match caching.

I’ve been experimenting with a semantic caching layer built on top of Redis vector search (HNSW + cosine similarity), and I think this capability could fit nicely as an optional extension to fastapi-cache.

Conceptually, the flow looks like:

Intercept request
Generate embedding from request payload/prompt (user-provided embedding function)
Perform vector similarity lookup against cache entries
Return cached response if similarity exceeds threshold

I currently have a working prototype using redis.commands.search and would be interested in adapting it into a PR if the maintainers think this aligns with the project direction.

A few architectural questions before proceeding:

1. Scope

Would semantic caching be considered within the scope of fastapi-cache, or would maintainers prefer it live as a separate extension package?

2. Backend API

The current Backend abstraction is key-oriented (get_with_ttl(key: str)).

Since semantic caching requires vector queries + similarity scores, would a separate interface (e.g. SemanticBackend) make more sense to avoid complicating existing backends?

3. Decorator/API Surface

Since embeddings must be generated before lookup, would a dedicated decorator such as @semantic_cache(...) be preferable over extending @cache?

4. Dependencies

My assumption is that embedding generation should remain entirely user-supplied (callable injection/configuration) so the library itself avoids heavyweight ML dependencies.

Happy to share implementation details or open a draft PR if there’s interest.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature Request: Core support for Semantic Caching (Redis HNSW / Vector Search) #570

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

Feature Request: Core support for Semantic Caching (Redis HNSW / Vector Search) #570

Uh oh!

AnkitNeupane007 May 9, 2026

1. Scope

2. Backend API

3. Decorator/API Surface

4. Dependencies

Replies: 0 comments

AnkitNeupane007
May 9, 2026