Add interface to do prefetch on KnnVectorValues and an example implementation to use prefetch in Scorer by navneet1v · Pull Request #15722 · apache/lucene

navneet1v · 2026-02-18T23:17:39Z

Description

This change add the ability to do prefetch for KnnVectorValues. Along with the change, there is a sample implementation of how to add create a PrefetchableFlatVectorScorer that can wrap any Scorer and do prefetch before doing any bulk scoring. Places where this prefetch can be useful:

When there is memory contension.
If underline storage is slow say remote store and calling prefetch can start downloading vectors in cache and then RAM.
The current HNSW based search is not impacted since the prefetch interfaces are not called

The changes include:

New prefetch interface on KNNVectorValues.
Only prefetch if numOfOrds > 1

Issue

#15286

navneet1v · 2026-02-18T23:23:41Z

@mikemccand , @vigyasharma can you please take a look and let me know what you think of in terms of interfaces. Because another approach I had in mind was to create another type of scorer called as Prefetchable Scorer that different systems can wrap if they want to do prefetch.

lucene/core/src/java/org/apache/lucene/index/KnnVectorValues.java

lucene/core/src/java/org/apache/lucene/codecs/lucene95/OffHeapByteVectorValues.java

benwtrent · 2026-02-19T14:28:33Z

I don't mind the interface. But it does seem that the "scorer.bulkScore" should "just do the right thing" and prefetch if necessary.

I think this will harm the "preferred path", which is the one where all vectors are in MMAP'd space. We need to benchmark with quantized vectors, where the vector ops during traversal are magnitudes cheaper, not just floating point.

In the aggressively nasty path, I am not sure prefetching 16-32 vectors at a time will give us much as when it comes to modern IO throughput numbers, that is barely scratching the surface (4kb * 16 if using raw vectors, which nobody should be using raw vectors anymore...way lower memory numbers if quantized...). If we are prefetching during graph traversal due to not having enough memory, I think we need to prefetch WAY more aggressively. Like, prefetching multiple candidate neighbors in the future (even ones we won't end up scoring :/).

All in all, I think if we want HNSW to act nice in low memory, we need to:

invest more in the Bipartite ordering of vectors so that neighbors are near each other on disk
Prefetch larger blocks hoping to get many vectors we care about besides the immediate ordinals.

navneet1v · 2026-02-19T17:29:00Z

We need to benchmark with quantized vectors, where the vector ops during traversal are magnitudes cheaper, not just floating point.

I am working on the benchmarks.

Prefetch larger blocks hoping to get many vectors we care about besides the immediate ordinals.

This is basic implementation I am thinking to combine the few ordinals in a block say 128KB(configurable) to do prefetch.

Will it be better if I remove the prefetch from scorer and graph searcher and move it to only KNNVectorValues. So that at-least for rescoring or for their own scorer prefetch can be used.

mccullocht · 2026-02-19T18:36:38Z

I don't mind the interface. But it does seem that the "scorer.bulkScore" should "just do the right thing" and prefetch if necessary.

Agreed, I think we'd like prefetching to be internal to the scorer.

I think this will harm the "preferred path", which is the one where all vectors are in MMAP'd space. We need to benchmark with quantized vectors, where the vector ops during traversal are magnitudes cheaper, not just floating point.

Is this ultimately a result of the behavior of mmap input prefetch()? The currently implementation does a synchronous madvise() syscall which I would expect to harm performance when everything is in memory and may be worse than doing a read when the data has spilled to storage.

If we are prefetching during graph traversal due to not having enough memory, I think we need to prefetch WAY more aggressively. Like, prefetching multiple candidate neighbors in the future (even ones we won't end up scoring :/).

Popping multiple candidates in the graph searcher like this is the suggested approach for parallelizing IO in the DiskANN paper. This would be a another query parameter unless we can reliably detect low memory conditions which is challenging if you let mmap manage caching :/.

navneet1v · 2026-02-19T23:02:21Z

I don't mind the interface. But it does seem that the "scorer.bulkScore" should "just do the right thing" and prefetch if necessary.

Agreed, I think we'd like prefetching to be internal to the scorer.

Thanks @benwtrent and @mccullocht for your input. I don't have concerns in doing prefetch inside the scorer.

How does this plan sound?

Raise a PR where we open up the interfaces on KNNVectorValues for Prefetch.
Another PR: Add a new scorer that can do prefetch in the bulkScore function.

Can we consider this approach?

The currently implementation does a synchronous madvise() syscall which I would expect to harm performance when everything is in memory and may be worse than doing a read when the data has spilled to storage.

@mccullocht it is async. We make the call and then immediately return. Is your comment mainly that the call is in sync path rather than in a separate thread?

This would be a another query parameter unless we can reliably detect low memory conditions which is challenging if you let mmap manage caching :/.

I think we can leave that to the application, and if we provide an implementation like PrefetchableScorer of RandomScorer, then during search an application can choose to pick a scorer based as per their needs. Is that something we can consider?

mccullocht · 2026-02-19T23:22:15Z

The currently implementation does a synchronous madvise() syscall which I would expect to harm performance when everything is in memory and may be worse than doing a read when the data has spilled to storage.

@mccullocht it is async. We make the call and then immediately return. Is your comment mainly that the call is in sync path rather than in a separate thread?

Syscalls are relatively expensive. If the data is already in memory, adding a syscall for every vector is going to be noticeable. I believe this is what Ben means when he says it will harm the "preferred path". I'm not sure if it would make a difference to run the syscall in the background. In any case I'm guessing you'll see this in benchmarks when the data set is small enough to fit in RAM.

I think we can leave that to the application, and if we provide an implementation like PrefetchableScorer of RandomScorer, then during search an application can choose to pick a scorer based as per their needs. Is that something we can consider?

Indicating this preference has to fit through the interface in KnnVectorsReader.search() to override or wrap the vector scorer. I don't have any ideas that are really clean or obvious. Maybe the collector could contain a function that wraps the scorer?

navneet1v · 2026-02-20T04:41:10Z

Hi @mccullocht and @benwtrent

I took another pass at the code where I cleaned up the prefetch logic from the HNSWGraphSearcher and only kept the interfaces at KNNVectorValues related to prefetch.

I have also added an example implementation of how to do prefetch (PrefetchableFlatVectorsScorer). This should now keep things clean and separated. Please let me know what are your thoughts on this.

…entation to use prefetch in Scorer Signed-off-by: Navneet Verma <navneev@amazon.com>

navneet1v · 2026-02-24T07:41:03Z

@mccullocht , @benwtrent can I get a review on this PR? Since I have moved out prefetching logic outside. There should be no impact on current HNSW algorithm

msfroh · 2026-02-25T20:30:20Z

lucene/core/src/java/org/apache/lucene/codecs/hnsw/PrefetchableFlatVectorScorer.java

+  static class PrefetchableRandomVectorScorer
+      extends RandomVectorScorer.AbstractRandomVectorScorer {
+
+    private final RandomVectorScorer.AbstractRandomVectorScorer randomVectorScorer;


There were some earlier issues discussing the correct behavior for wrapper classes when the underlying abstract class might change. I think there might be some value in adding a unit test for this class similar to TestFilterWeight that uses reflection to make sure that all methods are delegated.

This is a good point let me try to add a test for this.

msfroh · 2026-02-25T20:45:42Z

lucene/core/src/java/org/apache/lucene/codecs/hnsw/PrefetchableFlatVectorScorer.java

+    @Override
+    public UpdateableRandomVectorScorer scorer() throws IOException {
+      return this.randomVectorScorerSupplier.scorer();
+    }
+
+    @Override
+    public RandomVectorScorerSupplier copy() throws IOException {
+      return new PrefetchableRandomVectorScorerSupplier(randomVectorScorerSupplier.copy());
+    }


Does this PrefetchableRandomVectorScorerSupplier actually do anything? I don't see it producing a PrefetchableRandomVectorScorer.

Was the scorer() method supposed to do that? (But that means that PrefetchableRandomVectorScorer needs to implement UpdateableRandomVectorScorer -- maybe that's not a problem?)

@msfroh I think you pointed out correct thing. I missed it. But when I looked at it little bit more details what I have found is my intension while making the change was I want to do prefetch for search and not specifically for indexing. And as per read of the code

public UpdateableRandomVectorScorer scorer() throws IOException { return this.randomVectorScorerSupplier.scorer(); }

this I am not returning Prefetchable implementation of UpdateableRandomVectorScorer since I don't want prefetching on indexing as of now. But let me add it. I don't see a harm here. :) Thank you for this. :)

github-actions bot added module:core/index module:core/codecs module:core/hnsw labels Feb 18, 2026

navneet1v force-pushed the main branch from ee3eeb3 to 5cfb040 Compare February 18, 2026 23:22

github-actions bot added this to the 10.5.0 milestone Feb 18, 2026

navneet1v force-pushed the main branch from 5cfb040 to 7462992 Compare February 18, 2026 23:56

navneet1v mentioned this pull request Feb 18, 2026

Add IO prefetch to HNSW graph crawl? #15286

Open

mccullocht reviewed Feb 19, 2026

View reviewed changes

lucene/core/src/java/org/apache/lucene/index/KnnVectorValues.java Show resolved Hide resolved

navneet1v force-pushed the main branch from 7462992 to ec09ac0 Compare February 19, 2026 05:08

mccullocht requested changes Feb 19, 2026

View reviewed changes

lucene/core/src/java/org/apache/lucene/codecs/lucene95/OffHeapByteVectorValues.java Outdated Show resolved Hide resolved

navneet1v force-pushed the main branch 4 times, most recently from 1300834 to f27d3a3 Compare February 19, 2026 09:03

navneet1v requested a review from mccullocht February 19, 2026 09:03

navneet1v force-pushed the main branch 2 times, most recently from a0c182d to 6b1daee Compare February 20, 2026 04:36

github-actions bot removed the module:core/hnsw label Feb 20, 2026

navneet1v force-pushed the main branch from 6b1daee to ee9b8cf Compare February 20, 2026 04:40

navneet1v changed the title ~~Add the ability to do prefetch and do prefetch during vector search~~ Add interface to do prefetch on KnnVectorValues and an example implementation to use prefetch in Scorer Feb 20, 2026

navneet1v force-pushed the main branch from ee9b8cf to 60af148 Compare February 20, 2026 04:45

Add interface to do prefetch on KnnVectorValues and an example implem…

689bd49

…entation to use prefetch in Scorer Signed-off-by: Navneet Verma <navneev@amazon.com>

navneet1v force-pushed the main branch from 60af148 to 689bd49 Compare February 24, 2026 07:40

msfroh reviewed Feb 25, 2026

View reviewed changes

Conversation

navneet1v commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issue

Uh oh!

navneet1v commented Feb 18, 2026

Uh oh!

Uh oh!

Uh oh!

benwtrent commented Feb 19, 2026

Uh oh!

navneet1v commented Feb 19, 2026

Uh oh!

mccullocht commented Feb 19, 2026

Uh oh!

navneet1v commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mccullocht commented Feb 19, 2026

Uh oh!

navneet1v commented Feb 20, 2026

Uh oh!

navneet1v commented Feb 24, 2026

Uh oh!

msfroh Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

navneet1v Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

msfroh Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

navneet1v Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

navneet1v commented Feb 18, 2026 •

edited

Loading

navneet1v commented Feb 19, 2026 •

edited

Loading