[WIP] IP Vector Normalization to avoid all vectors clumped into single cluster in IVF-PQ by HowardHuang1 · Pull Request #1892 · rapidsai/cuvs

HowardHuang1 · 2026-03-07T06:42:47Z

Addresses #1875.

… cases due to reduced precision 8-bit values stored in LUT

copy-pr-bot · 2026-03-07T06:42:50Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

aamijar · 2026-03-09T22:59:53Z

/ok to test 78e47f1

aamijar · 2026-03-09T23:05:27Z

Hi @HowardHuang1, are we able to document an example in the description of the PR where vectors were clumped in a single cluster, and then verify that the fix (this PR) solves that case?

Also you can run pre-commit to fix the CI style check.

HowardHuang1 · 2026-03-09T23:15:11Z

Yes will do. I'll upload a before and after example that illustrates the fixed cluster distributions and timing improvements for search.

…d IVF-PQ IP

aamijar · 2026-03-10T22:15:48Z

/ok to test e822460

…reviously for debug only

aamijar · 2026-03-14T00:23:36Z

/ok to test da32073

…ange to only apply normalization to kmeans step

….s. raw doesn't affect ordering of clusters for nearest cluster assignment

aamijar · 2026-03-18T22:33:19Z

Hi @HowardHuang1, is this targeting release/26.04? If so, please retarget from main to release/26.04

jinsolp · 2026-03-19T02:51:14Z

Hi @HowardHuang1 you'll have to rebase on release/26.04 if you want to target that branch. 🙂

…put command parsing

jinsolp

Thanks @HowardHuang1 ! I have a few questions as I try to understand the fix in this code.

So we normalize the data, and train kmeans on this normalized data. However, instead of saving the resulting centroids (which come from the normalized space), we re-compute the proper centroids on the raw vectors (what calc_centers_and_sizes do)?
what are the changes in the cuvs bench for? I'm not so familiar with this part of the code so this is out of curiosity.
If I'm reading things right in the screenshot, we get a lower recall with the normalized vectors compared to using the raw vectors? And the search also takes longer?

jinsolp · 2026-03-20T22:25:16Z

cpp/src/neighbors/ivf_pq/ivf_pq_build.cuh

+        handle, kmeans_params, trainset_kmeans_const_view, centers_const_view, labels_view);
+      // Recompute centers in original space (mean of unnormalized trainset per cluster), overwrites centers_view
+      rmm::device_uvector<uint32_t> cluster_sizes(impl->n_lists(), stream, device_memory);
+      cuvs::cluster::kmeans::detail::calc_centers_and_sizes<float, float, internal_extents_t, uint32_t, uint32_t>(


can we use the calc_centers_and_sizes in the public namespace if possible?

Hey @jinsolp !

Yes exactly. If we were to keep the normalized centroids that would cause the computation to no longer be Inner Product and instead it will degenerate to Cosine which is not what we want. To prevent this, we use calc_centers_and_sizes to re-compute the proper centroids on the raw vectors. In essence, we only want normalization to happen in the kmeans cluster assignment step rather than the whole pipeline (normalization of the whole pipeline changes metric to Cosine).

The changes in cuvs bench are to resolve a linker issue. Will remove these when merging. They can be ignored for now.

The screenshot is a bit outdated, will replace soon with a new one. Recall should be same search should be faster.

Fixed vector normalization for Inner Product -- still failing 15 test…

ee94971

… cases due to reduced precision 8-bit values stored in LUT

HowardHuang1 requested review from a team as code owners March 7, 2026 06:42

github-project-automation bot added this to Vector Search, ML, & Data Mining Release Board Mar 7, 2026

aamijar assigned HowardHuang1 Mar 9, 2026

aamijar moved this to In Progress in Vector Search, ML, & Data Mining Release Board Mar 9, 2026

aamijar added non-breaking Introduces a non-breaking change bug Something isn't working labels Mar 9, 2026

Merge branch 'main' into HH-Vector-Normalization

78e47f1

add logging to compare recall and search speed for raw v.s. normalize…

e822460

…d IVF-PQ IP

revert change comparing normalize and raw in single run -- that was p…

da32073

…reviously for debug only

HowardHuang1 added 5 commits March 16, 2026 15:37

previously normalization was applied to entire IVF-PQ pipeline --> ch…

5375bb5

…ange to only apply normalization to kmeans step

revert to raw vectors. No need to normalize here because normalized v…

c320640

….s. raw doesn't affect ordering of clusters for nearest cluster assignment

clean up code

0486bf5

clean up code

107e2b3

upload code that resolves linker issue + live csv updates

5442b89

HowardHuang1 requested a review from a team as a code owner March 18, 2026 16:40

remove live_csv

dc9b6df

aamijar removed the request for review from a team March 18, 2026 22:32

Merge branch 'main' into HH-Vector-Normalization

cf3666c

HowardHuang1 changed the base branch from main to release/26.04 March 18, 2026 22:41

HowardHuang1 requested review from a team as code owners March 18, 2026 22:41

HowardHuang1 requested a review from AyodeAwe March 18, 2026 22:41

HowardHuang1 changed the base branch from release/26.04 to main March 18, 2026 23:41

jinsolp removed request for a team and AyodeAwe March 19, 2026 02:51

HowardHuang1 added 6 commits March 18, 2026 21:07

hardcode file path instead of searching multiple directories + fix in…

bdda881

…put command parsing

clean up unnecessary checks in data_export.py

c237db3

bring back comma parsing instead of underscore parsing

3c20377

bring back parts of plot/__main__.py for clarity

728c964

get rid of incremental JSON->CSV write for clarity

6c9bc36

bring back original plot/__main__.py for clarity

eb6bb88

jinsolp reviewed Mar 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] IP Vector Normalization to avoid all vectors clumped into single cluster in IVF-PQ #1892

[WIP] IP Vector Normalization to avoid all vectors clumped into single cluster in IVF-PQ #1892
HowardHuang1 wants to merge 17 commits intorapidsai:mainfrom
HowardHuang1:HH-Vector-Normalization

HowardHuang1 commented Mar 7, 2026 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Mar 7, 2026

Uh oh!

aamijar commented Mar 9, 2026

Uh oh!

aamijar commented Mar 9, 2026 •

edited

Loading

Uh oh!

HowardHuang1 commented Mar 9, 2026

Uh oh!

aamijar commented Mar 10, 2026

Uh oh!

aamijar commented Mar 14, 2026

Uh oh!

aamijar commented Mar 18, 2026

Uh oh!

jinsolp commented Mar 19, 2026

Uh oh!

jinsolp left a comment

Uh oh!

jinsolp Mar 20, 2026

Uh oh!

HowardHuang1 Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

HowardHuang1 commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot bot commented Mar 7, 2026

Uh oh!

aamijar commented Mar 9, 2026

Uh oh!

aamijar commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HowardHuang1 commented Mar 9, 2026

Uh oh!

aamijar commented Mar 10, 2026

Uh oh!

aamijar commented Mar 14, 2026

Uh oh!

aamijar commented Mar 18, 2026

Uh oh!

jinsolp commented Mar 19, 2026

Uh oh!

jinsolp left a comment

Choose a reason for hiding this comment

Uh oh!

jinsolp Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

HowardHuang1 Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HowardHuang1 commented Mar 7, 2026 •

edited

Loading

aamijar commented Mar 9, 2026 •

edited

Loading