Prevent nested parallelism in HNSW bench by julianmi · Pull Request #1895 · rapidsai/cuvs

julianmi · 2026-03-09T14:18:39Z

Setting the gbench number of threads and the HNSWlib config number of threads can lead to nested parallelism. This patch proposes to either use throughput mode using multiple gbench threads or latency mode using batch parallelism. Additionally, there is a significant overhead in going through the thread pool. It is skipped in the search method to handle single query batch size efficiently.

- Setting the gbench number of threads and the HNSWlib config number of threads can lead to nested parallelism. Force either throughput mode using multiple gbench threads or latency mode using batch paralleism. - Added a check in `search` method to handle single query batch size efficiently. There is a significant overhead in going throught he thread pool.

aamijar

Hi @julianmi, what is the UX for using multiple threads in HSNW bench? Does the user set the gbench threads parameter, or the num_threads_ parameter?

achirkin · 2026-03-10T05:23:46Z

To answer @aamijar

In the latency mode, gbench measures how long does it take to execute a single search call for the given algorithm and batch size. In this mode, gbench is always single-threaded. To make the use of the whole CPU, HNSW has its own threading logic. This makes the HNSW measures more realistic and fair against GPU algorithms.

In the throughput mode, gbench measures how many requests can the given algorithm serve per second. Thus, gbench provides independent threads to do the search calls. This clashes with the internal HNSW threading. Because gbench creates its threads and manages batching outside the measured benchmark loop, the performance of HNSW generally looks better with gbench threads than with the internal threads. Hence we just disable internal batching completely in the throughput mode.

julianmi requested a review from a team as a code owner March 9, 2026 14:18

github-project-automation bot added this to Vector Search, ML, & Data Mining Release Board Mar 9, 2026

aamijar assigned julianmi Mar 9, 2026

aamijar moved this to In Progress in Vector Search, ML, & Data Mining Release Board Mar 9, 2026

aamijar added non-breaking Introduces a non-breaking change improvement Improves an existing functionality labels Mar 9, 2026

aamijar reviewed Mar 9, 2026

View reviewed changes

achirkin approved these changes Mar 10, 2026

View reviewed changes

julianmi and others added 3 commits March 10, 2026 09:16

Merge branch 'main' into hnswlib-bench-threading

2e5a940

Merge branch 'main' into hnswlib-bench-threading

277eeec

Merge branch 'main' into hnswlib-bench-threading

11d0fc8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent nested parallelism in HNSW bench#1895

Prevent nested parallelism in HNSW bench#1895
julianmi wants to merge 4 commits intorapidsai:mainfrom
julianmi:hnswlib-bench-threading

julianmi commented Mar 9, 2026

Uh oh!

aamijar left a comment •

edited

Loading

Uh oh!

achirkin commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

julianmi commented Mar 9, 2026

Uh oh!

aamijar left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

achirkin commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aamijar left a comment •

edited

Loading