Concurrent graph modifications by razdoburdin · Pull Request #321 · intel/ScalableVectorSearch

razdoburdin · 2026-04-27T15:58:29Z

This PR introduces concurrent graph modifications with seqlock pattern.

rfsaliev

IMHO it would be better to keep both synchronized and non-synchronized graphs.
For example, synchronization is not needed for static Vamana index, but gives overhead.

rfsaliev · 2026-04-29T08:20:20Z

    using index_type = Idx;
    using value_type = std::span<Idx>;
-    using const_value_type = std::span<const Idx>;
+    using const_value_type = AtomicSpan<const Idx>;


For better flexibility and allow user to select syncronized/non-syncronized index kind, I would define a dedicated SyncronizedGraphBase class.

I am not sure this duplication is really required. Performance penalty for synchronized vs non-synchronized search is only few precents.

Hi @razdoburdin, is duplication too bad in this case? Otherwise, @rfsaliev may have a point on flexibility and it's always better if performance is not affected. But agree with you that pros and cons should be discussed if duplication is a large overhead.

I plan to investigate the trade-offs, after the finalization of design of concurrent path. Let's make sure it works well first.

To get valuable results, I would recommend to benchmark 'static' VamanaIndex with and without synchronized graph on different platforms - especially on multi-socket systems.

rfsaliev · 2026-04-29T08:26:36Z

            if (is_deleted(dst)) {
-                const auto& others = graph_.get_node(dst);
-                all_candidates.insert(others.begin(), others.end());
+                // SeqLock retry: a concurrent consolidate may be writing dst's


GraphConsolidator class is parametrized by a graph type, so we can keep both syncronized/non-synchronized behavior by detecting graph type.

mihaic · 2026-05-02T01:40:46Z

While testing this PR in Redis, I initially removed all locking; that made a test invoking compact and delete_entries fail (runGCParallel). Claude says:

It was a specific interaction between delete_entries (which marks nodes as Deleted via atomic CAS) and compact() (which remaps adjacency lists). When compact() builds its old_to_new_id_map from only Valid nodes, any node that gets marked Deleted concurrently won't be in that map. If a Valid node's adjacency list references the now-Deleted node, old_to_new_id_map.at() throws "Couldn't find key."

Tests do pass when search locks are removed.

mihaic · 2026-05-28T18:09:38Z

@razdoburdin, I encountered another crash in Redis. What do you think about the following analysis from Claude?

Root Cause

Race condition between compact() and concurrent delete_entries() during
GC.

The crash path (in dynamic_index.h:977):

runGC() calls consolidate() — prunes currently-Deleted nodes from adjacen
cy lists
Concurrently, deleteVector() or updateSVSIndex() phase 4 calls delet e_entries() → atomically marks node X as Deleted (was Valid during consolidate,
so X is still referenced in other nodes' adjacency lists)
runGC() calls compact():
- nonmissing_indices() collects only Valid slots → node X is excluded (it's now Deleted)
- old_to_new_id_map doesn't contain X
- When remapping adjacency lists, valid node Y still references X → old_to_new_id_map.at(X) throws "Couldn't find key"

The translator_mutex_ doesn't help because:

delete_entries() holds only a shared lock (line 842)
compact() holds no lock during the parallel_for graph remapping (lines 959-984)
Only the finishing steps (line 1001) take an exclusive lock

Fix

The minimal fix is making compact() resilient to concurrently-deleted neighbors. A neighbor missing from old_to_new_id_map was concurrently deleted — dropping it from the adjacency list is semantically correct (the next GC cycle would do the same via consolidate).

Now let me also check if the entry_point_ remap in the finishing steps has the same issue (line 1025).

The entry_point remap at line 1025 is inside the exclusive lock so it's safe from new concurrent deletes at that point. However, the same race could affect it if the entry_point was deleted between consolidate and nonmissing_indices.

Summary

The crash: svs::threads::ThreadingException — "Couldn't find key" thrown from compact() in dynamic_index.h:977 during SVS GC.

Root cause: A race condition between compact() (GC) and concurrent delete_entries() (from deleteVector or updateSVSIndex phase 4). After mainIndexGuard was removed:

consolidate() prunes adjacency lists of currently-Deleted nodes
A concurrent delete marks node X as Deleted (was Valid during consolidate, still in valid nodes' adjacency lists)
compact() builds old_to_new_id_map from only Valid nodes (X excluded)
When remapping adjacency lists: old_to_new_id_map.at(X) throws because X isn't in the map

Fix: Changed the std::transform + .at() in compact() to use .find() and skip neighbors not in the map. A missing neighbor is a concurrently-deleted node — dropping it is correct (equivalent to what the next consolidate() would do). The graph remains structurally sound; it simply has slightly fewer edges on the affected nodes until the next full GC cycle.

Proposed change:

diff --git a/include/svs/index/vamana/dynamic_index.h b/include/svs/index/vamana/dynamic_index.h
index 5cfcc7d..7e185ef 100644
--- a/include/svs/index/vamana/dynamic_index.h
+++ b/include/svs/index/vamana/dynamic_index.h
@@ -966,17 +966,17 @@ class MutableVamanaIndex {
                         auto old_id = new_to_old_id_map[new_id];
 
                         const auto& list = graph_.get_node(old_id);
-                        buffer.resize(list.size());
-
-                        // Transform the adjacency list from old to new.
-                        std::transform(
-                            list.begin(),
-                            list.end(),
-                            buffer.begin(),
-                            [&old_to_new_id_map](Idx old_id) {
-                                return old_to_new_id_map.at(old_id);
+                        buffer.clear();
+                        buffer.reserve(list.size());
+
+                        // Remap adjacency list, dropping neighbors that were
+                        // concurrently deleted after consolidate ran.
+                        for (auto neighbor : list) {
+                            auto it = old_to_new_id_map.find(neighbor);
+                            if (it != old_to_new_id_map.end()) {
+                                buffer.push_back(it->second);
                             }
-                        );
+                        }
 
                         temp_graph.replace_node(batch_id, buffer);
                     }

razdoburdin · 2026-05-29T13:45:43Z

@razdoburdin, I encountered another crash in Redis. What do you think about the following analysis from Claude?

The problem was deeper. I have made some changes, search isn't affected, but calls of compact() is now serialized vs additions, deletions and consolidations.

mihaic · 2026-05-30T02:28:05Z

@razdoburdin, another crash with VecSim update threshold 1 (not with the default of 1024). Here is what I got from Claude:

Root Cause: Race condition in SVS `data_.resize()` during concurrent search

The crash is a use-after-free in blocks_ (a std::vector<DenseArray>).

The Data Structure

include/svs/core/data/simple.h:955:

std::vector<array_type> blocks_;   // line 955

Data access (get_datum, line 837-839):

auto [block_id, data_id] = resolve(i);
return getindex(blocks_, block_id).slice(data_id);  // dereferences blocks_[block_id]

The Race

Thread A (search) — no lock held:

Calls get_datum(i) → reads blocks_.data() to index into the vector
Or already holds a reference to blocks_[block_id]

Thread B (add_points, Phase 1) — holds translator_mutex_ exclusive, at line 745 of dynamic_index.h:

data_.resize(new_size) → calls add_block() → blocks_.emplace_back(...)
If blocks_ capacity is exceeded, std::vector reallocates: old buffer is freed

Thread A (continues):
3. Indexes into the freed old blocks_ buffer → reads garbage DenseArray → dereferences garbage pointer_ → SIGSEGV at 0xada00

Why `translator_mutex_` Doesn't Help

translator_mutex_ is only acquired by:

add_points Phase 1 (exclusive) — to protect the ID translator maps
search's translate_to_external() (shared) — AFTER the search completes

Search's data_ access during graph traversal/distance computation holds NO lock at all. The translator_mutex_ was never designed to protect blocks_ from concurrent read.

The Stack Trace Confirms This

At crash time:

Thread 3048847 (update): in IDTranslator::insert_translation — this is AFTER data_.resize() already ran (line 745 precedes line 760 in Phase 1). The blocks_ reallocation has already happened.
Thread 3048835 (search): in IPImpl::compute — accessing vector data via a stale pointer into the old blocks_ buffer.

Why threshold=1 Crashes but threshold=1024 Doesn't

With threshold=1: add_points is called for every single insert (1M calls during upload + continuous during mixed workload). blocks_ reallocates whenever a block fills up (~every 1024 vectors), but the back-to-back add_points calls mean:

The system spends almost all its time cycling through add_points
Searches overlap with add_points almost continuously
The probability of a search being mid-get_datum() at the exact moment of blocks_ reallocation is high

With threshold=1024: add_points is called only every 1024 vectors (~977 times for 1M vectors). Between calls, there's a long stable window where searches run against static blocks_. The temporal overlap between search and reallocation is much smaller.

The Fix

SVS's internal locking is insufficient for concurrent search + mutation. The comment in VectorSimilarity ("SVS handles its own internal locking for concurrent search + modification") is incorrect for this code path.

Options:

Re-introduce mainIndexGuard — shared during search, exclusive during addVectors. This was the previous working approach.
Add a data-access reader-writer lock inside SVS — search holds shared, data_.resize() in Phase 1 holds exclusive. This would protect only the resize window, not all of Phase 1.
Make blocks_ reallocation-safe — e.g., use a fixed-capacity container, or an indirection layer (pointer-to-blocks) that's atomically swapped (RCU-style).

Dmitry Razdoburdin added 2 commits April 20, 2026 04:51

initial

af0f8e2

use shared_mutex for translator protection

339d156

razdoburdin requested review from ahuber21, ethanglaser and ibhati as code owners April 27, 2026 15:58

razdoburdin marked this pull request as draft April 27, 2026 15:58

razdoburdin and others added 4 commits April 28, 2026 09:18

Merge branch 'main' into seqlock

0683f0c

fix translator size calculation

0c76ef3

fix silent edge drop during concurent addition

d336a1f

fix insertion into deleted translarot entry

7c510e4

rfsaliev requested changes Apr 29, 2026

View reviewed changes

Dmitry Razdoburdin added 4 commits April 29, 2026 03:26

bump clang version to clang19 for macos ci

1f3f1b2

bump clang version to clang20 for macos ci

ac6113c

fix macos build

ffebf9c

fix silent vector loosing in concurent additions

46800ff

Dmitry Razdoburdin and others added 4 commits May 4, 2026 08:05

fix consolidate

7bf994f

Merge branch 'main' into seqlock

5d8ecdf

reduce critical section in add_points()

25b5b53

Merge branch 'main' into seqlock

826421b

Dmitry Razdoburdin and others added 2 commits May 29, 2026 03:45

fix race for compact()

7caf032

Merge branch 'main' into seqlock

1cfe979

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concurrent graph modifications#321

Concurrent graph modifications#321
razdoburdin wants to merge 16 commits into
intel:mainfrom
razdoburdin:seqlock

razdoburdin commented Apr 27, 2026

Uh oh!

rfsaliev left a comment •

edited

Loading

Uh oh!

rfsaliev Apr 29, 2026

Uh oh!

razdoburdin Apr 29, 2026

Uh oh!

aguerreb Apr 29, 2026

Uh oh!

razdoburdin Apr 30, 2026

Uh oh!

rfsaliev Apr 30, 2026

Uh oh!

rfsaliev Apr 29, 2026

Uh oh!

mihaic commented May 2, 2026

Uh oh!

mihaic commented May 28, 2026

Uh oh!

razdoburdin commented May 29, 2026

Uh oh!

mihaic commented May 30, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

razdoburdin commented Apr 27, 2026

Uh oh!

rfsaliev left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rfsaliev Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

razdoburdin Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

aguerreb Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

razdoburdin Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

rfsaliev Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

rfsaliev Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

mihaic commented May 2, 2026

Uh oh!

mihaic commented May 28, 2026

Root Cause

Fix

Summary

Uh oh!

razdoburdin commented May 29, 2026

Uh oh!

mihaic commented May 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Root Cause: Race condition in SVS data_.resize() during concurrent search

The Data Structure

The Race

Why translator_mutex_ Doesn't Help

The Stack Trace Confirms This

Why threshold=1 Crashes but threshold=1024 Doesn't

The Fix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rfsaliev left a comment •

edited

Loading

mihaic commented May 30, 2026 •

edited

Loading

Root Cause: Race condition in SVS `data_.resize()` during concurrent search

Why `translator_mutex_` Doesn't Help