feat(migrator): [3/7] Async migration with non-blocking planner, executor, validator, and readiness utilities by nkanu17 · Pull Request #562 · redis/redis-vl-python

nkanu17 · 2026-04-01T22:38:44Z

Summary

Async versions of planner, executor, and validator for non-blocking migration workflows. Async executor mirrors the sync drop/recreate flow with async key enumeration, prefix/field renames, vector re-encoding with checkpoint resume, and readiness polling.

Includes async utilities for index listing, readiness polling, and source snapshot validation.

Files

redisvl/migration/async_executor.py, async_planner.py, async_validation.py, async_utils.py
Async unit and integration tests

Stack

[1/7] Migration foundation > feat(migrator): [1/7] Migration foundation with models, schema-aware planner, validation, and shared utilities #560
[2/7] Sync executor with reliability and quantization > feat(migrator): [2/7] Sync executor with reliability checkpointing, crash-safe resume, and quantization support #561
[3/7] Async migration (this PR)
[4/7] Batch migration
[5/7] Interactive wizard
[6/7] CLI and documentation
[7/7] Benchmarks

Note

Medium Risk
Adds a new async migration execution path that performs destructive operations (drop/recreate, key/field renames, in-place vector re-encoding) and introduces checkpointed resume logic, which can impact data integrity if edge cases are missed. Changes are mostly additive and covered by new unit/integration tests, but they touch migration reliability concerns and Redis command sequencing.

Overview
Introduces async equivalents of the migration workflow: AsyncMigrationPlanner, AsyncMigrationExecutor, and AsyncMigrationValidator, plus async helpers for listing indexes, readiness polling, and source-snapshot validation.

The async executor mirrors the sync drop/recreate flow but uses non-blocking Redis operations, including FT.AGGREGATE-based key enumeration with SCAN fallback, optional BGSAVE safety snapshot, hash/JSON field renames, prefix key renames with collision fail-fast, vector re-encoding with idempotent skip + per-batch rollback, and optional checkpoint-based resume after crashes.

Exports these async APIs from redisvl.migration and adds substantial unit + integration test coverage for planning, apply/validate flow, enumeration, quantization checkpointing, and readiness utilities.

^{Written by Cursor Bugbot for commit 7a1ef9a. This will update automatically on new commits. Configure here.}

jit-ci · 2026-04-01T22:41:25Z

🛡️ Jit Security Scan Results

✅ No security findings were detected in this PR

^{Security scan by Jit}

redisvl/migration/async_utils.py

redisvl/migration/async_executor.py

Copilot

Pull request overview

Adds an async migration surface (AsyncMigrationPlanner, AsyncMigrationExecutor, AsyncMigrationValidator) to enable non-blocking drop/recreate migrations, plus async readiness/index utilities and corresponding unit/integration tests.

Changes:

Introduces async planner/executor/validator implementations mirroring the existing sync migration flow (including renames, vector re-encoding, and readiness polling).
Adds async helper utilities for listing indexes, readiness polling, and snapshot comparison.
Adds new async unit and integration tests for planning, execution, disk space estimation, and reliability helpers.

Reviewed changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
`redisvl/migration/async_planner.py`	Async migration planning built on `AsyncSearchIndex` with sync planner delegation.
`redisvl/migration/async_executor.py`	Async migration apply flow incl. key/field renames, optional vector re-encoding, readiness wait, and validation.
`redisvl/migration/async_validation.py`	Async post-migration validation and query checks.
`redisvl/migration/async_utils.py`	Async index listing, readiness polling, and snapshot match helper.
`redisvl/migration/__init__.py`	Exposes new async migration APIs/utilities from the package.
`tests/unit/test_async_migration_planner.py`	Unit coverage for async planner parity with sync behavior.
`tests/unit/test_async_migration_executor.py`	Unit coverage for async executor + disk estimator + reliability helpers.
`tests/integration/test_async_migration_v1.py`	End-to-end integration coverage for async plan/apply/validate against Redis.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

redisvl/migration/async_planner.py

redisvl/migration/async_utils.py

redisvl/migration/async_executor.py

tests/unit/test_async_migration_executor.py

redisvl/migration/async_executor.py

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 0087dcf33b

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

redisvl/migration/async_executor.py

nkanu17 · 2026-04-01T22:43:53Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 0087dcf33b

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

redisvl/migration/async_executor.py

Copilot

Pull request overview

Copilot reviewed 8 out of 8 changed files in this pull request and generated 5 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

redisvl/migration/async_executor.py

redisvl/migration/async_validation.py

redisvl/migration/async_planner.py

redisvl/migration/async_executor.py

nkanu17 · 2026-04-01T22:58:14Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 0087dcf33b

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

redisvl/migration/async_executor.py

- Fix unbound 'ready' variable in async_utils.py and async_executor.py - Fix completed checkpoint: resume from post-drop state - Pass rename_operations to get_vector_datatype_changes - Fix has_prefix_change falsy check for empty string prefixes - Fix partial key renames: fail fast on collision - Warn when field rename overwrites existing destination field - Fix async_validation prefix handling and indexing failure delta

- Fix _quantize_vectors docstring: 'documents quantized' not 'processed' - Close internally-created Redis client in async_list_indexes

nkanu17 · 2026-04-02T00:32:22Z

@codex review

redisvl/migration/async_planner.py

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 8642ec7435

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

redisvl/migration/async_executor.py

redisvl/migration/async_planner.py

- Remap datatype_changes keys to post-rename field names before quantization - Only resume from completed checkpoint when source index is actually gone

…xecutor, validator, and readiness utilities Async versions of planner, executor, and validator for non-blocking migration workflows. Async executor mirrors the sync drop/recreate flow with async key enumeration, prefix/field renames, vector re-encoding with checkpoint resume, and readiness polling. Includes async utilities for index listing, readiness polling, and source snapshot validation. Adds async unit and integration tests.

- Fix unbound 'ready' variable in async_utils.py and async_executor.py - Fix completed checkpoint: resume from post-drop state - Pass rename_operations to get_vector_datatype_changes - Fix has_prefix_change falsy check for empty string prefixes - Fix partial key renames: fail fast on collision - Warn when field rename overwrites existing destination field - Fix async_validation prefix handling and indexing failure delta

- Fix _quantize_vectors docstring: 'documents quantized' not 'processed' - Close internally-created Redis client in async_list_indexes

… formatting

- Pass existing snapshot to create_plan_from_patch to avoid double Redis round-trip - Use _get_client() instead of _redis_client for lazy async client initialization - Remap datatype_changes keys to post-rename field names before quantization - Only resume from completed checkpoint when source index is actually gone

nkanu17 · 2026-04-02T03:59:39Z

@codex review

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

cursor · 2026-04-02T04:01:14Z

redisvl/migration/async_executor.py

+            percent_indexed = latest_info.get("percent_indexed")
+
+            if percent_indexed is not None or indexing is not None:
+                ready = float(percent_indexed or 0) >= 1.0 and not bool(indexing)


Async readiness check logic diverges from sync version

High Severity

The async readiness check uses float(percent_indexed or 0) >= 1.0 and not bool(indexing), which differs from the sync version's logic. When percent_indexed is None but indexing is present and falsy (e.g., 0), the sync version correctly falls through to ready = not is_indexing (returning True). The async version evaluates float(None or 0) >= 1.0 → False, so ready stays False, potentially causing a 30-minute timeout instead of detecting the index is ready.

Additional Locations (1)

redisvl/migration/async_utils.py#L61-L63

cursor · 2026-04-02T04:01:14Z

redisvl/migration/async_planner.py

+            warnings.append(
+                "SVS-VAMANA requires Redis >= 8.2.0 and Redis Search >= 2.8.10. "
+                "Verify your Redis instance supports this algorithm before applying."
+            )


Async SVS check leaks Redis client connection

Medium Severity

When redis_url is provided, _check_svs_vamana_requirements creates a new Redis.from_url() client but never closes it. The sync counterpart in planner.py tracks the created client and calls created_client.close() in a finally block. The async version is missing both the created_client tracking and the finally cleanup with await client.aclose().

jit-ci · 2026-04-02T04:01:46Z

❌ Jit Scanner failed - Our team is investigating

Jit Scanner failed - Our team has been notified and is working to resolve the issue. Please contact support if you have any questions.

💡 Need to bypass this check? Comment @sera bypass to override.

Copilot

Pull request overview

Copilot reviewed 9 out of 9 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-02T04:03:54Z

redisvl/migration/async_utils.py

+            ready = float(percent_indexed or 0) >= 1.0 and not bool(indexing)
+            if progress_callback:
+                total_docs = int(latest_info.get("num_docs", 0))
+                pct = float(percent_indexed or 0)


async_wait_for_index_ready() treats a missing percent_indexed value as 0.0 even when the indexing flag is present, which can make the loop wait until timeout on Redis/Search versions that omit percent_indexed. Mirror the sync wait_for_index_ready() logic: if percent_indexed is None but indexing is present, consider the index ready when indexing is falsy, and compute progress accordingly.

Suggested change

ready = float(percent_indexed or 0) >= 1.0 and not bool(indexing)

if progress_callback:

total_docs = int(latest_info.get("num_docs", 0))

pct = float(percent_indexed or 0)

# Mirror sync wait_for_index_ready behavior:

# - If percent_indexed is provided, use it directly.

# - If percent_indexed is None but indexing is present, treat the

# index as fully indexed (pct = 1.0) when indexing is falsy.

if percent_indexed is not None:

pct = float(percent_indexed)

else:

pct = 1.0 if not bool(indexing) else 0.0

ready = pct >= 1.0 and not bool(indexing)

if progress_callback:

total_docs = int(latest_info.get("num_docs", 0))

Copilot · 2026-04-02T04:03:54Z

redisvl/migration/async_executor.py

+            if percent_indexed is not None or indexing is not None:
+                ready = float(percent_indexed or 0) >= 1.0 and not bool(indexing)
+                if progress_callback:
+                    total_docs = int(latest_info.get("num_docs", 0))
+                    pct = float(percent_indexed or 0)
+                    indexed_docs = int(total_docs * pct)
+                    progress_callback(indexed_docs, total_docs, pct * 100)


_async_wait_for_index_ready() can fail to ever report readiness when percent_indexed is missing but the indexing flag exists (it coerces missing percent to 0.0 and requires >= 1.0). Align with the sync readiness utility: if percent_indexed is absent, use the indexing flag alone to decide readiness (ready when indexing is falsy).

Suggested change

if percent_indexed is not None or indexing is not None:

ready = float(percent_indexed or 0) >= 1.0 and not bool(indexing)

if progress_callback:

total_docs = int(latest_info.get("num_docs", 0))

pct = float(percent_indexed or 0)

indexed_docs = int(total_docs * pct)

progress_callback(indexed_docs, total_docs, pct * 100)

if percent_indexed is not None:

pct = float(percent_indexed or 0)

ready = pct >= 1.0 and not bool(indexing)

if progress_callback:

total_docs = int(latest_info.get("num_docs", 0))

indexed_docs = int(total_docs * pct)

progress_callback(indexed_docs, total_docs, pct * 100)

elif indexing is not None:

# When percent_indexed is absent but indexing is present,

# fall back to the indexing flag alone (ready when falsy),

# matching the synchronous readiness utility behavior.

ready = not bool(indexing)

if progress_callback:

total_docs = int(latest_info.get("num_docs", 0))

if ready:

indexed_docs = total_docs

pct_value = 100.0

else:

indexed_docs = 0

pct_value = 0.0

progress_callback(indexed_docs, total_docs, pct_value)

Copilot · 2026-04-02T04:03:55Z

redisvl/migration/async_executor.py

+                    source_still_exists = (
+                        await self._async_current_source_matches_snapshot(
+                            plan.source.index_name,
+                            plan.source.schema_snapshot,
+                            redis_url=redis_url,
+                            redis_client=redis_client,
+                        )
+                    )


When checkpoint.status == "completed", this calls _async_current_source_matches_snapshot() to decide whether the source index still exists. If the source index was dropped (a likely post-crash resume scenario), AsyncSearchIndex.from_existing() will raise and apply() will bubble the exception (this code runs before the main try/except), preventing resume. Catch the "index not found" path here and treat it as source_still_exists = False.

Suggested change

source_still_exists = (

await self._async_current_source_matches_snapshot(

plan.source.index_name,

plan.source.schema_snapshot,

redis_url=redis_url,

redis_client=redis_client,

)

)

try:

source_still_exists = await self._async_current_source_matches_snapshot(

plan.source.index_name,

plan.source.schema_snapshot,

redis_url=redis_url,

redis_client=redis_client,

)

except ResponseError as exc:

# If the source index no longer exists, treat this as

# "source_still_exists = False" so we can resume from

# the completed checkpoint in a post-drop crash scenario.

if "Unknown Index name" in str(exc):

logger.info(

"Source index '%s' not found when checking completed "

"checkpoint at %s; treating as dropped",

plan.source.index_name,

checkpoint_path,

)

source_still_exists = False

else:

raise

nkanu17 · 2026-04-02T04:04:01Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 7a1ef9a651

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-02T04:07:51Z

redisvl/migration/async_executor.py

+        current_index = await AsyncSearchIndex.from_existing(
+            index_name,
+            redis_url=redis_url,
+            redis_client=redis_client,
+        )


Return false when async source index is already dropped

This helper is used by apply() to decide whether a completed checkpoint should resume from a post-drop state, but AsyncSearchIndex.from_existing(...) raises when the source index is gone (which is exactly that resume scenario). Because this call happens before the main try/except in apply(), a valid completed-checkpoint resume can crash with an exception instead of continuing or returning a failed MigrationReport. Catch the missing-index error here and return False so resume logic can proceed.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-02T04:07:51Z

redisvl/migration/async_planner.py

+                from redis.asyncio import Redis
+
+                client = Redis.from_url(redis_url)


Close temporary async Redis client after SVS check

When redis_url is provided (without redis_client), this function creates an async Redis client via Redis.from_url(...) but never closes it. Planning runs can be invoked repeatedly, so these unclosed clients accumulate sockets/connections over time and degrade process stability. The sync planner closes its temporary client in finally; the async path should similarly await aclose() for clients it creates.

Useful? React with 👍 / 👎.

Copilot AI review requested due to automatic review settings April 1, 2026 22:38

Copilot started reviewing on behalf of nkanu17 April 1, 2026 22:39 View session

cursor bot reviewed Apr 1, 2026

View reviewed changes

redisvl/migration/async_utils.py Show resolved Hide resolved

redisvl/migration/async_executor.py Show resolved Hide resolved

Copilot AI reviewed Apr 1, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Apr 1, 2026

View reviewed changes

redisvl/migration/async_executor.py Show resolved Hide resolved

redisvl/migration/async_executor.py Outdated Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Apr 1, 2026

View reviewed changes

redisvl/migration/async_executor.py Outdated Show resolved Hide resolved

redisvl/migration/async_executor.py Show resolved Hide resolved

nkanu17 requested a review from Copilot April 1, 2026 22:50

Copilot started reviewing on behalf of nkanu17 April 1, 2026 22:50 View session

Copilot AI reviewed Apr 1, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Apr 1, 2026

View reviewed changes

redisvl/migration/async_executor.py Outdated Show resolved Hide resolved

nkanu17 added a commit that referenced this pull request Apr 2, 2026

fix: async minor cleanups (#562)

3f1f185

- Fix _quantize_vectors docstring: 'documents quantized' not 'processed' - Close internally-created Redis client in async_list_indexes

nkanu17 force-pushed the feat/migrate-executor branch from 50cff88 to 33f2a40 Compare April 2, 2026 00:30

nkanu17 force-pushed the feat/migrate-async branch from 0087dcf to 8642ec7 Compare April 2, 2026 00:30

cursor bot reviewed Apr 2, 2026

View reviewed changes

redisvl/migration/async_planner.py Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Apr 2, 2026

View reviewed changes

redisvl/migration/async_executor.py Outdated Show resolved Hide resolved

redisvl/migration/async_planner.py Outdated Show resolved Hide resolved

nkanu17 added 6 commits April 1, 2026 23:55

fix: address review round 3 for migrate-executor (#561)

19f8451

- Remap datatype_changes keys to post-rename field names before quantization - Only resume from completed checkpoint when source index is actually gone

fix: async minor cleanups (#562)

7d23c7c

- Fix _quantize_vectors docstring: 'documents quantized' not 'processed' - Close internally-created Redis client in async_list_indexes

fix(validation): fix double-colon bug in async prefix key transform +…

324e0dc

… formatting

nkanu17 force-pushed the feat/migrate-executor branch from 33f2a40 to 42aa7bb Compare April 2, 2026 03:58

Copilot AI review requested due to automatic review settings April 2, 2026 03:58

nkanu17 force-pushed the feat/migrate-async branch from 8642ec7 to 7a1ef9a Compare April 2, 2026 03:58

Copilot started reviewing on behalf of nkanu17 April 2, 2026 03:59 View session

cursor bot reviewed Apr 2, 2026

View reviewed changes

Copilot AI reviewed Apr 2, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Apr 2, 2026

View reviewed changes

-            ready = float(percent_indexed or 0) >= 1.0 and not bool(indexing)
-            if progress_callback:
-                total_docs = int(latest_info.get("num_docs", 0))
-                pct = float(percent_indexed or 0)
+            # Mirror sync wait_for_index_ready behavior:
+            # - If percent_indexed is provided, use it directly.
+            # - If percent_indexed is None but indexing is present, treat the
+            #   index as fully indexed (pct = 1.0) when indexing is falsy.
+            if percent_indexed is not None:
+                pct = float(percent_indexed)
+            else:
+                pct = 1.0 if not bool(indexing) else 0.0
+            ready = pct >= 1.0 and not bool(indexing)
+            if progress_callback:
+                total_docs = int(latest_info.get("num_docs", 0))

		from redis.asyncio import Redis

		client = Redis.from_url(redis_url)

Conversation

nkanu17 commented Apr 1, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Files

Stack

Uh oh!

jit-ci bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🛡️ Jit Security Scan Results

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

nkanu17 commented Apr 1, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nkanu17 commented Apr 1, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

nkanu17 commented Apr 2, 2026

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

nkanu17 commented Apr 2, 2026

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Apr 2, 2026

Choose a reason for hiding this comment

Async readiness check logic diverges from sync version

Uh oh!

cursor bot Apr 2, 2026

Choose a reason for hiding this comment

Async SVS check leaks Redis client connection

Uh oh!

jit-ci bot commented Apr 2, 2026

❌ Jit Scanner failed - Our team is investigating

Uh oh!

Copilot AI left a comment

nkanu17 commented Apr 1, 2026 •

edited by cursor bot

Loading

jit-ci bot commented Apr 1, 2026 •

edited

Loading