[Feature] Add cloud leaderboard benchmark cases and client support by jamesgao-jpg · Pull Request #775 · zilliztech/VectorDBBench

jamesgao-jpg · 2026-05-11T09:34:35Z

Summary

This PR adds the CloudLeadboard v2 benchmark surface to VectorDBBench. The goal is to complement the existing raw-performance leaderboard with cloud-oriented cases that capture production behaviors managed vector database users care about: ingest readiness, payload-aware search, tenant-shaped traffic, cold latency, and cost-aware interpretation.

What is added

New cloud leaderboard cases

CloudPayloadSearchCase: measures search performance with explicit response payload profiles: ids_only, scalar_label, and vector. It supports unfiltered search, integer-filter search, and scalar-label filter search.
CloudInsertCase: measures insert throughput and separates client insert completion from downstream readiness signals such as fully searchable and fully indexed.
CloudColdLatencyCase: measures cold and warm serial latency so first-query and cache-sensitive serving behavior are visible instead of hidden by warm concurrent loops.
CloudMultiTenantSearchCase: models SaaS-style tenant-routed workloads with deterministic tenant assignment and tenant-aware query routing.

Runtime and metric plumbing

Threads payload profile configuration through case config, runners, client search calls, metrics, and JSON result output.
Adds payload byte estimation to result metrics so payload-heavy searches can be compared more explicitly.
Adds a cold/warm search runner and task-runner integration for cold latency measurement.
Adds concurrent insert readiness polling for cloud insert runs.
Adds first-class result fields for cloud insert, cold latency, payload profile, and related cloud case metadata.

Client support

Milvus/Zilliz Cloud: adds payload output handling, scalar-label support, multitenant partition-key validation, and related schema checks.
Pinecone: adds payload-profile search behavior, metadata/vector return handling, insert readiness polling through write/index LSNs where available, and namespace-based multitenant routing.
turbopuffer: adds payload-profile search behavior, scalar payload label configuration, multitenant namespace support, write backpressure control, and namespace pin/unpin CLI support for pinned benchmark runs.

CLI, frontend, and docs

Adds CLI options for payload profile, cloud filters, cold query count, insert batch size/duration, tenant settings, and turbopuffer pinning.
Adds frontend case config entries for the cloud payload search cases.
Adds a May 2026 CloudLeadboard v2 release note under docs/release/2026-05-cloud-leadboard.md.
Updates README.md to mention the new CloudLeadboard v2 benchmark cases and link to the release note.

Notes

The raw cloud leaderboard result dump was intentionally removed from this PR. Result artifacts can be added later in a smaller, dedicated update or kept outside the source tree.
Cloud cost and Pareto interpretation are described in the release note, but this PR focuses on benchmark case support and documentation rather than publishing a full result dataset.

Test Plan

python -m pytest tests/test_cloud_payload_case.py tests/test_cloud_payload_search.py tests/test_cloud_insert_case.py tests/test_cloud_cold_latency_case.py tests/test_multitenant_case.py tests/test_turbopuffer_cli.py tests/test_pinecone_multitenant.py -q

sre-ci-robot · 2026-05-11T09:34:42Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: jamesgao-jpg
To complete the pull request process, please assign xuanyang-cn after the PR has been reviewed.
You can assign the PR to them by writing /assign @xuanyang-cn in a comment when ready.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

…h-case # Conflicts: # vectordb_bench/backend/clients/milvus/milvus.py # vectordb_bench/backend/clients/turbopuffer/turbopuffer.py # vectordb_bench/backend/dataset.py # vectordb_bench/backend/runner/serial_runner.py

Ported onto cloud-payload-search-case while preserving TurboPuffer payload, backpressure, and multitenant support. (cherry picked from commit adb1ae4)

(cherry picked from commit d9e2f5d)

…lution # Conflicts: # tests/test_milvus.py

This reverts commit 9648add.

jamesgao-jpg added 3 commits May 8, 2026 10:12

Add cloud payload search benchmark case

8d83582

Add cloud payload scalar label support

1637484

Add cloud insert readiness case

829d7f4

jamesgao-jpg changed the title ~~Add cloud leaderboard payload and insert readiness cases~~ (feat) Add cloud leaderboard payload and insert readiness cases May 11, 2026

jamesgao-jpg force-pushed the cloudLeadboard branch 2 times, most recently from e258154 to f909086 Compare May 11, 2026 09:42

jamesgao-jpg force-pushed the cloudLeadboard branch from f909086 to c943513 Compare May 11, 2026 09:51

Fix cloud insert CI lint failures

519f01c

jamesgao-jpg changed the title ~~(feat) Add cloud leaderboard payload and insert readiness cases~~ [WIP] (feat) Add cloud leaderboard payload and insert readiness cases May 12, 2026

jamesgao-jpg added 19 commits May 12, 2026 13:17

Add multitenant VDBBench design spec

4df4b8f

Add multitenant implementation plan

20befac

Add cloud insert concurrency and Pinecone readiness support

76ad386

Add cloud cold latency case design

5fb9930

Add cloud cold latency implementation plan

e611187

Add cloud cold latency case model

2a99d70

Add cold warm search runner

f61b447

Fix cold warm runner lint issues

46c5c94

Wire cloud cold latency runner into tasks

a9f7668

Fix cloud cold latency task integration lint

33361d2

Fix cloud cold latency CLI defaults

52b7158

Add cloud multitenant search case

fc66bc1

Emit first-class cloud case result metrics

dd2485d

Pretty-print cloud result JSON output

2f92bbf

fix: validate multitenant search schema

77dcb72

fix: support turbopuffer multitenant payload runs

6cede04

fix: configure turbopuffer scalar payload field

df15217

Add Turbopuffer namespace pinning CLI

66475be

Ported onto cloud-payload-search-case while preserving TurboPuffer payload, backpressure, and multitenant support. (cherry picked from commit adb1ae4)

Document TurboPuffer unpin command

1994211

(cherry picked from commit d9e2f5d)

jamesgao-jpg added 3 commits May 14, 2026 06:46

Record turbopuffer pinning request metadata

9ecd79b

Add cloud insert benchmark raw results

0bf0fef

Remove internal docs and raw cloud results from PR

c4620d1

jamesgao-jpg changed the title ~~[WIP] (feat) Add cloud leaderboard payload and insert readiness cases~~ Add cloud benchmark cases for payload, insert, cold latency, and multitenancy May 15, 2026

jamesgao-jpg changed the title ~~Add cloud benchmark cases for payload, insert, cold latency, and multitenancy~~ Add cloud leaderboard benchmark cases and client support May 18, 2026

jamesgao-jpg added 2 commits May 18, 2026 06:39

Merge remote-tracking branch 'upstream/main' into pr775-conflict-reso…

c623e17

…lution # Conflicts: # tests/test_milvus.py

Format cloud leaderboard changes

b7ec21e

jamesgao-jpg changed the title ~~Add cloud leaderboard benchmark cases and client support~~ [Feature] Add cloud leaderboard benchmark cases and client support May 18, 2026

jamesgao-jpg added 4 commits May 19, 2026 07:14

Add cloud leaderboard test results

9648add

Revert "Add cloud leaderboard test results"

379f81e

This reverts commit 9648add.

Add CloudLeadboard release note

7bfa991

Document CloudLeadboard in README

b353ac0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Add cloud leaderboard benchmark cases and client support#775

[Feature] Add cloud leaderboard benchmark cases and client support#775
jamesgao-jpg wants to merge 33 commits into
zilliztech:mainfrom
jamesgao-jpg:cloudLeadboard

jamesgao-jpg commented May 11, 2026 •

edited

Loading

Uh oh!

sre-ci-robot commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jamesgao-jpg commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What is added

New cloud leaderboard cases

Runtime and metric plumbing

Client support

CLI, frontend, and docs

Notes

Test Plan

Uh oh!

sre-ci-robot commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jamesgao-jpg commented May 11, 2026 •

edited

Loading