[QDP] feat: add credit card fraud benchmark + amplitude encoding optimizations by rich7420 · Pull Request #1106 · apache/mahout

rich7420 · 2026-03-02T11:09:37Z

Changes

New benchmark: encoding_benchmarks/qdp_pipeline/creditcardfraud_amplitude.py — 5-qubit
amplitude VQC on Credit Card Fraud data, aligned with PennyLane baseline (same circuit, loss,
optimizer). Closes the QDP vs baseline training time gap from ~22% slower to <1% gap.
New baseline: encoding_benchmarks/pennylane_baseline/creditcardfraud_amplitude.py —
PennyLane reference implementation with AUPRC/F1 metrics for imbalanced data.
QuantumDataLoader API: added source_array(X) (in-memory, no temp file),
as_torch(device), and as_numpy() for ergonomic batch output format.
Rust PipelineIterator: added new_from_array() constructor; InMemory next_batch now
passes &data[start..end] slice directly (no per-batch to_vec()).
amplitude.rs: moved D2H norm validation to after encode kernel + device.synchronize(),
eliminating a mid-pipeline GPU→CPU roundtrip in encode_batch.
Bug fixes (iris + creditcard benchmarks): requires_grad=False on all data arrays to
prevent AdamOptimizer from computing unnecessary gradients through state vectors;
AmplitudeEmbedding(normalize=False) in place of StatePrep; .real extraction after
torch.from_dlpack() to convert complex128 DLPack output to float64.

Motivation

The existing QDP benchmark suite only covers the Iris dataset (100 samples, 2 qubits), which is too small to surface real-world data-loading and encoding bottlenecks. Credit Card Fraud (284,807 transactions, 5 qubits) is a standard imbalanced-classification benchmark from Kaggle/OpenML that stresses the full QDP pipeline — batch iteration, GPU encoding, and training — at realistic scale.

Adding this benchmark serves this purposes:

Expands loader API coverage: source_array(X), as_torch(), and as_numpy() are exercised end-to-end by the new benchmark and tests, catching integration issues across the Python → PyO3 → Rust → CUDA boundary.

Checklist

rich7420 · 2026-03-02T11:10:13Z

It seems a little be too big one. Sorry about that.

viiccwen

Thx for contributing! 🙌
I'll look deeper tomorrow, and I think we should add tests to cover new loader APIs, especially the new behavior crosses Python, PyO3, Rust, and CUDA boundaries.

viiccwen · 2026-03-02T18:52:04Z

+        elif kind == "numpy":
+            for qt in raw_iter:
+                yield _torch.from_dlpack(qt).cpu().numpy()


as_torch() validates that torch is installed, but as_numpy() does not. Then _wrap_iterator() calls _torch.from_dlpack(...) for the "numpy" path.

Does it mean as_numpy() can succeed at configuration time and then fail during iteration with an unclear runtime error if PyTorch is not installed. 🤔

oh , nice catch! you're right

ryankert01 · 2026-03-02T23:33:42Z

amplitude.rs: moved D2H norm validation to after encode kernel + device.synchronize(),
eliminating a mid-pipeline GPU→CPU roundtrip in encode_batch.

nice

400Ping · 2026-03-03T02:29:18Z

Please solve conflicts

rich7420 · 2026-03-06T07:26:00Z

plz take a look and test, not hurry once you have time

ryankert01

The pennylane baseline uses CPU for training, whereas qdp pipeline has dual CPU/GPU. We can simplify both to use GPU for training always.

rich7420 · 2026-03-10T03:20:26Z

no problem!

400Ping · 2026-03-19T06:40:56Z

Please solve conflicts

400Ping · 2026-03-19T08:00:24Z

Can you also give some more context in the PR description for why is this needed?

400Ping

The array-loader optimization claim does not match the implementation in this PR.
create_array_loader() says batching uses slices without per-batch to_vec(), but
PipelineIterator::take_batch_from_source() still clones each in-memory batch with
data[start..end].to_vec().

rich7420 · 2026-03-24T02:22:39Z

The array-loader optimization claim does not match the implementation in this PR. create_array_loader() says batching uses slices without per-batch to_vec(), but PipelineIterator::take_batch_from_source() still clones each in-memory batch with data[start..end].to_vec().

Great catch! You are right that the code in take_batch_from_source() was misleading.
Actually, next_batch() already handles the InMemory variant directly using the zero-copy data[start..end] slice logic and never calls take_batch_from_source() for it. To avoid confusion and make the implementation match the documentation claims, I've replaced the dead-code InMemory arm in take_batch_from_source() with unreachable!().
I've also rebased the branch onto main and resolved all the merge conflicts. Thanks for the review!

guan404ming · 2026-04-06T07:54:12Z

Great catch! You are right that the code in take_batch_from_source() was misleading.
Actually, next_batch() already handles the InMemory variant directly using the zero-copy data[start..end] slice logic and never calls take_batch_from_source() for it. To avoid confusion and make the implementation match the documentation claims, I've replaced the dead-code InMemory arm in take_batch_from_source() with unreachable!().
I've also rebased the branch onto main and resolved all the merge conflicts. Thanks for the review!

Make sense to me, thanks for the update. I could code change wise looks nie, plz help resolve the conflict, thanks!

…mizations

next_batch() already handles InMemory via zero-copy &data[start..end]. Remove the dead-code .to_vec() clone path that contradicted the documented optimization claim. Addresses review comment from 400Ping.

rich7420 requested review from 400Ping, guan404ming and ryankert01 as code owners March 2, 2026 11:09

viiccwen reviewed Mar 2, 2026

View reviewed changes

rich7420 force-pushed the credit-card branch from 0c84ab9 to 5b4f800 Compare March 3, 2026 04:19

rich7420 force-pushed the credit-card branch from 5b4f800 to 2aa0cda Compare March 6, 2026 07:38

400Ping self-assigned this Mar 8, 2026

ryankert01 reviewed Mar 9, 2026

View reviewed changes

rich7420 force-pushed the credit-card branch from 2aa0cda to 4e6b534 Compare March 10, 2026 03:21

400Ping requested changes Mar 19, 2026

View reviewed changes

rich7420 force-pushed the credit-card branch from 0552681 to e309d34 Compare March 24, 2026 02:18

rich7420 force-pushed the credit-card branch from e309d34 to 90395e8 Compare April 12, 2026 09:40

rich7420 added 4 commits April 12, 2026 09:55

[QDP] feat: add credit card fraud benchmark + amplitude encoding opti…

5beb555

…mizations

fix and update

6879d13

remove cpu , only use gpu

318714d

fix: make take_batch_from_source() InMemory arm unreachable

74db6dd

next_batch() already handles InMemory via zero-copy &data[start..end]. Remove the dead-code .to_vec() clone path that contradicted the documented optimization claim. Addresses review comment from 400Ping.

rich7420 force-pushed the credit-card branch from 90395e8 to 74db6dd Compare April 12, 2026 12:08

400Ping removed their assignment Apr 13, 2026

Conversation

rich7420 commented Mar 2, 2026 • edited by 400Ping Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Motivation

Checklist

Uh oh!

rich7420 commented Mar 2, 2026

Uh oh!

viiccwen left a comment

Choose a reason for hiding this comment

Uh oh!

viiccwen Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

rich7420 Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

ryankert01 commented Mar 2, 2026

Uh oh!

400Ping commented Mar 3, 2026

Uh oh!

rich7420 commented Mar 6, 2026

Uh oh!

ryankert01 left a comment

Choose a reason for hiding this comment

Uh oh!

rich7420 commented Mar 10, 2026

Uh oh!

400Ping commented Mar 19, 2026

Uh oh!

400Ping commented Mar 19, 2026

Uh oh!

400Ping left a comment

Choose a reason for hiding this comment

Uh oh!

rich7420 commented Mar 24, 2026

Uh oh!

guan404ming commented Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

rich7420 commented Mar 2, 2026 •

edited by 400Ping

Loading