Add _vectorize_node dispatchers for sparse ops by jaanerik · Pull Request #2190 · pymc-devs/pytensor

jaanerik · 2026-05-29T17:53:34Z

Description

Related Issue

Closes BUG: Sparse vectorize dispatchers #2189

Checklist

Checked that the pre-commit linting/style checks pass
Included tests that prove the fix is effective or that the new feature works

Type of change

New feature / enhancement
Bug fix

ricardoV94 · 2026-05-29T21:11:41Z

 usmm = Usmm()
+
+
+# ---------------------------------------------------------------------------


please remove these global comments/separators

ricardoV94 · 2026-05-29T21:11:55Z

+# That contradicts the sparse-input contract enforced by as_sparse_variable,
+# so every sparse op needs a custom dispatcher (or a clear NotImplementedError).
+@_vectorize_node.register(StructuredDot)
+def _vectorize_structured_dot(op, node, batch_a, batch_b):


ricardoV94 · 2026-05-29T21:13:00Z

looks good, just need style cleanup

The default Blockwise-based fallback in pytensor/graph/replace.py wraps ops in Blockwise and rebuilds their make_node with dense dummy core inputs, which contradicts the sparse-input contract enforced by as_sparse_variable. As a result, vectorize_graph crashes with "Variable type field must be a SparseTensorType" the moment it encounters any sparse op — pmx-extras pathfinder hits this whenever a PyMC model uses a sparse projection (e.g. a sum-to-zero constraint encoded as pt.dot(flat, as_sparse_variable(csr))). This patch: - Registers an explicit dispatcher for StructuredDot that batches the dense (right) input via a moveaxis+reshape trick while keeping the sparse (left) input unbatched (scipy has no batched-sparse type). Raises NotImplementedError with a clear message if the caller tries to batch the sparse input. - Registers NotImplementedError stubs for the other sparse ops likely to appear in user graphs (TrueDot, AddSS, AddSSData, AddSD, SparseSparseMultiply, SparseDenseMultiply) so callers see a descriptive error instead of the cryptic as_sparse_variable TypeError from the Blockwise fallback.

Add TestVectorizeSparse covering the StructuredDot dispatcher (batched dense input, no-batch no-op, batched-sparse error) and the AddSD / SparseDenseMultiply NotImplementedError stubs. The structured_dot test reproduces the original "Variable type field must be a SparseTensorType" crash without the dispatcher (issue pymc-devs#2189). Drop the NotImplementedError stubs for the all-sparse-input ops (TrueDot, AddSS, AddSSData, SparseSparseMultiply): a sparse input can never become batched, so vectorize_graph never dispatches to them. Keep only the reachable AddSD / SparseDenseMultiply, and reword the error since AddSD's output is dense, not sparse. Move the _vectorize_node import to the top of the module (no circular import) to satisfy ruff E402. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

ricardoV94 · 2026-06-03T13:04:35Z

+    existing StructuredDot 2D matmul.
+    """
+    a, b = node.inputs
+    if batch_a is not a:


a need not be batch_a, just check batch_a.ndim==2?

ricardoV94 · 2026-06-03T13:06:02Z

+    k_axis = -2
+    moved = ptb.moveaxis(batch_b, k_axis, 0)  # (k, B1,...,BN, n)
+
+    # Compose the trailing shape (B1*...*BN*n) symbolically.


pt.join_dims?

ricardoV94 · 2026-06-03T13:06:55Z

+    # Reshape back to (m, B1,...,BN, n).
+    m = flat_out.shape[0]
+    target_shape = ptb.concatenate([ptb.stack([m]), ptb.stack(list(trailing))])
+    unflat = flat_out.reshape(target_shape, ndim=batch_b.type.ndim)


pt.split_dims?

ricardoV94 · 2026-06-03T13:07:47Z

+    unflat = flat_out.reshape(target_shape, ndim=batch_b.type.ndim)
+    # Move m back into the (-2) slot: (B1,...,BN, m, n).
+    out = ptb.moveaxis(unflat, 0, -2)
+    return out.owner


you no longer need to artificially return an apply, that will actually be deprecated. Just return the variables (maybe in a list)

ricardoV94 · 2026-06-03T13:10:54Z

+
+def _vectorize_sparse_unsupported(op, node, *batched_inputs):
+    raise NotImplementedError(
+        f"Cannot vectorize {type(op).__name__}: scipy has no batched-sparse "


I'm not sure this advice is actionable. Blockwise will still fail even if sparse input has no batch dims no? At the very least it tries to add dummy expand dims. There's an issue open about this IIRC

ricardoV94 · 2026-06-03T13:13:20Z

+        xb_val = rng.normal(size=(*batch_shape, 3, 5)).astype("float64")
+        out = pytensor.function([xb], yb)(xb_val)
+
+        S_dense = np.eye(3)


isn't this just expected = np.eye(3) @ xb_val?

ricardoV94 · 2026-06-03T13:14:33Z

+            expected[idx] = S_dense @ xb_val[idx]
+        np.testing.assert_allclose(out, expected)
+
+    def test_structured_dot_no_batch_is_noop(self):


Remove in favor of batch_dims=0 parametrization above?

jaanerik force-pushed the sparse-vectorize-dispatchers branch from 1851ec5 to 0659b8f Compare May 29, 2026 18:31

ricardoV94 reviewed May 29, 2026

View reviewed changes

ricardoV94 added the enhancement New feature or request label May 29, 2026

jaanerik and others added 2 commits June 2, 2026 12:58

jaanerik force-pushed the sparse-vectorize-dispatchers branch from 0659b8f to be177e0 Compare June 2, 2026 10:02

Clean comment separators

ddd2049

jaanerik force-pushed the sparse-vectorize-dispatchers branch from be177e0 to ddd2049 Compare June 3, 2026 08:21

ricardoV94 reviewed Jun 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add _vectorize_node dispatchers for sparse ops#2190

Add _vectorize_node dispatchers for sparse ops#2190
jaanerik wants to merge 3 commits into
pymc-devs:mainfrom
jaanerik:sparse-vectorize-dispatchers

jaanerik commented May 29, 2026

Uh oh!

ricardoV94 May 29, 2026

Uh oh!

ricardoV94 May 29, 2026

Uh oh!

ricardoV94 commented May 29, 2026

Uh oh!

ricardoV94 Jun 3, 2026

Uh oh!

ricardoV94 Jun 3, 2026

Uh oh!

ricardoV94 Jun 3, 2026

Uh oh!

ricardoV94 Jun 3, 2026

Uh oh!

ricardoV94 Jun 3, 2026

Uh oh!

ricardoV94 Jun 3, 2026

Uh oh!

ricardoV94 Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		usmm = Usmm()


		# ---------------------------------------------------------------------------

Conversation

jaanerik commented May 29, 2026

Description

Related Issue

Checklist

Type of change

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 commented May 29, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants