Fix cumprod gradient returning NaN when input contains zeros by WHOIM1205 · Pull Request #1911 · pymc-devs/pytensor

WHOIM1205 · 2026-02-23T22:31:54Z

Problem

CumOp.L_op computed the gradient of cumprod using a division-based formula:

cumsum((cumprod(x, axis) * g_out)[reverse], axis)[reverse] / x

When x[i] == 0, this produced 0 / 0 = NaN, silently corrupting the gradient.
This NaN propagates through the computation graph and can break optimization or MCMC without any clear indication that cumprod is the source.

This is a real-world issue since zeros commonly appear in probability masks, indicator variables, ReLU outputs, and sparse data.
The existing tests did not catch this because they only used random inputs in (0, 1), which never include zeros.

Root Cause

The gradient formula relied on dividing by x, which is only valid when all elements are nonzero.

Unlike Prod.L_op, which implements explicit zero-handling logic, CumOp.L_op did not account for zero values.

Fix

Replaced the division-based implementation with a mathematically equivalent division-free formulation.

For each position i:

grad[i] = L[i] * R[i]

Where:

L[i] = exclusive prefix product (prod(x[0:i]))
R[i] = reverse linear recurrence
R[i] = g[i] + x[i+1] * R[i+1]

This approach:

Avoids division entirely
Correctly handles zero values
Preserves the computation graph
Passes finite-difference gradient checks

Changes

Updated CumOp.L_op in:
pytensor/tensor/extra_ops.py
Added regression tests in:
tests/tensor/test_extra_ops.py

Tests Added

New tests cover:

Single zero in the middle
Zero at the beginning
Multiple zeros
2D inputs with zeros along an axis

All existing tests pass, and gradients are now correct for inputs containing zeros.

Impact

Eliminates silent NaN corruption in cumprod gradients
Prevents hard-to-debug failures in downstream models (e.g., PyMC)
Adds regression protection against future breakage
Maintains backward compatibility

This PR fixes a clear correctness bug in `cumprod` gradient computation.

WHOIM1205 · 2026-02-23T22:32:40Z

pre-commit.ci autofix

WHOIM1205 · 2026-02-23T22:33:46Z

hey @jessegrabowski and @ricardoV94
This PR fixes a correctness issue in CumOp.L_op where the gradient of cumprod returned NaN when the input contained zeros due to a 0/0 division.

The implementation has been rewritten using a division-free formulation, and regression tests covering zero cases (including multi-dimensional inputs) have been added. All existing tests pass.

ricardoV94

Yeah I don't think we're not going with a scan for the gradient. Cumprod is pretty rare and never heard people having issues with it/gradient.

Sometimes we need convenience at the expense of edge cases

Signed-off-by: WHOIM1205 <rathourprateek8@gmail.com>

WHOIM1205 · 2026-02-24T20:13:50Z

pre-commit.ci autofix

for more information, see https://pre-commit.ci

WHOIM1205 · 2026-02-24T20:14:34Z

Thanks for the earlier feedback about avoiding scan that makes sense

I’ve reworked the implementation to keep it lightweight and removed scan entirely the gradient now uses only cumprod, cumsum, and simple masking logic to handle zeros safely It avoids division-by-zero while still matching the correct mathematical behavior for single and multiple zero cases

The graph stays simple and NUMBA-compatible, and the sparse / typed_list tests pass as well

Let me know if you'd like it simplified further or adjusted in any way

ricardoV94 · 2026-02-25T11:47:25Z

tests/tensor/test_extra_ops.py

+
+        # Zero at the beginning
+        result = f(np.array([0.0, 2.0, 3.0]))
+        expected = np.array([9.0, 0.0, 0.0])


Shouldn't the gradient wrt x[0] be 1.0?

Suggested change

expected = np.array([9.0, 0.0, 0.0])

expected = np.array([1.0, 0.0, 0.0])

Ah it affects the subsequent outputs too. Can you also include the case of all zeros?

ricardoV94 · 2026-02-25T11:53:30Z

tests/tensor/test_extra_ops.py

+        f = pytensor.function([x], g)
+
+        # Single zero in the middle
+        result = f(np.array([1.0, 0.0, 2.0]))


Can you avoid 1's and 2's in the tests? I think it's more robust as it avoids the mul identity or the equality between * and +

Suggested change

result = f(np.array([1.0, 0.0, 2.0]))

result = f(np.array([3.0, 0.0, 3.0]))

ricardoV94 reviewed Feb 23, 2026

View reviewed changes

Fix cumprod gradient returning NaN for inputs with zeros

e13ffa0

Signed-off-by: WHOIM1205 <rathourprateek8@gmail.com>

WHOIM1205 force-pushed the fix-cumprod-grad-zeros branch from 3561c36 to e13ffa0 Compare February 24, 2026 20:12

[pre-commit.ci] auto fixes from pre-commit.com hooks

f0e87d8

for more information, see https://pre-commit.ci

ricardoV94 reviewed Feb 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix cumprod gradient returning NaN when input contains zeros#1911

Fix cumprod gradient returning NaN when input contains zeros#1911
WHOIM1205 wants to merge 2 commits intopymc-devs:mainfrom
WHOIM1205:fix-cumprod-grad-zeros

WHOIM1205 commented Feb 23, 2026

Uh oh!

WHOIM1205 commented Feb 23, 2026

Uh oh!

WHOIM1205 commented Feb 23, 2026

Uh oh!

ricardoV94 left a comment

Uh oh!

WHOIM1205 commented Feb 24, 2026

Uh oh!

WHOIM1205 commented Feb 24, 2026

Uh oh!

ricardoV94 Feb 25, 2026 •

edited

Loading

Uh oh!

ricardoV94 Feb 25, 2026

Uh oh!

ricardoV94 Feb 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	expected = np.array([9.0, 0.0, 0.0])
	expected = np.array([1.0, 0.0, 0.0])

	result = f(np.array([1.0, 0.0, 2.0]))
	result = f(np.array([3.0, 0.0, 3.0]))

Conversation

WHOIM1205 commented Feb 23, 2026

Problem

Root Cause

Fix

Changes

Tests Added

Impact

This PR fixes a clear correctness bug in cumprod gradient computation.

Uh oh!

WHOIM1205 commented Feb 23, 2026

Uh oh!

WHOIM1205 commented Feb 23, 2026

Uh oh!

ricardoV94 left a comment

Choose a reason for hiding this comment

Uh oh!

WHOIM1205 commented Feb 24, 2026

Uh oh!

WHOIM1205 commented Feb 24, 2026

Uh oh!

ricardoV94 Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

This PR fixes a clear correctness bug in `cumprod` gradient computation.

ricardoV94 Feb 25, 2026 •

edited

Loading

ricardoV94 Feb 25, 2026 •

edited

Loading