fix: performance of erfinv #7568 by JeWaVe · Pull Request #7569 · dotnet/machinelearning

JeWaVe · 2026-01-15T10:07:15Z

Before :
an array of 1000 double is allocated and computed for each erfinv call

Now :
Lazy allocation for coefficient in a readonly struct with static constructor
Allocation and computation is done only once, first time we invoke erfinv

We are excited to review your PR.

So we can do the best job, please check:

[x ] There's a descriptive title that will make sense to other developers some time from now.
There's associated issues. All PR's should have issue(s) associated - unless a trivial self-evident change such as fixing a typo. You can use the format Fixes #nnnn in your description to cause GitHub to automatically close the issue(s) when your PR is merged.
Your change description explains what the change does, why you chose your approach, and anything else that reviewers should know.
You have included any necessary tests in the same PR. (didn't see any tests)

Lazy allocation for coefficient in a readonly struct with static constructor Allocation and computation is done only once, first time we invoke erfinv On the other side, 8kb won't be collected

codecov · 2026-01-15T11:33:09Z

Codecov Report

❌ Patch coverage is 0% with 10 lines in your changes missing coverage. Please review.
✅ Project coverage is 69.02%. Comparing base (adf0cec) to head (bfefa76).
⚠️ Report is 16 commits behind head on main.

Files with missing lines	Patch %	Lines
src/Microsoft.ML.CpuMath/ProbabilityFunctions.cs	0.00%	10 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7569      +/-   ##
==========================================
- Coverage   69.02%   69.02%   -0.01%     
==========================================
  Files        1482     1482              
  Lines      274096   274099       +3     
  Branches    28266    28266              
==========================================
- Hits       189191   189187       -4     
- Misses      77518    77526       +8     
+ Partials     7387     7386       -1

Flag	Coverage Δ
Debug	`69.02% <0.00%> (-0.01%)`	⬇️
production	`63.30% <0.00%> (-0.01%)`	⬇️
test	`89.47% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
src/Microsoft.ML.CpuMath/ProbabilityFunctions.cs	`50.00% <0.00%> (-1.86%)`	⬇️

... and 3 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

JeWaVe · 2026-01-23T08:30:20Z

There was no tests before. Should I add some ?

rokonec

Thank you for identifying this issue and submitting a fix! The observation that coefficients are recomputed on every call is valid. However, I'm requesting changes for two reasons:

1. Missing use case justification

Erfinv() currently has zero callers in the codebase. Before optimizing it, we need a concrete use case demonstrating that frequent Erfinv calls are needed and that the current per-call cost is actually a bottleneck. Could you provide the scenario where you're hitting this?

2. If optimization is warranted, prefer the Probit-based approach

The existing Probit() function in the same class (ProbabilityFunctions.cs) already implements a high-quality rational polynomial approximation (Beasley-Springer-Moro). Since erfinv(x) = Probit((1+x)/2) / sqrt(2), the entire Taylor series can be eliminated:

public static double Erfinv(double x)
{
    if (x > 1 || x < -1)
        return Double.NaN;
    if (x == 1)
        return Double.PositiveInfinity;
    if (x == -1.0)
        return Double.NegativeInfinity;

    return Probit((1.0 + x) / 2.0) / Math.Sqrt(2.0);
}

This approach is superior to caching the Taylor series coefficients because:

Performance (benchmarked with 100,000 values x 100 iterations)

Approach	Time	Speedup vs Original
Original (per-call alloc)	~128,512 ms (1 pass)	1x
Cached coefficients (this PR)	9,973 ms	~1,290x
Probit-based	96 ms	~134,000x

The Probit approach is ~104x faster than caching because it evaluates a degree-7 rational polynomial (~30 FP ops) instead of summing 1,000 series terms (~5,000 FP ops + 8 KB array traversal).

Accuracy - the Taylor series diverges near +/-1

At 100K test values spanning (-1, 1), the 1000-term Taylor series produces incorrect results near the boundaries:

Metric	Taylor series (Original & Cached)	Probit-based
Max round-trip error `abs(Erf(Erfinv(x)) - x)`	2.57e-4	1.39e-7
Max divergence from Probit at x~0.99998	--	0.44 (series fails to converge)

The series approach gives ~1,850x worse accuracy at the tails.

Simplicity

No new types, no readonly struct, no #pragma suppressions, no 8 KB static array
5 lines of code delegating to an existing, well-tested function
Zero additional memory

Summary

If a use case for frequent Erfinv calls is provided, the right fix is the one-liner delegating to Probit -- it's faster, more accurate, and simpler. The caching approach preserves the flawed Taylor series and adds structural complexity for a 104x slower result.

Happy to help with the implementation if you'd like to go this route!

…from dotnet#7569) Agent-Logs-Url: https://github.com/JanKrivanek/machinelearning/sessions/47b47f1f-d882-4f96-85b8-042a37ad3206 Co-authored-by: JanKrivanek <3809076+JanKrivanek@users.noreply.github.com>

fix: performance of erfinv dotnet#7568

bfefa76

Lazy allocation for coefficient in a readonly struct with static constructor Allocation and computation is done only once, first time we invoke erfinv On the other side, 8kb won't be collected

dotnet-policy-service Bot added the community-contribution label Jan 15, 2026

This was referenced Mar 4, 2026

🏥 ML.NET Repository Health Dashboard dotutils/GH-AW-Tests#3

Closed

🏥 ML.NET Repository Health Dashboard dotutils/GH-AW-Tests#2

Open

🏥 Repo Health Dashboard #7584

Open

rokonec requested changes Mar 19, 2026

View reviewed changes

dotnet-policy-service Bot added the needs-author-action label Mar 19, 2026

dotnet-policy-service Bot added no-recent-activity and removed no-recent-activity labels Apr 3, 2026

Copilot AI mentioned this pull request Apr 7, 2026

fix: performance of erfinv #7568 (from dotnet/machinelearning#7569) JanKrivanek/machinelearning#51

Draft

dotnet-policy-service Bot added no-recent-activity and removed no-recent-activity labels Apr 17, 2026

dotnet-policy-service Bot added no-recent-activity and removed no-recent-activity labels May 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: performance of erfinv #7568#7569

fix: performance of erfinv #7568#7569
JeWaVe wants to merge 1 commit intodotnet:mainfrom
JeWaVe:main

JeWaVe commented Jan 15, 2026 •

edited by rokonec

Loading

Uh oh!

codecov Bot commented Jan 15, 2026 •

edited

Loading

Uh oh!

JeWaVe commented Jan 23, 2026

Uh oh!

rokonec left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JeWaVe commented Jan 15, 2026 • edited by rokonec Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov Bot commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

JeWaVe commented Jan 23, 2026

Uh oh!

rokonec left a comment

Choose a reason for hiding this comment

1. Missing use case justification

2. If optimization is warranted, prefer the Probit-based approach

Performance (benchmarked with 100,000 values x 100 iterations)

Accuracy - the Taylor series diverges near +/-1

Simplicity

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JeWaVe commented Jan 15, 2026 •

edited by rokonec

Loading

codecov Bot commented Jan 15, 2026 •

edited

Loading