Grouped conv_fwd_bias_bnorm_clamp instances and tests #3525

ApoorvaKalyani · 2026-01-07T09:53:28Z

Proposed changes

Please describe the motivation behind the pull request, whether it enables a new feature or fixes a bug. If there are associated pull requests or issues, please link them to the pull request.

Checklist

Please put an x into the boxes that apply. You can also fill these out after creating the PR. If you're not sure, please don't hesitate to ask.

I have added tests relevant to the introduced functionality, and the unit tests are passing locally
I have added the test to REGRESSION_TESTS list defined at the top of CMakeLists.txt in tests/CMakeLists.txt, IF the test takes more than 30 seconds to run.
I have added inline documentation which enables the maintainers with understanding the motivation
I have removed the stale documentation which is no longer relevant after this pull request
(If this change is user-facing) I have added release notes which provide the end users with a brief summary of the improvement from this pull request
I have run clang-format on all changed files
Any dependent changes have been merged

Discussion

If this is a relatively large or complex change, feel free to start a discussion by explaining why you chose the solution you did and what alternatives you considered

krithalith · 2026-01-09T14:07:13Z

I had a look at the test failures myself and found the following tolerances to be sufficient for BF16, F16,iInt initialization, and float initialization:

INTEGER INITIALIZATION:

BF16
double rtol = 1e-1;
double atol = 5e-3;

F16
double rtol = 1.5e-3;
double atol = 7e-3;

FLOAT INITIALIZATION:

BF16
double rtol = 1e-1;
double atol = 7e-3;

F16
double rtol = 2e-3;
double atol = 1e-2;

I think the bias bnorm clamp operation at the end just fundamentally magnifies small errors in the gemm, so the tolerances simply need to be higher for this op

Also a 1e-3 relative error for f16 (default value in check_err) is very low since that is pretty much exactly a single f16 epsilon. For BF16 this is suddenly a lot more lenient (1e-1 even though the epsilon is only 8 times as large). Also check_err() adds up the relative and absolute tolerance errors, which is dubious.

ApoorvaKalyani added 16 commits January 7, 2026 09:44

Added bias_bnorm_clamp instances.

efb1c23

fwd_bias_bnorm_clamp comp instances

0c616d6

fwd_bias_bnorm_mem_inter and mem_intra instances

b286483

fwd_bias_bnorm_merged_group_instances

8b2ae23

fwd_bias_bnorm_clamp_conv3d_bf16 and f16 instances

7712702

Device level changes for fwd_bias_bnorm_clamp

2e2c38f

Added the test to the regression test list.

3a60216

Removed the part 2 and 2x instances

85ebc6f

Removed the irrelevant checks in wmma

9645a87

Refactored the instances to adapt to new device implementation

459aee0

Updated the reference and include files

69b350b

enabling tests

93bf907

Added missing profiler

2da4ff6

Updated tolerance.

b76de8a

Changing the tolerance to int values

76cc7b7

fixed clang

39bd224

ApoorvaKalyani added 5 commits January 9, 2026 18:24

Updated tolerance

edf10a2

Created and updated the new seperate instance file

202ef52

Clang fix

1ba986b

Fixed formatting of Instance file

26abd65

Added missing instance entry , deleted by mistake

4be35b9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Grouped conv_fwd_bias_bnorm_clamp instances and tests #3525

Grouped conv_fwd_bias_bnorm_clamp instances and tests #3525

Uh oh!

ApoorvaKalyani commented Jan 7, 2026

Uh oh!

krithalith commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Grouped conv_fwd_bias_bnorm_clamp instances and tests #3525

Are you sure you want to change the base?

Grouped conv_fwd_bias_bnorm_clamp instances and tests #3525

Uh oh!

Conversation

ApoorvaKalyani commented Jan 7, 2026

Proposed changes

Checklist

Discussion

Uh oh!

krithalith commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants