Fix literal promotion by ivarusic-amd · Pull Request #4826 · ROCm/AMDMIGraphX

ivarusic-amd · 2026-04-28T13:49:43Z

Motivation

When parsing ONNX models with half-precision (float16) weights, scalar float32 constants (e.g. epsilon in LayerNorm) were causing unintended type promotion of entire computation paths from float16 to float32. This resulted in a type mismatch error when the promoted float32 tensor reached a downstream operation (e.g. convolution or dot) whose weights remained float16. Result in this error:

same_type: convolution: Types do not match

Technical Details

Before applying the common type, inputs that are scalar literals (single-element and evaluable) are excluded from the type decision. If the remaining tensor inputs share a type, that type takes precedence over the scalar's type, and the scalar is cast down to match.

Changelog Category

Add a CHANGELOG.md entry for any option other than Not Applicable

- Added: New functionality.
- [ x] Changed: Changes to existing functionality.
- Removed: Functionality or support that has been removed. (Compared to a previous release)
- Optimized: Component performance that has been optimized or improved.
- Resolved Issues: Known issues from a previous version that have been resolved.
- Not Applicable: This PR is not to be included in the changelog.

…promotion

CharlieL7

This could affect accuracy. Another way to handle this that could preserve accuracy would be to promote the parameter to the wider format and then convert down the output. It's not clear which way would be more reasonable. Feels like a bug in the input model.

CharlieL7 · 2026-04-28T16:06:49Z

+        if(not args[5]->is_undefined() and
+           args[5]->get_shape().type() != args[1]->get_shape().type())
+        {
+            args[5] = info.add_instruction(
+                make_op("convert", {{"target_type", args[1]->get_shape().type()}}), args[5]);
+        }


Why is there a specific change for GRU?

So, for this specific model just literal change only helped with migraphx driver read command / compile was still failing with types do not match. The fix is in parse_gru.cpp because the type mismatch is introduced by the rewrite_rnn pass, which decomposes the GRU op into dot operations using m.insert_instruction(make_op("dot"), ...) directl; bypassing add_common_op and its type reconciliation. I agree this is odd behaviour; i will send you model via teams; for you to have better understanding of it.

CharlieL7 · 2026-04-28T16:14:43Z

    else
    {
        auto common = common_shape(to_shapes(inputs));
+        if(options.common_type)


Add a comment explaining the reasoning for this added code block

…GraphX into fix_literal_promotion

ivarusic-amd · 2026-04-29T11:04:38Z

This could affect accuracy. Another way to handle this that could preserve accuracy would be to promote the parameter to the wider format and then convert down the output. It's not clear which way would be more reasonable. Feels like a bug in the input model.

Agree. But it should only affect large scalar values for accuracy; all other cases should be fine? For the model bug i also agree, it's little bit odd, specialy since dml also have prolem with the model. But onnx format say it's fine. I can share model with you via teams. There are few models failing with similar errors (types mismatch)

shivadbhavsar · 2026-04-30T21:17:08Z

        if(options.common_type)
        {
            auto c_type = compute_common_types(input_shapes);
            std::transform(inputs.begin(), inputs.end(), inputs.begin(), [&](auto input) {
                if(input->get_shape().type() != c_type)
                {
                    input = m.insert_instruction(
                        ins, make_op("convert", {{"target_type", c_type}}), input);
                }
                return input;


Dynamic branch should have the same behavior when handling common_type

shivadbhavsar · 2026-04-30T21:31:02Z

This could affect accuracy. Another way to handle this that could preserve accuracy would be to promote the parameter to the wider format and then convert down the output. It's not clear which way would be more reasonable. Feels like a bug in the input model.

Agree. But it should only affect large scalar values for accuracy; all other cases should be fine? For the model bug i also agree, it's little bit odd, specialy since dml also have prolem with the model. But onnx format say it's fine. I can share model with you via teams. There are few models failing with similar errors (types mismatch)

I'm also curious about these models because the GRU spec for onnx doesnt seem to allow mixed precision like this (ie. I dont see how this come from a valid onnx model). Might be worth investigating the source of these mixed inputs before applying a fix like this.

pfultz2 · 2026-05-03T18:20:37Z

+                if(tensor_type != common.type())
+                    common = shape{tensor_type, common.lens()};
+            }
+        }


This logic should not be used. The common type should match the logic used in C++. There shouldn't be exceptions for literals, which makes it more difficult to reason about the logic.

If the model is mixing an fp32 with fp16 and it should be fp16 according the onnx spec then common type promotions shouldn't be used there.

ivarusic-amd added 3 commits April 28, 2026 06:17

fix test

49eb142

Prefer tensor type over scalar literal type to avoid unintended type …

9391d24

…promotion

add test

106fd25

ivarusic-amd requested a review from causten as a code owner April 28, 2026 13:49

ivarusic-amd requested review from ahsan-ca and kahmed10 April 28, 2026 13:50

ahsan-ca requested review from CharlieL7 and shivadbhavsar April 28, 2026 13:52

causten added the ok-to-test label Apr 28, 2026

ivarusic-amd and others added 2 commits April 28, 2026 07:42

Fix clang and license

2f9b73d

Merge branch 'develop' into fix_literal_promotion

4dce31f

CharlieL7 requested changes Apr 28, 2026

View reviewed changes

ivarusic-amd added 2 commits April 29, 2026 03:56

Add comment to code

0b6ff3c

Merge branch 'fix_literal_promotion' of https://github.com/ROCm/AMDMI…

1c1a60f

…GraphX into fix_literal_promotion

Merge branch 'develop' into fix_literal_promotion

27e1629

shivadbhavsar reviewed Apr 30, 2026

View reviewed changes

pfultz2 requested changes May 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix literal promotion#4826

Fix literal promotion#4826
ivarusic-amd wants to merge 8 commits into
developfrom
fix_literal_promotion

ivarusic-amd commented Apr 28, 2026

Uh oh!

CharlieL7 left a comment

Uh oh!

CharlieL7 Apr 28, 2026

Uh oh!

ivarusic-amd Apr 29, 2026

Uh oh!

CharlieL7 Apr 28, 2026

Uh oh!

ivarusic-amd Apr 29, 2026

Uh oh!

ivarusic-amd commented Apr 29, 2026

Uh oh!

shivadbhavsar Apr 30, 2026

Uh oh!

shivadbhavsar commented Apr 30, 2026

Uh oh!

pfultz2 May 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

ivarusic-amd commented Apr 28, 2026

Motivation

Technical Details

Changelog Category

Uh oh!

CharlieL7 left a comment

Choose a reason for hiding this comment

Uh oh!

CharlieL7 Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

ivarusic-amd Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

CharlieL7 Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

ivarusic-amd Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

ivarusic-amd commented Apr 29, 2026

Uh oh!

shivadbhavsar Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

shivadbhavsar commented Apr 30, 2026

Uh oh!

pfultz2 May 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants