Arm backend: Add Ethos-U FVP tests for MLPerf Tiny models by tirwu01 · Pull Request #18225 · pytorch/executorch

tirwu01 · 2026-03-17T10:18:41Z

Add model definitions and Arm backend tests for four MLPerf Tiny benchmark models: ResNet8, DS-CNN, Deep AutoEncoder, and MobileNetV1-0.25.

Model definitions are placed under examples/models/mlperf_tiny/. Each model has tests for tosa_FP, tosa_INT, u55_INT and u85_INT pipelines in backends/arm/test/models/.

Notable model adaptations for Arm delegation:

Deep AutoEncoder: Fuse Linear + BatchNorm1d pairs before export since the TOSA quantizer only annotates conv + batch_norm patterns.
DS-CNN: Replace AvgPool2d(24, 5) with AdaptiveAvgPool2d(1) to satisfy the Ethos-U55 stride <= 3 constraint; the DecomposeAdaptiveAvgPool2dPass decomposes it into stride-1 pools.

Change-Id: I8dbf5e8a4b80996faab9f850c21740899f6b36fd

Summary

[PLEASE REMOVE] See CONTRIBUTING.md's Pull Requests for ExecuTorch PR guidelines.

[PLEASE REMOVE] If this PR closes an issue, please add a Fixes #<issue-id> line.

[PLEASE REMOVE] If this PR introduces a fix or feature that should be the upcoming release notes, please add a "Release notes: " label. For a list of available release notes labels, check out CONTRIBUTING.md's Pull Requests.

Test plan

[PLEASE REMOVE] How did you test this PR? Please write down any manual commands you used and note down tests that you have written if applicable.

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell

Add model definitions and Arm backend tests for four MLPerf Tiny benchmark models: ResNet8, DS-CNN, Deep AutoEncoder, and MobileNetV1-0.25. Model definitions are placed under examples/models/mlperf_tiny/. Each model has tests for tosa_FP, tosa_INT, u55_INT and u85_INT pipelines in backends/arm/test/models/. Notable model adaptations for Arm delegation: - Deep AutoEncoder: Fuse Linear + BatchNorm1d pairs before export since the TOSA quantizer only annotates conv + batch_norm patterns. - DS-CNN: Replace AvgPool2d(24, 5) with AdaptiveAvgPool2d(1) to satisfy the Ethos-U55 stride <= 3 constraint; the DecomposeAdaptiveAvgPool2dPass decomposes it into stride-1 pools. Change-Id: I8dbf5e8a4b80996faab9f850c21740899f6b36fd Signed-off-by: Tirui Wu <tirui.wu@arm.com>

pytorch-bot · 2026-03-17T10:18:45Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18225

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 1 Unrelated Failure

As of commit 03c86e3 with merge base c81126e ():

NEW FAILURES - The following jobs have failed:

Apple / build-demo-ios / macos-job (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 65
Test CUDA Builds / check-all-cuda-builds (gh)
Process completed with exit code 1.
trunk / unittest-release / windows / windows-job (gh)
Process completed with exit code 1.

FLAKY - The following job failed but was likely due to flakiness present on trunk:

Test CUDA Builds / test-executorch-cuda-build-12.8 / linux-job (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

tirwu01 · 2026-03-17T10:19:00Z

@pytorchbot label ciflow/trunk

zingo · 2026-03-17T10:53:25Z

Hi @digantdesai @rascani and @psiddh this adds a few new mpu nice models to the example folder and need a Meta review 🙏 🙂

digantdesai · 2026-03-17T17:34:50Z

backends/arm/test/models/test_mobilenet_v1_025.py

+}
+
+
+def test_mobilenet_v1_025_tosa_FP():


Do they add any more coverage besides what we have for Mobilenet? Let's just add them only in examples if they aren't too different from what we already have, rationale is the CI job freq

Hi, MobileNetV1-0.25 is a distinct model from MobileNetV2/V3 — it's the specific architecture used in the MLPerf Tiny.These four models (ResNet8, DS-CNN, Deep AutoEncoder, MobileNetV1-0.25) are the standard MLPerf Tiny benchmark suite and are tested together as a set.

digantdesai · 2026-03-17T17:35:05Z

backends/arm/test/models/test_ds_cnn.py

+}
+
+
+def test_ds_cnn_tosa_FP():


same here as MV1

digantdesai · 2026-03-17T17:35:12Z

backends/arm/test/models/test_deep_autoencoder.py

+}
+
+
+def test_deep_autoencoder_tosa_FP():


same here as MV1

digantdesai · 2026-03-17T17:35:18Z

backends/arm/test/models/test_resnet8.py

+}
+
+
+def test_resnet8_tosa_FP():


same here as MV1

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 17, 2026

pytorch-bot bot added the ciflow/trunk label Mar 17, 2026

tirwu01 added the partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm label Mar 17, 2026

tirwu01 requested review from SaoirseARM and mansnils March 17, 2026 10:20

tirwu01 added the release notes: none Do not include this in the release notes label Mar 17, 2026

rascani approved these changes Mar 17, 2026

View reviewed changes

digantdesai reviewed Mar 17, 2026

View reviewed changes

backends/arm/test/models/test_ds_cnn.py

}

def test_ds_cnn_tosa_FP():

Copy link

Contributor

digantdesai Mar 17, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same here as MV1

digantdesai reviewed Mar 17, 2026

View reviewed changes

backends/arm/test/models/test_deep_autoencoder.py

}

def test_deep_autoencoder_tosa_FP():

Copy link

Contributor

digantdesai Mar 17, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same here as MV1

digantdesai reviewed Mar 17, 2026

View reviewed changes

backends/arm/test/models/test_resnet8.py

}

def test_resnet8_tosa_FP():

Copy link

Contributor

digantdesai Mar 17, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same here as MV1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arm backend: Add Ethos-U FVP tests for MLPerf Tiny models#18225

Arm backend: Add Ethos-U FVP tests for MLPerf Tiny models#18225
tirwu01 wants to merge 1 commit intopytorch:mainfrom
tirwu01:mlperf-tiny-models

tirwu01 commented Mar 17, 2026 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Mar 17, 2026 •

edited

Loading

Uh oh!

tirwu01 commented Mar 17, 2026

Uh oh!

zingo commented Mar 17, 2026

Uh oh!

digantdesai Mar 17, 2026

Uh oh!

tirwu01 Mar 18, 2026 •

edited

Loading

Uh oh!

digantdesai Mar 17, 2026

Uh oh!

digantdesai Mar 17, 2026

Uh oh!

digantdesai Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

tirwu01 commented Mar 17, 2026 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot bot commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18225

❌ 3 New Failures, 1 Unrelated Failure

Uh oh!

tirwu01 commented Mar 17, 2026

Uh oh!

zingo commented Mar 17, 2026

Uh oh!

digantdesai Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

tirwu01 Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

digantdesai Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

digantdesai Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

digantdesai Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

tirwu01 commented Mar 17, 2026 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Mar 17, 2026 •

edited

Loading

tirwu01 Mar 18, 2026 •

edited

Loading