Skip to content

Enable default MoE expert replay for Megatron train/inference parity#701

Draft
FurtherAI wants to merge 274 commits into
mainfrom
austin/train_inf_mismatch
Draft

Enable default MoE expert replay for Megatron train/inference parity#701
FurtherAI wants to merge 274 commits into
mainfrom
austin/train_inf_mismatch

Commits

This pull request is big! We're only showing the most recent 250 commits

Commits on Apr 13, 2026

Commits on Apr 14, 2026

Commits on Apr 22, 2026

Commits on Apr 24, 2026

Commits on Apr 27, 2026

Commits on Apr 28, 2026

Commits on Apr 30, 2026

Commits on May 1, 2026

Commits on May 3, 2026

Commits on May 4, 2026

Commits on May 6, 2026

Commits on May 7, 2026

Commits on May 9, 2026

Commits on May 10, 2026

Commits on May 11, 2026

Commits on May 12, 2026

Commits on May 13, 2026

Commits on May 18, 2026

Commits on May 20, 2026

Commits on May 21, 2026

Commits on May 24, 2026

Commits on May 25, 2026

Commits on May 27, 2026