Skip to content

[Feature] Layer-wise MoE auxiliary loss (split finalize) and optional async router D2H offload#1528

Open
tina-wen wants to merge 1 commit intoInternLM:mainfrom
tina-wen:split_bal_loss
Open

[Feature] Layer-wise MoE auxiliary loss (split finalize) and optional async router D2H offload#1528
tina-wen wants to merge 1 commit intoInternLM:mainfrom
tina-wen:split_bal_loss

Commits

Commits on Apr 27, 2026