[Feature] Layer-wise MoE auxiliary loss (split finalize) and optional async router D2H offload#1528
Open
tina-wen wants to merge 1 commit intoInternLM:mainfrom
Open
[Feature] Layer-wise MoE auxiliary loss (split finalize) and optional async router D2H offload#1528tina-wen wants to merge 1 commit intoInternLM:mainfrom
tina-wen wants to merge 1 commit intoInternLM:mainfrom
Commits
Commits on Apr 27, 2026
- committed
wentiange