Skip to content

Pull requests: modelscope/ms-swift

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[bugfix] fix mllm megatron gkd sft_alpha
#8394 opened Mar 21, 2026 by hjh0119 Loading…
[megatron] support muon
#8392 opened Mar 20, 2026 by Jintao-Huang Loading…
[megatron] support multimodal MTP
#8390 opened Mar 20, 2026 by Jintao-Huang Loading…
fix(megatron): destroy NCCL process groups on training exit
#8385 opened Mar 20, 2026 by inzamam-iqbal Loading…
2 tasks done
[megatron]add megatron log
#8348 opened Mar 16, 2026 by yangbofun Loading…
1 of 4 tasks
[megatron] support the fake distributed process group
#8347 opened Mar 16, 2026 by yangbofun Loading…
1 of 4 tasks
MTP Multimodal compatible
#8303 opened Mar 12, 2026 by jhvmhg Loading…
1 of 4 tasks
SymPO
#8245 opened Mar 9, 2026 by JiangWu0826 Loading…
1 of 4 tasks
Feature/ms swift custom
#8222 opened Mar 6, 2026 by LEWISZZZcc Loading…
4 tasks
[WIP] Moe kernel for qwen3 omni in ascend
#8214 opened Mar 5, 2026 by jiaqiw09 Loading…
1 of 4 tasks
feat: log grpo input images to wandb
#8157 opened Mar 2, 2026 by shunk031 Loading…
1 of 4 tasks
[feat] support frames packing for minicpmv4_5 video processing
#8046 opened Feb 13, 2026 by fanqiNO1 Loading…
2 of 4 tasks
Add QAT (Quantization-Aware Training) Support Callback
#8042 opened Feb 12, 2026 by y2logic Loading…
1 task done
ProTip! Adding no:label will show everything without a label.