Skip to content

Pull requests: modelscope/ms-swift

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[bugfix] fix qwen_vl_utils torchvision base64
#7004 opened Dec 11, 2025 by Jintao-Huang Loading…
[train] support embeding/reranker packing
#6987 opened Dec 10, 2025 by Jintao-Huang Loading…
collect npu profiling data
#6977 opened Dec 10, 2025 by OneMondy Loading…
1 of 4 tasks
[feat] Add Support Cut-Cross-Entropy (CCE)
#6971 opened Dec 9, 2025 by w1ida Loading…
[megatron] Update megatron shells
#6967 opened Dec 9, 2025 by Jintao-Huang Loading…
support deepspeed elastic
#6955 opened Dec 8, 2025 by meichangsu1 Loading…
2 of 4 tasks
[WIP] [v4] refactor model_type & template
#6944 opened Dec 8, 2025 by Jintao-Huang Loading…
add muon clip optimizer
#6662 opened Nov 19, 2025 by vx120 Loading…
1 task
Add conditional distillation support for GKD trainer
#6542 opened Nov 11, 2025 by woshixiaobai2019 Loading…
3 tasks
[WIP][Exp]Support ray dpo
#6395 opened Nov 1, 2025 by tastelikefeet Loading…
1 of 4 tasks
[megatron] update megatron_args default_val
#6252 opened Oct 22, 2025 by Jintao-Huang Loading…
feat: Enable for exporting unmerged HF Lora Adapter
#6225 opened Oct 20, 2025 by jason9693 Loading…
1 of 4 tasks
[WIP] refactor template
#6085 opened Oct 11, 2025 by Jintao-Huang Loading…
update docs
#5691 opened Sep 6, 2025 by Jintao-Huang Loading…
[model] update minicpmv-4.5 video processor stale
#5679 opened Sep 5, 2025 by hjh0119 Loading…
Bug fix: eval OOM due to deepcopy of torch model stale
#5607 opened Aug 29, 2025 by hellopahe Loading…
1 task done
[init]support gptq grpo in colocate mode stale
#5569 opened Aug 27, 2025 by ItGirls Loading…
1 of 4 tasks
Update dataset_info.json stale
#3723 opened Mar 31, 2025 by sandeep-sm Loading…
3 tasks
[WIP] support reasoning_content
#3159 opened Feb 18, 2025 by Jintao-Huang Loading…
loss_scale bug when meeting <image>
#3036 opened Feb 8, 2025 by mangoyuan Draft
1 of 4 tasks
add example OCRBench dataset
#2677 opened Dec 17, 2024 by ex-yanminmin001 Loading…
3 tasks
ProTip! Adding no:label will show everything without a label.