Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

geo3k VLM multi-turn megatron update
#1378 opened Jan 11, 2026 by gxlvera Loading…
[FSDP] Add argument validation for FA3 with cp
#1367 opened Jan 9, 2026 by Beichen-Ma Loading…
Add OSWorld VLM training cookbook and integration
#1364 opened Jan 8, 2026 by jbarnes850 Loading…
fix: fix sglang regression
#1363 opened Jan 8, 2026 by nanjiangwill Loading…
Feat: multi-threads data fetching for sft data
#1355 opened Jan 7, 2026 by UbeCc Loading…
[FSDP][Fix] Fix redundant import
#1354 opened Jan 7, 2026 by Hecate0821 Loading…
[WIP] add fault torlance
#1311 opened Jan 3, 2026 by lilei199908 Loading…
[data][feat] add large dataset support
#1298 opened Dec 31, 2025 by SwordFaith Loading…
Handle deepscaler answers without markers
#1226 opened Dec 26, 2025 by cklxx Loading…
Add Qwen3-Coder-30B-A3B-Instruct model script
#1213 opened Dec 25, 2025 by maoquan-ms Loading…
Megatron VLM Support (Qwen2.5-VL series) (3/N)
#1210 opened Dec 25, 2025 by Zhuohao-Li Loading…
Fix ruff hook and update pre-commit hooks
#1206 opened Dec 24, 2025 by ParagEkbote Loading…
Integrate Sonic-Moe in FSDP
#1176 opened Dec 22, 2025 by ChangyiYang Draft
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.