-
Notifications
You must be signed in to change notification settings - Fork 408
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix: fix double-trim bug in entropy computation for last CP rank
#1377
opened Jan 10, 2026 by
Beichen-Ma
Loading…
[1/X] Refactor: unify training backends, use general utils for Megatron and FSDP
run-ci-precision
run-ci-short
#1373
opened Jan 10, 2026 by
yueming-yuan
•
Draft
2 of 4 tasks
Fix: Apply loss mask to KL in REINFORCE++ returns calculation
#1372
opened Jan 9, 2026 by
kaysonyu
Loading…
feat: add int4 reinforcement learning training support (Part3)
#1368
opened Jan 9, 2026 by
Gao016
Loading…
[FSDP] Fix CP gradient sync in FSDP and mark as experimental
#1366
opened Jan 9, 2026 by
Hecate0821
Loading…
feat(examples): add strands-sglang integration for agentic RL with TITO support
#1359
opened Jan 8, 2026 by
Lawhy
Loading…
1 task done
[release] bump to v0.2.2
release
run-ci-ckpt
run-ci-fsdp
run-ci-megatron
#1345
opened Jan 6, 2026 by
zhuzilin
Loading…
[Fix] Update deprecated sglang ep args in docs and scripts
#1344
opened Jan 6, 2026 by
coding-famer
Loading…
[Feature] Add rollout concurrency argument for full async training
#1310
opened Jan 3, 2026 by
yitianlian
Loading…
Feat(router): add oai interface support for router
#1203
opened Dec 24, 2025 by
ChangyiYang
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.