Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add Trackio rollout trace logging
#1935 opened May 21, 2026 by abidlabs Loading…
[2/N] Support training with variable global batch size run-ci-megatron
#1933 opened May 21, 2026 by zhuzilin Contributor Loading…
Feat/minimax m2.5 support
#1929 opened May 21, 2026 by xs1997zju Loading…
fix: avoid applying rollout temperature to critic values
#1928 opened May 21, 2026 by Baiyu-Su Loading…
feat: add SFT entropy logging and validation loss monitoring
#1925 opened May 19, 2026 by none0663 Contributor Loading…
[examples] add coding_agent_rl: agent-in-sandbox RL minimal demo
#1923 opened May 19, 2026 by jingshenghang Collaborator Loading…
fix: make OPSM reject whole off-policy sequences
#1917 opened May 18, 2026 by haoyang9804 Loading…
fix: use getattr for sglang_speculative_algorithm to avoid AttributeError
#1913 opened May 15, 2026 by none0663 Contributor Loading…
Support custom rollout-proxy TIS hooks in bypass mode
#1912 opened May 15, 2026 by sjtushenhai Loading…
[docs] fix reverse KL formula
#1911 opened May 14, 2026 by underspirit Loading…
fix: add eval-before-train to train_async.py (parity with train.py)
#1906 opened May 13, 2026 by Taosheng-ty Loading…
4 tasks done
feat: filter logits by loss_mask before log_probs/entropy computation
#1905 opened May 13, 2026 by Taosheng-ty Loading…
5 of 6 tasks
Neutralize zero-advantage samples to skip wasted forward compute
#1901 opened May 11, 2026 by nanjiangwill Collaborator Loading…
Add SwanLab tracking support
#1898 opened May 9, 2026 by asckaya Loading…
[docker] upgrade to v0.5.11 run-ci-image
#1892 opened May 6, 2026 by zhuzilin Contributor Loading…
fix: add fallback for --save-hf when Megatron-Bridge lacks model support
#1881 opened Apr 30, 2026 by WangHong-yang Contributor Loading…
3 tasks done
feat(profile): safer torch.profiler defaults + per-grad-step capture
#1879 opened Apr 29, 2026 by leofan-lab Contributor Loading…
ProTip! no:milestone will show everything without a milestone.