-
Notifications
You must be signed in to change notification settings - Fork 811
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add GPU placement validation before starting rollout engines
#1934
opened May 21, 2026 by
fmh66
Loading…
[2/N] Support training with variable global batch size
run-ci-megatron
#1933
opened May 21, 2026 by
zhuzilin
Contributor
Loading…
fix: avoid applying rollout temperature to critic values
#1928
opened May 21, 2026 by
Baiyu-Su
Loading…
fix: quote nvidia-modelopt requirement in build_conda.sh
#1927
opened May 20, 2026 by
zhiminwei551
Loading…
feat: add SFT entropy logging and validation loss monitoring
#1925
opened May 19, 2026 by
none0663
Contributor
Loading…
[examples] add coding_agent_rl: agent-in-sandbox RL minimal demo
#1923
opened May 19, 2026 by
jingshenghang
Collaborator
Loading…
fix(debug): auto-append rollout_id/rank in save_debug_train_data path template
#1922
opened May 19, 2026 by
wlf-darkmatter
Loading…
Fix RolloutManager reward normalization for uneven rollout groups
#1918
opened May 18, 2026 by
haoyang9804
Loading…
feat: add --max-checkpoint-count to limit saved checkpoints
#1914
opened May 16, 2026 by
JIANG54864
Loading…
fix: use getattr for sglang_speculative_algorithm to avoid AttributeError
#1913
opened May 15, 2026 by
none0663
Contributor
Loading…
Support custom rollout-proxy TIS hooks in bypass mode
#1912
opened May 15, 2026 by
sjtushenhai
Loading…
fix: add eval-before-train to train_async.py (parity with train.py)
#1906
opened May 13, 2026 by
Taosheng-ty
Loading…
4 tasks done
feat: filter logits by loss_mask before log_probs/entropy computation
#1905
opened May 13, 2026 by
Taosheng-ty
Loading…
5 of 6 tasks
fix: preserve fused 3D expert tensors for Qwen3.5 MoE in torch_dist→H…
#1904
opened May 12, 2026 by
rouchenzi
Loading…
fix: restore actor weights after loading OPD teacher checkpoint
#1903
opened May 12, 2026 by
canlin03
Loading…
Neutralize zero-advantage samples to skip wasted forward compute
#1901
opened May 11, 2026 by
nanjiangwill
Collaborator
Loading…
fix: add fallback for --save-hf when Megatron-Bridge lacks model support
#1881
opened Apr 30, 2026 by
WangHong-yang
Contributor
Loading…
3 tasks done
feat(profile): safer torch.profiler defaults + per-grad-step capture
#1879
opened Apr 29, 2026 by
leofan-lab
Contributor
Loading…
Add Megatron-Bridge LoRA support for GRPO actor training
#1865
opened Apr 26, 2026 by
taivu1998
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.