-
Notifications
You must be signed in to change notification settings - Fork 406
Pull requests: InternLM/xtuner
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Reduce JsonlDataset memory by using mmap array and use npy to store jsonl meta
#1604
opened Mar 19, 2026 by
jayhenry
Loading…
[Refactor] Reduce memory usage in HardPackDataset via shared memory
#1602
opened Mar 19, 2026 by
HAOCHENYE
Loading…
[Improve] Free routed experts ray obj ref to avoid memory leak
#1595
opened Mar 18, 2026 by
RangiLyu
Loading…
support replay buffer save and resume, save_hf in trainer
#1592
opened Mar 18, 2026 by
YanhuiDua
Loading…
[Improve] Optimize ReplayBuffer by using direct method calls
#1591
opened Mar 18, 2026 by
RangiLyu
Loading…
[Fix] Muon optimizer per-expert orthogonalization for MoE models
#1582
opened Mar 13, 2026 by
CyCle1024
Loading…
[Feature] Add Multi-Token Prediction (MTP) module implementation
#1572
opened Mar 12, 2026 by
HAOCHENYE
Loading…
[Refactor] Rename CELossContext to LMHeadLossContext and refactor loss context base class
#1571
opened Mar 12, 2026 by
HAOCHENYE
Loading…
[Feature] Add Multi-Token Prediction (MTP) module implementation
#1570
opened Mar 12, 2026 by
HAOCHENYE
Loading…
[Refactor] Refactor loss context API to support multiple loss types
#1569
opened Mar 12, 2026 by
HAOCHENYE
Loading…
[Feature] chunk actor logprob computation for memory saving
npu
#1555
opened Mar 10, 2026 by
tina-wen
Loading…
[Feature] Domino EP support and training optimizations for InternS1 Pro VL
blocked
#1528
opened Mar 3, 2026 by
tina-wen
Loading…
[Optimization] Incremental checkpoint save for dcp on torch 2.7.x (ARM CPU optimization)
npu
#1525
opened Mar 3, 2026 by
tina-wen
Loading…
[Feature] Offload optimizer states to CPU to reduce memory
#1524
opened Mar 3, 2026 by
tina-wen
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.