Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add common pile scripts complexity: high
#3902 opened Mar 17, 2026 by Phlip79 Loading…
5 tasks
Core 0.16
Sbak/ckpt migrate
#3899 opened Mar 17, 2026 by dimapihtar Draft
5 tasks
Split merge-main-into-dev: docs and CI changes
#3895 opened Mar 16, 2026 by ilml Draft
1 of 2 tasks
Split merge-main-into-dev: test updates
#3894 opened Mar 16, 2026 by ilml Draft
1 of 2 tasks
Split merge-main-into-dev: essential runtime changes
#3893 opened Mar 16, 2026 by ilml Draft
1 of 2 tasks
Support GEMM + Swiglu fused MLP mirror-to-main
#3890 opened Mar 16, 2026 by ksivaman Loading…
5 tasks
Core 0.16
Tokenizers simplification Run functional tests Run MBridge tests Attach this for testing this PR against MBridge main Run tests
#3889 opened Mar 16, 2026 by asolergi-nv Draft
5 tasks done
Core 0.16
Rename RL timers to be consistent complexity: medium
#3878 opened Mar 15, 2026 by tdene Loading…
5 tasks
Core 0.16
Parity with VLLM over the reasoning field Approved All necessary approvals have been made complexity: low
#3873 opened Mar 15, 2026 by tdene Loading…
5 tasks
Core 0.16
Patch EOD out of inference results Approved All necessary approvals have been made complexity: low Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review.
#3866 opened Mar 13, 2026 by tdene Loading…
5 tasks
Core 0.16
Merge main into dev complexity: high
#3865 opened Mar 13, 2026 by ilml Loading…
5 tasks
Allow untokenized messages when using the prevent tokenization mode. complexity: low Final Review PR is in the "final review" stage
#3864 opened Mar 13, 2026 by ArEsKay3 Loading…
5 tasks
Pass gracefully if token_id not found in message Approved All necessary approvals have been made
#3862 opened Mar 13, 2026 by i-riyad Loading…
5 tasks
Remove packed_attention_mask unused parameter complexity: low Final Review PR is in the "final review" stage
#3859 opened Mar 13, 2026 by tdene Loading…
5 tasks
Core 0.16
MFU tracking for inference complexity: medium
#3856 opened Mar 13, 2026 by tdene Loading…
5 tasks
Core 0.16
Separate save_checkpoint into per-type execution paths community-request needs-follow-up Issue needs follow-up
#3852 opened Mar 13, 2026 by Anmol202005 Loading…
5 tasks done
fix: Handle quantized CUDA tensors in async checkpoint writer complexity: low Final Review PR is in the "final review" stage
#3845 opened Mar 12, 2026 by sbak5 Loading…
5 tasks
Core 0.16
ProTip! Adding no:label will show everything without a label.