Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][fix] Run DeepSeek V4 gate test on CUDA
#13932 opened May 9, 2026 by lfr-0531 Collaborator Loading…
1 task done
[TRTLLM-35237][feat] Add cute dsl FP4 paged MQA logits decode kernel
#13929 opened May 9, 2026 by limin2021 Collaborator Loading…
1 task
[None][chore] Add long seq test for DSV4.
#13928 opened May 9, 2026 by Tracin Collaborator Loading…
1 task done
[None][test] Waive 2 failed cases for main in QA CI
#13927 opened May 9, 2026 by xinhe-nv Collaborator Draft
[TRTLLM-12440][feat] Add GMS-only weight sharing support
#13926 opened May 9, 2026 by chienchunhung Collaborator Draft
1 task done
[None][fix] Fix and unwaive AutoDeploy accuracy tests
#13925 opened May 8, 2026 by bmarimuthu-nv Collaborator Loading…
1 task done
[None][fix] Fix accracy regression in DeepSeek models
#13924 opened May 8, 2026 by taylor-yb-lee Collaborator Loading…
1 task done
[None][infra] Check license with both isPermissive and isProprietary flags
#13921 opened May 8, 2026 by yuanjingx87 Collaborator Loading…
1 task
[TRTLLM-12339][feat] Support T5 encoder-decoder models in the PyTorch backend
#13919 opened May 8, 2026 by cascade812 Collaborator Loading…
1 task done
[None][doc] Add guide for integrating custom kernels in PyTorch backend
#13917 opened May 8, 2026 by chang-l Collaborator Loading…
5 tasks done
[https://nvbugs/6157892] [fix] MistralCommonImageProcessor text-only path
#13916 opened May 8, 2026 by evezhier Collaborator Loading…
1 task
[None][infra] Drop jupyter-server from dockerfile.
#13914 opened May 8, 2026 by tfogal Loading…
1 task done
[TRTLLM-12527][feat] Parallel LTX-2 LoRA weight loading
#13911 opened May 8, 2026 by yibinl-nvidia Collaborator Loading…
1 task done
[None][fix] only configure gc thresholds once
#13910 opened May 8, 2026 by ixlmar Collaborator Loading…
1 task done
[None][refactor] MoEScheduler split + MegaMoE EPLB / multi-chunk / CI integration
#13908 opened May 8, 2026 by xxi-nv Collaborator Loading…
2 tasks done
[None][chore] Remove glm_moe_dsa tokenizer WAR after Transformers 5.x upgrade
#13901 opened May 8, 2026 by longlee0622 Collaborator Loading…
1 task done
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.