-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][fix] Run DeepSeek V4 gate test on CUDA
#13932
opened May 9, 2026 by
lfr-0531
Collaborator
Loading…
1 task done
[TRTLLM-35237][feat] Add cute dsl FP4 paged MQA logits decode kernel
#13929
opened May 9, 2026 by
limin2021
Collaborator
Loading…
1 task
[None][chore] Add long seq test for DSV4.
#13928
opened May 9, 2026 by
Tracin
Collaborator
Loading…
1 task done
[TRTLLM-12440][feat] Add GMS-only weight sharing support
#13926
opened May 9, 2026 by
chienchunhung
Collaborator
•
Draft
1 task done
[None][fix] Fix and unwaive AutoDeploy accuracy tests
#13925
opened May 8, 2026 by
bmarimuthu-nv
Collaborator
Loading…
1 task done
[None][fix] Fix accracy regression in DeepSeek models
#13924
opened May 8, 2026 by
taylor-yb-lee
Collaborator
Loading…
1 task done
[https://nvbugs/6159129][fix] Added an FP8_BLOCK_SCALES + extra_acc_spec=tp_attn reference entry (accuracy 92.
#13923
opened May 8, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[https://nvbugs/6159132][fix] Differentiate the two paths via extra_acc_spec="tp_attn" when attention_dp=False
#13922
opened May 8, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][infra] Check license with both isPermissive and isProprietary flags
#13921
opened May 8, 2026 by
yuanjingx87
Collaborator
Loading…
1 task
[#13909][fix] Reuse hidden_states buffer across CUDA graph captures in Eagle3
Community want to contribute
PRs initiated from Community
#13920
opened May 8, 2026 by
ml-inference
Loading…
[TRTLLM-12339][feat] Support T5 encoder-decoder models in the PyTorch backend
#13919
opened May 8, 2026 by
cascade812
Collaborator
Loading…
1 task done
[None][fix] Make SleepConfig picklable by replacing closure lambda in defaultdict
Community want to contribute
PRs initiated from Community
#13918
opened May 8, 2026 by
hhzhang16
Loading…
1 task
[None][doc] Add guide for integrating custom kernels in PyTorch backend
#13917
opened May 8, 2026 by
chang-l
Collaborator
Loading…
5 tasks done
[https://nvbugs/6157892] [fix] MistralCommonImageProcessor text-only path
#13916
opened May 8, 2026 by
evezhier
Collaborator
Loading…
1 task
[https://nvbugs/6143599][fix] Re-apply proven fix from commit 295615d8bf (not present in HEAD): subtract 2× pr
#13915
opened May 8, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][infra] Drop jupyter-server from dockerfile.
#13914
opened May 8, 2026 by
tfogal
Loading…
1 task done
[https://nvbugs/6104831][draft] PR 13713 rebased onto v1.3.0rc13 — v7
Community want to contribute
PRs initiated from Community
[TRTLLM-12527][feat] Parallel LTX-2 LoRA weight loading
#13911
opened May 8, 2026 by
yibinl-nvidia
Collaborator
Loading…
1 task done
[None][fix] only configure gc thresholds once
#13910
opened May 8, 2026 by
ixlmar
Collaborator
Loading…
1 task done
[None][refactor] MoEScheduler split + MegaMoE EPLB / multi-chunk / CI integration
#13908
opened May 8, 2026 by
xxi-nv
Collaborator
Loading…
2 tasks done
[https://nvbugs/6157892][fix] Restore the pre-#12743
AutoProcessor.from_pretrained(...) assignment for `text
#13905
opened May 8, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][test] Add MLA chunked-prefill SM dispatch regression coverage
Community want to contribute
PRs initiated from Community
#13904
opened May 8, 2026 by
DhineshPonnarasan
Loading…
[None][chore] Remove glm_moe_dsa tokenizer WAR after Transformers 5.x upgrade
#13901
opened May 8, 2026 by
longlee0622
Collaborator
Loading…
1 task done
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.