-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[https://nvbugs/5948878][fix] fix lost requests
#12348
opened Mar 19, 2026 by
bo-nv
Loading…
1 task done
[None][fix] Fix B200 Aggregated CI Perf Test MPI Issue
#12347
opened Mar 19, 2026 by
chenfeiz0326
Loading…
1 task done
[https://nvbugs/5725811][test] Remove outdated llama-v4 and ministral-8b models out of QA scope
#12344
opened Mar 19, 2026 by
yufeiwu-nv
Loading…
1 task done
[None][chore] Organize the upper-layer transceiver logic
#12343
opened Mar 19, 2026 by
Shixiaowei02
•
Draft
1 task done
[TRTLLM-9523][chore] PyTransceiver code consolidation
#12342
opened Mar 19, 2026 by
Shixiaowei02
•
Draft
1 task done
[TRTLLM-11508][refactor] decouple MTP num_nextn_predict_layers from max_draft_len
#12341
opened Mar 19, 2026 by
zhaoyangwang-nvidia
•
Draft
1 task done
[None][docs] Update supported models matrix with AD-onboarded architectures
#12340
opened Mar 19, 2026 by
bmarimuthu-nv
Loading…
2 tasks
[TRTLLM-10407][perf] Add cute dsl single pass multi cta cluster topk
#12339
opened Mar 19, 2026 by
limin2021
Loading…
1 task
[None][doc] fix outdated code references in tech blogs 2, 3, 4, 8, 9, 11
#12338
opened Mar 19, 2026 by
schetlur-nv
Loading…
1 task
[None][fix] fix disagg kvcache router for chat API; add disagg benchmark for ai_perf
#12337
opened Mar 19, 2026 by
reasonsolo
Loading…
1 task done
[None][fix] Relax W8A16 MoE test tolerance for DTP mode
#12335
opened Mar 19, 2026 by
xxi-nv
Loading…
2 tasks done
[None][fix] Properly raise errors from multimodal loading
#12331
opened Mar 18, 2026 by
2ez4bz
Loading…
1 task done
[#11992][fix] Handle GUIDE_TYPE_STRUCTURAL_TAG in gRPC request manager
Community want to contribute
PRs initiated from Community
#12330
opened Mar 18, 2026 by
CatherineSue
Loading…
1 task done
refactor(tests): modernise kvCacheManagerTest readability
#12329
opened Mar 18, 2026 by
thorjohnsen
Loading…
4 tasks
[https://nvbugs/5800591][chore] Unwaive a deepseek MTP test
#12327
opened Mar 18, 2026 by
mikeiovine
Loading…
1 task done
[#12332][feat] AutoDeploy: SuperV3 MTP Support
#12326
opened Mar 18, 2026 by
govind-ramnarayan
Loading…
1 task done
Fix pinBlocks/unpinBlocksById to correctly handle multiple window sizes
#12325
opened Mar 18, 2026 by
thorjohnsen
Loading…
4 tasks
[None][perf] Kernel fusions in _gather_k_cache_for_chunk of Indexer in DSA
#12322
opened Mar 18, 2026 by
hyukn
Loading…
1 task done
[None][feat] Support update weight for nvfp4
#12320
opened Mar 18, 2026 by
shuyixiong
•
Draft
1 task
[batch_manager] Remove redundant mManagedSequences from BlockManager
#12319
opened Mar 18, 2026 by
thorjohnsen
Loading…
3 tasks
[https://nvbugs/5808603][fix] Add bias support to WeightOnlyQuantLinearMethod
#12317
opened Mar 18, 2026 by
stnie
Loading…
1 task done
[https://nvbugs/5949524][fix] Fix hang issue on DGX-Spark multinode
#12316
opened Mar 18, 2026 by
JennyLiu-nv
Loading…
1 task done
[None][feat] KV cache-aware ADP router for prefix-affinity request routing
#12315
opened Mar 18, 2026 by
lancelly
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.