-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][infra] Waive 1 failed cases for main in pre-merge 37652
#14045
opened May 12, 2026 by
ZhanruiSunCh
Collaborator
Loading…
[None][feat] LTX-2 Ulysses cross-attention for v2a with audio padding
#14044
opened May 12, 2026 by
luyiyun1021
Collaborator
•
Draft
5 of 6 tasks
[None][perf] Skip transceiver tp_allgather when no sessions ever opened
deepseek-v4
#14042
opened May 12, 2026 by
Shixiaowei02
Collaborator
Loading…
1 task done
[None][fix] fix warm up number in disagg benchmark
#14041
opened May 12, 2026 by
chuangz0
Collaborator
Loading…
1 task done
[#8542][feat] AutoDeploy: add Llama-3.1-8B FP8 perf-sanity test on H100
#14039
opened May 12, 2026 by
MrGeva
Collaborator
Loading…
1 task done
[https://nvbugs/6160248][fix] AutoDeploy: fixed broken pattern matching of fuse_rope_into_trtllm_attention transform
#14038
opened May 12, 2026 by
MrGeva
Collaborator
Loading…
1 task done
[None][test] Split verl tests into 19 fine-grained per-case wrappers
#14037
opened May 12, 2026 by
Superjomn
Collaborator
Loading…
3 of 4 tasks
[https://nvbugs/6162128] Remove nano v3 E2E test
#14036
opened May 12, 2026 by
2ez4bz
Collaborator
Loading…
1 task done
[TRTLLM-12631][infra] Split some long stages
#14035
opened May 12, 2026 by
EmmaQiaoCh
Collaborator
Loading…
1 task done
[https://nvbugs/6163033][fix] Guard
q_a_proj.weight dict access behind nvfp4_fused_a; update test to `chec
#14033
opened May 12, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[TRTLLM-11950][perf] Audio feature extractor optimizations
#14031
opened May 12, 2026 by
2ez4bz
Collaborator
Loading…
1 task done
[None][fix] bypass tokenizer in kvcache router when there is only one server
#14030
opened May 12, 2026 by
reasonsolo
Collaborator
Loading…
1 task done
[None][perf] DSV4 multistream improvement for attention
#14029
opened May 12, 2026 by
liji-nv
Collaborator
Loading…
1 task done
[None][fix] Raise server_waiting_timeout to 3600s for DSv4 disagg tests
deepseek-v4
#14028
opened May 12, 2026 by
Shixiaowei02
Collaborator
Loading…
1 task done
[None][feat] Support NVFP4 dsv4
#14026
opened May 12, 2026 by
Tracin
Collaborator
Loading…
1 task done
[None][infra] Source code and container vulnerability fix
#14025
opened May 12, 2026 by
yuanjingx87
Collaborator
Loading…
1 task
[https://nvbugs/6163030][fix] Replace
moe_ep_size * max_tokens with max(moe_ep_size, dp_size) * max_tokens
#14023
opened May 12, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[TRTLLM-12627][ci] Narrow tensorrt_llm/serve/ MGPU trigger to disagg-only files
#14022
opened May 12, 2026 by
QiJune
Collaborator
Loading…
1 task done
[TRTLLM-12527][feat] Parallelize multi-shard visual-gen checkpoint loading
#14021
opened May 12, 2026 by
yibinl-nvidia
Collaborator
Loading…
1 task done
[None][fix] PyExecutor Hang in Disagg TP Prefill
#14020
opened May 12, 2026 by
jthomson04
Collaborator
Loading…
[TRTLLM-11410][feat] MoT World Model Support
#14012
opened May 12, 2026 by
NVShreyas
Collaborator
Loading…
1 task done
[None][fix] Unwaive standalone llm-c package generation test
#14011
opened May 11, 2026 by
bmarimuthu-nv
Collaborator
Loading…
1 task done
[TRTLLM-12533][refactor] Move Media IO modality loading into MediaIO Interfaces
#14010
opened May 11, 2026 by
aswinvisva
Collaborator
Loading…
1 task done
Previous Next
ProTip!
no:milestone will show everything without a milestone.