Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][infra] Waive 1 failed cases for main in pre-merge 37652
#14045 opened May 12, 2026 by ZhanruiSunCh Collaborator Loading…
[None][feat] LTX-2 Ulysses cross-attention for v2a with audio padding
#14044 opened May 12, 2026 by luyiyun1021 Collaborator Draft
5 of 6 tasks
[None][perf] Skip transceiver tp_allgather when no sessions ever opened deepseek-v4
#14042 opened May 12, 2026 by Shixiaowei02 Collaborator Loading…
1 task done
[None][fix] fix warm up number in disagg benchmark
#14041 opened May 12, 2026 by chuangz0 Collaborator Loading…
1 task done
[#8542][feat] AutoDeploy: add Llama-3.1-8B FP8 perf-sanity test on H100
#14039 opened May 12, 2026 by MrGeva Collaborator Loading…
1 task done
[None][test] Split verl tests into 19 fine-grained per-case wrappers
#14037 opened May 12, 2026 by Superjomn Collaborator Loading…
3 of 4 tasks
[https://nvbugs/6162128] Remove nano v3 E2E test
#14036 opened May 12, 2026 by 2ez4bz Collaborator Loading…
1 task done
[TRTLLM-12631][infra] Split some long stages
#14035 opened May 12, 2026 by EmmaQiaoCh Collaborator Loading…
1 task done
[TRTLLM-11950][perf] Audio feature extractor optimizations
#14031 opened May 12, 2026 by 2ez4bz Collaborator Loading…
1 task done
[None][fix] bypass tokenizer in kvcache router when there is only one server
#14030 opened May 12, 2026 by reasonsolo Collaborator Loading…
1 task done
[None][perf] DSV4 multistream improvement for attention
#14029 opened May 12, 2026 by liji-nv Collaborator Loading…
1 task done
[None][feat] Support NVFP4 dsv4
#14026 opened May 12, 2026 by Tracin Collaborator Loading…
1 task done
[None][infra] Source code and container vulnerability fix
#14025 opened May 12, 2026 by yuanjingx87 Collaborator Loading…
1 task
[TRTLLM-12627][ci] Narrow tensorrt_llm/serve/ MGPU trigger to disagg-only files
#14022 opened May 12, 2026 by QiJune Collaborator Loading…
1 task done
[TRTLLM-12527][feat] Parallelize multi-shard visual-gen checkpoint loading
#14021 opened May 12, 2026 by yibinl-nvidia Collaborator Loading…
1 task done
[None][fix] PyExecutor Hang in Disagg TP Prefill
#14020 opened May 12, 2026 by jthomson04 Collaborator Loading…
[TRTLLM-11410][feat] MoT World Model Support
#14012 opened May 12, 2026 by NVShreyas Collaborator Loading…
1 task done
[None][fix] Unwaive standalone llm-c package generation test
#14011 opened May 11, 2026 by bmarimuthu-nv Collaborator Loading…
1 task done
[TRTLLM-12533][refactor] Move Media IO modality loading into MediaIO Interfaces
#14010 opened May 11, 2026 by aswinvisva Collaborator Loading…
1 task done
ProTip! no:milestone will show everything without a milestone.