-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][chore] AutoDeploy: clean up accuracy test configs
#8134
opened Oct 3, 2025 by
lucaslie
Loading…
1 task done
[None][feat] AutoDeploy: Nemotron-H accuracy test
#8133
opened Oct 3, 2025 by
lucaslie
Loading…
1 task done
[https://nvbugs/5516666][fix] unwaive some Qwen3 CI tests
#8130
opened Oct 3, 2025 by
byshiue
Loading…
[None][feat] Support ignored prompt length for penalties via new sampling config parameter
#8127
opened Oct 3, 2025 by
nvxuanyuc
Loading…
[TRTLLM-6342][feat] Factory TP sharding of quantized models
AutoDeploy
<NV> AutoDeploy Backend
#8123
opened Oct 2, 2025 by
greg-kwasniewski1
Loading…
1 task done
[TRTLLM-8413][chore] resolve sampling defaults in OpenAI API backend
#8121
opened Oct 2, 2025 by
ixlmar
Loading…
1 task done
[None][test] Add accuracy test for Qwen3Next model
#8111
opened Oct 1, 2025 by
Funatiq
Loading…
1 task
[TRTLLM-5966][feat] Helix: add full MLA support for Helix
#8104
opened Sep 30, 2025 by
MatthiasKohl
Loading…
[doc] Add Qwen3 Next Guide to Core README
Community want to contribute
PRs initiated from Community
#8101
opened Sep 30, 2025 by
faradawn
Loading…
1 task
[https://nvbugs/5521949][fix] Fix head_size handling in ModelConfig.get_bindings_model_config
#8100
opened Sep 30, 2025 by
amitz-nv
Loading…
1 task
[https://nvbugs/5541494] [fix] Fix missing sm100f/103a kernels and add tests
#8098
opened Sep 30, 2025 by
VALLIS-NERIA
Loading…
1 task
[None][feat] reuse cudagraph memory pool in normal forward flow
#8095
opened Sep 30, 2025 by
HuiGao-NV
Loading…
1 task
[None][fix] Avoid unnecessary concat in attn_output_gate case.
#8094
opened Sep 30, 2025 by
yuxianq
Loading…
1 task done
[TRTLLM-8246][test] add multimodal kvcache+chunked_prefil cases in to QA test list
#8091
opened Sep 30, 2025 by
crazydemo
Loading…
1 task done
[None][fix] Disable DeepGEMM for Qwen3 MoE Attention layers
#8087
opened Sep 30, 2025 by
achartier
Loading…
1 task done
[None][feat] add RocketKV support (experimental)
#8086
opened Sep 30, 2025 by
lfr-0531
Loading…
1 task done
[None][fix] Add Lock to protect mReqeustToSession
#8085
opened Sep 30, 2025 by
chuangz0
Loading…
1 task done
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.