Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[https://nvbugs/5521253][fix] Enable Gemma3 12B & 27B on SM100
#8666 opened Oct 26, 2025 by brb-nv Loading…
1 task done
[None][chore] Use a cached model path for Ray integration test
#8660 opened Oct 24, 2025 by achartier Loading…
1 task done
[None][autodeploy] minor refactor to rmsnorm transforms
#8657 opened Oct 24, 2025 by Fridah-nv Loading…
1 task done
[None][feat] add flag for EPLB to force using GDRCopy
#8650 opened Oct 24, 2025 by dongxuy04 Loading…
1 task done
[None][fix] fix EPLB init hang
#8649 opened Oct 24, 2025 by dongxuy04 Loading…
1 task done
[https://nvbugs/5607238][test] fix working dir in disagg worker test
#8648 opened Oct 24, 2025 by zhengd-nv Loading…
1 task done
[None][fix] Fix ModelConfig.from_pretrained get quant config file
#8647 opened Oct 24, 2025 by yuantailing Loading…
1 task done
[None][ci] move some time-consuming benchmark test cases to post merge
#8641 opened Oct 24, 2025 by QiJune Loading…
1 task done
[None] [chore] Update to cutlass 4.3
#8637 opened Oct 24, 2025 by kaiyux Draft
1 task
[None][fix] Change Ray submit() to use async RPC
#8636 opened Oct 24, 2025 by hchings Loading…
1 task
[DO NOT MERGE] Trigger CI
#8635 opened Oct 23, 2025 by jthomson04 Draft
1 task
ProTip! Type g i on any issue or pull request to go back to the issue listing page.