-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][feat] Autodeploy: Update the ssm to use slice
#8667
opened Oct 26, 2025 by
nvchenghaoz
Loading…
[https://nvbugs/5521253][fix] Enable Gemma3 12B & 27B on SM100
#8666
opened Oct 26, 2025 by
brb-nv
Loading…
1 task done
[None][doc] Clarify the perf best practice and supported hardware for gptoss
#8665
opened Oct 25, 2025 by
dongfengy
Loading…
1 task done
[https://nvbugs/5575913][fix] Use separate thresholds for 120b/20b gptoss
#8664
opened Oct 25, 2025 by
dongfengy
Loading…
1 task done
[None][feat] Autotuner can iterate through all tactics for test purposes
#8663
opened Oct 25, 2025 by
rosenrodt
Loading…
1 task done
[None] [feat] Add C++ dependency scanning system
#8662
opened Oct 25, 2025 by
venkywonka
•
Draft
1 task
[None][test] Enhance GPT-OSS CI with GPQA Diamond and additional Spec Decoding Test
#8661
opened Oct 24, 2025 by
dongfengy
Loading…
1 task done
[None][chore] Use a cached model path for Ray integration test
#8660
opened Oct 24, 2025 by
achartier
Loading…
1 task done
[None][infra] User/yuanjingx/choose repo when generate lock files
#8659
opened Oct 24, 2025 by
yuanjingx87
Loading…
1 task
[https://nvbugs/5606166][fix] AutoDeploy: use tuples for cudagraph shape lookup
#8658
opened Oct 24, 2025 by
lucaslie
Loading…
1 task done
[None][autodeploy] minor refactor to rmsnorm transforms
#8657
opened Oct 24, 2025 by
Fridah-nv
Loading…
1 task done
[TRTLLM-8825][feat] Support Pytest Perf Results uploading to Database
#8653
opened Oct 24, 2025 by
chenfeiz0326
Loading…
[None][feat] add flag for EPLB to force using GDRCopy
#8650
opened Oct 24, 2025 by
dongxuy04
Loading…
1 task done
[https://nvbugs/5607238][test] fix working dir in disagg worker test
#8648
opened Oct 24, 2025 by
zhengd-nv
Loading…
1 task done
[None][fix] Fix ModelConfig.from_pretrained get quant config file
#8647
opened Oct 24, 2025 by
yuantailing
Loading…
1 task done
[https://nvbugs/5556020][fix] cherry-pick fix test_disaggregated_serving.py::TestLlama3_1_8BInstruct::test_eagle3 dimension mismatch
#8644
opened Oct 24, 2025 by
sunnyqgg
Loading…
1 task done
[None][ci] move some time-consuming benchmark test cases to post merge
#8641
opened Oct 24, 2025 by
QiJune
Loading…
1 task done
[https://nvbugs/5552836][fix] Add flag
TLLM_SPAWN_EXTRA_MAIN_PROCESS to disable spawning main process
#8640
opened Oct 24, 2025 by
Superjomn
Loading…
1 task done
[#8389][fix] Update group attention matching to first map to custom torch attention
#8638
opened Oct 24, 2025 by
Fridah-nv
Loading…
1 task done
[None][fix] Change Ray submit() to use async RPC
#8636
opened Oct 24, 2025 by
hchings
Loading…
1 task
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.