-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][chore] Upgrade starlette and FastAPI
#9319
opened Nov 19, 2025 by
tburt-nv
Loading…
1 task done
[#9316][feat] AutoDeploy: Add the accuracy test for Nemotron MOE models
#9317
opened Nov 19, 2025 by
nvchenghaoz
Loading…
[None][feat] Enable NCCL_SYMMETRIC as default fallback for AllReduce
#9314
opened Nov 19, 2025 by
nv-lschneider
Loading…
1 task done
[TRTINFRA-7326][infra] - Consume SlurmCluster sshPort for clusters with custom SSH port
#9313
opened Nov 19, 2025 by
mlefeb01
Loading…
1 task done
[None][test] Enhance Eagle Tests for GPTOSS
#9312
opened Nov 19, 2025 by
dongfengy
Loading…
1 task done
[https://nvbugs/5601682][fix] Fix cacheTransceiver hang
#9311
opened Nov 19, 2025 by
Tabrizian
Loading…
1 task
feat: Replace GenAI-Perf with AIPerf
Community want to contribute
PRs initiated from Community
#9310
opened Nov 19, 2025 by
lkomali
Loading…
1 task
fix mtp.py typo
Community want to contribute
PRs initiated from Community
#9307
opened Nov 19, 2025 by
attack204
Loading…
1 task
[TRTLLM-9295][fix] use greedy decoding in test_openai_compatible_json_schema
#9305
opened Nov 19, 2025 by
ixlmar
Loading…
1 task done
[None][infra] Enable single-gpu CI on spark
#9304
opened Nov 19, 2025 by
EmmaQiaoCh
•
Draft
1 task done
[TRTLLM-9160][doc] add doc to llm_runtime.py
#9303
opened Nov 19, 2025 by
Superjomn
Loading…
1 task done
[None][feat] Add processed logprobs
#9302
opened Nov 19, 2025 by
dominicshanshan
•
Draft
1 task done
[#9198][feat] Refactor dist ops in AutoDeploy
#9301
opened Nov 19, 2025 by
MrGeva
Loading…
1 task done
[https://nvbugs/5667687][fix] Set correct lm_head_tp_size_upper_bound
#9300
opened Nov 19, 2025 by
lancelly
Loading…
[None][feat] Support custom chat template for tool calling
#9297
opened Nov 19, 2025 by
LinPoly
Loading…
1 task done
[https://nvbugs/5629833][fix] Don't fill tensors
#9296
opened Nov 19, 2025 by
HuiGao-NV
Loading…
1 task
[https://nvbugs/5625990][fix] Fix block copy from GPU to GPU for partial reuse in the KV cache manager
KV-Cache Management
kv-cache management for efficient LLM inference
[None][fix] Replace PYTORCH_CUDA_ALLOC_CONF with PYTORCH_ALLOC_CONF to fix deprecation warning
#9294
opened Nov 19, 2025 by
jiaganc
Loading…
1 task
[None][infra] Modify SBSA build thread from 4 to 8
#9293
opened Nov 19, 2025 by
ZhanruiSunCh
Loading…
1 task done
[TRTLLM-9086][doc] Clean up TODOs in documentation
#9292
opened Nov 19, 2025 by
QiJune
Loading…
1 task done
[None][infra] Add fallback when get wheel from build stage is fail
#9290
opened Nov 19, 2025 by
ZhanruiSunCh
Loading…
1 task
[TRTLLM-9370][feat] Integration of CuteDSL NVFP4 grouped GEMM (Part 2: SwiGLU Fusion and Finalize Fusion)
#9288
opened Nov 19, 2025 by
syuoni
Loading…
1 task done
Previous Next
ProTip!
no:milestone will show everything without a milestone.