-
Couldn't load subscription status.
- Fork 183
Pull requests: NVIDIA/TensorRT-Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[OMNIML-2917] export layer config using actual prefix instead of hard…
#470
opened Oct 28, 2025 by
shengliangxu
•
Draft
Add convert_llama3_config_to_decilm_config + unit test
#465
opened Oct 27, 2025 by
danielkorzekwa
Loading…
E2E test for the experimental compress algorithm based on https://arxiv.org/abs/2411.19146
#464
opened Oct 27, 2025 by
danielkorzekwa
Loading…
[5590225] Fixed regression introduced by PR #364 (FP64-to-FP32 conversion)
#462
opened Oct 24, 2025 by
gcunhase
Loading…
Added exception for warning caused while creating int4 tensor
#461
opened Oct 24, 2025 by
hthadicherla
Loading…
Add functional test cases for published checkpoints on HF
#455
opened Oct 21, 2025 by
noeyy-mino
Loading…
Preserve original rope scaling type in export due to transformers library AutoConfig issue
#452
opened Oct 17, 2025 by
Edwardf0t1
Loading…
[1/2] Registry interface for custom quantization functional backend
#449
opened Oct 17, 2025 by
realAsma
Loading…
[5271050, 5274346][ONNX] Add support for Conv-Act-Pool fusion
#448
opened Oct 17, 2025 by
gcunhase
Loading…
[4975376][5541172]perplexity and kl-divergence benchmark metrics
#411
opened Oct 8, 2025 by
ynankani
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.