Skip to content

Commit db3c373

Browse files
authored
[https://nvbugs/5572320][fix] Ported test_ad_trtllm_bench.py from main (#8671)
Signed-off-by: Eran Geva <[email protected]>
1 parent e04354b commit db3c373

File tree

5 files changed

+44
-782
lines changed

5 files changed

+44
-782
lines changed

tests/integration/test_lists/test-db/l0_a30.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ l0_a30:
1919
- unittest/_torch/modeling -k "modeling_qwen"
2020
- unittest/_torch/modeling -k "modeling_qwen_moe"
2121
- unittest/_torch/modeling -k "modeling_out_of_tree"
22-
- unittest/_torch/auto_deploy/unit/singlegpu -k "not test_trtllm_bench_backend_comparison"
22+
- unittest/_torch/auto_deploy/unit/singlegpu
2323
- unittest/_torch/sampler/test_beam_search.py
2424
- test_e2e.py::test_openai_completions_with_logit_bias[torch_sampler]
2525
- test_e2e.py::test_openai_chat_with_logit_bias[torch_sampler]

tests/integration/test_lists/test-db/l0_b200.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -74,7 +74,7 @@ l0_b200:
7474
- unittest/_torch/modeling -k "modeling_mixtral"
7575
- unittest/_torch/modeling -k "modeling_gpt_oss"
7676
- unittest/_torch/modeling/test_modeling_exaone4.py::TestEXAONE4::test_llm_load_1_FP8
77-
- unittest/_torch/auto_deploy/unit/singlegpu -k "not test_trtllm_bench_backend_comparison"
77+
- unittest/_torch/auto_deploy/unit/singlegpu
7878
- condition:
7979
ranges:
8080
system_gpu_count:

tests/integration/test_lists/test-db/l0_h100.yml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,6 @@ l0_h100:
3131
- unittest/_torch/modeling -k "modeling_nemotron"
3232
- unittest/_torch/modeling -k "modeling_gemma3"
3333
- unittest/_torch/modeling -k "modeling_gpt_oss"
34-
- unittest/_torch/auto_deploy/unit/singlegpu/test_ad_trtllm_bench.py::test_trtllm_bench_backend_comparison
3534
- unittest/disaggregated/test_disagg_utils.py
3635
- unittest/disaggregated/test_router.py
3736
- unittest/disaggregated/test_remoteDictionary.py

tests/unittest/_torch/auto_deploy/_utils_test/_model_test_utils.py

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -446,6 +446,12 @@ def apply_rotary_pos_emb_ds(q, k, cos, sin, position_ids, unsqueeze_dim=1):
446446
"vision_config": {"num_hidden_layers": 2},
447447
},
448448
},
449+
"TinyLlama/TinyLlama-1.1B-Chat-v1.0": {
450+
"llm_models_subdir": "llama-models-v2/TinyLlama-1.1B-Chat-v1.0",
451+
"model_kwargs": {
452+
"num_hidden_layers": 2,
453+
},
454+
},
449455
}
450456

451457

0 commit comments

Comments
 (0)