[ci] refactor longtext benchmark #4087

zhulinJulia24 · 2025-10-30T10:12:20Z

Add test configuration for H cluster
Add benchmark for longtext processing
Remove unused workspace test cases
Add workflow start time and final status tracking
Adjust volume settings for the new machine

lvhan028 · 2025-11-02T06:23:11Z

autotest/config-ascend.yaml

+turbomind_chat_model:
+    - Qwen/Qwen3-30B-A3B
+    - Qwen/Qwen3-235B-A22B
+    - Qwen/Qwen3-32B
+    - Qwen/Qwen3-8B
+    - Qwen/Qwen3-0.6B


turbomind doesn't support ascend

lvhan028 · 2025-11-02T06:23:31Z

autotest/config-ascend.yaml

+turbomind_vl_model:
+    - internlm/Intern-S1
+    - internlm/Intern-S1-mini
+    - OpenGVLab/InternVL3_5-2B
+    - OpenGVLab/InternVL3_5-8B
+    - OpenGVLab/InternVL3_5-38B
+


turbomind doesn't support ascend

lvhan028 · 2025-11-02T06:24:05Z

autotest/config-ascend.yaml

+turbomind_base_model:
+    - Qwen/Qwen3-0.6B
+


turbomind doesn't support ascend

lvhan028 · 2025-11-02T06:24:18Z

autotest/config-ascend.yaml

+turbomind_quatization:
+    no_awq:
+        - Qwen/Qwen3-30B-A3B
+        - Qwen/Qwen3-235B-A22B
+        - Qwen/Qwen3-32B
+        - Qwen/Qwen3-8B
+        - Qwen/Qwen3-0.6B
+        - internlm/Intern-S1
+        - internlm/Intern-S1-mini
+        - OpenGVLab/InternVL3_5-2B
+        - OpenGVLab/InternVL3_5-8B
+        - OpenGVLab/InternVL3_5-38B
+
+    gptq:
+        - Empty
+    no_kvint4:
+        - Qwen/Qwen3-30B-A3B
+        - Qwen/Qwen3-235B-A22B
+        - Qwen/Qwen3-32B
+        - Qwen/Qwen3-8B
+        - Qwen/Qwen3-0.6B
+        - internlm/Intern-S1
+        - internlm/Intern-S1-mini
+        - OpenGVLab/InternVL3_5-2B
+        - OpenGVLab/InternVL3_5-8B
+        - OpenGVLab/InternVL3_5-38B
+    no_kvint8:
+        - Qwen/Qwen3-30B-A3B
+        - Qwen/Qwen3-235B-A22B
+        - Qwen/Qwen3-32B
+        - Qwen/Qwen3-8B
+        - Qwen/Qwen3-0.6B
+        - internlm/Intern-S1
+        - internlm/Intern-S1-mini
+        - OpenGVLab/InternVL3_5-2B
+        - OpenGVLab/InternVL3_5-8B
+        - OpenGVLab/InternVL3_5-38B
+


turbomind doesn't support ascend

lvhan028 · 2025-11-02T06:25:20Z

autotest/config-h.yaml

+turbomind_base_model:
+    - internlm/Intern-S1-mini
+    - Qwen/Qwen3-4B-FP8
+    - openai/gpt-oss-20b
+


Those models are not base model

lvhan028 · 2025-11-02T06:27:42Z

autotest/utils/run_client_chat.py

-def command_line_test(config,
-                      case,
-                      case_info,
-                      model_case,
-                      type,
-                      extra: str = None,
-                      cuda_prefix: str = None,
-                      worker_id: str = ''):
-    dst_path = config.get('dst_path')
-
-    cmd = get_command_with_extra('lmdeploy chat ' + dst_path + '/workspace_' + model_case,
-                                 config,
-                                 model_case,
-                                 cuda_prefix=cuda_prefix)
-    if type == 'turbomind':
-        if ('w4' in model_case or ('4bits' in model_case or 'awq' in model_case.lower())):
-            cmd += ' --model-format awq'
-        elif 'gptq' in model_case.lower():
-            cmd += ' --model-format gptq'
-    if case == 'base_testcase':
-        cmd += ' --chat-template ' + TEMPLATE
-
-    # Add device option if specified in environment
-    device = os.environ.get('DEVICE', '')
-    if device:
-        cmd += f' --device {device} '
-        if device == 'ascend':
-            cmd += '--eager-mode '
-
-    return command_test(config, [cmd], model_case, case, case_info, type == 'turbomind', worker_id=worker_id)
-
-


No longer test the chat CLI?

…deploy into update_failcase

zhulinJulia24 and others added 27 commits October 22, 2025 14:46

update

bb65d89

update

60c9455

update

5c1a856

Update Docker tag from cuda12.4 to cuda12.8

bf52f0c

update

a777dd5

merge main

ba4a2c0

update

084c554

update

88acab4

update

d699c3a

update

537292d

Merge branch 'InternLM:main' into update_failcase

6738ad6

add longtext benchmark into workflow

488be8a

update

064960c

update

b1c049e

update

06bfd12

update

013d3ce

update

32bd0e7

fix

a1678c4

update

f275fae

update

4dbc839

update

a8e69e7

update

aaeda1f

update

2848f0c

update

5bf0d03

add ascend config

13e7a69

update

59a9830

update

ab4da9d

lvhan028 reviewed Nov 2, 2025

View reviewed changes

zhulinJulia24 and others added 3 commits November 3, 2025 10:12

Merge branch 'InternLM:main' into update_failcase

4eec89d

update

e056eba

Merge branch 'update_failcase' of https://github.com/zhulinJulia24/lm…

06b4a5e

…deploy into update_failcase

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ci] refactor longtext benchmark #4087

[ci] refactor longtext benchmark #4087

Uh oh!

zhulinJulia24 commented Oct 30, 2025 •

edited by lvhan028

Loading

Uh oh!

lvhan028 Nov 2, 2025

Uh oh!

lvhan028 Nov 2, 2025

Uh oh!

lvhan028 Nov 2, 2025

Uh oh!

lvhan028 Nov 2, 2025

Uh oh!

lvhan028 Nov 2, 2025

Uh oh!

lvhan028 Nov 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		turbomind_base_model:
		- Qwen/Qwen3-0.6B

[ci] refactor longtext benchmark #4087

Are you sure you want to change the base?

[ci] refactor longtext benchmark #4087

Uh oh!

Conversation

zhulinJulia24 commented Oct 30, 2025 • edited by lvhan028 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lvhan028 Nov 2, 2025

Choose a reason for hiding this comment

Uh oh!

lvhan028 Nov 2, 2025

Choose a reason for hiding this comment

Uh oh!

lvhan028 Nov 2, 2025

Choose a reason for hiding this comment

Uh oh!

lvhan028 Nov 2, 2025

Choose a reason for hiding this comment

Uh oh!

lvhan028 Nov 2, 2025

Choose a reason for hiding this comment

Uh oh!

lvhan028 Nov 2, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zhulinJulia24 commented Oct 30, 2025 •

edited by lvhan028

Loading