[Test]Add accuracy test for multiple models(2) #4251

MrZ20 · 2025-11-18T08:57:21Z

What this PR does / why we need it?

Add accuracy test for multiple models:

ERNIE-4.5-21B-A3B-PT.yaml
Mistral-7B-Instruct-v0.1.yaml
Phi-4-mini-instruct.yaml

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.11.0
vLLM main: vllm-project/vllm@2918c1b

Signed-off-by: MrZ20 <[email protected]>

gemini-code-assist

Code Review

This pull request adds accuracy test configurations for four new models: ERNIE-4.5-21B-A3B-PT, MiniCPM3-4B, Mistral-7B-Instruct-v0.1, and Phi-4-mini-instruct. The changes include new YAML configuration files for each model and an update to the list of accuracy tests. The new configuration files are well-formed. I have one suggestion to improve the maintainability of the test list file by sorting it alphabetically, which will make it easier to manage in the future.

gemini-code-assist · 2025-11-18T08:59:16Z

tests/e2e/models/configs/accuracy.txt

+InternVL3_5-8B.yaml
+ERNIE-4.5-21B-A3B-PT.yaml
+MiniCPM3-4B.yaml
+Mistral-7B-Instruct-v0.1.yaml
+Phi-4-mini-instruct.yaml


For improved maintainability and to prevent potential duplicates, it's a good practice to keep this list of configuration files sorted alphabetically. While the existing file isn't sorted, applying this convention now would be beneficial for future updates.

Could you please sort the entire file? For your convenience, here is the alphabetically sorted list:

DeepSeek-V2-Lite.yaml ERNIE-4.5-21B-A3B-PT.yaml InternVL3_5-8B.yaml Meta-Llama-3.1-8B-Instruct.yaml MiniCPM3-4B.yaml Mistral-7B-Instruct-v0.1.yaml Phi-4-mini-instruct.yaml Qwen2.5-Omni-7B.yaml Qwen2.5-VL-7B-Instruct.yaml Qwen2-7B.yaml Qwen2-Audio-7B-Instruct.yaml Qwen2-VL-7B-Instruct.yaml Qwen3-30B-A3B.yaml Qwen3-8B.yaml Qwen3-VL-30B-A3B-Instruct.yaml Qwen3-VL-8B-Instruct.yaml

github-actions · 2025-11-18T09:16:14Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Signed-off-by: MrZ20 <[email protected]>

MengqingCao · 2025-11-19T09:56:09Z

tests/e2e/models/configs/Mistral-7B-Instruct-v0.1.yaml

+  - name: "exact_match,strict-match"
+    value: 0.35
+  - name: "exact_match,flexible-extract"
+    value: 0.38


There is an accuracy issue on this model?

add acc test

b2318b8

Signed-off-by: MrZ20 <[email protected]>

gemini-code-assist bot reviewed Nov 18, 2025

View reviewed changes

vllm-ascend-ci added ready-for-test start test by label for PR accuracy-test enable all accuracy test for PR labels Nov 18, 2025

github-actions bot added the module:tests label Nov 18, 2025

add acc test

8e15521

Signed-off-by: MrZ20 <[email protected]>

vllm-ascend-ci removed ready-for-test start test by label for PR accuracy-test enable all accuracy test for PR labels Nov 19, 2025

modefy format

bc51d40

Signed-off-by: MrZ20 <[email protected]>

MengqingCao reviewed Nov 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Test]Add accuracy test for multiple models(2) #4251

[Test]Add accuracy test for multiple models(2) #4251

MrZ20 commented Nov 18, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 18, 2025

Uh oh!

github-actions bot commented Nov 18, 2025

Uh oh!

MengqingCao Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Test]Add accuracy test for multiple models(2) #4251

Are you sure you want to change the base?

[Test]Add accuracy test for multiple models(2) #4251

Conversation

MrZ20 commented Nov 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 18, 2025

Uh oh!

MengqingCao Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MrZ20 commented Nov 18, 2025 •

edited by github-actions bot

Loading