Skip to content

Conversation

@PerryZhang01
Copy link
Contributor

@PerryZhang01 PerryZhang01 commented Dec 24, 2025

Motivation

This PR is for unified benchmark test. We provided unified run_server.sh and run_bench.sh scripts for deepseekv3.2, gptoss and llama to test benchmark, the results will be printed on the screen and saved in an excel file. Users only need to modified the configs such as MODEL_PATH in run_bench.sh.
screen:
image
excel :
image

Note: we only test deepseek3.2. Gptoss and llama are not test, anyone can modifiy their params in run_server.sh and run_bench.sh after testing.

@valarLip
Copy link
Collaborator

a little bit toooo much if we have scripts for each model..

@PerryZhang01
Copy link
Contributor Author

every model has different config on server and client, scripts or recipes are necessary for them. users can directly reproduce the public performance using the scripts without worrying about special parameters and environment variables settings. everyone using the same scripts can reduce the performance gap, and can also used for CI.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants