feat(bench): add --accuracy mode that prints generated text by RemiliaForever · Pull Request #1092 · qualcomm/nexa-sdk

RemiliaForever (RemiliaForever) · 2026-06-25T07:15:18Z

What

Adds an --accuracy mode to geniex-bench for eyeballing model output quality rather than measuring speed.

Pins a single measured run (--warmup 0 / --repetitions 1), overriding --warmup / -r.
Prints the generated text to stdout, with a [gen ] prefix on every line so multi-line output stays greppable and visually attributed.
Works for both the LLM and VLM run loops; applies in single-cell and matrix mode.

Pair with --prompt-file for a real prompt — the default random-ids prefill yields meaningless text, which the --help text and README now call out.

Notes

SDK logs go to stderr while the bench output goes to stdout, so 2>/dev/null gives a clean view of the generated text.

Test

Built Release locally (cmake --build build) and ran against a cached Qwen3-0.6B-Q4_0.gguf:

--accuracy --prompt-file → single run, full text printed line-by-line with [gen ] prefix.
--accuracy -p 8 (default random-ids) → single run, runs without crashing.

Copilot

Pull request overview

Adds an --accuracy mode to geniex-bench to support quick qualitative inspection of model output (generated text printed to stdout), rather than benchmarking throughput/latency across multiple runs.

Changes:

Adds a new --accuracy flag that forces --warmup 0 and --repetitions 1.
Prints the generated text to stdout with a [gen ] prefix on each line for both LLM and VLM generation loops.
Updates the benchmark README and --help text to document accuracy mode and recommend pairing with --prompt-file.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
sdk/benchmark/README.md	Documents the new `--accuracy` mode and provides an example invocation and guidance about `--prompt-file`.
sdk/benchmark/benchmark.c	Implements `--accuracy` flag parsing/behavior and adds generated-text printing in both LLM and VLM run loops.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Single-run mode for eyeballing output quality rather than speed: pins --warmup 0 / --repetitions 1 and prints the generated text to stdout with a [gen ] prefix on every line. Pair with --prompt-file for a real prompt, since the default random-ids prefill yields meaningless text.

Copilot AI review requested due to automatic review settings June 25, 2026 07:15

Copilot started reviewing on behalf of RemiliaForever (RemiliaForever) June 25, 2026 07:15 View session

Copilot AI reviewed Jun 25, 2026

View reviewed changes

RemiliaForever (RemiliaForever) force-pushed the chore/migrate-repo-url branch from 4ed3a01 to c495fa9 Compare June 25, 2026 17:02

RemiliaForever (RemiliaForever) force-pushed the chore/geniex-bench branch from 12470ce to 3ab86c7 Compare June 25, 2026 17:02

RemiliaForever (RemiliaForever) force-pushed the chore/migrate-repo-url branch from c495fa9 to 81991ca Compare June 26, 2026 02:46

RemiliaForever (RemiliaForever) requested review from David Qian (Davidqian123), AlexCHEN (alexchen4ai), Mengsheng Wu (mengshengwu), Paul Zhu (vinovo), Zack Li (zhiyuan8) and Perry Cheng (zhycheng614) as code owners June 26, 2026 02:46

RemiliaForever (RemiliaForever) force-pushed the chore/geniex-bench branch from 3ab86c7 to 589abb1 Compare June 26, 2026 02:46

RemiliaForever (RemiliaForever) force-pushed the chore/migrate-repo-url branch from 81991ca to f2aa7e9 Compare June 26, 2026 02:49

RemiliaForever (RemiliaForever) force-pushed the chore/geniex-bench branch from 589abb1 to 5a3483b Compare June 26, 2026 02:49

Base automatically changed from chore/migrate-repo-url to main June 26, 2026 02:55

RemiliaForever (RemiliaForever) merged commit edd0183 into main Jun 26, 2026
9 of 11 checks passed

RemiliaForever (RemiliaForever) deleted the chore/geniex-bench branch June 26, 2026 02:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(bench): add --accuracy mode that prints generated text#1092

feat(bench): add --accuracy mode that prints generated text#1092
RemiliaForever (RemiliaForever) merged 1 commit into
mainfrom
chore/geniex-bench

RemiliaForever (RemiliaForever) commented Jun 25, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

RemiliaForever (RemiliaForever) commented Jun 25, 2026

What

Notes

Test

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants