Skip to content

[Bench][Linear-Attn] Add Qwen GDN prefill benchmark surface#5

Draft
superAngGao wants to merge 1 commit into
impl/gdn-prefill-opfrom
issue-1597-gdn-prefill-bench
Draft

[Bench][Linear-Attn] Add Qwen GDN prefill benchmark surface#5
superAngGao wants to merge 1 commit into
impl/gdn-prefill-opfrom
issue-1597-gdn-prefill-bench

Conversation

@superAngGao

Copy link
Copy Markdown
Owner

Stacked on tile-ai#1596. Addresses tile-ai#1597.

Summary

  • Switch the Gated DeltaNet prefill benchmark surface to the FLA/Qwen BTHD contract.
  • Add Qwen3.5-aligned long-context rows for S=32K/64K/128K and H=16/32/48/64 across fp16/bf16.
  • Compare TileOps against FLA with output_final_state=True and surface speedup in profile_run.log.
  • Keep nightly JUnit properties compatible while allowing profile_run.log to show extra metrics and GPU clocks.

Validation

  • python -m py_compile benchmarks/ops/bench_gated_deltanet_prefill.py workloads/gated_deltanet.py benchmarks/benchmark_base.py tileops/perf/formulas.py
  • python scripts/validate_manifest.py
  • python -m pytest --collect-only benchmarks/ops/bench_gated_deltanet_prefill.py -q
  • TMPDIR=/home/ga/tmp python -m pytest tests/ops/test_gated_deltanet_prefill.py -m smoke -q

Notes

  • The local environment does not have FLA installed, so full benchmark execution was only checked for collection/skip behavior here.

@github-actions github-actions Bot added the bench Auto-created by labeler label Jun 23, 2026
@superAngGao superAngGao force-pushed the impl/gdn-prefill-op branch 3 times, most recently from 9d21d88 to 9282b2b Compare June 24, 2026 10:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bench Auto-created by labeler

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant