Implement performance checking script: i8 vs f16 non-fused conv kernels #1727

dorde-antic · 2025-01-28T16:57:42Z

Implement shell script that captures the performance difference between data types to validate expected kernel performance.
Resolves ROCm/rocMLIR-internal#1674

codecov · 2025-02-04T22:43:49Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #1727      +/-   ##
===========================================
- Coverage    78.52%   77.99%   -0.52%     
===========================================
  Files          100      100              
  Lines        29907    30057     +150     
  Branches      4452     4656     +204     
===========================================
- Hits         23482    23442      -40     
- Misses        4590     4601      +11     
- Partials      1835     2014     +179

Flag	Coverage Δ
mfma	`77.98% <ø> (-0.54%)`	⬇️
navi3x	`77.98% <ø> (?)`
navi4x	`77.99% <ø> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

see 34 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

djramic

LGTM

mlir/utils/performance/performance-checking.sh

Signed-off-by: Djordje Antic <[email protected]>

Copilot

Pull Request Overview

This PR introduces a shell script to benchmark and compare FP16 vs INT8 performance for non-fused convolution kernels.

Adds perf-test-i8-f16-conv.sh to compile, time, and record average runtimes for both data types.
Supports configurable model name, ONNX path, and iteration count via flags.
Outputs results into a directory named after the model with a times summary file.

Comments suppressed due to low confidence (1)

mlir/utils/performance/perf-test-i8-f16-conv.sh:5

The usage comment references /performance-checking instead of this script's actual name (perf-test-i8-f16-conv.sh); updating it will avoid confusion.

# Usage: /performance-checking --d <model> --p <model_path> [--r <number_of_iterations>]"

mlir/utils/performance/perf-test-i8-f16-conv.sh

Signed-off-by: Djordje Antic <[email protected]>

umangyadav · 2025-06-24T13:39:32Z

mlir/utils/performance/perf-test-i8-f16-conv.sh

+        total_time=0
+
+        compiled="$testcase.mxdb"
+        migraphx-driver compile "$testcase" --mlir -o "$compiled" > /dev/null


Why this script is in rocMLIR ? Looks like better place is inside migraphx-benchmark-utils
https://github.com/ROCm/migraphx-benchmark-utils

On MLIR Standup (in january) we talked to put it in /mlir/utils/performance. But I agree with what you have said. Should I open PR there and put this script then? @umangyadav

And would it be in this directory: https://github.com/ROCm/migraphx-benchmark-utils/tree/main/scripts and to add it to readme table? @umangyadav

You can close this one

dorde-antic · 2025-06-24T14:52:34Z

Merged on migraphx-benchmark-utils repo - https://github.com/ROCm/migraphx-benchmark-utils/pull/70

dorde-antic marked this pull request as ready for review January 28, 2025 16:57

dorde-antic requested a review from causten as a code owner January 28, 2025 16:57

dorde-antic marked this pull request as draft January 28, 2025 16:59

dorde-antic marked this pull request as ready for review January 28, 2025 17:38

causten requested a review from djramic February 24, 2025 16:10

djramic approved these changes Feb 25, 2025

View reviewed changes

dhernandez0 requested changes Feb 26, 2025

View reviewed changes

mlir/utils/performance/performance-checking.sh Outdated Show resolved Hide resolved

mlir/utils/performance/performance-checking.sh Outdated Show resolved Hide resolved

mlir/utils/performance/performance-checking.sh Outdated Show resolved Hide resolved

dorde-antic force-pushed the performance-checking branch from 6fe114d to 896b225 Compare April 7, 2025 09:55

dorde-antic force-pushed the performance-checking branch from 896b225 to dad50ef Compare May 14, 2025 13:58

dorde-antic added 3 commits May 19, 2025 09:11

Implement performance checking script

bc89648

Signed-off-by: Djordje Antic <[email protected]>

Implement performance checking script

9583a30

Signed-off-by: Djordje Antic <[email protected]>

Fix to run both f16 and int8 kernels

520da0c

dorde-antic force-pushed the performance-checking branch from 45fd26f to 520da0c Compare May 19, 2025 14:11

dorde-antic added 2 commits June 18, 2025 12:40

Merge branch 'develop' into performance-checking

fcf2897

Address comments

9939e62

Signed-off-by: Djordje Antic <[email protected]>

dorde-antic requested a review from dhernandez0 June 19, 2025 10:09

Merge branch 'develop' into performance-checking

5890522

dorde-antic changed the title ~~Implement performance checking script~~ Implement performance checking script: i8 vs f16 non-fused conv kernels Jun 19, 2025

dorde-antic added 2 commits June 20, 2025 20:52

Merge branch 'develop' into performance-checking

3f8e384

Merge branch 'develop' into performance-checking

ff055d8

causten requested a review from Copilot June 23, 2025 14:21

Copilot AI reviewed Jun 23, 2025

View reviewed changes

mlir/utils/performance/perf-test-i8-f16-conv.sh Outdated Show resolved Hide resolved

mlir/utils/performance/perf-test-i8-f16-conv.sh Outdated Show resolved Hide resolved

mlir/utils/performance/perf-test-i8-f16-conv.sh Outdated Show resolved Hide resolved

Address comments

617b23a

Signed-off-by: Djordje Antic <[email protected]>

dorde-antic requested review from umangyadav, mirza-halilcevic and stefankoncarevic June 23, 2025 17:24

Merge branch 'develop' into performance-checking

09c1730

umangyadav reviewed Jun 24, 2025

View reviewed changes

dorde-antic closed this Jun 24, 2025

dorde-antic deleted the performance-checking branch June 24, 2025 14:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement performance checking script: i8 vs f16 non-fused conv kernels #1727

Implement performance checking script: i8 vs f16 non-fused conv kernels #1727

Uh oh!

dorde-antic commented Jan 28, 2025

Uh oh!

codecov bot commented Feb 4, 2025 •

edited

Loading

Uh oh!

djramic left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

umangyadav Jun 24, 2025 •

edited

Loading

Uh oh!

dorde-antic Jun 24, 2025 •

edited

Loading

Uh oh!

dorde-antic Jun 24, 2025

Uh oh!

umangyadav Jun 24, 2025

Uh oh!

dorde-antic commented Jun 24, 2025

Uh oh!

Uh oh!

Implement performance checking script: i8 vs f16 non-fused conv kernels #1727

Implement performance checking script: i8 vs f16 non-fused conv kernels #1727

Uh oh!

Conversation

dorde-antic commented Jan 28, 2025

Uh oh!

codecov bot commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

djramic left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

umangyadav Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dorde-antic Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dorde-antic Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

umangyadav Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

dorde-antic commented Jun 24, 2025

Uh oh!

Uh oh!

codecov bot commented Feb 4, 2025 •

edited

Loading

umangyadav Jun 24, 2025 •

edited

Loading

dorde-antic Jun 24, 2025 •

edited

Loading