UPSTREAM PR #19433: Add a build target to generate ROCm artifacts using ROCm 7.2 by loci-dev · Pull Request #1160 · auroralabs-loci/llama.cpp

loci-dev · 2026-02-09T02:18:11Z

Note

Source pull request: ggml-org/llama.cpp#19433

This builds the following targets:

gfx1151
gfx1150
gfx1200
gfx1201
gfx1100
gfx1101
gfx1102
gfx1030
gfx1031
gfx1032

Notably; it doesn't include any CDNA targets, and thus avoids ggml-org/llama.cpp#19269

loci-review · 2026-02-09T03:09:58Z

No meaningful performance changes were detected across 115630 analyzed functions in the following binaries: build.bin.llama-cvector-generator, build.bin.llama-tts, build.bin.libllama.so, build.bin.libmtmd.so, build.bin.libggml-cpu.so, build.bin.libggml.so, build.bin.libggml-base.so, build.bin.llama-bench, build.bin.llama-gemma3-cli, build.bin.llama-gguf-split, build.bin.llama-llava-cli, build.bin.llama-minicpmv-cli, build.bin.llama-quantize, build.bin.llama-tokenize, build.bin.llama-qwen2vl-cli.

🔎 Full breakdown: Loci Inspector.
💬 Questions? Tag @loci-dev.

noamshiovits-Aurora · 2026-02-09T19:15:11Z

@loci-dev how many functions were modified in this pr and how many files were edited or added ?

loci-review · 2026-02-09T19:15:47Z

Based on the repository analysis:

PR Statistics

Files Changed

110 files modified
6 files added
2 files deleted
Total: 118 files changed
43 commits

Functions Analyzed

According to the performance review, 115,630 functions were analyzed across all binaries, with no meaningful performance changes detected in any of them.

This is a substantial PR focused on:

GPU backend optimizations (Metal, Vulkan, CUDA, SYCL, WebGPU)
New model architecture support (Step3.5-Flash, Kimi-Linear, Qwen3next)
Core infrastructure refactoring (sampling, batch processing)
Testing improvements and dependency updates

The fact that 115K+ functions were analyzed but showed no meaningful performance regressions is actually a positive signal - it indicates that despite the extensive changes across 118 files, the modifications maintained performance stability. The changes appear to be primarily focused on correctness, feature additions, and code quality improvements rather than introducing performance-impacting modifications.

loci-review · 2026-02-14T03:08:47Z

No meaningful performance changes were detected across 115001 analyzed functions in the following binaries: build.bin.llama-cvector-generator, build.bin.llama-tts, build.bin.libggml-cpu.so, build.bin.libmtmd.so, build.bin.libllama.so, build.bin.libggml-base.so, build.bin.llama-bench, build.bin.llama-quantize, build.bin.llama-tokenize, build.bin.llama-gguf-split, build.bin.libggml.so, build.bin.llama-qwen2vl-cli, build.bin.llama-gemma3-cli, build.bin.llama-llava-cli, build.bin.llama-minicpmv-cli.

🔎 Full breakdown: Loci Inspector.
💬 Questions? Tag @loci-dev.

This builds the following targets: * gfx1151 * gfx1150 * gfx1200 * gfx1201 * gfx1100 * gfx1101 * gfx908 * gfx90a * gfx942

loci-review · 2026-02-19T03:58:40Z

No meaningful performance changes were detected across 111508 analyzed functions in the following binaries: build.bin.llama-cvector-generator, build.bin.libllama.so, build.bin.libmtmd.so, build.bin.llama-tts, build.bin.llama-tokenize, build.bin.llama-gemma3-cli, build.bin.llama-gguf-split, build.bin.llama-llava-cli, build.bin.llama-minicpmv-cli, build.bin.llama-quantize, build.bin.llama-qwen2vl-cli, build.bin.llama-bench, build.bin.libggml-base.so, build.bin.libggml-cpu.so, build.bin.libggml.so.

🔎 Full breakdown: Loci Inspector.
💬 Questions? Tag @loci-dev.

loci-dev temporarily deployed to PROD__AL_DEMO February 9, 2026 02:18 — with GitHub Actions Inactive

loci-dev had a problem deploying to PROD__AL_DEMO February 9, 2026 19:35 — with GitHub Actions Failure

loci-dev force-pushed the main branch 3 times, most recently from ef7afbe to d4c3480 Compare February 14, 2026 02:16

loci-dev force-pushed the loci/pr-19433-superm1-rocm-github-action branch from 8aeb553 to d961293 Compare February 14, 2026 02:16

loci-dev temporarily deployed to PROD__AL_DEMO February 14, 2026 02:17 — with GitHub Actions Inactive

loci-dev force-pushed the main branch 6 times, most recently from 073bd79 to 823244c Compare February 18, 2026 02:17

Add a build target to generate ROCm artifacts using ROCm 7.2

4a1f236

This builds the following targets: * gfx1151 * gfx1150 * gfx1200 * gfx1201 * gfx1100 * gfx1101 * gfx908 * gfx90a * gfx942

loci-dev force-pushed the main branch from 823244c to bab7d39 Compare February 19, 2026 02:17

loci-dev force-pushed the loci/pr-19433-superm1-rocm-github-action branch from d961293 to 4a1f236 Compare February 19, 2026 03:06

loci-dev temporarily deployed to PROD__AL_DEMO February 19, 2026 03:06 — with GitHub Actions Inactive

loci-dev force-pushed the main branch 2 times, most recently from 10f8f26 to a6ecec6 Compare February 20, 2026 02:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UPSTREAM PR #19433: Add a build target to generate ROCm artifacts using ROCm 7.2#1160

UPSTREAM PR #19433: Add a build target to generate ROCm artifacts using ROCm 7.2#1160
loci-dev wants to merge 1 commit intomainfrom
loci/pr-19433-superm1-rocm-github-action

loci-dev commented Feb 9, 2026

Uh oh!

loci-review bot commented Feb 9, 2026

Uh oh!

noamshiovits-Aurora commented Feb 9, 2026

Uh oh!

loci-review bot commented Feb 9, 2026

Uh oh!

loci-review bot commented Feb 14, 2026

Uh oh!

loci-review bot commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

loci-dev commented Feb 9, 2026

Uh oh!

loci-review bot commented Feb 9, 2026

Uh oh!

noamshiovits-Aurora commented Feb 9, 2026

Uh oh!

loci-review bot commented Feb 9, 2026

PR Statistics

Files Changed

Functions Analyzed

Uh oh!

loci-review bot commented Feb 14, 2026

Uh oh!

loci-review bot commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments