Compare→bitmask SIMD lowering: where custom SIMD actually helps #8280
CodSpeed HQ / CodSpeed Performance Analysis
failed
Jun 7, 2026 in 0s
Performance Regression: -16.61%
⚠️ Unknown Walltime execution environment detected
Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.
For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.
❌ 2 regressed benchmarks
✅ 1511 untouched benchmarks
🆕 62 new benchmarks
Warning
Please fix the performance issues or acknowledge them on CodSpeed.
Performance Changes
| Mode | Benchmark | BASE |
HEAD |
Efficiency | |
|---|---|---|---|---|---|
| ❌ | Simulation | varbinview_zip_block_mask |
2.9 ms | 3.7 ms | -21.59% |
| ❌ | Simulation | varbinview_zip_fragmented_mask |
6.1 ms | 6.9 ms | -11.31% |
| 🆕 | Simulation | between_scalar_pack[1048576] |
N/A | 2.6 ms | N/A |
| 🆕 | Simulation | between_scalar_pack[16384] |
N/A | 41.2 µs | N/A |
| 🆕 | Simulation | between_simd_bench[1048576] |
N/A | 2.3 ms | N/A |
| 🆕 | Simulation | between_simd_bench[16384] |
N/A | 36.3 µs | N/A |
| 🆕 | Simulation | gt_scalar_pack[1048576] |
N/A | 2.5 ms | N/A |
| 🆕 | Simulation | gt_scalar_pack[16384] |
N/A | 39.7 µs | N/A |
| 🆕 | Simulation | gt_simd_bench[1048576] |
N/A | 2.2 ms | N/A |
| 🆕 | Simulation | gt_simd_bench[16384] |
N/A | 34.9 µs | N/A |
| 🆕 | Simulation | u8_scalar_pack[1048576] |
N/A | 979.9 µs | N/A |
| 🆕 | Simulation | u8_scalar_pack[16384] |
N/A | 16.1 µs | N/A |
| 🆕 | Simulation | u8_scalar_swar[1048576] |
N/A | 934.2 µs | N/A |
| 🆕 | Simulation | u8_scalar_swar[16384] |
N/A | 15.2 µs | N/A |
| 🆕 | Simulation | u8_simd[1048576] |
N/A | 610.9 µs | N/A |
| 🆕 | Simulation | u8_simd[16384] |
N/A | 10 µs | N/A |
| 🆕 | Simulation | between_bitbuffer_new[1024] |
N/A | 5.7 µs | N/A |
| 🆕 | Simulation | between_bitbuffer_new[1048576] |
N/A | 2.8 ms | N/A |
| 🆕 | Simulation | between_bitbuffer_new[16384] |
N/A | 47 µs | N/A |
| 🆕 | Simulation | between_bitbuffer_new[262144] |
N/A | 692.4 µs | N/A |
| ... | ... | ... | ... | ... | ... |
ℹ️ Only the first 20 benchmarks are displayed. Go to the app to view all benchmarks.
Tip
Investigate this regression by commenting @codspeedbot fix this regression on this PR, or directly use the CodSpeed MCP with your agent.
Comparing claude/vector-bitpack-lowering-yKOkC (5d476ef) with develop (e06d80b)
Loading