[VectorCombine] Generalize foldBitOpOfBitcasts to support more cast operations #148350

rhyadav · 2025-07-12T09:17:33Z

This patch generalizes the existing foldBitOpOfBitcasts optimization in the VectorCombine pass to handle
additional cast operations beyond just bitcast.

Fixes: #146037

Summary

The optimization now supports folding bitwise operations (AND/OR/XOR) with the following cast operations:

bitcast (original functionality)
trunc (truncate)
sext (sign extend)
zext (zero extend)

The transformation pattern is:
bitop(castop(x), castop(y)) -> castop(bitop(x, y))

This reduces the number of cast instructions from 2 to 1, improving performance on targets where cast operations
are expensive or where performing bitwise operations on narrower types is beneficial.

Implementation Details

Renamed foldBitOpOfBitcasts to foldBitOpOfCastops to reflect broader functionality
Extended pattern matching to handle any CastInst operation
Added validation for each cast type's constraints (e.g., trunc requires source > dest)
Updated cost model to use the actual cast opcode
Preserves IR flags from original instructions
Handles multi-use scenarios appropriately

Testing

Added comprehensive tests in test/Transforms/VectorCombine/bitop-of-castops.ll
Tests cover all supported cast types with all bitwise operations
Includes negative tests for unsupported patterns
All existing VectorCombine tests pass

…perations This patch generalizes the foldBitOpOfBitcasts function (renamed to foldBitOpOfCastops) to handle additional cast operations beyond just bitcast. The optimization now supports: - trunc (truncate) - sext (sign extend) - zext (zero extend) - bitcast (original functionality) The optimization transforms: bitop(cast(x), cast(y)) -> cast(bitop(x, y)) This reduces the number of cast instructions from 2 to 1, which can improve performance on targets where cast operations are expensive or where performing bitwise operations on narrower types is beneficial. Changes: - Renamed foldBitOpOfBitcasts to foldBitOpOfCastops - Extended pattern matching to handle any CastInst - Added validation for each cast type's constraints - Updated cost model to use actual cast opcode - Added comprehensive tests for all supported cast types Fixes: llvm#146037

github-actions · 2025-07-12T09:17:50Z

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write permissions for the repository. In which case you can instead tag reviewers by name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

llvmbot · 2025-07-12T09:18:19Z

@llvm/pr-subscribers-vectorizers

@llvm/pr-subscribers-llvm-transforms

Author: Rahul Yadav (rhyadav)

Changes