Skip to content

[RISCV] Handle more cases when combining (vfmv.s.f (extract_subvector X, 0)) #154175

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
20 changes: 15 additions & 5 deletions llvm/lib/Target/RISCV/RISCVISelLowering.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -20738,12 +20738,22 @@ SDValue RISCVTargetLowering::PerformDAGCombine(SDNode *N,
isNullConstant(Src.getOperand(1)) &&
Src.getOperand(0).getValueType().isScalableVector()) {
EVT VT = N->getValueType(0);
EVT SrcVT = Src.getOperand(0).getValueType();
assert(SrcVT.getVectorElementType() == VT.getVectorElementType());
SDValue EVSrc = Src.getOperand(0);
EVT EVSrcVT = EVSrc.getValueType();
assert(EVSrcVT.getVectorElementType() == VT.getVectorElementType());
// Widths match, just return the original vector.
if (SrcVT == VT)
return Src.getOperand(0);
// TODO: Use insert_subvector/extract_subvector to change widen/narrow?
if (EVSrcVT == VT)
return EVSrc;
SDLoc DL(N);
// Width is narrower, using insert_subvector.
if (EVSrcVT.getVectorMinNumElements() < VT.getVectorMinNumElements()) {
return DAG.getNode(ISD::INSERT_SUBVECTOR, DL, VT, DAG.getUNDEF(VT),
EVSrc,
DAG.getConstant(0, DL, Subtarget.getXLenVT()));
}
// Width is wider, using extract_subvector.
return DAG.getNode(ISD::EXTRACT_SUBVECTOR, DL, VT, EVSrc,
DAG.getConstant(0, DL, Subtarget.getXLenVT()));
}
[[fallthrough]];
}
Expand Down
26 changes: 26 additions & 0 deletions llvm/test/CodeGen/RISCV/rvv/redundant-vfmvsf.ll
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py UTC_ARGS: --version 5
; RUN: llc -mtriple=riscv64 -mattr='+v,+zvl512b' < %s | FileCheck %s

define <2 x float> @redundant_vfmv(<2 x float> %arg0, <64 x float> %arg1, <64 x float> %arg2) {
; CHECK-LABEL: redundant_vfmv:
; CHECK: # %bb.0:
; CHECK-NEXT: li a0, 64
; CHECK-NEXT: vsetvli zero, a0, e32, m4, ta, ma
; CHECK-NEXT: vfredusum.vs v9, v12, v8
; CHECK-NEXT: vsetivli zero, 1, e32, mf2, ta, ma
; CHECK-NEXT: vslidedown.vi v8, v8, 1
; CHECK-NEXT: vsetvli zero, a0, e32, m4, ta, ma
; CHECK-NEXT: vfredusum.vs v8, v16, v8
; CHECK-NEXT: vfmv.f.s fa5, v8
; CHECK-NEXT: vsetivli zero, 2, e32, mf2, ta, ma
; CHECK-NEXT: vrgather.vi v8, v9, 0
; CHECK-NEXT: vfslide1down.vf v8, v8, fa5
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As a follow on, this could be a slideup from v8 instead.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

candidate PR: #154450

; CHECK-NEXT: ret
%s0 = extractelement <2 x float> %arg0, i64 0
%r0 = tail call reassoc float @llvm.vector.reduce.fadd.v64f32(float %s0, <64 x float> %arg1)
%a0 = insertelement <2 x float> poison, float %r0, i64 0
%s1 = extractelement <2 x float> %arg0, i64 1
%r1 = tail call reassoc float @llvm.vector.reduce.fadd.v64f32(float %s1, <64 x float> %arg2)
%a1 = insertelement <2 x float> %a0, float %r1, i64 1
ret <2 x float> %a1
}
Loading