This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/RISCV/
-
Target/
-
RISCV/
2/6
RISCVInsertVSETVLI.cpp
-
test/CodeGen/RISCV/rvv/
-
CodeGen/
-
RISCV/
-
rvv/
-
vsetvli-insert.ll

Differential D125392

[riscv] Canonicalize vsetvli (vsetvli avl, vtype1) vtype2 transitions
ClosedPublic

Authored by reames on May 11 2022, 8:19 AM.

Download Raw Diff

Details

Reviewers

frasercrmck
craig.topper

Commits

rG72925d98bf92: [riscv] Canonicalize vsetvli (vsetvli avl, vtype1) vtype2 transitionsas reviewed

Summary

This patch is an alternative to a piece of D125270. If we have one vsetvli which is using as AVL the output of another, and the prior AVL can be proven to produce the same VL value as that defining one, we can use the AVL from the prior instruction. This has the effect of removing a state transition on AVL, and will let us use the cheaper 'vsetvli x0, x0, vtype1' form or possible even skip emitting it entirely.

This builds on the same infrastructure as D125337, and does the analogous extension to working on abstract states instead of only prior explicit vsetvli instructions. This is where the (relatively minor) code improvements come from.

More importantly, this fixes the last case where the state computed in phase 1 and 2 of the algorithm differs from the state computed during phase 3. Note that such differences can cause miscompiles by creating disagreements about contents of the VL and VTYPE registers at block boundaries.

Doing this transform inside the dataflow can cause the compatibility of a later store to change with regards to the current state. test15 in the diff illustrates this case well. What we have is a vsetvli which is mutated by one following vector op, but whose GPR result is used by another. The compatibility logic walks back to the def in this case, and checks to see if it matches the immediate prior state. In phase 1 and 2, it doesn't, and in phase 3 (after mutation) it does because we remove a transition which caused it to differ.

Diff Detail

Unit TestsFailed

	Time	Test
	40 ms	x64 debian > Polly.ScopDetect::dot-scops-npm.ll

Event Timeline

reames created this revision.May 11 2022, 8:19 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 11 2022, 8:19 AM

Herald added subscribers: sunshaoce, VincentWu, luke957 and 31 others. · View Herald Transcript

reames requested review of this revision.May 11 2022, 8:19 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 11 2022, 8:19 AM

Herald added subscribers: • pcwang-thead, eopXD, MaskRay. · View Herald Transcript

reames added a child revision: D125270: [riscv] Remove mutation of prior vsetvli from insertion dataflow.May 11 2022, 8:56 AM

Harbormaster completed remote builds in B163909: Diff 428669.May 11 2022, 9:08 AM

frasercrmck added inline comments.May 11 2022, 10:05 AM

llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp
1165	Makes sense to me to do a post-pass. Though not for this patch.
1170	I think I'm getting confused by the comment here - what's "this instruction"? The current MI or the previous VSETVLI?
1258	`possibly`

reames added inline comments.May 11 2022, 10:18 AM

llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp
1170	The current one. Essentially, if we skip emitting the state change, the mutated state on the prior MI (which we know is compatible with all of its own uses since we didn't change VL), must match what this state change would have produced. That is, the VL/VTYPE state after the the vector op must be the same, even if that vector op doesn't care. (e.g. we can't reintroduce the scalar move bug)

LGTM

llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp
1170	Makes sense, thanks. I think it's just that the use `this` is somewhat confusing, since the comment is directly before a mutation of an instruction which can reasonably be thought of as "this" - and the comment doesn't make sense since it's obviously changing VTYPE. I think it might be better to explicitly name `MI` rather than relying on demonstrative pronouns.

This revision is now accepted and ready to land.May 11 2022, 10:30 AM

This revision was landed with ongoing or failed builds.May 11 2022, 10:45 AM

Closed by commit rG72925d98bf92: [riscv] Canonicalize vsetvli (vsetvli avl, vtype1) vtype2 transitionsas reviewed (authored by reames). · Explain Why

This revision was automatically updated to reflect the committed changes.

reames added a commit: rG72925d98bf92: [riscv] Canonicalize vsetvli (vsetvli avl, vtype1) vtype2 transitionsas reviewed.

reames added inline comments.May 11 2022, 10:53 AM

llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp

1170

Thinking about this, I think I can make this a bit more explicit in code. What would you think of the following as a follow on tweak?

iff --git a/llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp b/llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp
index b0b911572506..df79dfbfa5b2 100644
--- a/llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp
+++ b/llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp
@@ -1170,6 +1170,10 @@ void RISCVInsertVSETVLI::emitVSETVLIs(MachineBasicBlock &MBB) {
               // and VTYPE stay the same after MI.  This greatly limits the
               // mutation we can legally do here.
               PrevVSETVLIMI->getOperand(2).setImm(NewInfo.encodeVTYPE());
+              // Keep the abstract state in sync with the register values
+              // (At the moment, this is only used for the following assert.)
+              CurInfo.setVTYPE(NewInfo.encodeVTYPE());
+              assert(NewInfo == CurInfo && "states out of sync!");
               NeedInsertVSETVLI = false;
             }
           }

Revision Contents

Path

Size

llvm/

lib/

Target/

RISCV/

RISCVInsertVSETVLI.cpp

38 lines

test/

CodeGen/

RISCV/

rvv/

vsetvli-insert.ll

9 lines

Diff 428669

llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp

Show First 20 Lines • Show All 1,154 Lines • ▼ Show 20 Lines	if (RISCVII::hasSEWOp(TSFlags)) {
// treat it the same as the first phase so that we produce the correct		// treat it the same as the first phase so that we produce the correct
// vl/vtype for succesor blocks.		// vl/vtype for succesor blocks.
if (!canSkipVSETVLIForLoadStore(MI, NewInfo, CurInfo) &&		if (!canSkipVSETVLIForLoadStore(MI, NewInfo, CurInfo) &&
needVSETVLI(NewInfo, CurInfo)) {		needVSETVLI(NewInfo, CurInfo)) {
// If the previous VL/VTYPE is set by VSETVLI and do not use, Merge it		// If the previous VL/VTYPE is set by VSETVLI and do not use, Merge it
// with current VL/VTYPE.		// with current VL/VTYPE.
bool NeedInsertVSETVLI = true;		bool NeedInsertVSETVLI = true;
if (PrevVSETVLIMI) {		if (PrevVSETVLIMI) {
bool HasSameAVL =
CurInfo.hasSameAVL(NewInfo) \|\|
(NewInfo.hasAVLReg() && NewInfo.getAVLReg().isVirtual() &&
NewInfo.getAVLReg() == PrevVSETVLIMI->getOperand(0).getReg());
// If these two VSETVLI have the same AVL and the same VLMAX,		// If these two VSETVLI have the same AVL and the same VLMAX,
// we could merge these two VSETVLI.		// we could merge these two VSETVLI.
if (HasSameAVL && CurInfo.hasSameVLMAX(NewInfo)) {		// TODO: If we remove this, we get a `vsetvli x0, x0, vtype'
		frasercrmckUnsubmitted Not Done Reply Inline Actions Makes sense to me to do a post-pass. Though not for this patch. frasercrmck: Makes sense to me to do a post-pass. Though not for this patch.
		// here. We could simply let this be emitted, then remove
		// the unused vsetvlis in a post-pass.
		if (CurInfo.hasSameAVL(NewInfo) && CurInfo.hasSameVLMAX(NewInfo)) {
		// WARNING: For correctness, it is essential the contents of VL
		// and VTYPE stay the same after this instruction. This
		frasercrmckUnsubmitted Not Done Reply Inline Actions I think I'm getting confused by the comment here - what's "this instruction"? The current MI or the previous VSETVLI? frasercrmck: I think I'm getting confused by the comment here - what's "this instruction"? The current MI or…
		reamesAuthorUnsubmitted Done Reply Inline Actions The current one. Essentially, if we skip emitting the state change, the mutated state on the prior MI (which we know is compatible with all of its own uses since we didn't change VL), must match what this state change would have produced. That is, the VL/VTYPE state after the the vector op must be the same, even if that vector op doesn't care. (e.g. we can't reintroduce the scalar move bug) reames: The current one. Essentially, if we skip emitting the state change, the mutated state on the…
		frasercrmckUnsubmitted Not Done Reply Inline Actions Makes sense, thanks. I think it's just that the use `this` is somewhat confusing, since the comment is directly before a mutation of an instruction which can reasonably be thought of as "this" - and the comment doesn't make sense since it's obviously changing VTYPE. I think it might be better to explicitly name `MI` rather than relying on demonstrative pronouns. frasercrmck: Makes sense, thanks. I think it's just that the use `this` is somewhat confusing, since the…
		reamesAuthorUnsubmitted Done Reply Inline Actions Thinking about this, I think I can make this a bit more explicit in code. What would you think of the following as a follow on tweak? iff --git a/llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp b/llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp index b0b911572506..df79dfbfa5b2 100644 --- a/llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp +++ b/llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp @@ -1170,6 +1170,10 @@ void RISCVInsertVSETVLI::emitVSETVLIs(MachineBasicBlock &MBB) { // and VTYPE stay the same after MI. This greatly limits the // mutation we can legally do here. PrevVSETVLIMI->getOperand(2).setImm(NewInfo.encodeVTYPE()); + // Keep the abstract state in sync with the register values + // (At the moment, this is only used for the following assert.) + CurInfo.setVTYPE(NewInfo.encodeVTYPE()); + assert(NewInfo == CurInfo && "states out of sync!"); NeedInsertVSETVLI = false; } } reames: Thinking about this, I think I can make this a bit more explicit in code. What would you think…
		// greatly limits the mutation we can legally do here.
PrevVSETVLIMI->getOperand(2).setImm(NewInfo.encodeVTYPE());		PrevVSETVLIMI->getOperand(2).setImm(NewInfo.encodeVTYPE());
NeedInsertVSETVLI = false;		NeedInsertVSETVLI = false;
}		}
}		}
if (NeedInsertVSETVLI)		if (NeedInsertVSETVLI)
insertVSETVLI(MBB, MI, NewInfo, CurInfo);		insertVSETVLI(MBB, MI, NewInfo, CurInfo);
CurInfo = NewInfo;		CurInfo = NewInfo;
}		}
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	if (isScalarMoveInstr(MI)) {
else		else
VLOp.ChangeToRegister(CurInfo.getAVLReg(), /IsDef/ false);		VLOp.ChangeToRegister(CurInfo.getAVLReg(), /IsDef/ false);
CurInfo = computeInfoForInstr(MI, TSFlags, MRI);		CurInfo = computeInfoForInstr(MI, TSFlags, MRI);
continue;		continue;
}		}
}		}

if (RISCVII::hasSEWOp(TSFlags)) {		if (RISCVII::hasSEWOp(TSFlags)) {
		if (RISCVII::hasVLOp(TSFlags)) {
		const auto Require = computeInfoForInstr(MI, TSFlags, MRI);
		// If the AVL is the result of a previous vsetvli which has the
		// same AVL and VLMAX as our current state, we can reuse the AVL
		// from the current state for the new one. This allows us to
		// generate 'vsetvli x0, x0, vtype" or possible skip the transition
		frasercrmckUnsubmitted Not Done Reply Inline Actions `possibly` frasercrmck: `possibly`
		// entirely.
		if (!CurInfo.isUnknown() && Require.hasAVLReg() &&
		Require.getAVLReg().isVirtual()) {
		if (MachineInstr *DefMI = MRI->getVRegDef(Require.getAVLReg())) {
		if (isVectorConfigInstr(*DefMI)) {
		VSETVLIInfo DefInfo = getInfoForVSETVLI(*DefMI);
		if (DefInfo.hasSameAVL(CurInfo) &&
		DefInfo.hasSameVLMAX(CurInfo)) {
		MachineOperand &VLOp = MI.getOperand(getVLOpNum(MI));
		if (CurInfo.hasAVLImm())
		VLOp.ChangeToImmediate(CurInfo.getAVLImm());
		else
		VLOp.ChangeToRegister(CurInfo.getAVLReg(), /IsDef/ false);
		CurInfo = computeInfoForInstr(MI, TSFlags, MRI);
		continue;
		}
		}
		}
		}
		}
CurInfo = computeInfoForInstr(MI, TSFlags, MRI);		CurInfo = computeInfoForInstr(MI, TSFlags, MRI);
continue;		continue;
}		}

// If this is something that updates VL/VTYPE that we don't know about,		// If this is something that updates VL/VTYPE that we don't know about,
// set the state to unknown.		// set the state to unknown.
if (MI.isCall() \|\| MI.isInlineAsm() \|\| MI.modifiesRegister(RISCV::VL) \|\|		if (MI.isCall() \|\| MI.isInlineAsm() \|\| MI.modifiesRegister(RISCV::VL) \|\|
MI.modifiesRegister(RISCV::VTYPE))		MI.modifiesRegister(RISCV::VTYPE))
▲ Show 20 Lines • Show All 82 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/rvv/vsetvli-insert.ll

Show First 20 Lines • Show All 275 Lines • ▼ Show 20 Lines	%f2 = tail call <vscale x 1 x double> @llvm.riscv.vfadd.nxv1f64.nxv1f64(
<vscale x 1 x double> %b,		<vscale x 1 x double> %b,
i64 %vsetvli)		i64 %vsetvli)
ret <vscale x 1 x double> %f2		ret <vscale x 1 x double> %f2
}		}

define <vscale x 1 x double> @test15(i64 %avl, <vscale x 1 x double> %a, <vscale x 1 x double> %b) nounwind {		define <vscale x 1 x double> @test15(i64 %avl, <vscale x 1 x double> %a, <vscale x 1 x double> %b) nounwind {
; CHECK-LABEL: test15:		; CHECK-LABEL: test15:
; CHECK: # %bb.0: # %entry		; CHECK: # %bb.0: # %entry
; CHECK-NEXT: vsetvli a0, a0, e64, m1, ta, mu		; CHECK-NEXT: vsetvli zero, a0, e64, m1, ta, mu
; CHECK-NEXT: vfadd.vv v8, v8, v9		; CHECK-NEXT: vfadd.vv v8, v8, v9
; CHECK-NEXT: vfadd.vv v8, v8, v9		; CHECK-NEXT: vfadd.vv v8, v8, v9
; CHECK-NEXT: vsetvli zero, a0, e64, m1, ta, mu
; CHECK-NEXT: ret		; CHECK-NEXT: ret
entry:		entry:
%vsetvli = tail call i64 @llvm.riscv.vsetvli(i64 %avl, i64 2, i64 7)		%vsetvli = tail call i64 @llvm.riscv.vsetvli(i64 %avl, i64 2, i64 7)
%f1 = tail call <vscale x 1 x double> @llvm.riscv.vfadd.nxv1f64.nxv1f64(		%f1 = tail call <vscale x 1 x double> @llvm.riscv.vfadd.nxv1f64.nxv1f64(
<vscale x 1 x double> undef,		<vscale x 1 x double> undef,
<vscale x 1 x double> %a,		<vscale x 1 x double> %a,
<vscale x 1 x double> %b,		<vscale x 1 x double> %b,
i64 %avl)		i64 %avl)
▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	entry:
%c3 = fadd double %c1, %c2		%c3 = fadd double %c1, %c2
ret double %c3		ret double %c3
}		}


define <vscale x 1 x double> @test18(<vscale x 1 x double> %a, double %b) nounwind {		define <vscale x 1 x double> @test18(<vscale x 1 x double> %a, double %b) nounwind {
; CHECK-LABEL: test18:		; CHECK-LABEL: test18:
; CHECK: # %bb.0: # %entry		; CHECK: # %bb.0: # %entry
; CHECK-NEXT: vsetivli a0, 6, e64, m1, tu, mu		; CHECK-NEXT: vsetivli zero, 6, e64, m1, tu, mu
; CHECK-NEXT: vmv1r.v v9, v8		; CHECK-NEXT: vmv1r.v v9, v8
; CHECK-NEXT: vfmv.s.f v9, fa0		; CHECK-NEXT: vfmv.s.f v9, fa0
; CHECK-NEXT: vsetvli zero, a0, e64, m1, ta, mu		; CHECK-NEXT: vsetvli zero, zero, e64, m1, ta, mu
; CHECK-NEXT: vfadd.vv v8, v8, v8		; CHECK-NEXT: vfadd.vv v8, v8, v8
; CHECK-NEXT: vsetivli zero, 1, e64, m1, tu, mu		; CHECK-NEXT: vsetvli zero, zero, e64, m1, tu, mu
; CHECK-NEXT: vfmv.s.f v8, fa0		; CHECK-NEXT: vfmv.s.f v8, fa0
; CHECK-NEXT: vsetvli a0, zero, e64, m1, ta, mu		; CHECK-NEXT: vsetvli a0, zero, e64, m1, ta, mu
; CHECK-NEXT: vfadd.vv v8, v9, v8		; CHECK-NEXT: vfadd.vv v8, v9, v8
; CHECK-NEXT: ret		; CHECK-NEXT: ret
entry:		entry:
%x = tail call i64 @llvm.riscv.vsetvli(i64 6, i64 3, i64 0)		%x = tail call i64 @llvm.riscv.vsetvli(i64 6, i64 3, i64 0)
%y = call <vscale x 1 x double> @llvm.riscv.vfmv.s.f.nxv1f64(		%y = call <vscale x 1 x double> @llvm.riscv.vfmv.s.f.nxv1f64(
<vscale x 1 x double> %a, double %b, i64 2)		<vscale x 1 x double> %a, double %b, i64 2)
▲ Show 20 Lines • Show All 61 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[riscv] Canonicalize vsetvli (vsetvli avl, vtype1) vtype2 transitionsClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 428669

llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp

llvm/test/CodeGen/RISCV/rvv/vsetvli-insert.ll

[riscv] Canonicalize vsetvli (vsetvli avl, vtype1) vtype2 transitions
ClosedPublic