This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/RISCV/
-
Target/
-
RISCV/
1/2
RISCVISelLowering.cpp
-
test/CodeGen/RISCV/
-
CodeGen/
-
RISCV/
-
alu64.ll
-
bittest.ll
-
compress-opt-select.ll
1
double-convert.ll
-
double-round-conv-sat.ll
-
float-convert.ll
-
float-round-conv-sat.ll
-
forced-atomics.ll
-
fpclamptosat.ll
-
fpclamptosat_vec.ll
-
half-convert.ll
-
half-round-conv-sat.ll
-
rotl-rotr.ll
-
rv32zbb-zbkb.ll
-
rv32zbs.ll
-
rv64zbb.ll
-
rvv/
-
ceil-vp.ll
-
fixed-vector-fpext-vp.ll
-
fixed-vector-fptrunc-vp.ll
-
fixed-vector-trunc-vp.ll
-
fixed-vectors-ceil-vp.ll
-
fixed-vectors-floor-vp.ll
-
fixed-vectors-fp2i-sat.ll
-
fixed-vectors-fptosi-vp.ll
-
fixed-vectors-fptoui-vp.ll
-
fixed-vectors-reduction-fp-vp.ll
-
fixed-vectors-reduction-int-vp.ll
-
fixed-vectors-reduction-mask-vp.ll
-
fixed-vectors-round-vp.ll
-
fixed-vectors-roundeven-vp.ll
-
fixed-vectors-roundtozero-vp.ll
-
fixed-vectors-setcc-fp-vp.ll
-
fixed-vectors-setcc-int-vp.ll
-
fixed-vectors-sext-vp.ll
-
fixed-vectors-sitofp-vp.ll
-
fixed-vectors-strided-vpload.ll
-
fixed-vectors-strided-vpstore.ll
-
fixed-vectors-uitofp-vp.ll
-
fixed-vectors-vadd-vp.ll
-
fixed-vectors-vcopysign-vp.ll
-
fixed-vectors-vfabs-vp.ll
-
fixed-vectors-vfma-vp.ll
-
fixed-vectors-vfmax-vp.ll
-
fixed-vectors-vfmin-vp.ll
-
fixed-vectors-vfmuladd-vp.ll
-
fixed-vectors-vfneg-vp.ll
-
fixed-vectors-vfsqrt-vp.ll
-
fixed-vectors-vmax-vp.ll
-
fixed-vectors-vmaxu-vp.ll
-
fixed-vectors-vmin-vp.ll
-
fixed-vectors-vminu-vp.ll
-
fixed-vectors-vpgather.ll
-
fixed-vectors-vpload.ll
-
fixed-vectors-vpmerge.ll
-
fixed-vectors-vpscatter.ll
-
fixed-vectors-vpstore.ll
-
fixed-vectors-vselect-vp.ll
-
fixed-vectors-zext-vp.ll
-
floor-vp.ll
-
round-vp.ll
-
roundeven-vp.ll
-
roundtozero-vp.ll
-
setcc-fp-vp.ll
-
setcc-int-vp.ll
-
strided-vpload.ll
-
strided-vpstore.ll
-
vadd-vp.ll
-
vfabs-vp.ll
-
vfma-vp.ll
-
vfmuladd-vp.ll
-
vfneg-vp.ll
-
vfpext-vp.ll
-
vfptosi-vp.ll
-
vfptoui-vp.ll
-
vfptrunc-vp.ll
-
vfsqrt-vp.ll
-
vmax-vp.ll
-
vmaxu-vp.ll
-
vmin-vp.ll
-
vminu-vp.ll
-
vpgather-sdnode.ll
-
vpload.ll
-
vpmerge-sdnode.ll
-
vpscatter-sdnode.ll
-
vpstore.ll
-
vreductions-fp-vp.ll
-
vreductions-int-vp.ll
-
vreductions-mask-vp.ll
-
vselect-vp.ll
-
vsext-vp.ll
-
vsitofp-vp.ll
-
vtrunc-vp.ll
-
vuitofp-vp.ll
-
vzext-vp.ll
-
selectcc-to-shiftand.ll
-
shift-masked-shamt.ll
-
shifts.ll
-
usub_sat.ll
-
usub_sat_plus.ll
-
vec3-setcc-crash.ll

Differential D135600

[RISCV] Use branchless form for selects with 0 in either arm
ClosedPublic

Authored by reames on Oct 10 2022, 9:18 AM.

Download Raw Diff

Details

Reviewers

craig.topper
asb
frasercrmck

Commits

rG1c41d0cb62c2: [RISCV] Use branchless form for selects with 0 in either arm

Summary

Continuing the theme of adding branchless lowerings for simple selects, this time handle the 0 arm case. This is very common for various umin idioms, etc..

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

reames created this revision.Oct 10 2022, 9:18 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 10 2022, 9:18 AM

Herald added subscribers: sunshaoce, VincentWu, armkevincheng and 33 others. · View Herald Transcript

reames requested review of this revision.Oct 10 2022, 9:18 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 10 2022, 9:18 AM

Herald added subscribers: • pcwang-thead, eopXD, MaskRay. · View Herald Transcript

craig.topper added inline comments.Oct 10 2022, 9:55 AM

llvm/lib/Target/RISCV/RISCVISelLowering.cpp
9505	Too much copy/paste. :) This should be `isNullConstant`

Harbormaster completed remote builds in B191302: Diff 466532.Oct 10 2022, 10:20 AM

reames added inline comments.Oct 10 2022, 11:17 AM

llvm/lib/Target/RISCV/RISCVISelLowering.cpp
9505	Eek, how did I miss that? I swear I looked at the test diffs too!

Address bug caught by @craig.topper. Due to copy paste mistake, second case was dead code. With fix, impact is even broader, including making many floating point conversion idioms branchless.

Is the branchless form better though? Branch+move can be fused but these forms can't and have more instructions to execute.

In D135600#3847548, @jrtc27 wrote:

Is the branchless form better though? Branch+move can be fused but these forms can't and have more instructions to execute.

We need an mtune flag for CPUs that can fuse these like sifive-7-series. But we also need to implement a fusion guarantee too so that code motion doesn't sinks things into the basic block and break fusion. I assume rocket doesn't fuse these?

Harbormaster completed remote builds in B191338: Diff 466572.Oct 10 2022, 12:31 PM

liaolucy added a subscriber: liaolucy.Oct 10 2022, 5:45 PM

craig.topper added inline comments.Oct 12 2022, 11:06 AM

llvm/test/CodeGen/RISCV/double-convert.ll
97	This is interesting. The seqz isn't necessary. I'll take a look at this.

LGTM. I think this is a good starting point. I have a followup I'm going to try to fix the issue I noticed, but that shouldn't block this.

This revision is now accepted and ready to land.Oct 12 2022, 12:15 PM

This revision was landed with ongoing or failed builds.Oct 12 2022, 1:52 PM

Closed by commit rG1c41d0cb62c2: [RISCV] Use branchless form for selects with 0 in either arm (authored by reames). · Explain Why

This revision was automatically updated to reflect the committed changes.

reames added a commit: rG1c41d0cb62c2: [RISCV] Use branchless form for selects with 0 in either arm.

Large Diff

This large diff affects 101 files. Files without inline comments have been collapsed. Expand All Files

Revision Contents

Path

Size

llvm/

lib/

Target/

RISCV/

RISCVISelLowering.cpp

15 lines

test/

CodeGen/

RISCV/

alu64.ll

39 lines

bittest.ll

24 lines

compress-opt-select.ll

24 lines

double-convert.ll

598 lines

double-round-conv-sat.ll

285 lines

float-convert.ll

443 lines

float-round-conv-sat.ll

285 lines

67 lines

3070 lines

2981 lines

802 lines

half-round-conv-sat.ll

285 lines

744 lines

41 lines

70 lines

42 lines

rvv/

ceil-vp.ll

97 lines

fixed-vector-fpext-vp.ll

16 lines

fixed-vector-fptrunc-vp.ll

16 lines

fixed-vector-trunc-vp.ll

318 lines

fixed-vectors-ceil-vp.ll

145 lines

fixed-vectors-floor-vp.ll

145 lines

fixed-vectors-fp2i-sat.ll

300 lines

fixed-vectors-fptosi-vp.ll

39 lines

fixed-vectors-fptoui-vp.ll

39 lines

fixed-vectors-reduction-fp-vp.ll

46 lines

fixed-vectors-reduction-int-vp.ll

23 lines

fixed-vectors-reduction-mask-vp.ll

32 lines

fixed-vectors-round-vp.ll

145 lines

fixed-vectors-roundeven-vp.ll

145 lines

fixed-vectors-roundtozero-vp.ll

145 lines

fixed-vectors-setcc-fp-vp.ll

103 lines

fixed-vectors-setcc-int-vp.ll

190 lines

fixed-vectors-sext-vp.ll

32 lines

fixed-vectors-sitofp-vp.ll

39 lines

fixed-vectors-strided-vpload.ll

194 lines

fixed-vectors-strided-vpstore.ll

64 lines

fixed-vectors-uitofp-vp.ll

39 lines

fixed-vectors-vadd-vp.ll

126 lines

fixed-vectors-vcopysign-vp.ll

79 lines

fixed-vectors-vfabs-vp.ll

39 lines

fixed-vectors-vfma-vp.ll

156 lines

fixed-vectors-vfmax-vp.ll

79 lines

fixed-vectors-vfmin-vp.ll

79 lines

fixed-vectors-vfmuladd-vp.ll

156 lines

fixed-vectors-vfneg-vp.ll

39 lines

fixed-vectors-vfsqrt-vp.ll

39 lines

fixed-vectors-vmax-vp.ll

80 lines

fixed-vectors-vmaxu-vp.ll

80 lines

fixed-vectors-vmin-vp.ll

80 lines

fixed-vectors-vminu-vp.ll

80 lines

fixed-vectors-vpgather.ll

596 lines

fixed-vectors-vpload.ll

50 lines

fixed-vectors-vpmerge.ll

177 lines

fixed-vectors-vpscatter.ll

236 lines

fixed-vectors-vpstore.ll

22 lines

fixed-vectors-vselect-vp.ll

171 lines

fixed-vectors-zext-vp.ll

32 lines

97 lines

97 lines

97 lines

97 lines

264 lines

229 lines

180 lines

244 lines

112 lines

39 lines

127 lines

127 lines

39 lines

20 lines

59 lines

59 lines

165 lines

39 lines

112 lines

112 lines

112 lines

112 lines

312 lines

76 lines

206 lines

252 lines

90 lines

54 lines

vreductions-int-vp.ll

66 lines

vreductions-mask-vp.ll

29 lines

106 lines

41 lines

59 lines

190 lines

59 lines

41 lines

selectcc-to-shiftand.ll

76 lines

shift-masked-shamt.ll

26 lines

569 lines

111 lines

148 lines

100 lines

Diff 467252

llvm/lib/Target/RISCV/RISCVISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 9,488 Lines • ▼ Show 20 Lines	case RISCVISD::SELECT_CC: {
// (select c, y, -1) -> -!c \| y		// (select c, y, -1) -> -!c \| y
if (isAllOnesConstant(FalseV)) {		if (isAllOnesConstant(FalseV)) {
SDValue C = DAG.getSetCC(DL, VT, LHS, RHS,		SDValue C = DAG.getSetCC(DL, VT, LHS, RHS,
ISD::getSetCCInverse(CCVal, VT));		ISD::getSetCCInverse(CCVal, VT));
SDValue Neg = DAG.getNegative(C, DL, VT);		SDValue Neg = DAG.getNegative(C, DL, VT);
return DAG.getNode(ISD::OR, DL, VT, Neg, TrueV);		return DAG.getNode(ISD::OR, DL, VT, Neg, TrueV);
}		}

		// (select c, 0, y) -> -!c & y
		if (isNullConstant(TrueV)) {
		SDValue C = DAG.getSetCC(DL, VT, LHS, RHS,
		ISD::getSetCCInverse(CCVal, VT));
		SDValue Neg = DAG.getNegative(C, DL, VT);
		return DAG.getNode(ISD::AND, DL, VT, Neg, FalseV);
		}
		// (select c, y, 0) -> -c & y
		if (isNullConstant(FalseV)) {
		craig.topperUnsubmitted Not Done Reply Inline Actions Too much copy/paste. :) This should be `isNullConstant` craig.topper: Too much copy/paste. :) This should be `isNullConstant`
		reamesAuthorUnsubmitted Done Reply Inline Actions Eek, how did I miss that? I swear I looked at the test diffs too! reames: Eek, how did I miss that? I swear I looked at the test diffs too!
		SDValue C = DAG.getSetCC(DL, VT, LHS, RHS, CCVal);
		SDValue Neg = DAG.getNegative(C, DL, VT);
		return DAG.getNode(ISD::AND, DL, VT, Neg, TrueV);
		}


return SDValue();		return SDValue();
}		}
case RISCVISD::BR_CC: {		case RISCVISD::BR_CC: {
SDValue LHS = N->getOperand(1);		SDValue LHS = N->getOperand(1);
SDValue RHS = N->getOperand(2);		SDValue RHS = N->getOperand(2);
SDValue CC = N->getOperand(3);		SDValue CC = N->getOperand(3);
SDLoc DL(N);		SDLoc DL(N);

▲ Show 20 Lines • Show All 3,506 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[RISCV] Use branchless form for selects with 0 in either armClosedPublic

Details

Diff Detail

Event Timeline

Large Diff

Revision Contents

Diff 467252

llvm/lib/Target/RISCV/RISCVISelLowering.cpp

llvm/test/CodeGen/RISCV/alu64.ll

llvm/test/CodeGen/RISCV/bittest.ll

llvm/test/CodeGen/RISCV/compress-opt-select.ll

llvm/test/CodeGen/RISCV/double-convert.ll

llvm/test/CodeGen/RISCV/double-round-conv-sat.ll

llvm/test/CodeGen/RISCV/float-convert.ll

llvm/test/CodeGen/RISCV/float-round-conv-sat.ll

llvm/test/CodeGen/RISCV/forced-atomics.ll

llvm/test/CodeGen/RISCV/fpclamptosat.ll

llvm/test/CodeGen/RISCV/fpclamptosat_vec.ll

llvm/test/CodeGen/RISCV/half-convert.ll

llvm/test/CodeGen/RISCV/half-round-conv-sat.ll

llvm/test/CodeGen/RISCV/rotl-rotr.ll

llvm/test/CodeGen/RISCV/rv32zbb-zbkb.ll

llvm/test/CodeGen/RISCV/rv32zbs.ll

llvm/test/CodeGen/RISCV/rv64zbb.ll

llvm/test/CodeGen/RISCV/rvv/ceil-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vector-fpext-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vector-fptrunc-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vector-trunc-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-ceil-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-floor-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-fp2i-sat.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-fptosi-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-fptoui-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-reduction-fp-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-reduction-int-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-reduction-mask-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-round-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-roundeven-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-roundtozero-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-setcc-fp-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-setcc-int-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-sext-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-sitofp-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-strided-vpload.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-strided-vpstore.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-uitofp-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vadd-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vcopysign-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vfabs-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vfma-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vfmax-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vfmin-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vfmuladd-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vfneg-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vfsqrt-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vmax-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vmaxu-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vmin-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vminu-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vpgather.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vpload.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vpmerge.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vpscatter.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vpstore.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vselect-vp.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-zext-vp.ll

llvm/test/CodeGen/RISCV/rvv/floor-vp.ll

llvm/test/CodeGen/RISCV/rvv/round-vp.ll

llvm/test/CodeGen/RISCV/rvv/roundeven-vp.ll

llvm/test/CodeGen/RISCV/rvv/roundtozero-vp.ll

llvm/test/CodeGen/RISCV/rvv/setcc-fp-vp.ll

llvm/test/CodeGen/RISCV/rvv/setcc-int-vp.ll

llvm/test/CodeGen/RISCV/rvv/strided-vpload.ll

llvm/test/CodeGen/RISCV/rvv/strided-vpstore.ll

llvm/test/CodeGen/RISCV/rvv/vadd-vp.ll

llvm/test/CodeGen/RISCV/rvv/vfabs-vp.ll

llvm/test/CodeGen/RISCV/rvv/vfma-vp.ll

llvm/test/CodeGen/RISCV/rvv/vfmuladd-vp.ll

llvm/test/CodeGen/RISCV/rvv/vfneg-vp.ll

[RISCV] Use branchless form for selects with 0 in either arm
ClosedPublic