This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
1/9
InstCombineCompares.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
7
fold-signbit-test-power2.ll
-
minmax-of-xor-x.ll

Differential D144777

[InstCombine] Fold signbit test of a pow2 or zero
ClosedPublic

Authored by junaire on Feb 24 2023, 10:28 PM.

Download Raw Diff

Details

Reviewers

spatel
nikic
RKSimon
goldstein.w.n

Commits

rGf88436c3f3b0: [InstCombine] Fold signbit test of a pow2 or zero

Summary

Alive2: https://alive2.llvm.org/ce/z/_J5q3S
Closes: https://github.com/llvm/llvm-project/issues/60957

Signed-off-by: Jun Zhang <jun@junz.org>

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	60,090 ms	x64 debian > Clang.Driver::emit-reproducer.c
	60,590 ms	x64 debian > Clang.Driver::fsanitize.c
	80 ms	x64 debian > LLVM.Transforms/InstCombine::fold-signbit-test-power2.ll
	60,060 ms	x64 debian > libFuzzer.libFuzzer::fuzzer-leak.test
	60,050 ms	x64 debian > libFuzzer.libFuzzer::minimize_crash.test
		View Full Test Results (6 Failed)

Event Timeline

junaire created this revision.Feb 24 2023, 10:28 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 24 2023, 10:28 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

junaire requested review of this revision.Feb 24 2023, 10:28 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 24 2023, 10:28 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

junaire added reviewers: spatel, nikic, RKSimon.Feb 24 2023, 10:30 PM

Herald added a subscriber: StephenFan. · View Herald TranscriptFeb 24 2023, 10:30 PM

goldstein.w.n added a subscriber: goldstein.w.n.Feb 24 2023, 11:02 PM

goldstein.w.n added inline comments.

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
1343	There are other power of 2 patterns. Maybe use `isKnownPowerOf2`?

goldstein.w.n added inline comments.Feb 24 2023, 11:07 PM

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
1343	disregard, misunderstood the patch.

goldstein.w.n added inline comments.Feb 24 2023, 11:09 PM

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
1343	`m_Sub(m_Zero(), m_Value(X))` -> `m_Neg(m_Value(X))`? Also do you need `m_OneUse`? This shouldn't ever create more instructions.

Address comments.

junaire added inline comments.Feb 24 2023, 11:31 PM

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp

1343

Also do you need m_OneUse? This shouldn't ever create more instructions.

I thought it was unnecessary at first, but then it cause a regression in smax_xor_pow2_neg,
so I added it.

@@ -137,7 +137,10 @@ define i8 @smax_xor_pow2_neg(i8 %x, i8 %y) {                                                                                      ; CHECK-NEXT:    [[CMP:%.*]] = icmp eq i8 [[Y:%.*]], -128                                                                                             ; CHECK-NEXT:    br i1 [[CMP]], label [[NEG:%.*]], label [[POS:%.*]]                                                                                  ; CHECK:       neg:                                                                                                                                  -; CHECK-NEXT:    [[R:%.*]] = and i8 [[X:%.*]], 127                                                                                                   +; CHECK-NEXT:    [[NY:%.*]] = sub i8 0, [[Y]]                                                                                                        +; CHECK-NEXT:    [[YP2:%.*]] = and i8 [[NY]], [[Y]]                                                                                                  +; CHECK-NEXT:    [[X_XOR:%.*]] = xor i8 [[YP2]], [[X:%.*]]                                                                                           +; CHECK-NEXT:    [[R:%.*]] = call i8 @llvm.smax.i8(i8 [[X]], i8 [[X_XOR]])                                                                            ; CHECK-NEXT:    ret i8 [[R]]                                                                                                                         ; CHECK:       pos:                                                                                                                                   ; CHECK-NEXT:    call void @barrier()

junaire added a reviewer: goldstein.w.n.Feb 24 2023, 11:31 PM

Update the comment.

Harbormaster completed remote builds in B215901: Diff 500383.Feb 25 2023, 4:43 AM

Please can you add vector test coverage?

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
1343	Use m_c_And (and add suitable test coverage)?

Address comments, thanks @RKSimon

In D144777#4153394, @RKSimon wrote:

Please can you add vector test coverage?

Oops, missed this comment, will do.

spatel added inline comments.Feb 26 2023, 6:38 AM

llvm/test/Transforms/InstCombine/fold-signbit-test-power2.ll
14	Ths isn't testing what you expected because the `and` operands will be commuted before they reach the transform in this patch. This would be easier to see if you pre-commit the tests with baseline results (no pre-commit Phab review is needed for that NFC change). grep for "thwart complexity-based canonicalization" in the test directory to see how to create a test that handles the commuted pattern.
36	See comment about commuting - same as above.

Add vevtor tests.

Harbormaster completed remote builds in B216056: Diff 500564.Feb 26 2023, 7:27 AM

Hi @spatel, thanks for your comments, I updated the tests according to your
suggestions. However, everything stop folding after I use div instructions to
each oprands of and instruction.

Can you take a look? Is this because I missed something? Or my fold pattern is wrong.

Harbormaster completed remote builds in B216078: Diff 500587.Feb 26 2023, 9:13 AM

In D144777#4153685, @junaire wrote:

Hi @spatel, thanks for your comments, I updated the tests according to your
suggestions. However, everything stop folding after I use div instructions to
each oprands of and instruction.

Can you take a look? Is this because I missed something? Or my fold pattern is wrong.

There shouldn't be any extra instruction between the sub (negate) and and. You just need one extra binary-op instruction to create "%x", so it stays operand 0 of the and. You don't need any extra instructions if you want "%x" to be operand 1.

I think it would be better to add the transform inside of InstCombinerImpl::foldICmpAndConstant().

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
1341	"CI" should be "MinSignedC" or something like that in these comments and the code to be more specific.

junaire mentioned this in rGcf491a165f23: Precommit test for D144777, NFC.Feb 26 2023, 8:25 PM

Rebase

In D144777#4154057, @spatel wrote:

In D144777#4153685, @junaire wrote:

Hi @spatel, thanks for your comments, I updated the tests according to your
suggestions. However, everything stop folding after I use div instructions to
each oprands of and instruction.

Can you take a look? Is this because I missed something? Or my fold pattern is wrong.

There shouldn't be any extra instruction between the sub (negate) and and. You just need one extra binary-op instruction to create "%x", so it stays operand 0 of the and. You don't need any extra instructions if you want "%x" to be operand 1.

I think it would be better to add the transform inside of InstCombinerImpl::foldICmpAndConstant().

Thanks for your comments, they are very helpful. I pushed an NFC change about the recommit test in https://github.com/llvm/llvm-project/commit/cf491a165f239abfa7ab9e707f5cbd1861a6cb20 and moved the fold pattern to the place you suggest.

Please take a look :)

Harbormaster completed remote builds in B216133: Diff 500655.Feb 26 2023, 9:20 PM

We should have 2 more tests: (1) extra use of the sub and (2) extra use of the and.
As noted earlier, this patch should not require a "m_OneUse" limitation. I realize that it looks like a regression for the existing test, but that should be ok when viewed globally: we are reducing to an equality compare, and GVN, CVP, or some other pass will reduce that to the optimal form. It's just lucky (or unlucky) that the min/max folds added with D144606 are able to reduce it all within InstCombine.

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
1862	Formatting: Value *X;
llvm/test/Transforms/InstCombine/fold-signbit-test-power2.ll
3–4	It's easier to follow the test progression if we make the test names slightly more specific. This is a signbit test for "isNegative", so "pow2_or_zero_is_negative". To provide a little more coverage, you could change some of the tests to include variations of that like: define i1 @pow2_or_zero_is_negative(i8 %x) { %negx = sub i8 0, %x %pow2_or_zero = and i8 %negx, %x %cmp = icmp ugt i8 %pow2_or_zero, 127 ret i1 %cmp } That should already fold as expected without having to change anything in this patch, but it's good to show that we can handle it.

spatel added inline comments.Feb 27 2023, 5:40 AM

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
1826	Re-use this logic (put the new code inside of here)?

Reuse existing code and add more tests.

Just a beginer to the middle-end so bear with me if I get things wrong!

spatel added inline comments.Feb 27 2023, 7:01 AM

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
1835–1836	It could go either way, but I prefer "C"-style notation (consistent with the comment above) for its compactness: // (X & -X) < 0 --> X == MinSignedC // (X & -X) > -1 --> X != MinSignedC
llvm/test/Transforms/InstCombine/fold-signbit-test-power2.ll
74–76	This is "is_not_negative"
122	Let's not duplicate everything for unsigned predicates. It's sufficient to change one or two of the above tests to confirm that we handle the various forms of signbit tests.

Harbormaster completed remote builds in B216211: Diff 500773.Feb 27 2023, 7:35 AM

Update tests.

Update tests

Make sure the comments are properly aligned!

Harbormaster completed remote builds in B216221: Diff 500788.Feb 27 2023, 8:32 AM

LGTM - thanks!
See inline comments for a small adjustment to the tests.

llvm/test/Transforms/InstCombine/fold-signbit-test-power2.ll
25	I'd rather not add this "is_not_negative / ult" instruction to this test. It is confusing given the test name.
56	I'd rather not add this "is_not_negative / ult" instruction to this test. It is confusing given the test name.

spatel accepted this revision.Feb 27 2023, 10:00 AM

This revision is now accepted and ready to land.Feb 27 2023, 10:00 AM

Update

This revision was landed with ongoing or failed builds.Feb 27 2023, 11:53 PM

Closed by commit rGf88436c3f3b0: [InstCombine] Fold signbit test of a pow2 or zero (authored by junaire). · Explain Why

This revision was automatically updated to reflect the committed changes.

junaire added a commit: rGf88436c3f3b0: [InstCombine] Fold signbit test of a pow2 or zero.

Thanks to everyone who helped me review the patch! My first contribution to the optimization world, nice!

Harbormaster completed remote builds in B216404: Diff 501041.Feb 28 2023, 12:41 AM

Revision Contents

Path

Size

llvm/

lib/

Transforms/

InstCombine/

InstCombineCompares.cpp

9 lines

test/

Transforms/

InstCombine/

fold-signbit-test-power2.ll

112 lines

minmax-of-xor-x.ll

9 lines

Diff 500788

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,332 Lines • ▼ Show 20 Lines	Instruction *InstCombinerImpl::foldICmpWithConstant(ICmpInst &Cmp) {
Value A, B;		Value A, B;
ConstantInt CI, CI2; // I = icmp ugt (add (add A, B), CI2), CI		ConstantInt CI, CI2; // I = icmp ugt (add (add A, B), CI2), CI
if (Pred == ICmpInst::ICMP_UGT && match(Op1, m_ConstantInt(CI)) &&		if (Pred == ICmpInst::ICMP_UGT && match(Op1, m_ConstantInt(CI)) &&
match(Op0, m_Add(m_Add(m_Value(A), m_Value(B)), m_ConstantInt(CI2))))		match(Op0, m_Add(m_Add(m_Value(A), m_Value(B)), m_ConstantInt(CI2))))
if (Instruction Res = processUGT_ADDCST_ADD(Cmp, A, B, CI2, CI, this))		if (Instruction Res = processUGT_ADDCST_ADD(Cmp, A, B, CI2, CI, this))
return Res;		return Res;

// icmp(phi(C1, C2, ...), C) -> phi(icmp(C1, C), icmp(C2, C), ...).		// icmp(phi(C1, C2, ...), C) -> phi(icmp(C1, C), icmp(C2, C), ...).
Constant *C = dyn_cast<Constant>(Op1);		Constant *C = dyn_cast<Constant>(Op1);
		spatelUnsubmitted Not Done Reply Inline Actions "CI" should be "MinSignedC" or something like that in these comments and the code to be more specific. spatel: "CI" should be "MinSignedC" or something like that in these comments and the code to be more…
if (!C)		if (!C)
return nullptr;		return nullptr;
		goldstein.w.nUnsubmitted Not Done Reply Inline Actions There are other power of 2 patterns. Maybe use `isKnownPowerOf2`? goldstein.w.n: There are other power of 2 patterns. Maybe use `isKnownPowerOf2`?
		goldstein.w.nUnsubmitted Not Done Reply Inline Actions disregard, misunderstood the patch. goldstein.w.n: disregard, misunderstood the patch.
		goldstein.w.nUnsubmitted Not Done Reply Inline Actions `m_Sub(m_Zero(), m_Value(X))` -> `m_Neg(m_Value(X))`? Also do you need `m_OneUse`? This shouldn't ever create more instructions. goldstein.w.n: `m_Sub(m_Zero(), m_Value(X))` -> `m_Neg(m_Value(X))`? Also do you need `m_OneUse`? This…
		junaireAuthorUnsubmitted Done Reply Inline Actions Also do you need m_OneUse? This shouldn't ever create more instructions. I thought it was unnecessary at first, but then it cause a regression in `smax_xor_pow2_neg`, so I added it. @@ -137,7 +137,10 @@ define i8 @smax_xor_pow2_neg(i8 %x, i8 %y) { ; CHECK-NEXT: [[CMP:%.]] = icmp eq i8 [[Y:%.]], -128 ; CHECK-NEXT: br i1 [[CMP]], label [[NEG:%.]], label [[POS:%.]] ; CHECK: neg: -; CHECK-NEXT: [[R:%.]] = and i8 [[X:%.]], 127 +; CHECK-NEXT: [[NY:%.]] = sub i8 0, [[Y]] +; CHECK-NEXT: [[YP2:%.]] = and i8 [[NY]], [[Y]] +; CHECK-NEXT: [[X_XOR:%.]] = xor i8 [[YP2]], [[X:%.]] +; CHECK-NEXT: [[R:%.]] = call i8 @llvm.smax.i8(i8 [[X]], i8 [[X_XOR]]) ; CHECK-NEXT: ret i8 [[R]] ; CHECK: pos: ; CHECK-NEXT: call void @barrier() junaire:* > Also do you need m_OneUse? This shouldn't ever create more instructions. I thought it was…
		RKSimonUnsubmitted Not Done Reply Inline Actions Use m_c_And (and add suitable test coverage)? RKSimon: Use m_c_And (and add suitable test coverage)?

if (auto *Phi = dyn_cast<PHINode>(Op0))		if (auto *Phi = dyn_cast<PHINode>(Op0))
if (all_of(Phi->operands(), [](Value *V) { return isa<Constant>(V); })) {		if (all_of(Phi->operands(), [](Value *V) { return isa<Constant>(V); })) {
Type *Ty = Cmp.getType();		Type *Ty = Cmp.getType();
Builder.SetInsertPoint(Phi);		Builder.SetInsertPoint(Phi);
PHINode *NewPhi =		PHINode *NewPhi =
Builder.CreatePHI(Ty, Phi->getNumOperands());		Builder.CreatePHI(Ty, Phi->getNumOperands());
for (BasicBlock *Predecessor : predecessors(Phi->getParent())) {		for (BasicBlock *Predecessor : predecessors(Phi->getParent())) {
▲ Show 20 Lines • Show All 466 Lines • ▼ Show 20 Lines
Instruction *InstCombinerImpl::foldICmpAndConstant(ICmpInst &Cmp,		Instruction *InstCombinerImpl::foldICmpAndConstant(ICmpInst &Cmp,
BinaryOperator *And,		BinaryOperator *And,
const APInt &C) {		const APInt &C) {
if (Instruction *I = foldICmpAndConstConst(Cmp, And, C))		if (Instruction *I = foldICmpAndConstConst(Cmp, And, C))
return I;		return I;

const ICmpInst::Predicate Pred = Cmp.getPredicate();		const ICmpInst::Predicate Pred = Cmp.getPredicate();
bool TrueIfNeg;		bool TrueIfNeg;
if (isSignBitCheck(Pred, C, TrueIfNeg)) {		if (isSignBitCheck(Pred, C, TrueIfNeg)) {
		spatelUnsubmitted Not Done Reply Inline Actions Re-use this logic (put the new code inside of here)? spatel: Re-use this logic (put the new code inside of here)?
// ((X - 1) & ~X) < 0 --> X == 0		// ((X - 1) & ~X) < 0 --> X == 0
// ((X - 1) & ~X) >= 0 --> X != 0		// ((X - 1) & ~X) >= 0 --> X != 0
Value *X;		Value *X;
if (match(And->getOperand(0), m_Add(m_Value(X), m_AllOnes())) &&		if (match(And->getOperand(0), m_Add(m_Value(X), m_AllOnes())) &&
match(And->getOperand(1), m_Not(m_Specific(X)))) {		match(And->getOperand(1), m_Not(m_Specific(X)))) {
auto NewPred = TrueIfNeg ? CmpInst::ICMP_EQ : CmpInst::ICMP_NE;		auto NewPred = TrueIfNeg ? CmpInst::ICMP_EQ : CmpInst::ICMP_NE;
return new ICmpInst(NewPred, X, ConstantInt::getNullValue(X->getType()));		return new ICmpInst(NewPred, X, ConstantInt::getNullValue(X->getType()));
}		}
		// (X & X) < 0 --> X == MinSignedC
		// (X & X) > -1 --> X != MinSignedC
		spatelUnsubmitted Not Done Reply Inline Actions It could go either way, but I prefer "C"-style notation (consistent with the comment above) for its compactness: // (X & -X) < 0 --> X == MinSignedC // (X & -X) > -1 --> X != MinSignedC spatel: It could go either way, but I prefer "C"-style notation (consistent with the comment above) for…
		if (match(And, m_c_And(m_Neg(m_Value(X)), m_Deferred(X)))) {
		Constant *MinSignedC = ConstantInt::get(
		X->getType(),
		APInt::getSignedMinValue(X->getType()->getScalarSizeInBits()));
		auto NewPred = TrueIfNeg ? CmpInst::ICMP_EQ : CmpInst::ICMP_NE;
		return new ICmpInst(NewPred, X, MinSignedC);
		}
}		}

// TODO: These all require that Y is constant too, so refactor with the above.		// TODO: These all require that Y is constant too, so refactor with the above.

// Try to optimize things like "A[i] & 42 == 0" to index computations.		// Try to optimize things like "A[i] & 42 == 0" to index computations.
Value *X = And->getOperand(0);		Value *X = And->getOperand(0);
Value *Y = And->getOperand(1);		Value *Y = And->getOperand(1);
if (auto *C2 = dyn_cast<ConstantInt>(Y))		if (auto *C2 = dyn_cast<ConstantInt>(Y))
if (auto *LI = dyn_cast<LoadInst>(X))		if (auto *LI = dyn_cast<LoadInst>(X))
if (auto *GEP = dyn_cast<GetElementPtrInst>(LI->getOperand(0)))		if (auto *GEP = dyn_cast<GetElementPtrInst>(LI->getOperand(0)))
if (auto *GV = dyn_cast<GlobalVariable>(GEP->getOperand(0)))		if (auto *GV = dyn_cast<GlobalVariable>(GEP->getOperand(0)))
if (Instruction *Res =		if (Instruction *Res =
foldCmpLoadFromIndexedGlobal(LI, GEP, GV, Cmp, C2))		foldCmpLoadFromIndexedGlobal(LI, GEP, GV, Cmp, C2))
return Res;		return Res;

if (!Cmp.isEquality())		if (!Cmp.isEquality())
return nullptr;		return nullptr;

// X & -C == -C -> X > u ~C		// X & -C == -C -> X > u ~C
		spatelUnsubmitted Not Done Reply Inline Actions Formatting: Value X; spatel:* Formatting: Value *X;
// X & -C != -C -> X <= u ~C		// X & -C != -C -> X <= u ~C
// iff C is a power of 2		// iff C is a power of 2
if (Cmp.getOperand(1) == Y && C.isNegatedPowerOf2()) {		if (Cmp.getOperand(1) == Y && C.isNegatedPowerOf2()) {
auto NewPred =		auto NewPred =
Pred == CmpInst::ICMP_EQ ? CmpInst::ICMP_UGT : CmpInst::ICMP_ULE;		Pred == CmpInst::ICMP_EQ ? CmpInst::ICMP_UGT : CmpInst::ICMP_ULE;
return new ICmpInst(NewPred, X, SubOne(cast<Constant>(Cmp.getOperand(1))));		return new ICmpInst(NewPred, X, SubOne(cast<Constant>(Cmp.getOperand(1))));
}		}

▲ Show 20 Lines • Show All 5,350 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/fold-signbit-test-power2.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -passes=instcombine -S \| FileCheck %s			; RUN: opt < %s -passes=instcombine -S \| FileCheck %s

	; icmp slt (and X, -X), 0 --> icmp eq (X, MinSignC)			declare void @use(i8)
				spatelUnsubmitted Not Done Reply Inline Actions It's easier to follow the test progression if we make the test names slightly more specific. This is a signbit test for "isNegative", so "pow2_or_zero_is_negative". To provide a little more coverage, you could change some of the tests to include variations of that like: define i1 @pow2_or_zero_is_negative(i8 %x) { %negx = sub i8 0, %x %pow2_or_zero = and i8 %negx, %x %cmp = icmp ugt i8 %pow2_or_zero, 127 ret i1 %cmp } That should already fold as expected without having to change anything in this patch, but it's good to show that we can handle it. spatel: It's easier to follow the test progression if we make the test names slightly more specific.
	define i1 @pow2_or_zero1(i8 %x) {			declare void @use_i1(i1)
	; CHECK-LABEL: @pow2_or_zero1(			declare void @use_i1_vec(<2 x i1>)
	; CHECK-NEXT: [[NEG:%.]] = sub i8 0, [[X:%.]]
	; CHECK-NEXT: [[POW2_OR_ZERO:%.*]] = and i8 [[NEG]], [[X]]			; (X & -X) < 0 --> X == MinSignC
	; CHECK-NEXT: [[CMP:%.*]] = icmp slt i8 [[POW2_OR_ZERO]], 0			; (X & X) > -1 --> X != MinSignC

				define i1 @pow2_or_zero_is_negative(i8 %x) {
				; CHECK-LABEL: @pow2_or_zero_is_negative1(
				; CHECK-NEXT: [[CMP:%.]] = icmp eq i8 [[X:%.]], -128
				; CHECK-NEXT: [[CMP_2:%.*]] = icmp eq i8 [[X]], -128
				spatelUnsubmitted Not Done Reply Inline Actions Ths isn't testing what you expected because the `and` operands will be commuted before they reach the transform in this patch. This would be easier to see if you pre-commit the tests with baseline results (no pre-commit Phab review is needed for that NFC change). grep for "thwart complexity-based canonicalization" in the test directory to see how to create a test that handles the commuted pattern. spatel: Ths isn't testing what you expected because the `and` operands will be commuted before they…
				; CHECK-NEXT: call void @use_i1(i1 [[CMP_2]])
				; CHECK-NEXT: [[CMP_3:%.*]] = icmp ne i8 [[X]], -128
				; CHECK-NEXT: call void @use_i1(i1 [[CMP_3]])
	; CHECK-NEXT: ret i1 [[CMP]]			; CHECK-NEXT: ret i1 [[CMP]]
	;			;
	%neg = sub i8 0, %x			%neg = sub i8 0, %x
	%pow2_or_zero = and i8 %x, %neg			%pow2_or_zero = and i8 %x, %neg
	%cmp = icmp slt i8 %pow2_or_zero, 0			%cmp = icmp slt i8 %pow2_or_zero, 0
				%cmp.2 = icmp ugt i8 %pow2_or_zero, 127
				call void @use_i1(i1 %cmp.2)
				%cmp.3 = icmp ult i8 %pow2_or_zero, -128
				spatelUnsubmitted Not Done Reply Inline Actions I'd rather not add this "is_not_negative / ult" instruction to this test. It is confusing given the test name. spatel: I'd rather not add this "is_not_negative / ult" instruction to this test. It is confusing given…
				call void @use_i1(i1 %cmp.3)
	ret i1 %cmp			ret i1 %cmp
	}			}

	; icmp slt (and -X, X), 0 --> icmp eq (X, MinSignC)			define i1 @pow2_or_zero_is_negative_commute(i8 %A) {
	define i1 @pow2_or_zero1_commute(i8 %A) {			; CHECK-LABEL: @pow2_or_zero_is_negative1_commute(
	; CHECK-LABEL: @pow2_or_zero1_commute(			; CHECK-NEXT: [[X:%.]] = mul i8 [[A:%.]], 42
	; CHECK-NEXT: [[X:%.]] = sdiv i8 42, [[A:%.]]			; CHECK-NEXT: [[CMP:%.*]] = icmp eq i8 [[X]], -128
	; CHECK-NEXT: [[NEG:%.*]] = sub nsw i8 0, [[X]]
	; CHECK-NEXT: [[POW2_OR_ZERO:%.*]] = and i8 [[X]], [[NEG]]
	; CHECK-NEXT: [[CMP:%.*]] = icmp slt i8 [[POW2_OR_ZERO]], 0
	; CHECK-NEXT: ret i1 [[CMP]]			; CHECK-NEXT: ret i1 [[CMP]]
	;			;
	%x = sdiv i8 42, %A ; thwart complexity-based canonicalization			%x = mul i8 42, %A ; thwart complexity-based canonicalization
				spatelUnsubmitted Not Done Reply Inline Actions See comment about commuting - same as above. spatel: See comment about commuting - same as above.
	%neg = sub i8 0, %x			%neg = sub i8 0, %x
	%pow2_or_zero = and i8 %neg, %x			%pow2_or_zero = and i8 %neg, %x
	%cmp = icmp slt i8 %pow2_or_zero, 0			%cmp = icmp slt i8 %pow2_or_zero, 0
	ret i1 %cmp			ret i1 %cmp
	}			}

	define <2 x i1> @pow2_or_zero1_vec(<2 x i8> %x) {			define <2 x i1> @pow2_or_zero_is_negative_vec(<2 x i8> %x) {
	; CHECK-LABEL: @pow2_or_zero1_vec(			; CHECK-LABEL: @pow2_or_zero_is_negative_vec1(
	; CHECK-NEXT: [[NEG:%.]] = sub <2 x i8> zeroinitializer, [[X:%.]]			; CHECK-NEXT: [[CMP:%.]] = icmp eq <2 x i8> [[X:%.]], <i8 -128, i8 -128>
	; CHECK-NEXT: [[POW2_OR_ZERO:%.*]] = and <2 x i8> [[NEG]], [[X]]			; CHECK-NEXT: [[CMP_2:%.*]] = icmp eq <2 x i8> [[X]], <i8 -128, i8 -128>
	; CHECK-NEXT: [[CMP:%.*]] = icmp slt <2 x i8> [[POW2_OR_ZERO]], zeroinitializer			; CHECK-NEXT: call void @use_i1_vec(<2 x i1> [[CMP_2]])
				; CHECK-NEXT: call void @use_i1_vec(<2 x i1> [[CMP_2]])
	; CHECK-NEXT: ret <2 x i1> [[CMP]]			; CHECK-NEXT: ret <2 x i1> [[CMP]]
	;			;
	%neg = sub <2 x i8> <i8 0, i8 0>, %x			%neg = sub <2 x i8> <i8 0, i8 0>, %x
	%pow2_or_zero = and <2 x i8> %x, %neg			%pow2_or_zero = and <2 x i8> %x, %neg
	%cmp = icmp slt <2 x i8> %pow2_or_zero, <i8 0, i8 0>			%cmp = icmp slt <2 x i8> %pow2_or_zero, <i8 0, i8 0>
				%cmp.2 = icmp ugt <2 x i8> %pow2_or_zero, <i8 127, i8 127>
				call void @use_i1_vec(<2 x i1> %cmp.2)
				%cmp.3 = icmp ult <2 x i8> %pow2_or_zero, <i8 -128, i8 -128>
				spatelUnsubmitted Not Done Reply Inline Actions I'd rather not add this "is_not_negative / ult" instruction to this test. It is confusing given the test name. spatel: I'd rather not add this "is_not_negative / ult" instruction to this test. It is confusing given…
				call void @use_i1_vec(<2 x i1> %cmp.2)
	ret <2 x i1> %cmp			ret <2 x i1> %cmp
	}			}

				define <2 x i1> @pow2_or_zero_is_negative_vec_commute(<2 x i8> %A) {
	define <2 x i1> @pow2_or_zero1_vec_commute(<2 x i8> %A) {			; CHECK-LABEL: @pow2_or_zero_is_negative_vec1_commute(
	; CHECK-LABEL: @pow2_or_zero1_vec_commute(
	; CHECK-NEXT: [[X:%.]] = mul <2 x i8> [[A:%.]], <i8 42, i8 42>			; CHECK-NEXT: [[X:%.]] = mul <2 x i8> [[A:%.]], <i8 42, i8 42>
	; CHECK-NEXT: [[NEG:%.*]] = sub <2 x i8> zeroinitializer, [[X]]			; CHECK-NEXT: [[CMP:%.*]] = icmp eq <2 x i8> [[X]], <i8 -128, i8 -128>
	; CHECK-NEXT: [[POW2_OR_ZERO:%.*]] = and <2 x i8> [[X]], [[NEG]]
	; CHECK-NEXT: [[CMP:%.*]] = icmp slt <2 x i8> [[POW2_OR_ZERO]], zeroinitializer
	; CHECK-NEXT: ret <2 x i1> [[CMP]]			; CHECK-NEXT: ret <2 x i1> [[CMP]]
	;			;
	%x = mul <2 x i8> <i8 42, i8 42>, %A ; thwart complexity-based canonicalization			%x = mul <2 x i8> <i8 42, i8 42>, %A ; thwart complexity-based canonicalization
	%neg = sub <2 x i8> <i8 0, i8 0>, %x			%neg = sub <2 x i8> <i8 0, i8 0>, %x
	%pow2_or_zero = and <2 x i8> %neg, %x			%pow2_or_zero = and <2 x i8> %neg, %x
	%cmp = icmp slt <2 x i8> %pow2_or_zero, <i8 0, i8 0>			%cmp = icmp slt <2 x i8> %pow2_or_zero, <i8 0, i8 0>
	ret <2 x i1> %cmp			ret <2 x i1> %cmp
	}			}

	; icmp sgt (and X, -X), -1 --> icmp ne (X, MinSignC)			define i1 @pow2_or_zero_is_not_negative(i8 %x) {
	define i1 @pow2_or_zero2(i8 %x) {			; CHECK-LABEL: @pow2_or_zero_is_not_negative2(
	; CHECK-LABEL: @pow2_or_zero2(			; CHECK-NEXT: [[CMP:%.]] = icmp ne i8 [[X:%.]], -128
				spatelUnsubmitted Not Done Reply Inline Actions This is "is_not_negative" spatel: This is "is_not_negative"
	; CHECK-NEXT: [[NEG:%.]] = sub i8 0, [[X:%.]]
	; CHECK-NEXT: [[POW2_OR_ZERO:%.*]] = and i8 [[NEG]], [[X]]
	; CHECK-NEXT: [[CMP:%.*]] = icmp sgt i8 [[POW2_OR_ZERO]], -1
	; CHECK-NEXT: ret i1 [[CMP]]			; CHECK-NEXT: ret i1 [[CMP]]
	;			;
	%neg = sub i8 0, %x			%neg = sub i8 0, %x
	%pow2_or_zero = and i8 %x, %neg			%pow2_or_zero = and i8 %x, %neg
	%cmp = icmp sgt i8 %pow2_or_zero, -1			%cmp = icmp sgt i8 %pow2_or_zero, -1
	ret i1 %cmp			ret i1 %cmp
	}			}

	; icmp sgt (and -X, X), -1 --> icmp ne (X, MinSignC)			define i1 @pow2_or_zero_is_not_negative_commute(i8 %A) {
	define i1 @pow2_or_zero2_commute(i8 %A) {			; CHECK-LABEL: @pow2_or_zero_is_not_negative2_commute(
	; CHECK-LABEL: @pow2_or_zero2_commute(
	; CHECK-NEXT: [[X:%.]] = mul i8 [[A:%.]], 42			; CHECK-NEXT: [[X:%.]] = mul i8 [[A:%.]], 42
	; CHECK-NEXT: [[NEG:%.*]] = sub i8 0, [[X]]			; CHECK-NEXT: [[CMP:%.*]] = icmp ne i8 [[X]], -128
	; CHECK-NEXT: [[POW2_OR_ZERO:%.*]] = and i8 [[X]], [[NEG]]
	; CHECK-NEXT: [[CMP:%.*]] = icmp sgt i8 [[POW2_OR_ZERO]], -1
	; CHECK-NEXT: ret i1 [[CMP]]			; CHECK-NEXT: ret i1 [[CMP]]
	;			;
	%x = mul i8 42, %A ; thwart complexity-based canonicalization			%x = mul i8 42, %A ; thwart complexity-based canonicalization
	%neg = sub i8 0, %x			%neg = sub i8 0, %x
	%pow2_or_zero = and i8 %neg, %x			%pow2_or_zero = and i8 %neg, %x
	%cmp = icmp sgt i8 %pow2_or_zero, -1			%cmp = icmp sgt i8 %pow2_or_zero, -1
	ret i1 %cmp			ret i1 %cmp
	}			}

	define <2 x i1> @pow2_or_zero2_vec(<2 x i8> %x) {			define <2 x i1> @pow2_or_zero_is_not_negative_vec(<2 x i8> %x) {
	; CHECK-LABEL: @pow2_or_zero2_vec(			; CHECK-LABEL: @pow2_or_zero_is_not_negative_vec2(
	; CHECK-NEXT: [[NEG:%.]] = sub <2 x i8> zeroinitializer, [[X:%.]]			; CHECK-NEXT: [[CMP:%.]] = icmp ne <2 x i8> [[X:%.]], <i8 -128, i8 -128>
	; CHECK-NEXT: [[POW2_OR_ZERO:%.*]] = and <2 x i8> [[NEG]], [[X]]
	; CHECK-NEXT: [[CMP:%.*]] = icmp sgt <2 x i8> [[POW2_OR_ZERO]], <i8 -1, i8 -1>
	; CHECK-NEXT: ret <2 x i1> [[CMP]]			; CHECK-NEXT: ret <2 x i1> [[CMP]]
	;			;
	%neg = sub <2 x i8> <i8 0, i8 0>, %x			%neg = sub <2 x i8> <i8 0, i8 0>, %x
	%pow2_or_zero = and <2 x i8> %x, %neg			%pow2_or_zero = and <2 x i8> %x, %neg
	%cmp = icmp sgt <2 x i8> %pow2_or_zero, <i8 -1, i8 -1>			%cmp = icmp sgt <2 x i8> %pow2_or_zero, <i8 -1, i8 -1>
	ret <2 x i1> %cmp			ret <2 x i1> %cmp
	}			}

	define <2 x i1> @pow2_or_zero2_vec_commute(<2 x i8> %A) {			define <2 x i1> @pow2_or_zero_is_not_negative_vec_commute(<2 x i8> %A) {
	; CHECK-LABEL: @pow2_or_zero2_vec_commute(			; CHECK-LABEL: @pow2_or_zero_is_not_negative_vec2_commute(
	; CHECK-NEXT: [[X:%.]] = mul <2 x i8> [[A:%.]], <i8 42, i8 42>			; CHECK-NEXT: [[X:%.]] = mul <2 x i8> [[A:%.]], <i8 42, i8 42>
	; CHECK-NEXT: [[NEG:%.*]] = sub <2 x i8> zeroinitializer, [[X]]			; CHECK-NEXT: [[CMP:%.*]] = icmp ne <2 x i8> [[X]], <i8 -128, i8 -128>
	; CHECK-NEXT: [[POW2_OR_ZERO:%.*]] = and <2 x i8> [[X]], [[NEG]]
	; CHECK-NEXT: [[CMP:%.*]] = icmp sgt <2 x i8> [[POW2_OR_ZERO]], <i8 -1, i8 -1>
	; CHECK-NEXT: ret <2 x i1> [[CMP]]			; CHECK-NEXT: ret <2 x i1> [[CMP]]
	;			;
	%x = mul <2 x i8> <i8 42, i8 42>, %A ; thwart complexity-based canonicalization			%x = mul <2 x i8> <i8 42, i8 42>, %A ; thwart complexity-based canonicalization
	%neg = sub <2 x i8> <i8 0, i8 0>, %x			%neg = sub <2 x i8> <i8 0, i8 0>, %x
	%pow2_or_zero = and <2 x i8> %neg, %x			%pow2_or_zero = and <2 x i8> %neg, %x
	%cmp = icmp sgt <2 x i8> %pow2_or_zero, <i8 -1, i8 -1>			%cmp = icmp sgt <2 x i8> %pow2_or_zero, <i8 -1, i8 -1>
	ret <2 x i1> %cmp			ret <2 x i1> %cmp
	}			}

				define i1 @pow2_or_zero_is_negative_extra_use(i8 %x) {
				spatelUnsubmitted Not Done Reply Inline Actions Let's not duplicate everything for unsigned predicates. It's sufficient to change one or two of the above tests to confirm that we handle the various forms of signbit tests. spatel: Let's not duplicate everything for unsigned predicates. It's sufficient to change one or two of…
				; CHECK-LABEL: @pow2_or_zero_is_negative3(
				; CHECK-NEXT: [[NEG:%.]] = sub i8 0, [[X:%.]]
				; CHECK-NEXT: call void @use(i8 [[NEG]])
				; CHECK-NEXT: [[POW2_OR_ZERO:%.*]] = and i8 [[NEG]], [[X]]
				; CHECK-NEXT: call void @use(i8 [[POW2_OR_ZERO]])
				; CHECK-NEXT: [[CMP:%.*]] = icmp eq i8 [[X]], -128
				; CHECK-NEXT: ret i1 [[CMP]]
				;
				%neg = sub i8 0, %x
				call void @use(i8 %neg)
				%pow2_or_zero = and i8 %x, %neg
				call void @use(i8 %pow2_or_zero)
				%cmp = icmp slt i8 %pow2_or_zero, 0
				ret i1 %cmp
				}

llvm/test/Transforms/InstCombine/minmax-of-xor-x.ll

Show First 20 Lines • Show All 128 Lines • ▼ Show 20 Lines	;
%yp2 = and <2 x i8> %y, %ny		%yp2 = and <2 x i8> %y, %ny
%x_xor = xor <2 x i8> %x, %yp2		%x_xor = xor <2 x i8> %x, %yp2
%r = call <2 x i8> @llvm.smin.v2i8(<2 x i8> %x, <2 x i8> %x_xor)		%r = call <2 x i8> @llvm.smin.v2i8(<2 x i8> %x, <2 x i8> %x_xor)
ret <2 x i8> %r		ret <2 x i8> %r
}		}

define i8 @smax_xor_pow2_neg(i8 %x, i8 %y) {		define i8 @smax_xor_pow2_neg(i8 %x, i8 %y) {
; CHECK-LABEL: @smax_xor_pow2_neg(		; CHECK-LABEL: @smax_xor_pow2_neg(
; CHECK-NEXT: [[NY:%.]] = sub i8 0, [[Y:%.]]		; CHECK-NEXT: [[CMP:%.]] = icmp eq i8 [[Y:%.]], -128
; CHECK-NEXT: [[YP2:%.*]] = and i8 [[NY]], [[Y]]
; CHECK-NEXT: [[CMP:%.*]] = icmp slt i8 [[YP2]], 0
; CHECK-NEXT: br i1 [[CMP]], label [[NEG:%.]], label [[POS:%.]]		; CHECK-NEXT: br i1 [[CMP]], label [[NEG:%.]], label [[POS:%.]]
; CHECK: neg:		; CHECK: neg:
; CHECK-NEXT: [[R:%.]] = and i8 [[X:%.]], 127		; CHECK-NEXT: [[NY:%.*]] = sub i8 0, [[Y]]
		; CHECK-NEXT: [[YP2:%.*]] = and i8 [[NY]], [[Y]]
		; CHECK-NEXT: [[X_XOR:%.]] = xor i8 [[YP2]], [[X:%.]]
		; CHECK-NEXT: [[R:%.*]] = call i8 @llvm.smax.i8(i8 [[X]], i8 [[X_XOR]])
; CHECK-NEXT: ret i8 [[R]]		; CHECK-NEXT: ret i8 [[R]]
; CHECK: pos:		; CHECK: pos:
; CHECK-NEXT: call void @barrier()		; CHECK-NEXT: call void @barrier()
; CHECK-NEXT: ret i8 0		; CHECK-NEXT: ret i8 0
;		;
%ny = sub i8 0, %y		%ny = sub i8 0, %y
%yp2 = and i8 %y, %ny		%yp2 = and i8 %y, %ny
%cmp = icmp slt i8 %yp2, 0		%cmp = icmp slt i8 %yp2, 0
Show All 36 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Fold signbit test of a pow2 or zeroClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 500788

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp

llvm/test/Transforms/InstCombine/fold-signbit-test-power2.ll

llvm/test/Transforms/InstCombine/minmax-of-xor-x.ll

[InstCombine] Fold signbit test of a pow2 or zero
ClosedPublic