Download Raw Diff

Details

Reviewers

spatel
suyog
nlewycky
• rafael
dexonsmith

Commits

rG3a8c2c1e6cb2: This patch implements optimization as mentioned in PR19753: Optimize…
rL213678: This patch implements optimization as mentioned in PR19753: Optimize…

Summary

This patch rectifies icmp instruction combine problem seen because of previous optimization patch for 'ashr/lshr exact' as mentioned in bug 19958.
I admit i wrote the previous patch keeping only equality in mind (not considering all cases is bad practice, will be careful in future).

This patch for now handles 'icmp eq/ne' only for 'ashr/lsher exact'. Added relevant test cases.

For other icmp instructions, some observations:

define i1 @f(i32 %x) {

%y = lshr exact i32 8, %x
%cmp = icmp ult i32 %y, 2
ret i1 %cmp

}

Here, in above example, as %x increases, %y decreases and hence comparison with same icmp instruction was giving wrong output.

while in ,

define i1 @f(i32 %x) {

%y = lshr exact i32 -8, %x
%cmp = icmp ult i32 %y, -2
ret i1 %cmp

}

as %x increases, %y also increases mathematically. Hence comparison with same icmp instruction is fine in this case.

This was not captured in original patch and hence wrong output. This can be handled in following two ways:

if const1,const2 are both positive, then icmp instruction should be swapped (something like getSwappedPredicate()). if const1,const2 are both negative, then keep the icmp instruction as it is. Note : getInversePredicate() won't work.
if const1/const2 are both positive, swap the operands, keeping the icmp instruction as it is. No change if const1,const2 are both negative.

Note : Cases where const1 and const2 have opposite sign - one is positive other is negative are always true/false.

This is handled by simplifyicmp call and it never reaches our patch. I checked it with both ashr/lshr for this case.

I will verify if above two approaches for other icmp instructions are universally true for both ashr/lshr and come up with another patch.
Added a TODO for the same.

Please review this patch which handles equality for ashr/lshr exact.
Any comments/suggestions are most welcomed.

Thanks.

Suyog

Diff Detail

Repository: rL LLVM

Event Timeline

suyog updated this revision to Diff 10235.Jun 9 2014, 4:07 AM

suyog retitled this revision from to PR19958 wrong code at -O1 and above on x86_64-linux-gnu (InstCombine).

suyog updated this object.

suyog edited the test plan for this revision. (Show Details)

suyog added reviewers: • rafael, spatel, nlewycky.

suyog added a subscriber: Unknown Object (MLST).

Please check for typos: lhsr -> lshr

I'm not sure what LLVM test case policy is, but I think it would be good to have contra-test cases too. Eg, a test case that doesn't specify 'exact', a test case that isn't eq/ne - these would ensure that your code is not firing when it wasn't intended. Also, can you include a 'ne' test?

Corrected typo.

Added test cases for

exact ne
no exact
no ne/eq

Please help in reviewing if this looks good or any more test cases are required.

Thanks

Suyog

Minor nit: fear the 80-column police
// (icmp eq/ne (ashr exact const2, A), const1) -> icmp eq/ne A, Log2(const2/const1)

This line's indentation looks wrong too:
return new ICmpInst(I.getPredicate(), A,

Major nit:
unsigned shift = Quotient.logBase2();

What if Quotient isn't power of 2?

I think this leads to miscompiled code:
$ cat 19958.ll
@a = common global i32 0, align 4
define i1 @main() {
%a = load i32* @a, align 4
%shr = lshr exact i32 80, %a
%cmp = icmp eq i32 %shr, 41 ; NOTE: non-power-of-2 comparison
ret i1 %cmp
}
$ ./opt 19958.ll -S | ./llc | ./clang -x assembler - -o a.out ; ./a.out ; echo $?
0
$ ./opt -instcombine 19958.ll -S | ./llc | ./clang -x assembler - -o a.out ; ./a.out ; echo $?
1

I like the additional test cases, but they would be better if the constants were at the limits rather than just random numbers in the middle of the possible values.

Thanks for the review.

Updated patch to handle exact division and exact log2. We were handling exact division and exact log2 for ashr in instsimplify, and not for lshr.

Now we are handling for both. Updated test cases which includes constants to the extreme limit and not something random in between.
Also took care of 80 lines clang-format. :)

Please help in reviewing the patch.

Gentle Ping !!

Sorry for the delay. LGTM...but considering that versions of this patch have caused miscompiles twice and I'm pretty new here, I think we should have someone with more instcombine experience give final approval in case I've missed anything.

nicholas added a subscriber: nicholas.Jul 5 2014, 7:04 PM

nicholas added inline comments.

lib/Transforms/InstCombine/InstCombineCompares.cpp
2349 ↗	(On Diff #10383)	APInt::isMinValue() is faster than APInt::operator== even though it's less clear to read.
2350 ↗	(On Diff #10383)	APInt::sdivrem is as fast as a single sdiv or srem. Please use it then text the 'rem' results.
test/Transforms/InstCombine/icmp.ll
1427 ↗	(On Diff #10383)	The bugs from last time were due to icmp non-equality comparisons, right? Are those tests already in this code or should you be adding tests to make sure you don't miscompile those cases here?

Hi Duncan, Nick, Rafael, Sanjay,

I have modified the code to calculate shift by Log2(Const2) - Log2(Const1)
instead of Log2(Const2/Const1) to avoid expensive division operation as per Duncan's suggestion.
Also handled few more conditions for 0.
Added additional test cases as well.

(I am doing this for icmp eq/ne only for now,
other icmp instructions are TODO item as they need some verification)

Can you please help in reviewing this modified patch.

Your suggestions/comments are most awaited. :)

Thanks,
Suyog

Hi Duncan,

Thanks for your explanation. I have handled case for 'lshr' separately now in the patch attached.

I have combined the logic for ashr/lshr as well as exact/non-exact as most of the logic is same.
Wherever there are special cases, i have handled them separately. I have added the logic for the
cases which were handled by 'instsimplify' to keep the logical flow as discussed.

I have combined the logic, since the 'match' functions are expensive, same for the log,
and our whole point of calculating difference of log was to avoid expensive division operation.

Please help in reviewing the patch. Your comments/suggestions are most welcomed :)

Hi Duncan,

Thanks for your review.
Couldn't spot the miscompiles earlier because i didn't comment out
the 'simplify' call. Now i modified the code and tested almost every
combination.

I have modified the code as per your suggestions :

Made comments short and crisp as per your suggestion.
included lambda helper functions to return appropriate constant and icmp instruction.
included logic for both constants -1 for ashr as suggested (In my opinion if both constants are equal and not -1, then final icmp would be icmp eq/ne A, 0. The Predicate won't get inversed and will remain same. You had suggested using lambda function here to get inversed predicate. Correct me if my analysis is wrong.)
used 'bool IsAShr' to avoid recalculation.
removed LShrOperator block which was causing miscompiles.
removed unnecessary shl/lshr distinction for exact and non exact
moved all test cases to new file icmp-ashr.ll
used msb_high/low instead of +ve/-ve and opposite msb
tried including all combinations of ne/eq + exact/non-exact + lshr/ashr

Please help in reviewing the patch.

Your comments/suggestions are valuable and most awaited. :)

Thanks,
Suyog

Hi Duncan,

Made changes as per your suggestions :

Wrote the whole logic in separate function. Didn't make it 'static', took 'cue' from other 'Foldicmp' functions.
Removed TODO, will probably raise a separate PR for it.
Updated comments, removed extra braces.
defined variables just before their use.

Can you please see if this looks good to you?
Your comments/suggestions are most welcomed.

Thanks,
Suyog

Hi Duncan, Nick

As there is no guidance in the coding standards,
i looked at few files in InstCombine and comment style in them and
i found it better to include braces, wherever there is a comment inside 'if'.

if ( ) {
// comment
}

I removed/didn't add braces where there is no comment and a single statement block.

Please see if this looks good to you !!

Accepting this as per review in
http://comments.gmane.org/gmane.comp.compilers.llvm.cvs/191371

This revision is now accepted and ready to land.Jul 22 2014, 12:21 PM

Closed by commit rL213678 (authored by @suyog).

silvas mentioned this in D5518: Added InstCombine transformation for combining two instructions icmp ult/ule/uge/ugt (ashr/lshr (Const2) %A), (Const1).Oct 8 2020, 7:17 PM

Diff 11773

llvm/trunk/lib/Transforms/InstCombine/InstCombine.h

Show First 20 Lines • Show All 166 Lines • ▼ Show 20 Lines	public:
Instruction *visitICmpInst(ICmpInst &I);		Instruction *visitICmpInst(ICmpInst &I);
Instruction *visitICmpInstWithCastAndCast(ICmpInst &ICI);		Instruction *visitICmpInstWithCastAndCast(ICmpInst &ICI);
Instruction visitICmpInstWithInstAndIntCst(ICmpInst &ICI, Instruction LHS,		Instruction visitICmpInstWithInstAndIntCst(ICmpInst &ICI, Instruction LHS,
ConstantInt *RHS);		ConstantInt *RHS);
Instruction FoldICmpDivCst(ICmpInst &ICI, BinaryOperator DivI,		Instruction FoldICmpDivCst(ICmpInst &ICI, BinaryOperator DivI,
ConstantInt *DivRHS);		ConstantInt *DivRHS);
Instruction FoldICmpShrCst(ICmpInst &ICI, BinaryOperator DivI,		Instruction FoldICmpShrCst(ICmpInst &ICI, BinaryOperator DivI,
ConstantInt *DivRHS);		ConstantInt *DivRHS);
		Instruction FoldICmpCstShrCst(ICmpInst &I, Value Op, Value *A,
		ConstantInt CI1, ConstantInt CI2);
Instruction FoldICmpAddOpCst(Instruction &ICI, Value X, ConstantInt *CI,		Instruction FoldICmpAddOpCst(Instruction &ICI, Value X, ConstantInt *CI,
ICmpInst::Predicate Pred);		ICmpInst::Predicate Pred);
Instruction FoldGEPICmp(GEPOperator GEPLHS, Value *RHS,		Instruction FoldGEPICmp(GEPOperator GEPLHS, Value *RHS,
ICmpInst::Predicate Cond, Instruction &I);		ICmpInst::Predicate Cond, Instruction &I);
Instruction FoldShiftByConstant(Value Op0, Constant *Op1,		Instruction FoldShiftByConstant(Value Op0, Constant *Op1,
BinaryOperator &I);		BinaryOperator &I);
Instruction *commonCastTransforms(CastInst &CI);		Instruction *commonCastTransforms(CastInst &CI);
Instruction *commonPointerCastTransforms(CastInst &CI);		Instruction *commonPointerCastTransforms(CastInst &CI);
▲ Show 20 Lines • Show All 222 Lines • Show Last 20 Lines

llvm/trunk/lib/Transforms/InstCombine/InstCombineCompares.cpp

Show First 20 Lines • Show All 1,038 Lines • ▼ Show 20 Lines	if (Shr->hasOneUse()) {

Value *And = Builder->CreateAnd(Shr->getOperand(0),		Value *And = Builder->CreateAnd(Shr->getOperand(0),
Mask, Shr->getName()+".mask");		Mask, Shr->getName()+".mask");
return new ICmpInst(ICI.getPredicate(), And, ShiftedCmpRHS);		return new ICmpInst(ICI.getPredicate(), And, ShiftedCmpRHS);
}		}
return nullptr;		return nullptr;
}		}

		/// FoldICmpCstShrCst - Handle "(icmp eq/ne (ashr/lshr const2, A), const1)" ->
		/// (icmp eq/ne A, Log2(const2/const1)) ->
		/// (icmp eq/ne A, Log2(const2) - Log2(const1)).
		Instruction InstCombiner::FoldICmpCstShrCst(ICmpInst &I, Value Op, Value *A,
		ConstantInt *CI1,
		ConstantInt *CI2) {
		assert(I.isEquality() && "Cannot fold icmp gt/lt");

		auto getConstant = [&I, this](bool IsTrue) {
		if (I.getPredicate() == I.ICMP_NE)
		IsTrue = !IsTrue;
		return ReplaceInstUsesWith(I, ConstantInt::get(I.getType(), IsTrue));
		};

		auto getICmp = [&I](CmpInst::Predicate Pred, Value LHS, Value RHS) {
		if (I.getPredicate() == I.ICMP_NE)
		Pred = CmpInst::getInversePredicate(Pred);
		return new ICmpInst(Pred, LHS, RHS);
		};

		APInt AP1 = CI1->getValue();
		APInt AP2 = CI2->getValue();

		if (!AP1) {
		if (!AP2) {
		// Both Constants are 0.
		return getConstant(true);
		}

		if (cast<BinaryOperator>(Op)->isExact())
		return getConstant(false);

		if (AP2.isNegative()) {
		// MSB is set, so a lshr with a large enough 'A' would be undefined.
		return getConstant(false);
		}

		// 'A' must be large enough to shift out the highest set bit.
		return getICmp(I.ICMP_UGT, A,
		ConstantInt::get(A->getType(), AP2.logBase2()));
		}

		if (!AP2) {
		// Shifting 0 by any value gives 0.
		return getConstant(false);
		}

		bool IsAShr = isa<AShrOperator>(Op);
		if (AP1 == AP2) {
		if (AP1.isAllOnesValue() && IsAShr) {
		// Arithmatic shift of -1 is always -1.
		return getConstant(true);
		}
		return getICmp(I.ICMP_EQ, A, ConstantInt::getNullValue(A->getType()));
		}

		if (IsAShr) {
		if (AP1.isNegative() != AP2.isNegative()) {
		// Arithmetic shift will never change the sign.
		return getConstant(false);
		}
		// Both the constants are negative, take their positive to calculate
		// log.
		if (AP1.isNegative()) {
		AP1 = -AP1;
		AP2 = -AP2;
		}
		}

		if (AP1.ugt(AP2)) {
		// Right-shifting will not increase the value.
		return getConstant(false);
		}

		// Get the distance between the highest bit that's set.
		int Shift = AP2.logBase2() - AP1.logBase2();

		// Use lshr here, since we've canonicalized to +ve numbers.
		if (AP1 == AP2.lshr(Shift))
		return getICmp(I.ICMP_EQ, A, ConstantInt::get(A->getType(), Shift));

		// Shifting const2 will never be equal to const1.
		return getConstant(false);
		}

/// visitICmpInstWithInstAndIntCst - Handle "icmp (instr, intcst)".		/// visitICmpInstWithInstAndIntCst - Handle "icmp (instr, intcst)".
///		///
Instruction *InstCombiner::visitICmpInstWithInstAndIntCst(ICmpInst &ICI,		Instruction *InstCombiner::visitICmpInstWithInstAndIntCst(ICmpInst &ICI,
Instruction *LHSI,		Instruction *LHSI,
ConstantInt *RHS) {		ConstantInt *RHS) {
const APInt &RHSV = RHS->getValue();		const APInt &RHSV = RHS->getValue();

▲ Show 20 Lines • Show All 1,409 Lines • ▼ Show 20 Lines	case ICmpInst::ICMP_UGE:
return new ICmpInst(ICmpInst::ICMP_UGT, Op0,		return new ICmpInst(ICmpInst::ICMP_UGT, Op0,
Builder->getInt(CI->getValue()-1));		Builder->getInt(CI->getValue()-1));
case ICmpInst::ICMP_SGE:		case ICmpInst::ICMP_SGE:
assert(!CI->isMinValue(true)); // A >=s MIN -> TRUE		assert(!CI->isMinValue(true)); // A >=s MIN -> TRUE
return new ICmpInst(ICmpInst::ICMP_SGT, Op0,		return new ICmpInst(ICmpInst::ICMP_SGT, Op0,
Builder->getInt(CI->getValue()-1));		Builder->getInt(CI->getValue()-1));
}		}

		// (icmp eq/ne (ashr/lshr const2, A), const1)
		if (I.isEquality()) {
		ConstantInt *CI2;
		if (match(Op0, m_AShr(m_ConstantInt(CI2), m_Value(A))) \|\|
		match(Op0, m_LShr(m_ConstantInt(CI2), m_Value(A)))) {
		return FoldICmpCstShrCst(I, Op0, A, CI, CI2);
		}
		}

// If this comparison is a normal comparison, it demands all		// If this comparison is a normal comparison, it demands all
// bits, if it is a sign bit comparison, it only demands the sign bit.		// bits, if it is a sign bit comparison, it only demands the sign bit.
bool UnusedBit;		bool UnusedBit;
isSignBit = isSignBitCheck(I.getPredicate(), CI, UnusedBit);		isSignBit = isSignBitCheck(I.getPredicate(), CI, UnusedBit);
}		}

// See if we can fold the comparison based on range information we can get		// See if we can fold the comparison based on range information we can get
// by checking whether bits are known to be zero or one in the input.		// by checking whether bits are known to be zero or one in the input.
▲ Show 20 Lines • Show All 1,182 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/InstCombine/icmp-shr.ll

				; RUN: opt < %s -instcombine -S \| FileCheck %s

				target datalayout = "e-p:64:64:64-p1:16:16:16-p2:32:32:32-p3:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64"

				; CHECK-LABEL: @exact_lshr_eq_both_zero
				; CHECK-NEXT: ret i1 true
				define i1 @exact_lshr_eq_both_zero(i8 %a) {
				%shr = lshr exact i8 0, %a
				%cmp = icmp eq i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_eq_both_zero
				; CHECK-NEXT: ret i1 true
				define i1 @exact_ashr_eq_both_zero(i8 %a) {
				%shr = ashr exact i8 0, %a
				%cmp = icmp eq i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_eq_both_zero
				; CHECK-NEXT: ret i1 true
				define i1 @nonexact_lshr_eq_both_zero(i8 %a) {
				%shr = lshr i8 0, %a
				%cmp = icmp eq i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_eq_both_zero
				; CHECK-NEXT: ret i1 true
				define i1 @nonexact_ashr_eq_both_zero(i8 %a) {
				%shr = ashr i8 0, %a
				%cmp = icmp eq i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_ne_both_zero
				; CHECK-NEXT: ret i1 false
				define i1 @exact_lshr_ne_both_zero(i8 %a) {
				%shr = lshr exact i8 0, %a
				%cmp = icmp ne i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_ne_both_zero
				; CHECK-NEXT: ret i1 false
				define i1 @exact_ashr_ne_both_zero(i8 %a) {
				%shr = ashr exact i8 0, %a
				%cmp = icmp ne i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_ne_both_zero
				; CHECK-NEXT: ret i1 false
				define i1 @nonexact_lshr_ne_both_zero(i8 %a) {
				%shr = lshr i8 0, %a
				%cmp = icmp ne i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_ne_both_zero
				; CHECK-NEXT: ret i1 false
				define i1 @nonexact_ashr_ne_both_zero(i8 %a) {
				%shr = ashr i8 0, %a
				%cmp = icmp ne i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_eq_last_zero
				; CHECK-NEXT: ret i1 false
				define i1 @exact_lshr_eq_last_zero(i8 %a) {
				%shr = lshr exact i8 128, %a
				%cmp = icmp eq i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_eq_last_zero
				; CHECK-NEXT: ret i1 false
				define i1 @exact_ashr_eq_last_zero(i8 %a) {
				%shr = ashr exact i8 -128, %a
				%cmp = icmp eq i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_ne_last_zero
				; CHECK-NEXT: ret i1 true
				define i1 @exact_lshr_ne_last_zero(i8 %a) {
				%shr = lshr exact i8 128, %a
				%cmp = icmp ne i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_ne_last_zero
				; CHECK-NEXT: ret i1 true
				define i1 @exact_ashr_ne_last_zero(i8 %a) {
				%shr = ashr exact i8 -128, %a
				%cmp = icmp ne i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_eq_last_zero
				; CHECK-NEXT: ret i1 false
				define i1 @nonexact_lshr_eq_last_zero(i8 %a) {
				%shr = lshr i8 128, %a
				%cmp = icmp eq i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_eq_last_zero
				; CHECK-NEXT: ret i1 false
				define i1 @nonexact_ashr_eq_last_zero(i8 %a) {
				%shr = ashr i8 -128, %a
				%cmp = icmp eq i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_ne_last_zero
				; CHECK-NEXT: ret i1 true
				define i1 @nonexact_lshr_ne_last_zero(i8 %a) {
				%shr = lshr i8 128, %a
				%cmp = icmp ne i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_ne_last_zero
				; CHECK-NEXT: ret i1 true
				define i1 @nonexact_ashr_ne_last_zero(i8 %a) {
				%shr = ashr i8 -128, %a
				%cmp = icmp ne i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @lshr_eq_msb_low_last_zero
				; CHECK-NEXT: icmp ugt i8 %a, 6
				define i1 @lshr_eq_msb_low_last_zero(i8 %a) {
				%shr = lshr i8 127, %a
				%cmp = icmp eq i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @ashr_eq_msb_low_second_zero
				; CHECK-NEXT: icmp ugt i8 %a, 6
				define i1 @ashr_eq_msb_low_second_zero(i8 %a) {
				%shr = ashr i8 127, %a
				%cmp = icmp eq i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @lshr_ne_msb_low_last_zero
				; CHECK-NEXT: icmp ult i8 %a, 7
				define i1 @lshr_ne_msb_low_last_zero(i8 %a) {
				%shr = lshr i8 127, %a
				%cmp = icmp ne i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @ashr_ne_msb_low_second_zero
				; CHECK-NEXT: icmp ult i8 %a, 7
				define i1 @ashr_ne_msb_low_second_zero(i8 %a) {
				%shr = ashr i8 127, %a
				%cmp = icmp ne i8 %shr, 0
				ret i1 %cmp
				}

				; CHECK-LABEL: @lshr_eq_first_zero
				; CHECK-NEXT: ret i1 false
				define i1 @lshr_eq_first_zero(i8 %a) {
				%shr = lshr i8 0, %a
				%cmp = icmp eq i8 %shr, 2
				ret i1 %cmp
				}

				; CHECK-LABEL: @ashr_eq_first_zero
				; CHECK-NEXT: ret i1 false
				define i1 @ashr_eq_first_zero(i8 %a) {
				%shr = ashr i8 0, %a
				%cmp = icmp eq i8 %shr, 2
				ret i1 %cmp
				}

				; CHECK-LABEL: @lshr_ne_first_zero
				; CHECK-NEXT: ret i1 true
				define i1 @lshr_ne_first_zero(i8 %a) {
				%shr = lshr i8 0, %a
				%cmp = icmp ne i8 %shr, 2
				ret i1 %cmp
				}

				; CHECK-LABEL: @ashr_ne_first_zero
				; CHECK-NEXT: ret i1 true
				define i1 @ashr_ne_first_zero(i8 %a) {
				%shr = ashr i8 0, %a
				%cmp = icmp ne i8 %shr, 2
				ret i1 %cmp
				}

				; CHECK-LABEL: @ashr_eq_both_minus1
				; CHECK-NEXT: ret i1 true
				define i1 @ashr_eq_both_minus1(i8 %a) {
				%shr = ashr i8 -1, %a
				%cmp = icmp eq i8 %shr, -1
				ret i1 %cmp
				}

				; CHECK-LABEL: @ashr_ne_both_minus1
				; CHECK-NEXT: ret i1 false
				define i1 @ashr_ne_both_minus1(i8 %a) {
				%shr = ashr i8 -1, %a
				%cmp = icmp ne i8 %shr, -1
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_eq_both_minus1
				; CHECK-NEXT: ret i1 true
				define i1 @exact_ashr_eq_both_minus1(i8 %a) {
				%shr = ashr exact i8 -1, %a
				%cmp = icmp eq i8 %shr, -1
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_ne_both_minus1
				; CHECK-NEXT: ret i1 false
				define i1 @exact_ashr_ne_both_minus1(i8 %a) {
				%shr = ashr exact i8 -1, %a
				%cmp = icmp ne i8 %shr, -1
				ret i1 %cmp
				}

				; CHECK-LABEL: @ashr_eq_both_equal
				; CHECK-NEXT: icmp eq i8 %a, 0
				define i1 @ashr_eq_both_equal(i8 %a) {
				%shr = ashr i8 128, %a
				%cmp = icmp eq i8 %shr, 128
				ret i1 %cmp
				}

				; CHECK-LABEL: @ashr_ne_both_equal
				; CHECK-NEXT: icmp ne i8 %a, 0
				define i1 @ashr_ne_both_equal(i8 %a) {
				%shr = ashr i8 128, %a
				%cmp = icmp ne i8 %shr, 128
				ret i1 %cmp
				}

				; CHECK-LABEL: @lshr_eq_both_equal
				; CHECK-NEXT: icmp eq i8 %a, 0
				define i1 @lshr_eq_both_equal(i8 %a) {
				%shr = lshr i8 127, %a
				%cmp = icmp eq i8 %shr, 127
				ret i1 %cmp
				}

				; CHECK-LABEL: @lshr_ne_both_equal
				; CHECK-NEXT: icmp ne i8 %a, 0
				define i1 @lshr_ne_both_equal(i8 %a) {
				%shr = lshr i8 127, %a
				%cmp = icmp ne i8 %shr, 127
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_eq_both_equal
				; CHECK-NEXT: icmp eq i8 %a, 0
				define i1 @exact_ashr_eq_both_equal(i8 %a) {
				%shr = ashr exact i8 128, %a
				%cmp = icmp eq i8 %shr, 128
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_ne_both_equal
				; CHECK-NEXT: icmp ne i8 %a, 0
				define i1 @exact_ashr_ne_both_equal(i8 %a) {
				%shr = ashr exact i8 128, %a
				%cmp = icmp ne i8 %shr, 128
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_eq_both_equal
				; CHECK-NEXT: icmp eq i8 %a, 0
				define i1 @exact_lshr_eq_both_equal(i8 %a) {
				%shr = lshr exact i8 126, %a
				%cmp = icmp eq i8 %shr, 126
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_ne_both_equal
				; CHECK-NEXT: icmp ne i8 %a, 0
				define i1 @exact_lshr_ne_both_equal(i8 %a) {
				%shr = lshr exact i8 126, %a
				%cmp = icmp ne i8 %shr, 126
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_eq_opposite_msb
				; CHECK-NEXT: ret i1 false
				define i1 @exact_ashr_eq_opposite_msb(i8 %a) {
				%shr = ashr exact i8 -128, %a
				%cmp = icmp eq i8 %shr, 1
				ret i1 %cmp
				}

				; CHECK-LABEL: @ashr_eq_opposite_msb
				; CHECK-NEXT: ret i1 false
				define i1 @ashr_eq_opposite_msb(i8 %a) {
				%shr = ashr i8 -128, %a
				%cmp = icmp eq i8 %shr, 1
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_eq_opposite_msb
				; CHECK-NEXT: icmp eq i8 %a, 7
				define i1 @exact_lshr_eq_opposite_msb(i8 %a) {
				%shr = lshr exact i8 -128, %a
				%cmp = icmp eq i8 %shr, 1
				ret i1 %cmp
				}

				; CHECK-LABEL: @lshr_eq_opposite_msb
				; CHECK-NEXT: icmp eq i8 %a, 7
				define i1 @lshr_eq_opposite_msb(i8 %a) {
				%shr = lshr i8 -128, %a
				%cmp = icmp eq i8 %shr, 1
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_ne_opposite_msb
				; CHECK-NEXT: ret i1 true
				define i1 @exact_ashr_ne_opposite_msb(i8 %a) {
				%shr = ashr exact i8 -128, %a
				%cmp = icmp ne i8 %shr, 1
				ret i1 %cmp
				}

				; CHECK-LABEL: @ashr_ne_opposite_msb
				; CHECK-NEXT: ret i1 true
				define i1 @ashr_ne_opposite_msb(i8 %a) {
				%shr = ashr i8 -128, %a
				%cmp = icmp ne i8 %shr, 1
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_ne_opposite_msb
				; CHECK-NEXT: icmp ne i8 %a, 7
				define i1 @exact_lshr_ne_opposite_msb(i8 %a) {
				%shr = lshr exact i8 -128, %a
				%cmp = icmp ne i8 %shr, 1
				ret i1 %cmp
				}

				; CHECK-LABEL: @lshr_ne_opposite_msb
				; CHECK-NEXT: icmp ne i8 %a, 7
				define i1 @lshr_ne_opposite_msb(i8 %a) {
				%shr = lshr i8 -128, %a
				%cmp = icmp ne i8 %shr, 1
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_eq_shift_gt
				; CHECK-NEXT : ret i1 false
				define i1 @exact_ashr_eq_shift_gt(i8 %a) {
				%shr = ashr exact i8 -2, %a
				%cmp = icmp eq i8 %shr, -8
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_ne_shift_gt
				; CHECK-NEXT : ret i1 true
				define i1 @exact_ashr_ne_shift_gt(i8 %a) {
				%shr = ashr exact i8 -2, %a
				%cmp = icmp ne i8 %shr, -8
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_eq_shift_gt
				; CHECK-NEXT : ret i1 false
				define i1 @nonexact_ashr_eq_shift_gt(i8 %a) {
				%shr = ashr i8 -2, %a
				%cmp = icmp eq i8 %shr, -8
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_ne_shift_gt
				; CHECK-NEXT : ret i1 true
				define i1 @nonexact_ashr_ne_shift_gt(i8 %a) {
				%shr = ashr i8 -2, %a
				%cmp = icmp ne i8 %shr, -8
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_eq_shift_gt
				; CHECK-NEXT: ret i1 false
				define i1 @exact_lshr_eq_shift_gt(i8 %a) {
				%shr = lshr exact i8 2, %a
				%cmp = icmp eq i8 %shr, 8
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_ne_shift_gt
				; CHECK-NEXT: ret i1 true
				define i1 @exact_lshr_ne_shift_gt(i8 %a) {
				%shr = lshr exact i8 2, %a
				%cmp = icmp ne i8 %shr, 8
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_eq_shift_gt
				; CHECK-NEXT : ret i1 false
				define i1 @nonexact_lshr_eq_shift_gt(i8 %a) {
				%shr = lshr i8 2, %a
				%cmp = icmp eq i8 %shr, 8
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_ne_shift_gt
				; CHECK-NEXT : ret i1 true
				define i1 @nonexact_lshr_ne_shift_gt(i8 %a) {
				%shr = ashr i8 2, %a
				%cmp = icmp ne i8 %shr, 8
				ret i1 %cmp
				}



				; CHECK-LABEL: @exact_ashr_eq
				; CHECK-NEXT: icmp eq i8 %a, 7
				define i1 @exact_ashr_eq(i8 %a) {
				%shr = ashr exact i8 -128, %a
				%cmp = icmp eq i8 %shr, -1
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_ne
				; CHECK-NEXT: icmp ne i8 %a, 7
				define i1 @exact_ashr_ne(i8 %a) {
				%shr = ashr exact i8 -128, %a
				%cmp = icmp ne i8 %shr, -1
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_eq
				; CHECK-NEXT: icmp eq i8 %a, 2
				define i1 @exact_lshr_eq(i8 %a) {
				%shr = lshr exact i8 4, %a
				%cmp = icmp eq i8 %shr, 1
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_ne
				; CHECK-NEXT: icmp ne i8 %a, 2
				define i1 @exact_lshr_ne(i8 %a) {
				%shr = lshr exact i8 4, %a
				%cmp = icmp ne i8 %shr, 1
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_eq
				; CHECK-NEXT: icmp eq i8 %a, 7
				define i1 @nonexact_ashr_eq(i8 %a) {
				%shr = ashr i8 -128, %a
				%cmp = icmp eq i8 %shr, -1
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_ne
				; CHECK-NEXT: icmp ne i8 %a, 7
				define i1 @nonexact_ashr_ne(i8 %a) {
				%shr = ashr i8 -128, %a
				%cmp = icmp ne i8 %shr, -1
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_eq
				; CHECK-NEXT: icmp eq i8 %a, 2
				define i1 @nonexact_lshr_eq(i8 %a) {
				%shr = lshr i8 4, %a
				%cmp = icmp eq i8 %shr, 1
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_ne
				; CHECK-NEXT: icmp ne i8 %a, 2
				define i1 @nonexact_lshr_ne(i8 %a) {
				%shr = lshr i8 4, %a
				%cmp = icmp ne i8 %shr, 1
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_eq_exactdiv
				; CHECK-NEXT: icmp eq i8 %a, 4
				define i1 @exact_lshr_eq_exactdiv(i8 %a) {
				%shr = lshr exact i8 80, %a
				%cmp = icmp eq i8 %shr, 5
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_ne_exactdiv
				; CHECK-NEXT: icmp ne i8 %a, 4
				define i1 @exact_lshr_ne_exactdiv(i8 %a) {
				%shr = lshr exact i8 80, %a
				%cmp = icmp ne i8 %shr, 5
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_eq_exactdiv
				; CHECK-NEXT: icmp eq i8 %a, 4
				define i1 @nonexact_lshr_eq_exactdiv(i8 %a) {
				%shr = lshr i8 80, %a
				%cmp = icmp eq i8 %shr, 5
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_ne_exactdiv
				; CHECK-NEXT: icmp ne i8 %a, 4
				define i1 @nonexact_lshr_ne_exactdiv(i8 %a) {
				%shr = lshr i8 80, %a
				%cmp = icmp ne i8 %shr, 5
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_eq_exactdiv
				; CHECK-NEXT: icmp eq i8 %a, 4
				define i1 @exact_ashr_eq_exactdiv(i8 %a) {
				%shr = ashr exact i8 -80, %a
				%cmp = icmp eq i8 %shr, -5
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_ne_exactdiv
				; CHECK-NEXT: icmp ne i8 %a, 4
				define i1 @exact_ashr_ne_exactdiv(i8 %a) {
				%shr = ashr exact i8 -80, %a
				%cmp = icmp ne i8 %shr, -5
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_eq_exactdiv
				; CHECK-NEXT: icmp eq i8 %a, 4
				define i1 @nonexact_ashr_eq_exactdiv(i8 %a) {
				%shr = ashr i8 -80, %a
				%cmp = icmp eq i8 %shr, -5
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_ne_exactdiv
				; CHECK-NEXT: icmp ne i8 %a, 4
				define i1 @nonexact_ashr_ne_exactdiv(i8 %a) {
				%shr = ashr i8 -80, %a
				%cmp = icmp ne i8 %shr, -5
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_eq_noexactdiv
				; CHECK-NEXT: ret i1 false
				define i1 @exact_lshr_eq_noexactdiv(i8 %a) {
				%shr = lshr exact i8 80, %a
				%cmp = icmp eq i8 %shr, 31
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_ne_noexactdiv
				; CHECK-NEXT: ret i1 true
				define i1 @exact_lshr_ne_noexactdiv(i8 %a) {
				%shr = lshr exact i8 80, %a
				%cmp = icmp ne i8 %shr, 31
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_eq_noexactdiv
				; CHECK-NEXT: ret i1 false
				define i1 @nonexact_lshr_eq_noexactdiv(i8 %a) {
				%shr = lshr i8 80, %a
				%cmp = icmp eq i8 %shr, 31
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_ne_noexactdiv
				; CHECK-NEXT: ret i1 true
				define i1 @nonexact_lshr_ne_noexactdiv(i8 %a) {
				%shr = lshr i8 80, %a
				%cmp = icmp ne i8 %shr, 31
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_eq_noexactdiv
				; CHECK-NEXT: ret i1 false
				define i1 @exact_ashr_eq_noexactdiv(i8 %a) {
				%shr = ashr exact i8 -80, %a
				%cmp = icmp eq i8 %shr, -31
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_ne_noexactdiv
				; CHECK-NEXT: ret i1 true
				define i1 @exact_ashr_ne_noexactdiv(i8 %a) {
				%shr = ashr exact i8 -80, %a
				%cmp = icmp ne i8 %shr, -31
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_eq_noexactdiv
				; CHECK-NEXT: ret i1 false
				define i1 @nonexact_ashr_eq_noexactdiv(i8 %a) {
				%shr = ashr i8 -80, %a
				%cmp = icmp eq i8 %shr, -31
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_ne_noexactdiv
				; CHECK-NEXT: ret i1 true
				define i1 @nonexact_ashr_ne_noexactdiv(i8 %a) {
				%shr = ashr i8 -80, %a
				%cmp = icmp ne i8 %shr, -31
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_eq_noexactlog
				; CHECK-NEXT: ret i1 false
				define i1 @exact_lshr_eq_noexactlog(i8 %a) {
				%shr = lshr exact i8 90, %a
				%cmp = icmp eq i8 %shr, 30
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_lshr_ne_noexactlog
				; CHECK-NEXT: ret i1 true
				define i1 @exact_lshr_ne_noexactlog(i8 %a) {
				%shr = lshr exact i8 90, %a
				%cmp = icmp ne i8 %shr, 30
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_eq_noexactlog
				; CHECK-NEXT: ret i1 false
				define i1 @nonexact_lshr_eq_noexactlog(i8 %a) {
				%shr = lshr i8 90, %a
				%cmp = icmp eq i8 %shr, 30
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_lshr_ne_noexactlog
				; CHECK-NEXT: ret i1 true
				define i1 @nonexact_lshr_ne_noexactlog(i8 %a) {
				%shr = lshr i8 90, %a
				%cmp = icmp ne i8 %shr, 30
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_eq_noexactlog
				; CHECK-NEXT: ret i1 false
				define i1 @exact_ashr_eq_noexactlog(i8 %a) {
				%shr = ashr exact i8 -90, %a
				%cmp = icmp eq i8 %shr, -30
				ret i1 %cmp
				}

				; CHECK-LABEL: @exact_ashr_ne_noexactlog
				; CHECK-NEXT: ret i1 true
				define i1 @exact_ashr_ne_noexactlog(i8 %a) {
				%shr = ashr exact i8 -90, %a
				%cmp = icmp ne i8 %shr, -30
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_eq_noexactlog
				; CHECK-NEXT: ret i1 false
				define i1 @nonexact_ashr_eq_noexactlog(i8 %a) {
				%shr = ashr i8 -90, %a
				%cmp = icmp eq i8 %shr, -30
				ret i1 %cmp
				}

				; CHECK-LABEL: @nonexact_ashr_ne_noexactlog
				; CHECK-NEXT: ret i1 true
				define i1 @nonexact_ashr_ne_noexactlog(i8 %a) {
				%shr = ashr i8 -90, %a
				%cmp = icmp ne i8 %shr, -30
				ret i1 %cmp
				}

This is an archive of the discontinued LLVM Phabricator instance.

PR19958 wrong code at -O1 and above on x86_64-linux-gnu (InstCombine)
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 11773

llvm/trunk/lib/Transforms/InstCombine/InstCombine.h

llvm/trunk/lib/Transforms/InstCombine/InstCombineCompares.cpp

llvm/trunk/test/Transforms/InstCombine/icmp-shr.ll

This is an archive of the discontinued LLVM Phabricator instance.

PR19958 wrong code at -O1 and above on x86_64-linux-gnu (InstCombine)ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 11773

llvm/trunk/lib/Transforms/InstCombine/InstCombine.h

llvm/trunk/lib/Transforms/InstCombine/InstCombineCompares.cpp

llvm/trunk/test/Transforms/InstCombine/icmp-shr.ll

PR19958 wrong code at -O1 and above on x86_64-linux-gnu (InstCombine)
ClosedPublic