Download Raw Diff

Details

Reviewers

nikic
spatel
efriedma
xbolva00
RKSimon

Commits

rGfb38b7aab3f2: [InstCombine] Fold '(-1 u/ %x) u< %y' to '@llvm.umul.with.overflow' + overflow…
rL370347: [InstCombine] Fold '(-1 u/ %x) u< %y' to '@llvm.umul.with.overflow' + overflow…

Summary

(-1 u/ %x) u< %y is one of (3?) common ways to check that
some unsigned multiplication (will not) overflow.
Currently, we don't catch it. We could:

----------------------------------------
Name: no overflow
  %o0 = udiv i4 -1, %x
  %r = icmp ult i4 %o0, %y
=>
  %o0 = udiv i4 -1, %x
  %n0 = umul_overflow i4 %x, %y
  %r = extractvalue {i4, i1} %n0, 1

Done: 1
Optimization is correct!

----------------------------------------
Name: no overflow, swapped
  %o0 = udiv i4 -1, %x
  %r = icmp ugt i4 %y, %o0
=>
  %o0 = udiv i4 -1, %x
  %n0 = umul_overflow i4 %x, %y
  %r = extractvalue {i4, i1} %n0, 1

Done: 1
Optimization is correct!

----------------------------------------
Name: overflow
  %o0 = udiv i4 -1, %x
  %r = icmp uge i4 %o0, %y
=>
  %o0 = udiv i4 -1, %x
  %n0 = umul_overflow i4 %x, %y
  %n1 = extractvalue {i4, i1} %n0, 1
  %r = xor %n1, -1

Done: 1
Optimization is correct!

----------------------------------------
Name: overflow
  %o0 = udiv i4 -1, %x
  %r = icmp ule i4 %y, %o0
=>
  %o0 = udiv i4 -1, %x
  %n0 = umul_overflow i4 %x, %y
  %n1 = extractvalue {i4, i1} %n0, 1
  %r = xor %n1, -1

Done: 1
Optimization is correct!

As it can be observed from tests, while simply forming the @llvm.umul.with.overflow
is easy, if we were looking for the inverted answer, then more work needs to be done
to cleanup the now-pointless control-flow that was guarding against division-by-zero.
This is being addressed in follow-up patches.

Diff Detail

Repository: rL LLVM

Event Timeline

lebedev.ri created this revision.Jul 23 2019, 6:59 AM

Herald added a subscriber: hiraditya. · View Herald TranscriptJul 23 2019, 6:59 AM

lebedev.ri added a child revision: D65144: [InstCombine] Fold '((%x * %y) u/ %x) != %y' to '@llvm.umul.with.overflow' + overflow bit extraction.Jul 23 2019, 6:59 AM

xbolva00 added inline comments.Jul 23 2019, 7:14 AM

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
3333 ↗	(On Diff #211286)	Just ‘x’ instead of ‘%x’ in the comments?
3351 ↗	(On Diff #211286)	“.. whether overflow happens or not” ?
3363 ↗	(On Diff #211286)	Early return maybe better?

lebedev.ri mentioned this in D65147: [SimplifyCFG] FoldTwoEntryPHINode(): don't bailout on i1 PHI's if we can hoist a 'not' from incoming values.Jul 23 2019, 7:23 AM

lebedev.ri mentioned this in D65150: [InstSimplify] Drop leftover "division-by-zero guard" around `@llvm.umul.with.overflow` overflow bit.Jul 23 2019, 7:32 AM

lebedev.ri mentioned this in D65151: [InstSimplify] Drop leftover "division-by-zero guard" around `@llvm.umul.with.overflow` inverted overflow bit.

lebedev.ri added a child revision: D65147: [SimplifyCFG] FoldTwoEntryPHINode(): don't bailout on i1 PHI's if we can hoist a 'not' from incoming values.

lebedev.ri added inline comments.Jul 23 2019, 7:54 AM

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
3363 ↗	(On Diff #211286)	Then the diff in the very next patch (D65144) will have to re-indent, i wonder if this is more readable as-is..

NFC, improve comments.

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
3351 ↗	(On Diff #211286)	Non-native english here, i think the current variant is more correct. We obviously are checking whether or not the overflow happens, so the suggested comment isn't wrong. But the question here is whether we are asking "if this overflows return 1 else return 0" or "if this overflows return 0 else return 1", and i wouldn't say i'd infer it from suggested wording.

xbolva00 added inline comments.Jul 23 2019, 8:46 AM

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
3363 ↗	(On Diff #211286)	Oh yea, +1

lebedev.ri marked 3 inline comments as done.Jul 23 2019, 8:47 AM

LGTM

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
3349 ↗	(On Diff #211305)	Side note: This seems pretty awkward and potentially bug prone. Having looked through the relatively few uses of this matcher, most cases either don't care (because they handle equalities) or would benefit from returning the swapped predicate. I think we should change the semantics to remove this potential footgun...
3370 ↗	(On Diff #211305)	Technically increases instruction count, but that seems justified here, especially as not's will usually be folded.

This revision is now accepted and ready to land.Jul 27 2019, 1:00 PM

Thank you for the review.

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
3349 ↗	(On Diff #211305)	... i.e. you suggest that `m_c_ICmp()` should swap predicate as needed?
3370 ↗	(On Diff #211305)	Yes, i think this is a very rare case where this is ok.

NFC, added a comment about increasing instruction count.

nikic added inline comments.Jul 28 2019, 1:59 AM

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
3349 ↗	(On Diff #211305)	Yes. `m_c_ICmp(Pred, X, Y)` should match `X Pred Y` or an equivalent commutative form, which is `Y SwappedPred X`, not `Y Pred X` (apart from the degenerate case of equality). At least that's what I would intuitively expect and what would be useful in practice (such as here).

lebedev.ri marked 3 inline comments as done.Jul 28 2019, 2:05 AM

lebedev.ri added inline comments.

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
3349 ↗	(On Diff #211305)	Got it. Looks like right now there is only 20 uses of `m_c_ICmp()` so it seems doable. Filed https://bugs.llvm.org/show_bug.cgi?id=42801

xbolva00 accepted this revision.Aug 11 2019, 11:30 AM

lebedev.ri added a parent revision: D66098: [CostModel] Model all `extractvalue`s as free..Aug 29 2019, 4:59 AM

Rebased, NFC.

Closed by commit rL370347: [InstCombine] Fold '(-1 u/ %x) u< %y' to '@llvm.umul.with.overflow' + overflow… (authored by lebedevri). · Explain WhyAug 29 2019, 5:46 AM

This revision was automatically updated to reflect the committed changes.

Diffusion mentioned this in rL370349: [SimplifyCFG] FoldTwoEntryPHINode(): don't bailout on i1 PHI's if we can hoist….

Diffusion mentioned this in rL370350: [InstSimplify] Drop leftover "division-by-zero guard" around `@llvm.umul.with..Aug 29 2019, 5:52 AM

Diffusion mentioned this in rL370351: [InstSimplify] Drop leftover "division-by-zero guard" around `@llvm.umul.with..

lebedev.ri mentioned this in rG9f35d2b56404: [SimplifyCFG] FoldTwoEntryPHINode(): don't bailout on i1 PHI's if we can hoist….Aug 29 2019, 5:52 AM

lebedev.ri mentioned this in rGaaf6ab4410b8: [InstSimplify] Drop leftover "division-by-zero guard" around `@llvm.umul.with..

lebedev.ri mentioned this in rGc58478685410: [InstSimplify] Drop leftover "division-by-zero guard" around `@llvm.umul.with..

spatel mentioned this in D131452: [InstCombine] avoid generating mul intrinsic that lowers as a libcall.Aug 9 2022, 5:35 AM

Diff 217850

llvm/trunk/lib/Transforms/InstCombine/InstCombineCompares.cpp

Show First 20 Lines • Show All 3,519 Lines • ▼ Show 20 Lines	foldShiftIntoShiftInAnotherHandOfAndInICmp(ICmpInst &I, const SimplifyQuery SQ,
Value *T0 = XShiftOpcode == Instruction::BinaryOps::LShr		Value *T0 = XShiftOpcode == Instruction::BinaryOps::LShr
? Builder.CreateLShr(X, NewShAmt)		? Builder.CreateLShr(X, NewShAmt)
: Builder.CreateShl(X, NewShAmt);		: Builder.CreateShl(X, NewShAmt);
Value *T1 = Builder.CreateAnd(T0, Y);		Value *T1 = Builder.CreateAnd(T0, Y);
return Builder.CreateICmp(I.getPredicate(), T1,		return Builder.CreateICmp(I.getPredicate(), T1,
Constant::getNullValue(WidestTy));		Constant::getNullValue(WidestTy));
}		}

		/// Fold
		/// (-1 u/ x) u< y
		/// to
		/// @llvm.umul.with.overflow(x, y) plus extraction of overflow bit
		/// Note that the comparison is commutative, while inverted (u>=) predicate
		/// will mean that we are looking for the opposite answer.
		static Value *
		foldUnsignedMultiplicationOverflowCheck(ICmpInst &I,
		InstCombiner::BuilderTy &Builder) {
		ICmpInst::Predicate Pred;
		Value X, Y;
		bool NeedNegation;
		// Look for: (-1 u/ x) u</u>= y
		if (!I.isEquality() &&
		match(&I, m_c_ICmp(Pred, m_OneUse(m_UDiv(m_AllOnes(), m_Value(X))),
		m_Value(Y)))) {
		// Canonicalize as-if y was on RHS.
		if (I.getOperand(1) != Y)
		Pred = I.getSwappedPredicate();

		// Are we checking that overflow does not happen, or does happen?
		switch (Pred) {
		case ICmpInst::Predicate::ICMP_ULT:
		NeedNegation = false;
		break; // OK
		case ICmpInst::Predicate::ICMP_UGE:
		NeedNegation = true;
		break; // OK
		default:
		return nullptr; // Wrong predicate.
		}
		} else
		return nullptr;

		Function *F = Intrinsic::getDeclaration(
		I.getModule(), Intrinsic::umul_with_overflow, X->getType());
		CallInst *Call = Builder.CreateCall(F, {X, Y}, "umul");
		Value *Res = Builder.CreateExtractValue(Call, 1, "umul.ov");
		if (NeedNegation) // This technically increases instruction count.
		Res = Builder.CreateNot(Res, "umul.not.ov");

		return Res;
		}

/// Try to fold icmp (binop), X or icmp X, (binop).		/// Try to fold icmp (binop), X or icmp X, (binop).
/// TODO: A large part of this logic is duplicated in InstSimplify's		/// TODO: A large part of this logic is duplicated in InstSimplify's
/// simplifyICmpWithBinOp(). We should be able to share that and avoid the code		/// simplifyICmpWithBinOp(). We should be able to share that and avoid the code
/// duplication.		/// duplication.
Instruction *InstCombiner::foldICmpBinOp(ICmpInst &I) {		Instruction *InstCombiner::foldICmpBinOp(ICmpInst &I) {
Value Op0 = I.getOperand(0), Op1 = I.getOperand(1);		Value Op0 = I.getOperand(0), Op1 = I.getOperand(1);

// Special logic for binary operators.		// Special logic for binary operators.
▲ Show 20 Lines • Show All 333 Lines • ▼ Show 20 Lines	if (BO0) {
auto BitwiseAnd = m_c_And(m_Value(), LSubOne);		auto BitwiseAnd = m_c_And(m_Value(), LSubOne);

if (match(BO0, BitwiseAnd) && Pred == ICmpInst::ICMP_ULT) {		if (match(BO0, BitwiseAnd) && Pred == ICmpInst::ICMP_ULT) {
auto *Zero = Constant::getNullValue(BO0->getType());		auto *Zero = Constant::getNullValue(BO0->getType());
return new ICmpInst(ICmpInst::ICMP_NE, Op1, Zero);		return new ICmpInst(ICmpInst::ICMP_NE, Op1, Zero);
}		}
}		}

		if (Value *V = foldUnsignedMultiplicationOverflowCheck(I, Builder))
		return replaceInstUsesWith(I, V);

if (Value *V = foldICmpWithLowBitMaskedVal(I, Builder))		if (Value *V = foldICmpWithLowBitMaskedVal(I, Builder))
return replaceInstUsesWith(I, V);		return replaceInstUsesWith(I, V);

if (Value *V = foldICmpWithTruncSignExtendedVal(I, Builder))		if (Value *V = foldICmpWithTruncSignExtendedVal(I, Builder))
return replaceInstUsesWith(I, V);		return replaceInstUsesWith(I, V);

if (Value *V = foldShiftIntoShiftInAnotherHandOfAndInICmp(I, SQ, Builder))		if (Value *V = foldShiftIntoShiftInAnotherHandOfAndInICmp(I, SQ, Builder))
return replaceInstUsesWith(I, V);		return replaceInstUsesWith(I, V);
▲ Show 20 Lines • Show All 2,084 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/InstCombine/unsigned-mul-lack-of-overflow-check-via-udiv-of-allones.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt %s -instcombine -S \| FileCheck %s			; RUN: opt %s -instcombine -S \| FileCheck %s

	; Fold			; Fold
	; (-1 u/ %x) u>= %y			; (-1 u/ %x) u>= %y
	; to			; to
	; @llvm.umul.with.overflow(%x, %y) + extractvalue + not			; @llvm.umul.with.overflow(%x, %y) + extractvalue + not

	define i1 @t0_basic(i8 %x, i8 %y) {			define i1 @t0_basic(i8 %x, i8 %y) {
	; CHECK-LABEL: @t0_basic(			; CHECK-LABEL: @t0_basic(
	; CHECK-NEXT: [[T0:%.]] = udiv i8 -1, [[X:%.]]			; CHECK-NEXT: [[UMUL:%.]] = call { i8, i1 } @llvm.umul.with.overflow.i8(i8 [[X:%.]], i8 [[Y:%.*]])
	; CHECK-NEXT: [[R:%.]] = icmp uge i8 [[T0]], [[Y:%.]]			; CHECK-NEXT: [[UMUL_OV:%.*]] = extractvalue { i8, i1 } [[UMUL]], 1
	; CHECK-NEXT: ret i1 [[R]]			; CHECK-NEXT: [[UMUL_NOT_OV:%.*]] = xor i1 [[UMUL_OV]], true
				; CHECK-NEXT: ret i1 [[UMUL_NOT_OV]]
	;			;
	%t0 = udiv i8 -1, %x			%t0 = udiv i8 -1, %x
	%r = icmp uge i8 %t0, %y			%r = icmp uge i8 %t0, %y
	ret i1 %r			ret i1 %r
	}			}

	define <2 x i1> @t1_vec(<2 x i8> %x, <2 x i8> %y) {			define <2 x i1> @t1_vec(<2 x i8> %x, <2 x i8> %y) {
	; CHECK-LABEL: @t1_vec(			; CHECK-LABEL: @t1_vec(
	; CHECK-NEXT: [[T0:%.]] = udiv <2 x i8> <i8 -1, i8 -1>, [[X:%.]]			; CHECK-NEXT: [[UMUL:%.]] = call { <2 x i8>, <2 x i1> } @llvm.umul.with.overflow.v2i8(<2 x i8> [[X:%.]], <2 x i8> [[Y:%.*]])
	; CHECK-NEXT: [[R:%.]] = icmp uge <2 x i8> [[T0]], [[Y:%.]]			; CHECK-NEXT: [[UMUL_OV:%.*]] = extractvalue { <2 x i8>, <2 x i1> } [[UMUL]], 1
	; CHECK-NEXT: ret <2 x i1> [[R]]			; CHECK-NEXT: [[UMUL_NOT_OV:%.*]] = xor <2 x i1> [[UMUL_OV]], <i1 true, i1 true>
				; CHECK-NEXT: ret <2 x i1> [[UMUL_NOT_OV]]
	;			;
	%t0 = udiv <2 x i8> <i8 -1, i8 -1>, %x			%t0 = udiv <2 x i8> <i8 -1, i8 -1>, %x
	%r = icmp uge <2 x i8> %t0, %y			%r = icmp uge <2 x i8> %t0, %y
	ret <2 x i1> %r			ret <2 x i1> %r
	}			}

	define <3 x i1> @t2_vec_undef(<3 x i8> %x, <3 x i8> %y) {			define <3 x i1> @t2_vec_undef(<3 x i8> %x, <3 x i8> %y) {
	; CHECK-LABEL: @t2_vec_undef(			; CHECK-LABEL: @t2_vec_undef(
	; CHECK-NEXT: [[T0:%.]] = udiv <3 x i8> <i8 -1, i8 undef, i8 -1>, [[X:%.]]			; CHECK-NEXT: [[UMUL:%.]] = call { <3 x i8>, <3 x i1> } @llvm.umul.with.overflow.v3i8(<3 x i8> [[X:%.]], <3 x i8> [[Y:%.*]])
	; CHECK-NEXT: [[R:%.]] = icmp uge <3 x i8> [[T0]], [[Y:%.]]			; CHECK-NEXT: [[UMUL_OV:%.*]] = extractvalue { <3 x i8>, <3 x i1> } [[UMUL]], 1
	; CHECK-NEXT: ret <3 x i1> [[R]]			; CHECK-NEXT: [[UMUL_NOT_OV:%.*]] = xor <3 x i1> [[UMUL_OV]], <i1 true, i1 true, i1 true>
				; CHECK-NEXT: ret <3 x i1> [[UMUL_NOT_OV]]
	;			;
	%t0 = udiv <3 x i8> <i8 -1, i8 undef, i8 -1>, %x			%t0 = udiv <3 x i8> <i8 -1, i8 undef, i8 -1>, %x
	%r = icmp uge <3 x i8> %t0, %y			%r = icmp uge <3 x i8> %t0, %y
	ret <3 x i1> %r			ret <3 x i1> %r
	}			}

	declare i8 @gen8()			declare i8 @gen8()

	define i1 @t3_commutative(i8 %x) {			define i1 @t3_commutative(i8 %x) {
	; CHECK-LABEL: @t3_commutative(			; CHECK-LABEL: @t3_commutative(
	; CHECK-NEXT: [[T0:%.]] = udiv i8 -1, [[X:%.]]
	; CHECK-NEXT: [[Y:%.*]] = call i8 @gen8()			; CHECK-NEXT: [[Y:%.*]] = call i8 @gen8()
	; CHECK-NEXT: [[R:%.*]] = icmp ule i8 [[Y]], [[T0]]			; CHECK-NEXT: [[UMUL:%.]] = call { i8, i1 } @llvm.umul.with.overflow.i8(i8 [[X:%.]], i8 [[Y]])
	; CHECK-NEXT: ret i1 [[R]]			; CHECK-NEXT: [[UMUL_OV:%.*]] = extractvalue { i8, i1 } [[UMUL]], 1
				; CHECK-NEXT: [[UMUL_NOT_OV:%.*]] = xor i1 [[UMUL_OV]], true
				; CHECK-NEXT: ret i1 [[UMUL_NOT_OV]]
	;			;
	%t0 = udiv i8 -1, %x			%t0 = udiv i8 -1, %x
	%y = call i8 @gen8()			%y = call i8 @gen8()
	%r = icmp ule i8 %y, %t0 ; swapped			%r = icmp ule i8 %y, %t0 ; swapped
	ret i1 %r			ret i1 %r
	}			}

	; Negative tests			; Negative tests
	▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/InstCombine/unsigned-mul-overflow-check-via-udiv-of-allones.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt %s -instcombine -S \| FileCheck %s			; RUN: opt %s -instcombine -S \| FileCheck %s

	; Fold			; Fold
	; (-1 u/ %x) u< %y			; (-1 u/ %x) u< %y
	; to			; to
	; @llvm.umul.with.overflow(%x, %y) + extractvalue			; @llvm.umul.with.overflow(%x, %y) + extractvalue

	define i1 @t0_basic(i8 %x, i8 %y) {			define i1 @t0_basic(i8 %x, i8 %y) {
	; CHECK-LABEL: @t0_basic(			; CHECK-LABEL: @t0_basic(
	; CHECK-NEXT: [[T0:%.]] = udiv i8 -1, [[X:%.]]			; CHECK-NEXT: [[UMUL:%.]] = call { i8, i1 } @llvm.umul.with.overflow.i8(i8 [[X:%.]], i8 [[Y:%.*]])
	; CHECK-NEXT: [[R:%.]] = icmp ult i8 [[T0]], [[Y:%.]]			; CHECK-NEXT: [[UMUL_OV:%.*]] = extractvalue { i8, i1 } [[UMUL]], 1
	; CHECK-NEXT: ret i1 [[R]]			; CHECK-NEXT: ret i1 [[UMUL_OV]]
	;			;
	%t0 = udiv i8 -1, %x			%t0 = udiv i8 -1, %x
	%r = icmp ult i8 %t0, %y			%r = icmp ult i8 %t0, %y
	ret i1 %r			ret i1 %r
	}			}

	define <2 x i1> @t1_vec(<2 x i8> %x, <2 x i8> %y) {			define <2 x i1> @t1_vec(<2 x i8> %x, <2 x i8> %y) {
	; CHECK-LABEL: @t1_vec(			; CHECK-LABEL: @t1_vec(
	; CHECK-NEXT: [[T0:%.]] = udiv <2 x i8> <i8 -1, i8 -1>, [[X:%.]]			; CHECK-NEXT: [[UMUL:%.]] = call { <2 x i8>, <2 x i1> } @llvm.umul.with.overflow.v2i8(<2 x i8> [[X:%.]], <2 x i8> [[Y:%.*]])
	; CHECK-NEXT: [[R:%.]] = icmp ult <2 x i8> [[T0]], [[Y:%.]]			; CHECK-NEXT: [[UMUL_OV:%.*]] = extractvalue { <2 x i8>, <2 x i1> } [[UMUL]], 1
	; CHECK-NEXT: ret <2 x i1> [[R]]			; CHECK-NEXT: ret <2 x i1> [[UMUL_OV]]
	;			;
	%t0 = udiv <2 x i8> <i8 -1, i8 -1>, %x			%t0 = udiv <2 x i8> <i8 -1, i8 -1>, %x
	%r = icmp ult <2 x i8> %t0, %y			%r = icmp ult <2 x i8> %t0, %y
	ret <2 x i1> %r			ret <2 x i1> %r
	}			}

	define <3 x i1> @t2_vec_undef(<3 x i8> %x, <3 x i8> %y) {			define <3 x i1> @t2_vec_undef(<3 x i8> %x, <3 x i8> %y) {
	; CHECK-LABEL: @t2_vec_undef(			; CHECK-LABEL: @t2_vec_undef(
	; CHECK-NEXT: [[T0:%.]] = udiv <3 x i8> <i8 -1, i8 undef, i8 -1>, [[X:%.]]			; CHECK-NEXT: [[UMUL:%.]] = call { <3 x i8>, <3 x i1> } @llvm.umul.with.overflow.v3i8(<3 x i8> [[X:%.]], <3 x i8> [[Y:%.*]])
	; CHECK-NEXT: [[R:%.]] = icmp ult <3 x i8> [[T0]], [[Y:%.]]			; CHECK-NEXT: [[UMUL_OV:%.*]] = extractvalue { <3 x i8>, <3 x i1> } [[UMUL]], 1
	; CHECK-NEXT: ret <3 x i1> [[R]]			; CHECK-NEXT: ret <3 x i1> [[UMUL_OV]]
	;			;
	%t0 = udiv <3 x i8> <i8 -1, i8 undef, i8 -1>, %x			%t0 = udiv <3 x i8> <i8 -1, i8 undef, i8 -1>, %x
	%r = icmp ult <3 x i8> %t0, %y			%r = icmp ult <3 x i8> %t0, %y
	ret <3 x i1> %r			ret <3 x i1> %r
	}			}

	declare i8 @gen8()			declare i8 @gen8()

	define i1 @t3_commutative(i8 %x) {			define i1 @t3_commutative(i8 %x) {
	; CHECK-LABEL: @t3_commutative(			; CHECK-LABEL: @t3_commutative(
	; CHECK-NEXT: [[T0:%.]] = udiv i8 -1, [[X:%.]]
	; CHECK-NEXT: [[Y:%.*]] = call i8 @gen8()			; CHECK-NEXT: [[Y:%.*]] = call i8 @gen8()
	; CHECK-NEXT: [[R:%.*]] = icmp ugt i8 [[Y]], [[T0]]			; CHECK-NEXT: [[UMUL:%.]] = call { i8, i1 } @llvm.umul.with.overflow.i8(i8 [[X:%.]], i8 [[Y]])
	; CHECK-NEXT: ret i1 [[R]]			; CHECK-NEXT: [[UMUL_OV:%.*]] = extractvalue { i8, i1 } [[UMUL]], 1
				; CHECK-NEXT: ret i1 [[UMUL_OV]]
	;			;
	%t0 = udiv i8 -1, %x			%t0 = udiv i8 -1, %x
	%y = call i8 @gen8()			%y = call i8 @gen8()
	%r = icmp ugt i8 %y, %t0 ; swapped			%r = icmp ugt i8 %y, %t0 ; swapped
	ret i1 %r			ret i1 %r
	}			}

	; Negative tests			; Negative tests
	▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/PhaseOrdering/unsigned-multiply-overflow-check.ll

	Show All 12 Lines
	; #include <cstdint>			; #include <cstdint>
	;			;
	; using size_type = std::size_t;			; using size_type = std::size_t;
	; bool will_not_overflow(size_type size, size_type nmemb) {			; bool will_not_overflow(size_type size, size_type nmemb) {
	; return (size != 0 && (nmemb > std::numeric_limits<size_type>::max() / size));			; return (size != 0 && (nmemb > std::numeric_limits<size_type>::max() / size));
	; }			; }

	define i1 @will_not_overflow(i64 %arg, i64 %arg1) {			define i1 @will_not_overflow(i64 %arg, i64 %arg1) {
	; ALL-LABEL: @will_not_overflow(			; SIMPLIFYCFG-LABEL: @will_not_overflow(
	; ALL-NEXT: bb:			; SIMPLIFYCFG-NEXT: bb:
	; ALL-NEXT: [[T0:%.]] = icmp eq i64 [[ARG:%.]], 0			; SIMPLIFYCFG-NEXT: [[T0:%.]] = icmp eq i64 [[ARG:%.]], 0
	; ALL-NEXT: br i1 [[T0]], label [[BB5:%.]], label [[BB2:%.]]			; SIMPLIFYCFG-NEXT: br i1 [[T0]], label [[BB5:%.]], label [[BB2:%.]]
	; ALL: bb2:			; SIMPLIFYCFG: bb2:
	; ALL-NEXT: [[T3:%.*]] = udiv i64 -1, [[ARG]]			; SIMPLIFYCFG-NEXT: [[T3:%.*]] = udiv i64 -1, [[ARG]]
	; ALL-NEXT: [[T4:%.]] = icmp ult i64 [[T3]], [[ARG1:%.]]			; SIMPLIFYCFG-NEXT: [[T4:%.]] = icmp ult i64 [[T3]], [[ARG1:%.]]
	; ALL-NEXT: br label [[BB5]]			; SIMPLIFYCFG-NEXT: br label [[BB5]]
	; ALL: bb5:			; SIMPLIFYCFG: bb5:
	; ALL-NEXT: [[T6:%.]] = phi i1 [ false, [[BB:%.]] ], [ [[T4]], [[BB2]] ]			; SIMPLIFYCFG-NEXT: [[T6:%.]] = phi i1 [ false, [[BB:%.]] ], [ [[T4]], [[BB2]] ]
	; ALL-NEXT: ret i1 [[T6]]			; SIMPLIFYCFG-NEXT: ret i1 [[T6]]
				;
				; INSTCOMBINEONLY-LABEL: @will_not_overflow(
				; INSTCOMBINEONLY-NEXT: bb:
				; INSTCOMBINEONLY-NEXT: [[T0:%.]] = icmp eq i64 [[ARG:%.]], 0
				; INSTCOMBINEONLY-NEXT: br i1 [[T0]], label [[BB5:%.]], label [[BB2:%.]]
				; INSTCOMBINEONLY: bb2:
				; INSTCOMBINEONLY-NEXT: [[UMUL:%.]] = call { i64, i1 } @llvm.umul.with.overflow.i64(i64 [[ARG]], i64 [[ARG1:%.]])
				; INSTCOMBINEONLY-NEXT: [[UMUL_OV:%.*]] = extractvalue { i64, i1 } [[UMUL]], 1
				; INSTCOMBINEONLY-NEXT: br label [[BB5]]
				; INSTCOMBINEONLY: bb5:
				; INSTCOMBINEONLY-NEXT: [[T6:%.]] = phi i1 [ false, [[BB:%.]] ], [ [[UMUL_OV]], [[BB2]] ]
				; INSTCOMBINEONLY-NEXT: ret i1 [[T6]]
				;
				; INSTCOMBINESIMPLIFYCFGONLY-LABEL: @will_not_overflow(
				; INSTCOMBINESIMPLIFYCFGONLY-NEXT: bb:
				; INSTCOMBINESIMPLIFYCFGONLY-NEXT: [[T0:%.]] = icmp eq i64 [[ARG:%.]], 0
				; INSTCOMBINESIMPLIFYCFGONLY-NEXT: [[UMUL:%.]] = call { i64, i1 } @llvm.umul.with.overflow.i64(i64 [[ARG]], i64 [[ARG1:%.]])
				; INSTCOMBINESIMPLIFYCFGONLY-NEXT: [[UMUL_OV:%.*]] = extractvalue { i64, i1 } [[UMUL]], 1
				; INSTCOMBINESIMPLIFYCFGONLY-NEXT: [[T6:%.*]] = select i1 [[T0]], i1 false, i1 [[UMUL_OV]]
				; INSTCOMBINESIMPLIFYCFGONLY-NEXT: ret i1 [[T6]]
				;
				; INSTCOMBINESIMPLIFYCFGINSTCOMBINE-LABEL: @will_not_overflow(
				; INSTCOMBINESIMPLIFYCFGINSTCOMBINE-NEXT: bb:
				; INSTCOMBINESIMPLIFYCFGINSTCOMBINE-NEXT: [[T0:%.]] = icmp ne i64 [[ARG:%.]], 0
				; INSTCOMBINESIMPLIFYCFGINSTCOMBINE-NEXT: [[UMUL:%.]] = call { i64, i1 } @llvm.umul.with.overflow.i64(i64 [[ARG]], i64 [[ARG1:%.]])
				; INSTCOMBINESIMPLIFYCFGINSTCOMBINE-NEXT: [[UMUL_OV:%.*]] = extractvalue { i64, i1 } [[UMUL]], 1
				; INSTCOMBINESIMPLIFYCFGINSTCOMBINE-NEXT: [[T6:%.*]] = and i1 [[UMUL_OV]], [[T0]]
				; INSTCOMBINESIMPLIFYCFGINSTCOMBINE-NEXT: ret i1 [[T6]]
				;
				; INSTCOMBINESIMPLIFYCFGCOSTLYONLY-LABEL: @will_not_overflow(
				; INSTCOMBINESIMPLIFYCFGCOSTLYONLY-NEXT: bb:
				; INSTCOMBINESIMPLIFYCFGCOSTLYONLY-NEXT: [[T0:%.]] = icmp eq i64 [[ARG:%.]], 0
				; INSTCOMBINESIMPLIFYCFGCOSTLYONLY-NEXT: [[UMUL:%.]] = call { i64, i1 } @llvm.umul.with.overflow.i64(i64 [[ARG]], i64 [[ARG1:%.]])
				; INSTCOMBINESIMPLIFYCFGCOSTLYONLY-NEXT: [[UMUL_OV:%.*]] = extractvalue { i64, i1 } [[UMUL]], 1
				; INSTCOMBINESIMPLIFYCFGCOSTLYONLY-NEXT: [[T6:%.*]] = select i1 [[T0]], i1 false, i1 [[UMUL_OV]]
				; INSTCOMBINESIMPLIFYCFGCOSTLYONLY-NEXT: ret i1 [[T6]]
				;
				; INSTCOMBINESIMPLIFYCFGCOSTLYINSTCOMBINE-LABEL: @will_not_overflow(
				; INSTCOMBINESIMPLIFYCFGCOSTLYINSTCOMBINE-NEXT: bb:
				; INSTCOMBINESIMPLIFYCFGCOSTLYINSTCOMBINE-NEXT: [[T0:%.]] = icmp ne i64 [[ARG:%.]], 0
				; INSTCOMBINESIMPLIFYCFGCOSTLYINSTCOMBINE-NEXT: [[UMUL:%.]] = call { i64, i1 } @llvm.umul.with.overflow.i64(i64 [[ARG]], i64 [[ARG1:%.]])
				; INSTCOMBINESIMPLIFYCFGCOSTLYINSTCOMBINE-NEXT: [[UMUL_OV:%.*]] = extractvalue { i64, i1 } [[UMUL]], 1
				; INSTCOMBINESIMPLIFYCFGCOSTLYINSTCOMBINE-NEXT: [[T6:%.*]] = and i1 [[UMUL_OV]], [[T0]]
				; INSTCOMBINESIMPLIFYCFGCOSTLYINSTCOMBINE-NEXT: ret i1 [[T6]]
	;			;
	bb:			bb:
	%t0 = icmp eq i64 %arg, 0			%t0 = icmp eq i64 %arg, 0
	br i1 %t0, label %bb5, label %bb2			br i1 %t0, label %bb5, label %bb2

	bb2: ; preds = %bb			bb2: ; preds = %bb
	%t3 = udiv i64 -1, %arg			%t3 = udiv i64 -1, %arg
	%t4 = icmp ult i64 %t3, %arg1			%t4 = icmp ult i64 %t3, %arg1
	Show All 20 Lines
	; SIMPLIFYCFG-NEXT: [[T7:%.*]] = xor i1 [[T6]], true			; SIMPLIFYCFG-NEXT: [[T7:%.*]] = xor i1 [[T6]], true
	; SIMPLIFYCFG-NEXT: ret i1 [[T7]]			; SIMPLIFYCFG-NEXT: ret i1 [[T7]]
	;			;
	; INSTCOMBINE-LABEL: @will_overflow(			; INSTCOMBINE-LABEL: @will_overflow(
	; INSTCOMBINE-NEXT: bb:			; INSTCOMBINE-NEXT: bb:
	; INSTCOMBINE-NEXT: [[T0:%.]] = icmp eq i64 [[ARG:%.]], 0			; INSTCOMBINE-NEXT: [[T0:%.]] = icmp eq i64 [[ARG:%.]], 0
	; INSTCOMBINE-NEXT: br i1 [[T0]], label [[BB5:%.]], label [[BB2:%.]]			; INSTCOMBINE-NEXT: br i1 [[T0]], label [[BB5:%.]], label [[BB2:%.]]
	; INSTCOMBINE: bb2:			; INSTCOMBINE: bb2:
	; INSTCOMBINE-NEXT: [[T3:%.*]] = udiv i64 -1, [[ARG]]			; INSTCOMBINE-NEXT: [[UMUL:%.]] = call { i64, i1 } @llvm.umul.with.overflow.i64(i64 [[ARG]], i64 [[ARG1:%.]])
	; INSTCOMBINE-NEXT: [[T4:%.]] = icmp uge i64 [[T3]], [[ARG1:%.]]			; INSTCOMBINE-NEXT: [[UMUL_OV:%.*]] = extractvalue { i64, i1 } [[UMUL]], 1
				; INSTCOMBINE-NEXT: [[PHITMP:%.*]] = xor i1 [[UMUL_OV]], true
	; INSTCOMBINE-NEXT: br label [[BB5]]			; INSTCOMBINE-NEXT: br label [[BB5]]
	; INSTCOMBINE: bb5:			; INSTCOMBINE: bb5:
	; INSTCOMBINE-NEXT: [[T6:%.]] = phi i1 [ true, [[BB:%.]] ], [ [[T4]], [[BB2]] ]			; INSTCOMBINE-NEXT: [[T6:%.]] = phi i1 [ true, [[BB:%.]] ], [ [[PHITMP]], [[BB2]] ]
	; INSTCOMBINE-NEXT: ret i1 [[T6]]			; INSTCOMBINE-NEXT: ret i1 [[T6]]
	;			;
	bb:			bb:
	%t0 = icmp eq i64 %arg, 0			%t0 = icmp eq i64 %arg, 0
	br i1 %t0, label %bb5, label %bb2			br i1 %t0, label %bb5, label %bb2

	bb2: ; preds = %bb			bb2: ; preds = %bb
	%t3 = udiv i64 -1, %arg			%t3 = udiv i64 -1, %arg
	%t4 = icmp ult i64 %t3, %arg1			%t4 = icmp ult i64 %t3, %arg1
	br label %bb5			br label %bb5

	bb5: ; preds = %bb2, %bb			bb5: ; preds = %bb2, %bb
	%t6 = phi i1 [ false, %bb ], [ %t4, %bb2 ]			%t6 = phi i1 [ false, %bb ], [ %t4, %bb2 ]
	%t7 = xor i1 %t6, true			%t7 = xor i1 %t6, true
	ret i1 %t7			ret i1 %t7
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Fold '(-1 u/ %x) u< %y' to '@llvm.umul.with.overflow' + overflow bit extraction
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 217850

llvm/trunk/lib/Transforms/InstCombine/InstCombineCompares.cpp

llvm/trunk/test/Transforms/InstCombine/unsigned-mul-lack-of-overflow-check-via-udiv-of-allones.ll

llvm/trunk/test/Transforms/InstCombine/unsigned-mul-overflow-check-via-udiv-of-allones.ll

llvm/trunk/test/Transforms/PhaseOrdering/unsigned-multiply-overflow-check.ll

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Fold '(-1 u/ %x) u< %y' to '@llvm.umul.with.overflow' + overflow bit extractionClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 217850

llvm/trunk/lib/Transforms/InstCombine/InstCombineCompares.cpp

llvm/trunk/test/Transforms/InstCombine/unsigned-mul-lack-of-overflow-check-via-udiv-of-allones.ll

llvm/trunk/test/Transforms/InstCombine/unsigned-mul-overflow-check-via-udiv-of-allones.ll

llvm/trunk/test/Transforms/PhaseOrdering/unsigned-multiply-overflow-check.ll

[InstCombine] Fold '(-1 u/ %x) u< %y' to '@llvm.umul.with.overflow' + overflow bit extraction
ClosedPublic