This is an archive of the discontinued LLVM Phabricator instance.

[InstSimplify] don't let poison inhibit an easy fold
ClosedPublic

Authored by spatel on Oct 6 2017, 10:34 AM.

Download Raw Diff

Details

Reviewers

majnemer
efriedma
craig.topper
nlopes

Summary

D38591 offers one way to avoid the assert in PR34838, but we could just explicitly handle this pattern in InstSimplify to make life easier for InstCombine. This also avoids using computeKnownBits() if we don't have to and gets known bad code reduced faster, so we're not wasting time on it.

Diff Detail

Event Timeline

spatel created this revision.Oct 6 2017, 10:34 AM

Herald added a subscriber: mcrosier. · View Herald TranscriptOct 6 2017, 10:34 AM

spatel mentioned this in D38591: [InstCombine] don't assert that InstSimplify has removed a known true/false cmp (PR34838).Oct 6 2017, 10:37 AM

If I understand correctly, the reason computeKnownBits can't handle this is that it doesn't know what to do with a known poison value? We could just solve the issue in computeKnownBits: currently, it says there are no known bits when it detects a shift overflow, but it could just say, for example, that all the bits are known zero (since the result of computeKnownBits is only meaningful if the value isn't poison).

In D38637#890899, @efriedma wrote:

If I understand correctly, the reason computeKnownBits can't handle this is that it doesn't know what to do with a known poison value? We could just solve the issue in computeKnownBits: currently, it says there are no known bits when it detects a shift overflow, but it could just say, for example, that all the bits are known zero (since the result of computeKnownBits is only meaningful if the value isn't poison).

Ah, I thought that wasn't an option. I remember some bug report related to undef handling in computeKnownBits that made we think we have to be conservative, but I'm not locating it. We have these comments in computeKnownBitsFromShiftOperator():

// If there is conflict between Known.Zero and Known.One, this must be an
// overflowing left shift, so the shift result is undefined. Clear Known
// bits so that other code could propagate this undef.

...

// If the shift amount could be greater than or equal to the bit-width of the LHS, the
// value could be undef, so we don't know anything about it.

...

// If there are no compatible shift amounts, then we've proven that the shift
// amount must be >= the BitWidth, and the result is undefined. We could
// return anything we'd like, but we need to make sure the sets of known bits
// stay disjoint (it should be better for some other code to actually
// propagate the undef than to pick a value here using known bits).

Also, this is in the header comment for computeKnownBits():
/ NOTE: we cannot consider 'undef' to be "IsZero" here. The problem is that
/ we cannot optimize based on the assumption that it is zero without changing
/ it to be an explicit zero. If we don't change it to zero, other code could
/ optimized based on the contradictory assumption that it is non-zero.

So since we don't know what the caller will do with the result, we're conservative. Is it different if something is known to produce poison rather than undef?

The exact definition of poison is still getting refined, but it's different from undef. undef is a bit-wise property, which is why ComputeKnownBits has to be careful around it. poison works differently; essentially, any arithmetic or logical operation which has poison as an input produces poison, no matter what the other input is. So it doesn't matter what ComputeKnownBits returns for a known poison value.

In D38637#891147, @efriedma wrote:

The exact definition of poison is still getting refined, but it's different from undef. undef is a bit-wise property, which is why ComputeKnownBits has to be careful around it. poison works differently; essentially, any arithmetic or logical operation which has poison as an input produces poison, no matter what the other input is. So it doesn't matter what ComputeKnownBits returns for a known poison value.

Thanks! Then, it seems clear I can abandon the InstCombine fix, and I'll redo this one to work in value tracking directly.

Patch updated:
Have computeKnownBitsFromShiftOperator() return a zero constant when we discover a conflict in known bits. This allows InstSimplify to fold compares.

spatel added inline comments.Oct 7 2017, 8:55 AM

lib/Analysis/ValueTracking.cpp
822–824 ↗	(On Diff #118138)	Oops - this comment doesn't make sense. An overshift produces poison too. Removing this check would mean we're going to fall through to the expensive check below more often though. Do we want to do that or should I just fix the comment?

Patch updated:
Fix bogus comment about undef and add a TODO for a potential follow-up patch.

My only potential concern here is that we could end up blocking optimizations because we're folding to undef rather than zero... but that's probably rare enough that it doesn't matter. LGTM.

lib/Analysis/ValueTracking.cpp
822–824 ↗	(On Diff #118138)	IIRC the old version of this comment is just outdated; we recently adjusted LangRef to be a bit more aggressive with shifts because we have some transforms which depend on it being poison rather than undef. Probably worth investigating getting rid of this at some point; I expect there are some interesting shifts we could analyze.

This revision is now accepted and ready to land.Oct 11 2017, 7:11 PM

Closed with rL315595

spatel mentioned this in D40649: [InstCombine] Don't crash on out of bounds shifts.Nov 30 2017, 8:59 AM

Revision Contents

Path

Size

lib/

Analysis/

InstructionSimplify.cpp

22 lines

test/

Transforms/

InstSimplify/

icmp-constant.ll

26 lines

Diff 118034

lib/Analysis/InstructionSimplify.cpp

	Show First 20 Lines • Show All 2,399 Lines • ▼ Show 20 Lines
	}			}

	static Value simplifyICmpWithConstant(CmpInst::Predicate Pred, Value LHS,			static Value simplifyICmpWithConstant(CmpInst::Predicate Pred, Value LHS,
	Value *RHS) {			Value *RHS) {
	const APInt *C;			const APInt *C;
	if (!match(RHS, m_APInt(C)))			if (!match(RHS, m_APInt(C)))
	return nullptr;			return nullptr;

				Type *CmpTy = GetCompareTy(RHS);
	// Rule out tautological comparisons (eg., ult 0 or uge 0).			// Rule out tautological comparisons (eg., ult 0 or uge 0).
	ConstantRange RHS_CR = ConstantRange::makeExactICmpRegion(Pred, *C);			ConstantRange RHS_CR = ConstantRange::makeExactICmpRegion(Pred, *C);
	if (RHS_CR.isEmptySet())			if (RHS_CR.isEmptySet())
	return ConstantInt::getFalse(GetCompareTy(RHS));			return ConstantInt::getFalse(CmpTy);
	if (RHS_CR.isFullSet())			if (RHS_CR.isFullSet())
	return ConstantInt::getTrue(GetCompareTy(RHS));			return ConstantInt::getTrue(CmpTy);

	// Find the range of possible values for binary operators.			// Find the range of possible values for binary operators.
	unsigned Width = C->getBitWidth();			unsigned Width = C->getBitWidth();
	APInt Lower = APInt(Width, 0);			APInt Lower = APInt(Width, 0);
	APInt Upper = APInt(Width, 0);			APInt Upper = APInt(Width, 0);
	if (auto *BO = dyn_cast<BinaryOperator>(LHS))			if (auto *BO = dyn_cast<BinaryOperator>(LHS))
	setLimitsForBinOp(*BO, Lower, Upper);			setLimitsForBinOp(*BO, Lower, Upper);

	ConstantRange LHS_CR =			ConstantRange LHS_CR =
	Lower != Upper ? ConstantRange(Lower, Upper) : ConstantRange(Width, true);			Lower != Upper ? ConstantRange(Lower, Upper) : ConstantRange(Width, true);

	if (auto *I = dyn_cast<Instruction>(LHS))			if (auto *I = dyn_cast<Instruction>(LHS))
	if (auto *Ranges = I->getMetadata(LLVMContext::MD_range))			if (auto *Ranges = I->getMetadata(LLVMContext::MD_range))
	LHS_CR = LHS_CR.intersectWith(getConstantRangeFromMetadata(*Ranges));			LHS_CR = LHS_CR.intersectWith(getConstantRangeFromMetadata(*Ranges));

	if (!LHS_CR.isFullSet()) {			if (!LHS_CR.isFullSet()) {
	if (RHS_CR.contains(LHS_CR))			if (RHS_CR.contains(LHS_CR))
	return ConstantInt::getTrue(GetCompareTy(RHS));			return ConstantInt::getTrue(CmpTy);
	if (RHS_CR.inverse().contains(LHS_CR))			if (RHS_CR.inverse().contains(LHS_CR))
	return ConstantInt::getFalse(GetCompareTy(RHS));			return ConstantInt::getFalse(CmpTy);
				}

				// Shift-left doesn't easily conform to range reduction, but we can still
				// check if the inserted zero bits make this comparison true or false.
				const APInt *ShiftAmtC;
				if (match(LHS, m_Shl(m_Value(), m_APInt(ShiftAmtC))) &&
				C->lshr(ShiftAmtC).shl(ShiftAmtC) != *C) {
				// icmp eq (shl X, ShiftAmtC), C --> false if any low bits of C are set
				if (Pred == ICmpInst::ICMP_EQ)
				return ConstantInt::getFalse(CmpTy);
				// icmp ne (shl X, ShiftAmtC), C --> true if any low bits of C are set
				if (Pred == ICmpInst::ICMP_NE)
				return ConstantInt::getTrue(CmpTy);
	}			}

	return nullptr;			return nullptr;
	}			}

	/// TODO: A large part of this logic is duplicated in InstCombine's			/// TODO: A large part of this logic is duplicated in InstCombine's
	/// foldICmpBinOp(). We should be able to share that and avoid the code			/// foldICmpBinOp(). We should be able to share that and avoid the code
	/// duplication.			/// duplication.
	▲ Show 20 Lines • Show All 2,361 Lines • Show Last 20 Lines

test/Transforms/InstSimplify/icmp-constant.ll

	Show First 20 Lines • Show All 565 Lines • ▼ Show 20 Lines
	; CHECK-LABEL: @add_nsw_pos_const5_splat_vec(			; CHECK-LABEL: @add_nsw_pos_const5_splat_vec(
	; CHECK-NEXT: ret <2 x i1> <i1 true, i1 true>			; CHECK-NEXT: ret <2 x i1> <i1 true, i1 true>
	;			;
	%add = add nsw <2 x i32> %x, <i32 42, i32 42>			%add = add nsw <2 x i32> %x, <i32 42, i32 42>
	%cmp = icmp ne <2 x i32> %add, <i32 -2147483607, i32 -2147483607>			%cmp = icmp ne <2 x i32> %add, <i32 -2147483607, i32 -2147483607>
	ret <2 x i1> %cmp			ret <2 x i1> %cmp
	}			}

				; PR34838 - https://bugs.llvm.org/show_bug.cgi?id=34838
				; The shift is known to create poison, but that doesn't mean we can't simplify the cmp.

				define i1 @ne_shl_low_bits_set(i8 %x) {
				; CHECK-LABEL: @ne_shl_low_bits_set(
				; CHECK-NEXT: ret i1 true
				;
				%zx = zext i8 %x to i16 ; zx = 0x00xx
				%xor = xor i16 %zx, 32767 ; xor = 0x7fyy
				%sub = sub nsw i16 %zx, %xor ; sub = 0x80zz (the top bit is known one)
				%sh = shl nsw i16 %sub, 2 ; oops! this shl can't be nsw; that's POISON
				%cmp = icmp ne i16 %sh, 1
				ret i1 %cmp
				}

				define i1 @eq_shl_low_bits_set(i8 %x) {
				; CHECK-LABEL: @eq_shl_low_bits_set(
				; CHECK-NEXT: ret i1 false
				;
				%clear_high_bit = and i8 %x, 127
				%set_next_high_bits = or i8 %clear_high_bit, 112 ; 0x70
				%poison_shift = shl nsw i8 %set_next_high_bits, 3
				%cmp = icmp eq i8 %poison_shift, 15
				ret i1 %cmp
				}