This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/lib/Analysis/
-
trunk/
-
lib/
-
Analysis/
-
ValueTracking.cpp

Differential D16205

ValueTracking: Put DataLayout reference into the Query structure, NFC.
ClosedPublic

Authored by MatzeB on Jan 14 2016, 3:46 PM.

Download Raw Diff

Details

Reviewers

majnemer
sanjoy
mehdi_amini
hfinkel

Commits

rGfeb81bc6820b: ValueTracking: Put DataLayout reference into the Query structure, NFC.
rL257944: ValueTracking: Put DataLayout reference into the Query structure, NFC.

Summary

After r251146 computeKnownBits() is called multiple times for every instruction in the program, which resulted in a 3% compiletime regression. This patch tries to get some of that compiletime back by optimizing the function:

Put DataLayout reference into the Query structure.
It looks nicer and improves the compiletime of a typical
clang -O3 -emit-llvm run by ~0.6% for me.

Diff Detail

Repository: rL LLVM

Event Timeline

MatzeB updated this revision to Diff 44940.Jan 14 2016, 3:46 PM

MatzeB retitled this revision from to ValueTracking: Put DataLayout reference into the Query structure, NFC..

MatzeB updated this object.

MatzeB added reviewers: hfinkel, majnemer, sanjoy.

MatzeB set the repository for this revision to rL LLVM.

MatzeB added a subscriber: llvm-commits.

IIUC, you just save a pointer argument to computeKnownBits right? Is it the only reason for the speedup? Pretty surprising to me.

lib/Analysis/ValueTracking.cpp
461 ↗	(On Diff #44940)	The changes in this function seems unrelated to the DataLayout, it is a cleanup that you could commit separately I feel.

In D16205#327411, @joker.eph wrote:

IIUC, you just save a pointer argument to computeKnownBits right? Is it the only reason for the speedup? Pretty surprising to me.

Yes it's "just" the pointer argument, but the code here is called often and recursively so this can add up to smaller stack size and reduced reloading/spilling.

lib/Analysis/ValueTracking.cpp
461 ↗	(On Diff #44940)	This is necessary because llvm::isValidAssumeForContext() constructed an ad-hoc Query to transmit these 2 values, however we do not have a DataLayout at that point and actually don't need it here anyway.

LGTM.
(I added the DataLayout argument to computeKnownBits last year, I'm fine with adding it to the query)

This revision is now accepted and ready to land.Jan 14 2016, 4:04 PM

I have no objection to the patch, but I am curious to know why you see a
speedup here. I'd really expect a combination of inlining and register
allocation tricks* to cut the cost of the argument passing to near
zero. If it isn't, that seems like something we should fix in the
compiler. (I'm assuming you're compiling with a recent Clang here?)

In many of these cases, the argument register doesn't need to be

preserved over the inner call because a) it's not a callee saved
register and b) it's not used after the return.

Have you looked at the resulting assembly to see where the time is spent?

Philip

In D16205#327475, @reames wrote:

I have no objection to the patch, but I am curious to know why you see a
speedup here. I'd really expect a combination of inlining and register
allocation tricks* to cut the cost of the argument passing to near
zero. If it isn't, that seems like something we should fix in the
compiler. (I'm assuming you're compiling with a recent Clang here?)

In many of these cases, the argument register doesn't need to be

preserved over the inner call because a) it's not a callee saved
register and b) it's not used after the return.

Have you looked at the resulting assembly to see where the time is spent?

I looked at it a bit in valgrind/callgrind. There are indeed 0.7% less instructions executed overall.
Looking at the assembly the DataLayout parameter is spilled at the beginning of each function and reloaded in front of nearly all recursive calls. According to valgrind this saves 2-3% of instructions in the various computeKnownBits functions and surprisingly they are high enough in the profile that this amounts to 0.7% instructions saved overall.

Closed by commit rL257944: ValueTracking: Put DataLayout reference into the Query structure, NFC. (authored by matze). · Explain WhyJan 15 2016, 2:25 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

lib/

Analysis/

ValueTracking.cpp

507 lines

Diff 45038

llvm/trunk/lib/Analysis/ValueTracking.cpp

Show First 20 Lines • Show All 90 Lines • ▼ Show 20 Lines

namespace {		namespace {
// Simplifying using an assume can only be done in a particular control-flow		// Simplifying using an assume can only be done in a particular control-flow
// context (the context instruction provides that context). If an assume and		// context (the context instruction provides that context). If an assume and
// the context instruction are not in the same block then the DT helps in		// the context instruction are not in the same block then the DT helps in
// figuring out if we can use it.		// figuring out if we can use it.
struct Query {		struct Query {
ExclInvsSet ExclInvs;		ExclInvsSet ExclInvs;
		const DataLayout &DL;
AssumptionCache *AC;		AssumptionCache *AC;
const Instruction *CxtI;		const Instruction *CxtI;
const DominatorTree *DT;		const DominatorTree *DT;

Query(AssumptionCache AC = nullptr, const Instruction CxtI = nullptr,		Query(const DataLayout &DL, AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT = nullptr)		const DominatorTree *DT)
: AC(AC), CxtI(CxtI), DT(DT) {}		: DL(DL), AC(AC), CxtI(CxtI), DT(DT) {}

Query(const Query &Q, const Value *NewExcl)		Query(const Query &Q, const Value *NewExcl)
: ExclInvs(Q.ExclInvs), AC(Q.AC), CxtI(Q.CxtI), DT(Q.DT) {		: ExclInvs(Q.ExclInvs), DL(Q.DL), AC(Q.AC), CxtI(Q.CxtI), DT(Q.DT) {
ExclInvs.insert(NewExcl);		ExclInvs.insert(NewExcl);
}		}
};		};
} // end anonymous namespace		} // end anonymous namespace

// Given the provided Value and, potentially, a context instruction, return		// Given the provided Value and, potentially, a context instruction, return
// the preferred context instruction (if any).		// the preferred context instruction (if any).
static const Instruction safeCxtI(const Value V, const Instruction *CxtI) {		static const Instruction safeCxtI(const Value V, const Instruction *CxtI) {
// If we've been provided with a context instruction, then use that (provided		// If we've been provided with a context instruction, then use that (provided
// it has been inserted).		// it has been inserted).
if (CxtI && CxtI->getParent())		if (CxtI && CxtI->getParent())
return CxtI;		return CxtI;

// If the value is really an already-inserted instruction, then use that.		// If the value is really an already-inserted instruction, then use that.
CxtI = dyn_cast<Instruction>(V);		CxtI = dyn_cast<Instruction>(V);
if (CxtI && CxtI->getParent())		if (CxtI && CxtI->getParent())
return CxtI;		return CxtI;

return nullptr;		return nullptr;
}		}

static void computeKnownBits(Value *V, APInt &KnownZero, APInt &KnownOne,		static void computeKnownBits(Value *V, APInt &KnownZero, APInt &KnownOne,
const DataLayout &DL, unsigned Depth,		unsigned Depth, const Query &Q);
const Query &Q);

void llvm::computeKnownBits(Value *V, APInt &KnownZero, APInt &KnownOne,		void llvm::computeKnownBits(Value *V, APInt &KnownZero, APInt &KnownOne,
const DataLayout &DL, unsigned Depth,		const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT) {		const DominatorTree *DT) {
::computeKnownBits(V, KnownZero, KnownOne, DL, Depth,		::computeKnownBits(V, KnownZero, KnownOne, Depth,
Query(AC, safeCxtI(V, CxtI), DT));		Query(DL, AC, safeCxtI(V, CxtI), DT));
}		}

bool llvm::haveNoCommonBitsSet(Value LHS, Value RHS, const DataLayout &DL,		bool llvm::haveNoCommonBitsSet(Value LHS, Value RHS, const DataLayout &DL,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT) {		const DominatorTree *DT) {
assert(LHS->getType() == RHS->getType() &&		assert(LHS->getType() == RHS->getType() &&
"LHS and RHS should have the same type");		"LHS and RHS should have the same type");
assert(LHS->getType()->isIntOrIntVectorTy() &&		assert(LHS->getType()->isIntOrIntVectorTy() &&
"LHS and RHS should be integers");		"LHS and RHS should be integers");
IntegerType *IT = cast<IntegerType>(LHS->getType()->getScalarType());		IntegerType *IT = cast<IntegerType>(LHS->getType()->getScalarType());
APInt LHSKnownZero(IT->getBitWidth(), 0), LHSKnownOne(IT->getBitWidth(), 0);		APInt LHSKnownZero(IT->getBitWidth(), 0), LHSKnownOne(IT->getBitWidth(), 0);
APInt RHSKnownZero(IT->getBitWidth(), 0), RHSKnownOne(IT->getBitWidth(), 0);		APInt RHSKnownZero(IT->getBitWidth(), 0), RHSKnownOne(IT->getBitWidth(), 0);
computeKnownBits(LHS, LHSKnownZero, LHSKnownOne, DL, 0, AC, CxtI, DT);		computeKnownBits(LHS, LHSKnownZero, LHSKnownOne, DL, 0, AC, CxtI, DT);
computeKnownBits(RHS, RHSKnownZero, RHSKnownOne, DL, 0, AC, CxtI, DT);		computeKnownBits(RHS, RHSKnownZero, RHSKnownOne, DL, 0, AC, CxtI, DT);
return (LHSKnownZero \| RHSKnownZero).isAllOnesValue();		return (LHSKnownZero \| RHSKnownZero).isAllOnesValue();
}		}

static void ComputeSignBit(Value *V, bool &KnownZero, bool &KnownOne,		static void ComputeSignBit(Value *V, bool &KnownZero, bool &KnownOne,
const DataLayout &DL, unsigned Depth,		unsigned Depth, const Query &Q);
const Query &Q);

void llvm::ComputeSignBit(Value *V, bool &KnownZero, bool &KnownOne,		void llvm::ComputeSignBit(Value *V, bool &KnownZero, bool &KnownOne,
const DataLayout &DL, unsigned Depth,		const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT) {		const DominatorTree *DT) {
::ComputeSignBit(V, KnownZero, KnownOne, DL, Depth,		::ComputeSignBit(V, KnownZero, KnownOne, Depth,
Query(AC, safeCxtI(V, CxtI), DT));		Query(DL, AC, safeCxtI(V, CxtI), DT));
}		}

static bool isKnownToBeAPowerOfTwo(Value *V, bool OrZero, unsigned Depth,		static bool isKnownToBeAPowerOfTwo(Value *V, bool OrZero, unsigned Depth,
const Query &Q, const DataLayout &DL);		const Query &Q);

bool llvm::isKnownToBeAPowerOfTwo(Value *V, const DataLayout &DL, bool OrZero,		bool llvm::isKnownToBeAPowerOfTwo(Value *V, const DataLayout &DL, bool OrZero,
unsigned Depth, AssumptionCache *AC,		unsigned Depth, AssumptionCache *AC,
const Instruction *CxtI,		const Instruction *CxtI,
const DominatorTree *DT) {		const DominatorTree *DT) {
return ::isKnownToBeAPowerOfTwo(V, OrZero, Depth,		return ::isKnownToBeAPowerOfTwo(V, OrZero, Depth,
Query(AC, safeCxtI(V, CxtI), DT), DL);		Query(DL, AC, safeCxtI(V, CxtI), DT));
}		}

static bool isKnownNonZero(Value *V, const DataLayout &DL, unsigned Depth,		static bool isKnownNonZero(Value *V, unsigned Depth, const Query &Q);
const Query &Q);

bool llvm::isKnownNonZero(Value *V, const DataLayout &DL, unsigned Depth,		bool llvm::isKnownNonZero(Value *V, const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT) {		const DominatorTree *DT) {
return ::isKnownNonZero(V, DL, Depth, Query(AC, safeCxtI(V, CxtI), DT));		return ::isKnownNonZero(V, Depth, Query(DL, AC, safeCxtI(V, CxtI), DT));
}		}

bool llvm::isKnownNonNegative(Value *V, const DataLayout &DL, unsigned Depth,		bool llvm::isKnownNonNegative(Value *V, const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT) {		const DominatorTree *DT) {
bool NonNegative, Negative;		bool NonNegative, Negative;
ComputeSignBit(V, NonNegative, Negative, DL, Depth, AC, CxtI, DT);		ComputeSignBit(V, NonNegative, Negative, DL, Depth, AC, CxtI, DT);
return NonNegative;		return NonNegative;
}		}

static bool isKnownNonEqual(Value V1, Value V2, const DataLayout &DL,		static bool isKnownNonEqual(Value V1, Value V2, const Query &Q);
const Query &Q);

bool llvm::isKnownNonEqual(Value V1, Value V2, const DataLayout &DL,		bool llvm::isKnownNonEqual(Value V1, Value V2, const DataLayout &DL,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT) {		const DominatorTree *DT) {
return ::isKnownNonEqual(V1, V2, DL, Query(AC,		return ::isKnownNonEqual(V1, V2, Query(DL, AC,
safeCxtI(V1, safeCxtI(V2, CxtI)),		safeCxtI(V1, safeCxtI(V2, CxtI)),
DT));		DT));
}		}

static bool MaskedValueIsZero(Value *V, const APInt &Mask, const DataLayout &DL,		static bool MaskedValueIsZero(Value *V, const APInt &Mask, unsigned Depth,
unsigned Depth, const Query &Q);		const Query &Q);

bool llvm::MaskedValueIsZero(Value *V, const APInt &Mask, const DataLayout &DL,		bool llvm::MaskedValueIsZero(Value *V, const APInt &Mask, const DataLayout &DL,
unsigned Depth, AssumptionCache *AC,		unsigned Depth, AssumptionCache *AC,
const Instruction CxtI, const DominatorTree DT) {		const Instruction CxtI, const DominatorTree DT) {
return ::MaskedValueIsZero(V, Mask, DL, Depth,		return ::MaskedValueIsZero(V, Mask, Depth,
Query(AC, safeCxtI(V, CxtI), DT));		Query(DL, AC, safeCxtI(V, CxtI), DT));
}		}

static unsigned ComputeNumSignBits(Value *V, const DataLayout &DL,		static unsigned ComputeNumSignBits(Value *V, unsigned Depth, const Query &Q);
unsigned Depth, const Query &Q);

unsigned llvm::ComputeNumSignBits(Value *V, const DataLayout &DL,		unsigned llvm::ComputeNumSignBits(Value *V, const DataLayout &DL,
unsigned Depth, AssumptionCache *AC,		unsigned Depth, AssumptionCache *AC,
const Instruction *CxtI,		const Instruction *CxtI,
const DominatorTree *DT) {		const DominatorTree *DT) {
return ::ComputeNumSignBits(V, DL, Depth, Query(AC, safeCxtI(V, CxtI), DT));		return ::ComputeNumSignBits(V, Depth, Query(DL, AC, safeCxtI(V, CxtI), DT));
}		}

static void computeKnownBitsAddSub(bool Add, Value Op0, Value Op1, bool NSW,		static void computeKnownBitsAddSub(bool Add, Value Op0, Value Op1, bool NSW,
APInt &KnownZero, APInt &KnownOne,		APInt &KnownZero, APInt &KnownOne,
APInt &KnownZero2, APInt &KnownOne2,		APInt &KnownZero2, APInt &KnownOne2,
const DataLayout &DL, unsigned Depth,		unsigned Depth, const Query &Q) {
const Query &Q) {
if (!Add) {		if (!Add) {
if (ConstantInt *CLHS = dyn_cast<ConstantInt>(Op0)) {		if (ConstantInt *CLHS = dyn_cast<ConstantInt>(Op0)) {
// We know that the top bits of C-X are clear if X contains less bits		// We know that the top bits of C-X are clear if X contains less bits
// than C (i.e. no wrap-around can happen). For example, 20-X is		// than C (i.e. no wrap-around can happen). For example, 20-X is
// positive if we can prove that X is >= 0 and < 16.		// positive if we can prove that X is >= 0 and < 16.
if (!CLHS->getValue().isNegative()) {		if (!CLHS->getValue().isNegative()) {
unsigned BitWidth = KnownZero.getBitWidth();		unsigned BitWidth = KnownZero.getBitWidth();
unsigned NLZ = (CLHS->getValue()+1).countLeadingZeros();		unsigned NLZ = (CLHS->getValue()+1).countLeadingZeros();
// NLZ can't be BitWidth with no sign bit		// NLZ can't be BitWidth with no sign bit
APInt MaskV = APInt::getHighBitsSet(BitWidth, NLZ+1);		APInt MaskV = APInt::getHighBitsSet(BitWidth, NLZ+1);
computeKnownBits(Op1, KnownZero2, KnownOne2, DL, Depth + 1, Q);		computeKnownBits(Op1, KnownZero2, KnownOne2, Depth + 1, Q);

// If all of the MaskV bits are known to be zero, then we know the		// If all of the MaskV bits are known to be zero, then we know the
// output top bits are zero, because we now know that the output is		// output top bits are zero, because we now know that the output is
// from [0-C].		// from [0-C].
if ((KnownZero2 & MaskV) == MaskV) {		if ((KnownZero2 & MaskV) == MaskV) {
unsigned NLZ2 = CLHS->getValue().countLeadingZeros();		unsigned NLZ2 = CLHS->getValue().countLeadingZeros();
// Top bits known zero.		// Top bits known zero.
KnownZero = APInt::getHighBitsSet(BitWidth, NLZ2);		KnownZero = APInt::getHighBitsSet(BitWidth, NLZ2);
}		}
}		}
}		}
}		}

unsigned BitWidth = KnownZero.getBitWidth();		unsigned BitWidth = KnownZero.getBitWidth();

// If an initial sequence of bits in the result is not needed, the		// If an initial sequence of bits in the result is not needed, the
// corresponding bits in the operands are not needed.		// corresponding bits in the operands are not needed.
APInt LHSKnownZero(BitWidth, 0), LHSKnownOne(BitWidth, 0);		APInt LHSKnownZero(BitWidth, 0), LHSKnownOne(BitWidth, 0);
computeKnownBits(Op0, LHSKnownZero, LHSKnownOne, DL, Depth + 1, Q);		computeKnownBits(Op0, LHSKnownZero, LHSKnownOne, Depth + 1, Q);
computeKnownBits(Op1, KnownZero2, KnownOne2, DL, Depth + 1, Q);		computeKnownBits(Op1, KnownZero2, KnownOne2, Depth + 1, Q);

// Carry in a 1 for a subtract, rather than a 0.		// Carry in a 1 for a subtract, rather than a 0.
APInt CarryIn(BitWidth, 0);		APInt CarryIn(BitWidth, 0);
if (!Add) {		if (!Add) {
// Sum = LHS + ~RHS + 1		// Sum = LHS + ~RHS + 1
std::swap(KnownZero2, KnownOne2);		std::swap(KnownZero2, KnownOne2);
CarryIn.setBit(0);		CarryIn.setBit(0);
}		}
Show All 31 Lines	if (NSW) {
KnownOne \|= APInt::getSignBit(BitWidth);		KnownOne \|= APInt::getSignBit(BitWidth);
}		}
}		}
}		}

static void computeKnownBitsMul(Value Op0, Value Op1, bool NSW,		static void computeKnownBitsMul(Value Op0, Value Op1, bool NSW,
APInt &KnownZero, APInt &KnownOne,		APInt &KnownZero, APInt &KnownOne,
APInt &KnownZero2, APInt &KnownOne2,		APInt &KnownZero2, APInt &KnownOne2,
const DataLayout &DL, unsigned Depth,		unsigned Depth, const Query &Q) {
const Query &Q) {
unsigned BitWidth = KnownZero.getBitWidth();		unsigned BitWidth = KnownZero.getBitWidth();
computeKnownBits(Op1, KnownZero, KnownOne, DL, Depth + 1, Q);		computeKnownBits(Op1, KnownZero, KnownOne, Depth + 1, Q);
computeKnownBits(Op0, KnownZero2, KnownOne2, DL, Depth + 1, Q);		computeKnownBits(Op0, KnownZero2, KnownOne2, Depth + 1, Q);

bool isKnownNegative = false;		bool isKnownNegative = false;
bool isKnownNonNegative = false;		bool isKnownNonNegative = false;
// If the multiplication is known not to overflow, compute the sign bit.		// If the multiplication is known not to overflow, compute the sign bit.
if (NSW) {		if (NSW) {
if (Op0 == Op1) {		if (Op0 == Op1) {
// The product of a number with itself is non-negative.		// The product of a number with itself is non-negative.
isKnownNonNegative = true;		isKnownNonNegative = true;
} else {		} else {
bool isKnownNonNegativeOp1 = KnownZero.isNegative();		bool isKnownNonNegativeOp1 = KnownZero.isNegative();
bool isKnownNonNegativeOp0 = KnownZero2.isNegative();		bool isKnownNonNegativeOp0 = KnownZero2.isNegative();
bool isKnownNegativeOp1 = KnownOne.isNegative();		bool isKnownNegativeOp1 = KnownOne.isNegative();
bool isKnownNegativeOp0 = KnownOne2.isNegative();		bool isKnownNegativeOp0 = KnownOne2.isNegative();
// The product of two numbers with the same sign is non-negative.		// The product of two numbers with the same sign is non-negative.
isKnownNonNegative = (isKnownNegativeOp1 && isKnownNegativeOp0) \|\|		isKnownNonNegative = (isKnownNegativeOp1 && isKnownNegativeOp0) \|\|
(isKnownNonNegativeOp1 && isKnownNonNegativeOp0);		(isKnownNonNegativeOp1 && isKnownNonNegativeOp0);
// The product of a negative number and a non-negative number is either		// The product of a negative number and a non-negative number is either
// negative or zero.		// negative or zero.
if (!isKnownNonNegative)		if (!isKnownNonNegative)
isKnownNegative = (isKnownNegativeOp1 && isKnownNonNegativeOp0 &&		isKnownNegative = (isKnownNegativeOp1 && isKnownNonNegativeOp0 &&
isKnownNonZero(Op0, DL, Depth, Q)) \|\|		isKnownNonZero(Op0, Depth, Q)) \|\|
(isKnownNegativeOp0 && isKnownNonNegativeOp1 &&		(isKnownNegativeOp0 && isKnownNonNegativeOp1 &&
isKnownNonZero(Op1, DL, Depth, Q));		isKnownNonZero(Op1, Depth, Q));
}		}
}		}

// If low bits are zero in either operand, output low known-0 bits.		// If low bits are zero in either operand, output low known-0 bits.
// Also compute a conservative estimate for high known-0 bits.		// Also compute a conservative estimate for high known-0 bits.
// More trickiness is possible, but this is sufficient for the		// More trickiness is possible, but this is sufficient for the
// interesting case of alignment computation.		// interesting case of alignment computation.
KnownOne.clearAllBits();		KnownOne.clearAllBits();
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	if (Function *F = CI->getCalledFunction())
case Intrinsic::ptr_annotation:		case Intrinsic::ptr_annotation:
case Intrinsic::var_annotation:		case Intrinsic::var_annotation:
return true;		return true;
}		}

return false;		return false;
}		}

static bool isValidAssumeForContext(Value *V, const Query &Q) {		static bool isValidAssumeForContext(Value V, const Instruction CxtI,
		const DominatorTree *DT) {
Instruction *Inv = cast<Instruction>(V);		Instruction *Inv = cast<Instruction>(V);

// There are two restrictions on the use of an assume:		// There are two restrictions on the use of an assume:
// 1. The assume must dominate the context (or the control flow must		// 1. The assume must dominate the context (or the control flow must
// reach the assume whenever it reaches the context).		// reach the assume whenever it reaches the context).
// 2. The context must not be in the assume's set of ephemeral values		// 2. The context must not be in the assume's set of ephemeral values
// (otherwise we will use the assume to prove that the condition		// (otherwise we will use the assume to prove that the condition
// feeding the assume is trivially true, thus causing the removal of		// feeding the assume is trivially true, thus causing the removal of
// the assume).		// the assume).

if (Q.DT) {		if (DT) {
if (Q.DT->dominates(Inv, Q.CxtI)) {		if (DT->dominates(Inv, CxtI)) {
return true;		return true;
} else if (Inv->getParent() == Q.CxtI->getParent()) {		} else if (Inv->getParent() == CxtI->getParent()) {
// The context comes first, but they're both in the same block. Make sure		// The context comes first, but they're both in the same block. Make sure
// there is nothing in between that might interrupt the control flow.		// there is nothing in between that might interrupt the control flow.
for (BasicBlock::const_iterator I =		for (BasicBlock::const_iterator I =
std::next(BasicBlock::const_iterator(Q.CxtI)),		std::next(BasicBlock::const_iterator(CxtI)),
IE(Inv); I != IE; ++I)		IE(Inv); I != IE; ++I)
if (!isSafeToSpeculativelyExecute(&I) && !isAssumeLikeIntrinsic(&I))		if (!isSafeToSpeculativelyExecute(&I) && !isAssumeLikeIntrinsic(&I))
return false;		return false;

return !isEphemeralValueOf(Inv, Q.CxtI);		return !isEphemeralValueOf(Inv, CxtI);
}		}

return false;		return false;
}		}

// When we don't have a DT, we do a limited search...		// When we don't have a DT, we do a limited search...
if (Inv->getParent() == Q.CxtI->getParent()->getSinglePredecessor()) {		if (Inv->getParent() == CxtI->getParent()->getSinglePredecessor()) {
return true;		return true;
} else if (Inv->getParent() == Q.CxtI->getParent()) {		} else if (Inv->getParent() == CxtI->getParent()) {
// Search forward from the assume until we reach the context (or the end		// Search forward from the assume until we reach the context (or the end
// of the block); the common case is that the assume will come first.		// of the block); the common case is that the assume will come first.
for (BasicBlock::iterator I = std::next(BasicBlock::iterator(Inv)),		for (BasicBlock::iterator I = std::next(BasicBlock::iterator(Inv)),
IE = Inv->getParent()->end(); I != IE; ++I)		IE = Inv->getParent()->end(); I != IE; ++I)
if (&*I == Q.CxtI)		if (&*I == CxtI)
return true;		return true;

// The context must come first...		// The context must come first...
for (BasicBlock::const_iterator I =		for (BasicBlock::const_iterator I =
std::next(BasicBlock::const_iterator(Q.CxtI)),		std::next(BasicBlock::const_iterator(CxtI)),
IE(Inv); I != IE; ++I)		IE(Inv); I != IE; ++I)
if (!isSafeToSpeculativelyExecute(&I) && !isAssumeLikeIntrinsic(&I))		if (!isSafeToSpeculativelyExecute(&I) && !isAssumeLikeIntrinsic(&I))
return false;		return false;

return !isEphemeralValueOf(Inv, Q.CxtI);		return !isEphemeralValueOf(Inv, CxtI);
}		}

return false;		return false;
}		}

bool llvm::isValidAssumeForContext(const Instruction *I,		bool llvm::isValidAssumeForContext(const Instruction *I,
const Instruction *CxtI,		const Instruction *CxtI,
const DominatorTree *DT) {		const DominatorTree *DT) {
return ::isValidAssumeForContext(const_cast<Instruction *>(I),		return ::isValidAssumeForContext(const_cast<Instruction *>(I), CxtI, DT);
Query(nullptr, CxtI, DT));
}		}

template<typename LHS, typename RHS>		template<typename LHS, typename RHS>
inline match_combine_or<CmpClass_match<LHS, RHS, ICmpInst, ICmpInst::Predicate>,		inline match_combine_or<CmpClass_match<LHS, RHS, ICmpInst, ICmpInst::Predicate>,
CmpClass_match<RHS, LHS, ICmpInst, ICmpInst::Predicate>>		CmpClass_match<RHS, LHS, ICmpInst, ICmpInst::Predicate>>
m_c_ICmp(ICmpInst::Predicate &Pred, const LHS &L, const RHS &R) {		m_c_ICmp(ICmpInst::Predicate &Pred, const LHS &L, const RHS &R) {
return m_CombineOr(m_ICmp(Pred, L, R), m_ICmp(Pred, R, L));		return m_CombineOr(m_ICmp(Pred, L, R), m_ICmp(Pred, R, L));
}		}
Show All 20 Lines
}		}

/// Compute known bits in 'V' under the assumption that the condition 'Cmp' is		/// Compute known bits in 'V' under the assumption that the condition 'Cmp' is
/// true (at the context instruction.) This is mostly a utility function for		/// true (at the context instruction.) This is mostly a utility function for
/// the prototype dominating conditions reasoning below.		/// the prototype dominating conditions reasoning below.
static void computeKnownBitsFromTrueCondition(Value V, ICmpInst Cmp,		static void computeKnownBitsFromTrueCondition(Value V, ICmpInst Cmp,
APInt &KnownZero,		APInt &KnownZero,
APInt &KnownOne,		APInt &KnownOne,
const DataLayout &DL,
unsigned Depth, const Query &Q) {		unsigned Depth, const Query &Q) {
Value *LHS = Cmp->getOperand(0);		Value *LHS = Cmp->getOperand(0);
Value *RHS = Cmp->getOperand(1);		Value *RHS = Cmp->getOperand(1);
// TODO: We could potentially be more aggressive here. This would be worth		// TODO: We could potentially be more aggressive here. This would be worth
// evaluating. If we can, explore commoning this code with the assume		// evaluating. If we can, explore commoning this code with the assume
// handling logic.		// handling logic.
if (LHS != V && RHS != V)		if (LHS != V && RHS != V)
return;		return;

const unsigned BitWidth = KnownZero.getBitWidth();		const unsigned BitWidth = KnownZero.getBitWidth();

switch (Cmp->getPredicate()) {		switch (Cmp->getPredicate()) {
default:		default:
// We know nothing from this condition		// We know nothing from this condition
break;		break;
// TODO: implement unsigned bound from below (known one bits)		// TODO: implement unsigned bound from below (known one bits)
// TODO: common condition check implementations with assumes		// TODO: common condition check implementations with assumes
// TODO: implement other patterns from assume (e.g. V & B == A)		// TODO: implement other patterns from assume (e.g. V & B == A)
case ICmpInst::ICMP_SGT:		case ICmpInst::ICMP_SGT:
if (LHS == V) {		if (LHS == V) {
APInt KnownZeroTemp(BitWidth, 0), KnownOneTemp(BitWidth, 0);		APInt KnownZeroTemp(BitWidth, 0), KnownOneTemp(BitWidth, 0);
computeKnownBits(RHS, KnownZeroTemp, KnownOneTemp, DL, Depth + 1, Q);		computeKnownBits(RHS, KnownZeroTemp, KnownOneTemp, Depth + 1, Q);
if (KnownOneTemp.isAllOnesValue() \|\| KnownZeroTemp.isNegative()) {		if (KnownOneTemp.isAllOnesValue() \|\| KnownZeroTemp.isNegative()) {
// We know that the sign bit is zero.		// We know that the sign bit is zero.
KnownZero \|= APInt::getSignBit(BitWidth);		KnownZero \|= APInt::getSignBit(BitWidth);
}		}
}		}
break;		break;
case ICmpInst::ICMP_EQ:		case ICmpInst::ICMP_EQ:
{		{
APInt KnownZeroTemp(BitWidth, 0), KnownOneTemp(BitWidth, 0);		APInt KnownZeroTemp(BitWidth, 0), KnownOneTemp(BitWidth, 0);
if (LHS == V)		if (LHS == V)
computeKnownBits(RHS, KnownZeroTemp, KnownOneTemp, DL, Depth + 1, Q);		computeKnownBits(RHS, KnownZeroTemp, KnownOneTemp, Depth + 1, Q);
else if (RHS == V)		else if (RHS == V)
computeKnownBits(LHS, KnownZeroTemp, KnownOneTemp, DL, Depth + 1, Q);		computeKnownBits(LHS, KnownZeroTemp, KnownOneTemp, Depth + 1, Q);
else		else
llvm_unreachable("missing use?");		llvm_unreachable("missing use?");
KnownZero \|= KnownZeroTemp;		KnownZero \|= KnownZeroTemp;
KnownOne \|= KnownOneTemp;		KnownOne \|= KnownOneTemp;
}		}
break;		break;
case ICmpInst::ICMP_ULE:		case ICmpInst::ICMP_ULE:
if (LHS == V) {		if (LHS == V) {
APInt KnownZeroTemp(BitWidth, 0), KnownOneTemp(BitWidth, 0);		APInt KnownZeroTemp(BitWidth, 0), KnownOneTemp(BitWidth, 0);
computeKnownBits(RHS, KnownZeroTemp, KnownOneTemp, DL, Depth + 1, Q);		computeKnownBits(RHS, KnownZeroTemp, KnownOneTemp, Depth + 1, Q);
// The known zero bits carry over		// The known zero bits carry over
unsigned SignBits = KnownZeroTemp.countLeadingOnes();		unsigned SignBits = KnownZeroTemp.countLeadingOnes();
KnownZero \|= APInt::getHighBitsSet(BitWidth, SignBits);		KnownZero \|= APInt::getHighBitsSet(BitWidth, SignBits);
}		}
break;		break;
case ICmpInst::ICMP_ULT:		case ICmpInst::ICMP_ULT:
if (LHS == V) {		if (LHS == V) {
APInt KnownZeroTemp(BitWidth, 0), KnownOneTemp(BitWidth, 0);		APInt KnownZeroTemp(BitWidth, 0), KnownOneTemp(BitWidth, 0);
computeKnownBits(RHS, KnownZeroTemp, KnownOneTemp, DL, Depth + 1, Q);		computeKnownBits(RHS, KnownZeroTemp, KnownOneTemp, Depth + 1, Q);
// Whatever high bits in rhs are zero are known to be zero (if rhs is a		// Whatever high bits in rhs are zero are known to be zero (if rhs is a
// power of 2, then one more).		// power of 2, then one more).
unsigned SignBits = KnownZeroTemp.countLeadingOnes();		unsigned SignBits = KnownZeroTemp.countLeadingOnes();
if (isKnownToBeAPowerOfTwo(RHS, false, Depth + 1, Query(Q, Cmp), DL))		if (isKnownToBeAPowerOfTwo(RHS, false, Depth + 1, Query(Q, Cmp)))
SignBits++;		SignBits++;
KnownZero \|= APInt::getHighBitsSet(BitWidth, SignBits);		KnownZero \|= APInt::getHighBitsSet(BitWidth, SignBits);
}		}
break;		break;
};		};
}		}

/// Compute known bits in 'V' from conditions which are known to be true along		/// Compute known bits in 'V' from conditions which are known to be true along
/// all paths leading to the context instruction. In particular, look for		/// all paths leading to the context instruction. In particular, look for
/// cases where one branch of an interesting condition dominates the context		/// cases where one branch of an interesting condition dominates the context
/// instruction. This does not do general dataflow.		/// instruction. This does not do general dataflow.
/// NOTE: This code is EXPERIMENTAL and currently off by default.		/// NOTE: This code is EXPERIMENTAL and currently off by default.
static void computeKnownBitsFromDominatingCondition(Value *V, APInt &KnownZero,		static void computeKnownBitsFromDominatingCondition(Value *V, APInt &KnownZero,
APInt &KnownOne,		APInt &KnownOne,
const DataLayout &DL,
unsigned Depth,		unsigned Depth,
const Query &Q) {		const Query &Q) {
// Need both the dominator tree and the query location to do anything useful		// Need both the dominator tree and the query location to do anything useful
if (!Q.DT \|\| !Q.CxtI)		if (!Q.DT \|\| !Q.CxtI)
return;		return;
Instruction Cxt = const_cast<Instruction >(Q.CxtI);		Instruction Cxt = const_cast<Instruction >(Q.CxtI);
// The context instruction might be in a statically unreachable block. If		// The context instruction might be in a statically unreachable block. If
// so, asking dominator queries may yield suprising results. (e.g. the block		// so, asking dominator queries may yield suprising results. (e.g. the block
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	while (true) {
// to ensure that the taken edge dominates the context instruction. We		// to ensure that the taken edge dominates the context instruction. We
// know that the edge must be reachable since we started from a reachable		// know that the edge must be reachable since we started from a reachable
// block.		// block.
BasicBlock *BB0 = BI->getSuccessor(0);		BasicBlock *BB0 = BI->getSuccessor(0);
BasicBlockEdge Edge(BI->getParent(), BB0);		BasicBlockEdge Edge(BI->getParent(), BB0);
if (!Edge.isSingleEdge() \|\| !Q.DT->dominates(Edge, Q.CxtI->getParent()))		if (!Edge.isSingleEdge() \|\| !Q.DT->dominates(Edge, Q.CxtI->getParent()))
continue;		continue;

computeKnownBitsFromTrueCondition(V, Cmp, KnownZero, KnownOne, DL, Depth,		computeKnownBitsFromTrueCondition(V, Cmp, KnownZero, KnownOne, Depth, Q);
Q);
}		}

// Option 2 - Search the other uses of V		// Option 2 - Search the other uses of V
unsigned NumUsesExplored = 0;		unsigned NumUsesExplored = 0;
for (auto U : V->users()) {		for (auto U : V->users()) {
// Avoid massive lists		// Avoid massive lists
if (NumUsesExplored >= DomConditionsMaxUses)		if (NumUsesExplored >= DomConditionsMaxUses)
break;		break;
Show All 16 Lines	for (auto *CmpU : Cmp->users()) {
// merge before the context instruction we're actually interested in.		// merge before the context instruction we're actually interested in.
// Instead, we need to ensure that the taken edge dominates the context		// Instead, we need to ensure that the taken edge dominates the context
// instruction.		// instruction.
BasicBlock *BB0 = BI->getSuccessor(0);		BasicBlock *BB0 = BI->getSuccessor(0);
BasicBlockEdge Edge(BI->getParent(), BB0);		BasicBlockEdge Edge(BI->getParent(), BB0);
if (!Edge.isSingleEdge() \|\| !Q.DT->dominates(Edge, Q.CxtI->getParent()))		if (!Edge.isSingleEdge() \|\| !Q.DT->dominates(Edge, Q.CxtI->getParent()))
continue;		continue;

computeKnownBitsFromTrueCondition(V, Cmp, KnownZero, KnownOne, DL, Depth,		computeKnownBitsFromTrueCondition(V, Cmp, KnownZero, KnownOne, Depth, Q);
Q);
}		}
}		}
}		}

static void computeKnownBitsFromAssume(Value *V, APInt &KnownZero,		static void computeKnownBitsFromAssume(Value *V, APInt &KnownZero,
APInt &KnownOne, const DataLayout &DL,		APInt &KnownOne, unsigned Depth,
unsigned Depth, const Query &Q) {		const Query &Q) {
// Use of assumptions is context-sensitive. If we don't have a context, we		// Use of assumptions is context-sensitive. If we don't have a context, we
// cannot use them!		// cannot use them!
if (!Q.AC \|\| !Q.CxtI)		if (!Q.AC \|\| !Q.CxtI)
return;		return;

unsigned BitWidth = KnownZero.getBitWidth();		unsigned BitWidth = KnownZero.getBitWidth();

for (auto &AssumeVH : Q.AC->assumptions()) {		for (auto &AssumeVH : Q.AC->assumptions()) {
Show All 9 Lines	for (auto &AssumeVH : Q.AC->assumptions()) {
// We're running this loop for once for each value queried resulting in a		// We're running this loop for once for each value queried resulting in a
// runtime of ~O(#assumes * #values).		// runtime of ~O(#assumes * #values).

assert(I->getCalledFunction()->getIntrinsicID() == Intrinsic::assume &&		assert(I->getCalledFunction()->getIntrinsicID() == Intrinsic::assume &&
"must be an assume intrinsic");		"must be an assume intrinsic");

Value *Arg = I->getArgOperand(0);		Value *Arg = I->getArgOperand(0);

if (Arg == V && isValidAssumeForContext(I, Q)) {		if (Arg == V && isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
assert(BitWidth == 1 && "assume operand is not i1?");		assert(BitWidth == 1 && "assume operand is not i1?");
KnownZero.clearAllBits();		KnownZero.clearAllBits();
KnownOne.setAllBits();		KnownOne.setAllBits();
return;		return;
}		}

// The remaining tests are all recursive, so bail out if we hit the limit.		// The remaining tests are all recursive, so bail out if we hit the limit.
if (Depth == MaxDepth)		if (Depth == MaxDepth)
continue;		continue;

Value A, B;		Value A, B;
auto m_V = m_CombineOr(m_Specific(V),		auto m_V = m_CombineOr(m_Specific(V),
m_CombineOr(m_PtrToInt(m_Specific(V)),		m_CombineOr(m_PtrToInt(m_Specific(V)),
m_BitCast(m_Specific(V))));		m_BitCast(m_Specific(V))));

CmpInst::Predicate Pred;		CmpInst::Predicate Pred;
ConstantInt *C;		ConstantInt *C;
// assume(v = a)		// assume(v = a)
if (match(Arg, m_c_ICmp(Pred, m_V, m_Value(A))) &&		if (match(Arg, m_c_ICmp(Pred, m_V, m_Value(A))) &&
Pred == ICmpInst::ICMP_EQ && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_EQ && isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));
KnownZero \|= RHSKnownZero;		KnownZero \|= RHSKnownZero;
KnownOne \|= RHSKnownOne;		KnownOne \|= RHSKnownOne;
// assume(v & b = a)		// assume(v & b = a)
} else if (match(Arg,		} else if (match(Arg,
m_c_ICmp(Pred, m_c_And(m_V, m_Value(B)), m_Value(A))) &&		m_c_ICmp(Pred, m_c_And(m_V, m_Value(B)), m_Value(A))) &&
Pred == ICmpInst::ICMP_EQ && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_EQ &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));
APInt MaskKnownZero(BitWidth, 0), MaskKnownOne(BitWidth, 0);		APInt MaskKnownZero(BitWidth, 0), MaskKnownOne(BitWidth, 0);
computeKnownBits(B, MaskKnownZero, MaskKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(B, MaskKnownZero, MaskKnownOne, Depth+1, Query(Q, I));

// For those bits in the mask that are known to be one, we can propagate		// For those bits in the mask that are known to be one, we can propagate
// known bits from the RHS to V.		// known bits from the RHS to V.
KnownZero \|= RHSKnownZero & MaskKnownOne;		KnownZero \|= RHSKnownZero & MaskKnownOne;
KnownOne \|= RHSKnownOne & MaskKnownOne;		KnownOne \|= RHSKnownOne & MaskKnownOne;
// assume(~(v & b) = a)		// assume(~(v & b) = a)
} else if (match(Arg, m_c_ICmp(Pred, m_Not(m_c_And(m_V, m_Value(B))),		} else if (match(Arg, m_c_ICmp(Pred, m_Not(m_c_And(m_V, m_Value(B))),
m_Value(A))) &&		m_Value(A))) &&
Pred == ICmpInst::ICMP_EQ && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_EQ &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));
APInt MaskKnownZero(BitWidth, 0), MaskKnownOne(BitWidth, 0);		APInt MaskKnownZero(BitWidth, 0), MaskKnownOne(BitWidth, 0);
computeKnownBits(B, MaskKnownZero, MaskKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(B, MaskKnownZero, MaskKnownOne, Depth+1, Query(Q, I));

// For those bits in the mask that are known to be one, we can propagate		// For those bits in the mask that are known to be one, we can propagate
// inverted known bits from the RHS to V.		// inverted known bits from the RHS to V.
KnownZero \|= RHSKnownOne & MaskKnownOne;		KnownZero \|= RHSKnownOne & MaskKnownOne;
KnownOne \|= RHSKnownZero & MaskKnownOne;		KnownOne \|= RHSKnownZero & MaskKnownOne;
// assume(v \| b = a)		// assume(v \| b = a)
} else if (match(Arg,		} else if (match(Arg,
m_c_ICmp(Pred, m_c_Or(m_V, m_Value(B)), m_Value(A))) &&		m_c_ICmp(Pred, m_c_Or(m_V, m_Value(B)), m_Value(A))) &&
Pred == ICmpInst::ICMP_EQ && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_EQ &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));
APInt BKnownZero(BitWidth, 0), BKnownOne(BitWidth, 0);		APInt BKnownZero(BitWidth, 0), BKnownOne(BitWidth, 0);
computeKnownBits(B, BKnownZero, BKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(B, BKnownZero, BKnownOne, Depth+1, Query(Q, I));

// For those bits in B that are known to be zero, we can propagate known		// For those bits in B that are known to be zero, we can propagate known
// bits from the RHS to V.		// bits from the RHS to V.
KnownZero \|= RHSKnownZero & BKnownZero;		KnownZero \|= RHSKnownZero & BKnownZero;
KnownOne \|= RHSKnownOne & BKnownZero;		KnownOne \|= RHSKnownOne & BKnownZero;
// assume(~(v \| b) = a)		// assume(~(v \| b) = a)
} else if (match(Arg, m_c_ICmp(Pred, m_Not(m_c_Or(m_V, m_Value(B))),		} else if (match(Arg, m_c_ICmp(Pred, m_Not(m_c_Or(m_V, m_Value(B))),
m_Value(A))) &&		m_Value(A))) &&
Pred == ICmpInst::ICMP_EQ && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_EQ &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));
APInt BKnownZero(BitWidth, 0), BKnownOne(BitWidth, 0);		APInt BKnownZero(BitWidth, 0), BKnownOne(BitWidth, 0);
computeKnownBits(B, BKnownZero, BKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(B, BKnownZero, BKnownOne, Depth+1, Query(Q, I));

// For those bits in B that are known to be zero, we can propagate		// For those bits in B that are known to be zero, we can propagate
// inverted known bits from the RHS to V.		// inverted known bits from the RHS to V.
KnownZero \|= RHSKnownOne & BKnownZero;		KnownZero \|= RHSKnownOne & BKnownZero;
KnownOne \|= RHSKnownZero & BKnownZero;		KnownOne \|= RHSKnownZero & BKnownZero;
// assume(v ^ b = a)		// assume(v ^ b = a)
} else if (match(Arg,		} else if (match(Arg,
m_c_ICmp(Pred, m_c_Xor(m_V, m_Value(B)), m_Value(A))) &&		m_c_ICmp(Pred, m_c_Xor(m_V, m_Value(B)), m_Value(A))) &&
Pred == ICmpInst::ICMP_EQ && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_EQ &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));
APInt BKnownZero(BitWidth, 0), BKnownOne(BitWidth, 0);		APInt BKnownZero(BitWidth, 0), BKnownOne(BitWidth, 0);
computeKnownBits(B, BKnownZero, BKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(B, BKnownZero, BKnownOne, Depth+1, Query(Q, I));

// For those bits in B that are known to be zero, we can propagate known		// For those bits in B that are known to be zero, we can propagate known
// bits from the RHS to V. For those bits in B that are known to be one,		// bits from the RHS to V. For those bits in B that are known to be one,
// we can propagate inverted known bits from the RHS to V.		// we can propagate inverted known bits from the RHS to V.
KnownZero \|= RHSKnownZero & BKnownZero;		KnownZero \|= RHSKnownZero & BKnownZero;
KnownOne \|= RHSKnownOne & BKnownZero;		KnownOne \|= RHSKnownOne & BKnownZero;
KnownZero \|= RHSKnownOne & BKnownOne;		KnownZero \|= RHSKnownOne & BKnownOne;
KnownOne \|= RHSKnownZero & BKnownOne;		KnownOne \|= RHSKnownZero & BKnownOne;
// assume(~(v ^ b) = a)		// assume(~(v ^ b) = a)
} else if (match(Arg, m_c_ICmp(Pred, m_Not(m_c_Xor(m_V, m_Value(B))),		} else if (match(Arg, m_c_ICmp(Pred, m_Not(m_c_Xor(m_V, m_Value(B))),
m_Value(A))) &&		m_Value(A))) &&
Pred == ICmpInst::ICMP_EQ && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_EQ &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));
APInt BKnownZero(BitWidth, 0), BKnownOne(BitWidth, 0);		APInt BKnownZero(BitWidth, 0), BKnownOne(BitWidth, 0);
computeKnownBits(B, BKnownZero, BKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(B, BKnownZero, BKnownOne, Depth+1, Query(Q, I));

// For those bits in B that are known to be zero, we can propagate		// For those bits in B that are known to be zero, we can propagate
// inverted known bits from the RHS to V. For those bits in B that are		// inverted known bits from the RHS to V. For those bits in B that are
// known to be one, we can propagate known bits from the RHS to V.		// known to be one, we can propagate known bits from the RHS to V.
KnownZero \|= RHSKnownOne & BKnownZero;		KnownZero \|= RHSKnownOne & BKnownZero;
KnownOne \|= RHSKnownZero & BKnownZero;		KnownOne \|= RHSKnownZero & BKnownZero;
KnownZero \|= RHSKnownZero & BKnownOne;		KnownZero \|= RHSKnownZero & BKnownOne;
KnownOne \|= RHSKnownOne & BKnownOne;		KnownOne \|= RHSKnownOne & BKnownOne;
// assume(v << c = a)		// assume(v << c = a)
} else if (match(Arg, m_c_ICmp(Pred, m_Shl(m_V, m_ConstantInt(C)),		} else if (match(Arg, m_c_ICmp(Pred, m_Shl(m_V, m_ConstantInt(C)),
m_Value(A))) &&		m_Value(A))) &&
Pred == ICmpInst::ICMP_EQ && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_EQ &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));
// For those bits in RHS that are known, we can propagate them to known		// For those bits in RHS that are known, we can propagate them to known
// bits in V shifted to the right by C.		// bits in V shifted to the right by C.
KnownZero \|= RHSKnownZero.lshr(C->getZExtValue());		KnownZero \|= RHSKnownZero.lshr(C->getZExtValue());
KnownOne \|= RHSKnownOne.lshr(C->getZExtValue());		KnownOne \|= RHSKnownOne.lshr(C->getZExtValue());
// assume(~(v << c) = a)		// assume(~(v << c) = a)
} else if (match(Arg, m_c_ICmp(Pred, m_Not(m_Shl(m_V, m_ConstantInt(C))),		} else if (match(Arg, m_c_ICmp(Pred, m_Not(m_Shl(m_V, m_ConstantInt(C))),
m_Value(A))) &&		m_Value(A))) &&
Pred == ICmpInst::ICMP_EQ && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_EQ &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));
// For those bits in RHS that are known, we can propagate them inverted		// For those bits in RHS that are known, we can propagate them inverted
// to known bits in V shifted to the right by C.		// to known bits in V shifted to the right by C.
KnownZero \|= RHSKnownOne.lshr(C->getZExtValue());		KnownZero \|= RHSKnownOne.lshr(C->getZExtValue());
KnownOne \|= RHSKnownZero.lshr(C->getZExtValue());		KnownOne \|= RHSKnownZero.lshr(C->getZExtValue());
// assume(v >> c = a)		// assume(v >> c = a)
} else if (match(Arg,		} else if (match(Arg,
m_c_ICmp(Pred, m_CombineOr(m_LShr(m_V, m_ConstantInt(C)),		m_c_ICmp(Pred, m_CombineOr(m_LShr(m_V, m_ConstantInt(C)),
m_AShr(m_V, m_ConstantInt(C))),		m_AShr(m_V, m_ConstantInt(C))),
m_Value(A))) &&		m_Value(A))) &&
Pred == ICmpInst::ICMP_EQ && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_EQ &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));
// For those bits in RHS that are known, we can propagate them to known		// For those bits in RHS that are known, we can propagate them to known
// bits in V shifted to the right by C.		// bits in V shifted to the right by C.
KnownZero \|= RHSKnownZero << C->getZExtValue();		KnownZero \|= RHSKnownZero << C->getZExtValue();
KnownOne \|= RHSKnownOne << C->getZExtValue();		KnownOne \|= RHSKnownOne << C->getZExtValue();
// assume(~(v >> c) = a)		// assume(~(v >> c) = a)
} else if (match(Arg, m_c_ICmp(Pred, m_Not(m_CombineOr(		} else if (match(Arg, m_c_ICmp(Pred, m_Not(m_CombineOr(
m_LShr(m_V, m_ConstantInt(C)),		m_LShr(m_V, m_ConstantInt(C)),
m_AShr(m_V, m_ConstantInt(C)))),		m_AShr(m_V, m_ConstantInt(C)))),
m_Value(A))) &&		m_Value(A))) &&
Pred == ICmpInst::ICMP_EQ && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_EQ &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));
// For those bits in RHS that are known, we can propagate them inverted		// For those bits in RHS that are known, we can propagate them inverted
// to known bits in V shifted to the right by C.		// to known bits in V shifted to the right by C.
KnownZero \|= RHSKnownOne << C->getZExtValue();		KnownZero \|= RHSKnownOne << C->getZExtValue();
KnownOne \|= RHSKnownZero << C->getZExtValue();		KnownOne \|= RHSKnownZero << C->getZExtValue();
// assume(v >=_s c) where c is non-negative		// assume(v >=_s c) where c is non-negative
} else if (match(Arg, m_ICmp(Pred, m_V, m_Value(A))) &&		} else if (match(Arg, m_ICmp(Pred, m_V, m_Value(A))) &&
Pred == ICmpInst::ICMP_SGE && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_SGE &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));

if (RHSKnownZero.isNegative()) {		if (RHSKnownZero.isNegative()) {
// We know that the sign bit is zero.		// We know that the sign bit is zero.
KnownZero \|= APInt::getSignBit(BitWidth);		KnownZero \|= APInt::getSignBit(BitWidth);
}		}
// assume(v >_s c) where c is at least -1.		// assume(v >_s c) where c is at least -1.
} else if (match(Arg, m_ICmp(Pred, m_V, m_Value(A))) &&		} else if (match(Arg, m_ICmp(Pred, m_V, m_Value(A))) &&
Pred == ICmpInst::ICMP_SGT && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_SGT &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));

if (RHSKnownOne.isAllOnesValue() \|\| RHSKnownZero.isNegative()) {		if (RHSKnownOne.isAllOnesValue() \|\| RHSKnownZero.isNegative()) {
// We know that the sign bit is zero.		// We know that the sign bit is zero.
KnownZero \|= APInt::getSignBit(BitWidth);		KnownZero \|= APInt::getSignBit(BitWidth);
}		}
// assume(v <=_s c) where c is negative		// assume(v <=_s c) where c is negative
} else if (match(Arg, m_ICmp(Pred, m_V, m_Value(A))) &&		} else if (match(Arg, m_ICmp(Pred, m_V, m_Value(A))) &&
Pred == ICmpInst::ICMP_SLE && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_SLE &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));

if (RHSKnownOne.isNegative()) {		if (RHSKnownOne.isNegative()) {
// We know that the sign bit is one.		// We know that the sign bit is one.
KnownOne \|= APInt::getSignBit(BitWidth);		KnownOne \|= APInt::getSignBit(BitWidth);
}		}
// assume(v <_s c) where c is non-positive		// assume(v <_s c) where c is non-positive
} else if (match(Arg, m_ICmp(Pred, m_V, m_Value(A))) &&		} else if (match(Arg, m_ICmp(Pred, m_V, m_Value(A))) &&
Pred == ICmpInst::ICMP_SLT && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_SLT &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));

if (RHSKnownZero.isAllOnesValue() \|\| RHSKnownOne.isNegative()) {		if (RHSKnownZero.isAllOnesValue() \|\| RHSKnownOne.isNegative()) {
// We know that the sign bit is one.		// We know that the sign bit is one.
KnownOne \|= APInt::getSignBit(BitWidth);		KnownOne \|= APInt::getSignBit(BitWidth);
}		}
// assume(v <=_u c)		// assume(v <=_u c)
} else if (match(Arg, m_ICmp(Pred, m_V, m_Value(A))) &&		} else if (match(Arg, m_ICmp(Pred, m_V, m_Value(A))) &&
Pred == ICmpInst::ICMP_ULE && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_ULE &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));

// Whatever high bits in c are zero are known to be zero.		// Whatever high bits in c are zero are known to be zero.
KnownZero \|=		KnownZero \|=
APInt::getHighBitsSet(BitWidth, RHSKnownZero.countLeadingOnes());		APInt::getHighBitsSet(BitWidth, RHSKnownZero.countLeadingOnes());
// assume(v <_u c)		// assume(v <_u c)
} else if (match(Arg, m_ICmp(Pred, m_V, m_Value(A))) &&		} else if (match(Arg, m_ICmp(Pred, m_V, m_Value(A))) &&
Pred == ICmpInst::ICMP_ULT && isValidAssumeForContext(I, Q)) {		Pred == ICmpInst::ICMP_ULT &&
		isValidAssumeForContext(I, Q.CxtI, Q.DT)) {
APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);		APInt RHSKnownZero(BitWidth, 0), RHSKnownOne(BitWidth, 0);
computeKnownBits(A, RHSKnownZero, RHSKnownOne, DL, Depth+1, Query(Q, I));		computeKnownBits(A, RHSKnownZero, RHSKnownOne, Depth+1, Query(Q, I));

// Whatever high bits in c are zero are known to be zero (if c is a power		// Whatever high bits in c are zero are known to be zero (if c is a power
// of 2, then one more).		// of 2, then one more).
if (isKnownToBeAPowerOfTwo(A, false, Depth + 1, Query(Q, I), DL))		if (isKnownToBeAPowerOfTwo(A, false, Depth + 1, Query(Q, I)))
KnownZero \|=		KnownZero \|=
APInt::getHighBitsSet(BitWidth, RHSKnownZero.countLeadingOnes()+1);		APInt::getHighBitsSet(BitWidth, RHSKnownZero.countLeadingOnes()+1);
else		else
KnownZero \|=		KnownZero \|=
APInt::getHighBitsSet(BitWidth, RHSKnownZero.countLeadingOnes());		APInt::getHighBitsSet(BitWidth, RHSKnownZero.countLeadingOnes());
}		}
}		}
}		}

// Compute known bits from a shift operator, including those with a		// Compute known bits from a shift operator, including those with a
// non-constant shift amount. KnownZero and KnownOne are the outputs of this		// non-constant shift amount. KnownZero and KnownOne are the outputs of this
// function. KnownZero2 and KnownOne2 are pre-allocated temporaries with the		// function. KnownZero2 and KnownOne2 are pre-allocated temporaries with the
// same bit width as KnownZero and KnownOne. KZF and KOF are operator-specific		// same bit width as KnownZero and KnownOne. KZF and KOF are operator-specific
// functors that, given the known-zero or known-one bits respectively, and a		// functors that, given the known-zero or known-one bits respectively, and a
// shift amount, compute the implied known-zero or known-one bits of the shift		// shift amount, compute the implied known-zero or known-one bits of the shift
// operator's result respectively for that shift amount. The results from calling		// operator's result respectively for that shift amount. The results from calling
// KZF and KOF are conservatively combined for all permitted shift amounts.		// KZF and KOF are conservatively combined for all permitted shift amounts.
template <typename KZFunctor, typename KOFunctor>		template <typename KZFunctor, typename KOFunctor>
static void computeKnownBitsFromShiftOperator(Operator *I,		static void computeKnownBitsFromShiftOperator(Operator *I,
APInt &KnownZero, APInt &KnownOne,		APInt &KnownZero, APInt &KnownOne,
APInt &KnownZero2, APInt &KnownOne2,		APInt &KnownZero2, APInt &KnownOne2,
const DataLayout &DL, unsigned Depth, const Query &Q,		unsigned Depth, const Query &Q, KZFunctor KZF, KOFunctor KOF) {
KZFunctor KZF, KOFunctor KOF) {
unsigned BitWidth = KnownZero.getBitWidth();		unsigned BitWidth = KnownZero.getBitWidth();

if (auto *SA = dyn_cast<ConstantInt>(I->getOperand(1))) {		if (auto *SA = dyn_cast<ConstantInt>(I->getOperand(1))) {
unsigned ShiftAmt = SA->getLimitedValue(BitWidth-1);		unsigned ShiftAmt = SA->getLimitedValue(BitWidth-1);

computeKnownBits(I->getOperand(0), KnownZero, KnownOne, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(0), KnownZero, KnownOne, Depth + 1, Q);
KnownZero = KZF(KnownZero, ShiftAmt);		KnownZero = KZF(KnownZero, ShiftAmt);
KnownOne = KOF(KnownOne, ShiftAmt);		KnownOne = KOF(KnownOne, ShiftAmt);
return;		return;
}		}

computeKnownBits(I->getOperand(1), KnownZero, KnownOne, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(1), KnownZero, KnownOne, Depth + 1, Q);

// Note: We cannot use KnownZero.getLimitedValue() here, because if		// Note: We cannot use KnownZero.getLimitedValue() here, because if
// BitWidth > 64 and any upper bits are known, we'll end up returning the		// BitWidth > 64 and any upper bits are known, we'll end up returning the
// limit value (which implies all bits are known).		// limit value (which implies all bits are known).
uint64_t ShiftAmtKZ = KnownZero.zextOrTrunc(64).getZExtValue();		uint64_t ShiftAmtKZ = KnownZero.zextOrTrunc(64).getZExtValue();
uint64_t ShiftAmtKO = KnownOne.zextOrTrunc(64).getZExtValue();		uint64_t ShiftAmtKO = KnownOne.zextOrTrunc(64).getZExtValue();

// It would be more-clearly correct to use the two temporaries for this		// It would be more-clearly correct to use the two temporaries for this
// calculation. Reusing the APInts here to prevent unnecessary allocations.		// calculation. Reusing the APInts here to prevent unnecessary allocations.
KnownZero.clearAllBits(), KnownOne.clearAllBits();		KnownZero.clearAllBits(), KnownOne.clearAllBits();

// If we know the shifter operand is nonzero, we can sometimes infer more		// If we know the shifter operand is nonzero, we can sometimes infer more
// known bits. However this is expensive to compute, so be lazy about it and		// known bits. However this is expensive to compute, so be lazy about it and
// only compute it when absolutely necessary.		// only compute it when absolutely necessary.
Optional<bool> ShifterOperandIsNonZero;		Optional<bool> ShifterOperandIsNonZero;

// Early exit if we can't constrain any well-defined shift amount.		// Early exit if we can't constrain any well-defined shift amount.
if (!(ShiftAmtKZ & (BitWidth - 1)) && !(ShiftAmtKO & (BitWidth - 1))) {		if (!(ShiftAmtKZ & (BitWidth - 1)) && !(ShiftAmtKO & (BitWidth - 1))) {
ShifterOperandIsNonZero =		ShifterOperandIsNonZero =
isKnownNonZero(I->getOperand(1), DL, Depth + 1, Q);		isKnownNonZero(I->getOperand(1), Depth + 1, Q);
if (!*ShifterOperandIsNonZero)		if (!*ShifterOperandIsNonZero)
return;		return;
}		}

computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, Depth + 1, Q);

KnownZero = KnownOne = APInt::getAllOnesValue(BitWidth);		KnownZero = KnownOne = APInt::getAllOnesValue(BitWidth);
for (unsigned ShiftAmt = 0; ShiftAmt < BitWidth; ++ShiftAmt) {		for (unsigned ShiftAmt = 0; ShiftAmt < BitWidth; ++ShiftAmt) {
// Combine the shifted known input bits only for those shift amounts		// Combine the shifted known input bits only for those shift amounts
// compatible with its known constraints.		// compatible with its known constraints.
if ((ShiftAmt & ~ShiftAmtKZ) != ShiftAmt)		if ((ShiftAmt & ~ShiftAmtKZ) != ShiftAmt)
continue;		continue;
if ((ShiftAmt \| ShiftAmtKO) != ShiftAmt)		if ((ShiftAmt \| ShiftAmtKO) != ShiftAmt)
continue;		continue;
// If we know the shifter is nonzero, we may be able to infer more known		// If we know the shifter is nonzero, we may be able to infer more known
// bits. This check is sunk down as far as possible to avoid the expensive		// bits. This check is sunk down as far as possible to avoid the expensive
// call to isKnownNonZero if the cheaper checks above fail.		// call to isKnownNonZero if the cheaper checks above fail.
if (ShiftAmt == 0) {		if (ShiftAmt == 0) {
if (!ShifterOperandIsNonZero.hasValue())		if (!ShifterOperandIsNonZero.hasValue())
ShifterOperandIsNonZero =		ShifterOperandIsNonZero =
isKnownNonZero(I->getOperand(1), DL, Depth + 1, Q);		isKnownNonZero(I->getOperand(1), Depth + 1, Q);
if (*ShifterOperandIsNonZero)		if (*ShifterOperandIsNonZero)
continue;		continue;
}		}

KnownZero &= KZF(KnownZero2, ShiftAmt);		KnownZero &= KZF(KnownZero2, ShiftAmt);
KnownOne &= KOF(KnownOne2, ShiftAmt);		KnownOne &= KOF(KnownOne2, ShiftAmt);
}		}

// If there are no compatible shift amounts, then we've proven that the shift		// If there are no compatible shift amounts, then we've proven that the shift
// amount must be >= the BitWidth, and the result is undefined. We could		// amount must be >= the BitWidth, and the result is undefined. We could
// return anything we'd like, but we need to make sure the sets of known bits		// return anything we'd like, but we need to make sure the sets of known bits
// stay disjoint (it should be better for some other code to actually		// stay disjoint (it should be better for some other code to actually
// propagate the undef than to pick a value here using known bits).		// propagate the undef than to pick a value here using known bits).
if ((KnownZero & KnownOne) != 0)		if ((KnownZero & KnownOne) != 0)
KnownZero.clearAllBits(), KnownOne.clearAllBits();		KnownZero.clearAllBits(), KnownOne.clearAllBits();
}		}

static void computeKnownBitsFromOperator(Operator *I, APInt &KnownZero,		static void computeKnownBitsFromOperator(Operator *I, APInt &KnownZero,
APInt &KnownOne, const DataLayout &DL,		APInt &KnownOne, unsigned Depth,
unsigned Depth, const Query &Q) {		const Query &Q) {
unsigned BitWidth = KnownZero.getBitWidth();		unsigned BitWidth = KnownZero.getBitWidth();

APInt KnownZero2(KnownZero), KnownOne2(KnownOne);		APInt KnownZero2(KnownZero), KnownOne2(KnownOne);
switch (I->getOpcode()) {		switch (I->getOpcode()) {
default: break;		default: break;
case Instruction::Load:		case Instruction::Load:
if (MDNode *MD = cast<LoadInst>(I)->getMetadata(LLVMContext::MD_range))		if (MDNode *MD = cast<LoadInst>(I)->getMetadata(LLVMContext::MD_range))
computeKnownBitsFromRangeMetadata(*MD, KnownZero, KnownOne);		computeKnownBitsFromRangeMetadata(*MD, KnownZero, KnownOne);
break;		break;
case Instruction::And: {		case Instruction::And: {
// If either the LHS or the RHS are Zero, the result is zero.		// If either the LHS or the RHS are Zero, the result is zero.
computeKnownBits(I->getOperand(1), KnownZero, KnownOne, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(1), KnownZero, KnownOne, Depth + 1, Q);
computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, Depth + 1, Q);

// Output known-1 bits are only known if set in both the LHS & RHS.		// Output known-1 bits are only known if set in both the LHS & RHS.
KnownOne &= KnownOne2;		KnownOne &= KnownOne2;
// Output known-0 are known to be clear if zero in either the LHS \| RHS.		// Output known-0 are known to be clear if zero in either the LHS \| RHS.
KnownZero \|= KnownZero2;		KnownZero \|= KnownZero2;

// and(x, add (x, -1)) is a common idiom that always clears the low bit;		// and(x, add (x, -1)) is a common idiom that always clears the low bit;
// here we handle the more general case of adding any odd number by		// here we handle the more general case of adding any odd number by
// matching the form add(x, add(x, y)) where y is odd.		// matching the form add(x, add(x, y)) where y is odd.
// TODO: This could be generalized to clearing any bit set in y where the		// TODO: This could be generalized to clearing any bit set in y where the
// following bit is known to be unset in y.		// following bit is known to be unset in y.
Value *Y = nullptr;		Value *Y = nullptr;
if (match(I->getOperand(0), m_Add(m_Specific(I->getOperand(1)),		if (match(I->getOperand(0), m_Add(m_Specific(I->getOperand(1)),
m_Value(Y))) \|\|		m_Value(Y))) \|\|
match(I->getOperand(1), m_Add(m_Specific(I->getOperand(0)),		match(I->getOperand(1), m_Add(m_Specific(I->getOperand(0)),
m_Value(Y)))) {		m_Value(Y)))) {
APInt KnownZero3(BitWidth, 0), KnownOne3(BitWidth, 0);		APInt KnownZero3(BitWidth, 0), KnownOne3(BitWidth, 0);
computeKnownBits(Y, KnownZero3, KnownOne3, DL, Depth + 1, Q);		computeKnownBits(Y, KnownZero3, KnownOne3, Depth + 1, Q);
if (KnownOne3.countTrailingOnes() > 0)		if (KnownOne3.countTrailingOnes() > 0)
KnownZero \|= APInt::getLowBitsSet(BitWidth, 1);		KnownZero \|= APInt::getLowBitsSet(BitWidth, 1);
}		}
break;		break;
}		}
case Instruction::Or: {		case Instruction::Or: {
computeKnownBits(I->getOperand(1), KnownZero, KnownOne, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(1), KnownZero, KnownOne, Depth + 1, Q);
computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, Depth + 1, Q);

// Output known-0 bits are only known if clear in both the LHS & RHS.		// Output known-0 bits are only known if clear in both the LHS & RHS.
KnownZero &= KnownZero2;		KnownZero &= KnownZero2;
// Output known-1 are known to be set if set in either the LHS \| RHS.		// Output known-1 are known to be set if set in either the LHS \| RHS.
KnownOne \|= KnownOne2;		KnownOne \|= KnownOne2;
break;		break;
}		}
case Instruction::Xor: {		case Instruction::Xor: {
computeKnownBits(I->getOperand(1), KnownZero, KnownOne, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(1), KnownZero, KnownOne, Depth + 1, Q);
computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, Depth + 1, Q);

// Output known-0 bits are known if clear or set in both the LHS & RHS.		// Output known-0 bits are known if clear or set in both the LHS & RHS.
APInt KnownZeroOut = (KnownZero & KnownZero2) \| (KnownOne & KnownOne2);		APInt KnownZeroOut = (KnownZero & KnownZero2) \| (KnownOne & KnownOne2);
// Output known-1 are known to be set if set in only one of the LHS, RHS.		// Output known-1 are known to be set if set in only one of the LHS, RHS.
KnownOne = (KnownZero & KnownOne2) \| (KnownOne & KnownZero2);		KnownOne = (KnownZero & KnownOne2) \| (KnownOne & KnownZero2);
KnownZero = KnownZeroOut;		KnownZero = KnownZeroOut;
break;		break;
}		}
case Instruction::Mul: {		case Instruction::Mul: {
bool NSW = cast<OverflowingBinaryOperator>(I)->hasNoSignedWrap();		bool NSW = cast<OverflowingBinaryOperator>(I)->hasNoSignedWrap();
computeKnownBitsMul(I->getOperand(0), I->getOperand(1), NSW, KnownZero,		computeKnownBitsMul(I->getOperand(0), I->getOperand(1), NSW, KnownZero,
KnownOne, KnownZero2, KnownOne2, DL, Depth, Q);		KnownOne, KnownZero2, KnownOne2, Depth, Q);
break;		break;
}		}
case Instruction::UDiv: {		case Instruction::UDiv: {
// For the purposes of computing leading zeros we can conservatively		// For the purposes of computing leading zeros we can conservatively
// treat a udiv as a logical right shift by the power of 2 known to		// treat a udiv as a logical right shift by the power of 2 known to
// be less than the denominator.		// be less than the denominator.
computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, Depth + 1, Q);
unsigned LeadZ = KnownZero2.countLeadingOnes();		unsigned LeadZ = KnownZero2.countLeadingOnes();

KnownOne2.clearAllBits();		KnownOne2.clearAllBits();
KnownZero2.clearAllBits();		KnownZero2.clearAllBits();
computeKnownBits(I->getOperand(1), KnownZero2, KnownOne2, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(1), KnownZero2, KnownOne2, Depth + 1, Q);
unsigned RHSUnknownLeadingOnes = KnownOne2.countLeadingZeros();		unsigned RHSUnknownLeadingOnes = KnownOne2.countLeadingZeros();
if (RHSUnknownLeadingOnes != BitWidth)		if (RHSUnknownLeadingOnes != BitWidth)
LeadZ = std::min(BitWidth,		LeadZ = std::min(BitWidth,
LeadZ + BitWidth - RHSUnknownLeadingOnes - 1);		LeadZ + BitWidth - RHSUnknownLeadingOnes - 1);

KnownZero = APInt::getHighBitsSet(BitWidth, LeadZ);		KnownZero = APInt::getHighBitsSet(BitWidth, LeadZ);
break;		break;
}		}
case Instruction::Select:		case Instruction::Select:
computeKnownBits(I->getOperand(2), KnownZero, KnownOne, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(2), KnownZero, KnownOne, Depth + 1, Q);
computeKnownBits(I->getOperand(1), KnownZero2, KnownOne2, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(1), KnownZero2, KnownOne2, Depth + 1, Q);

// Only known if known in both the LHS and RHS.		// Only known if known in both the LHS and RHS.
KnownOne &= KnownOne2;		KnownOne &= KnownOne2;
KnownZero &= KnownZero2;		KnownZero &= KnownZero2;
break;		break;
case Instruction::FPTrunc:		case Instruction::FPTrunc:
case Instruction::FPExt:		case Instruction::FPExt:
case Instruction::FPToUI:		case Instruction::FPToUI:
case Instruction::FPToSI:		case Instruction::FPToSI:
case Instruction::SIToFP:		case Instruction::SIToFP:
case Instruction::UIToFP:		case Instruction::UIToFP:
break; // Can't work with floating point.		break; // Can't work with floating point.
case Instruction::PtrToInt:		case Instruction::PtrToInt:
case Instruction::IntToPtr:		case Instruction::IntToPtr:
case Instruction::AddrSpaceCast: // Pointers could be different sizes.		case Instruction::AddrSpaceCast: // Pointers could be different sizes.
// FALL THROUGH and handle them the same as zext/trunc.		// FALL THROUGH and handle them the same as zext/trunc.
case Instruction::ZExt:		case Instruction::ZExt:
case Instruction::Trunc: {		case Instruction::Trunc: {
Type *SrcTy = I->getOperand(0)->getType();		Type *SrcTy = I->getOperand(0)->getType();

unsigned SrcBitWidth;		unsigned SrcBitWidth;
// Note that we handle pointer operands here because of inttoptr/ptrtoint		// Note that we handle pointer operands here because of inttoptr/ptrtoint
// which fall through here.		// which fall through here.
SrcBitWidth = DL.getTypeSizeInBits(SrcTy->getScalarType());		SrcBitWidth = Q.DL.getTypeSizeInBits(SrcTy->getScalarType());

assert(SrcBitWidth && "SrcBitWidth can't be zero");		assert(SrcBitWidth && "SrcBitWidth can't be zero");
KnownZero = KnownZero.zextOrTrunc(SrcBitWidth);		KnownZero = KnownZero.zextOrTrunc(SrcBitWidth);
KnownOne = KnownOne.zextOrTrunc(SrcBitWidth);		KnownOne = KnownOne.zextOrTrunc(SrcBitWidth);
computeKnownBits(I->getOperand(0), KnownZero, KnownOne, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(0), KnownZero, KnownOne, Depth + 1, Q);
KnownZero = KnownZero.zextOrTrunc(BitWidth);		KnownZero = KnownZero.zextOrTrunc(BitWidth);
KnownOne = KnownOne.zextOrTrunc(BitWidth);		KnownOne = KnownOne.zextOrTrunc(BitWidth);
// Any top bits are known to be zero.		// Any top bits are known to be zero.
if (BitWidth > SrcBitWidth)		if (BitWidth > SrcBitWidth)
KnownZero \|= APInt::getHighBitsSet(BitWidth, BitWidth - SrcBitWidth);		KnownZero \|= APInt::getHighBitsSet(BitWidth, BitWidth - SrcBitWidth);
break;		break;
}		}
case Instruction::BitCast: {		case Instruction::BitCast: {
Type *SrcTy = I->getOperand(0)->getType();		Type *SrcTy = I->getOperand(0)->getType();
if ((SrcTy->isIntegerTy() \|\| SrcTy->isPointerTy() \|\|		if ((SrcTy->isIntegerTy() \|\| SrcTy->isPointerTy() \|\|
SrcTy->isFloatingPointTy()) &&		SrcTy->isFloatingPointTy()) &&
// TODO: For now, not handling conversions like:		// TODO: For now, not handling conversions like:
// (bitcast i64 %x to <2 x i32>)		// (bitcast i64 %x to <2 x i32>)
!I->getType()->isVectorTy()) {		!I->getType()->isVectorTy()) {
computeKnownBits(I->getOperand(0), KnownZero, KnownOne, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(0), KnownZero, KnownOne, Depth + 1, Q);
break;		break;
}		}
break;		break;
}		}
case Instruction::SExt: {		case Instruction::SExt: {
// Compute the bits in the result that are not present in the input.		// Compute the bits in the result that are not present in the input.
unsigned SrcBitWidth = I->getOperand(0)->getType()->getScalarSizeInBits();		unsigned SrcBitWidth = I->getOperand(0)->getType()->getScalarSizeInBits();

KnownZero = KnownZero.trunc(SrcBitWidth);		KnownZero = KnownZero.trunc(SrcBitWidth);
KnownOne = KnownOne.trunc(SrcBitWidth);		KnownOne = KnownOne.trunc(SrcBitWidth);
computeKnownBits(I->getOperand(0), KnownZero, KnownOne, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(0), KnownZero, KnownOne, Depth + 1, Q);
KnownZero = KnownZero.zext(BitWidth);		KnownZero = KnownZero.zext(BitWidth);
KnownOne = KnownOne.zext(BitWidth);		KnownOne = KnownOne.zext(BitWidth);

// If the sign bit of the input is known set or clear, then we know the		// If the sign bit of the input is known set or clear, then we know the
// top bits of the result.		// top bits of the result.
if (KnownZero[SrcBitWidth-1]) // Input sign bit known zero		if (KnownZero[SrcBitWidth-1]) // Input sign bit known zero
KnownZero \|= APInt::getHighBitsSet(BitWidth, BitWidth - SrcBitWidth);		KnownZero \|= APInt::getHighBitsSet(BitWidth, BitWidth - SrcBitWidth);
else if (KnownOne[SrcBitWidth-1]) // Input sign bit known set		else if (KnownOne[SrcBitWidth-1]) // Input sign bit known set
KnownOne \|= APInt::getHighBitsSet(BitWidth, BitWidth - SrcBitWidth);		KnownOne \|= APInt::getHighBitsSet(BitWidth, BitWidth - SrcBitWidth);
break;		break;
}		}
case Instruction::Shl: {		case Instruction::Shl: {
// (shl X, C1) & C2 == 0 iff (X & C2 >>u C1) == 0		// (shl X, C1) & C2 == 0 iff (X & C2 >>u C1) == 0
auto KZF = [BitWidth](const APInt &KnownZero, unsigned ShiftAmt) {		auto KZF = [BitWidth](const APInt &KnownZero, unsigned ShiftAmt) {
return (KnownZero << ShiftAmt) \|		return (KnownZero << ShiftAmt) \|
APInt::getLowBitsSet(BitWidth, ShiftAmt); // Low bits known 0.		APInt::getLowBitsSet(BitWidth, ShiftAmt); // Low bits known 0.
};		};

auto KOF = [BitWidth](const APInt &KnownOne, unsigned ShiftAmt) {		auto KOF = [BitWidth](const APInt &KnownOne, unsigned ShiftAmt) {
return KnownOne << ShiftAmt;		return KnownOne << ShiftAmt;
};		};

computeKnownBitsFromShiftOperator(I, KnownZero, KnownOne,		computeKnownBitsFromShiftOperator(I, KnownZero, KnownOne,
KnownZero2, KnownOne2, DL, Depth, Q,		KnownZero2, KnownOne2, Depth, Q, KZF,
KZF, KOF);		KOF);
break;		break;
}		}
case Instruction::LShr: {		case Instruction::LShr: {
// (ushr X, C1) & C2 == 0 iff (-1 >> C1) & C2 == 0		// (ushr X, C1) & C2 == 0 iff (-1 >> C1) & C2 == 0
auto KZF = [BitWidth](const APInt &KnownZero, unsigned ShiftAmt) {		auto KZF = [BitWidth](const APInt &KnownZero, unsigned ShiftAmt) {
return APIntOps::lshr(KnownZero, ShiftAmt) \|		return APIntOps::lshr(KnownZero, ShiftAmt) \|
// High bits known zero.		// High bits known zero.
APInt::getHighBitsSet(BitWidth, ShiftAmt);		APInt::getHighBitsSet(BitWidth, ShiftAmt);
};		};

auto KOF = [BitWidth](const APInt &KnownOne, unsigned ShiftAmt) {		auto KOF = [BitWidth](const APInt &KnownOne, unsigned ShiftAmt) {
return APIntOps::lshr(KnownOne, ShiftAmt);		return APIntOps::lshr(KnownOne, ShiftAmt);
};		};

computeKnownBitsFromShiftOperator(I, KnownZero, KnownOne,		computeKnownBitsFromShiftOperator(I, KnownZero, KnownOne,
KnownZero2, KnownOne2, DL, Depth, Q,		KnownZero2, KnownOne2, Depth, Q, KZF,
KZF, KOF);		KOF);
break;		break;
}		}
case Instruction::AShr: {		case Instruction::AShr: {
// (ashr X, C1) & C2 == 0 iff (-1 >> C1) & C2 == 0		// (ashr X, C1) & C2 == 0 iff (-1 >> C1) & C2 == 0
auto KZF = [BitWidth](const APInt &KnownZero, unsigned ShiftAmt) {		auto KZF = [BitWidth](const APInt &KnownZero, unsigned ShiftAmt) {
return APIntOps::ashr(KnownZero, ShiftAmt);		return APIntOps::ashr(KnownZero, ShiftAmt);
};		};

auto KOF = [BitWidth](const APInt &KnownOne, unsigned ShiftAmt) {		auto KOF = [BitWidth](const APInt &KnownOne, unsigned ShiftAmt) {
return APIntOps::ashr(KnownOne, ShiftAmt);		return APIntOps::ashr(KnownOne, ShiftAmt);
};		};

computeKnownBitsFromShiftOperator(I, KnownZero, KnownOne,		computeKnownBitsFromShiftOperator(I, KnownZero, KnownOne,
KnownZero2, KnownOne2, DL, Depth, Q,		KnownZero2, KnownOne2, Depth, Q, KZF,
KZF, KOF);		KOF);
break;		break;
}		}
case Instruction::Sub: {		case Instruction::Sub: {
bool NSW = cast<OverflowingBinaryOperator>(I)->hasNoSignedWrap();		bool NSW = cast<OverflowingBinaryOperator>(I)->hasNoSignedWrap();
computeKnownBitsAddSub(false, I->getOperand(0), I->getOperand(1), NSW,		computeKnownBitsAddSub(false, I->getOperand(0), I->getOperand(1), NSW,
KnownZero, KnownOne, KnownZero2, KnownOne2, DL,		KnownZero, KnownOne, KnownZero2, KnownOne2, Depth,
Depth, Q);		Q);
break;		break;
}		}
case Instruction::Add: {		case Instruction::Add: {
bool NSW = cast<OverflowingBinaryOperator>(I)->hasNoSignedWrap();		bool NSW = cast<OverflowingBinaryOperator>(I)->hasNoSignedWrap();
computeKnownBitsAddSub(true, I->getOperand(0), I->getOperand(1), NSW,		computeKnownBitsAddSub(true, I->getOperand(0), I->getOperand(1), NSW,
KnownZero, KnownOne, KnownZero2, KnownOne2, DL,		KnownZero, KnownOne, KnownZero2, KnownOne2, Depth,
Depth, Q);		Q);
break;		break;
}		}
case Instruction::SRem:		case Instruction::SRem:
if (ConstantInt *Rem = dyn_cast<ConstantInt>(I->getOperand(1))) {		if (ConstantInt *Rem = dyn_cast<ConstantInt>(I->getOperand(1))) {
APInt RA = Rem->getValue().abs();		APInt RA = Rem->getValue().abs();
if (RA.isPowerOf2()) {		if (RA.isPowerOf2()) {
APInt LowBits = RA - 1;		APInt LowBits = RA - 1;
computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, DL, Depth + 1,		computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, Depth + 1,
Q);		Q);

// The low bits of the first operand are unchanged by the srem.		// The low bits of the first operand are unchanged by the srem.
KnownZero = KnownZero2 & LowBits;		KnownZero = KnownZero2 & LowBits;
KnownOne = KnownOne2 & LowBits;		KnownOne = KnownOne2 & LowBits;

// If the first operand is non-negative or has all low bits zero, then		// If the first operand is non-negative or has all low bits zero, then
// the upper bits are all zero.		// the upper bits are all zero.
if (KnownZero2[BitWidth-1] \|\| ((KnownZero2 & LowBits) == LowBits))		if (KnownZero2[BitWidth-1] \|\| ((KnownZero2 & LowBits) == LowBits))
KnownZero \|= ~LowBits;		KnownZero \|= ~LowBits;

// If the first operand is negative and not all low bits are zero, then		// If the first operand is negative and not all low bits are zero, then
// the upper bits are all one.		// the upper bits are all one.
if (KnownOne2[BitWidth-1] && ((KnownOne2 & LowBits) != 0))		if (KnownOne2[BitWidth-1] && ((KnownOne2 & LowBits) != 0))
KnownOne \|= ~LowBits;		KnownOne \|= ~LowBits;

assert((KnownZero & KnownOne) == 0 && "Bits known to be one AND zero?");		assert((KnownZero & KnownOne) == 0 && "Bits known to be one AND zero?");
}		}
}		}

// The sign bit is the LHS's sign bit, except when the result of the		// The sign bit is the LHS's sign bit, except when the result of the
// remainder is zero.		// remainder is zero.
if (KnownZero.isNonNegative()) {		if (KnownZero.isNonNegative()) {
APInt LHSKnownZero(BitWidth, 0), LHSKnownOne(BitWidth, 0);		APInt LHSKnownZero(BitWidth, 0), LHSKnownOne(BitWidth, 0);
computeKnownBits(I->getOperand(0), LHSKnownZero, LHSKnownOne, DL,		computeKnownBits(I->getOperand(0), LHSKnownZero, LHSKnownOne, Depth + 1,
Depth + 1, Q);		Q);
// If it's known zero, our sign bit is also zero.		// If it's known zero, our sign bit is also zero.
if (LHSKnownZero.isNegative())		if (LHSKnownZero.isNegative())
KnownZero.setBit(BitWidth - 1);		KnownZero.setBit(BitWidth - 1);
}		}

break;		break;
case Instruction::URem: {		case Instruction::URem: {
if (ConstantInt *Rem = dyn_cast<ConstantInt>(I->getOperand(1))) {		if (ConstantInt *Rem = dyn_cast<ConstantInt>(I->getOperand(1))) {
APInt RA = Rem->getValue();		APInt RA = Rem->getValue();
if (RA.isPowerOf2()) {		if (RA.isPowerOf2()) {
APInt LowBits = (RA - 1);		APInt LowBits = (RA - 1);
computeKnownBits(I->getOperand(0), KnownZero, KnownOne, DL, Depth + 1,		computeKnownBits(I->getOperand(0), KnownZero, KnownOne, Depth + 1, Q);
Q);
KnownZero \|= ~LowBits;		KnownZero \|= ~LowBits;
KnownOne &= LowBits;		KnownOne &= LowBits;
break;		break;
}		}
}		}

// Since the result is less than or equal to either operand, any leading		// Since the result is less than or equal to either operand, any leading
// zero bits in either operand must also exist in the result.		// zero bits in either operand must also exist in the result.
computeKnownBits(I->getOperand(0), KnownZero, KnownOne, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(0), KnownZero, KnownOne, Depth + 1, Q);
computeKnownBits(I->getOperand(1), KnownZero2, KnownOne2, DL, Depth + 1, Q);		computeKnownBits(I->getOperand(1), KnownZero2, KnownOne2, Depth + 1, Q);

unsigned Leaders = std::max(KnownZero.countLeadingOnes(),		unsigned Leaders = std::max(KnownZero.countLeadingOnes(),
KnownZero2.countLeadingOnes());		KnownZero2.countLeadingOnes());
KnownOne.clearAllBits();		KnownOne.clearAllBits();
KnownZero = APInt::getHighBitsSet(BitWidth, Leaders);		KnownZero = APInt::getHighBitsSet(BitWidth, Leaders);
break;		break;
}		}

case Instruction::Alloca: {		case Instruction::Alloca: {
AllocaInst *AI = cast<AllocaInst>(I);		AllocaInst *AI = cast<AllocaInst>(I);
unsigned Align = AI->getAlignment();		unsigned Align = AI->getAlignment();
if (Align == 0)		if (Align == 0)
Align = DL.getABITypeAlignment(AI->getType()->getElementType());		Align = Q.DL.getABITypeAlignment(AI->getType()->getElementType());

if (Align > 0)		if (Align > 0)
KnownZero = APInt::getLowBitsSet(BitWidth, countTrailingZeros(Align));		KnownZero = APInt::getLowBitsSet(BitWidth, countTrailingZeros(Align));
break;		break;
}		}
case Instruction::GetElementPtr: {		case Instruction::GetElementPtr: {
// Analyze all of the subscripts of this getelementptr instruction		// Analyze all of the subscripts of this getelementptr instruction
// to determine if we can prove known low zero bits.		// to determine if we can prove known low zero bits.
APInt LocalKnownZero(BitWidth, 0), LocalKnownOne(BitWidth, 0);		APInt LocalKnownZero(BitWidth, 0), LocalKnownOne(BitWidth, 0);
computeKnownBits(I->getOperand(0), LocalKnownZero, LocalKnownOne, DL,		computeKnownBits(I->getOperand(0), LocalKnownZero, LocalKnownOne, Depth + 1,
Depth + 1, Q);		Q);
unsigned TrailZ = LocalKnownZero.countTrailingOnes();		unsigned TrailZ = LocalKnownZero.countTrailingOnes();

gep_type_iterator GTI = gep_type_begin(I);		gep_type_iterator GTI = gep_type_begin(I);
for (unsigned i = 1, e = I->getNumOperands(); i != e; ++i, ++GTI) {		for (unsigned i = 1, e = I->getNumOperands(); i != e; ++i, ++GTI) {
Value *Index = I->getOperand(i);		Value *Index = I->getOperand(i);
if (StructType STy = dyn_cast<StructType>(GTI)) {		if (StructType STy = dyn_cast<StructType>(GTI)) {
// Handle struct member offset arithmetic.		// Handle struct member offset arithmetic.

// Handle case when index is vector zeroinitializer		// Handle case when index is vector zeroinitializer
Constant *CIndex = cast<Constant>(Index);		Constant *CIndex = cast<Constant>(Index);
if (CIndex->isZeroValue())		if (CIndex->isZeroValue())
continue;		continue;

if (CIndex->getType()->isVectorTy())		if (CIndex->getType()->isVectorTy())
Index = CIndex->getSplatValue();		Index = CIndex->getSplatValue();

unsigned Idx = cast<ConstantInt>(Index)->getZExtValue();		unsigned Idx = cast<ConstantInt>(Index)->getZExtValue();
const StructLayout *SL = DL.getStructLayout(STy);		const StructLayout *SL = Q.DL.getStructLayout(STy);
uint64_t Offset = SL->getElementOffset(Idx);		uint64_t Offset = SL->getElementOffset(Idx);
TrailZ = std::min<unsigned>(TrailZ,		TrailZ = std::min<unsigned>(TrailZ,
countTrailingZeros(Offset));		countTrailingZeros(Offset));
} else {		} else {
// Handle array index arithmetic.		// Handle array index arithmetic.
Type *IndexedTy = GTI.getIndexedType();		Type *IndexedTy = GTI.getIndexedType();
if (!IndexedTy->isSized()) {		if (!IndexedTy->isSized()) {
TrailZ = 0;		TrailZ = 0;
break;		break;
}		}
unsigned GEPOpiBits = Index->getType()->getScalarSizeInBits();		unsigned GEPOpiBits = Index->getType()->getScalarSizeInBits();
uint64_t TypeSize = DL.getTypeAllocSize(IndexedTy);		uint64_t TypeSize = Q.DL.getTypeAllocSize(IndexedTy);
LocalKnownZero = LocalKnownOne = APInt(GEPOpiBits, 0);		LocalKnownZero = LocalKnownOne = APInt(GEPOpiBits, 0);
computeKnownBits(Index, LocalKnownZero, LocalKnownOne, DL, Depth + 1,		computeKnownBits(Index, LocalKnownZero, LocalKnownOne, Depth + 1, Q);
Q);
TrailZ = std::min(TrailZ,		TrailZ = std::min(TrailZ,
unsigned(countTrailingZeros(TypeSize) +		unsigned(countTrailingZeros(TypeSize) +
LocalKnownZero.countTrailingOnes()));		LocalKnownZero.countTrailingOnes()));
}		}
}		}

KnownZero = APInt::getLowBitsSet(BitWidth, TrailZ);		KnownZero = APInt::getLowBitsSet(BitWidth, TrailZ);
break;		break;
Show All 25 Lines	if (P->getNumIncomingValues() == 2) {
if (LL == I)		if (LL == I)
L = LR;		L = LR;
else if (LR == I)		else if (LR == I)
L = LL;		L = LL;
else		else
break;		break;
// Ok, we have a PHI of the form L op= R. Check for low		// Ok, we have a PHI of the form L op= R. Check for low
// zero bits.		// zero bits.
computeKnownBits(R, KnownZero2, KnownOne2, DL, Depth + 1, Q);		computeKnownBits(R, KnownZero2, KnownOne2, Depth + 1, Q);

// We need to take the minimum number of known bits		// We need to take the minimum number of known bits
APInt KnownZero3(KnownZero), KnownOne3(KnownOne);		APInt KnownZero3(KnownZero), KnownOne3(KnownOne);
computeKnownBits(L, KnownZero3, KnownOne3, DL, Depth + 1, Q);		computeKnownBits(L, KnownZero3, KnownOne3, Depth + 1, Q);

KnownZero = APInt::getLowBitsSet(BitWidth,		KnownZero = APInt::getLowBitsSet(BitWidth,
std::min(KnownZero2.countTrailingOnes(),		std::min(KnownZero2.countTrailingOnes(),
KnownZero3.countTrailingOnes()));		KnownZero3.countTrailingOnes()));
break;		break;
}		}
}		}
}		}
Show All 14 Lines	if (Depth < MaxDepth - 1 && !KnownZero && !KnownOne) {
for (Value *IncValue : P->incoming_values()) {		for (Value *IncValue : P->incoming_values()) {
// Skip direct self references.		// Skip direct self references.
if (IncValue == P) continue;		if (IncValue == P) continue;

KnownZero2 = APInt(BitWidth, 0);		KnownZero2 = APInt(BitWidth, 0);
KnownOne2 = APInt(BitWidth, 0);		KnownOne2 = APInt(BitWidth, 0);
// Recurse, but cap the recursion to one level, because we don't		// Recurse, but cap the recursion to one level, because we don't
// want to waste time spinning around in loops.		// want to waste time spinning around in loops.
computeKnownBits(IncValue, KnownZero2, KnownOne2, DL,		computeKnownBits(IncValue, KnownZero2, KnownOne2, MaxDepth - 1, Q);
MaxDepth - 1, Q);
KnownZero &= KnownZero2;		KnownZero &= KnownZero2;
KnownOne &= KnownOne2;		KnownOne &= KnownOne2;
// If all bits have been ruled out, there's no need to check		// If all bits have been ruled out, there's no need to check
// more operands.		// more operands.
if (!KnownZero && !KnownOne)		if (!KnownZero && !KnownOne)
break;		break;
}		}
}		}
break;		break;
}		}
case Instruction::Call:		case Instruction::Call:
case Instruction::Invoke:		case Instruction::Invoke:
if (MDNode *MD = cast<Instruction>(I)->getMetadata(LLVMContext::MD_range))		if (MDNode *MD = cast<Instruction>(I)->getMetadata(LLVMContext::MD_range))
computeKnownBitsFromRangeMetadata(*MD, KnownZero, KnownOne);		computeKnownBitsFromRangeMetadata(*MD, KnownZero, KnownOne);
// If a range metadata is attached to this IntrinsicInst, intersect the		// If a range metadata is attached to this IntrinsicInst, intersect the
// explicit range specified by the metadata and the implicit range of		// explicit range specified by the metadata and the implicit range of
// the intrinsic.		// the intrinsic.
if (IntrinsicInst *II = dyn_cast<IntrinsicInst>(I)) {		if (IntrinsicInst *II = dyn_cast<IntrinsicInst>(I)) {
switch (II->getIntrinsicID()) {		switch (II->getIntrinsicID()) {
default: break;		default: break;
case Intrinsic::bswap:		case Intrinsic::bswap:
computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, DL,		computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, Depth + 1, Q);
Depth + 1, Q);
KnownZero \|= KnownZero2.byteSwap();		KnownZero \|= KnownZero2.byteSwap();
KnownOne \|= KnownOne2.byteSwap();		KnownOne \|= KnownOne2.byteSwap();
break;		break;
case Intrinsic::ctlz:		case Intrinsic::ctlz:
case Intrinsic::cttz: {		case Intrinsic::cttz: {
unsigned LowBits = Log2_32(BitWidth)+1;		unsigned LowBits = Log2_32(BitWidth)+1;
// If this call is undefined for 0, the result will be less than 2^n.		// If this call is undefined for 0, the result will be less than 2^n.
if (II->getArgOperand(1) == ConstantInt::getTrue(II->getContext()))		if (II->getArgOperand(1) == ConstantInt::getTrue(II->getContext()))
LowBits -= 1;		LowBits -= 1;
KnownZero \|= APInt::getHighBitsSet(BitWidth, BitWidth - LowBits);		KnownZero \|= APInt::getHighBitsSet(BitWidth, BitWidth - LowBits);
break;		break;
}		}
case Intrinsic::ctpop: {		case Intrinsic::ctpop: {
computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, DL,		computeKnownBits(I->getOperand(0), KnownZero2, KnownOne2, Depth + 1, Q);
Depth + 1, Q);
// We can bound the space the count needs. Also, bits known to be zero		// We can bound the space the count needs. Also, bits known to be zero
// can't contribute to the population.		// can't contribute to the population.
unsigned BitsPossiblySet = BitWidth - KnownZero2.countPopulation();		unsigned BitsPossiblySet = BitWidth - KnownZero2.countPopulation();
unsigned LeadingZeros =		unsigned LeadingZeros =
APInt(BitWidth, BitsPossiblySet).countLeadingZeros();		APInt(BitWidth, BitsPossiblySet).countLeadingZeros();
assert(LeadingZeros <= BitWidth);		assert(LeadingZeros <= BitWidth);
KnownZero \|= APInt::getHighBitsSet(BitWidth, LeadingZeros);		KnownZero \|= APInt::getHighBitsSet(BitWidth, LeadingZeros);
KnownOne &= ~KnownZero;		KnownOne &= ~KnownZero;
Show All 19 Lines	if (IntrinsicInst *II = dyn_cast<IntrinsicInst>(I->getOperand(0))) {
if (EVI->getNumIndices() != 1) break;		if (EVI->getNumIndices() != 1) break;
if (EVI->getIndices()[0] == 0) {		if (EVI->getIndices()[0] == 0) {
switch (II->getIntrinsicID()) {		switch (II->getIntrinsicID()) {
default: break;		default: break;
case Intrinsic::uadd_with_overflow:		case Intrinsic::uadd_with_overflow:
case Intrinsic::sadd_with_overflow:		case Intrinsic::sadd_with_overflow:
computeKnownBitsAddSub(true, II->getArgOperand(0),		computeKnownBitsAddSub(true, II->getArgOperand(0),
II->getArgOperand(1), false, KnownZero,		II->getArgOperand(1), false, KnownZero,
KnownOne, KnownZero2, KnownOne2, DL, Depth, Q);		KnownOne, KnownZero2, KnownOne2, Depth, Q);
break;		break;
case Intrinsic::usub_with_overflow:		case Intrinsic::usub_with_overflow:
case Intrinsic::ssub_with_overflow:		case Intrinsic::ssub_with_overflow:
computeKnownBitsAddSub(false, II->getArgOperand(0),		computeKnownBitsAddSub(false, II->getArgOperand(0),
II->getArgOperand(1), false, KnownZero,		II->getArgOperand(1), false, KnownZero,
KnownOne, KnownZero2, KnownOne2, DL, Depth, Q);		KnownOne, KnownZero2, KnownOne2, Depth, Q);
break;		break;
case Intrinsic::umul_with_overflow:		case Intrinsic::umul_with_overflow:
case Intrinsic::smul_with_overflow:		case Intrinsic::smul_with_overflow:
computeKnownBitsMul(II->getArgOperand(0), II->getArgOperand(1), false,		computeKnownBitsMul(II->getArgOperand(0), II->getArgOperand(1), false,
KnownZero, KnownOne, KnownZero2, KnownOne2, DL,		KnownZero, KnownOne, KnownZero2, KnownOne2, Depth,
Depth, Q);		Q);
break;		break;
}		}
}		}
}		}
}		}
}		}

static unsigned getAlignment(const Value *V, const DataLayout &DL) {		static unsigned getAlignment(const Value *V, const DataLayout &DL) {
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
/// this won't lose us code quality.		/// this won't lose us code quality.
///		///
/// This function is defined on values with integer type, values with pointer		/// This function is defined on values with integer type, values with pointer
/// type, and vectors of integers. In the case		/// type, and vectors of integers. In the case
/// where V is a vector, known zero, and known one values are the		/// where V is a vector, known zero, and known one values are the
/// same width as the vector element, and the bit is set only if it is true		/// same width as the vector element, and the bit is set only if it is true
/// for all of the elements in the vector.		/// for all of the elements in the vector.
void computeKnownBits(Value *V, APInt &KnownZero, APInt &KnownOne,		void computeKnownBits(Value *V, APInt &KnownZero, APInt &KnownOne,
const DataLayout &DL, unsigned Depth, const Query &Q) {		unsigned Depth, const Query &Q) {
assert(V && "No Value?");		assert(V && "No Value?");
assert(Depth <= MaxDepth && "Limit Search Depth");		assert(Depth <= MaxDepth && "Limit Search Depth");
unsigned BitWidth = KnownZero.getBitWidth();		unsigned BitWidth = KnownZero.getBitWidth();

assert((V->getType()->isIntOrIntVectorTy() \|\|		assert((V->getType()->isIntOrIntVectorTy() \|\|
V->getType()->isFPOrFPVectorTy() \|\|		V->getType()->isFPOrFPVectorTy() \|\|
V->getType()->getScalarType()->isPointerTy()) &&		V->getType()->getScalarType()->isPointerTy()) &&
"Not integer, floating point, or pointer type!");		"Not integer, floating point, or pointer type!");
assert((DL.getTypeSizeInBits(V->getType()->getScalarType()) == BitWidth) &&		assert((Q.DL.getTypeSizeInBits(V->getType()->getScalarType()) == BitWidth) &&
(!V->getType()->isIntOrIntVectorTy() \|\|		(!V->getType()->isIntOrIntVectorTy() \|\|
V->getType()->getScalarSizeInBits() == BitWidth) &&		V->getType()->getScalarSizeInBits() == BitWidth) &&
KnownZero.getBitWidth() == BitWidth &&		KnownZero.getBitWidth() == BitWidth &&
KnownOne.getBitWidth() == BitWidth &&		KnownOne.getBitWidth() == BitWidth &&
"V, KnownOne and KnownZero should have same BitWidth");		"V, KnownOne and KnownZero should have same BitWidth");

if (ConstantInt *CI = dyn_cast<ConstantInt>(V)) {		if (ConstantInt *CI = dyn_cast<ConstantInt>(V)) {
// We know all of the bits for a constant!		// We know all of the bits for a constant!
Show All 31 Lines	void computeKnownBits(Value *V, APInt &KnownZero, APInt &KnownOne,
// All recursive calls that increase depth must come after this.		// All recursive calls that increase depth must come after this.
if (Depth == MaxDepth)		if (Depth == MaxDepth)
return;		return;

// A weak GlobalAlias is totally unknown. A non-weak GlobalAlias has		// A weak GlobalAlias is totally unknown. A non-weak GlobalAlias has
// the bits of its aliasee.		// the bits of its aliasee.
if (GlobalAlias *GA = dyn_cast<GlobalAlias>(V)) {		if (GlobalAlias *GA = dyn_cast<GlobalAlias>(V)) {
if (!GA->mayBeOverridden())		if (!GA->mayBeOverridden())
computeKnownBits(GA->getAliasee(), KnownZero, KnownOne, DL, Depth + 1, Q);		computeKnownBits(GA->getAliasee(), KnownZero, KnownOne, Depth + 1, Q);
return;		return;
}		}

if (Operator *I = dyn_cast<Operator>(V))		if (Operator *I = dyn_cast<Operator>(V))
computeKnownBitsFromOperator(I, KnownZero, KnownOne, DL, Depth, Q);		computeKnownBitsFromOperator(I, KnownZero, KnownOne, Depth, Q);

// Aligned pointers have trailing zeros - refine KnownZero set		// Aligned pointers have trailing zeros - refine KnownZero set
if (V->getType()->isPointerTy()) {		if (V->getType()->isPointerTy()) {
unsigned Align = getAlignment(V, DL);		unsigned Align = getAlignment(V, Q.DL);
if (Align)		if (Align)
KnownZero \|= APInt::getLowBitsSet(BitWidth, countTrailingZeros(Align));		KnownZero \|= APInt::getLowBitsSet(BitWidth, countTrailingZeros(Align));
}		}

// computeKnownBitsFromAssume and computeKnownBitsFromDominatingCondition		// computeKnownBitsFromAssume and computeKnownBitsFromDominatingCondition
// strictly refines KnownZero and KnownOne. Therefore, we run them after		// strictly refines KnownZero and KnownOne. Therefore, we run them after
// computeKnownBitsFromOperator.		// computeKnownBitsFromOperator.

// Check whether a nearby assume intrinsic can determine some known bits.		// Check whether a nearby assume intrinsic can determine some known bits.
computeKnownBitsFromAssume(V, KnownZero, KnownOne, DL, Depth, Q);		computeKnownBitsFromAssume(V, KnownZero, KnownOne, Depth, Q);

// Check whether there's a dominating condition which implies something about		// Check whether there's a dominating condition which implies something about
// this value at the given context.		// this value at the given context.
if (EnableDomConditions && Depth <= DomConditionsMaxDepth)		if (EnableDomConditions && Depth <= DomConditionsMaxDepth)
computeKnownBitsFromDominatingCondition(V, KnownZero, KnownOne, DL, Depth,		computeKnownBitsFromDominatingCondition(V, KnownZero, KnownOne, Depth, Q);
Q);

assert((KnownZero & KnownOne) == 0 && "Bits known to be one AND zero?");		assert((KnownZero & KnownOne) == 0 && "Bits known to be one AND zero?");
}		}

/// Determine whether the sign bit is known to be zero or one.		/// Determine whether the sign bit is known to be zero or one.
/// Convenience wrapper around computeKnownBits.		/// Convenience wrapper around computeKnownBits.
void ComputeSignBit(Value *V, bool &KnownZero, bool &KnownOne,		void ComputeSignBit(Value *V, bool &KnownZero, bool &KnownOne,
const DataLayout &DL, unsigned Depth, const Query &Q) {		unsigned Depth, const Query &Q) {
unsigned BitWidth = getBitWidth(V->getType(), DL);		unsigned BitWidth = getBitWidth(V->getType(), Q.DL);
if (!BitWidth) {		if (!BitWidth) {
KnownZero = false;		KnownZero = false;
KnownOne = false;		KnownOne = false;
return;		return;
}		}
APInt ZeroBits(BitWidth, 0);		APInt ZeroBits(BitWidth, 0);
APInt OneBits(BitWidth, 0);		APInt OneBits(BitWidth, 0);
computeKnownBits(V, ZeroBits, OneBits, DL, Depth, Q);		computeKnownBits(V, ZeroBits, OneBits, Depth, Q);
KnownOne = OneBits[BitWidth - 1];		KnownOne = OneBits[BitWidth - 1];
KnownZero = ZeroBits[BitWidth - 1];		KnownZero = ZeroBits[BitWidth - 1];
}		}

/// Return true if the given value is known to have exactly one		/// Return true if the given value is known to have exactly one
/// bit set when defined. For vectors return true if every element is known to		/// bit set when defined. For vectors return true if every element is known to
/// be a power of two when defined. Supports values with integer or pointer		/// be a power of two when defined. Supports values with integer or pointer
/// types and vectors of integers.		/// types and vectors of integers.
bool isKnownToBeAPowerOfTwo(Value *V, bool OrZero, unsigned Depth,		bool isKnownToBeAPowerOfTwo(Value *V, bool OrZero, unsigned Depth,
const Query &Q, const DataLayout &DL) {		const Query &Q) {
if (Constant *C = dyn_cast<Constant>(V)) {		if (Constant *C = dyn_cast<Constant>(V)) {
if (C->isNullValue())		if (C->isNullValue())
return OrZero;		return OrZero;
if (ConstantInt *CI = dyn_cast<ConstantInt>(C))		if (ConstantInt *CI = dyn_cast<ConstantInt>(C))
return CI->getValue().isPowerOf2();		return CI->getValue().isPowerOf2();
// TODO: Handle vector constants.		// TODO: Handle vector constants.
}		}

Show All 11 Lines	bool isKnownToBeAPowerOfTwo(Value *V, bool OrZero, unsigned Depth,
if (Depth++ == MaxDepth)		if (Depth++ == MaxDepth)
return false;		return false;

Value X = nullptr, Y = nullptr;		Value X = nullptr, Y = nullptr;
// A shift left or a logical shift right of a power of two is a power of two		// A shift left or a logical shift right of a power of two is a power of two
// or zero.		// or zero.
if (OrZero && (match(V, m_Shl(m_Value(X), m_Value())) \|\|		if (OrZero && (match(V, m_Shl(m_Value(X), m_Value())) \|\|
match(V, m_LShr(m_Value(X), m_Value()))))		match(V, m_LShr(m_Value(X), m_Value()))))
return isKnownToBeAPowerOfTwo(X, /OrZero/ true, Depth, Q, DL);		return isKnownToBeAPowerOfTwo(X, /OrZero/ true, Depth, Q);

if (ZExtInst *ZI = dyn_cast<ZExtInst>(V))		if (ZExtInst *ZI = dyn_cast<ZExtInst>(V))
return isKnownToBeAPowerOfTwo(ZI->getOperand(0), OrZero, Depth, Q, DL);		return isKnownToBeAPowerOfTwo(ZI->getOperand(0), OrZero, Depth, Q);

if (SelectInst *SI = dyn_cast<SelectInst>(V))		if (SelectInst *SI = dyn_cast<SelectInst>(V))
return isKnownToBeAPowerOfTwo(SI->getTrueValue(), OrZero, Depth, Q, DL) &&		return isKnownToBeAPowerOfTwo(SI->getTrueValue(), OrZero, Depth, Q) &&
isKnownToBeAPowerOfTwo(SI->getFalseValue(), OrZero, Depth, Q, DL);		isKnownToBeAPowerOfTwo(SI->getFalseValue(), OrZero, Depth, Q);

if (OrZero && match(V, m_And(m_Value(X), m_Value(Y)))) {		if (OrZero && match(V, m_And(m_Value(X), m_Value(Y)))) {
// A power of two and'd with anything is a power of two or zero.		// A power of two and'd with anything is a power of two or zero.
if (isKnownToBeAPowerOfTwo(X, /OrZero/ true, Depth, Q, DL) \|\|		if (isKnownToBeAPowerOfTwo(X, /OrZero/ true, Depth, Q) \|\|
isKnownToBeAPowerOfTwo(Y, /OrZero/ true, Depth, Q, DL))		isKnownToBeAPowerOfTwo(Y, /OrZero/ true, Depth, Q))
return true;		return true;
// X & (-X) is always a power of two or zero.		// X & (-X) is always a power of two or zero.
if (match(X, m_Neg(m_Specific(Y))) \|\| match(Y, m_Neg(m_Specific(X))))		if (match(X, m_Neg(m_Specific(Y))) \|\| match(Y, m_Neg(m_Specific(X))))
return true;		return true;
return false;		return false;
}		}

// Adding a power-of-two or zero to the same power-of-two or zero yields		// Adding a power-of-two or zero to the same power-of-two or zero yields
// either the original power-of-two, a larger power-of-two or zero.		// either the original power-of-two, a larger power-of-two or zero.
if (match(V, m_Add(m_Value(X), m_Value(Y)))) {		if (match(V, m_Add(m_Value(X), m_Value(Y)))) {
OverflowingBinaryOperator *VOBO = cast<OverflowingBinaryOperator>(V);		OverflowingBinaryOperator *VOBO = cast<OverflowingBinaryOperator>(V);
if (OrZero \|\| VOBO->hasNoUnsignedWrap() \|\| VOBO->hasNoSignedWrap()) {		if (OrZero \|\| VOBO->hasNoUnsignedWrap() \|\| VOBO->hasNoSignedWrap()) {
if (match(X, m_And(m_Specific(Y), m_Value())) \|\|		if (match(X, m_And(m_Specific(Y), m_Value())) \|\|
match(X, m_And(m_Value(), m_Specific(Y))))		match(X, m_And(m_Value(), m_Specific(Y))))
if (isKnownToBeAPowerOfTwo(Y, OrZero, Depth, Q, DL))		if (isKnownToBeAPowerOfTwo(Y, OrZero, Depth, Q))
return true;		return true;
if (match(Y, m_And(m_Specific(X), m_Value())) \|\|		if (match(Y, m_And(m_Specific(X), m_Value())) \|\|
match(Y, m_And(m_Value(), m_Specific(X))))		match(Y, m_And(m_Value(), m_Specific(X))))
if (isKnownToBeAPowerOfTwo(X, OrZero, Depth, Q, DL))		if (isKnownToBeAPowerOfTwo(X, OrZero, Depth, Q))
return true;		return true;

unsigned BitWidth = V->getType()->getScalarSizeInBits();		unsigned BitWidth = V->getType()->getScalarSizeInBits();
APInt LHSZeroBits(BitWidth, 0), LHSOneBits(BitWidth, 0);		APInt LHSZeroBits(BitWidth, 0), LHSOneBits(BitWidth, 0);
computeKnownBits(X, LHSZeroBits, LHSOneBits, DL, Depth, Q);		computeKnownBits(X, LHSZeroBits, LHSOneBits, Depth, Q);

APInt RHSZeroBits(BitWidth, 0), RHSOneBits(BitWidth, 0);		APInt RHSZeroBits(BitWidth, 0), RHSOneBits(BitWidth, 0);
computeKnownBits(Y, RHSZeroBits, RHSOneBits, DL, Depth, Q);		computeKnownBits(Y, RHSZeroBits, RHSOneBits, Depth, Q);
// If i8 V is a power of two or zero:		// If i8 V is a power of two or zero:
// ZeroBits: 1 1 1 0 1 1 1 1		// ZeroBits: 1 1 1 0 1 1 1 1
// ~ZeroBits: 0 0 0 1 0 0 0 0		// ~ZeroBits: 0 0 0 1 0 0 0 0
if ((~(LHSZeroBits & RHSZeroBits)).isPowerOf2())		if ((~(LHSZeroBits & RHSZeroBits)).isPowerOf2())
// If OrZero isn't set, we cannot give back a zero result.		// If OrZero isn't set, we cannot give back a zero result.
// Make sure either the LHS or RHS has a bit set.		// Make sure either the LHS or RHS has a bit set.
if (OrZero \|\| RHSOneBits.getBoolValue() \|\| LHSOneBits.getBoolValue())		if (OrZero \|\| RHSOneBits.getBoolValue() \|\| LHSOneBits.getBoolValue())
return true;		return true;
}		}
}		}

// An exact divide or right shift can only shift off zero bits, so the result		// An exact divide or right shift can only shift off zero bits, so the result
// is a power of two only if the first operand is a power of two and not		// is a power of two only if the first operand is a power of two and not
// copying a sign bit (sdiv int_min, 2).		// copying a sign bit (sdiv int_min, 2).
if (match(V, m_Exact(m_LShr(m_Value(), m_Value()))) \|\|		if (match(V, m_Exact(m_LShr(m_Value(), m_Value()))) \|\|
match(V, m_Exact(m_UDiv(m_Value(), m_Value())))) {		match(V, m_Exact(m_UDiv(m_Value(), m_Value())))) {
return isKnownToBeAPowerOfTwo(cast<Operator>(V)->getOperand(0), OrZero,		return isKnownToBeAPowerOfTwo(cast<Operator>(V)->getOperand(0), OrZero,
Depth, Q, DL);		Depth, Q);
}		}

return false;		return false;
}		}

/// \brief Test whether a GEP's result is known to be non-null.		/// \brief Test whether a GEP's result is known to be non-null.
///		///
/// Uses properties inherent in a GEP to try to determine whether it is known		/// Uses properties inherent in a GEP to try to determine whether it is known
/// to be non-null.		/// to be non-null.
///		///
/// Currently this routine does not support vector GEPs.		/// Currently this routine does not support vector GEPs.
static bool isGEPKnownNonNull(GEPOperator *GEP, const DataLayout &DL,		static bool isGEPKnownNonNull(GEPOperator *GEP, unsigned Depth,
unsigned Depth, const Query &Q) {		const Query &Q) {
if (!GEP->isInBounds() \|\| GEP->getPointerAddressSpace() != 0)		if (!GEP->isInBounds() \|\| GEP->getPointerAddressSpace() != 0)
return false;		return false;

// FIXME: Support vector-GEPs.		// FIXME: Support vector-GEPs.
assert(GEP->getType()->isPointerTy() && "We only support plain pointer GEP");		assert(GEP->getType()->isPointerTy() && "We only support plain pointer GEP");

// If the base pointer is non-null, we cannot walk to a null address with an		// If the base pointer is non-null, we cannot walk to a null address with an
// inbounds GEP in address space zero.		// inbounds GEP in address space zero.
if (isKnownNonZero(GEP->getPointerOperand(), DL, Depth, Q))		if (isKnownNonZero(GEP->getPointerOperand(), Depth, Q))
return true;		return true;

// Walk the GEP operands and see if any operand introduces a non-zero offset.		// Walk the GEP operands and see if any operand introduces a non-zero offset.
// If so, then the GEP cannot produce a null pointer, as doing so would		// If so, then the GEP cannot produce a null pointer, as doing so would
// inherently violate the inbounds contract within address space zero.		// inherently violate the inbounds contract within address space zero.
for (gep_type_iterator GTI = gep_type_begin(GEP), GTE = gep_type_end(GEP);		for (gep_type_iterator GTI = gep_type_begin(GEP), GTE = gep_type_end(GEP);
GTI != GTE; ++GTI) {		GTI != GTE; ++GTI) {
// Struct types are easy -- they must always be indexed by a constant.		// Struct types are easy -- they must always be indexed by a constant.
if (StructType STy = dyn_cast<StructType>(GTI)) {		if (StructType STy = dyn_cast<StructType>(GTI)) {
ConstantInt *OpC = cast<ConstantInt>(GTI.getOperand());		ConstantInt *OpC = cast<ConstantInt>(GTI.getOperand());
unsigned ElementIdx = OpC->getZExtValue();		unsigned ElementIdx = OpC->getZExtValue();
const StructLayout *SL = DL.getStructLayout(STy);		const StructLayout *SL = Q.DL.getStructLayout(STy);
uint64_t ElementOffset = SL->getElementOffset(ElementIdx);		uint64_t ElementOffset = SL->getElementOffset(ElementIdx);
if (ElementOffset > 0)		if (ElementOffset > 0)
return true;		return true;
continue;		continue;
}		}

// If we have a zero-sized type, the index doesn't matter. Keep looping.		// If we have a zero-sized type, the index doesn't matter. Keep looping.
if (DL.getTypeAllocSize(GTI.getIndexedType()) == 0)		if (Q.DL.getTypeAllocSize(GTI.getIndexedType()) == 0)
continue;		continue;

// Fast path the constant operand case both for efficiency and so we don't		// Fast path the constant operand case both for efficiency and so we don't
// increment Depth when just zipping down an all-constant GEP.		// increment Depth when just zipping down an all-constant GEP.
if (ConstantInt *OpC = dyn_cast<ConstantInt>(GTI.getOperand())) {		if (ConstantInt *OpC = dyn_cast<ConstantInt>(GTI.getOperand())) {
if (!OpC->isZero())		if (!OpC->isZero())
return true;		return true;
continue;		continue;
}		}

// We post-increment Depth here because while isKnownNonZero increments it		// We post-increment Depth here because while isKnownNonZero increments it
// as well, when we pop back up that increment won't persist. We don't want		// as well, when we pop back up that increment won't persist. We don't want
// to recurse 10k times just because we have 10k GEP operands. We don't		// to recurse 10k times just because we have 10k GEP operands. We don't
// bail completely out because we want to handle constant GEPs regardless		// bail completely out because we want to handle constant GEPs regardless
// of depth.		// of depth.
if (Depth++ >= MaxDepth)		if (Depth++ >= MaxDepth)
continue;		continue;

if (isKnownNonZero(GTI.getOperand(), DL, Depth, Q))		if (isKnownNonZero(GTI.getOperand(), Depth, Q))
return true;		return true;
}		}

return false;		return false;
}		}

/// Does the 'Range' metadata (which must be a valid MD_range operand list)		/// Does the 'Range' metadata (which must be a valid MD_range operand list)
/// ensure that the value it's attached to is never Value? 'RangeType' is		/// ensure that the value it's attached to is never Value? 'RangeType' is
Show All 13 Lines	static bool rangeMetadataExcludesValue(MDNode* Ranges,
}		}
return true;		return true;
}		}

/// Return true if the given value is known to be non-zero when defined.		/// Return true if the given value is known to be non-zero when defined.
/// For vectors return true if every element is known to be non-zero when		/// For vectors return true if every element is known to be non-zero when
/// defined. Supports values with integer or pointer type and vectors of		/// defined. Supports values with integer or pointer type and vectors of
/// integers.		/// integers.
bool isKnownNonZero(Value *V, const DataLayout &DL, unsigned Depth,		bool isKnownNonZero(Value *V, unsigned Depth, const Query &Q) {
const Query &Q) {
if (Constant *C = dyn_cast<Constant>(V)) {		if (Constant *C = dyn_cast<Constant>(V)) {
if (C->isNullValue())		if (C->isNullValue())
return false;		return false;
if (isa<ConstantInt>(C))		if (isa<ConstantInt>(C))
// Must be non-zero due to null test above.		// Must be non-zero due to null test above.
return true;		return true;
// TODO: Handle vectors		// TODO: Handle vectors
return false;		return false;
Show All 15 Lines	bool isKnownNonZero(Value *V, unsigned Depth, const Query &Q) {
if (Depth++ >= MaxDepth)		if (Depth++ >= MaxDepth)
return false;		return false;

// Check for pointer simplifications.		// Check for pointer simplifications.
if (V->getType()->isPointerTy()) {		if (V->getType()->isPointerTy()) {
if (isKnownNonNull(V))		if (isKnownNonNull(V))
return true;		return true;
if (GEPOperator *GEP = dyn_cast<GEPOperator>(V))		if (GEPOperator *GEP = dyn_cast<GEPOperator>(V))
if (isGEPKnownNonNull(GEP, DL, Depth, Q))		if (isGEPKnownNonNull(GEP, Depth, Q))
return true;		return true;
}		}

unsigned BitWidth = getBitWidth(V->getType()->getScalarType(), DL);		unsigned BitWidth = getBitWidth(V->getType()->getScalarType(), Q.DL);

// X \| Y != 0 if X != 0 or Y != 0.		// X \| Y != 0 if X != 0 or Y != 0.
Value X = nullptr, Y = nullptr;		Value X = nullptr, Y = nullptr;
if (match(V, m_Or(m_Value(X), m_Value(Y))))		if (match(V, m_Or(m_Value(X), m_Value(Y))))
return isKnownNonZero(X, DL, Depth, Q) \|\| isKnownNonZero(Y, DL, Depth, Q);		return isKnownNonZero(X, Depth, Q) \|\| isKnownNonZero(Y, Depth, Q);

// ext X != 0 if X != 0.		// ext X != 0 if X != 0.
if (isa<SExtInst>(V) \|\| isa<ZExtInst>(V))		if (isa<SExtInst>(V) \|\| isa<ZExtInst>(V))
return isKnownNonZero(cast<Instruction>(V)->getOperand(0), DL, Depth, Q);		return isKnownNonZero(cast<Instruction>(V)->getOperand(0), Depth, Q);

// shl X, Y != 0 if X is odd. Note that the value of the shift is undefined		// shl X, Y != 0 if X is odd. Note that the value of the shift is undefined
// if the lowest bit is shifted off the end.		// if the lowest bit is shifted off the end.
if (BitWidth && match(V, m_Shl(m_Value(X), m_Value(Y)))) {		if (BitWidth && match(V, m_Shl(m_Value(X), m_Value(Y)))) {
// shl nuw can't remove any non-zero bits.		// shl nuw can't remove any non-zero bits.
OverflowingBinaryOperator *BO = cast<OverflowingBinaryOperator>(V);		OverflowingBinaryOperator *BO = cast<OverflowingBinaryOperator>(V);
if (BO->hasNoUnsignedWrap())		if (BO->hasNoUnsignedWrap())
return isKnownNonZero(X, DL, Depth, Q);		return isKnownNonZero(X, Depth, Q);

APInt KnownZero(BitWidth, 0);		APInt KnownZero(BitWidth, 0);
APInt KnownOne(BitWidth, 0);		APInt KnownOne(BitWidth, 0);
computeKnownBits(X, KnownZero, KnownOne, DL, Depth, Q);		computeKnownBits(X, KnownZero, KnownOne, Depth, Q);
if (KnownOne[0])		if (KnownOne[0])
return true;		return true;
}		}
// shr X, Y != 0 if X is negative. Note that the value of the shift is not		// shr X, Y != 0 if X is negative. Note that the value of the shift is not
// defined if the sign bit is shifted off the end.		// defined if the sign bit is shifted off the end.
else if (match(V, m_Shr(m_Value(X), m_Value(Y)))) {		else if (match(V, m_Shr(m_Value(X), m_Value(Y)))) {
// shr exact can only shift out zero bits.		// shr exact can only shift out zero bits.
PossiblyExactOperator *BO = cast<PossiblyExactOperator>(V);		PossiblyExactOperator *BO = cast<PossiblyExactOperator>(V);
if (BO->isExact())		if (BO->isExact())
return isKnownNonZero(X, DL, Depth, Q);		return isKnownNonZero(X, Depth, Q);

bool XKnownNonNegative, XKnownNegative;		bool XKnownNonNegative, XKnownNegative;
ComputeSignBit(X, XKnownNonNegative, XKnownNegative, DL, Depth, Q);		ComputeSignBit(X, XKnownNonNegative, XKnownNegative, Depth, Q);
if (XKnownNegative)		if (XKnownNegative)
return true;		return true;

// If the shifter operand is a constant, and all of the bits shifted		// If the shifter operand is a constant, and all of the bits shifted
// out are known to be zero, and X is known non-zero then at least one		// out are known to be zero, and X is known non-zero then at least one
// non-zero bit must remain.		// non-zero bit must remain.
if (ConstantInt *Shift = dyn_cast<ConstantInt>(Y)) {		if (ConstantInt *Shift = dyn_cast<ConstantInt>(Y)) {
APInt KnownZero(BitWidth, 0);		APInt KnownZero(BitWidth, 0);
APInt KnownOne(BitWidth, 0);		APInt KnownOne(BitWidth, 0);
computeKnownBits(X, KnownZero, KnownOne, DL, Depth, Q);		computeKnownBits(X, KnownZero, KnownOne, Depth, Q);

auto ShiftVal = Shift->getLimitedValue(BitWidth - 1);		auto ShiftVal = Shift->getLimitedValue(BitWidth - 1);
// Is there a known one in the portion not shifted out?		// Is there a known one in the portion not shifted out?
if (KnownOne.countLeadingZeros() < BitWidth - ShiftVal)		if (KnownOne.countLeadingZeros() < BitWidth - ShiftVal)
return true;		return true;
// Are all the bits to be shifted out known zero?		// Are all the bits to be shifted out known zero?
if (KnownZero.countTrailingOnes() >= ShiftVal)		if (KnownZero.countTrailingOnes() >= ShiftVal)
return isKnownNonZero(X, DL, Depth, Q);		return isKnownNonZero(X, Depth, Q);
}		}
}		}
// div exact can only produce a zero if the dividend is zero.		// div exact can only produce a zero if the dividend is zero.
else if (match(V, m_Exact(m_IDiv(m_Value(X), m_Value())))) {		else if (match(V, m_Exact(m_IDiv(m_Value(X), m_Value())))) {
return isKnownNonZero(X, DL, Depth, Q);		return isKnownNonZero(X, Depth, Q);
}		}
// X + Y.		// X + Y.
else if (match(V, m_Add(m_Value(X), m_Value(Y)))) {		else if (match(V, m_Add(m_Value(X), m_Value(Y)))) {
bool XKnownNonNegative, XKnownNegative;		bool XKnownNonNegative, XKnownNegative;
bool YKnownNonNegative, YKnownNegative;		bool YKnownNonNegative, YKnownNegative;
ComputeSignBit(X, XKnownNonNegative, XKnownNegative, DL, Depth, Q);		ComputeSignBit(X, XKnownNonNegative, XKnownNegative, Depth, Q);
ComputeSignBit(Y, YKnownNonNegative, YKnownNegative, DL, Depth, Q);		ComputeSignBit(Y, YKnownNonNegative, YKnownNegative, Depth, Q);

// If X and Y are both non-negative (as signed values) then their sum is not		// If X and Y are both non-negative (as signed values) then their sum is not
// zero unless both X and Y are zero.		// zero unless both X and Y are zero.
if (XKnownNonNegative && YKnownNonNegative)		if (XKnownNonNegative && YKnownNonNegative)
if (isKnownNonZero(X, DL, Depth, Q) \|\| isKnownNonZero(Y, DL, Depth, Q))		if (isKnownNonZero(X, Depth, Q) \|\| isKnownNonZero(Y, Depth, Q))
return true;		return true;

// If X and Y are both negative (as signed values) then their sum is not		// If X and Y are both negative (as signed values) then their sum is not
// zero unless both X and Y equal INT_MIN.		// zero unless both X and Y equal INT_MIN.
if (BitWidth && XKnownNegative && YKnownNegative) {		if (BitWidth && XKnownNegative && YKnownNegative) {
APInt KnownZero(BitWidth, 0);		APInt KnownZero(BitWidth, 0);
APInt KnownOne(BitWidth, 0);		APInt KnownOne(BitWidth, 0);
APInt Mask = APInt::getSignedMaxValue(BitWidth);		APInt Mask = APInt::getSignedMaxValue(BitWidth);
// The sign bit of X is set. If some other bit is set then X is not equal		// The sign bit of X is set. If some other bit is set then X is not equal
// to INT_MIN.		// to INT_MIN.
computeKnownBits(X, KnownZero, KnownOne, DL, Depth, Q);		computeKnownBits(X, KnownZero, KnownOne, Depth, Q);
if ((KnownOne & Mask) != 0)		if ((KnownOne & Mask) != 0)
return true;		return true;
// The sign bit of Y is set. If some other bit is set then Y is not equal		// The sign bit of Y is set. If some other bit is set then Y is not equal
// to INT_MIN.		// to INT_MIN.
computeKnownBits(Y, KnownZero, KnownOne, DL, Depth, Q);		computeKnownBits(Y, KnownZero, KnownOne, Depth, Q);
if ((KnownOne & Mask) != 0)		if ((KnownOne & Mask) != 0)
return true;		return true;
}		}

// The sum of a non-negative number and a power of two is not zero.		// The sum of a non-negative number and a power of two is not zero.
if (XKnownNonNegative &&		if (XKnownNonNegative &&
isKnownToBeAPowerOfTwo(Y, /OrZero/ false, Depth, Q, DL))		isKnownToBeAPowerOfTwo(Y, /OrZero/ false, Depth, Q))
return true;		return true;
if (YKnownNonNegative &&		if (YKnownNonNegative &&
isKnownToBeAPowerOfTwo(X, /OrZero/ false, Depth, Q, DL))		isKnownToBeAPowerOfTwo(X, /OrZero/ false, Depth, Q))
return true;		return true;
}		}
// X * Y.		// X * Y.
else if (match(V, m_Mul(m_Value(X), m_Value(Y)))) {		else if (match(V, m_Mul(m_Value(X), m_Value(Y)))) {
OverflowingBinaryOperator *BO = cast<OverflowingBinaryOperator>(V);		OverflowingBinaryOperator *BO = cast<OverflowingBinaryOperator>(V);
// If X and Y are non-zero then so is X * Y as long as the multiplication		// If X and Y are non-zero then so is X * Y as long as the multiplication
// does not overflow.		// does not overflow.
if ((BO->hasNoSignedWrap() \|\| BO->hasNoUnsignedWrap()) &&		if ((BO->hasNoSignedWrap() \|\| BO->hasNoUnsignedWrap()) &&
isKnownNonZero(X, DL, Depth, Q) && isKnownNonZero(Y, DL, Depth, Q))		isKnownNonZero(X, Depth, Q) && isKnownNonZero(Y, Depth, Q))
return true;		return true;
}		}
// (C ? X : Y) != 0 if X != 0 and Y != 0.		// (C ? X : Y) != 0 if X != 0 and Y != 0.
else if (SelectInst *SI = dyn_cast<SelectInst>(V)) {		else if (SelectInst *SI = dyn_cast<SelectInst>(V)) {
if (isKnownNonZero(SI->getTrueValue(), DL, Depth, Q) &&		if (isKnownNonZero(SI->getTrueValue(), Depth, Q) &&
isKnownNonZero(SI->getFalseValue(), DL, Depth, Q))		isKnownNonZero(SI->getFalseValue(), Depth, Q))
return true;		return true;
}		}
// PHI		// PHI
else if (PHINode *PN = dyn_cast<PHINode>(V)) {		else if (PHINode *PN = dyn_cast<PHINode>(V)) {
// Try and detect a recurrence that monotonically increases from a		// Try and detect a recurrence that monotonically increases from a
// starting value, as these are common as induction variables.		// starting value, as these are common as induction variables.
if (PN->getNumIncomingValues() == 2) {		if (PN->getNumIncomingValues() == 2) {
Value *Start = PN->getIncomingValue(0);		Value *Start = PN->getIncomingValue(0);
Show All 10 Lines	if (PN->getNumIncomingValues() == 2) {
}		}
}		}
}		}
}		}

if (!BitWidth) return false;		if (!BitWidth) return false;
APInt KnownZero(BitWidth, 0);		APInt KnownZero(BitWidth, 0);
APInt KnownOne(BitWidth, 0);		APInt KnownOne(BitWidth, 0);
computeKnownBits(V, KnownZero, KnownOne, DL, Depth, Q);		computeKnownBits(V, KnownZero, KnownOne, Depth, Q);
return KnownOne != 0;		return KnownOne != 0;
}		}

/// Return true if V2 == V1 + X, where X is known non-zero.		/// Return true if V2 == V1 + X, where X is known non-zero.
static bool isAddOfNonZero(Value V1, Value V2, const DataLayout &DL,		static bool isAddOfNonZero(Value V1, Value V2, const Query &Q) {
const Query &Q) {
BinaryOperator *BO = dyn_cast<BinaryOperator>(V1);		BinaryOperator *BO = dyn_cast<BinaryOperator>(V1);
if (!BO \|\| BO->getOpcode() != Instruction::Add)		if (!BO \|\| BO->getOpcode() != Instruction::Add)
return false;		return false;
Value *Op = nullptr;		Value *Op = nullptr;
if (V2 == BO->getOperand(0))		if (V2 == BO->getOperand(0))
Op = BO->getOperand(1);		Op = BO->getOperand(1);
else if (V2 == BO->getOperand(1))		else if (V2 == BO->getOperand(1))
Op = BO->getOperand(0);		Op = BO->getOperand(0);
else		else
return false;		return false;
return isKnownNonZero(Op, DL, 0, Q);		return isKnownNonZero(Op, 0, Q);
}		}

/// Return true if it is known that V1 != V2.		/// Return true if it is known that V1 != V2.
static bool isKnownNonEqual(Value V1, Value V2, const DataLayout &DL,		static bool isKnownNonEqual(Value V1, Value V2, const Query &Q) {
const Query &Q) {
if (V1->getType()->isVectorTy() \|\| V1 == V2)		if (V1->getType()->isVectorTy() \|\| V1 == V2)
return false;		return false;
if (V1->getType() != V2->getType())		if (V1->getType() != V2->getType())
// We can't look through casts yet.		// We can't look through casts yet.
return false;		return false;
if (isAddOfNonZero(V1, V2, DL, Q) \|\| isAddOfNonZero(V2, V1, DL, Q))		if (isAddOfNonZero(V1, V2, Q) \|\| isAddOfNonZero(V2, V1, Q))
return true;		return true;

if (IntegerType *Ty = dyn_cast<IntegerType>(V1->getType())) {		if (IntegerType *Ty = dyn_cast<IntegerType>(V1->getType())) {
// Are any known bits in V1 contradictory to known bits in V2? If V1		// Are any known bits in V1 contradictory to known bits in V2? If V1
// has a known zero where V2 has a known one, they must not be equal.		// has a known zero where V2 has a known one, they must not be equal.
auto BitWidth = Ty->getBitWidth();		auto BitWidth = Ty->getBitWidth();
APInt KnownZero1(BitWidth, 0);		APInt KnownZero1(BitWidth, 0);
APInt KnownOne1(BitWidth, 0);		APInt KnownOne1(BitWidth, 0);
computeKnownBits(V1, KnownZero1, KnownOne1, DL, 0, Q);		computeKnownBits(V1, KnownZero1, KnownOne1, 0, Q);
APInt KnownZero2(BitWidth, 0);		APInt KnownZero2(BitWidth, 0);
APInt KnownOne2(BitWidth, 0);		APInt KnownOne2(BitWidth, 0);
computeKnownBits(V2, KnownZero2, KnownOne2, DL, 0, Q);		computeKnownBits(V2, KnownZero2, KnownOne2, 0, Q);

auto OppositeBits = (KnownZero1 & KnownOne2) \| (KnownZero2 & KnownOne1);		auto OppositeBits = (KnownZero1 & KnownOne2) \| (KnownZero2 & KnownOne1);
if (OppositeBits.getBoolValue())		if (OppositeBits.getBoolValue())
return true;		return true;
}		}
return false;		return false;
}		}

/// Return true if 'V & Mask' is known to be zero. We use this predicate to		/// Return true if 'V & Mask' is known to be zero. We use this predicate to
/// simplify operations downstream. Mask is known to be zero for bits that V		/// simplify operations downstream. Mask is known to be zero for bits that V
/// cannot have.		/// cannot have.
///		///
/// This function is defined on values with integer type, values with pointer		/// This function is defined on values with integer type, values with pointer
/// type, and vectors of integers. In the case		/// type, and vectors of integers. In the case
/// where V is a vector, the mask, known zero, and known one values are the		/// where V is a vector, the mask, known zero, and known one values are the
/// same width as the vector element, and the bit is set only if it is true		/// same width as the vector element, and the bit is set only if it is true
/// for all of the elements in the vector.		/// for all of the elements in the vector.
bool MaskedValueIsZero(Value *V, const APInt &Mask, const DataLayout &DL,		bool MaskedValueIsZero(Value *V, const APInt &Mask, unsigned Depth,
unsigned Depth, const Query &Q) {		const Query &Q) {
APInt KnownZero(Mask.getBitWidth(), 0), KnownOne(Mask.getBitWidth(), 0);		APInt KnownZero(Mask.getBitWidth(), 0), KnownOne(Mask.getBitWidth(), 0);
computeKnownBits(V, KnownZero, KnownOne, DL, Depth, Q);		computeKnownBits(V, KnownZero, KnownOne, Depth, Q);
return (KnownZero & Mask) == Mask;		return (KnownZero & Mask) == Mask;
}		}



/// Return the number of times the sign bit of the register is replicated into		/// Return the number of times the sign bit of the register is replicated into
/// the other bits. We know that at least 1 bit is always equal to the sign bit		/// the other bits. We know that at least 1 bit is always equal to the sign bit
/// (itself), but other cases can give us information. For example, immediately		/// (itself), but other cases can give us information. For example, immediately
/// after an "ashr X, 2", we know that the top 3 bits are all equal to each		/// after an "ashr X, 2", we know that the top 3 bits are all equal to each
/// other, so we return 3.		/// other, so we return 3.
///		///
/// 'Op' must have a scalar integer type.		/// 'Op' must have a scalar integer type.
///		///
unsigned ComputeNumSignBits(Value *V, const DataLayout &DL, unsigned Depth,		unsigned ComputeNumSignBits(Value *V, unsigned Depth, const Query &Q) {
const Query &Q) {		unsigned TyBits = Q.DL.getTypeSizeInBits(V->getType()->getScalarType());
unsigned TyBits = DL.getTypeSizeInBits(V->getType()->getScalarType());
unsigned Tmp, Tmp2;		unsigned Tmp, Tmp2;
unsigned FirstAnswer = 1;		unsigned FirstAnswer = 1;

// Note that ConstantInt is handled by the general computeKnownBits case		// Note that ConstantInt is handled by the general computeKnownBits case
// below.		// below.

if (Depth == 6)		if (Depth == 6)
return 1; // Limit search depth.		return 1; // Limit search depth.

Operator *U = dyn_cast<Operator>(V);		Operator *U = dyn_cast<Operator>(V);
switch (Operator::getOpcode(V)) {		switch (Operator::getOpcode(V)) {
default: break;		default: break;
case Instruction::SExt:		case Instruction::SExt:
Tmp = TyBits - U->getOperand(0)->getType()->getScalarSizeInBits();		Tmp = TyBits - U->getOperand(0)->getType()->getScalarSizeInBits();
return ComputeNumSignBits(U->getOperand(0), DL, Depth + 1, Q) + Tmp;		return ComputeNumSignBits(U->getOperand(0), Depth + 1, Q) + Tmp;

case Instruction::SDiv: {		case Instruction::SDiv: {
const APInt *Denominator;		const APInt *Denominator;
// sdiv X, C -> adds log(C) sign bits.		// sdiv X, C -> adds log(C) sign bits.
if (match(U->getOperand(1), m_APInt(Denominator))) {		if (match(U->getOperand(1), m_APInt(Denominator))) {

// Ignore non-positive denominator.		// Ignore non-positive denominator.
if (!Denominator->isStrictlyPositive())		if (!Denominator->isStrictlyPositive())
break;		break;

// Calculate the incoming numerator bits.		// Calculate the incoming numerator bits.
unsigned NumBits = ComputeNumSignBits(U->getOperand(0), DL, Depth + 1, Q);		unsigned NumBits = ComputeNumSignBits(U->getOperand(0), Depth + 1, Q);

// Add floor(log(C)) bits to the numerator bits.		// Add floor(log(C)) bits to the numerator bits.
return std::min(TyBits, NumBits + Denominator->logBase2());		return std::min(TyBits, NumBits + Denominator->logBase2());
}		}
break;		break;
}		}

case Instruction::SRem: {		case Instruction::SRem: {
const APInt *Denominator;		const APInt *Denominator;
// srem X, C -> we know that the result is within [-C+1,C) when C is a		// srem X, C -> we know that the result is within [-C+1,C) when C is a
// positive constant. This let us put a lower bound on the number of sign		// positive constant. This let us put a lower bound on the number of sign
// bits.		// bits.
if (match(U->getOperand(1), m_APInt(Denominator))) {		if (match(U->getOperand(1), m_APInt(Denominator))) {

// Ignore non-positive denominator.		// Ignore non-positive denominator.
if (!Denominator->isStrictlyPositive())		if (!Denominator->isStrictlyPositive())
break;		break;

// Calculate the incoming numerator bits. SRem by a positive constant		// Calculate the incoming numerator bits. SRem by a positive constant
// can't lower the number of sign bits.		// can't lower the number of sign bits.
unsigned NumrBits =		unsigned NumrBits =
ComputeNumSignBits(U->getOperand(0), DL, Depth + 1, Q);		ComputeNumSignBits(U->getOperand(0), Depth + 1, Q);

// Calculate the leading sign bit constraints by examining the		// Calculate the leading sign bit constraints by examining the
// denominator. Given that the denominator is positive, there are two		// denominator. Given that the denominator is positive, there are two
// cases:		// cases:
//		//
// 1. the numerator is positive. The result range is [0,C) and [0,C) u<		// 1. the numerator is positive. The result range is [0,C) and [0,C) u<
// (1 << ceilLogBase2(C)).		// (1 << ceilLogBase2(C)).
//		//
// 2. the numerator is negative. Then the result range is (-C,0] and		// 2. the numerator is negative. Then the result range is (-C,0] and
// integers in (-C,0] are either 0 or >u (-1 << ceilLogBase2(C)).		// integers in (-C,0] are either 0 or >u (-1 << ceilLogBase2(C)).
//		//
// Thus a lower bound on the number of sign bits is `TyBits -		// Thus a lower bound on the number of sign bits is `TyBits -
// ceilLogBase2(C)`.		// ceilLogBase2(C)`.

unsigned ResBits = TyBits - Denominator->ceilLogBase2();		unsigned ResBits = TyBits - Denominator->ceilLogBase2();
return std::max(NumrBits, ResBits);		return std::max(NumrBits, ResBits);
}		}
break;		break;
}		}

case Instruction::AShr: {		case Instruction::AShr: {
Tmp = ComputeNumSignBits(U->getOperand(0), DL, Depth + 1, Q);		Tmp = ComputeNumSignBits(U->getOperand(0), Depth + 1, Q);
// ashr X, C -> adds C sign bits. Vectors too.		// ashr X, C -> adds C sign bits. Vectors too.
const APInt *ShAmt;		const APInt *ShAmt;
if (match(U->getOperand(1), m_APInt(ShAmt))) {		if (match(U->getOperand(1), m_APInt(ShAmt))) {
Tmp += ShAmt->getZExtValue();		Tmp += ShAmt->getZExtValue();
if (Tmp > TyBits) Tmp = TyBits;		if (Tmp > TyBits) Tmp = TyBits;
}		}
return Tmp;		return Tmp;
}		}
case Instruction::Shl: {		case Instruction::Shl: {
const APInt *ShAmt;		const APInt *ShAmt;
if (match(U->getOperand(1), m_APInt(ShAmt))) {		if (match(U->getOperand(1), m_APInt(ShAmt))) {
// shl destroys sign bits.		// shl destroys sign bits.
Tmp = ComputeNumSignBits(U->getOperand(0), DL, Depth + 1, Q);		Tmp = ComputeNumSignBits(U->getOperand(0), Depth + 1, Q);
Tmp2 = ShAmt->getZExtValue();		Tmp2 = ShAmt->getZExtValue();
if (Tmp2 >= TyBits \|\| // Bad shift.		if (Tmp2 >= TyBits \|\| // Bad shift.
Tmp2 >= Tmp) break; // Shifted all sign bits out.		Tmp2 >= Tmp) break; // Shifted all sign bits out.
return Tmp - Tmp2;		return Tmp - Tmp2;
}		}
break;		break;
}		}
case Instruction::And:		case Instruction::And:
case Instruction::Or:		case Instruction::Or:
case Instruction::Xor: // NOT is handled here.		case Instruction::Xor: // NOT is handled here.
// Logical binary ops preserve the number of sign bits at the worst.		// Logical binary ops preserve the number of sign bits at the worst.
Tmp = ComputeNumSignBits(U->getOperand(0), DL, Depth + 1, Q);		Tmp = ComputeNumSignBits(U->getOperand(0), Depth + 1, Q);
if (Tmp != 1) {		if (Tmp != 1) {
Tmp2 = ComputeNumSignBits(U->getOperand(1), DL, Depth + 1, Q);		Tmp2 = ComputeNumSignBits(U->getOperand(1), Depth + 1, Q);
FirstAnswer = std::min(Tmp, Tmp2);		FirstAnswer = std::min(Tmp, Tmp2);
// We computed what we know about the sign bits as our first		// We computed what we know about the sign bits as our first
// answer. Now proceed to the generic code that uses		// answer. Now proceed to the generic code that uses
// computeKnownBits, and pick whichever answer is better.		// computeKnownBits, and pick whichever answer is better.
}		}
break;		break;

case Instruction::Select:		case Instruction::Select:
Tmp = ComputeNumSignBits(U->getOperand(1), DL, Depth + 1, Q);		Tmp = ComputeNumSignBits(U->getOperand(1), Depth + 1, Q);
if (Tmp == 1) return 1; // Early out.		if (Tmp == 1) return 1; // Early out.
Tmp2 = ComputeNumSignBits(U->getOperand(2), DL, Depth + 1, Q);		Tmp2 = ComputeNumSignBits(U->getOperand(2), Depth + 1, Q);
return std::min(Tmp, Tmp2);		return std::min(Tmp, Tmp2);

case Instruction::Add:		case Instruction::Add:
// Add can have at most one carry bit. Thus we know that the output		// Add can have at most one carry bit. Thus we know that the output
// is, at worst, one more bit than the inputs.		// is, at worst, one more bit than the inputs.
Tmp = ComputeNumSignBits(U->getOperand(0), DL, Depth + 1, Q);		Tmp = ComputeNumSignBits(U->getOperand(0), Depth + 1, Q);
if (Tmp == 1) return 1; // Early out.		if (Tmp == 1) return 1; // Early out.

// Special case decrementing a value (ADD X, -1):		// Special case decrementing a value (ADD X, -1):
if (const auto *CRHS = dyn_cast<Constant>(U->getOperand(1)))		if (const auto *CRHS = dyn_cast<Constant>(U->getOperand(1)))
if (CRHS->isAllOnesValue()) {		if (CRHS->isAllOnesValue()) {
APInt KnownZero(TyBits, 0), KnownOne(TyBits, 0);		APInt KnownZero(TyBits, 0), KnownOne(TyBits, 0);
computeKnownBits(U->getOperand(0), KnownZero, KnownOne, DL, Depth + 1,		computeKnownBits(U->getOperand(0), KnownZero, KnownOne, Depth + 1, Q);
Q);

// If the input is known to be 0 or 1, the output is 0/-1, which is all		// If the input is known to be 0 or 1, the output is 0/-1, which is all
// sign bits set.		// sign bits set.
if ((KnownZero \| APInt(TyBits, 1)).isAllOnesValue())		if ((KnownZero \| APInt(TyBits, 1)).isAllOnesValue())
return TyBits;		return TyBits;

// If we are subtracting one from a positive number, there is no carry		// If we are subtracting one from a positive number, there is no carry
// out of the result.		// out of the result.
if (KnownZero.isNegative())		if (KnownZero.isNegative())
return Tmp;		return Tmp;
}		}

Tmp2 = ComputeNumSignBits(U->getOperand(1), DL, Depth + 1, Q);		Tmp2 = ComputeNumSignBits(U->getOperand(1), Depth + 1, Q);
if (Tmp2 == 1) return 1;		if (Tmp2 == 1) return 1;
return std::min(Tmp, Tmp2)-1;		return std::min(Tmp, Tmp2)-1;

case Instruction::Sub:		case Instruction::Sub:
Tmp2 = ComputeNumSignBits(U->getOperand(1), DL, Depth + 1, Q);		Tmp2 = ComputeNumSignBits(U->getOperand(1), Depth + 1, Q);
if (Tmp2 == 1) return 1;		if (Tmp2 == 1) return 1;

// Handle NEG.		// Handle NEG.
if (const auto *CLHS = dyn_cast<Constant>(U->getOperand(0)))		if (const auto *CLHS = dyn_cast<Constant>(U->getOperand(0)))
if (CLHS->isNullValue()) {		if (CLHS->isNullValue()) {
APInt KnownZero(TyBits, 0), KnownOne(TyBits, 0);		APInt KnownZero(TyBits, 0), KnownOne(TyBits, 0);
computeKnownBits(U->getOperand(1), KnownZero, KnownOne, DL, Depth + 1,		computeKnownBits(U->getOperand(1), KnownZero, KnownOne, Depth + 1, Q);
Q);
// If the input is known to be 0 or 1, the output is 0/-1, which is all		// If the input is known to be 0 or 1, the output is 0/-1, which is all
// sign bits set.		// sign bits set.
if ((KnownZero \| APInt(TyBits, 1)).isAllOnesValue())		if ((KnownZero \| APInt(TyBits, 1)).isAllOnesValue())
return TyBits;		return TyBits;

// If the input is known to be positive (the sign bit is known clear),		// If the input is known to be positive (the sign bit is known clear),
// the output of the NEG has the same number of sign bits as the input.		// the output of the NEG has the same number of sign bits as the input.
if (KnownZero.isNegative())		if (KnownZero.isNegative())
return Tmp2;		return Tmp2;

// Otherwise, we treat this like a SUB.		// Otherwise, we treat this like a SUB.
}		}

// Sub can have at most one carry bit. Thus we know that the output		// Sub can have at most one carry bit. Thus we know that the output
// is, at worst, one more bit than the inputs.		// is, at worst, one more bit than the inputs.
Tmp = ComputeNumSignBits(U->getOperand(0), DL, Depth + 1, Q);		Tmp = ComputeNumSignBits(U->getOperand(0), Depth + 1, Q);
if (Tmp == 1) return 1; // Early out.		if (Tmp == 1) return 1; // Early out.
return std::min(Tmp, Tmp2)-1;		return std::min(Tmp, Tmp2)-1;

case Instruction::PHI: {		case Instruction::PHI: {
PHINode *PN = cast<PHINode>(U);		PHINode *PN = cast<PHINode>(U);
unsigned NumIncomingValues = PN->getNumIncomingValues();		unsigned NumIncomingValues = PN->getNumIncomingValues();
// Don't analyze large in-degree PHIs.		// Don't analyze large in-degree PHIs.
if (NumIncomingValues > 4) break;		if (NumIncomingValues > 4) break;
// Unreachable blocks may have zero-operand PHI nodes.		// Unreachable blocks may have zero-operand PHI nodes.
if (NumIncomingValues == 0) break;		if (NumIncomingValues == 0) break;

// Take the minimum of all incoming values. This can't infinitely loop		// Take the minimum of all incoming values. This can't infinitely loop
// because of our depth threshold.		// because of our depth threshold.
Tmp = ComputeNumSignBits(PN->getIncomingValue(0), DL, Depth + 1, Q);		Tmp = ComputeNumSignBits(PN->getIncomingValue(0), Depth + 1, Q);
for (unsigned i = 1, e = NumIncomingValues; i != e; ++i) {		for (unsigned i = 1, e = NumIncomingValues; i != e; ++i) {
if (Tmp == 1) return Tmp;		if (Tmp == 1) return Tmp;
Tmp = std::min(		Tmp = std::min(
Tmp, ComputeNumSignBits(PN->getIncomingValue(i), DL, Depth + 1, Q));		Tmp, ComputeNumSignBits(PN->getIncomingValue(i), Depth + 1, Q));
}		}
return Tmp;		return Tmp;
}		}

case Instruction::Trunc:		case Instruction::Trunc:
// FIXME: it's tricky to do anything useful for this, but it is an important		// FIXME: it's tricky to do anything useful for this, but it is an important
// case for targets like X86.		// case for targets like X86.
break;		break;
}		}

// Finally, if we can prove that the top bits of the result are 0's or 1's,		// Finally, if we can prove that the top bits of the result are 0's or 1's,
// use this information.		// use this information.
APInt KnownZero(TyBits, 0), KnownOne(TyBits, 0);		APInt KnownZero(TyBits, 0), KnownOne(TyBits, 0);
APInt Mask;		APInt Mask;
computeKnownBits(V, KnownZero, KnownOne, DL, Depth, Q);		computeKnownBits(V, KnownZero, KnownOne, Depth, Q);

if (KnownZero.isNegative()) { // sign bit is 0		if (KnownZero.isNegative()) { // sign bit is 0
Mask = KnownZero;		Mask = KnownZero;
} else if (KnownOne.isNegative()) { // sign bit is 1;		} else if (KnownOne.isNegative()) { // sign bit is 1;
Mask = KnownOne;		Mask = KnownOne;
} else {		} else {
// Nothing known.		// Nothing known.
return FirstAnswer;		return FirstAnswer;
▲ Show 20 Lines • Show All 1,875 Lines • Show Last 20 Lines