This is an archive of the discontinued LLVM Phabricator instance.

[DAGCombine] Remove AND in SETCC if we can prove they are unneeded
AbandonedPublic

Authored by dmgreen on Mar 2 2018, 12:53 PM.

Download Raw Diff

Details

Reviewers

efriedma
spatel
john.brawn
samparker
craig.topper
RKSimon
javed.absar

Summary

This is a possible solution for the remainder of PR35875. The idea is that if
we have a SETCC of two ANDs with the same mask, and we can calculate
that the non-mask bits of the operands of the AND are the same, we can
drop the ANDs from the SETCC.
i.e (x&C < y&C) => x<y for any s/u </>/=/etc.

Here is an attempt to prove such a thing at the IR level:
https://rise4fun.com/Alive/ha2

So that seems to be fine. Except that the code in question, the ANDs come
from type legalising a SETCC node, with values that come from anyext loads.
These don't provide upper known bits, even though they are likely to become
zext loads.

Hence all this plumbing to find and return anyext loads, treating them as
zeroext loads during computeKnownBits and adjusting the loads to zeroext
if that is useful for this case.

The left hand side of these tests are not yet checked in. As you can hopefully
see, it helps out quite a bit here. If there are better ways to solve this
problem, I'm all ears.

Diff Detail

Event Timeline

dmgreen created this revision.Mar 2 2018, 12:53 PM

Herald added subscribers: fedor.sergeev, kbarton, javed.absar and 4 others. · View Herald TranscriptMar 2 2018, 12:53 PM

dmgreen edited the summary of this revision. (Show Details)Mar 2 2018, 12:53 PM

spatel mentioned this in rL326659: [InstCombine] add tests for notnotsub; NFC.Mar 3 2018, 9:23 AM

rogfer01 added a subscriber: rogfer01.Mar 4 2018, 11:59 PM

nhaehnle removed a subscriber: nhaehnle.Mar 5 2018, 2:54 AM

samparker added inline comments.Mar 6 2018, 1:26 AM

include/llvm/CodeGen/SelectionDAG.h
1385	If you reorder the default parameters, you could reduce the number of changes needed in the backends.
lib/CodeGen/SelectionDAG/TargetLowering.cpp
2682	Is this safe? Shouldn't the load be checked for a single use or that all the uses on your known bits path?

FWIW - I haven't looked closely at this patch (it might be good independently of PR35875), but I think we can optimize the motivating case even more than what's shown here by reducing the IR:
https://bugs.llvm.org/show_bug.cgi?id=35875#c5

I was asked not to add DemandedElts to individual cases in computeKnownBits until you have test coverage - this should probably be the same for AnyToZeroExtLoads. That will be easier if you move the AnyToZeroExtLoads arg to the end to make use of the default nullptr.

Random idea - would a more general approach be for us to compute undefined bits as well as one/zero bits - either adding them to the KnownBits struct or add it separately?

Thanks all for the reviews. I'm currently hoping that the transform Sanjay suggested will take away my main motivation for this patch. Here are some responses in any case:

I was asked not to add DemandedElts to individual cases in computeKnownBits until you have test coverage - this should probably be the same for AnyToZeroExtLoads.

OK. Noted. By this you mean unit tests for computeKnownBits? Sounds like a sensible idea. I'm guessing the nature of ISel makes this difficult?

Random idea - would a more general approach be for us to compute undefined bits as well as one/zero bits - either adding them to the KnownBits struct or add it separately?

In this specific case (SETCC(AND(%x, C) , AND(%y, C)) where we know that the ~C bits of %x == %y, then the AND can be removed) - we need to know the bits are a specific value. We can't have the two sides being undef and then choosing to differently zero/signext for example. In the case I was looking, x and y were both and(xor(8bit load, -1), 255), so the top bits are all 1 on both sides, so long as we choose to zeroext. I think we need to know the bits, and know which nodes to change to force those bits to be specified. It might well be a good idea for other situations.

include/llvm/CodeGen/SelectionDAG.h
1385	Yeah, makes sense. That's probably best. I was trying to keep depth as the last argument for some reason.
lib/CodeGen/SelectionDAG/TargetLowering.cpp
2682	I don't know a huge amount about the internals of selection dag. My understanding was that if they are anyext then we cannot presume the values of the bits, but then neither can anything else. So long as we set the values of the bits (as we do here), then all other optimisations have to then use the bits as 0. If I could have just changed the existing LD's to zeroextends, that would be simpler. But I believe the nodes are immutable. So I don't think we have to check for single uses as the only bits we are changing are the upper bits from unspecified to 0's. Which isn't something that anything else can rely upon, without itself doing the unspecified -> set transform.

In D44043#1031702, @dmgreen wrote:

I was asked not to add DemandedElts to individual cases in computeKnownBits until you have test coverage - this should probably be the same for AnyToZeroExtLoads.

OK. Noted. By this you mean unit tests for computeKnownBits? Sounds like a sensible idea. I'm guessing the nature of ISel makes this difficult?

Most of my tests for these cases come from combines that make use of SelectionDAG::SignBitIsZero which uses computeKnownBits - converting uitofp to sitofp etc. in x86 combine-*.ll tests have some good examples.

@dmgreen Do you still need this patch or should it be abandoned?

Herald added a reviewer: javed.absar. · View Herald TranscriptSep 29 2018, 3:59 AM

Herald added subscribers: jsji, jvesely. · View Herald Transcript

Yep sure, with 52177, I no longer have a motivating case for this.

Revision Contents

Path

Size

include/

llvm/

CodeGen/

SelectionDAG.h

11 lines

TargetLowering.h

9 lines

lib/

CodeGen/

SelectionDAG/

SelectionDAG.cpp

212 lines

TargetLowering.cpp

69 lines

Target/

AArch64/

AArch64ISelLowering.h

8 lines

AArch64ISelLowering.cpp

11 lines

AMDGPU/

AMDGPUISelLowering.h

9 lines

AMDGPUISelLowering.cpp

11 lines

ARM/

ARMISelLowering.h

10 lines

ARMISelLowering.cpp

16 lines

Lanai/

LanaiISelLowering.h

8 lines

LanaiISelLowering.cpp

9 lines

PowerPC/

PPCISelLowering.h

10 lines

PPCISelLowering.cpp

9 lines

Sparc/

SparcISelLowering.h

10 lines

SparcISelLowering.cpp

15 lines

SystemZ/

SystemZISelLowering.h

9 lines

SystemZISelLowering.cpp

16 lines

X86/

X86ISelLowering.h

10 lines

X86ISelLowering.cpp

22 lines

X86InstrCompiler.td

4 lines

XCore/

XCoreISelLowering.h

10 lines

XCoreISelLowering.cpp

9 lines

test/

CodeGen/

Thumb/

setcc_xor.ll

149 lines

Diff 136826

include/llvm/CodeGen/SelectionDAG.h

Show First 20 Lines • Show All 1,374 Lines • ▼ Show 20 Lines	public:
bool MaskedValueIsZero(SDValue Op, const APInt &Mask, unsigned Depth = 0)		bool MaskedValueIsZero(SDValue Op, const APInt &Mask, unsigned Depth = 0)
const;		const;

/// Determine which bits of Op are known to be either zero or one and return		/// Determine which bits of Op are known to be either zero or one and return
/// them in Known. For vectors, the known bits are those that are shared by		/// them in Known. For vectors, the known bits are those that are shared by
/// every vector element.		/// every vector element.
/// Targets can implement the computeKnownBitsForTargetNode method in the		/// Targets can implement the computeKnownBitsForTargetNode method in the
/// TargetLowering class to allow target nodes to be understood.		/// TargetLowering class to allow target nodes to be understood.
void computeKnownBits(SDValue Op, KnownBits &Known, unsigned Depth = 0) const;		void
		computeKnownBits(SDValue Op, KnownBits &Known,
		SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads = nullptr,
		samparkerUnsubmitted Not Done Reply Inline Actions If you reorder the default parameters, you could reduce the number of changes needed in the backends. samparker: If you reorder the default parameters, you could reduce the number of changes needed in the…
		dmgreenAuthorUnsubmitted Not Done Reply Inline Actions Yeah, makes sense. That's probably best. I was trying to keep depth as the last argument for some reason. dmgreen: Yeah, makes sense. That's probably best. I was trying to keep depth as the last argument for…
		unsigned Depth = 0) const;

/// Determine which bits of Op are known to be either zero or one and return		/// Determine which bits of Op are known to be either zero or one and return
/// them in Known. The DemandedElts argument allows us to only collect the		/// them in Known. The DemandedElts argument allows us to only collect the
/// known bits that are shared by the requested vector elements.		/// known bits that are shared by the requested vector elements.
/// Targets can implement the computeKnownBitsForTargetNode method in the		/// Targets can implement the computeKnownBitsForTargetNode method in the
/// TargetLowering class to allow target nodes to be understood.		/// TargetLowering class to allow target nodes to be understood.
void computeKnownBits(SDValue Op, KnownBits &Known, const APInt &DemandedElts,		void
		computeKnownBits(SDValue Op, KnownBits &Known, const APInt &DemandedElts,
		SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads = nullptr,
unsigned Depth = 0) const;		unsigned Depth = 0) const;

/// Used to represent the possible overflow behavior of an operation.		/// Used to represent the possible overflow behavior of an operation.
/// Never: the operation cannot overflow.		/// Never: the operation cannot overflow.
/// Always: the operation will always overflow.		/// Always: the operation will always overflow.
/// Sometime: the operation may or may not overflow.		/// Sometime: the operation may or may not overflow.
enum OverflowKind {		enum OverflowKind {
OFK_Never,		OFK_Never,
OFK_Sometime,		OFK_Sometime,
▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

include/llvm/CodeGen/TargetLowering.h

Show First 20 Lines • Show All 2,735 Lines • ▼ Show 20 Lines	public:
bool SimplifyDemandedVectorElts(SDValue Op, const APInt &DemandedElts,		bool SimplifyDemandedVectorElts(SDValue Op, const APInt &DemandedElts,
APInt &KnownUndef, APInt &KnownZero,		APInt &KnownUndef, APInt &KnownZero,
DAGCombinerInfo &DCI) const;		DAGCombinerInfo &DCI) const;

/// Determine which of the bits specified in Mask are known to be either zero		/// Determine which of the bits specified in Mask are known to be either zero
/// or one and return them in the KnownZero/KnownOne bitsets. The DemandedElts		/// or one and return them in the KnownZero/KnownOne bitsets. The DemandedElts
/// argument allows us to only collect the known bits that are shared by the		/// argument allows us to only collect the known bits that are shared by the
/// requested vector elements.		/// requested vector elements.
virtual void computeKnownBitsForTargetNode(const SDValue Op,		virtual void computeKnownBitsForTargetNode(
KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts,		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
const SelectionDAG &DAG,
unsigned Depth = 0) const;		unsigned Depth = 0) const;

/// Determine which of the bits of FrameIndex \p FIOp are known to be 0.		/// Determine which of the bits of FrameIndex \p FIOp are known to be 0.
/// Default implementation computes low bits based on alignment		/// Default implementation computes low bits based on alignment
/// information. This should preserve known bits passed into it.		/// information. This should preserve known bits passed into it.
virtual void computeKnownBitsForFrameIndex(const SDValue FIOp,		virtual void computeKnownBitsForFrameIndex(const SDValue FIOp,
KnownBits &Known,		KnownBits &Known,
const APInt &DemandedElts,		const APInt &DemandedElts,
const SelectionDAG &DAG,		const SelectionDAG &DAG,
▲ Show 20 Lines • Show All 838 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/SelectionDAG.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,108 Lines • ▼ Show 20 Lines
}		}

/// MaskedValueIsZero - Return true if 'V & Mask' is known to be zero. We use		/// MaskedValueIsZero - Return true if 'V & Mask' is known to be zero. We use
/// this predicate to simplify operations downstream. Mask is known to be zero		/// this predicate to simplify operations downstream. Mask is known to be zero
/// for bits that V cannot have.		/// for bits that V cannot have.
bool SelectionDAG::MaskedValueIsZero(SDValue Op, const APInt &Mask,		bool SelectionDAG::MaskedValueIsZero(SDValue Op, const APInt &Mask,
unsigned Depth) const {		unsigned Depth) const {
KnownBits Known;		KnownBits Known;
computeKnownBits(Op, Known, Depth);		computeKnownBits(Op, Known, nullptr, Depth);
return Mask.isSubsetOf(Known.Zero);		return Mask.isSubsetOf(Known.Zero);
}		}

/// Helper function that checks to see if a node is a constant or a		/// Helper function that checks to see if a node is a constant or a
/// build vector of splat constants at least within the demanded elts.		/// build vector of splat constants at least within the demanded elts.
static ConstantSDNode *isConstOrDemandedConstSplat(SDValue N,		static ConstantSDNode *isConstOrDemandedConstSplat(SDValue N,
const APInt &DemandedElts) {		const APInt &DemandedElts) {
if (ConstantSDNode *CN = dyn_cast<ConstantSDNode>(N))		if (ConstantSDNode *CN = dyn_cast<ConstantSDNode>(N))
Show All 26 Lines	if (ShAmt.ult(V.getScalarValueSizeInBits()))
return &ShAmt;		return &ShAmt;
}		}
return nullptr;		return nullptr;
}		}

/// Determine which bits of Op are known to be either zero or one and return		/// Determine which bits of Op are known to be either zero or one and return
/// them in Known. For vectors, the known bits are those that are shared by		/// them in Known. For vectors, the known bits are those that are shared by
/// every vector element.		/// every vector element.
void SelectionDAG::computeKnownBits(SDValue Op, KnownBits &Known,		void SelectionDAG::computeKnownBits(
unsigned Depth) const {		SDValue Op, KnownBits &Known,
		SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads, unsigned Depth) const {
EVT VT = Op.getValueType();		EVT VT = Op.getValueType();
APInt DemandedElts = VT.isVector()		APInt DemandedElts = VT.isVector()
? APInt::getAllOnesValue(VT.getVectorNumElements())		? APInt::getAllOnesValue(VT.getVectorNumElements())
: APInt(1, 1);		: APInt(1, 1);
computeKnownBits(Op, Known, DemandedElts, Depth);		computeKnownBits(Op, Known, DemandedElts, AnyToZeroExtLoads, Depth);
}		}

/// Determine which bits of Op are known to be either zero or one and return		/// Determine which bits of Op are known to be either zero or one and return
/// them in Known. The DemandedElts argument allows us to only collect the known		/// them in Known. The DemandedElts argument allows us to only collect the known
/// bits that are shared by the requested vector elements.		/// bits that are shared by the requested vector elements. AnyToZeroExtLoads
void SelectionDAG::computeKnownBits(SDValue Op, KnownBits &Known,		/// optionally allows anyext loads to be treated as zeroext (top bits zero) and
const APInt &DemandedElts,		/// returned in the SmallPtrSet. They should be converted to zeroext loads for
unsigned Depth) const {		/// the returned KnownBits to be valid.
		void SelectionDAG::computeKnownBits(
		SDValue Op, KnownBits &Known, const APInt &DemandedElts,
		SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads, unsigned Depth) const {
unsigned BitWidth = Op.getScalarValueSizeInBits();		unsigned BitWidth = Op.getScalarValueSizeInBits();

Known = KnownBits(BitWidth); // Don't know anything.		Known = KnownBits(BitWidth); // Don't know anything.

if (auto *C = dyn_cast<ConstantSDNode>(Op)) {		if (auto *C = dyn_cast<ConstantSDNode>(Op)) {
// We know all of the bits for a constant!		// We know all of the bits for a constant!
Known.One = C->getAPIntValue();		Known.One = C->getAPIntValue();
Known.Zero = ~Known.One;		Known.Zero = ~Known.One;
Show All 22 Lines	case ISD::BUILD_VECTOR:
assert(NumElts == Op.getValueType().getVectorNumElements() &&		assert(NumElts == Op.getValueType().getVectorNumElements() &&
"Unexpected vector size");		"Unexpected vector size");
Known.Zero.setAllBits(); Known.One.setAllBits();		Known.Zero.setAllBits(); Known.One.setAllBits();
for (unsigned i = 0, e = Op.getNumOperands(); i != e; ++i) {		for (unsigned i = 0, e = Op.getNumOperands(); i != e; ++i) {
if (!DemandedElts[i])		if (!DemandedElts[i])
continue;		continue;

SDValue SrcOp = Op.getOperand(i);		SDValue SrcOp = Op.getOperand(i);
computeKnownBits(SrcOp, Known2, Depth + 1);		computeKnownBits(SrcOp, Known2, AnyToZeroExtLoads, Depth + 1);

// BUILD_VECTOR can implicitly truncate sources, we must handle this.		// BUILD_VECTOR can implicitly truncate sources, we must handle this.
if (SrcOp.getValueSizeInBits() != BitWidth) {		if (SrcOp.getValueSizeInBits() != BitWidth) {
assert(SrcOp.getValueSizeInBits() > BitWidth &&		assert(SrcOp.getValueSizeInBits() > BitWidth &&
"Expected BUILD_VECTOR implicit truncation");		"Expected BUILD_VECTOR implicit truncation");
Known2 = Known2.trunc(BitWidth);		Known2 = Known2.trunc(BitWidth);
}		}

Show All 30 Lines	for (unsigned i = 0; i != NumElts; ++i) {
if ((unsigned)M < NumElts)		if ((unsigned)M < NumElts)
DemandedLHS.setBit((unsigned)M % NumElts);		DemandedLHS.setBit((unsigned)M % NumElts);
else		else
DemandedRHS.setBit((unsigned)M % NumElts);		DemandedRHS.setBit((unsigned)M % NumElts);
}		}
// Known bits are the values that are shared by every demanded element.		// Known bits are the values that are shared by every demanded element.
if (!!DemandedLHS) {		if (!!DemandedLHS) {
SDValue LHS = Op.getOperand(0);		SDValue LHS = Op.getOperand(0);
computeKnownBits(LHS, Known2, DemandedLHS, Depth + 1);		computeKnownBits(LHS, Known2, DemandedLHS, AnyToZeroExtLoads, Depth + 1);
Known.One &= Known2.One;		Known.One &= Known2.One;
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
}		}
// If we don't know any bits, early out.		// If we don't know any bits, early out.
if (Known.isUnknown())		if (Known.isUnknown())
break;		break;
if (!!DemandedRHS) {		if (!!DemandedRHS) {
SDValue RHS = Op.getOperand(1);		SDValue RHS = Op.getOperand(1);
computeKnownBits(RHS, Known2, DemandedRHS, Depth + 1);		computeKnownBits(RHS, Known2, DemandedRHS, AnyToZeroExtLoads, Depth + 1);
Known.One &= Known2.One;		Known.One &= Known2.One;
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
}		}
break;		break;
}		}
case ISD::CONCAT_VECTORS: {		case ISD::CONCAT_VECTORS: {
// Split DemandedElts and test each of the demanded subvectors.		// Split DemandedElts and test each of the demanded subvectors.
Known.Zero.setAllBits(); Known.One.setAllBits();		Known.Zero.setAllBits(); Known.One.setAllBits();
EVT SubVectorVT = Op.getOperand(0).getValueType();		EVT SubVectorVT = Op.getOperand(0).getValueType();
unsigned NumSubVectorElts = SubVectorVT.getVectorNumElements();		unsigned NumSubVectorElts = SubVectorVT.getVectorNumElements();
unsigned NumSubVectors = Op.getNumOperands();		unsigned NumSubVectors = Op.getNumOperands();
for (unsigned i = 0; i != NumSubVectors; ++i) {		for (unsigned i = 0; i != NumSubVectors; ++i) {
APInt DemandedSub = DemandedElts.lshr(i * NumSubVectorElts);		APInt DemandedSub = DemandedElts.lshr(i * NumSubVectorElts);
DemandedSub = DemandedSub.trunc(NumSubVectorElts);		DemandedSub = DemandedSub.trunc(NumSubVectorElts);
if (!!DemandedSub) {		if (!!DemandedSub) {
SDValue Sub = Op.getOperand(i);		SDValue Sub = Op.getOperand(i);
computeKnownBits(Sub, Known2, DemandedSub, Depth + 1);		computeKnownBits(Sub, Known2, DemandedSub, AnyToZeroExtLoads,
		Depth + 1);
Known.One &= Known2.One;		Known.One &= Known2.One;
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
}		}
// If we don't know any bits, early out.		// If we don't know any bits, early out.
if (Known.isUnknown())		if (Known.isUnknown())
break;		break;
}		}
break;		break;
}		}
case ISD::INSERT_SUBVECTOR: {		case ISD::INSERT_SUBVECTOR: {
// If we know the element index, demand any elements from the subvector and		// If we know the element index, demand any elements from the subvector and
// the remainder from the src its inserted into, otherwise demand them all.		// the remainder from the src its inserted into, otherwise demand them all.
SDValue Src = Op.getOperand(0);		SDValue Src = Op.getOperand(0);
SDValue Sub = Op.getOperand(1);		SDValue Sub = Op.getOperand(1);
ConstantSDNode *SubIdx = dyn_cast<ConstantSDNode>(Op.getOperand(2));		ConstantSDNode *SubIdx = dyn_cast<ConstantSDNode>(Op.getOperand(2));
unsigned NumSubElts = Sub.getValueType().getVectorNumElements();		unsigned NumSubElts = Sub.getValueType().getVectorNumElements();
if (SubIdx && SubIdx->getAPIntValue().ule(NumElts - NumSubElts)) {		if (SubIdx && SubIdx->getAPIntValue().ule(NumElts - NumSubElts)) {
Known.One.setAllBits();		Known.One.setAllBits();
Known.Zero.setAllBits();		Known.Zero.setAllBits();
uint64_t Idx = SubIdx->getZExtValue();		uint64_t Idx = SubIdx->getZExtValue();
APInt DemandedSubElts = DemandedElts.extractBits(NumSubElts, Idx);		APInt DemandedSubElts = DemandedElts.extractBits(NumSubElts, Idx);
if (!!DemandedSubElts) {		if (!!DemandedSubElts) {
computeKnownBits(Sub, Known, DemandedSubElts, Depth + 1);		computeKnownBits(Sub, Known, DemandedSubElts, AnyToZeroExtLoads,
		Depth + 1);
if (Known.isUnknown())		if (Known.isUnknown())
break; // early-out.		break; // early-out.
}		}
APInt SubMask = APInt::getBitsSet(NumElts, Idx, Idx + NumSubElts);		APInt SubMask = APInt::getBitsSet(NumElts, Idx, Idx + NumSubElts);
APInt DemandedSrcElts = DemandedElts & ~SubMask;		APInt DemandedSrcElts = DemandedElts & ~SubMask;
if (!!DemandedSrcElts) {		if (!!DemandedSrcElts) {
computeKnownBits(Src, Known2, DemandedSrcElts, Depth + 1);		computeKnownBits(Src, Known2, DemandedSrcElts, AnyToZeroExtLoads,
		Depth + 1);
Known.One &= Known2.One;		Known.One &= Known2.One;
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
}		}
} else {		} else {
computeKnownBits(Sub, Known, Depth + 1);		computeKnownBits(Sub, Known, AnyToZeroExtLoads, Depth + 1);
if (Known.isUnknown())		if (Known.isUnknown())
break; // early-out.		break; // early-out.
computeKnownBits(Src, Known2, Depth + 1);		computeKnownBits(Src, Known2, AnyToZeroExtLoads, Depth + 1);
Known.One &= Known2.One;		Known.One &= Known2.One;
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
}		}
break;		break;
}		}
case ISD::EXTRACT_SUBVECTOR: {		case ISD::EXTRACT_SUBVECTOR: {
// If we know the element index, just demand that subvector elements,		// If we know the element index, just demand that subvector elements,
// otherwise demand them all.		// otherwise demand them all.
SDValue Src = Op.getOperand(0);		SDValue Src = Op.getOperand(0);
ConstantSDNode *SubIdx = dyn_cast<ConstantSDNode>(Op.getOperand(1));		ConstantSDNode *SubIdx = dyn_cast<ConstantSDNode>(Op.getOperand(1));
unsigned NumSrcElts = Src.getValueType().getVectorNumElements();		unsigned NumSrcElts = Src.getValueType().getVectorNumElements();
if (SubIdx && SubIdx->getAPIntValue().ule(NumSrcElts - NumElts)) {		if (SubIdx && SubIdx->getAPIntValue().ule(NumSrcElts - NumElts)) {
// Offset the demanded elts by the subvector index.		// Offset the demanded elts by the subvector index.
uint64_t Idx = SubIdx->getZExtValue();		uint64_t Idx = SubIdx->getZExtValue();
APInt DemandedSrc = DemandedElts.zext(NumSrcElts).shl(Idx);		APInt DemandedSrc = DemandedElts.zext(NumSrcElts).shl(Idx);
computeKnownBits(Src, Known, DemandedSrc, Depth + 1);		computeKnownBits(Src, Known, DemandedSrc, AnyToZeroExtLoads, Depth + 1);
} else {		} else {
computeKnownBits(Src, Known, Depth + 1);		computeKnownBits(Src, Known, AnyToZeroExtLoads, Depth + 1);
}		}
break;		break;
}		}
case ISD::BITCAST: {		case ISD::BITCAST: {
SDValue N0 = Op.getOperand(0);		SDValue N0 = Op.getOperand(0);
EVT SubVT = N0.getValueType();		EVT SubVT = N0.getValueType();
unsigned SubBitWidth = SubVT.getScalarSizeInBits();		unsigned SubBitWidth = SubVT.getScalarSizeInBits();

// Ignore bitcasts from unsupported types.		// Ignore bitcasts from unsupported types.
if (!(SubVT.isInteger() \|\| SubVT.isFloatingPoint()))		if (!(SubVT.isInteger() \|\| SubVT.isFloatingPoint()))
break;		break;

// Fast handling of 'identity' bitcasts.		// Fast handling of 'identity' bitcasts.
if (BitWidth == SubBitWidth) {		if (BitWidth == SubBitWidth) {
computeKnownBits(N0, Known, DemandedElts, Depth + 1);		computeKnownBits(N0, Known, DemandedElts, AnyToZeroExtLoads, Depth + 1);
break;		break;
}		}

// Support big-endian targets when it becomes useful.		// Support big-endian targets when it becomes useful.
bool IsLE = getDataLayout().isLittleEndian();		bool IsLE = getDataLayout().isLittleEndian();
if (!IsLE)		if (!IsLE)
break;		break;

// Bitcast 'small element' vector to 'large element' scalar/vector.		// Bitcast 'small element' vector to 'large element' scalar/vector.
if ((BitWidth % SubBitWidth) == 0) {		if ((BitWidth % SubBitWidth) == 0) {
assert(N0.getValueType().isVector() && "Expected bitcast from vector");		assert(N0.getValueType().isVector() && "Expected bitcast from vector");

// Collect known bits for the (larger) output by collecting the known		// Collect known bits for the (larger) output by collecting the known
// bits from each set of sub elements and shift these into place.		// bits from each set of sub elements and shift these into place.
// We need to separately call computeKnownBits for each set of		// We need to separately call computeKnownBits for each set of
// sub elements as the knownbits for each is likely to be different.		// sub elements as the knownbits for each is likely to be different.
unsigned SubScale = BitWidth / SubBitWidth;		unsigned SubScale = BitWidth / SubBitWidth;
APInt SubDemandedElts(NumElts * SubScale, 0);		APInt SubDemandedElts(NumElts * SubScale, 0);
for (unsigned i = 0; i != NumElts; ++i)		for (unsigned i = 0; i != NumElts; ++i)
if (DemandedElts[i])		if (DemandedElts[i])
SubDemandedElts.setBit(i * SubScale);		SubDemandedElts.setBit(i * SubScale);

for (unsigned i = 0; i != SubScale; ++i) {		for (unsigned i = 0; i != SubScale; ++i) {
computeKnownBits(N0, Known2, SubDemandedElts.shl(i),		computeKnownBits(N0, Known2, SubDemandedElts.shl(i), AnyToZeroExtLoads,
Depth + 1);		Depth + 1);
Known.One \|= Known2.One.zext(BitWidth).shl(SubBitWidth * i);		Known.One \|= Known2.One.zext(BitWidth).shl(SubBitWidth * i);
Known.Zero \|= Known2.Zero.zext(BitWidth).shl(SubBitWidth * i);		Known.Zero \|= Known2.Zero.zext(BitWidth).shl(SubBitWidth * i);
}		}
}		}

// Bitcast 'large element' scalar/vector to 'small element' vector.		// Bitcast 'large element' scalar/vector to 'small element' vector.
if ((SubBitWidth % BitWidth) == 0) {		if ((SubBitWidth % BitWidth) == 0) {
assert(Op.getValueType().isVector() && "Expected bitcast to vector");		assert(Op.getValueType().isVector() && "Expected bitcast to vector");

// Collect known bits for the (smaller) output by collecting the known		// Collect known bits for the (smaller) output by collecting the known
// bits from the overlapping larger input elements and extracting the		// bits from the overlapping larger input elements and extracting the
// sub sections we actually care about.		// sub sections we actually care about.
unsigned SubScale = SubBitWidth / BitWidth;		unsigned SubScale = SubBitWidth / BitWidth;
APInt SubDemandedElts(NumElts / SubScale, 0);		APInt SubDemandedElts(NumElts / SubScale, 0);
for (unsigned i = 0; i != NumElts; ++i)		for (unsigned i = 0; i != NumElts; ++i)
if (DemandedElts[i])		if (DemandedElts[i])
SubDemandedElts.setBit(i / SubScale);		SubDemandedElts.setBit(i / SubScale);

computeKnownBits(N0, Known2, SubDemandedElts, Depth + 1);		computeKnownBits(N0, Known2, SubDemandedElts, AnyToZeroExtLoads,
		Depth + 1);

Known.Zero.setAllBits(); Known.One.setAllBits();		Known.Zero.setAllBits(); Known.One.setAllBits();
for (unsigned i = 0; i != NumElts; ++i)		for (unsigned i = 0; i != NumElts; ++i)
if (DemandedElts[i]) {		if (DemandedElts[i]) {
unsigned Offset = (i % SubScale) * BitWidth;		unsigned Offset = (i % SubScale) * BitWidth;
Known.One &= Known2.One.lshr(Offset).trunc(BitWidth);		Known.One &= Known2.One.lshr(Offset).trunc(BitWidth);
Known.Zero &= Known2.Zero.lshr(Offset).trunc(BitWidth);		Known.Zero &= Known2.Zero.lshr(Offset).trunc(BitWidth);
// If we don't know any bits, early out.		// If we don't know any bits, early out.
if (Known.isUnknown())		if (Known.isUnknown())
break;		break;
}		}
}		}
break;		break;
}		}
case ISD::AND:		case ISD::AND:
// If either the LHS or the RHS are Zero, the result is zero.		// If either the LHS or the RHS are Zero, the result is zero.
computeKnownBits(Op.getOperand(1), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(1), Known, DemandedElts, AnyToZeroExtLoads,
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		Depth + 1);
		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);

// Output known-1 bits are only known if set in both the LHS & RHS.		// Output known-1 bits are only known if set in both the LHS & RHS.
Known.One &= Known2.One;		Known.One &= Known2.One;
// Output known-0 are known to be clear if zero in either the LHS \| RHS.		// Output known-0 are known to be clear if zero in either the LHS \| RHS.
Known.Zero \|= Known2.Zero;		Known.Zero \|= Known2.Zero;
break;		break;
case ISD::OR:		case ISD::OR:
computeKnownBits(Op.getOperand(1), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(1), Known, DemandedElts, AnyToZeroExtLoads,
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		Depth + 1);
		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);

// Output known-0 bits are only known if clear in both the LHS & RHS.		// Output known-0 bits are only known if clear in both the LHS & RHS.
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
// Output known-1 are known to be set if set in either the LHS \| RHS.		// Output known-1 are known to be set if set in either the LHS \| RHS.
Known.One \|= Known2.One;		Known.One \|= Known2.One;
break;		break;
case ISD::XOR: {		case ISD::XOR: {
computeKnownBits(Op.getOperand(1), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(1), Known, DemandedElts, AnyToZeroExtLoads,
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		Depth + 1);
		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);

// Output known-0 bits are known if clear or set in both the LHS & RHS.		// Output known-0 bits are known if clear or set in both the LHS & RHS.
APInt KnownZeroOut = (Known.Zero & Known2.Zero) \| (Known.One & Known2.One);		APInt KnownZeroOut = (Known.Zero & Known2.Zero) \| (Known.One & Known2.One);
// Output known-1 are known to be set if set in only one of the LHS, RHS.		// Output known-1 are known to be set if set in only one of the LHS, RHS.
Known.One = (Known.Zero & Known2.One) \| (Known.One & Known2.Zero);		Known.One = (Known.Zero & Known2.One) \| (Known.One & Known2.Zero);
Known.Zero = KnownZeroOut;		Known.Zero = KnownZeroOut;
break;		break;
}		}
case ISD::MUL: {		case ISD::MUL: {
computeKnownBits(Op.getOperand(1), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(1), Known, DemandedElts, AnyToZeroExtLoads,
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		Depth + 1);
		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);

// If low bits are zero in either operand, output low known-0 bits.		// If low bits are zero in either operand, output low known-0 bits.
// Also compute a conservative estimate for high known-0 bits.		// Also compute a conservative estimate for high known-0 bits.
// More trickiness is possible, but this is sufficient for the		// More trickiness is possible, but this is sufficient for the
// interesting case of alignment computation.		// interesting case of alignment computation.
unsigned TrailZ = Known.countMinTrailingZeros() +		unsigned TrailZ = Known.countMinTrailingZeros() +
Known2.countMinTrailingZeros();		Known2.countMinTrailingZeros();
unsigned LeadZ = std::max(Known.countMinLeadingZeros() +		unsigned LeadZ = std::max(Known.countMinLeadingZeros() +
Known2.countMinLeadingZeros(),		Known2.countMinLeadingZeros(),
BitWidth) - BitWidth;		BitWidth) - BitWidth;

Known.resetAll();		Known.resetAll();
Known.Zero.setLowBits(std::min(TrailZ, BitWidth));		Known.Zero.setLowBits(std::min(TrailZ, BitWidth));
Known.Zero.setHighBits(std::min(LeadZ, BitWidth));		Known.Zero.setHighBits(std::min(LeadZ, BitWidth));
break;		break;
}		}
case ISD::UDIV: {		case ISD::UDIV: {
// For the purposes of computing leading zeros we can conservatively		// For the purposes of computing leading zeros we can conservatively
// treat a udiv as a logical right shift by the power of 2 known to		// treat a udiv as a logical right shift by the power of 2 known to
// be less than the denominator.		// be less than the denominator.
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
unsigned LeadZ = Known2.countMinLeadingZeros();		unsigned LeadZ = Known2.countMinLeadingZeros();

computeKnownBits(Op.getOperand(1), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(1), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
unsigned RHSMaxLeadingZeros = Known2.countMaxLeadingZeros();		unsigned RHSMaxLeadingZeros = Known2.countMaxLeadingZeros();
if (RHSMaxLeadingZeros != BitWidth)		if (RHSMaxLeadingZeros != BitWidth)
LeadZ = std::min(BitWidth, LeadZ + BitWidth - RHSMaxLeadingZeros - 1);		LeadZ = std::min(BitWidth, LeadZ + BitWidth - RHSMaxLeadingZeros - 1);

Known.Zero.setHighBits(LeadZ);		Known.Zero.setHighBits(LeadZ);
break;		break;
}		}
case ISD::SELECT:		case ISD::SELECT:
case ISD::VSELECT:		case ISD::VSELECT:
computeKnownBits(Op.getOperand(2), Known, DemandedElts, Depth+1);		computeKnownBits(Op.getOperand(2), Known, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
// If we don't know any bits, early out.		// If we don't know any bits, early out.
if (Known.isUnknown())		if (Known.isUnknown())
break;		break;
computeKnownBits(Op.getOperand(1), Known2, DemandedElts, Depth+1);		computeKnownBits(Op.getOperand(1), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);

// Only known if known in both the LHS and RHS.		// Only known if known in both the LHS and RHS.
Known.One &= Known2.One;		Known.One &= Known2.One;
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
break;		break;
case ISD::SELECT_CC:		case ISD::SELECT_CC:
computeKnownBits(Op.getOperand(3), Known, DemandedElts, Depth+1);		computeKnownBits(Op.getOperand(3), Known, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
// If we don't know any bits, early out.		// If we don't know any bits, early out.
if (Known.isUnknown())		if (Known.isUnknown())
break;		break;
computeKnownBits(Op.getOperand(2), Known2, DemandedElts, Depth+1);		computeKnownBits(Op.getOperand(2), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);

// Only known if known in both the LHS and RHS.		// Only known if known in both the LHS and RHS.
Known.One &= Known2.One;		Known.One &= Known2.One;
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
break;		break;
case ISD::SMULO:		case ISD::SMULO:
case ISD::UMULO:		case ISD::UMULO:
case ISD::ATOMIC_CMP_SWAP_WITH_SUCCESS:		case ISD::ATOMIC_CMP_SWAP_WITH_SUCCESS:
Show All 12 Lines	case ISD::SETCC:
// If we know the result of a setcc has the top bits zero, use this info.		// If we know the result of a setcc has the top bits zero, use this info.
if (TLI->getBooleanContents(Op.getOperand(0).getValueType()) ==		if (TLI->getBooleanContents(Op.getOperand(0).getValueType()) ==
TargetLowering::ZeroOrOneBooleanContent &&		TargetLowering::ZeroOrOneBooleanContent &&
BitWidth > 1)		BitWidth > 1)
Known.Zero.setBitsFrom(1);		Known.Zero.setBitsFrom(1);
break;		break;
case ISD::SHL:		case ISD::SHL:
if (const APInt *ShAmt = getValidShiftAmountConstant(Op)) {		if (const APInt *ShAmt = getValidShiftAmountConstant(Op)) {
computeKnownBits(Op.getOperand(0), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
unsigned Shift = ShAmt->getZExtValue();		unsigned Shift = ShAmt->getZExtValue();
Known.Zero <<= Shift;		Known.Zero <<= Shift;
Known.One <<= Shift;		Known.One <<= Shift;
// Low bits are known zero.		// Low bits are known zero.
Known.Zero.setLowBits(Shift);		Known.Zero.setLowBits(Shift);
}		}
break;		break;
case ISD::SRL:		case ISD::SRL:
if (const APInt *ShAmt = getValidShiftAmountConstant(Op)) {		if (const APInt *ShAmt = getValidShiftAmountConstant(Op)) {
computeKnownBits(Op.getOperand(0), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
unsigned Shift = ShAmt->getZExtValue();		unsigned Shift = ShAmt->getZExtValue();
Known.Zero.lshrInPlace(Shift);		Known.Zero.lshrInPlace(Shift);
Known.One.lshrInPlace(Shift);		Known.One.lshrInPlace(Shift);
// High bits are known zero.		// High bits are known zero.
Known.Zero.setHighBits(Shift);		Known.Zero.setHighBits(Shift);
} else if (auto *BV = dyn_cast<BuildVectorSDNode>(Op.getOperand(1))) {		} else if (auto *BV = dyn_cast<BuildVectorSDNode>(Op.getOperand(1))) {
// If the shift amount is a vector of constants see if we can bound		// If the shift amount is a vector of constants see if we can bound
// the number of upper zero bits.		// the number of upper zero bits.
Show All 12 Lines	if (const APInt *ShAmt = getValidShiftAmountConstant(Op)) {
break;		break;
}		}

Known.Zero.setHighBits(ShiftAmountMin);		Known.Zero.setHighBits(ShiftAmountMin);
}		}
break;		break;
case ISD::SRA:		case ISD::SRA:
if (const APInt *ShAmt = getValidShiftAmountConstant(Op)) {		if (const APInt *ShAmt = getValidShiftAmountConstant(Op)) {
computeKnownBits(Op.getOperand(0), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
unsigned Shift = ShAmt->getZExtValue();		unsigned Shift = ShAmt->getZExtValue();
// Sign extend known zero/one bit (else is unknown).		// Sign extend known zero/one bit (else is unknown).
Known.Zero.ashrInPlace(Shift);		Known.Zero.ashrInPlace(Shift);
Known.One.ashrInPlace(Shift);		Known.One.ashrInPlace(Shift);
}		}
break;		break;
case ISD::SIGN_EXTEND_INREG: {		case ISD::SIGN_EXTEND_INREG: {
EVT EVT = cast<VTSDNode>(Op.getOperand(1))->getVT();		EVT EVT = cast<VTSDNode>(Op.getOperand(1))->getVT();
unsigned EBits = EVT.getScalarSizeInBits();		unsigned EBits = EVT.getScalarSizeInBits();

// Sign extension. Compute the demanded bits in the result that are not		// Sign extension. Compute the demanded bits in the result that are not
// present in the input.		// present in the input.
APInt NewBits = APInt::getHighBitsSet(BitWidth, BitWidth - EBits);		APInt NewBits = APInt::getHighBitsSet(BitWidth, BitWidth - EBits);

APInt InSignMask = APInt::getSignMask(EBits);		APInt InSignMask = APInt::getSignMask(EBits);
APInt InputDemandedBits = APInt::getLowBitsSet(BitWidth, EBits);		APInt InputDemandedBits = APInt::getLowBitsSet(BitWidth, EBits);

// If the sign extended bits are demanded, we know that the sign		// If the sign extended bits are demanded, we know that the sign
// bit is demanded.		// bit is demanded.
InSignMask = InSignMask.zext(BitWidth);		InSignMask = InSignMask.zext(BitWidth);
if (NewBits.getBoolValue())		if (NewBits.getBoolValue())
InputDemandedBits \|= InSignMask;		InputDemandedBits \|= InSignMask;

computeKnownBits(Op.getOperand(0), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
Known.One &= InputDemandedBits;		Known.One &= InputDemandedBits;
Known.Zero &= InputDemandedBits;		Known.Zero &= InputDemandedBits;

// If the sign bit of the input is known set or clear, then we know the		// If the sign bit of the input is known set or clear, then we know the
// top bits of the result.		// top bits of the result.
if (Known.Zero.intersects(InSignMask)) { // Input sign bit known clear		if (Known.Zero.intersects(InSignMask)) { // Input sign bit known clear
Known.Zero \|= NewBits;		Known.Zero \|= NewBits;
Known.One &= ~NewBits;		Known.One &= ~NewBits;
} else if (Known.One.intersects(InSignMask)) { // Input sign bit known set		} else if (Known.One.intersects(InSignMask)) { // Input sign bit known set
Known.One \|= NewBits;		Known.One \|= NewBits;
Known.Zero &= ~NewBits;		Known.Zero &= ~NewBits;
} else { // Input sign bit unknown		} else { // Input sign bit unknown
Known.Zero &= ~NewBits;		Known.Zero &= ~NewBits;
Known.One &= ~NewBits;		Known.One &= ~NewBits;
}		}
break;		break;
}		}
case ISD::CTTZ:		case ISD::CTTZ:
case ISD::CTTZ_ZERO_UNDEF: {		case ISD::CTTZ_ZERO_UNDEF: {
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
// If we have a known 1, its position is our upper bound.		// If we have a known 1, its position is our upper bound.
unsigned PossibleTZ = Known2.countMaxTrailingZeros();		unsigned PossibleTZ = Known2.countMaxTrailingZeros();
unsigned LowBits = Log2_32(PossibleTZ) + 1;		unsigned LowBits = Log2_32(PossibleTZ) + 1;
Known.Zero.setBitsFrom(LowBits);		Known.Zero.setBitsFrom(LowBits);
break;		break;
}		}
case ISD::CTLZ:		case ISD::CTLZ:
case ISD::CTLZ_ZERO_UNDEF: {		case ISD::CTLZ_ZERO_UNDEF: {
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
// If we have a known 1, its position is our upper bound.		// If we have a known 1, its position is our upper bound.
unsigned PossibleLZ = Known2.countMaxLeadingZeros();		unsigned PossibleLZ = Known2.countMaxLeadingZeros();
unsigned LowBits = Log2_32(PossibleLZ) + 1;		unsigned LowBits = Log2_32(PossibleLZ) + 1;
Known.Zero.setBitsFrom(LowBits);		Known.Zero.setBitsFrom(LowBits);
break;		break;
}		}
case ISD::CTPOP: {		case ISD::CTPOP: {
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
// If we know some of the bits are zero, they can't be one.		// If we know some of the bits are zero, they can't be one.
unsigned PossibleOnes = Known2.countMaxPopulation();		unsigned PossibleOnes = Known2.countMaxPopulation();
Known.Zero.setBitsFrom(Log2_32(PossibleOnes) + 1);		Known.Zero.setBitsFrom(Log2_32(PossibleOnes) + 1);
break;		break;
}		}
case ISD::LOAD: {		case ISD::LOAD: {
LoadSDNode *LD = cast<LoadSDNode>(Op);		LoadSDNode *LD = cast<LoadSDNode>(Op);
// If this is a ZEXTLoad and we are looking at the loaded value.		// If this is a ZEXTLoad and we are looking at the loaded value.
if (ISD::isZEXTLoad(Op.getNode()) && Op.getResNo() == 0) {		if (ISD::isZEXTLoad(Op.getNode()) && Op.getResNo() == 0) {
EVT VT = LD->getMemoryVT();		EVT VT = LD->getMemoryVT();
unsigned MemBits = VT.getScalarSizeInBits();		unsigned MemBits = VT.getScalarSizeInBits();
Known.Zero.setBitsFrom(MemBits);		Known.Zero.setBitsFrom(MemBits);
		// If this is a EXTLoad and we are passed AnyToZeroExtLoads, treat
		// the load as zero extended.
		} else if (AnyToZeroExtLoads && ISD::isEXTLoad(Op.getNode()) &&
		Op.getResNo() == 0) {
		EVT VT = LD->getMemoryVT();
		unsigned MemBits = VT.getScalarSizeInBits();
		Known.Zero.setBitsFrom(MemBits);
		AnyToZeroExtLoads->insert(LD);
} else if (const MDNode *Ranges = LD->getRanges()) {		} else if (const MDNode *Ranges = LD->getRanges()) {
if (LD->getExtensionType() == ISD::NON_EXTLOAD)		if (LD->getExtensionType() == ISD::NON_EXTLOAD)
computeKnownBitsFromRangeMetadata(*Ranges, Known);		computeKnownBitsFromRangeMetadata(*Ranges, Known);
}		}
break;		break;
}		}
case ISD::ZERO_EXTEND_VECTOR_INREG: {		case ISD::ZERO_EXTEND_VECTOR_INREG: {
EVT InVT = Op.getOperand(0).getValueType();		EVT InVT = Op.getOperand(0).getValueType();
APInt InDemandedElts = DemandedElts.zext(InVT.getVectorNumElements());		APInt InDemandedElts = DemandedElts.zext(InVT.getVectorNumElements());
computeKnownBits(Op.getOperand(0), Known, InDemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known, InDemandedElts, AnyToZeroExtLoads,
		Depth + 1);
Known = Known.zext(BitWidth);		Known = Known.zext(BitWidth);
Known.Zero.setBitsFrom(InVT.getScalarSizeInBits());		Known.Zero.setBitsFrom(InVT.getScalarSizeInBits());
break;		break;
}		}
case ISD::ZERO_EXTEND: {		case ISD::ZERO_EXTEND: {
EVT InVT = Op.getOperand(0).getValueType();		EVT InVT = Op.getOperand(0).getValueType();
computeKnownBits(Op.getOperand(0), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
Known = Known.zext(BitWidth);		Known = Known.zext(BitWidth);
Known.Zero.setBitsFrom(InVT.getScalarSizeInBits());		Known.Zero.setBitsFrom(InVT.getScalarSizeInBits());
break;		break;
}		}
// TODO ISD::SIGN_EXTEND_VECTOR_INREG		// TODO ISD::SIGN_EXTEND_VECTOR_INREG
case ISD::SIGN_EXTEND: {		case ISD::SIGN_EXTEND: {
computeKnownBits(Op.getOperand(0), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
// If the sign bit is known to be zero or one, then sext will extend		// If the sign bit is known to be zero or one, then sext will extend
// it to the top bits, else it will just zext.		// it to the top bits, else it will just zext.
Known = Known.sext(BitWidth);		Known = Known.sext(BitWidth);
break;		break;
}		}
case ISD::ANY_EXTEND: {		case ISD::ANY_EXTEND: {
computeKnownBits(Op.getOperand(0), Known, Depth+1);		computeKnownBits(Op.getOperand(0), Known, AnyToZeroExtLoads, Depth + 1);
Known = Known.zext(BitWidth);		Known = Known.zext(BitWidth);
break;		break;
}		}
case ISD::TRUNCATE: {		case ISD::TRUNCATE: {
computeKnownBits(Op.getOperand(0), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
Known = Known.trunc(BitWidth);		Known = Known.trunc(BitWidth);
break;		break;
}		}
case ISD::AssertZext: {		case ISD::AssertZext: {
EVT VT = cast<VTSDNode>(Op.getOperand(1))->getVT();		EVT VT = cast<VTSDNode>(Op.getOperand(1))->getVT();
APInt InMask = APInt::getLowBitsSet(BitWidth, VT.getSizeInBits());		APInt InMask = APInt::getLowBitsSet(BitWidth, VT.getSizeInBits());
computeKnownBits(Op.getOperand(0), Known, Depth+1);		computeKnownBits(Op.getOperand(0), Known, AnyToZeroExtLoads, Depth + 1);
Known.Zero \|= (~InMask);		Known.Zero \|= (~InMask);
Known.One &= (~Known.Zero);		Known.One &= (~Known.Zero);
break;		break;
}		}
case ISD::FGETSIGN:		case ISD::FGETSIGN:
// All bits are zero except the low bit.		// All bits are zero except the low bit.
Known.Zero.setBitsFrom(1);		Known.Zero.setBitsFrom(1);
break;		break;
Show All 14 Lines	if (ConstantSDNode *CLHS = isConstOrConstSplat(Op.getOperand(0))) {
// We know that the top bits of C-X are clear if X contains less bits		// We know that the top bits of C-X are clear if X contains less bits
// than C (i.e. no wrap-around can happen). For example, 20-X is		// than C (i.e. no wrap-around can happen). For example, 20-X is
// positive if we can prove that X is >= 0 and < 16.		// positive if we can prove that X is >= 0 and < 16.
if (CLHS->getAPIntValue().isNonNegative()) {		if (CLHS->getAPIntValue().isNonNegative()) {
unsigned NLZ = (CLHS->getAPIntValue()+1).countLeadingZeros();		unsigned NLZ = (CLHS->getAPIntValue()+1).countLeadingZeros();
// NLZ can't be BitWidth with no sign bit		// NLZ can't be BitWidth with no sign bit
APInt MaskV = APInt::getHighBitsSet(BitWidth, NLZ+1);		APInt MaskV = APInt::getHighBitsSet(BitWidth, NLZ+1);
computeKnownBits(Op.getOperand(1), Known2, DemandedElts,		computeKnownBits(Op.getOperand(1), Known2, DemandedElts,
Depth + 1);		AnyToZeroExtLoads, Depth + 1);

// If all of the MaskV bits are known to be zero, then we know the		// If all of the MaskV bits are known to be zero, then we know the
// output top bits are zero, because we now know that the output is		// output top bits are zero, because we now know that the output is
// from [0-C].		// from [0-C].
if ((Known2.Zero & MaskV) == MaskV) {		if ((Known2.Zero & MaskV) == MaskV) {
unsigned NLZ2 = CLHS->getAPIntValue().countLeadingZeros();		unsigned NLZ2 = CLHS->getAPIntValue().countLeadingZeros();
// Top bits known zero.		// Top bits known zero.
Known.Zero.setHighBits(NLZ2);		Known.Zero.setHighBits(NLZ2);
}		}
}		}
}		}

// If low bits are know to be zero in both operands, then we know they are		// If low bits are know to be zero in both operands, then we know they are
// going to be 0 in the result. Both addition and complement operations		// going to be 0 in the result. Both addition and complement operations
// preserve the low zero bits.		// preserve the low zero bits.
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
unsigned KnownZeroLow = Known2.countMinTrailingZeros();		unsigned KnownZeroLow = Known2.countMinTrailingZeros();
if (KnownZeroLow == 0)		if (KnownZeroLow == 0)
break;		break;

computeKnownBits(Op.getOperand(1), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(1), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
KnownZeroLow = std::min(KnownZeroLow, Known2.countMinTrailingZeros());		KnownZeroLow = std::min(KnownZeroLow, Known2.countMinTrailingZeros());
Known.Zero.setLowBits(KnownZeroLow);		Known.Zero.setLowBits(KnownZeroLow);
break;		break;
}		}
case ISD::UADDO:		case ISD::UADDO:
case ISD::SADDO:		case ISD::SADDO:
case ISD::ADDCARRY:		case ISD::ADDCARRY:
if (Op.getResNo() == 1) {		if (Op.getResNo() == 1) {
Show All 10 Lines	void SelectionDAG::computeKnownBits(
case ISD::ADDE: {		case ISD::ADDE: {
// Output known-0 bits are known if clear or set in both the low clear bits		// Output known-0 bits are known if clear or set in both the low clear bits
// common to both LHS & RHS. For example, 8+(X<<3) is known to have the		// common to both LHS & RHS. For example, 8+(X<<3) is known to have the
// low 3 bits clear.		// low 3 bits clear.
// Output known-0 bits are also known if the top bits of each input are		// Output known-0 bits are also known if the top bits of each input are
// known to be clear. For example, if one input has the top 10 bits clear		// known to be clear. For example, if one input has the top 10 bits clear
// and the other has the top 8 bits clear, we know the top 7 bits of the		// and the other has the top 8 bits clear, we know the top 7 bits of the
// output must be clear.		// output must be clear.
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
unsigned KnownZeroHigh = Known2.countMinLeadingZeros();		unsigned KnownZeroHigh = Known2.countMinLeadingZeros();
unsigned KnownZeroLow = Known2.countMinTrailingZeros();		unsigned KnownZeroLow = Known2.countMinTrailingZeros();

computeKnownBits(Op.getOperand(1), Known2, DemandedElts,		computeKnownBits(Op.getOperand(1), Known2, DemandedElts, AnyToZeroExtLoads,
Depth + 1);		Depth + 1);
KnownZeroHigh = std::min(KnownZeroHigh, Known2.countMinLeadingZeros());		KnownZeroHigh = std::min(KnownZeroHigh, Known2.countMinLeadingZeros());
KnownZeroLow = std::min(KnownZeroLow, Known2.countMinTrailingZeros());		KnownZeroLow = std::min(KnownZeroLow, Known2.countMinTrailingZeros());

if (Opcode == ISD::ADDE \|\| Opcode == ISD::ADDCARRY) {		if (Opcode == ISD::ADDE \|\| Opcode == ISD::ADDCARRY) {
// With ADDE and ADDCARRY, a carry bit may be added in, so we can only		// With ADDE and ADDCARRY, a carry bit may be added in, so we can only
// use this information if we know (at least) that the low two bits are		// use this information if we know (at least) that the low two bits are
// clear. We then return to the caller that the low bit is unknown but		// clear. We then return to the caller that the low bit is unknown but
// that other bits are known zero.		// that other bits are known zero.
if (KnownZeroLow >= 2)		if (KnownZeroLow >= 2)
Known.Zero.setBits(1, KnownZeroLow);		Known.Zero.setBits(1, KnownZeroLow);
break;		break;
}		}

Known.Zero.setLowBits(KnownZeroLow);		Known.Zero.setLowBits(KnownZeroLow);
if (KnownZeroHigh > 1)		if (KnownZeroHigh > 1)
Known.Zero.setHighBits(KnownZeroHigh - 1);		Known.Zero.setHighBits(KnownZeroHigh - 1);
break;		break;
}		}
case ISD::SREM:		case ISD::SREM:
if (ConstantSDNode *Rem = isConstOrConstSplat(Op.getOperand(1))) {		if (ConstantSDNode *Rem = isConstOrConstSplat(Op.getOperand(1))) {
const APInt &RA = Rem->getAPIntValue().abs();		const APInt &RA = Rem->getAPIntValue().abs();
if (RA.isPowerOf2()) {		if (RA.isPowerOf2()) {
APInt LowBits = RA - 1;		APInt LowBits = RA - 1;
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known2, DemandedElts,
		AnyToZeroExtLoads, Depth + 1);

// The low bits of the first operand are unchanged by the srem.		// The low bits of the first operand are unchanged by the srem.
Known.Zero = Known2.Zero & LowBits;		Known.Zero = Known2.Zero & LowBits;
Known.One = Known2.One & LowBits;		Known.One = Known2.One & LowBits;

// If the first operand is non-negative or has all low bits zero, then		// If the first operand is non-negative or has all low bits zero, then
// the upper bits are all zero.		// the upper bits are all zero.
if (Known2.Zero[BitWidth-1] \|\| ((Known2.Zero & LowBits) == LowBits))		if (Known2.Zero[BitWidth-1] \|\| ((Known2.Zero & LowBits) == LowBits))
Known.Zero \|= ~LowBits;		Known.Zero \|= ~LowBits;

// If the first operand is negative and not all low bits are zero, then		// If the first operand is negative and not all low bits are zero, then
// the upper bits are all one.		// the upper bits are all one.
if (Known2.One[BitWidth-1] && ((Known2.One & LowBits) != 0))		if (Known2.One[BitWidth-1] && ((Known2.One & LowBits) != 0))
Known.One \|= ~LowBits;		Known.One \|= ~LowBits;
assert((Known.Zero & Known.One) == 0&&"Bits known to be one AND zero?");		assert((Known.Zero & Known.One) == 0&&"Bits known to be one AND zero?");
}		}
}		}
break;		break;
case ISD::UREM: {		case ISD::UREM: {
if (ConstantSDNode *Rem = isConstOrConstSplat(Op.getOperand(1))) {		if (ConstantSDNode *Rem = isConstOrConstSplat(Op.getOperand(1))) {
const APInt &RA = Rem->getAPIntValue();		const APInt &RA = Rem->getAPIntValue();
if (RA.isPowerOf2()) {		if (RA.isPowerOf2()) {
APInt LowBits = (RA - 1);		APInt LowBits = (RA - 1);
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known2, DemandedElts,
		AnyToZeroExtLoads, Depth + 1);

// The upper bits are all zero, the lower ones are unchanged.		// The upper bits are all zero, the lower ones are unchanged.
Known.Zero = Known2.Zero \| ~LowBits;		Known.Zero = Known2.Zero \| ~LowBits;
Known.One = Known2.One & LowBits;		Known.One = Known2.One & LowBits;
break;		break;
}		}
}		}

// Since the result is less than or equal to either operand, any leading		// Since the result is less than or equal to either operand, any leading
// zero bits in either operand must also exist in the result.		// zero bits in either operand must also exist in the result.
computeKnownBits(Op.getOperand(0), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known, DemandedElts, AnyToZeroExtLoads,
computeKnownBits(Op.getOperand(1), Known2, DemandedElts, Depth + 1);		Depth + 1);
		computeKnownBits(Op.getOperand(1), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);

uint32_t Leaders =		uint32_t Leaders =
std::max(Known.countMinLeadingZeros(), Known2.countMinLeadingZeros());		std::max(Known.countMinLeadingZeros(), Known2.countMinLeadingZeros());
Known.resetAll();		Known.resetAll();
Known.Zero.setHighBits(Leaders);		Known.Zero.setHighBits(Leaders);
break;		break;
}		}
case ISD::EXTRACT_ELEMENT: {		case ISD::EXTRACT_ELEMENT: {
computeKnownBits(Op.getOperand(0), Known, Depth+1);		computeKnownBits(Op.getOperand(0), Known, AnyToZeroExtLoads, Depth + 1);
const unsigned Index = Op.getConstantOperandVal(1);		const unsigned Index = Op.getConstantOperandVal(1);
const unsigned BitWidth = Op.getValueSizeInBits();		const unsigned BitWidth = Op.getValueSizeInBits();

// Remove low part of known bits mask		// Remove low part of known bits mask
Known.Zero = Known.Zero.getHiBits(Known.Zero.getBitWidth() - Index * BitWidth);		Known.Zero = Known.Zero.getHiBits(Known.Zero.getBitWidth() - Index * BitWidth);
Known.One = Known.One.getHiBits(Known.One.getBitWidth() - Index * BitWidth);		Known.One = Known.One.getHiBits(Known.One.getBitWidth() - Index * BitWidth);

// Remove high part of known bit mask		// Remove high part of known bit mask
Show All 11 Lines	case ISD::EXTRACT_VECTOR_ELT: {
// anything about the extended bits.		// anything about the extended bits.
if (BitWidth > EltBitWidth)		if (BitWidth > EltBitWidth)
Known = Known.trunc(EltBitWidth);		Known = Known.trunc(EltBitWidth);
ConstantSDNode *ConstEltNo = dyn_cast<ConstantSDNode>(EltNo);		ConstantSDNode *ConstEltNo = dyn_cast<ConstantSDNode>(EltNo);
if (ConstEltNo && ConstEltNo->getAPIntValue().ult(NumSrcElts)) {		if (ConstEltNo && ConstEltNo->getAPIntValue().ult(NumSrcElts)) {
// If we know the element index, just demand that vector element.		// If we know the element index, just demand that vector element.
unsigned Idx = ConstEltNo->getZExtValue();		unsigned Idx = ConstEltNo->getZExtValue();
APInt DemandedElt = APInt::getOneBitSet(NumSrcElts, Idx);		APInt DemandedElt = APInt::getOneBitSet(NumSrcElts, Idx);
computeKnownBits(InVec, Known, DemandedElt, Depth + 1);		computeKnownBits(InVec, Known, DemandedElt, AnyToZeroExtLoads, Depth + 1);
} else {		} else {
// Unknown element index, so ignore DemandedElts and demand them all.		// Unknown element index, so ignore DemandedElts and demand them all.
computeKnownBits(InVec, Known, Depth + 1);		computeKnownBits(InVec, Known, AnyToZeroExtLoads, Depth + 1);
}		}
if (BitWidth > EltBitWidth)		if (BitWidth > EltBitWidth)
Known = Known.zext(BitWidth);		Known = Known.zext(BitWidth);
break;		break;
}		}
case ISD::INSERT_VECTOR_ELT: {		case ISD::INSERT_VECTOR_ELT: {
SDValue InVec = Op.getOperand(0);		SDValue InVec = Op.getOperand(0);
SDValue InVal = Op.getOperand(1);		SDValue InVal = Op.getOperand(1);
SDValue EltNo = Op.getOperand(2);		SDValue EltNo = Op.getOperand(2);

ConstantSDNode *CEltNo = dyn_cast<ConstantSDNode>(EltNo);		ConstantSDNode *CEltNo = dyn_cast<ConstantSDNode>(EltNo);
if (CEltNo && CEltNo->getAPIntValue().ult(NumElts)) {		if (CEltNo && CEltNo->getAPIntValue().ult(NumElts)) {
// If we know the element index, split the demand between the		// If we know the element index, split the demand between the
// source vector and the inserted element.		// source vector and the inserted element.
Known.Zero = Known.One = APInt::getAllOnesValue(BitWidth);		Known.Zero = Known.One = APInt::getAllOnesValue(BitWidth);
unsigned EltIdx = CEltNo->getZExtValue();		unsigned EltIdx = CEltNo->getZExtValue();

// If we demand the inserted element then add its common known bits.		// If we demand the inserted element then add its common known bits.
if (DemandedElts[EltIdx]) {		if (DemandedElts[EltIdx]) {
computeKnownBits(InVal, Known2, Depth + 1);		computeKnownBits(InVal, Known2, AnyToZeroExtLoads, Depth + 1);
Known.One &= Known2.One.zextOrTrunc(Known.One.getBitWidth());		Known.One &= Known2.One.zextOrTrunc(Known.One.getBitWidth());
Known.Zero &= Known2.Zero.zextOrTrunc(Known.Zero.getBitWidth());		Known.Zero &= Known2.Zero.zextOrTrunc(Known.Zero.getBitWidth());
}		}

// If we demand the source vector then add its common known bits, ensuring		// If we demand the source vector then add its common known bits, ensuring
// that we don't demand the inserted element.		// that we don't demand the inserted element.
APInt VectorElts = DemandedElts & ~(APInt::getOneBitSet(NumElts, EltIdx));		APInt VectorElts = DemandedElts & ~(APInt::getOneBitSet(NumElts, EltIdx));
if (!!VectorElts) {		if (!!VectorElts) {
computeKnownBits(InVec, Known2, VectorElts, Depth + 1);		computeKnownBits(InVec, Known2, VectorElts, AnyToZeroExtLoads,
		Depth + 1);
Known.One &= Known2.One;		Known.One &= Known2.One;
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
}		}
} else {		} else {
// Unknown element index, so ignore DemandedElts and demand them all.		// Unknown element index, so ignore DemandedElts and demand them all.
computeKnownBits(InVec, Known, Depth + 1);		computeKnownBits(InVec, Known, AnyToZeroExtLoads, Depth + 1);
computeKnownBits(InVal, Known2, Depth + 1);		computeKnownBits(InVal, Known2, AnyToZeroExtLoads, Depth + 1);
Known.One &= Known2.One.zextOrTrunc(Known.One.getBitWidth());		Known.One &= Known2.One.zextOrTrunc(Known.One.getBitWidth());
Known.Zero &= Known2.Zero.zextOrTrunc(Known.Zero.getBitWidth());		Known.Zero &= Known2.Zero.zextOrTrunc(Known.Zero.getBitWidth());
}		}
break;		break;
}		}
case ISD::BITREVERSE: {		case ISD::BITREVERSE: {
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
Known.Zero = Known2.Zero.reverseBits();		Known.Zero = Known2.Zero.reverseBits();
Known.One = Known2.One.reverseBits();		Known.One = Known2.One.reverseBits();
break;		break;
}		}
case ISD::BSWAP: {		case ISD::BSWAP: {
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
Known.Zero = Known2.Zero.byteSwap();		Known.Zero = Known2.Zero.byteSwap();
Known.One = Known2.One.byteSwap();		Known.One = Known2.One.byteSwap();
break;		break;
}		}
case ISD::ABS: {		case ISD::ABS: {
computeKnownBits(Op.getOperand(0), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);

// If the source's MSB is zero then we know the rest of the bits already.		// If the source's MSB is zero then we know the rest of the bits already.
if (Known2.isNonNegative()) {		if (Known2.isNonNegative()) {
Known.Zero = Known2.Zero;		Known.Zero = Known2.Zero;
Known.One = Known2.One;		Known.One = Known2.One;
break;		break;
}		}

// We only know that the absolute values's MSB will be zero iff there is		// We only know that the absolute values's MSB will be zero iff there is
// a set bit that isn't the sign bit (otherwise it could be INT_MIN).		// a set bit that isn't the sign bit (otherwise it could be INT_MIN).
Known2.One.clearSignBit();		Known2.One.clearSignBit();
if (Known2.One.getBoolValue()) {		if (Known2.One.getBoolValue()) {
Known.Zero = APInt::getSignMask(BitWidth);		Known.Zero = APInt::getSignMask(BitWidth);
break;		break;
}		}
break;		break;
}		}
case ISD::UMIN: {		case ISD::UMIN: {
computeKnownBits(Op.getOperand(0), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known, DemandedElts, AnyToZeroExtLoads,
computeKnownBits(Op.getOperand(1), Known2, DemandedElts, Depth + 1);		Depth + 1);
		computeKnownBits(Op.getOperand(1), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);

// UMIN - we know that the result will have the maximum of the		// UMIN - we know that the result will have the maximum of the
// known zero leading bits of the inputs.		// known zero leading bits of the inputs.
unsigned LeadZero = Known.countMinLeadingZeros();		unsigned LeadZero = Known.countMinLeadingZeros();
LeadZero = std::max(LeadZero, Known2.countMinLeadingZeros());		LeadZero = std::max(LeadZero, Known2.countMinLeadingZeros());

Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
Known.One &= Known2.One;		Known.One &= Known2.One;
Known.Zero.setHighBits(LeadZero);		Known.Zero.setHighBits(LeadZero);
break;		break;
}		}
case ISD::UMAX: {		case ISD::UMAX: {
computeKnownBits(Op.getOperand(0), Known, DemandedElts,		computeKnownBits(Op.getOperand(0), Known, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
		computeKnownBits(Op.getOperand(1), Known2, DemandedElts, AnyToZeroExtLoads,
Depth + 1);		Depth + 1);
computeKnownBits(Op.getOperand(1), Known2, DemandedElts, Depth + 1);

// UMAX - we know that the result will have the maximum of the		// UMAX - we know that the result will have the maximum of the
// known one leading bits of the inputs.		// known one leading bits of the inputs.
unsigned LeadOne = Known.countMinLeadingOnes();		unsigned LeadOne = Known.countMinLeadingOnes();
LeadOne = std::max(LeadOne, Known2.countMinLeadingOnes());		LeadOne = std::max(LeadOne, Known2.countMinLeadingOnes());

Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
Known.One &= Known2.One;		Known.One &= Known2.One;
Show All 27 Lines	if (CstLow && CstHigh) {
if (ValueLow.isNonNegative() && ValueHigh.isNonNegative()) {		if (ValueLow.isNonNegative() && ValueHigh.isNonNegative()) {
Known.Zero.setHighBits(MinSignBits);		Known.Zero.setHighBits(MinSignBits);
break;		break;
}		}
}		}
}		}

// Fallback - just get the shared known bits of the operands.		// Fallback - just get the shared known bits of the operands.
computeKnownBits(Op.getOperand(0), Known, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(0), Known, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
if (Known.isUnknown()) break; // Early-out		if (Known.isUnknown()) break; // Early-out
computeKnownBits(Op.getOperand(1), Known2, DemandedElts, Depth + 1);		computeKnownBits(Op.getOperand(1), Known2, DemandedElts, AnyToZeroExtLoads,
		Depth + 1);
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
Known.One &= Known2.One;		Known.One &= Known2.One;
break;		break;
}		}
case ISD::FrameIndex:		case ISD::FrameIndex:
case ISD::TargetFrameIndex:		case ISD::TargetFrameIndex:
TLI->computeKnownBitsForFrameIndex(Op, Known, DemandedElts, *this, Depth);		TLI->computeKnownBitsForFrameIndex(Op, Known, DemandedElts, *this, Depth);
break;		break;

default:		default:
if (Opcode < ISD::BUILTIN_OP_END)		if (Opcode < ISD::BUILTIN_OP_END)
break;		break;
LLVM_FALLTHROUGH;		LLVM_FALLTHROUGH;
case ISD::INTRINSIC_WO_CHAIN:		case ISD::INTRINSIC_WO_CHAIN:
case ISD::INTRINSIC_W_CHAIN:		case ISD::INTRINSIC_W_CHAIN:
case ISD::INTRINSIC_VOID:		case ISD::INTRINSIC_VOID:
// Allow the target to implement this method for its nodes.		// Allow the target to implement this method for its nodes.
TLI->computeKnownBitsForTargetNode(Op, Known, DemandedElts, *this, Depth);		TLI->computeKnownBitsForTargetNode(Op, Known, DemandedElts, *this,
		AnyToZeroExtLoads, Depth);
break;		break;
}		}

assert(!Known.hasConflict() && "Bits known to be one AND zero?");		assert(!Known.hasConflict() && "Bits known to be one AND zero?");
}		}

SelectionDAG::OverflowKind SelectionDAG::computeOverflowKind(SDValue N0,		SelectionDAG::OverflowKind SelectionDAG::computeOverflowKind(SDValue N0,
SDValue N1) const {		SDValue N1) const {
▲ Show 20 Lines • Show All 327 Lines • ▼ Show 20 Lines	case ISD::ADDC:
// is, at worst, one more bit than the inputs.		// is, at worst, one more bit than the inputs.
Tmp = ComputeNumSignBits(Op.getOperand(0), Depth+1);		Tmp = ComputeNumSignBits(Op.getOperand(0), Depth+1);
if (Tmp == 1) return 1; // Early out.		if (Tmp == 1) return 1; // Early out.

// Special case decrementing a value (ADD X, -1):		// Special case decrementing a value (ADD X, -1):
if (ConstantSDNode *CRHS = dyn_cast<ConstantSDNode>(Op.getOperand(1)))		if (ConstantSDNode *CRHS = dyn_cast<ConstantSDNode>(Op.getOperand(1)))
if (CRHS->isAllOnesValue()) {		if (CRHS->isAllOnesValue()) {
KnownBits Known;		KnownBits Known;
computeKnownBits(Op.getOperand(0), Known, Depth+1);		computeKnownBits(Op.getOperand(0), Known, nullptr, Depth + 1);

// If the input is known to be 0 or 1, the output is 0/-1, which is all		// If the input is known to be 0 or 1, the output is 0/-1, which is all
// sign bits set.		// sign bits set.
if ((Known.Zero \| 1).isAllOnesValue())		if ((Known.Zero \| 1).isAllOnesValue())
return VTBits;		return VTBits;

// If we are subtracting one from a positive number, there is no carry		// If we are subtracting one from a positive number, there is no carry
// out of the result.		// out of the result.
if (Known.isNonNegative())		if (Known.isNonNegative())
return Tmp;		return Tmp;
}		}

Tmp2 = ComputeNumSignBits(Op.getOperand(1), Depth+1);		Tmp2 = ComputeNumSignBits(Op.getOperand(1), Depth+1);
if (Tmp2 == 1) return 1;		if (Tmp2 == 1) return 1;
return std::min(Tmp, Tmp2)-1;		return std::min(Tmp, Tmp2)-1;

case ISD::SUB:		case ISD::SUB:
Tmp2 = ComputeNumSignBits(Op.getOperand(1), Depth+1);		Tmp2 = ComputeNumSignBits(Op.getOperand(1), Depth+1);
if (Tmp2 == 1) return 1;		if (Tmp2 == 1) return 1;

// Handle NEG.		// Handle NEG.
if (ConstantSDNode *CLHS = isConstOrConstSplat(Op.getOperand(0)))		if (ConstantSDNode *CLHS = isConstOrConstSplat(Op.getOperand(0)))
if (CLHS->isNullValue()) {		if (CLHS->isNullValue()) {
KnownBits Known;		KnownBits Known;
computeKnownBits(Op.getOperand(1), Known, Depth+1);		computeKnownBits(Op.getOperand(1), Known, nullptr, Depth + 1);
// If the input is known to be 0 or 1, the output is 0/-1, which is all		// If the input is known to be 0 or 1, the output is 0/-1, which is all
// sign bits set.		// sign bits set.
if ((Known.Zero \| 1).isAllOnesValue())		if ((Known.Zero \| 1).isAllOnesValue())
return VTBits;		return VTBits;

// If the input is known to be positive (the sign bit is known clear),		// If the input is known to be positive (the sign bit is known clear),
// the output of the NEG has the same number of sign bits as the input.		// the output of the NEG has the same number of sign bits as the input.
if (Known.isNonNegative())		if (Known.isNonNegative())
▲ Show 20 Lines • Show All 149 Lines • ▼ Show 20 Lines	unsigned NumBits =
TLI->ComputeNumSignBitsForTargetNode(Op, DemandedElts, *this, Depth);		TLI->ComputeNumSignBitsForTargetNode(Op, DemandedElts, *this, Depth);
if (NumBits > 1)		if (NumBits > 1)
FirstAnswer = std::max(FirstAnswer, NumBits);		FirstAnswer = std::max(FirstAnswer, NumBits);
}		}

// Finally, if we can prove that the top bits of the result are 0's or 1's,		// Finally, if we can prove that the top bits of the result are 0's or 1's,
// use this information.		// use this information.
KnownBits Known;		KnownBits Known;
computeKnownBits(Op, Known, DemandedElts, Depth);		computeKnownBits(Op, Known, DemandedElts, nullptr, Depth);

APInt Mask;		APInt Mask;
if (Known.isNonNegative()) { // sign bit is 0		if (Known.isNonNegative()) { // sign bit is 0
Mask = Known.Zero;		Mask = Known.Zero;
} else if (Known.isNegative()) { // sign bit is 1;		} else if (Known.isNegative()) { // sign bit is 1;
Mask = Known.One;		Mask = Known.One;
} else {		} else {
// Nothing known.		// Nothing known.
▲ Show 20 Lines • Show All 4,823 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/TargetLowering.cpp

Show First 20 Lines • Show All 523 Lines • ▼ Show 20 Lines	bool TargetLowering::SimplifyDemandedBits(SDValue Op,
}		}

// Other users may use these bits.		// Other users may use these bits.
EVT VT = Op.getValueType();		EVT VT = Op.getValueType();
if (!Op.getNode()->hasOneUse() && !AssumeSingleUse) {		if (!Op.getNode()->hasOneUse() && !AssumeSingleUse) {
if (Depth != 0) {		if (Depth != 0) {
// If not at the root, Just compute the Known bits to		// If not at the root, Just compute the Known bits to
// simplify things downstream.		// simplify things downstream.
TLO.DAG.computeKnownBits(Op, Known, Depth);		TLO.DAG.computeKnownBits(Op, Known, nullptr, Depth);
return false;		return false;
}		}
// If this is the root being simplified, allow it to have multiple uses,		// If this is the root being simplified, allow it to have multiple uses,
// just set the NewMask to all bits.		// just set the NewMask to all bits.
NewMask = APInt::getAllOnesValue(BitWidth);		NewMask = APInt::getAllOnesValue(BitWidth);
} else if (DemandedMask == 0) {		} else if (DemandedMask == 0) {
// Not demanding any bits from Op.		// Not demanding any bits from Op.
if (!Op.isUndef())		if (!Op.isUndef())
Show All 34 Lines	case ISD::AND:
// If the RHS is a constant, check to see if the LHS would be zero without		// If the RHS is a constant, check to see if the LHS would be zero without
// using the bits from the RHS. Below, we use knowledge about the RHS to		// using the bits from the RHS. Below, we use knowledge about the RHS to
// simplify the LHS, here we're using information from the LHS to simplify		// simplify the LHS, here we're using information from the LHS to simplify
// the RHS.		// the RHS.
if (ConstantSDNode *RHSC = isConstOrConstSplat(Op.getOperand(1))) {		if (ConstantSDNode *RHSC = isConstOrConstSplat(Op.getOperand(1))) {
SDValue Op0 = Op.getOperand(0);		SDValue Op0 = Op.getOperand(0);
KnownBits LHSKnown;		KnownBits LHSKnown;
// Do not increment Depth here; that can cause an infinite loop.		// Do not increment Depth here; that can cause an infinite loop.
TLO.DAG.computeKnownBits(Op0, LHSKnown, Depth);		TLO.DAG.computeKnownBits(Op0, LHSKnown, nullptr, Depth);
// If the LHS already has zeros where RHSC does, this 'and' is dead.		// If the LHS already has zeros where RHSC does, this 'and' is dead.
if ((LHSKnown.Zero & NewMask) == (~RHSC->getAPIntValue() & NewMask))		if ((LHSKnown.Zero & NewMask) == (~RHSC->getAPIntValue() & NewMask))
return TLO.CombineTo(Op, Op0);		return TLO.CombineTo(Op, Op0);

// If any of the set bits in the RHS are known zero on the LHS, shrink		// If any of the set bits in the RHS are known zero on the LHS, shrink
// the constant.		// the constant.
if (ShrinkDemandedConstant(Op, ~LHSKnown.Zero & NewMask, TLO))		if (ShrinkDemandedConstant(Op, ~LHSKnown.Zero & NewMask, TLO))
return true;		return true;
▲ Show 20 Lines • Show All 607 Lines • ▼ Show 20 Lines	if (!TLO.LegalOperations() && !VT.isVector() &&
unsigned ShVal = Op.getValueSizeInBits() - 1;		unsigned ShVal = Op.getValueSizeInBits() - 1;
SDValue ShAmt = TLO.DAG.getConstant(ShVal, dl, VT);		SDValue ShAmt = TLO.DAG.getConstant(ShVal, dl, VT);
return TLO.CombineTo(Op, TLO.DAG.getNode(ISD::SHL, dl, VT, Sign, ShAmt));		return TLO.CombineTo(Op, TLO.DAG.getNode(ISD::SHL, dl, VT, Sign, ShAmt));
}		}
}		}
// If this is a bitcast, let computeKnownBits handle it. Only do this on a		// If this is a bitcast, let computeKnownBits handle it. Only do this on a
// recursive call where Known may be useful to the caller.		// recursive call where Known may be useful to the caller.
if (Depth > 0) {		if (Depth > 0) {
TLO.DAG.computeKnownBits(Op, Known, Depth);		TLO.DAG.computeKnownBits(Op, Known, nullptr, Depth);
return false;		return false;
}		}
break;		break;
case ISD::ADD:		case ISD::ADD:
case ISD::MUL:		case ISD::MUL:
case ISD::SUB: {		case ISD::SUB: {
// Add, Sub, and Mul don't demand any bits in positions beyond that		// Add, Sub, and Mul don't demand any bits in positions beyond that
// of the highest bit demanded of them.		// of the highest bit demanded of them.
Show All 36 Lines	if (C && !C->isAllOnesValue() && !C->isOne() &&
SDValue NewOp = TLO.DAG.getNode(Op.getOpcode(), dl, VT, Op0, Neg1, Flags);		SDValue NewOp = TLO.DAG.getNode(Op.getOpcode(), dl, VT, Op0, Neg1, Flags);
return TLO.CombineTo(Op, NewOp);		return TLO.CombineTo(Op, NewOp);
}		}

LLVM_FALLTHROUGH;		LLVM_FALLTHROUGH;
}		}
default:		default:
// Just use computeKnownBits to compute output bits.		// Just use computeKnownBits to compute output bits.
TLO.DAG.computeKnownBits(Op, Known, Depth);		TLO.DAG.computeKnownBits(Op, Known, nullptr, Depth);
break;		break;
}		}

// If we know the value of all of the demanded bits, return this as a		// If we know the value of all of the demanded bits, return this as a
// constant.		// constant.
if (NewMask.isSubsetOf(Known.Zero\|Known.One)) {		if (NewMask.isSubsetOf(Known.Zero\|Known.One)) {
// Avoid folding to a constant if any OpaqueConstant is involved.		// Avoid folding to a constant if any OpaqueConstant is involved.
const SDNode *N = Op.getNode();		const SDNode *N = Op.getNode();
▲ Show 20 Lines • Show All 295 Lines • ▼ Show 20 Lines	bool TargetLowering::SimplifyDemandedVectorElts(
}		}

assert((KnownUndef & KnownZero) == 0 && "Elements flagged as undef AND zero");		assert((KnownUndef & KnownZero) == 0 && "Elements flagged as undef AND zero");
return false;		return false;
}		}

/// Determine which of the bits specified in Mask are known to be either zero or		/// Determine which of the bits specified in Mask are known to be either zero or
/// one and return them in the Known.		/// one and return them in the Known.
void TargetLowering::computeKnownBitsForTargetNode(const SDValue Op,		void TargetLowering::computeKnownBitsForTargetNode(
KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts,		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
const SelectionDAG &DAG,
unsigned Depth) const {		unsigned Depth) const {
assert((Op.getOpcode() >= ISD::BUILTIN_OP_END \|\|		assert((Op.getOpcode() >= ISD::BUILTIN_OP_END \|\|
Op.getOpcode() == ISD::INTRINSIC_WO_CHAIN \|\|		Op.getOpcode() == ISD::INTRINSIC_WO_CHAIN \|\|
Op.getOpcode() == ISD::INTRINSIC_W_CHAIN \|\|		Op.getOpcode() == ISD::INTRINSIC_W_CHAIN \|\|
Op.getOpcode() == ISD::INTRINSIC_VOID) &&		Op.getOpcode() == ISD::INTRINSIC_VOID) &&
"Should use MaskedValueIsZero if you don't know whether Op"		"Should use MaskedValueIsZero if you don't know whether Op"
" is a target node!");		" is a target node!");
Known.resetAll();		Known.resetAll();
}		}
▲ Show 20 Lines • Show All 1,048 Lines • ▼ Show 20 Lines	if (VT.getScalarType() != MVT::i1) {
DCI.AddToWorklist(N0.getNode());		DCI.AddToWorklist(N0.getNode());
// FIXME: If running after legalize, we probably can't do this.		// FIXME: If running after legalize, we probably can't do this.
ISD::NodeType ExtendCode = getExtendForContent(getBooleanContents(OpVT));		ISD::NodeType ExtendCode = getExtendForContent(getBooleanContents(OpVT));
N0 = DAG.getNode(ExtendCode, dl, VT, N0);		N0 = DAG.getNode(ExtendCode, dl, VT, N0);
}		}
return N0;		return N0;
}		}

		// Test if both operands are AND's, with the same mask, and the
		// non-mask bits are the same (often all zero). In this case we
		// can drop the ands.
		if (N0.getOpcode() == ISD::AND && N1.getOpcode() == ISD::AND &&
		N0.getNode()->hasOneUse() && N1.getNode()->hasOneUse()) {
		SDValue N0LHS = N0.getOperand(0);
		SDValue N0RHS = N0.getOperand(1);
		SDValue N1LHS = N1.getOperand(0);
		SDValue N1RHS = N1.getOperand(1);

		if (isa<ConstantSDNode>(N0LHS))
		std::swap(N0LHS, N0RHS);
		if (isa<ConstantSDNode>(N1LHS))
		std::swap(N1LHS, N1RHS);

		if (!isa<ConstantSDNode>(N0RHS) \|\| !isa<ConstantSDNode>(N1RHS))
		return SDValue();

		APInt AndMask = cast<ConstantSDNode>(N0RHS)->getAPIntValue();
		if (cast<ConstantSDNode>(N1RHS)->getAPIntValue() != AndMask)
		return SDValue();

		KnownBits KB0, KB1;
		llvm::SmallPtrSet<LoadSDNode *, 4> AnyToZeroExtLoads;
		DAG.computeKnownBits(N0LHS, KB0, &AnyToZeroExtLoads);
		DAG.computeKnownBits(N1LHS, KB1, &AnyToZeroExtLoads);

		// Check we know something about all bits
		if (!(KB0.Zero \| KB0.One \| AndMask).isAllOnesValue())
		return SDValue();

		// All non-mask bits must be the same from N0 and N1.
		if ((KB0.Zero & ~AndMask) != (KB1.Zero & ~AndMask) \|\|
		(KB0.One & ~AndMask) != (KB1.One & ~AndMask))
		return SDValue();

		// Transform anyext loads -> zeroext loads
		for (auto *LD : AnyToZeroExtLoads) {
		SDValue Load = DAG.getExtLoad(
		ISD::ZEXTLOAD, SDLoc(LD), LD->getValueType(0), LD->getChain(),
		LD->getBasePtr(), LD->getPointerInfo(), LD->getMemoryVT(),
		LD->getAlignment(), LD->getMemOperand()->getFlags(), LD->getAAInfo());
		DAG.ReplaceAllUsesOfValueWith(SDValue(LD, 0), Load);
		samparkerUnsubmitted Not Done Reply Inline Actions Is this safe? Shouldn't the load be checked for a single use or that all the uses on your known bits path? samparker: Is this safe? Shouldn't the load be checked for a single use or that all the uses on your known…
		dmgreenAuthorUnsubmitted Not Done Reply Inline Actions I don't know a huge amount about the internals of selection dag. My understanding was that if they are anyext then we cannot presume the values of the bits, but then neither can anything else. So long as we set the values of the bits (as we do here), then all other optimisations have to then use the bits as 0. If I could have just changed the existing LD's to zeroextends, that would be simpler. But I believe the nodes are immutable. So I don't think we have to check for single uses as the only bits we are changing are the upper bits from unspecified to 0's. Which isn't something that anything else can rely upon, without itself doing the unspecified -> set transform. dmgreen: I don't know a huge amount about the internals of selection dag. My understanding was that if…
		if (LD == N0LHS.getNode())
		N0LHS = Load;
		if (LD == N1LHS.getNode())
		N1LHS = Load;
		}

		return DAG.getSetCC(dl, VT, N0LHS, N1LHS, Cond);
		}

// Could not fold it.		// Could not fold it.
return SDValue();		return SDValue();
}		}

/// Returns true (and the GlobalValue and the offset) if the node is a		/// Returns true (and the GlobalValue and the offset) if the node is a
/// GlobalAddress + offset.		/// GlobalAddress + offset.
bool TargetLowering::isGAPlusOffset(SDNode N, const GlobalValue &GA,		bool TargetLowering::isGAPlusOffset(SDNode N, const GlobalValue &GA,
int64_t &Offset) const {		int64_t &Offset) const {
▲ Show 20 Lines • Show All 1,577 Lines • Show Last 20 Lines

lib/Target/AArch64/AArch64ISelLowering.h

Show First 20 Lines • Show All 244 Lines • ▼ Show 20 Lines	public:
/// Selects the correct CCAssignFn for a given CallingConvention value.		/// Selects the correct CCAssignFn for a given CallingConvention value.
CCAssignFn *CCAssignFnForCall(CallingConv::ID CC, bool IsVarArg) const;		CCAssignFn *CCAssignFnForCall(CallingConv::ID CC, bool IsVarArg) const;

/// Selects the correct CCAssignFn for a given CallingConvention value.		/// Selects the correct CCAssignFn for a given CallingConvention value.
CCAssignFn *CCAssignFnForReturn(CallingConv::ID CC) const;		CCAssignFn *CCAssignFnForReturn(CallingConv::ID CC) const;

/// Determine which of the bits specified in Mask are known to be either zero		/// Determine which of the bits specified in Mask are known to be either zero
/// or one and return them in the KnownZero/KnownOne bitsets.		/// or one and return them in the KnownZero/KnownOne bitsets.
void computeKnownBitsForTargetNode(const SDValue Op, KnownBits &Known,		void computeKnownBitsForTargetNode(
const APInt &DemandedElts,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const SelectionDAG &DAG,		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
unsigned Depth = 0) const override;		unsigned Depth = 0) const override;

bool targetShrinkDemandedConstant(SDValue Op, const APInt &Demanded,		bool targetShrinkDemandedConstant(SDValue Op, const APInt &Demanded,
TargetLoweringOpt &TLO) const override;		TargetLoweringOpt &TLO) const override;

MVT getScalarShiftAmountTy(const DataLayout &DL, EVT) const override;		MVT getScalarShiftAmountTy(const DataLayout &DL, EVT) const override;

/// Returns true if the target allows unaligned memory accesses of the		/// Returns true if the target allows unaligned memory accesses of the
/// specified type.		/// specified type.
▲ Show 20 Lines • Show All 407 Lines • Show Last 20 Lines

lib/Target/AArch64/AArch64ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 959 Lines • ▼ Show 20 Lines	if (!C)
return false;		return false;
uint64_t Imm = C->getZExtValue();		uint64_t Imm = C->getZExtValue();
return optimizeLogicalImm(Op, Size, Imm, Demanded, TLO, NewOpc);		return optimizeLogicalImm(Op, Size, Imm, Demanded, TLO, NewOpc);
}		}

/// computeKnownBitsForTargetNode - Determine which of the bits specified in		/// computeKnownBitsForTargetNode - Determine which of the bits specified in
/// Mask are known to be either zero or one and return them Known.		/// Mask are known to be either zero or one and return them Known.
void AArch64TargetLowering::computeKnownBitsForTargetNode(		void AArch64TargetLowering::computeKnownBitsForTargetNode(
const SDValue Op, KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts, const SelectionDAG &DAG, unsigned Depth) const {		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
		unsigned Depth) const {
switch (Op.getOpcode()) {		switch (Op.getOpcode()) {
default:		default:
break;		break;
case AArch64ISD::CSEL: {		case AArch64ISD::CSEL: {
KnownBits Known2;		KnownBits Known2;
DAG.computeKnownBits(Op->getOperand(0), Known, Depth + 1);		DAG.computeKnownBits(Op->getOperand(0), Known, AnyToZeroExtLoads,
DAG.computeKnownBits(Op->getOperand(1), Known2, Depth + 1);		Depth + 1);
		DAG.computeKnownBits(Op->getOperand(1), Known2, AnyToZeroExtLoads,
		Depth + 1);
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
Known.One &= Known2.One;		Known.One &= Known2.One;
break;		break;
}		}
case ISD::INTRINSIC_W_CHAIN: {		case ISD::INTRINSIC_W_CHAIN: {
ConstantSDNode *CN = cast<ConstantSDNode>(Op->getOperand(1));		ConstantSDNode *CN = cast<ConstantSDNode>(Op->getOperand(1));
Intrinsic::ID IntID = static_cast<Intrinsic::ID>(CN->getZExtValue());		Intrinsic::ID IntID = static_cast<Intrinsic::ID>(CN->getZExtValue());
switch (IntID) {		switch (IntID) {
▲ Show 20 Lines • Show All 10,155 Lines • Show Last 20 Lines

lib/Target/AMDGPU/AMDGPUISelLowering.h

Show First 20 Lines • Show All 221 Lines • ▼ Show 20 Lines	SDValue getRecipEstimate(SDValue Operand, SelectionDAG &DAG, int Enabled,
int &RefinementSteps) const override;		int &RefinementSteps) const override;

virtual SDNode PostISelFolding(MachineSDNode N,		virtual SDNode PostISelFolding(MachineSDNode N,
SelectionDAG &DAG) const = 0;		SelectionDAG &DAG) const = 0;

/// \brief Determine which of the bits specified in \p Mask are known to be		/// \brief Determine which of the bits specified in \p Mask are known to be
/// either zero or one and return them in the \p KnownZero and \p KnownOne		/// either zero or one and return them in the \p KnownZero and \p KnownOne
/// bitsets.		/// bitsets.
void computeKnownBitsForTargetNode(const SDValue Op,		void computeKnownBitsForTargetNode(
KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts,		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
const SelectionDAG &DAG,
unsigned Depth = 0) const override;		unsigned Depth = 0) const override;

unsigned ComputeNumSignBitsForTargetNode(SDValue Op, const APInt &DemandedElts,		unsigned ComputeNumSignBitsForTargetNode(SDValue Op, const APInt &DemandedElts,
const SelectionDAG &DAG,		const SelectionDAG &DAG,
unsigned Depth = 0) const override;		unsigned Depth = 0) const override;

/// \brief Helper function that adds Reg to the LiveIn list of the DAG's		/// \brief Helper function that adds Reg to the LiveIn list of the DAG's
/// MachineFunction.		/// MachineFunction.
///		///
▲ Show 20 Lines • Show All 335 Lines • Show Last 20 Lines

lib/Target/AMDGPU/AMDGPUISelLowering.cpp

Show First 20 Lines • Show All 4,122 Lines • ▼ Show 20 Lines	SDValue AMDGPUTargetLowering::getRecipEstimate(SDValue Operand,

// TODO: There is also f64 rcp instruction, but the documentation is less		// TODO: There is also f64 rcp instruction, but the documentation is less
// clear on its precision.		// clear on its precision.

return SDValue();		return SDValue();
}		}

void AMDGPUTargetLowering::computeKnownBitsForTargetNode(		void AMDGPUTargetLowering::computeKnownBitsForTargetNode(
const SDValue Op, KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts, const SelectionDAG &DAG, unsigned Depth) const {		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
		unsigned Depth) const {

Known.resetAll(); // Don't know anything.		Known.resetAll(); // Don't know anything.

unsigned Opc = Op.getOpcode();		unsigned Opc = Op.getOpcode();

switch (Opc) {		switch (Opc) {
default:		default:
break;		break;
Show All 22 Lines	case AMDGPUISD::FP16_ZEXT: {

// High bits are zero.		// High bits are zero.
Known.Zero = APInt::getHighBitsSet(BitWidth, BitWidth - 16);		Known.Zero = APInt::getHighBitsSet(BitWidth, BitWidth - 16);
break;		break;
}		}
case AMDGPUISD::MUL_U24:		case AMDGPUISD::MUL_U24:
case AMDGPUISD::MUL_I24: {		case AMDGPUISD::MUL_I24: {
KnownBits LHSKnown, RHSKnown;		KnownBits LHSKnown, RHSKnown;
DAG.computeKnownBits(Op.getOperand(0), LHSKnown, Depth + 1);		DAG.computeKnownBits(Op.getOperand(0), LHSKnown, AnyToZeroExtLoads,
DAG.computeKnownBits(Op.getOperand(1), RHSKnown, Depth + 1);		Depth + 1);
		DAG.computeKnownBits(Op.getOperand(1), RHSKnown, AnyToZeroExtLoads,
		Depth + 1);

unsigned TrailZ = LHSKnown.countMinTrailingZeros() +		unsigned TrailZ = LHSKnown.countMinTrailingZeros() +
RHSKnown.countMinTrailingZeros();		RHSKnown.countMinTrailingZeros();
Known.Zero.setLowBits(std::min(TrailZ, 32u));		Known.Zero.setLowBits(std::min(TrailZ, 32u));

unsigned LHSValBits = 32 - std::max(LHSKnown.countMinSignBits(), 8u);		unsigned LHSValBits = 32 - std::max(LHSKnown.countMinSignBits(), 8u);
unsigned RHSValBits = 32 - std::max(RHSKnown.countMinSignBits(), 8u);		unsigned RHSValBits = 32 - std::max(RHSKnown.countMinSignBits(), 8u);
unsigned MaxValBits = std::min(LHSValBits + RHSValBits, 32u);		unsigned MaxValBits = std::min(LHSValBits + RHSValBits, 32u);
▲ Show 20 Lines • Show All 68 Lines • Show Last 20 Lines

lib/Target/ARM/ARMISelLowering.h

Show First 20 Lines • Show All 377 Lines • ▼ Show 20 Lines	public:

/// getPostIndexedAddressParts - returns true by value, base pointer and		/// getPostIndexedAddressParts - returns true by value, base pointer and
/// offset pointer and addressing mode by reference if this node can be		/// offset pointer and addressing mode by reference if this node can be
/// combined with a load / store to form a post-indexed load / store.		/// combined with a load / store to form a post-indexed load / store.
bool getPostIndexedAddressParts(SDNode N, SDNode Op, SDValue &Base,		bool getPostIndexedAddressParts(SDNode N, SDNode Op, SDValue &Base,
SDValue &Offset, ISD::MemIndexedMode &AM,		SDValue &Offset, ISD::MemIndexedMode &AM,
SelectionDAG &DAG) const override;		SelectionDAG &DAG) const override;

void computeKnownBitsForTargetNode(const SDValue Op, KnownBits &Known,		void computeKnownBitsForTargetNode(
const APInt &DemandedElts,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const SelectionDAG &DAG,		const SelectionDAG &DAG,
		SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
unsigned Depth) const override;		unsigned Depth) const override;


bool ExpandInlineAsm(CallInst *CI) const override;		bool ExpandInlineAsm(CallInst *CI) const override;

ConstraintType getConstraintType(StringRef Constraint) const override;		ConstraintType getConstraintType(StringRef Constraint) const override;

/// Examine constraint string and operand type and determine a weight value.		/// Examine constraint string and operand type and determine a weight value.
/// The operand object must already have been set up with the operand type.		/// The operand object must already have been set up with the operand type.
ConstraintWeight getSingleConstraintMatchWeight(		ConstraintWeight getSingleConstraintMatchWeight(
AsmOperandInfo &info, const char *constraint) const override;		AsmOperandInfo &info, const char *constraint) const override;
▲ Show 20 Lines • Show All 405 Lines • Show Last 20 Lines

lib/Target/ARM/ARMISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 13,375 Lines • ▼ Show 20 Lines	if (Ptr != Base) {
if (Ptr != Base)		if (Ptr != Base)
return false;		return false;
}		}

AM = isInc ? ISD::POST_INC : ISD::POST_DEC;		AM = isInc ? ISD::POST_INC : ISD::POST_DEC;
return true;		return true;
}		}

void ARMTargetLowering::computeKnownBitsForTargetNode(const SDValue Op,		void ARMTargetLowering::computeKnownBitsForTargetNode(
KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts,		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
const SelectionDAG &DAG,
unsigned Depth) const {		unsigned Depth) const {
unsigned BitWidth = Known.getBitWidth();		unsigned BitWidth = Known.getBitWidth();
Known.resetAll();		Known.resetAll();
switch (Op.getOpcode()) {		switch (Op.getOpcode()) {
default: break;		default: break;
case ARMISD::ADDC:		case ARMISD::ADDC:
case ARMISD::ADDE:		case ARMISD::ADDE:
case ARMISD::SUBC:		case ARMISD::SUBC:
case ARMISD::SUBE:		case ARMISD::SUBE:
// Special cases when we convert a carry to a boolean.		// Special cases when we convert a carry to a boolean.
if (Op.getResNo() == 0) {		if (Op.getResNo() == 0) {
SDValue LHS = Op.getOperand(0);		SDValue LHS = Op.getOperand(0);
SDValue RHS = Op.getOperand(1);		SDValue RHS = Op.getOperand(1);
// (ADDE 0, 0, C) will give us a single bit.		// (ADDE 0, 0, C) will give us a single bit.
if (Op->getOpcode() == ARMISD::ADDE && isNullConstant(LHS) &&		if (Op->getOpcode() == ARMISD::ADDE && isNullConstant(LHS) &&
isNullConstant(RHS)) {		isNullConstant(RHS)) {
Known.Zero \|= APInt::getHighBitsSet(BitWidth, BitWidth - 1);		Known.Zero \|= APInt::getHighBitsSet(BitWidth, BitWidth - 1);
return;		return;
}		}
}		}
break;		break;
case ARMISD::CMOV: {		case ARMISD::CMOV: {
// Bits are known zero/one if known on the LHS and RHS.		// Bits are known zero/one if known on the LHS and RHS.
DAG.computeKnownBits(Op.getOperand(0), Known, Depth+1);		DAG.computeKnownBits(Op.getOperand(0), Known, AnyToZeroExtLoads, Depth + 1);
if (Known.isUnknown())		if (Known.isUnknown())
return;		return;

KnownBits KnownRHS;		KnownBits KnownRHS;
DAG.computeKnownBits(Op.getOperand(1), KnownRHS, Depth+1);		DAG.computeKnownBits(Op.getOperand(1), KnownRHS, AnyToZeroExtLoads,
		Depth + 1);
Known.Zero &= KnownRHS.Zero;		Known.Zero &= KnownRHS.Zero;
Known.One &= KnownRHS.One;		Known.One &= KnownRHS.One;
return;		return;
}		}
case ISD::INTRINSIC_W_CHAIN: {		case ISD::INTRINSIC_W_CHAIN: {
ConstantSDNode *CN = cast<ConstantSDNode>(Op->getOperand(1));		ConstantSDNode *CN = cast<ConstantSDNode>(Op->getOperand(1));
Intrinsic::ID IntID = static_cast<Intrinsic::ID>(CN->getZExtValue());		Intrinsic::ID IntID = static_cast<Intrinsic::ID>(CN->getZExtValue());
switch (IntID) {		switch (IntID) {
default: return;		default: return;
case Intrinsic::arm_ldaex:		case Intrinsic::arm_ldaex:
case Intrinsic::arm_ldrex: {		case Intrinsic::arm_ldrex: {
EVT VT = cast<MemIntrinsicSDNode>(Op)->getMemoryVT();		EVT VT = cast<MemIntrinsicSDNode>(Op)->getMemoryVT();
unsigned MemBits = VT.getScalarSizeInBits();		unsigned MemBits = VT.getScalarSizeInBits();
Known.Zero \|= APInt::getHighBitsSet(BitWidth, BitWidth - MemBits);		Known.Zero \|= APInt::getHighBitsSet(BitWidth, BitWidth - MemBits);
return;		return;
}		}
}		}
}		}
case ARMISD::BFI: {		case ARMISD::BFI: {
// Conservatively, we can recurse down the first operand		// Conservatively, we can recurse down the first operand
// and just mask out all affected bits.		// and just mask out all affected bits.
DAG.computeKnownBits(Op.getOperand(0), Known, Depth + 1);		DAG.computeKnownBits(Op.getOperand(0), Known, AnyToZeroExtLoads, Depth + 1);

// The operand to BFI is already a mask suitable for removing the bits it		// The operand to BFI is already a mask suitable for removing the bits it
// sets.		// sets.
ConstantSDNode *CI = cast<ConstantSDNode>(Op.getOperand(2));		ConstantSDNode *CI = cast<ConstantSDNode>(Op.getOperand(2));
const APInt &Mask = CI->getAPIntValue();		const APInt &Mask = CI->getAPIntValue();
Known.Zero &= Mask;		Known.Zero &= Mask;
Known.One &= Mask;		Known.One &= Mask;
return;		return;
▲ Show 20 Lines • Show All 1,369 Lines • Show Last 20 Lines

lib/Target/Lanai/LanaiISelLowering.h

Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines	public:
getSingleConstraintMatchWeight(AsmOperandInfo &Info,		getSingleConstraintMatchWeight(AsmOperandInfo &Info,
const char *Constraint) const override;		const char *Constraint) const override;
void LowerAsmOperandForConstraint(SDValue Op, std::string &Constraint,		void LowerAsmOperandForConstraint(SDValue Op, std::string &Constraint,
std::vector<SDValue> &Ops,		std::vector<SDValue> &Ops,
SelectionDAG &DAG) const override;		SelectionDAG &DAG) const override;

SDValue PerformDAGCombine(SDNode *N, DAGCombinerInfo &DCI) const override;		SDValue PerformDAGCombine(SDNode *N, DAGCombinerInfo &DCI) const override;

void computeKnownBitsForTargetNode(const SDValue Op, KnownBits &Known,		void computeKnownBitsForTargetNode(
const APInt &DemandedElts,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const SelectionDAG &DAG,		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
unsigned Depth = 0) const override;		unsigned Depth = 0) const override;

private:		private:
SDValue LowerCCCCallTo(SDValue Chain, SDValue Callee,		SDValue LowerCCCCallTo(SDValue Chain, SDValue Callee,
CallingConv::ID CallConv, bool IsVarArg,		CallingConv::ID CallConv, bool IsVarArg,
bool IsTailCall,		bool IsTailCall,
const SmallVectorImpl<ISD::OutputArg> &Outs,		const SmallVectorImpl<ISD::OutputArg> &Outs,
const SmallVectorImpl<SDValue> &OutVals,		const SmallVectorImpl<SDValue> &OutVals,
const SmallVectorImpl<ISD::InputArg> &Ins,		const SmallVectorImpl<ISD::InputArg> &Ins,
Show All 34 Lines

lib/Target/Lanai/LanaiISelLowering.cpp

Show First 20 Lines • Show All 1,497 Lines • ▼ Show 20 Lines	case ISD::SUB:
return PerformSUBCombine(N, DCI);		return PerformSUBCombine(N, DCI);
}		}

return SDValue();		return SDValue();
}		}

void LanaiTargetLowering::computeKnownBitsForTargetNode(		void LanaiTargetLowering::computeKnownBitsForTargetNode(
const SDValue Op, KnownBits &Known, const APInt &DemandedElts,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const SelectionDAG &DAG, unsigned Depth) const {		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
		unsigned Depth) const {
unsigned BitWidth = Known.getBitWidth();		unsigned BitWidth = Known.getBitWidth();
switch (Op.getOpcode()) {		switch (Op.getOpcode()) {
default:		default:
break;		break;
case LanaiISD::SETCC:		case LanaiISD::SETCC:
Known = KnownBits(BitWidth);		Known = KnownBits(BitWidth);
Known.Zero.setBits(1, BitWidth);		Known.Zero.setBits(1, BitWidth);
break;		break;
case LanaiISD::SELECT_CC:		case LanaiISD::SELECT_CC:
KnownBits Known2;		KnownBits Known2;
DAG.computeKnownBits(Op->getOperand(0), Known, Depth + 1);		DAG.computeKnownBits(Op->getOperand(0), Known, AnyToZeroExtLoads,
DAG.computeKnownBits(Op->getOperand(1), Known2, Depth + 1);		Depth + 1);
		DAG.computeKnownBits(Op->getOperand(1), Known2, AnyToZeroExtLoads,
		Depth + 1);
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
Known.One &= Known2.One;		Known.One &= Known2.One;
break;		break;
}		}
}		}

lib/Target/PowerPC/PPCISelLowering.h

Show First 20 Lines • Show All 653 Lines • ▼ Show 20 Lines	public:
SDValue PerformDAGCombine(SDNode *N, DAGCombinerInfo &DCI) const override;		SDValue PerformDAGCombine(SDNode *N, DAGCombinerInfo &DCI) const override;

SDValue BuildSDIVPow2(SDNode *N, const APInt &Divisor, SelectionDAG &DAG,		SDValue BuildSDIVPow2(SDNode *N, const APInt &Divisor, SelectionDAG &DAG,
std::vector<SDNode > Created) const override;		std::vector<SDNode > Created) const override;

unsigned getRegisterByName(const char* RegName, EVT VT,		unsigned getRegisterByName(const char* RegName, EVT VT,
SelectionDAG &DAG) const override;		SelectionDAG &DAG) const override;

void computeKnownBitsForTargetNode(const SDValue Op,		void computeKnownBitsForTargetNode(
KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts,
const SelectionDAG &DAG,		const SelectionDAG &DAG,
		SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
unsigned Depth = 0) const override;		unsigned Depth = 0) const override;

unsigned getPrefLoopAlignment(MachineLoop *ML) const override;		unsigned getPrefLoopAlignment(MachineLoop *ML) const override;

bool shouldInsertFencesForAtomic(const Instruction *I) const override {		bool shouldInsertFencesForAtomic(const Instruction *I) const override {
return true;		return true;
}		}

Instruction emitLeadingFence(IRBuilder<> &Builder, Instruction Inst,		Instruction emitLeadingFence(IRBuilder<> &Builder, Instruction Inst,
▲ Show 20 Lines • Show All 463 Lines • Show Last 20 Lines

lib/Target/PowerPC/PPCISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 12,928 Lines • ▼ Show 20 Lines	PPCTargetLowering::BuildSDIVPow2(SDNode *N, const APInt &Divisor,

return Op;		return Op;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Inline Assembly Support		// Inline Assembly Support
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

void PPCTargetLowering::computeKnownBitsForTargetNode(const SDValue Op,		void PPCTargetLowering::computeKnownBitsForTargetNode(
KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts,		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
const SelectionDAG &DAG,
unsigned Depth) const {		unsigned Depth) const {
Known.resetAll();		Known.resetAll();
switch (Op.getOpcode()) {		switch (Op.getOpcode()) {
default: break;		default: break;
case PPCISD::LBRX: {		case PPCISD::LBRX: {
// lhbrx is known to have the top bits cleared out.		// lhbrx is known to have the top bits cleared out.
if (cast<VTSDNode>(Op.getOperand(2))->getVT() == MVT::i16)		if (cast<VTSDNode>(Op.getOperand(2))->getVT() == MVT::i16)
Known.Zero = 0xFFFF0000;		Known.Zero = 0xFFFF0000;
break;		break;
▲ Show 20 Lines • Show All 994 Lines • Show Last 20 Lines

lib/Target/Sparc/SparcISelLowering.h

Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	public:
SparcTargetLowering(const TargetMachine &TM, const SparcSubtarget &STI);		SparcTargetLowering(const TargetMachine &TM, const SparcSubtarget &STI);
SDValue LowerOperation(SDValue Op, SelectionDAG &DAG) const override;		SDValue LowerOperation(SDValue Op, SelectionDAG &DAG) const override;

bool useSoftFloat() const override;		bool useSoftFloat() const override;

/// computeKnownBitsForTargetNode - Determine which of the bits specified		/// computeKnownBitsForTargetNode - Determine which of the bits specified
/// in Mask are known to be either zero or one and return them in the		/// in Mask are known to be either zero or one and return them in the
/// KnownZero/KnownOne bitsets.		/// KnownZero/KnownOne bitsets.
void computeKnownBitsForTargetNode(const SDValue Op,		void computeKnownBitsForTargetNode(
KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts,
const SelectionDAG &DAG,		const SelectionDAG &DAG,
		SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
unsigned Depth = 0) const override;		unsigned Depth = 0) const override;

MachineBasicBlock *		MachineBasicBlock *
EmitInstrWithCustomInserter(MachineInstr &MI,		EmitInstrWithCustomInserter(MachineInstr &MI,
MachineBasicBlock *MBB) const override;		MachineBasicBlock *MBB) const override;

const char *getTargetNodeName(unsigned Opcode) const override;		const char *getTargetNodeName(unsigned Opcode) const override;

ConstraintType getConstraintType(StringRef Constraint) const override;		ConstraintType getConstraintType(StringRef Constraint) const override;
▲ Show 20 Lines • Show All 143 Lines • Show Last 20 Lines

lib/Target/Sparc/SparcISelLowering.cpp

Show First 20 Lines • Show All 1,877 Lines • ▼ Show 20 Lines	EVT SparcTargetLowering::getSetCCResultType(const DataLayout &, LLVMContext &,
if (!VT.isVector())		if (!VT.isVector())
return MVT::i32;		return MVT::i32;
return VT.changeVectorElementTypeToInteger();		return VT.changeVectorElementTypeToInteger();
}		}

/// isMaskedValueZeroForTargetNode - Return true if 'Op & Mask' is known to		/// isMaskedValueZeroForTargetNode - Return true if 'Op & Mask' is known to
/// be zero. Op is expected to be a target specific node. Used by DAG		/// be zero. Op is expected to be a target specific node. Used by DAG
/// combiner.		/// combiner.
void SparcTargetLowering::computeKnownBitsForTargetNode		void SparcTargetLowering::computeKnownBitsForTargetNode(
(const SDValue Op,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
KnownBits &Known,		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
const APInt &DemandedElts,
const SelectionDAG &DAG,
unsigned Depth) const {		unsigned Depth) const {
KnownBits Known2;		KnownBits Known2;
Known.resetAll();		Known.resetAll();

switch (Op.getOpcode()) {		switch (Op.getOpcode()) {
default: break;		default: break;
case SPISD::SELECT_ICC:		case SPISD::SELECT_ICC:
case SPISD::SELECT_XCC:		case SPISD::SELECT_XCC:
case SPISD::SELECT_FCC:		case SPISD::SELECT_FCC:
DAG.computeKnownBits(Op.getOperand(1), Known, Depth+1);		DAG.computeKnownBits(Op.getOperand(1), Known, AnyToZeroExtLoads, Depth + 1);
DAG.computeKnownBits(Op.getOperand(0), Known2, Depth+1);		DAG.computeKnownBits(Op.getOperand(0), Known2, AnyToZeroExtLoads,
		Depth + 1);

// Only known if known in both the LHS and RHS.		// Only known if known in both the LHS and RHS.
Known.One &= Known2.One;		Known.One &= Known2.One;
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
break;		break;
}		}
}		}

▲ Show 20 Lines • Show All 1,694 Lines • Show Last 20 Lines

lib/Target/SystemZ/SystemZISelLowering.h

Show First 20 Lines • Show All 486 Lines • ▼ Show 20 Lines	public:
SDValue LowerReturn(SDValue Chain, CallingConv::ID CallConv, bool IsVarArg,		SDValue LowerReturn(SDValue Chain, CallingConv::ID CallConv, bool IsVarArg,
const SmallVectorImpl<ISD::OutputArg> &Outs,		const SmallVectorImpl<ISD::OutputArg> &Outs,
const SmallVectorImpl<SDValue> &OutVals, const SDLoc &DL,		const SmallVectorImpl<SDValue> &OutVals, const SDLoc &DL,
SelectionDAG &DAG) const override;		SelectionDAG &DAG) const override;
SDValue PerformDAGCombine(SDNode *N, DAGCombinerInfo &DCI) const override;		SDValue PerformDAGCombine(SDNode *N, DAGCombinerInfo &DCI) const override;

/// Determine which of the bits specified in Mask are known to be either		/// Determine which of the bits specified in Mask are known to be either
/// zero or one and return them in the KnownZero/KnownOne bitsets.		/// zero or one and return them in the KnownZero/KnownOne bitsets.
void computeKnownBitsForTargetNode(const SDValue Op,		void computeKnownBitsForTargetNode(
KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts,		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
const SelectionDAG &DAG,
unsigned Depth = 0) const override;		unsigned Depth = 0) const override;

ISD::NodeType getExtendForAtomicOps() const override {		ISD::NodeType getExtendForAtomicOps() const override {
return ISD::ANY_EXTEND;		return ISD::ANY_EXTEND;
}		}

bool supportSwiftError() const override {		bool supportSwiftError() const override {
return true;		return true;
}		}
▲ Show 20 Lines • Show All 125 Lines • Show Last 20 Lines

lib/Target/SystemZ/SystemZISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,570 Lines • ▼ Show 20 Lines	SDValue SystemZTargetLowering::PerformDAGCombine(SDNode *N,
case ISD::ROTL: return combineSHIFTROT(N, DCI);		case ISD::ROTL: return combineSHIFTROT(N, DCI);
case SystemZISD::BR_CCMASK: return combineBR_CCMASK(N, DCI);		case SystemZISD::BR_CCMASK: return combineBR_CCMASK(N, DCI);
case SystemZISD::SELECT_CCMASK: return combineSELECT_CCMASK(N, DCI);		case SystemZISD::SELECT_CCMASK: return combineSELECT_CCMASK(N, DCI);
}		}

return SDValue();		return SDValue();
}		}

void		void SystemZTargetLowering::computeKnownBitsForTargetNode(
SystemZTargetLowering::computeKnownBitsForTargetNode(const SDValue Op,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
KnownBits &Known,		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
const APInt &DemandedElts,
const SelectionDAG &DAG,
unsigned Depth) const {		unsigned Depth) const {
unsigned BitWidth = Known.getBitWidth();		unsigned BitWidth = Known.getBitWidth();

Known.resetAll();		Known.resetAll();
switch (Op.getOpcode()) {		switch (Op.getOpcode()) {
case SystemZISD::SELECT_CCMASK: {		case SystemZISD::SELECT_CCMASK: {
KnownBits TrueKnown(BitWidth), FalseKnown(BitWidth);		KnownBits TrueKnown(BitWidth), FalseKnown(BitWidth);
DAG.computeKnownBits(Op.getOperand(0), TrueKnown, Depth + 1);		DAG.computeKnownBits(Op.getOperand(0), TrueKnown, AnyToZeroExtLoads,
DAG.computeKnownBits(Op.getOperand(1), FalseKnown, Depth + 1);		Depth + 1);
		DAG.computeKnownBits(Op.getOperand(1), FalseKnown, AnyToZeroExtLoads,
		Depth + 1);
Known.Zero = TrueKnown.Zero & FalseKnown.Zero;		Known.Zero = TrueKnown.Zero & FalseKnown.Zero;
Known.One = TrueKnown.One & FalseKnown.One;		Known.One = TrueKnown.One & FalseKnown.One;
break;		break;
}		}

default:		default:
break;		break;
}		}
▲ Show 20 Lines • Show All 1,194 Lines • Show Last 20 Lines

lib/Target/X86/X86ISelLowering.h

Show First 20 Lines • Show All 834 Lines • ▼ Show 20 Lines	public:
EVT getSetCCResultType(const DataLayout &DL, LLVMContext &Context,		EVT getSetCCResultType(const DataLayout &DL, LLVMContext &Context,
EVT VT) const override;		EVT VT) const override;

bool targetShrinkDemandedConstant(SDValue Op, const APInt &Demanded,		bool targetShrinkDemandedConstant(SDValue Op, const APInt &Demanded,
TargetLoweringOpt &TLO) const override;		TargetLoweringOpt &TLO) const override;

/// Determine which of the bits specified in Mask are known to be either		/// Determine which of the bits specified in Mask are known to be either
/// zero or one and return them in the KnownZero/KnownOne bitsets.		/// zero or one and return them in the KnownZero/KnownOne bitsets.
void computeKnownBitsForTargetNode(const SDValue Op,		void computeKnownBitsForTargetNode(
KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts,
const SelectionDAG &DAG,		const SelectionDAG &DAG,
		SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
unsigned Depth = 0) const override;		unsigned Depth = 0) const override;

/// Determine the number of bits in the operation that are sign bits.		/// Determine the number of bits in the operation that are sign bits.
unsigned ComputeNumSignBitsForTargetNode(SDValue Op,		unsigned ComputeNumSignBitsForTargetNode(SDValue Op,
const APInt &DemandedElts,		const APInt &DemandedElts,
const SelectionDAG &DAG,		const SelectionDAG &DAG,
unsigned Depth) const override;		unsigned Depth) const override;

SDValue unwrapAddress(SDValue N) const override;		SDValue unwrapAddress(SDValue N) const override;
▲ Show 20 Lines • Show All 683 Lines • Show Last 20 Lines

lib/Target/X86/X86ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 28,131 Lines • ▼ Show 20 Lines	X86TargetLowering::targetShrinkDemandedConstant(SDValue Op,

// Replace the constant with the zero extend mask.		// Replace the constant with the zero extend mask.
SDLoc DL(Op);		SDLoc DL(Op);
SDValue NewC = TLO.DAG.getConstant(ZeroExtendMask, DL, VT);		SDValue NewC = TLO.DAG.getConstant(ZeroExtendMask, DL, VT);
SDValue NewOp = TLO.DAG.getNode(ISD::AND, DL, VT, Op.getOperand(0), NewC);		SDValue NewOp = TLO.DAG.getNode(ISD::AND, DL, VT, Op.getOperand(0), NewC);
return TLO.CombineTo(Op, NewOp);		return TLO.CombineTo(Op, NewOp);
}		}

void X86TargetLowering::computeKnownBitsForTargetNode(const SDValue Op,		void X86TargetLowering::computeKnownBitsForTargetNode(
KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts,		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
const SelectionDAG &DAG,
unsigned Depth) const {		unsigned Depth) const {
unsigned BitWidth = Known.getBitWidth();		unsigned BitWidth = Known.getBitWidth();
unsigned Opc = Op.getOpcode();		unsigned Opc = Op.getOpcode();
EVT VT = Op.getValueType();		EVT VT = Op.getValueType();
assert((Opc >= ISD::BUILTIN_OP_END \|\|		assert((Opc >= ISD::BUILTIN_OP_END \|\|
Opc == ISD::INTRINSIC_WO_CHAIN \|\|		Opc == ISD::INTRINSIC_WO_CHAIN \|\|
Opc == ISD::INTRINSIC_W_CHAIN \|\|		Opc == ISD::INTRINSIC_W_CHAIN \|\|
Opc == ISD::INTRINSIC_VOID) &&		Opc == ISD::INTRINSIC_VOID) &&
"Should use MaskedValueIsZero if you don't know whether Op"		"Should use MaskedValueIsZero if you don't know whether Op"
Show All 11 Lines	case X86ISD::MOVMSK: {
break;		break;
}		}
case X86ISD::PEXTRB:		case X86ISD::PEXTRB:
case X86ISD::PEXTRW: {		case X86ISD::PEXTRW: {
SDValue Src = Op.getOperand(0);		SDValue Src = Op.getOperand(0);
EVT SrcVT = Src.getValueType();		EVT SrcVT = Src.getValueType();
APInt DemandedElt = APInt::getOneBitSet(SrcVT.getVectorNumElements(),		APInt DemandedElt = APInt::getOneBitSet(SrcVT.getVectorNumElements(),
Op.getConstantOperandVal(1));		Op.getConstantOperandVal(1));
DAG.computeKnownBits(Src, Known, DemandedElt, Depth + 1);		DAG.computeKnownBits(Src, Known, DemandedElt, AnyToZeroExtLoads, Depth + 1);
Known = Known.zextOrTrunc(BitWidth);		Known = Known.zextOrTrunc(BitWidth);
Known.Zero.setBitsFrom(SrcVT.getScalarSizeInBits());		Known.Zero.setBitsFrom(SrcVT.getScalarSizeInBits());
break;		break;
}		}
case X86ISD::VSHLI:		case X86ISD::VSHLI:
case X86ISD::VSRLI: {		case X86ISD::VSRLI: {
if (auto *ShiftImm = dyn_cast<ConstantSDNode>(Op.getOperand(1))) {		if (auto *ShiftImm = dyn_cast<ConstantSDNode>(Op.getOperand(1))) {
if (ShiftImm->getAPIntValue().uge(VT.getScalarSizeInBits())) {		if (ShiftImm->getAPIntValue().uge(VT.getScalarSizeInBits())) {
Known.setAllZero();		Known.setAllZero();
break;		break;
}		}

DAG.computeKnownBits(Op.getOperand(0), Known, DemandedElts, Depth + 1);		DAG.computeKnownBits(Op.getOperand(0), Known, DemandedElts,
		AnyToZeroExtLoads, Depth + 1);
unsigned ShAmt = ShiftImm->getZExtValue();		unsigned ShAmt = ShiftImm->getZExtValue();
if (Opc == X86ISD::VSHLI) {		if (Opc == X86ISD::VSHLI) {
Known.Zero <<= ShAmt;		Known.Zero <<= ShAmt;
Known.One <<= ShAmt;		Known.One <<= ShAmt;
// Low bits are known zero.		// Low bits are known zero.
Known.Zero.setLowBits(ShAmt);		Known.Zero.setLowBits(ShAmt);
} else {		} else {
Known.Zero.lshrInPlace(ShAmt);		Known.Zero.lshrInPlace(ShAmt);
Show All 11 Lines	case X86ISD::VZEXT: {

EVT SrcVT = N0.getValueType();		EVT SrcVT = N0.getValueType();
unsigned InNumElts = SrcVT.getVectorNumElements();		unsigned InNumElts = SrcVT.getVectorNumElements();
unsigned InBitWidth = SrcVT.getScalarSizeInBits();		unsigned InBitWidth = SrcVT.getScalarSizeInBits();
assert(InNumElts >= NumElts && "Illegal VZEXT input");		assert(InNumElts >= NumElts && "Illegal VZEXT input");

Known = KnownBits(InBitWidth);		Known = KnownBits(InBitWidth);
APInt DemandedSrcElts = APInt::getLowBitsSet(InNumElts, NumElts);		APInt DemandedSrcElts = APInt::getLowBitsSet(InNumElts, NumElts);
DAG.computeKnownBits(N0, Known, DemandedSrcElts, Depth + 1);		DAG.computeKnownBits(N0, Known, DemandedSrcElts, AnyToZeroExtLoads,
		Depth + 1);
Known = Known.zext(BitWidth);		Known = Known.zext(BitWidth);
Known.Zero.setBitsFrom(InBitWidth);		Known.Zero.setBitsFrom(InBitWidth);
break;		break;
}		}
case X86ISD::CMOV: {		case X86ISD::CMOV: {
DAG.computeKnownBits(Op.getOperand(1), Known, Depth+1);		DAG.computeKnownBits(Op.getOperand(1), Known, AnyToZeroExtLoads, Depth + 1);
// If we don't know any bits, early out.		// If we don't know any bits, early out.
if (Known.isUnknown())		if (Known.isUnknown())
break;		break;
KnownBits Known2;		KnownBits Known2;
DAG.computeKnownBits(Op.getOperand(0), Known2, Depth+1);		DAG.computeKnownBits(Op.getOperand(0), Known2, AnyToZeroExtLoads,
		Depth + 1);

// Only known if known in both the LHS and RHS.		// Only known if known in both the LHS and RHS.
Known.One &= Known2.One;		Known.One &= Known2.One;
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
break;		break;
}		}
case X86ISD::UDIVREM8_ZEXT_HREG:		case X86ISD::UDIVREM8_ZEXT_HREG:
// TODO: Support more than just the zero extended bits?		// TODO: Support more than just the zero extended bits?
▲ Show 20 Lines • Show All 11,103 Lines • Show Last 20 Lines

lib/Target/X86/X86InstrCompiler.td

	Show First 20 Lines • Show All 1,318 Lines • ▼ Show 20 Lines
	// into "disjoint bits" pseudo ops.			// into "disjoint bits" pseudo ops.

	// Treat an 'or' node is as an 'add' if the or'ed bits are known to be zero.			// Treat an 'or' node is as an 'add' if the or'ed bits are known to be zero.
	def or_is_add : PatFrag<(ops node:$lhs, node:$rhs), (or node:$lhs, node:$rhs),[{			def or_is_add : PatFrag<(ops node:$lhs, node:$rhs), (or node:$lhs, node:$rhs),[{
	if (ConstantSDNode *CN = dyn_cast<ConstantSDNode>(N->getOperand(1)))			if (ConstantSDNode *CN = dyn_cast<ConstantSDNode>(N->getOperand(1)))
	return CurDAG->MaskedValueIsZero(N->getOperand(0), CN->getAPIntValue());			return CurDAG->MaskedValueIsZero(N->getOperand(0), CN->getAPIntValue());

	KnownBits Known0;			KnownBits Known0;
	CurDAG->computeKnownBits(N->getOperand(0), Known0, 0);			CurDAG->computeKnownBits(N->getOperand(0), Known0, nullptr, 0);
	KnownBits Known1;			KnownBits Known1;
	CurDAG->computeKnownBits(N->getOperand(1), Known1, 0);			CurDAG->computeKnownBits(N->getOperand(1), Known1, nullptr, 0);
	return (~Known0.Zero & ~Known1.Zero) == 0;			return (~Known0.Zero & ~Known1.Zero) == 0;
	}]>;			}]>;


	// (or x1, x2) -> (add x1, x2) if two operands are known not to share bits.			// (or x1, x2) -> (add x1, x2) if two operands are known not to share bits.
	// Try this before the selecting to OR.			// Try this before the selecting to OR.
	let AddedComplexity = 5, SchedRW = [WriteALU] in {			let AddedComplexity = 5, SchedRW = [WriteALU] in {

	▲ Show 20 Lines • Show All 738 Lines • Show Last 20 Lines

lib/Target/XCore/XCoreISelLowering.h

Show First 20 Lines • Show All 194 Lines • ▼ Show 20 Lines	getRegForInlineAsmConstraint(const TargetRegisterInfo *TRI,
StringRef Constraint, MVT VT) const override;		StringRef Constraint, MVT VT) const override;

// Expand specifics		// Expand specifics
SDValue TryExpandADDWithMul(SDNode *Op, SelectionDAG &DAG) const;		SDValue TryExpandADDWithMul(SDNode *Op, SelectionDAG &DAG) const;
SDValue ExpandADDSUB(SDNode *Op, SelectionDAG &DAG) const;		SDValue ExpandADDSUB(SDNode *Op, SelectionDAG &DAG) const;

SDValue PerformDAGCombine(SDNode *N, DAGCombinerInfo &DCI) const override;		SDValue PerformDAGCombine(SDNode *N, DAGCombinerInfo &DCI) const override;

void computeKnownBitsForTargetNode(const SDValue Op,		void computeKnownBitsForTargetNode(
KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts,
const SelectionDAG &DAG,		const SelectionDAG &DAG,
		SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
unsigned Depth = 0) const override;		unsigned Depth = 0) const override;

SDValue		SDValue
LowerFormalArguments(SDValue Chain, CallingConv::ID CallConv, bool isVarArg,		LowerFormalArguments(SDValue Chain, CallingConv::ID CallConv, bool isVarArg,
const SmallVectorImpl<ISD::InputArg> &Ins,		const SmallVectorImpl<ISD::InputArg> &Ins,
const SDLoc &dl, SelectionDAG &DAG,		const SDLoc &dl, SelectionDAG &DAG,
SmallVectorImpl<SDValue> &InVals) const override;		SmallVectorImpl<SDValue> &InVals) const override;

SDValue		SDValue
Show All 20 Lines

lib/Target/XCore/XCoreISelLowering.cpp

Show First 20 Lines • Show All 1,813 Lines • ▼ Show 20 Lines	if (LoadSDNode *LD = dyn_cast<LoadSDNode>(ST->getValue())) {
}		}
}		}
break;		break;
}		}
}		}
return SDValue();		return SDValue();
}		}

void XCoreTargetLowering::computeKnownBitsForTargetNode(const SDValue Op,		void XCoreTargetLowering::computeKnownBitsForTargetNode(
KnownBits &Known,		const SDValue Op, KnownBits &Known, const APInt &DemandedElts,
const APInt &DemandedElts,		const SelectionDAG &DAG, SmallPtrSetImpl<LoadSDNode > AnyToZeroExtLoads,
const SelectionDAG &DAG,
unsigned Depth) const {		unsigned Depth) const {
Known.resetAll();		Known.resetAll();
switch (Op.getOpcode()) {		switch (Op.getOpcode()) {
default: break;		default: break;
case XCoreISD::LADD:		case XCoreISD::LADD:
case XCoreISD::LSUB:		case XCoreISD::LSUB:
if (Op.getResNo() == 1) {		if (Op.getResNo() == 1) {
// Top bits of carry / borrow are clear.		// Top bits of carry / borrow are clear.
Known.Zero = APInt::getHighBitsSet(Known.getBitWidth(),		Known.Zero = APInt::getHighBitsSet(Known.getBitWidth(),
▲ Show 20 Lines • Show All 113 Lines • Show Last 20 Lines

test/CodeGen/Thumb/setcc_xor.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc -mtriple=thumbv6m-eabi -asm-verbose=false %s -o - \| FileCheck %s --check-prefix=CHECK-V6M			; RUN: llc -mtriple=thumbv6m-eabi -asm-verbose=false %s -o - \| FileCheck %s --check-prefix=CHECK-V6M
	; RUN: llc -mtriple=thumbv7m-eabi -asm-verbose=false %s -o - \| FileCheck %s --check-prefix=CHECK-V7M			; RUN: llc -mtriple=thumbv7m-eabi -asm-verbose=false %s -o - \| FileCheck %s --check-prefix=CHECK-V7M

	define i8 @test1(i8 zeroext %x, i8 zeroext %y) {			define i8 @test1(i8 zeroext %x, i8 zeroext %y) {
	; CHECK-V6M-LABEL: test1:			; CHECK-V6M-LABEL: test1:
	; CHECK-V6M: movs r2, #255			; CHECK-V6M: mvns r0, r0
	; CHECK-V6M-NEXT: mov r3, r2
	; CHECK-V6M-NEXT: bics r3, r1
	; CHECK-V6M-NEXT: bics r2, r0
	; CHECK-V6M-NEXT: mvns r0, r0
	; CHECK-V6M-NEXT: mvns r1, r1			; CHECK-V6M-NEXT: mvns r1, r1
	; CHECK-V6M-NEXT: cmp r2, r3			; CHECK-V6M-NEXT: cmp r0, r1
	; CHECK-V6M-NEXT: bls .LBB0_2			; CHECK-V6M-NEXT: bls .LBB0_2
	; CHECK-V6M-NEXT: mov r0, r1			; CHECK-V6M-NEXT: mov r0, r1
	; CHECK-V6M-NEXT: .LBB0_2:			; CHECK-V6M-NEXT: .LBB0_2:
	; CHECK-V6M-NEXT: bx lr			; CHECK-V6M-NEXT: bx lr
	;			;
	; CHECK-V7M-LABEL: test1:			; CHECK-V7M-LABEL: test1:
	; CHECK-V7M: mvns r1, r1			; CHECK-V7M: mvns r2, r0
	; CHECK-V7M-NEXT: mvns r0, r0			; CHECK-V7M-NEXT: mvns r0, r1
	; CHECK-V7M-NEXT: uxtb r2, r1			; CHECK-V7M-NEXT: cmp r2, r0
	; CHECK-V7M-NEXT: uxtb r3, r0
	; CHECK-V7M-NEXT: cmp r3, r2
	; CHECK-V7M-NEXT: it ls			; CHECK-V7M-NEXT: it ls
	; CHECK-V7M-NEXT: movls r1, r0			; CHECK-V7M-NEXT: movls r0, r2
	; CHECK-V7M-NEXT: mov r0, r1
	; CHECK-V7M-NEXT: bx lr			; CHECK-V7M-NEXT: bx lr
	entry:			entry:
	%nx = xor i8 %x, 255			%nx = xor i8 %x, 255
	%ny = xor i8 %y, 255			%ny = xor i8 %y, 255
	%cmp = icmp ule i8 %nx, %ny			%cmp = icmp ule i8 %nx, %ny
	%sel = select i1 %cmp, i8 %nx, i8 %ny			%sel = select i1 %cmp, i8 %nx, i8 %ny
	ret i8 %sel			ret i8 %sel
	}			}

	define void @test2(i8* %X, i8* %Y) {			define void @test2(i8* %X, i8* %Y) {
	; CHECK-V6M-LABEL: test2:			; CHECK-V6M-LABEL: test2:
	; CHECK-V6M: .save {r4, r5, r7, lr}			; CHECK-V6M: .save {r4, lr}
	; CHECK-V6M-NEXT: push {r4, r5, r7, lr}			; CHECK-V6M-NEXT: push {r4, lr}
	; CHECK-V6M-NEXT: ldrb r2, [r1]			; CHECK-V6M-NEXT: ldrb r2, [r0]
	; CHECK-V6M-NEXT: movs r4, #255
	; CHECK-V6M-NEXT: mov r5, r4
	; CHECK-V6M-NEXT: bics r5, r2
	; CHECK-V6M-NEXT: ldrb r3, [r0]
	; CHECK-V6M-NEXT: bics r4, r3
	; CHECK-V6M-NEXT: mvns r3, r3
	; CHECK-V6M-NEXT: mvns r2, r2			; CHECK-V6M-NEXT: mvns r2, r2
	; CHECK-V6M-NEXT: cmp r4, r5			; CHECK-V6M-NEXT: ldrb r3, [r1]
	; CHECK-V6M-NEXT: mov r4, r3			; CHECK-V6M-NEXT: mvns r3, r3
	; CHECK-V6M-NEXT: blo .LBB1_2			; CHECK-V6M-NEXT: cmp r2, r3
	; CHECK-V6M-NEXT: mov r4, r2			; CHECK-V6M-NEXT: mov r4, r2
				; CHECK-V6M-NEXT: blo .LBB1_2
				; CHECK-V6M-NEXT: mov r4, r3
	; CHECK-V6M-NEXT: .LBB1_2:			; CHECK-V6M-NEXT: .LBB1_2:
	; CHECK-V6M-NEXT: subs r3, r3, r4			; CHECK-V6M-NEXT: subs r2, r2, r4
	; CHECK-V6M-NEXT: strb r3, [r0]			; CHECK-V6M-NEXT: strb r2, [r0]
	; CHECK-V6M-NEXT: subs r0, r2, r4			; CHECK-V6M-NEXT: subs r0, r3, r4
	; CHECK-V6M-NEXT: strb r0, [r1]			; CHECK-V6M-NEXT: strb r0, [r1]
	; CHECK-V6M-NEXT: pop {r4, r5, r7, pc}			; CHECK-V6M-NEXT: pop {r4, pc}
	;			;
	; CHECK-V7M-LABEL: test2:			; CHECK-V7M-LABEL: test2:
	; CHECK-V7M: .save {r7, lr}			; CHECK-V7M: .save {r7, lr}
	; CHECK-V7M-NEXT: push {r7, lr}			; CHECK-V7M-NEXT: push {r7, lr}
	; CHECK-V7M-NEXT: ldrb r3, [r0]			; CHECK-V7M-NEXT: ldrb r3, [r1]
	; CHECK-V7M-NEXT: ldrb r2, [r1]			; CHECK-V7M-NEXT: ldrb r2, [r0]
	; CHECK-V7M-NEXT: mvns r3, r3			; CHECK-V7M-NEXT: mvn.w lr, r3
	; CHECK-V7M-NEXT: mvn.w lr, r2			; CHECK-V7M-NEXT: mvn.w r12, r2
	; CHECK-V7M-NEXT: uxtb r2, r3			; CHECK-V7M-NEXT: cmp r12, lr
	; CHECK-V7M-NEXT: uxtb.w r12, lr			; CHECK-V7M-NEXT: mov r3, lr
	; CHECK-V7M-NEXT: cmp r2, r12
	; CHECK-V7M-NEXT: mov r2, lr
	; CHECK-V7M-NEXT: it lo			; CHECK-V7M-NEXT: it lo
	; CHECK-V7M-NEXT: movlo r2, r3			; CHECK-V7M-NEXT: movlo r3, r12
	; CHECK-V7M-NEXT: subs r3, r3, r2			; CHECK-V7M-NEXT: sub.w r2, r12, r3
	; CHECK-V7M-NEXT: strb r3, [r0]			; CHECK-V7M-NEXT: strb r2, [r0]
	; CHECK-V7M-NEXT: sub.w r0, lr, r2			; CHECK-V7M-NEXT: sub.w r0, lr, r3
	; CHECK-V7M-NEXT: strb r0, [r1]			; CHECK-V7M-NEXT: strb r0, [r1]
	; CHECK-V7M-NEXT: pop {r7, pc}			; CHECK-V7M-NEXT: pop {r7, pc}
	entry:			entry:
	%x = load i8, i8* %X			%x = load i8, i8* %X
	%y = load i8, i8* %Y			%y = load i8, i8* %Y
	%nx = xor i8 %x, 255			%nx = xor i8 %x, 255
	%ny = xor i8 %y, 255			%ny = xor i8 %y, 255
	%cmp = icmp ult i8 %nx, %ny			%cmp = icmp ult i8 %nx, %ny
	%sel = select i1 %cmp, i8 %nx, i8 %ny			%sel = select i1 %cmp, i8 %nx, i8 %ny
	%xr = sub i8 %nx, %sel			%xr = sub i8 %nx, %sel
	%yr = sub i8 %ny, %sel			%yr = sub i8 %ny, %sel
	store i8 %xr, i8* %X			store i8 %xr, i8* %X
	store i8 %yr, i8* %Y			store i8 %yr, i8* %Y
	ret void			ret void
	}			}


	define void @testloop(i32 %I, i8* nocapture readonly %A, i8* nocapture %B) {			define void @testloop(i32 %I, i8* nocapture readonly %A, i8* nocapture %B) {
	; CHECK-V6M-LABEL: testloop:			; CHECK-V6M-LABEL: testloop:
	; CHECK-V6M: .save {r4, r5, r6, r7, lr}			; CHECK-V6M: .save {r4, r5, r6, lr}
	; CHECK-V6M-NEXT: push {r4, r5, r6, r7, lr}			; CHECK-V6M-NEXT: push {r4, r5, r6, lr}
	; CHECK-V6M-NEXT: .pad #4
	; CHECK-V6M-NEXT: sub sp, #4
	; CHECK-V6M-NEXT: cmp r0, #1			; CHECK-V6M-NEXT: cmp r0, #1
	; CHECK-V6M-NEXT: blt .LBB2_6			; CHECK-V6M-NEXT: blt .LBB2_6
	; CHECK-V6M-NEXT: .LBB2_1:			; CHECK-V6M-NEXT: .LBB2_1:
				; CHECK-V6M-NEXT: ldrb r3, [r1]
				; CHECK-V6M-NEXT: mvns r4, r3
	; CHECK-V6M-NEXT: ldrb r3, [r1, #2]			; CHECK-V6M-NEXT: ldrb r3, [r1, #2]
	; CHECK-V6M-NEXT: movs r6, #255
	; CHECK-V6M-NEXT: mov r5, r6
	; CHECK-V6M-NEXT: bics r5, r3
	; CHECK-V6M-NEXT: ldrb r4, [r1]
	; CHECK-V6M-NEXT: mov r7, r6
	; CHECK-V6M-NEXT: bics r7, r4
	; CHECK-V6M-NEXT: mvns r4, r4
	; CHECK-V6M-NEXT: mvns r3, r3			; CHECK-V6M-NEXT: mvns r3, r3
	; CHECK-V6M-NEXT: cmp r7, r5			; CHECK-V6M-NEXT: cmp r4, r3
	; CHECK-V6M-NEXT: mov r5, r4			; CHECK-V6M-NEXT: mov r5, r4
	; CHECK-V6M-NEXT: blo .LBB2_3			; CHECK-V6M-NEXT: blo .LBB2_3
	; CHECK-V6M-NEXT: mov r5, r3			; CHECK-V6M-NEXT: mov r5, r3
	; CHECK-V6M-NEXT: .LBB2_3:			; CHECK-V6M-NEXT: .LBB2_3:
	; CHECK-V6M-NEXT: str r3, [sp]			; CHECK-V6M-NEXT: ldrb r6, [r1, #1]
	; CHECK-V6M-NEXT: uxtb r3, r5			; CHECK-V6M-NEXT: mvns r6, r6
	; CHECK-V6M-NEXT: ldrb r7, [r1, #1]			; CHECK-V6M-NEXT: cmp r5, r6
	; CHECK-V6M-NEXT: bics r6, r7
	; CHECK-V6M-NEXT: mvns r7, r7
	; CHECK-V6M-NEXT: cmp r3, r6
	; CHECK-V6M-NEXT: blo .LBB2_5			; CHECK-V6M-NEXT: blo .LBB2_5
	; CHECK-V6M-NEXT: mov r5, r7			; CHECK-V6M-NEXT: mov r5, r6
	; CHECK-V6M-NEXT: .LBB2_5:			; CHECK-V6M-NEXT: .LBB2_5:
	; CHECK-V6M-NEXT: strb r5, [r2]			; CHECK-V6M-NEXT: strb r5, [r2]
	; CHECK-V6M-NEXT: subs r3, r4, r5			; CHECK-V6M-NEXT: subs r4, r4, r5
	; CHECK-V6M-NEXT: strb r3, [r2, #1]			; CHECK-V6M-NEXT: strb r4, [r2, #1]
	; CHECK-V6M-NEXT: subs r3, r7, r5			; CHECK-V6M-NEXT: subs r4, r6, r5
	; CHECK-V6M-NEXT: strb r3, [r2, #2]			; CHECK-V6M-NEXT: strb r4, [r2, #2]
	; CHECK-V6M-NEXT: ldr r3, [sp]
	; CHECK-V6M-NEXT: subs r3, r3, r5			; CHECK-V6M-NEXT: subs r3, r3, r5
	; CHECK-V6M-NEXT: strb r3, [r2, #3]			; CHECK-V6M-NEXT: strb r3, [r2, #3]
	; CHECK-V6M-NEXT: adds r2, r2, #4			; CHECK-V6M-NEXT: adds r2, r2, #4
	; CHECK-V6M-NEXT: adds r1, r1, #3			; CHECK-V6M-NEXT: adds r1, r1, #3
	; CHECK-V6M-NEXT: subs r0, r0, #1			; CHECK-V6M-NEXT: subs r0, r0, #1
	; CHECK-V6M-NEXT: bne .LBB2_1			; CHECK-V6M-NEXT: bne .LBB2_1
	; CHECK-V6M-NEXT: .LBB2_6:			; CHECK-V6M-NEXT: .LBB2_6:
	; CHECK-V6M-NEXT: add sp, #4			; CHECK-V6M-NEXT: pop {r4, r5, r6, pc}
	; CHECK-V6M-NEXT: pop {r4, r5, r6, r7, pc}
	;			;
	; CHECK-V7M-LABEL: testloop:			; CHECK-V7M-LABEL: testloop:
	; CHECK-V7M: .save {r4, r5, r6, r7, lr}			; CHECK-V7M: .save {r4, r5, r7, lr}
	; CHECK-V7M-NEXT: push {r4, r5, r6, r7, lr}			; CHECK-V7M-NEXT: push {r4, r5, r7, lr}
	; CHECK-V7M-NEXT: cmp r0, #1			; CHECK-V7M-NEXT: cmp r0, #1
	; CHECK-V7M-NEXT: blt .LBB2_2			; CHECK-V7M-NEXT: blt .LBB2_2
	; CHECK-V7M-NEXT: .LBB2_1:			; CHECK-V7M-NEXT: .LBB2_1:
	; CHECK-V7M-NEXT: ldrb.w lr, [r1]			; CHECK-V7M-NEXT: ldrb.w lr, [r1]
	; CHECK-V7M-NEXT: ldrb r3, [r1, #2]			; CHECK-V7M-NEXT: ldrb r4, [r1, #2]
	; CHECK-V7M-NEXT: ldrb.w r12, [r1, #1]			; CHECK-V7M-NEXT: ldrb.w r12, [r1, #1]
	; CHECK-V7M-NEXT: adds r1, #3			; CHECK-V7M-NEXT: adds r1, #3
	; CHECK-V7M-NEXT: mvn.w r4, lr			; CHECK-V7M-NEXT: mvn.w r5, lr
	; CHECK-V7M-NEXT: mvns r7, r3			; CHECK-V7M-NEXT: mvn.w lr, r4
	; CHECK-V7M-NEXT: uxtb r5, r4			; CHECK-V7M-NEXT: cmp r5, lr
	; CHECK-V7M-NEXT: uxtb r6, r7			; CHECK-V7M-NEXT: mov r4, lr
	; CHECK-V7M-NEXT: cmp r5, r6			; CHECK-V7M-NEXT: mvn.w r3, r12
	; CHECK-V7M-NEXT: mov r3, r7
	; CHECK-V7M-NEXT: mvn.w r5, r12
	; CHECK-V7M-NEXT: it lo			; CHECK-V7M-NEXT: it lo
	; CHECK-V7M-NEXT: movlo r3, r4			; CHECK-V7M-NEXT: movlo r4, r5
	; CHECK-V7M-NEXT: uxtb r6, r5			; CHECK-V7M-NEXT: cmp r4, r3
	; CHECK-V7M-NEXT: uxtb.w lr, r3
	; CHECK-V7M-NEXT: cmp lr, r6
	; CHECK-V7M-NEXT: it hs			; CHECK-V7M-NEXT: it hs
	; CHECK-V7M-NEXT: movhs r3, r5			; CHECK-V7M-NEXT: movhs r4, r3
	; CHECK-V7M-NEXT: subs r0, #1			; CHECK-V7M-NEXT: subs r0, #1
	; CHECK-V7M-NEXT: sub.w r6, r4, r3			; CHECK-V7M-NEXT: sub.w r3, r3, r4
	; CHECK-V7M-NEXT: strb r3, [r2]			; CHECK-V7M-NEXT: strb r4, [r2]
	; CHECK-V7M-NEXT: strb r6, [r2, #1]			; CHECK-V7M-NEXT: sub.w r5, r5, r4
	; CHECK-V7M-NEXT: sub.w r6, r5, r3			; CHECK-V7M-NEXT: strb r5, [r2, #1]
	; CHECK-V7M-NEXT: strb r6, [r2, #2]			; CHECK-V7M-NEXT: strb r3, [r2, #2]
	; CHECK-V7M-NEXT: sub.w r3, r7, r3			; CHECK-V7M-NEXT: sub.w r3, lr, r4
	; CHECK-V7M-NEXT: strb r3, [r2, #3]			; CHECK-V7M-NEXT: strb r3, [r2, #3]
	; CHECK-V7M-NEXT: add.w r2, r2, #4			; CHECK-V7M-NEXT: add.w r2, r2, #4
	; CHECK-V7M-NEXT: bne .LBB2_1			; CHECK-V7M-NEXT: bne .LBB2_1
	; CHECK-V7M-NEXT: .LBB2_2:			; CHECK-V7M-NEXT: .LBB2_2:
	; CHECK-V7M-NEXT: pop {r4, r5, r6, r7, pc}			; CHECK-V7M-NEXT: pop {r4, r5, r7, pc}
	entry:			entry:
	%cmp74 = icmp sgt i32 %I, 0			%cmp74 = icmp sgt i32 %I, 0
	br i1 %cmp74, label %for.body.preheader, label %for.cond.cleanup			br i1 %cmp74, label %for.body.preheader, label %for.cond.cleanup

	for.body.preheader:			for.body.preheader:
	br label %for.body			br label %for.body

	for.body:			for.body:
	Show All 34 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[DAGCombine] Remove AND in SETCC if we can prove they are unneededAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 136826

include/llvm/CodeGen/SelectionDAG.h

include/llvm/CodeGen/TargetLowering.h

lib/CodeGen/SelectionDAG/SelectionDAG.cpp

lib/CodeGen/SelectionDAG/TargetLowering.cpp

lib/Target/AArch64/AArch64ISelLowering.h

lib/Target/AArch64/AArch64ISelLowering.cpp

lib/Target/AMDGPU/AMDGPUISelLowering.h

lib/Target/AMDGPU/AMDGPUISelLowering.cpp

lib/Target/ARM/ARMISelLowering.h

lib/Target/ARM/ARMISelLowering.cpp

lib/Target/Lanai/LanaiISelLowering.h

lib/Target/Lanai/LanaiISelLowering.cpp

lib/Target/PowerPC/PPCISelLowering.h

lib/Target/PowerPC/PPCISelLowering.cpp

lib/Target/Sparc/SparcISelLowering.h

lib/Target/Sparc/SparcISelLowering.cpp

lib/Target/SystemZ/SystemZISelLowering.h

lib/Target/SystemZ/SystemZISelLowering.cpp

lib/Target/X86/X86ISelLowering.h

lib/Target/X86/X86ISelLowering.cpp

lib/Target/X86/X86InstrCompiler.td

lib/Target/XCore/XCoreISelLowering.h

lib/Target/XCore/XCoreISelLowering.cpp

test/CodeGen/Thumb/setcc_xor.ll

[DAGCombine] Remove AND in SETCC if we can prove they are unneeded
AbandonedPublic