This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
docs/
1
LangRef.rst
-
include/llvm/
-
llvm/
-
Analysis/
-
ValueTracking.h
-
CodeGen/GlobalISel/
-
GlobalISel/
-
IRTranslator.h
-
IR/
-
Constant.h
-
lib/
-
Analysis/
-
CodeMetrics.cpp
1/3
ValueTracking.cpp
-
CodeGen/SelectionDAG/
-
SelectionDAG/
-
FastISel.cpp
-
SelectionDAGBuilder.h
1
SelectionDAGBuilder.cpp
-
SelectionDAGISel.cpp
-
IR/
1
Constants.cpp
-
Transforms/
-
Utils/
-
SimplifyCFG.cpp
-
Vectorize/
-
LoopVectorizationLegality.cpp
-
test/CodeGen/X86/
-
CodeGen/
-
X86/
-
critical-edge-split-2.ll
1/3
divide-constant.ll

Differential D63036

LLVM IR constant expressions never trap.
Changes PlannedPublic

Authored by efriedma on Jun 7 2019, 5:30 PM.

Download Raw Diff

Details

Reviewers

hfinkel
chandlerc
jdoerfert
aemerson

Summary

Currently, constants have a property "Constant::canTrap", which is whether they contain a division that might have undefined behavior. If an instruction has a canTrap constant expression as an operand, and that constant expression contains a division with undefined behavior, the instruction has undefined behavior. For PHI nodes, the behavior is only undefined along the corresponding edges. This isn't documented anywhere in LangRef, but we use it to avoid certain transforms in a few optimization passes. For example, isSafeToSpeculativelyExecute checks whether instructions have a canTrap operand.

In practice, canTrap is almost never true: the only way create such an expression is to do something strange with the address of a global, so the denominator of a division is a complex constant expression. This means we have a lot of complexity with very little test coverage. So it would be nice if we could simplify the rules here.

This patch proposes to give up on the whole "canTrap" thing, and redefine the meaning of division in constant expressions. With this patch, if a constant expression divides by zero, or contains an overflowing divide, the result is poison. This simplifies a bunch of code. It also fixes an infinite loop bug involving a canTrap constant, a PHI, and an unsplittable critical edge. The downside is a slight performance hit: if we do end up with a divide constant expression with a complex denominator, we generate extra code to avoid the trap.

There are a few ways to reduce the performance hit that I haven't tried to implement. On architectures where division never traps, we could avoid generating extra code. We could also try to avoid constant-folding divide instructions that would result in a complex constant expression.

Diff Detail

Repository: rL LLVM

Event Timeline

efriedma created this revision.Jun 7 2019, 5:30 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 7 2019, 5:30 PM

Overall, I think that this makes sense. Thanks for proposing this.

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
3224	Please add a comment here explaining that you're guarding against both x/0 and INT_MIN/-1.
test/CodeGen/X86/divide-constant.ll
301	Can you check known bits? I feel like we should somehow know that `ptrtoint(@g)` isn't zero. For a test case, we can always do `ptrtoint(@g1)/(ptrtoint(@g2)-123456)` or similar.

jdoerfert added inline comments.Jun 8 2019, 9:29 AM

docs/LangRef.rst
3441	I'm not really happy with this formulation. It basically says that we somewhere define things to be UB and here we say that it is not UB if it is a constant.
lib/Analysis/ValueTracking.cpp
489	`V == I` implies `isa<Instruction>(V)` and you can use `isSafeToSpeculativelyExecute(I)`

Looks much better, the GISel changes are fine.

efriedma marked 2 inline comments as done.Jun 10 2019, 2:06 PM

efriedma added inline comments.

lib/Analysis/ValueTracking.cpp
489	I think you're reading this backwards; isSafeToSpeculativelyExecute only executes if `V != I`
test/CodeGen/X86/divide-constant.ll
301	I don't think there's any way to prove the value is non-zero here; it's extern_weak.

cameron.mcinally added a subscriber: cameron.mcinally.Jun 10 2019, 2:14 PM

hfinkel added inline comments.Jun 10 2019, 2:30 PM

test/CodeGen/X86/divide-constant.ll
301	Indeed. I missed that. I did mean the comment more generally - it seems like we should know that non-weak globals aren't zero. That having been said, looking at the implementation of SelectionDAG::isKnownNeverZero and SelectionDAG::computeKnownBits, etc. they don't seem to know anything about globals, so I suppose that enhancement there would also be needed in order to have an impact on this lowering.

nikic added a subscriber: nikic.Jun 10 2019, 2:33 PM

jdoerfert added inline comments.Jun 10 2019, 3:10 PM

lib/Analysis/ValueTracking.cpp
489	I was, :(

Address review comments, minor code cleanup, fix ConstantExpr::getAsInstruction, fix regression tests.

Is this still an RFC or by now an accumulation of changes we actually want to make? I added to comments assuming the latter.

lib/IR/Constants.cpp
2966	This can be reasonably separated or you could probably work with the `ValueOperands` or `Ops` array. At least I don't (immediately) see that we need to keep this connected to the RFC patch.
test/Transforms/LoopVectorize/X86/masked_load_store.ll
1505 ↗	(On Diff #203924)	I'm unsure about the sentence that talks about history. I think I'd prefer a statement about the semantic we have, thus, "constant expressions never trap, check ...". But I don't feel strongly about this.

Addressed review comments. Added release note. I think this patch contains all the changes necessary to reflect the change to IR semantics.

I'll send a brief email to llvmdev now, so everyone's aware this is changing.

I commented on the llvmdev thread, but instead of moving this complexity around, I'd really rather see it go away. We should never have supported div/rem constant expressions in the first place...

-Chris

mcberg2017 added a subscriber: mcberg2017.Jun 14 2019, 4:47 PM

efriedma planned changes to this revision.Mar 3 2022, 10:28 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 3 2022, 10:28 AM

Herald added a subscriber: pengfei. · View Herald Transcript

Revision Contents

Path

Size

docs/

LangRef.rst

4 lines

include/

llvm/

Analysis/

ValueTracking.h

2 lines

CodeGen/

GlobalISel/

IRTranslator.h

12 lines

IR/

Constant.h

4 lines

lib/

Analysis/

CodeMetrics.cpp

3 lines

ValueTracking.cpp

14 lines

CodeGen/

SelectionDAG/

FastISel.cpp

12 lines

SelectionDAGBuilder.h

9 lines

SelectionDAGBuilder.cpp

44 lines

SelectionDAGISel.cpp

43 lines

IR/

Constants.cpp

36 lines

Transforms/

Utils/

SimplifyCFG.cpp

61 lines

Vectorize/

LoopVectorizationLegality.cpp

26 lines

test/

CodeGen/

X86/

critical-edge-split-2.ll

2 lines

divide-constant.ll

390 lines

Diff 203646

docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,430 Lines • ▼ Show 20 Lines	``insertvalue (VAL, ELT, IDX0, IDX1, ...)``
The index list is interpreted in a similar manner as indices in a		The index list is interpreted in a similar manner as indices in a
':ref:`getelementptr <i_getelementptr>`' operation. At least one index		':ref:`getelementptr <i_getelementptr>`' operation. At least one index
value must be specified.		value must be specified.
``OPCODE (LHS, RHS)``		``OPCODE (LHS, RHS)``
Perform the specified operation of the LHS and RHS constants. OPCODE		Perform the specified operation of the LHS and RHS constants. OPCODE
may be any of the :ref:`binary <binaryops>` or :ref:`bitwise		may be any of the :ref:`binary <binaryops>` or :ref:`bitwise
binary <bitwiseops>` operations. The constraints on operands are		binary <bitwiseops>` operations. The constraints on operands are
the same as those for the corresponding instruction (e.g. no bitwise		the same as those for the corresponding instruction (e.g. no bitwise
operations on floating-point values are allowed).		operations on floating-point values are allowed). Constant expressions
		never have undefined behavior; division operations that would have
		undefined behavior instead produce poison.
		jdoerfertUnsubmitted Not Done Reply Inline Actions I'm not really happy with this formulation. It basically says that we somewhere define things to be UB and here we say that it is not UB if it is a constant. jdoerfert: I'm not really happy with this formulation. It basically says that we somewhere define things…

Other Values		Other Values
============		============

.. _inlineasmexprs:		.. _inlineasmexprs:

Inline Assembler Expressions		Inline Assembler Expressions
----------------------------		----------------------------
▲ Show 20 Lines • Show All 13,797 Lines • Show Last 20 Lines

include/llvm/Analysis/ValueTracking.h

Show First 20 Lines • Show All 382 Lines • ▼ Show 20 Lines	class Value;
///		///
/// If the CtxI is NOT specified this method only looks at the instruction		/// If the CtxI is NOT specified this method only looks at the instruction
/// itself and its operands, so if this method returns true, it is safe to		/// itself and its operands, so if this method returns true, it is safe to
/// move the instruction as long as the correct dominance relationships for		/// move the instruction as long as the correct dominance relationships for
/// the operands and users hold.		/// the operands and users hold.
///		///
/// This method can return true for instructions that read memory;		/// This method can return true for instructions that read memory;
/// for such instructions, moving them may change the resulting value.		/// for such instructions, moving them may change the resulting value.
bool isSafeToSpeculativelyExecute(const Value *V,		bool isSafeToSpeculativelyExecute(const Instruction *I,
const Instruction *CtxI = nullptr,		const Instruction *CtxI = nullptr,
const DominatorTree *DT = nullptr);		const DominatorTree *DT = nullptr);

/// Returns true if the result or effects of the given instructions \p I		/// Returns true if the result or effects of the given instructions \p I
/// depend on or influence global memory.		/// depend on or influence global memory.
/// Memory dependence arises for example if the instruction reads from		/// Memory dependence arises for example if the instruction reads from
/// memory or may produce effects or undefined behaviour. Memory dependent		/// memory or may produce effects or undefined behaviour. Memory dependent
/// instructions generally cannot be reorderd with respect to other memory		/// instructions generally cannot be reorderd with respect to other memory
▲ Show 20 Lines • Show All 237 Lines • Show Last 20 Lines

include/llvm/CodeGen/GlobalISel/IRTranslator.h

Show First 20 Lines • Show All 331 Lines • ▼ Show 20 Lines	private:
bool translateOr(const User &U, MachineIRBuilder &MIRBuilder) {		bool translateOr(const User &U, MachineIRBuilder &MIRBuilder) {
return translateBinaryOp(TargetOpcode::G_OR, U, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_OR, U, MIRBuilder);
}		}
bool translateXor(const User &U, MachineIRBuilder &MIRBuilder) {		bool translateXor(const User &U, MachineIRBuilder &MIRBuilder) {
return translateBinaryOp(TargetOpcode::G_XOR, U, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_XOR, U, MIRBuilder);
}		}

bool translateUDiv(const User &U, MachineIRBuilder &MIRBuilder) {		bool translateUDiv(const User &U, MachineIRBuilder &MIRBuilder) {
		// Non-trapping div for ConstantExpr not yet implemented.
		if (isa<ConstantExpr>(U))
		return false;
return translateBinaryOp(TargetOpcode::G_UDIV, U, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_UDIV, U, MIRBuilder);
}		}
bool translateSDiv(const User &U, MachineIRBuilder &MIRBuilder) {		bool translateSDiv(const User &U, MachineIRBuilder &MIRBuilder) {
		// Non-trapping div for ConstantExpr not yet implemented.
		if (isa<ConstantExpr>(U))
		return false;
return translateBinaryOp(TargetOpcode::G_SDIV, U, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_SDIV, U, MIRBuilder);
}		}
bool translateURem(const User &U, MachineIRBuilder &MIRBuilder) {		bool translateURem(const User &U, MachineIRBuilder &MIRBuilder) {
		// Non-trapping div for ConstantExpr not yet implemented.
		if (isa<ConstantExpr>(U))
		return false;
return translateBinaryOp(TargetOpcode::G_UREM, U, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_UREM, U, MIRBuilder);
}		}
bool translateSRem(const User &U, MachineIRBuilder &MIRBuilder) {		bool translateSRem(const User &U, MachineIRBuilder &MIRBuilder) {
		// Non-trapping div for ConstantExpr not yet implemented.
		if (isa<ConstantExpr>(U))
		return false;
return translateBinaryOp(TargetOpcode::G_SREM, U, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_SREM, U, MIRBuilder);
}		}
bool translateIntToPtr(const User &U, MachineIRBuilder &MIRBuilder) {		bool translateIntToPtr(const User &U, MachineIRBuilder &MIRBuilder) {
return translateCast(TargetOpcode::G_INTTOPTR, U, MIRBuilder);		return translateCast(TargetOpcode::G_INTTOPTR, U, MIRBuilder);
}		}
bool translatePtrToInt(const User &U, MachineIRBuilder &MIRBuilder) {		bool translatePtrToInt(const User &U, MachineIRBuilder &MIRBuilder) {
return translateCast(TargetOpcode::G_PTRTOINT, U, MIRBuilder);		return translateCast(TargetOpcode::G_PTRTOINT, U, MIRBuilder);
}		}
▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

include/llvm/IR/Constant.h

Show First 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	public:
/// Return true if this is a vector constant that includes any undefined		/// Return true if this is a vector constant that includes any undefined
/// elements.		/// elements.
bool containsUndefElement() const;		bool containsUndefElement() const;

/// Return true if this is a vector constant that includes any constant		/// Return true if this is a vector constant that includes any constant
/// expressions.		/// expressions.
bool containsConstantExpression() const;		bool containsConstantExpression() const;

/// Return true if evaluation of this constant could trap. This is true for
/// things like constant expressions that could divide by zero.
bool canTrap() const;

/// Return true if the value can vary between threads.		/// Return true if the value can vary between threads.
bool isThreadDependent() const;		bool isThreadDependent() const;

/// Return true if the value is dependent on a dllimport variable.		/// Return true if the value is dependent on a dllimport variable.
bool isDLLImportDependent() const;		bool isDLLImportDependent() const;

/// Return true if the constant has users other than constant expressions and		/// Return true if the constant has users other than constant expressions and
/// other dangling things.		/// other dangling things.
▲ Show 20 Lines • Show All 81 Lines • Show Last 20 Lines

lib/Analysis/CodeMetrics.cpp

Show All 28 Lines	appendSpeculatableOperands(const Value *V,
SmallPtrSetImpl<const Value *> &Visited,		SmallPtrSetImpl<const Value *> &Visited,
SmallVectorImpl<const Value *> &Worklist) {		SmallVectorImpl<const Value *> &Worklist) {
const User *U = dyn_cast<User>(V);		const User *U = dyn_cast<User>(V);
if (!U)		if (!U)
return;		return;

for (const Value *Operand : U->operands())		for (const Value *Operand : U->operands())
if (Visited.insert(Operand).second)		if (Visited.insert(Operand).second)
if (isSafeToSpeculativelyExecute(Operand))		if (!isa<Instruction>(Operand) \|\|
		isSafeToSpeculativelyExecute(cast<Instruction>(Operand)))
Worklist.push_back(Operand);		Worklist.push_back(Operand);
}		}

static void completeEphemeralValues(SmallPtrSetImpl<const Value *> &Visited,		static void completeEphemeralValues(SmallPtrSetImpl<const Value *> &Visited,
SmallVectorImpl<const Value *> &Worklist,		SmallVectorImpl<const Value *> &Worklist,
SmallPtrSetImpl<const Value *> &EphValues) {		SmallPtrSetImpl<const Value *> &EphValues) {
// Note: We don't speculate PHIs here, so we'll miss instruction chains kept		// Note: We don't speculate PHIs here, so we'll miss instruction chains kept
// alive only by ephemeral values.		// alive only by ephemeral values.
▲ Show 20 Lines • Show All 150 Lines • Show Last 20 Lines

lib/Analysis/ValueTracking.cpp

Show First 20 Lines • Show All 479 Lines • ▼ Show 20 Lines	while (!WorkSet.empty()) {

// If all uses of this value are ephemeral, then so is this value.		// If all uses of this value are ephemeral, then so is this value.
if (llvm::all_of(V->users(), [&](const User *U) {		if (llvm::all_of(V->users(), [&](const User *U) {
return EphValues.count(U);		return EphValues.count(U);
})) {		})) {
if (V == E)		if (V == E)
return true;		return true;

if (V == I \|\| isSafeToSpeculativelyExecute(V)) {		if (V == I \|\| !isa<Instruction>(V) \|\|
		isSafeToSpeculativelyExecute(cast<Instruction>(V))) {
		jdoerfertUnsubmitted Not Done Reply Inline Actions `V == I` implies `isa<Instruction>(V)` and you can use `isSafeToSpeculativelyExecute(I)` jdoerfert: `V == I` implies `isa<Instruction>(V)` and you can use `isSafeToSpeculativelyExecute(I)`
		efriedmaAuthorUnsubmitted Done Reply Inline Actions I think you're reading this backwards; isSafeToSpeculativelyExecute only executes if `V != I` efriedma: I think you're reading this backwards; isSafeToSpeculativelyExecute only executes if `V != I`
		jdoerfertUnsubmitted Not Done Reply Inline Actions I was, :( jdoerfert: I was, :(
EphValues.insert(V);		EphValues.insert(V);
if (const User *U = dyn_cast<User>(V))		if (const User *U = dyn_cast<User>(V))
for (User::const_op_iterator J = U->op_begin(), JE = U->op_end();		for (User::const_op_iterator J = U->op_begin(), JE = U->op_end();
J != JE; ++J)		J != JE; ++J)
WorkSet.push_back(*J);		WorkSet.push_back(*J);
}		}
}		}
}		}
▲ Show 20 Lines • Show All 3,393 Lines • ▼ Show 20 Lines	for (const User *U : V->users()) {
if (!II) return false;		if (!II) return false;

if (!II->isLifetimeStartOrEnd())		if (!II->isLifetimeStartOrEnd())
return false;		return false;
}		}
return true;		return true;
}		}

bool llvm::isSafeToSpeculativelyExecute(const Value *V,		bool llvm::isSafeToSpeculativelyExecute(const Instruction *Inst,
const Instruction *CtxI,		const Instruction *CtxI,
const DominatorTree *DT) {		const DominatorTree *DT) {
const Operator *Inst = dyn_cast<Operator>(V);
if (!Inst)
return false;

for (unsigned i = 0, e = Inst->getNumOperands(); i != e; ++i)
if (Constant *C = dyn_cast<Constant>(Inst->getOperand(i)))
if (C->canTrap())
return false;

switch (Inst->getOpcode()) {		switch (Inst->getOpcode()) {
default:		default:
return true;		return true;
case Instruction::UDiv:		case Instruction::UDiv:
case Instruction::URem: {		case Instruction::URem: {
// x / y is undefined if y == 0.		// x / y is undefined if y == 0.
const APInt *V;		const APInt *V;
if (match(Inst->getOperand(1), m_APInt(V)))		if (match(Inst->getOperand(1), m_APInt(V)))
▲ Show 20 Lines • Show All 1,793 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/FastISel.cpp

Show First 20 Lines • Show All 1,811 Lines • ▼ Show 20 Lines	if (match(I, m_FNeg(m_Value(X))))
return selectFNeg(I, X);		return selectFNeg(I, X);
return selectBinaryOp(I, ISD::FSUB);		return selectBinaryOp(I, ISD::FSUB);
}		}
case Instruction::Mul:		case Instruction::Mul:
return selectBinaryOp(I, ISD::MUL);		return selectBinaryOp(I, ISD::MUL);
case Instruction::FMul:		case Instruction::FMul:
return selectBinaryOp(I, ISD::FMUL);		return selectBinaryOp(I, ISD::FMUL);
case Instruction::SDiv:		case Instruction::SDiv:
		// Non-trapping div for ConstantExpr not yet implemented.
		if (isa<ConstantExpr>(I))
		return false;
return selectBinaryOp(I, ISD::SDIV);		return selectBinaryOp(I, ISD::SDIV);
case Instruction::UDiv:		case Instruction::UDiv:
		// Non-trapping div for ConstantExpr not yet implemented.
		if (isa<ConstantExpr>(I))
		return false;
return selectBinaryOp(I, ISD::UDIV);		return selectBinaryOp(I, ISD::UDIV);
case Instruction::FDiv:		case Instruction::FDiv:
return selectBinaryOp(I, ISD::FDIV);		return selectBinaryOp(I, ISD::FDIV);
case Instruction::SRem:		case Instruction::SRem:
		// Non-trapping div for ConstantExpr not yet implemented.
		if (isa<ConstantExpr>(I))
		return false;
return selectBinaryOp(I, ISD::SREM);		return selectBinaryOp(I, ISD::SREM);
case Instruction::URem:		case Instruction::URem:
		// Non-trapping div for ConstantExpr not yet implemented.
		if (isa<ConstantExpr>(I))
		return false;
return selectBinaryOp(I, ISD::UREM);		return selectBinaryOp(I, ISD::UREM);
case Instruction::FRem:		case Instruction::FRem:
return selectBinaryOp(I, ISD::FREM);		return selectBinaryOp(I, ISD::FREM);
case Instruction::Shl:		case Instruction::Shl:
return selectBinaryOp(I, ISD::SHL);		return selectBinaryOp(I, ISD::SHL);
case Instruction::LShr:		case Instruction::LShr:
return selectBinaryOp(I, ISD::SRL);		return selectBinaryOp(I, ISD::SRL);
case Instruction::AShr:		case Instruction::AShr:
▲ Show 20 Lines • Show All 650 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.h

Show First 20 Lines • Show All 872 Lines • ▼ Show 20 Lines	private:
void visitCallBr(const CallBrInst &I);		void visitCallBr(const CallBrInst &I);
void visitResume(const ResumeInst &I);		void visitResume(const ResumeInst &I);

void visitUnary(const User &I, unsigned Opcode);		void visitUnary(const User &I, unsigned Opcode);
void visitFNeg(const User &I) { visitUnary(I, ISD::FNEG); }		void visitFNeg(const User &I) { visitUnary(I, ISD::FNEG); }

void visitBinary(const User &I, unsigned Opcode);		void visitBinary(const User &I, unsigned Opcode);
void visitShift(const User &I, unsigned Opcode);		void visitShift(const User &I, unsigned Opcode);
		void visitDivRem(const User &I, unsigned Opcode);
void visitAdd(const User &I) { visitBinary(I, ISD::ADD); }		void visitAdd(const User &I) { visitBinary(I, ISD::ADD); }
void visitFAdd(const User &I) { visitBinary(I, ISD::FADD); }		void visitFAdd(const User &I) { visitBinary(I, ISD::FADD); }
void visitSub(const User &I) { visitBinary(I, ISD::SUB); }		void visitSub(const User &I) { visitBinary(I, ISD::SUB); }
void visitFSub(const User &I);		void visitFSub(const User &I);
void visitMul(const User &I) { visitBinary(I, ISD::MUL); }		void visitMul(const User &I) { visitBinary(I, ISD::MUL); }
void visitFMul(const User &I) { visitBinary(I, ISD::FMUL); }		void visitFMul(const User &I) { visitBinary(I, ISD::FMUL); }
void visitURem(const User &I) { visitBinary(I, ISD::UREM); }		void visitURem(const User &I) { visitDivRem(I, ISD::UREM); }
void visitSRem(const User &I) { visitBinary(I, ISD::SREM); }		void visitSRem(const User &I) { visitDivRem(I, ISD::SREM); }
void visitFRem(const User &I) { visitBinary(I, ISD::FREM); }		void visitFRem(const User &I) { visitBinary(I, ISD::FREM); }
void visitUDiv(const User &I) { visitBinary(I, ISD::UDIV); }		void visitUDiv(const User &I) { visitDivRem(I, ISD::UDIV); }
void visitSDiv(const User &I);		void visitSDiv(const User &I) { visitDivRem(I, ISD::SDIV); }
void visitFDiv(const User &I) { visitBinary(I, ISD::FDIV); }		void visitFDiv(const User &I) { visitBinary(I, ISD::FDIV); }
void visitAnd (const User &I) { visitBinary(I, ISD::AND); }		void visitAnd (const User &I) { visitBinary(I, ISD::AND); }
void visitOr (const User &I) { visitBinary(I, ISD::OR); }		void visitOr (const User &I) { visitBinary(I, ISD::OR); }
void visitXor (const User &I) { visitBinary(I, ISD::XOR); }		void visitXor (const User &I) { visitBinary(I, ISD::XOR); }
void visitShl (const User &I) { visitShift(I, ISD::SHL); }		void visitShl (const User &I) { visitShift(I, ISD::SHL); }
void visitLShr(const User &I) { visitShift(I, ISD::SRL); }		void visitLShr(const User &I) { visitShift(I, ISD::SRL); }
void visitAShr(const User &I) { visitShift(I, ISD::SRA); }		void visitAShr(const User &I) { visitShift(I, ISD::SRA); }
void visitICmp(const User &I);		void visitICmp(const User &I);
▲ Show 20 Lines • Show All 200 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,200 Lines • ▼ Show 20 Lines	void SelectionDAGBuilder::visitShift(const User &I, unsigned Opcode) {
Flags.setExact(exact);		Flags.setExact(exact);
Flags.setNoSignedWrap(nsw);		Flags.setNoSignedWrap(nsw);
Flags.setNoUnsignedWrap(nuw);		Flags.setNoUnsignedWrap(nuw);
SDValue Res = DAG.getNode(Opcode, getCurSDLoc(), Op1.getValueType(), Op1, Op2,		SDValue Res = DAG.getNode(Opcode, getCurSDLoc(), Op1.getValueType(), Op1, Op2,
Flags);		Flags);
setValue(&I, Res);		setValue(&I, Res);
}		}

void SelectionDAGBuilder::visitSDiv(const User &I) {		void SelectionDAGBuilder::visitDivRem(const User &I, unsigned Opcode) {
		if (!isa<Constant>(I))
		return visitBinary(I, Opcode);

		// Constants aren't allowed to trap, so we have to do something
		// a bit trickier.
		//
		// FIXME: Some targets have a cheap non-trapping div.
SDValue Op1 = getValue(I.getOperand(0));		SDValue Op1 = getValue(I.getOperand(0));
SDValue Op2 = getValue(I.getOperand(1));		SDValue Op2 = getValue(I.getOperand(1));
		SDLoc dl(getCurSDLoc());
SDNodeFlags Flags;		EVT VT = Op1.getValueType();
Flags.setExact(isa<PossiblyExactOperator>(&I) &&		if (Opcode == ISD::UDIV \|\| Opcode == ISD::UREM) {
cast<PossiblyExactOperator>(&I)->isExact());		Op2 = DAG.getNode(ISD::UMAX, dl, VT, Op2, DAG.getConstant(1, dl, VT));
setValue(&I, DAG.getNode(ISD::SDIV, getCurSDLoc(), Op1.getValueType(), Op1,		} else {
Op2, Flags));		auto &TLI = DAG.getTargetLoweringInfo();
		hfinkelUnsubmitted Not Done Reply Inline Actions Please add a comment here explaining that you're guarding against both x/0 and INT_MIN/-1. hfinkel: Please add a comment here explaining that you're guarding against both x/0 and INT_MIN/-1.
		EVT CCVT =
		TLI.getSetCCResultType(DAG.getDataLayout(), *DAG.getContext(), VT);
		SDValue IsZero =
		DAG.getSetCC(dl, CCVT, Op2, DAG.getConstant(0, dl, VT), ISD::SETEQ);
		SDValue IsNegOne =
		DAG.getSetCC(dl, CCVT, Op2, DAG.getAllOnesConstant(dl, VT), ISD::SETEQ);
		auto IntMin = APInt::getSignedMinValue(VT.getScalarSizeInBits());
		SDValue IsIntMin = DAG.getSetCC(
		dl, CCVT, Op1, DAG.getConstant(IntMin, dl, VT), ISD::SETEQ);
		SDValue IsIntMinOverNegOne =
		DAG.getNode(ISD::AND, dl, CCVT, IsNegOne, IsIntMin);
		SDValue IsInvalid =
		DAG.getNode(ISD::OR, dl, CCVT, IsZero, IsIntMinOverNegOne);
		ISD::NodeType SelectOpCode = VT.isVector() ? ISD::VSELECT : ISD::SELECT;
		Op2 = DAG.getNode(SelectOpCode, dl, VT, IsInvalid,
		DAG.getConstant(1, dl, VT), Op2);
		}

		SDNodeFlags DivFlags;
		if (auto *ExactOp = dyn_cast<PossiblyExactOperator>(&I))
		DivFlags.setExact(ExactOp->isExact());
		SDValue BinNodeValue = DAG.getNode(Opcode, dl, VT, Op1, Op2, DivFlags);
		setValue(&I, BinNodeValue);
}		}

void SelectionDAGBuilder::visitICmp(const User &I) {		void SelectionDAGBuilder::visitICmp(const User &I) {
ICmpInst::Predicate predicate = ICmpInst::BAD_ICMP_PREDICATE;		ICmpInst::Predicate predicate = ICmpInst::BAD_ICMP_PREDICATE;
if (const ICmpInst *IC = dyn_cast<ICmpInst>(&I))		if (const ICmpInst *IC = dyn_cast<ICmpInst>(&I))
predicate = IC->getPredicate();		predicate = IC->getPredicate();
else if (const ConstantExpr *IC = dyn_cast<ConstantExpr>(&I))		else if (const ConstantExpr *IC = dyn_cast<ConstantExpr>(&I))
predicate = ICmpInst::Predicate(IC->getPredicate());		predicate = ICmpInst::Predicate(IC->getPredicate());
▲ Show 20 Lines • Show All 7,700 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

Show First 20 Lines • Show All 336 Lines • ▼ Show 20 Lines	void SelectionDAGISel::getAnalysisUsage(AnalysisUsage &AU) const {
AU.addPreserved<GCModuleInfo>();		AU.addPreserved<GCModuleInfo>();
AU.addRequired<TargetLibraryInfoWrapperPass>();		AU.addRequired<TargetLibraryInfoWrapperPass>();
AU.addRequired<TargetTransformInfoWrapperPass>();		AU.addRequired<TargetTransformInfoWrapperPass>();
if (UseMBPI && OptLevel != CodeGenOpt::None)		if (UseMBPI && OptLevel != CodeGenOpt::None)
AU.addRequired<BranchProbabilityInfoWrapperPass>();		AU.addRequired<BranchProbabilityInfoWrapperPass>();
MachineFunctionPass::getAnalysisUsage(AU);		MachineFunctionPass::getAnalysisUsage(AU);
}		}

/// SplitCriticalSideEffectEdges - Look for critical edges with a PHI value that
/// may trap on it. In this case we have to split the edge so that the path
/// through the predecessor block that doesn't go to the phi block doesn't
/// execute the possibly trapping instruction. If available, we pass domtree
/// and loop info to be updated when we split critical edges. This is because
/// SelectionDAGISel preserves these analyses.
/// This is required for correctness, so it must be done at -O0.
///
static void SplitCriticalSideEffectEdges(Function &Fn, DominatorTree *DT,
LoopInfo *LI) {
// Loop for blocks with phi nodes.
for (BasicBlock &BB : Fn) {
PHINode *PN = dyn_cast<PHINode>(BB.begin());
if (!PN) continue;

ReprocessBlock:
// For each block with a PHI node, check to see if any of the input values
// are potentially trapping constant expressions. Constant expressions are
// the only potentially trapping value that can occur as the argument to a
// PHI.
for (BasicBlock::iterator I = BB.begin(); (PN = dyn_cast<PHINode>(I)); ++I)
for (unsigned i = 0, e = PN->getNumIncomingValues(); i != e; ++i) {
ConstantExpr *CE = dyn_cast<ConstantExpr>(PN->getIncomingValue(i));
if (!CE \|\| !CE->canTrap()) continue;

// The only case we have to worry about is when the edge is critical.
// Since this block has a PHI Node, we assume it has multiple input
// edges: check to see if the pred has multiple successors.
BasicBlock *Pred = PN->getIncomingBlock(i);
if (Pred->getTerminator()->getNumSuccessors() == 1)
continue;

// Okay, we have to split this edge.
SplitCriticalEdge(
Pred->getTerminator(), GetSuccessorNumber(Pred, &BB),
CriticalEdgeSplittingOptions(DT, LI).setMergeIdenticalEdges());
goto ReprocessBlock;
}
}
}

static void computeUsesMSVCFloatingPoint(const Triple &TT, const Function &F,		static void computeUsesMSVCFloatingPoint(const Triple &TT, const Function &F,
MachineModuleInfo &MMI) {		MachineModuleInfo &MMI) {
// Only needed for MSVC		// Only needed for MSVC
if (!TT.isKnownWindowsMSVCEnvironment())		if (!TT.isKnownWindowsMSVCEnvironment())
return;		return;

// If it's already set, nothing to do.		// If it's already set, nothing to do.
if (MMI.usesMSVCFloatingPoint())		if (MMI.usesMSVCFloatingPoint())
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	bool SelectionDAGISel::runOnMachineFunction(MachineFunction &mf) {
ORE = make_unique<OptimizationRemarkEmitter>(&Fn);		ORE = make_unique<OptimizationRemarkEmitter>(&Fn);
auto *DTWP = getAnalysisIfAvailable<DominatorTreeWrapperPass>();		auto *DTWP = getAnalysisIfAvailable<DominatorTreeWrapperPass>();
DominatorTree *DT = DTWP ? &DTWP->getDomTree() : nullptr;		DominatorTree *DT = DTWP ? &DTWP->getDomTree() : nullptr;
auto *LIWP = getAnalysisIfAvailable<LoopInfoWrapperPass>();		auto *LIWP = getAnalysisIfAvailable<LoopInfoWrapperPass>();
LoopInfo *LI = LIWP ? &LIWP->getLoopInfo() : nullptr;		LoopInfo *LI = LIWP ? &LIWP->getLoopInfo() : nullptr;

LLVM_DEBUG(dbgs() << "\n\n\n=== " << Fn.getName() << "\n");		LLVM_DEBUG(dbgs() << "\n\n\n=== " << Fn.getName() << "\n");

SplitCriticalSideEffectEdges(const_cast<Function &>(Fn), DT, LI);

CurDAG->init(MF, ORE, this, LibInfo,		CurDAG->init(MF, ORE, this, LibInfo,
getAnalysisIfAvailable<LegacyDivergenceAnalysis>());		getAnalysisIfAvailable<LegacyDivergenceAnalysis>());
FuncInfo->set(Fn, *MF, CurDAG);		FuncInfo->set(Fn, *MF, CurDAG);
SwiftError->setFunction(*MF);		SwiftError->setFunction(*MF);

// Now get the optional analyzes if we want to.		// Now get the optional analyzes if we want to.
// This is based on the possibly changed OptLevel (after optnone is taken		// This is based on the possibly changed OptLevel (after optnone is taken
// into account). That's unfortunate but OK because it just means we won't		// into account). That's unfortunate but OK because it just means we won't
▲ Show 20 Lines • Show All 3,175 Lines • Show Last 20 Lines

lib/IR/Constants.cpp

Show First 20 Lines • Show All 402 Lines • ▼ Show 20 Lines	#endif
// The constant should remove itself from our use list...		// The constant should remove itself from our use list...
assert((use_empty() \|\| user_back() != V) && "Constant not removed!");		assert((use_empty() \|\| user_back() != V) && "Constant not removed!");
}		}

// Value has no outstanding references it is safe to delete it now...		// Value has no outstanding references it is safe to delete it now...
delete this;		delete this;
}		}

static bool canTrapImpl(const Constant *C,
SmallPtrSetImpl<const ConstantExpr *> &NonTrappingOps) {
assert(C->getType()->isFirstClassType() && "Cannot evaluate aggregate vals!");
// The only thing that could possibly trap are constant exprs.
const ConstantExpr *CE = dyn_cast<ConstantExpr>(C);
if (!CE)
return false;

// ConstantExpr traps if any operands can trap.
for (unsigned i = 0, e = C->getNumOperands(); i != e; ++i) {
if (ConstantExpr *Op = dyn_cast<ConstantExpr>(CE->getOperand(i))) {
if (NonTrappingOps.insert(Op).second && canTrapImpl(Op, NonTrappingOps))
return true;
}
}

// Otherwise, only specific operations can trap.
switch (CE->getOpcode()) {
default:
return false;
case Instruction::UDiv:
case Instruction::SDiv:
case Instruction::URem:
case Instruction::SRem:
// Div and rem can trap if the RHS is not known to be non-zero.
if (!isa<ConstantInt>(CE->getOperand(1)) \|\|CE->getOperand(1)->isNullValue())
return true;
return false;
}
}

bool Constant::canTrap() const {
SmallPtrSet<const ConstantExpr *, 4> NonTrappingOps;
return canTrapImpl(this, NonTrappingOps);
}

/// Check if C contains a GlobalValue for which Predicate is true.		/// Check if C contains a GlobalValue for which Predicate is true.
static bool		static bool
ConstHasGlobalValuePredicate(const Constant *C,		ConstHasGlobalValuePredicate(const Constant *C,
bool (Predicate)(const GlobalValue )) {		bool (Predicate)(const GlobalValue )) {
SmallPtrSet<const Constant *, 8> Visited;		SmallPtrSet<const Constant *, 8> Visited;
SmallVector<const Constant *, 8> WorkList;		SmallVector<const Constant *, 8> WorkList;
WorkList.push_back(C);		WorkList.push_back(C);
Visited.insert(C);		Visited.insert(C);
▲ Show 20 Lines • Show All 2,503 Lines • ▼ Show 20 Lines	Value ConstantExpr::handleOperandChangeImpl(Value From, Value *ToV) {
// Update to the new value.		// Update to the new value.
return getContext().pImpl->ExprConstants.replaceOperandsInPlace(		return getContext().pImpl->ExprConstants.replaceOperandsInPlace(
NewOps, this, From, To, NumUpdated, OperandNo);		NewOps, this, From, To, NumUpdated, OperandNo);
}		}

Instruction *ConstantExpr::getAsInstruction() {		Instruction *ConstantExpr::getAsInstruction() {
SmallVector<Value *, 4> ValueOperands(op_begin(), op_end());		SmallVector<Value *, 4> ValueOperands(op_begin(), op_end());
ArrayRef<Value*> Ops(ValueOperands);		ArrayRef<Value*> Ops(ValueOperands);

jdoerfertUnsubmitted Not Done Reply Inline Actions This can be reasonably separated or you could probably work with the `ValueOperands` or `Ops` array. At least I don't (immediately) see that we need to keep this connected to the RFC patch. jdoerfert: This can be reasonably separated or you could probably work with the `ValueOperands` or `Ops`…
switch (getOpcode()) {		switch (getOpcode()) {
case Instruction::Trunc:		case Instruction::Trunc:
case Instruction::ZExt:		case Instruction::ZExt:
case Instruction::SExt:		case Instruction::SExt:
case Instruction::FPTrunc:		case Instruction::FPTrunc:
case Instruction::FPExt:		case Instruction::FPExt:
case Instruction::UIToFP:		case Instruction::UIToFP:
case Instruction::SIToFP:		case Instruction::SIToFP:
▲ Show 20 Lines • Show All 51 Lines • Show Last 20 Lines

lib/Transforms/Utils/SimplifyCFG.cpp

Show First 20 Lines • Show All 302 Lines • ▼ Show 20 Lines
}		}

/// Compute an abstract "cost" of speculating the given instruction,		/// Compute an abstract "cost" of speculating the given instruction,
/// which is assumed to be safe to speculate. TCC_Free means cheap,		/// which is assumed to be safe to speculate. TCC_Free means cheap,
/// TCC_Basic means less cheap, and TCC_Expensive means prohibitively		/// TCC_Basic means less cheap, and TCC_Expensive means prohibitively
/// expensive.		/// expensive.
static unsigned ComputeSpeculationCost(const User *I,		static unsigned ComputeSpeculationCost(const User *I,
const TargetTransformInfo &TTI) {		const TargetTransformInfo &TTI) {
assert(isSafeToSpeculativelyExecute(I) &&		assert(!isa<Instruction>(I) \|\|
		isSafeToSpeculativelyExecute(cast<Instruction>(I)) &&
"Instruction is not safe to speculatively execute!");		"Instruction is not safe to speculatively execute!");
return TTI.getUserCost(I);		return TTI.getUserCost(I);
}		}

/// If we have a merge point of an "if condition" as accepted above,		/// If we have a merge point of an "if condition" as accepted above,
/// return true if the specified value dominates the block. We		/// return true if the specified value dominates the block. We
/// don't handle the true generality of domination here, just a special case		/// don't handle the true generality of domination here, just a special case
/// which works well enough for us.		/// which works well enough for us.
Show All 18 Lines	static bool DominatesMergePoint(Value V, BasicBlock BB,
// It is possible to hit a zero-cost cycle (phi/gep instructions for example),		// It is possible to hit a zero-cost cycle (phi/gep instructions for example),
// so limit the recursion depth.		// so limit the recursion depth.
// TODO: While this recursion limit does prevent pathological behavior, it		// TODO: While this recursion limit does prevent pathological behavior, it
// would be better to track visited instructions to avoid cycles.		// would be better to track visited instructions to avoid cycles.
if (Depth == MaxSpeculationDepth)		if (Depth == MaxSpeculationDepth)
return false;		return false;

Instruction *I = dyn_cast<Instruction>(V);		Instruction *I = dyn_cast<Instruction>(V);
if (!I) {		if (!I)
// Non-instructions all dominate instructions, but not all constantexprs
// can be executed unconditionally.
if (ConstantExpr *C = dyn_cast<ConstantExpr>(V))
if (C->canTrap())
return false;
return true;		return true;
}
BasicBlock *PBB = I->getParent();		BasicBlock *PBB = I->getParent();

// We don't want to allow weird loops that might have the "if condition" in		// We don't want to allow weird loops that might have the "if condition" in
// the bottom of this block.		// the bottom of this block.
if (PBB == BB)		if (PBB == BB)
return false;		return false;

// If this instruction is defined in a block that contains an unconditional		// If this instruction is defined in a block that contains an unconditional
▲ Show 20 Lines • Show All 1,011 Lines • ▼ Show 20 Lines	for (PHINode &PN : Succ->phis()) {
if (BB1V == BB2V)		if (BB1V == BB2V)
continue;		continue;

// Check for passingValueIsAlwaysUndefined here because we would rather		// Check for passingValueIsAlwaysUndefined here because we would rather
// eliminate undefined control flow then converting it to a select.		// eliminate undefined control flow then converting it to a select.
if (passingValueIsAlwaysUndefined(BB1V, &PN) \|\|		if (passingValueIsAlwaysUndefined(BB1V, &PN) \|\|
passingValueIsAlwaysUndefined(BB2V, &PN))		passingValueIsAlwaysUndefined(BB2V, &PN))
return Changed;		return Changed;

if (isa<ConstantExpr>(BB1V) && !isSafeToSpeculativelyExecute(BB1V))
return Changed;
if (isa<ConstantExpr>(BB2V) && !isSafeToSpeculativelyExecute(BB2V))
return Changed;
}		}
}		}

// Okay, it is safe to hoist the terminator.		// Okay, it is safe to hoist the terminator.
Instruction *NT = I1->clone();		Instruction *NT = I1->clone();
BIParent->getInstList().insert(BI->getIterator(), NT);		BIParent->getInstList().insert(BI->getIterator(), NT);
if (!NT->getType()->isVoidTy()) {		if (!NT->getType()->isVoidTy()) {
I1->replaceAllUsesWith(NT);		I1->replaceAllUsesWith(NT);
▲ Show 20 Lines • Show All 657 Lines • ▼ Show 20 Lines	if (passingValueIsAlwaysUndefined(OrigV, &PN) \|\|
return false;		return false;

HaveRewritablePHIs = true;		HaveRewritablePHIs = true;
ConstantExpr *OrigCE = dyn_cast<ConstantExpr>(OrigV);		ConstantExpr *OrigCE = dyn_cast<ConstantExpr>(OrigV);
ConstantExpr *ThenCE = dyn_cast<ConstantExpr>(ThenV);		ConstantExpr *ThenCE = dyn_cast<ConstantExpr>(ThenV);
if (!OrigCE && !ThenCE)		if (!OrigCE && !ThenCE)
continue; // Known safe and cheap.		continue; // Known safe and cheap.

if ((ThenCE && !isSafeToSpeculativelyExecute(ThenCE)) \|\|
(OrigCE && !isSafeToSpeculativelyExecute(OrigCE)))
return false;
unsigned OrigCost = OrigCE ? ComputeSpeculationCost(OrigCE, TTI) : 0;		unsigned OrigCost = OrigCE ? ComputeSpeculationCost(OrigCE, TTI) : 0;
unsigned ThenCost = ThenCE ? ComputeSpeculationCost(ThenCE, TTI) : 0;		unsigned ThenCost = ThenCE ? ComputeSpeculationCost(ThenCE, TTI) : 0;
unsigned MaxCost =		unsigned MaxCost =
2 * PHINodeFoldingThreshold * TargetTransformInfo::TCC_Basic;		2 * PHINodeFoldingThreshold * TargetTransformInfo::TCC_Basic;
if (OrigCost + ThenCost > MaxCost)		if (OrigCost + ThenCost > MaxCost)
return false;		return false;

// Account for the cost of an unfolded ConstantExpr which could end up		// Account for the cost of an unfolded ConstantExpr which could end up
▲ Show 20 Lines • Show All 392 Lines • ▼ Show 20 Lines	static bool SimplifyCondBranchToTwoReturns(BranchInst *BI,
// Unwrap any PHI nodes in the return blocks.		// Unwrap any PHI nodes in the return blocks.
if (PHINode *TVPN = dyn_cast_or_null<PHINode>(TrueValue))		if (PHINode *TVPN = dyn_cast_or_null<PHINode>(TrueValue))
if (TVPN->getParent() == TrueSucc)		if (TVPN->getParent() == TrueSucc)
TrueValue = TVPN->getIncomingValueForBlock(BI->getParent());		TrueValue = TVPN->getIncomingValueForBlock(BI->getParent());
if (PHINode *FVPN = dyn_cast_or_null<PHINode>(FalseValue))		if (PHINode *FVPN = dyn_cast_or_null<PHINode>(FalseValue))
if (FVPN->getParent() == FalseSucc)		if (FVPN->getParent() == FalseSucc)
FalseValue = FVPN->getIncomingValueForBlock(BI->getParent());		FalseValue = FVPN->getIncomingValueForBlock(BI->getParent());

// In order for this transformation to be safe, we must be able to
// unconditionally execute both operands to the return. This is
// normally the case, but we could have a potentially-trapping
// constant expression that prevents this transformation from being
// safe.
if (ConstantExpr *TCV = dyn_cast_or_null<ConstantExpr>(TrueValue))
if (TCV->canTrap())
return false;
if (ConstantExpr *FCV = dyn_cast_or_null<ConstantExpr>(FalseValue))
if (FCV->canTrap())
return false;

// Okay, we collected all the mapped values and checked them for sanity, and		// Okay, we collected all the mapped values and checked them for sanity, and
// defined to really do this transformation. First, update the CFG.		// defined to really do this transformation. First, update the CFG.
TrueSucc->removePredecessor(BI->getParent());		TrueSucc->removePredecessor(BI->getParent());
FalseSucc->removePredecessor(BI->getParent());		FalseSucc->removePredecessor(BI->getParent());

// Insert select instructions where needed.		// Insert select instructions where needed.
Value *BrCond = BI->getCondition();		Value *BrCond = BI->getCondition();
if (TrueValue) {		if (TrueValue) {
▲ Show 20 Lines • Show All 139 Lines • ▼ Show 20 Lines	for (auto I = BB->begin(); Cond != &*I; ++I) {
// Account for the cost of duplicating this instruction into each		// Account for the cost of duplicating this instruction into each
// predecessor.		// predecessor.
NumBonusInsts += PredCount;		NumBonusInsts += PredCount;
// Early exits once we reach the limit.		// Early exits once we reach the limit.
if (NumBonusInsts > BonusInstThreshold)		if (NumBonusInsts > BonusInstThreshold)
return false;		return false;
}		}

// Cond is known to be a compare or binary operator. Check to make sure that
// neither operand is a potentially-trapping constant expression.
if (ConstantExpr *CE = dyn_cast<ConstantExpr>(Cond->getOperand(0)))
if (CE->canTrap())
return false;
if (ConstantExpr *CE = dyn_cast<ConstantExpr>(Cond->getOperand(1)))
if (CE->canTrap())
return false;

// Finally, don't infinitely unroll conditional loops.		// Finally, don't infinitely unroll conditional loops.
BasicBlock *TrueDest = BI->getSuccessor(0);		BasicBlock *TrueDest = BI->getSuccessor(0);
BasicBlock *FalseDest = (BI->isConditional()) ? BI->getSuccessor(1) : nullptr;		BasicBlock *FalseDest = (BI->isConditional()) ? BI->getSuccessor(1) : nullptr;
if (TrueDest == BB \|\| FalseDest == BB)		if (TrueDest == BB \|\| FalseDest == BB)
return false;		return false;

for (pred_iterator PI = pred_begin(BB), E = pred_end(BB); PI != E; ++PI) {		for (pred_iterator PI = pred_begin(BB), E = pred_end(BB); PI != E; ++PI) {
BasicBlock PredBlock = PI;		BasicBlock PredBlock = PI;
▲ Show 20 Lines • Show All 585 Lines • ▼ Show 20 Lines	if (BlockIsSimpleEnoughToThreadThrough(BB)) {
}		}
}		}

BI->setCondition(NewPN);		BI->setCondition(NewPN);
return true;		return true;
}		}
}		}

if (auto *CE = dyn_cast<ConstantExpr>(BI->getCondition()))
if (CE->canTrap())
return false;

// If both branches are conditional and both contain stores to the same		// If both branches are conditional and both contain stores to the same
// address, remove the stores from the conditionals and create a conditional		// address, remove the stores from the conditionals and create a conditional
// merged store at the end.		// merged store at the end.
if (MergeCondStores && mergeConditionalStores(PBI, BI, DL))		if (MergeCondStores && mergeConditionalStores(PBI, BI, DL))
return true;		return true;

// If this is a conditional branch in an empty block, and if any		// If this is a conditional branch in an empty block, and if any
// predecessors are a conditional branch to one of our destinations,		// predecessors are a conditional branch to one of our destinations,
Show All 24 Lines	static bool SimplifyCondBranchToCondBranch(BranchInst PBI, BranchInst BI,
// isn't BB itself. If so, this is an infinite loop that will		// isn't BB itself. If so, this is an infinite loop that will
// keep getting unwound.		// keep getting unwound.
if (PBI->getSuccessor(PBIOp) == BB)		if (PBI->getSuccessor(PBIOp) == BB)
return false;		return false;

// Do not perform this transformation if it would require		// Do not perform this transformation if it would require
// insertion of a large number of select instructions. For targets		// insertion of a large number of select instructions. For targets
// without predication/cmovs, this is a big pessimization.		// without predication/cmovs, this is a big pessimization.

// Also do not perform this transformation if any phi node in the common
// destination block can trap when reached by BB or PBB (PR17073). In that
// case, it would be unsafe to hoist the operation into a select instruction.

BasicBlock *CommonDest = PBI->getSuccessor(PBIOp);		BasicBlock *CommonDest = PBI->getSuccessor(PBIOp);
unsigned NumPhis = 0;		unsigned NumPhis = 0;
for (BasicBlock::iterator II = CommonDest->begin(); isa<PHINode>(II);		for (BasicBlock::iterator II = CommonDest->begin(); isa<PHINode>(II);
++II, ++NumPhis) {		++II, ++NumPhis) {
if (NumPhis > 2) // Disable this xform.		if (NumPhis > 2) // Disable this xform.
return false;		return false;

PHINode *PN = cast<PHINode>(II);
Value *BIV = PN->getIncomingValueForBlock(BB);
if (ConstantExpr *CE = dyn_cast<ConstantExpr>(BIV))
if (CE->canTrap())
return false;

unsigned PBBIdx = PN->getBasicBlockIndex(PBI->getParent());
Value *PBIV = PN->getIncomingValue(PBBIdx);
if (ConstantExpr *CE = dyn_cast<ConstantExpr>(PBIV))
if (CE->canTrap())
return false;
}		}

// Finally, if everything is ok, fold the branches to logical ops.		// Finally, if everything is ok, fold the branches to logical ops.
BasicBlock *OtherDest = BI->getSuccessor(BIOp ^ 1);		BasicBlock *OtherDest = BI->getSuccessor(BIOp ^ 1);

LLVM_DEBUG(dbgs() << "FOLDING BRs:" << *PBI->getParent()		LLVM_DEBUG(dbgs() << "FOLDING BRs:" << *PBI->getParent()
<< "AND: " << *BI->getParent());		<< "AND: " << *BI->getParent());

▲ Show 20 Lines • Show All 2,797 Lines • Show Last 20 Lines

lib/Transforms/Vectorize/LoopVectorizationLegality.cpp

Show First 20 Lines • Show All 372 Lines • ▼ Show 20 Lines	static bool isUniformLoopNest(Loop Lp, Loop OuterLp) {
// Check if nested loops are uniform.		// Check if nested loops are uniform.
for (Loop SubLp : Lp)		for (Loop SubLp : Lp)
if (!isUniformLoopNest(SubLp, OuterLp))		if (!isUniformLoopNest(SubLp, OuterLp))
return false;		return false;

return true;		return true;
}		}

/// Check whether it is safe to if-convert this phi node.
///
/// Phi nodes with constant expressions that can trap are not safe to if
/// convert.
static bool canIfConvertPHINodes(BasicBlock *BB) {
for (PHINode &Phi : BB->phis()) {
for (Value *V : Phi.incoming_values())
if (auto *C = dyn_cast<Constant>(V))
if (C->canTrap())
return false;
}
return true;
}

static Type convertPointerToIntegerType(const DataLayout &DL, Type Ty) {		static Type convertPointerToIntegerType(const DataLayout &DL, Type Ty) {
if (Ty->isPointerTy())		if (Ty->isPointerTy())
return DL.getIntPtrType(Ty);		return DL.getIntPtrType(Ty);

// It is possible that char's or short's overflow when we ask for the loop's		// It is possible that char's or short's overflow when we ask for the loop's
// trip count, work around this by changing the type size.		// trip count, work around this by changing the type size.
if (Ty->getScalarSizeInBits() < 32)		if (Ty->getScalarSizeInBits() < 32)
return Type::getInt32Ty(Ty->getContext());		return Type::getInt32Ty(Ty->getContext());
▲ Show 20 Lines • Show All 469 Lines • ▼ Show 20 Lines	bool LoopVectorizationLegality::blockNeedsPredication(BasicBlock *BB) {
return LoopAccessInfo::blockNeedsPredication(BB, TheLoop, DT);		return LoopAccessInfo::blockNeedsPredication(BB, TheLoop, DT);
}		}

bool LoopVectorizationLegality::blockCanBePredicated(		bool LoopVectorizationLegality::blockCanBePredicated(
BasicBlock BB, SmallPtrSetImpl<Value > &SafePtrs) {		BasicBlock BB, SmallPtrSetImpl<Value > &SafePtrs) {
const bool IsAnnotatedParallel = TheLoop->isAnnotatedParallel();		const bool IsAnnotatedParallel = TheLoop->isAnnotatedParallel();

for (Instruction &I : *BB) {		for (Instruction &I : *BB) {
// Check that we don't have a constant expression that can trap as operand.
for (Value *Operand : I.operands()) {
if (auto *C = dyn_cast<Constant>(Operand))
if (C->canTrap())
return false;
}
// We might be able to hoist the load.		// We might be able to hoist the load.
if (I.mayReadFromMemory()) {		if (I.mayReadFromMemory()) {
auto *LI = dyn_cast<LoadInst>(&I);		auto *LI = dyn_cast<LoadInst>(&I);
if (!LI)		if (!LI)
return false;		return false;
if (!SafePtrs.count(LI->getPointerOperand())) {		if (!SafePtrs.count(LI->getPointerOperand())) {
// !llvm.mem.parallel_loop_access implies if-conversion safety.		// !llvm.mem.parallel_loop_access implies if-conversion safety.
// Otherwise, record that the load needs (real or emulated) masking		// Otherwise, record that the load needs (real or emulated) masking
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	for (BasicBlock *BB : TheLoop->blocks()) {
if (blockNeedsPredication(BB)) {		if (blockNeedsPredication(BB)) {
if (!blockCanBePredicated(BB, SafePointes)) {		if (!blockCanBePredicated(BB, SafePointes)) {
reportVectorizationFailure(		reportVectorizationFailure(
"Control flow cannot be substituted for a select",		"Control flow cannot be substituted for a select",
"control flow cannot be substituted for a select",		"control flow cannot be substituted for a select",
"NoCFGForSelect", BB->getTerminator());		"NoCFGForSelect", BB->getTerminator());
return false;		return false;
}		}
} else if (BB != Header && !canIfConvertPHINodes(BB)) {
reportVectorizationFailure(
"Control flow cannot be substituted for a select",
"control flow cannot be substituted for a select",
"NoCFGForSelect", BB->getTerminator());
return false;
}		}
}		}

// We can if-convert this loop.		// We can if-convert this loop.
return true;		return true;
}		}

// Helper function to canVectorizeLoopNestCFG.		// Helper function to canVectorizeLoopNestCFG.
▲ Show 20 Lines • Show All 241 Lines • Show Last 20 Lines

test/CodeGen/X86/critical-edge-split-2.ll

	Show All 15 Lines
	; CHECK-NEXT: jne .LBB0_2			; CHECK-NEXT: jne .LBB0_2
	; CHECK-NEXT: # %bb.1: # %cond.false.i			; CHECK-NEXT: # %bb.1: # %cond.false.i
	; CHECK-NEXT: movl $g_4, %eax			; CHECK-NEXT: movl $g_4, %eax
	; CHECK-NEXT: movl $g_2+4, %ecx			; CHECK-NEXT: movl $g_2+4, %ecx
	; CHECK-NEXT: xorl %esi, %esi			; CHECK-NEXT: xorl %esi, %esi
	; CHECK-NEXT: cmpq %rax, %rcx			; CHECK-NEXT: cmpq %rax, %rcx
	; CHECK-NEXT: sete %sil			; CHECK-NEXT: sete %sil
	; CHECK-NEXT: movl $1, %eax			; CHECK-NEXT: movl $1, %eax
				; CHECK-NEXT: cmovnel %eax, %esi
				; CHECK-NEXT: movl $1, %eax
	; CHECK-NEXT: xorl %edx, %edx			; CHECK-NEXT: xorl %edx, %edx
	; CHECK-NEXT: divl %esi			; CHECK-NEXT: divl %esi
	; CHECK-NEXT: movl %edx, %eax			; CHECK-NEXT: movl %edx, %eax
	; CHECK-NEXT: .LBB0_2: # %cond.end.i			; CHECK-NEXT: .LBB0_2: # %cond.end.i
	; CHECK-NEXT: # kill: def $ax killed $ax killed $eax			; CHECK-NEXT: # kill: def $ax killed $ax killed $eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	entry:			entry:
	br i1 %C, label %cond.end.i, label %cond.false.i			br i1 %C, label %cond.end.i, label %cond.false.i
	Show All 9 Lines

test/CodeGen/X86/divide-constant.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc < %s -mtriple=x86_64-linux-gnu -verify-machineinstrs \| FileCheck %s -check-prefix=SDAG
				; RUN: llc < %s -mtriple=x86_64-linux-gnu -fast-isel -verify-machineinstrs \| FileCheck %s -check-prefix=FAST
				; RUN: llc < %s -mtriple=x86_64-linux-gnu -global-isel -global-isel-abort=0 -verify-machineinstrs \| FileCheck %s -check-prefix=GLOBAL

				@g1 = extern_weak global i32
				@g2 = extern_weak global i32

				define i32 @test1(i1 %c) {
				; SDAG-LABEL: test1:
				; SDAG: # %bb.0: # %entry
				; SDAG-NEXT: movl $g1, %eax
				; SDAG-NEXT: xorl $-2147483648, %eax # imm = 0x80000000
				; SDAG-NEXT: movl $g2, %esi
				; SDAG-NEXT: movl $g2, %ecx
				; SDAG-NEXT: notl %ecx
				; SDAG-NEXT: orl %eax, %ecx
				; SDAG-NEXT: sete %al
				; SDAG-NEXT: testl %esi, %esi
				; SDAG-NEXT: sete %cl
				; SDAG-NEXT: orb %al, %cl
				; SDAG-NEXT: movl $1, %ecx
				; SDAG-NEXT: cmovnel %ecx, %esi
				; SDAG-NEXT: movl $g1, %eax
				; SDAG-NEXT: cltd
				; SDAG-NEXT: idivl %esi
				; SDAG-NEXT: testb $1, %dil
				; SDAG-NEXT: je .LBB0_2
				; SDAG-NEXT: # %bb.1:
				; SDAG-NEXT: movl %eax, %ecx
				; SDAG-NEXT: .LBB0_2: # %cond.end.i
				; SDAG-NEXT: movl %ecx, %eax
				; SDAG-NEXT: retq
				;
				; FAST-LABEL: test1:
				; FAST: # %bb.0: # %entry
				; FAST-NEXT: movl $g1, %eax
				; FAST-NEXT: xorl $-2147483648, %eax # imm = 0x80000000
				; FAST-NEXT: movl $g2, %esi
				; FAST-NEXT: movl $g2, %ecx
				; FAST-NEXT: notl %ecx
				; FAST-NEXT: orl %eax, %ecx
				; FAST-NEXT: sete %al
				; FAST-NEXT: testl %esi, %esi
				; FAST-NEXT: sete %cl
				; FAST-NEXT: orb %al, %cl
				; FAST-NEXT: movl $1, %ecx
				; FAST-NEXT: cmovnel %ecx, %esi
				; FAST-NEXT: movl $g1, %eax
				; FAST-NEXT: cltd
				; FAST-NEXT: idivl %esi
				; FAST-NEXT: testb $1, %dil
				; FAST-NEXT: je .LBB0_2
				; FAST-NEXT: # %bb.1:
				; FAST-NEXT: movl %eax, %ecx
				; FAST-NEXT: .LBB0_2: # %cond.end.i
				; FAST-NEXT: movl %ecx, %eax
				; FAST-NEXT: retq
				;
				; GLOBAL-LABEL: test1:
				; GLOBAL: # %bb.0: # %entry
				; GLOBAL-NEXT: movl $g1, %eax
				; GLOBAL-NEXT: xorl $-2147483648, %eax # imm = 0x80000000
				; GLOBAL-NEXT: movl $g2, %esi
				; GLOBAL-NEXT: movl $g2, %ecx
				; GLOBAL-NEXT: notl %ecx
				; GLOBAL-NEXT: orl %eax, %ecx
				; GLOBAL-NEXT: sete %al
				; GLOBAL-NEXT: testl %esi, %esi
				; GLOBAL-NEXT: sete %cl
				; GLOBAL-NEXT: orb %al, %cl
				; GLOBAL-NEXT: movl $1, %ecx
				; GLOBAL-NEXT: cmovnel %ecx, %esi
				; GLOBAL-NEXT: movl $g1, %eax
				; GLOBAL-NEXT: cltd
				; GLOBAL-NEXT: idivl %esi
				; GLOBAL-NEXT: testb $1, %dil
				; GLOBAL-NEXT: je .LBB0_2
				; GLOBAL-NEXT: # %bb.1:
				; GLOBAL-NEXT: movl %eax, %ecx
				; GLOBAL-NEXT: .LBB0_2: # %cond.end.i
				; GLOBAL-NEXT: movl %ecx, %eax
				; GLOBAL-NEXT: retq
				entry:
				br i1 %c, label %cond.end.i, label %cond.false.i

				cond.false.i:
				br label %cond.end.i

				cond.end.i:
				%r = phi i32 [ sdiv (i32 ptrtoint (i32* @g1 to i32), i32 ptrtoint (i32* @g2 to i32)), %entry ], [ 1, %cond.false.i ]
				ret i32 %r
				}

				define i32 @test2(i1 %c) {
				; SDAG-LABEL: test2:
				; SDAG: # %bb.0: # %entry
				; SDAG-NEXT: movl $g2, %esi
				; SDAG-NEXT: cmpl $1, %esi
				; SDAG-NEXT: movl $1, %ecx
				; SDAG-NEXT: cmovbel %ecx, %esi
				; SDAG-NEXT: movl $g1, %eax
				; SDAG-NEXT: xorl %edx, %edx
				; SDAG-NEXT: divl %esi
				; SDAG-NEXT: testb $1, %dil
				; SDAG-NEXT: je .LBB1_2
				; SDAG-NEXT: # %bb.1:
				; SDAG-NEXT: movl %eax, %ecx
				; SDAG-NEXT: .LBB1_2: # %cond.end.i
				; SDAG-NEXT: movl %ecx, %eax
				; SDAG-NEXT: retq
				;
				; FAST-LABEL: test2:
				; FAST: # %bb.0: # %entry
				; FAST-NEXT: movl $g2, %esi
				; FAST-NEXT: cmpl $1, %esi
				; FAST-NEXT: movl $1, %ecx
				; FAST-NEXT: cmovbel %ecx, %esi
				; FAST-NEXT: movl $g1, %eax
				; FAST-NEXT: xorl %edx, %edx
				; FAST-NEXT: divl %esi
				; FAST-NEXT: testb $1, %dil
				; FAST-NEXT: je .LBB1_2
				; FAST-NEXT: # %bb.1:
				; FAST-NEXT: movl %eax, %ecx
				; FAST-NEXT: .LBB1_2: # %cond.end.i
				; FAST-NEXT: movl %ecx, %eax
				; FAST-NEXT: retq
				;
				; GLOBAL-LABEL: test2:
				; GLOBAL: # %bb.0: # %entry
				; GLOBAL-NEXT: movl $g2, %esi
				; GLOBAL-NEXT: cmpl $1, %esi
				; GLOBAL-NEXT: movl $1, %ecx
				; GLOBAL-NEXT: cmovbel %ecx, %esi
				; GLOBAL-NEXT: movl $g1, %eax
				; GLOBAL-NEXT: xorl %edx, %edx
				; GLOBAL-NEXT: divl %esi
				; GLOBAL-NEXT: testb $1, %dil
				; GLOBAL-NEXT: je .LBB1_2
				; GLOBAL-NEXT: # %bb.1:
				; GLOBAL-NEXT: movl %eax, %ecx
				; GLOBAL-NEXT: .LBB1_2: # %cond.end.i
				; GLOBAL-NEXT: movl %ecx, %eax
				; GLOBAL-NEXT: retq
				entry:
				br i1 %c, label %cond.end.i, label %cond.false.i

				cond.false.i:
				br label %cond.end.i

				cond.end.i:
				%r = phi i32 [ udiv (i32 ptrtoint (i32* @g1 to i32), i32 ptrtoint (i32* @g2 to i32)), %entry ], [ 1, %cond.false.i ]
				ret i32 %r
				}

				define i32 @test3(i1 %c) {
				; SDAG-LABEL: test3:
				; SDAG: # %bb.0: # %entry
				; SDAG-NEXT: movl $g1, %eax
				; SDAG-NEXT: xorl $-2147483648, %eax # imm = 0x80000000
				; SDAG-NEXT: movl $g2, %esi
				; SDAG-NEXT: movl $g2, %ecx
				; SDAG-NEXT: notl %ecx
				; SDAG-NEXT: orl %eax, %ecx
				; SDAG-NEXT: sete %al
				; SDAG-NEXT: testl %esi, %esi
				; SDAG-NEXT: sete %cl
				; SDAG-NEXT: orb %al, %cl
				; SDAG-NEXT: movl $1, %ecx
				; SDAG-NEXT: cmovnel %ecx, %esi
				; SDAG-NEXT: movl $g1, %eax
				; SDAG-NEXT: cltd
				; SDAG-NEXT: idivl %esi
				; SDAG-NEXT: testb $1, %dil
				; SDAG-NEXT: je .LBB2_2
				; SDAG-NEXT: # %bb.1:
				; SDAG-NEXT: movl %edx, %ecx
				; SDAG-NEXT: .LBB2_2: # %cond.end.i
				; SDAG-NEXT: movl %ecx, %eax
				; SDAG-NEXT: retq
				;
				; FAST-LABEL: test3:
				; FAST: # %bb.0: # %entry
				; FAST-NEXT: movl $g1, %eax
				; FAST-NEXT: xorl $-2147483648, %eax # imm = 0x80000000
				; FAST-NEXT: movl $g2, %esi
				; FAST-NEXT: movl $g2, %ecx
				; FAST-NEXT: notl %ecx
				; FAST-NEXT: orl %eax, %ecx
				; FAST-NEXT: sete %al
				; FAST-NEXT: testl %esi, %esi
				; FAST-NEXT: sete %cl
				; FAST-NEXT: orb %al, %cl
				; FAST-NEXT: movl $1, %ecx
				; FAST-NEXT: cmovnel %ecx, %esi
				; FAST-NEXT: movl $g1, %eax
				; FAST-NEXT: cltd
				; FAST-NEXT: idivl %esi
				; FAST-NEXT: testb $1, %dil
				; FAST-NEXT: je .LBB2_2
				; FAST-NEXT: # %bb.1:
				; FAST-NEXT: movl %edx, %ecx
				; FAST-NEXT: .LBB2_2: # %cond.end.i
				; FAST-NEXT: movl %ecx, %eax
				; FAST-NEXT: retq
				;
				; GLOBAL-LABEL: test3:
				; GLOBAL: # %bb.0: # %entry
				; GLOBAL-NEXT: movl $g1, %eax
				; GLOBAL-NEXT: xorl $-2147483648, %eax # imm = 0x80000000
				; GLOBAL-NEXT: movl $g2, %esi
				; GLOBAL-NEXT: movl $g2, %ecx
				; GLOBAL-NEXT: notl %ecx
				; GLOBAL-NEXT: orl %eax, %ecx
				; GLOBAL-NEXT: sete %al
				; GLOBAL-NEXT: testl %esi, %esi
				; GLOBAL-NEXT: sete %cl
				; GLOBAL-NEXT: orb %al, %cl
				; GLOBAL-NEXT: movl $1, %ecx
				; GLOBAL-NEXT: cmovnel %ecx, %esi
				; GLOBAL-NEXT: movl $g1, %eax
				; GLOBAL-NEXT: cltd
				; GLOBAL-NEXT: idivl %esi
				; GLOBAL-NEXT: testb $1, %dil
				; GLOBAL-NEXT: je .LBB2_2
				; GLOBAL-NEXT: # %bb.1:
				; GLOBAL-NEXT: movl %edx, %ecx
				; GLOBAL-NEXT: .LBB2_2: # %cond.end.i
				; GLOBAL-NEXT: movl %ecx, %eax
				; GLOBAL-NEXT: retq
				entry:
				br i1 %c, label %cond.end.i, label %cond.false.i

				cond.false.i:
				br label %cond.end.i

				cond.end.i:
				%r = phi i32 [ srem (i32 ptrtoint (i32* @g1 to i32), i32 ptrtoint (i32* @g2 to i32)), %entry ], [ 1, %cond.false.i ]
				ret i32 %r
				}

				define i32 @test4(i1 %c) {
				; SDAG-LABEL: test4:
				; SDAG: # %bb.0: # %entry
				; SDAG-NEXT: movl $g2, %esi
				; SDAG-NEXT: cmpl $1, %esi
				; SDAG-NEXT: movl $1, %ecx
				; SDAG-NEXT: cmovbel %ecx, %esi
				; SDAG-NEXT: movl $g1, %eax
				; SDAG-NEXT: xorl %edx, %edx
				; SDAG-NEXT: divl %esi
				; SDAG-NEXT: testb $1, %dil
				; SDAG-NEXT: je .LBB3_2
				; SDAG-NEXT: # %bb.1:
				; SDAG-NEXT: movl %edx, %ecx
				; SDAG-NEXT: .LBB3_2: # %cond.end.i
				; SDAG-NEXT: movl %ecx, %eax
				; SDAG-NEXT: retq
				;
				; FAST-LABEL: test4:
				; FAST: # %bb.0: # %entry
				; FAST-NEXT: movl $g2, %esi
				; FAST-NEXT: cmpl $1, %esi
				; FAST-NEXT: movl $1, %ecx
				; FAST-NEXT: cmovbel %ecx, %esi
				; FAST-NEXT: movl $g1, %eax
				; FAST-NEXT: xorl %edx, %edx
				; FAST-NEXT: divl %esi
				; FAST-NEXT: testb $1, %dil
				; FAST-NEXT: je .LBB3_2
				; FAST-NEXT: # %bb.1:
				; FAST-NEXT: movl %edx, %ecx
				; FAST-NEXT: .LBB3_2: # %cond.end.i
				; FAST-NEXT: movl %ecx, %eax
				; FAST-NEXT: retq
				;
				; GLOBAL-LABEL: test4:
				; GLOBAL: # %bb.0: # %entry
				; GLOBAL-NEXT: movl $g2, %esi
				; GLOBAL-NEXT: cmpl $1, %esi
				; GLOBAL-NEXT: movl $1, %ecx
				; GLOBAL-NEXT: cmovbel %ecx, %esi
				; GLOBAL-NEXT: movl $g1, %eax
				; GLOBAL-NEXT: xorl %edx, %edx
				; GLOBAL-NEXT: divl %esi
				; GLOBAL-NEXT: testb $1, %dil
				; GLOBAL-NEXT: je .LBB3_2
				; GLOBAL-NEXT: # %bb.1:
				; GLOBAL-NEXT: movl %edx, %ecx
				; GLOBAL-NEXT: .LBB3_2: # %cond.end.i
				; GLOBAL-NEXT: movl %ecx, %eax
				; GLOBAL-NEXT: retq
				entry:
				br i1 %c, label %cond.end.i, label %cond.false.i

				cond.false.i:
				br label %cond.end.i

				cond.end.i:
				%r = phi i32 [ urem (i32 ptrtoint (i32* @g1 to i32), i32 ptrtoint (i32* @g2 to i32)), %entry ], [ 1, %cond.false.i ]
				hfinkelUnsubmitted Not Done Reply Inline Actions Can you check known bits? I feel like we should somehow know that `ptrtoint(@g)` isn't zero. For a test case, we can always do `ptrtoint(@g1)/(ptrtoint(@g2)-123456)` or similar. hfinkel: Can you check known bits? I feel like we should somehow know that `ptrtoint(@g)` isn't zero.
				efriedmaAuthorUnsubmitted Done Reply Inline Actions I don't think there's any way to prove the value is non-zero here; it's extern_weak. efriedma: I don't think there's any way to prove the value is non-zero here; it's extern_weak.
				hfinkelUnsubmitted Not Done Reply Inline Actions Indeed. I missed that. I did mean the comment more generally - it seems like we should know that non-weak globals aren't zero. That having been said, looking at the implementation of SelectionDAG::isKnownNeverZero and SelectionDAG::computeKnownBits, etc. they don't seem to know anything about globals, so I suppose that enhancement there would also be needed in order to have an impact on this lowering. hfinkel: Indeed. I missed that. I did mean the comment more generally - it seems like we should know…
				ret i32 %r
				}

				define i32 @test5(i32 %c) {
				; SDAG-LABEL: test5:
				; SDAG: # %bb.0: # %entry
				; SDAG-NEXT: movl $g1, %eax
				; SDAG-NEXT: xorl $-2147483648, %eax # imm = 0x80000000
				; SDAG-NEXT: movl $g2, %ecx
				; SDAG-NEXT: movl $g2, %edx
				; SDAG-NEXT: notl %edx
				; SDAG-NEXT: orl %eax, %edx
				; SDAG-NEXT: sete %al
				; SDAG-NEXT: testl %ecx, %ecx
				; SDAG-NEXT: sete %dl
				; SDAG-NEXT: orb %al, %dl
				; SDAG-NEXT: movl $1, %eax
				; SDAG-NEXT: cmovnel %eax, %ecx
				; SDAG-NEXT: movl $g1, %eax
				; SDAG-NEXT: cltd
				; SDAG-NEXT: idivl %ecx
				; SDAG-NEXT: #APP
				; SDAG-NEXT: #NO_APP
				; SDAG-NEXT: .Ltmp0: # Block address taken
				; SDAG-NEXT: .LBB4_1: # %cond.false.i
				; SDAG-NEXT: movl $1, %eax
				; SDAG-NEXT: .LBB4_2: # %cond.end.i
				; SDAG-NEXT: retq
				;
				; FAST-LABEL: test5:
				; FAST: # %bb.0: # %entry
				; FAST-NEXT: movl $g1, %eax
				; FAST-NEXT: xorl $-2147483648, %eax # imm = 0x80000000
				; FAST-NEXT: movl $g2, %ecx
				; FAST-NEXT: movl $g2, %edx
				; FAST-NEXT: notl %edx
				; FAST-NEXT: orl %eax, %edx
				; FAST-NEXT: sete %al
				; FAST-NEXT: testl %ecx, %ecx
				; FAST-NEXT: sete %dl
				; FAST-NEXT: orb %al, %dl
				; FAST-NEXT: movl $1, %eax
				; FAST-NEXT: cmovnel %eax, %ecx
				; FAST-NEXT: movl $g1, %eax
				; FAST-NEXT: cltd
				; FAST-NEXT: idivl %ecx
				; FAST-NEXT: #APP
				; FAST-NEXT: #NO_APP
				; FAST-NEXT: .Ltmp0: # Block address taken
				; FAST-NEXT: .LBB4_1: # %cond.false.i
				; FAST-NEXT: movl $1, %eax
				; FAST-NEXT: .LBB4_2: # %cond.end.i
				; FAST-NEXT: retq
				;
				; GLOBAL-LABEL: test5:
				; GLOBAL: # %bb.0: # %entry
				; GLOBAL-NEXT: movl $g1, %eax
				; GLOBAL-NEXT: xorl $-2147483648, %eax # imm = 0x80000000
				; GLOBAL-NEXT: movl $g2, %ecx
				; GLOBAL-NEXT: movl $g2, %edx
				; GLOBAL-NEXT: notl %edx
				; GLOBAL-NEXT: orl %eax, %edx
				; GLOBAL-NEXT: sete %al
				; GLOBAL-NEXT: testl %ecx, %ecx
				; GLOBAL-NEXT: sete %dl
				; GLOBAL-NEXT: orb %al, %dl
				; GLOBAL-NEXT: movl $1, %eax
				; GLOBAL-NEXT: cmovnel %eax, %ecx
				; GLOBAL-NEXT: movl $g1, %eax
				; GLOBAL-NEXT: cltd
				; GLOBAL-NEXT: idivl %ecx
				; GLOBAL-NEXT: #APP
				; GLOBAL-NEXT: #NO_APP
				; GLOBAL-NEXT: .Ltmp0: # Block address taken
				; GLOBAL-NEXT: .LBB4_1: # %cond.false.i
				; GLOBAL-NEXT: movl $1, %eax
				; GLOBAL-NEXT: .LBB4_2: # %cond.end.i
				; GLOBAL-NEXT: retq
				entry:
				callbr void asm "", "r,X"(i32 %c, i8 *blockaddress(@test5, %cond.false.i))
				to label %cond.false.i [label %cond.end.i]

				cond.false.i:
				br label %cond.end.i

				cond.end.i:
				%r = phi i32 [ sdiv (i32 ptrtoint (i32* @g1 to i32), i32 ptrtoint (i32* @g2 to i32)), %entry ], [ 1, %cond.false.i ]
				ret i32 %r
				}

This is an archive of the discontinued LLVM Phabricator instance.

LLVM IR constant expressions never trap.Changes PlannedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 203646

docs/LangRef.rst

include/llvm/Analysis/ValueTracking.h

include/llvm/CodeGen/GlobalISel/IRTranslator.h

include/llvm/IR/Constant.h

lib/Analysis/CodeMetrics.cpp

lib/Analysis/ValueTracking.cpp

lib/CodeGen/SelectionDAG/FastISel.cpp

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.h

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

lib/IR/Constants.cpp

lib/Transforms/Utils/SimplifyCFG.cpp

lib/Transforms/Vectorize/LoopVectorizationLegality.cpp

test/CodeGen/X86/critical-edge-split-2.ll

test/CodeGen/X86/divide-constant.ll

LLVM IR constant expressions never trap.
Changes PlannedPublic