This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/Analysis/
-
llvm/
-
Analysis/
-
ScalarEvolution.h
-
ScalarEvolutionExpressions.h
-
lib/
-
Analysis/
-
ScalarEvolution.cpp
-
Transforms/Scalar/
-
Scalar/
-
LoopRerollPass.cpp

Differential D29790

[SCEV] Remove the walk of the entire expression subgraph on every lookup of a cached SCEV for a particular value.
Needs ReviewPublic

Authored by chandlerc on Feb 9 2017, 2:08 PM.

Download Raw Diff

This revision needs review, but there are no reviewers specified.

Details

Reviewers: None

Summary

This was added waaaaay back in r185843 to handle the issue where
a SCEVUnknown wrapped around an IR value is deleted when the IR value is
deleted. It becomes null and makes the entire expression subgraph
containing it invalid. The fix was to check for these null nodes in the
subgraph on every query because SCEVs don't have uselists to update.
Unfortunately, this means querying every SCEV in a large graph is
trivially quadratic. Bad news. A great example of this is the unittest
ScalarEvolutionTests::SCEVAddExpr which becomes quite slow with a larger
number of expressions. When the loop was at 1000, the test took well
over 20 seconds for me.

Unfortunately, we *really* don't want to add uselist based invalidation
here because it will make removing instruction walk their subgraphs!
Especially considering that a common pattern is to delete the IR and
then forget the loop, this would be a big waste.

So this patch introduces a different approach. We add a generation count
to ScalarEvolution, and for composite expression SCEVs (SCEVExpr base
class in this patch, but a better name would be welocme) we track the
generation count at which the SCEV was valid. Then, each time the value
handle invalidates a SCEVUnknown node at the leaves of a subgraph it
also moves the generation count forward. When we validate, we check the
subgraphs with old generation counts, *and update the generation count
where valid*. We also clear out the part of the subgraph that we
trivially know is invalid rather than leaving it around to consume
memory. This essentially allows the validation of a subgraph to be
cached *between* queries to SCEVs.

Now, as long as the deletion of the IR and the queries to SCEV are
*batched*, the quadratic component falls away. This seems very likely to
be true in practice and certainly is true in the unittest. That unittest
with the loop size set to 1000 is over 5x faster with this patch.
Naturally, I can magnify this difference by making the size of the graph
larger. =]

When discussing this, there were some initial concerns around memory
overhead, but because this is isolated in compound expressions,
I suspect it will be fine. The largest overhead is likely from the unary
cast expressions, but even there it seems likely to be a reasonable cost
given the other things cached for every SCEV.

Unfortunately (but maybe fortunately?), this makes SCEV *much* more strict
about caching SCEVs across IR mutations because that can cause it to build new
expressions out of these cached SCEVs with skewed generations. The only failure
of this kind in 'check-llvm' was loop reroll which I've fixed here to forget
the loop after it deletes tons of instructions and not cache things across that
deletion.

However, if you run with this patch and assertions across the test suite, the
unroller also triggers the assert that catches skewed generations when building
new expresisons. This one is especially concerning as it appears to happen
before the unroller modifies any IR, making it look like some other pass forgot
to call forgetLoop? Unclear. It's also generally unclear whether we want this
degree of strictness, or if we don't, how/when we should re-validate cached
SCEVs.

I'm sadly out of time to keep hacking on this, but I wanted to send it out in
case others are interested in pushing this forward.

Diff Detail

Build Status

Buildable 3842
Build 3842: arc lint + arc unit

Event Timeline

chandlerc created this revision.Feb 9 2017, 2:08 PM

Herald added subscribers: mzolotukhin, mcrosier. · View Herald TranscriptFeb 9 2017, 2:08 PM

Revision Contents

Path

Size

include/

llvm/

Analysis/

ScalarEvolution.h

30 lines

ScalarEvolutionExpressions.h

116 lines

lib/

Analysis/

ScalarEvolution.cpp

226 lines

Transforms/

Scalar/

LoopRerollPass.cpp

35 lines

Diff 87877

include/llvm/Analysis/ScalarEvolution.h

Show First 20 Lines • Show All 480 Lines • ▼ Show 20 Lines	private:
/// The dominator tree.		/// The dominator tree.
///		///
DominatorTree &DT;		DominatorTree &DT;

/// The loop information for the function we are currently analyzing.		/// The loop information for the function we are currently analyzing.
///		///
LoopInfo &LI;		LoopInfo &LI;

		/// The generation count of the analysis.
		///
		/// This is used by various parts of the analysis to track which SCEVs have
		/// been lazily updated after some partial invalidation event.
		uint64_t Generation;

/// This SCEV is used to represent unknown trip counts and things.		/// This SCEV is used to represent unknown trip counts and things.
std::unique_ptr<SCEVCouldNotCompute> CouldNotCompute;		std::unique_ptr<SCEVCouldNotCompute> CouldNotCompute;

/// The typedef for HasRecMap.		/// The typedef for HasRecMap.
///		///
typedef DenseMap<const SCEV *, bool> HasRecMapType;		typedef DenseMap<const SCEV *, bool> HasRecMapType;

/// This is a cache to record whether a SCEV contains any scAddRecExpr.		/// This is a cache to record whether a SCEV contains any scAddRecExpr.
▲ Show 20 Lines • Show All 543 Lines • ▼ Show 20 Lines	private:
Optional<APInt> computeConstantDifference(const SCEV LHS, const SCEV RHS);		Optional<APInt> computeConstantDifference(const SCEV LHS, const SCEV RHS);

/// Drop memoized information computed for S.		/// Drop memoized information computed for S.
void forgetMemoizedResults(const SCEV *S);		void forgetMemoizedResults(const SCEV *S);

/// Return an existing SCEV for V if there is one, otherwise return nullptr.		/// Return an existing SCEV for V if there is one, otherwise return nullptr.
const SCEV getExistingSCEV(Value V);		const SCEV getExistingSCEV(Value V);

/// Return false iff given SCEV contains a SCEVUnknown with NULL value-		/// Validate a SCEV's subgraph.
/// pointer.		///
bool checkValidity(const SCEV *S) const;		/// This will ensure that the given SCEV and all SCEVs it transitively
		/// references are valid. If any part of the subgraph is detected to be
		/// invalid, this will remove those SCEVs from the analysis and return false.
		/// Note that this may not remove all invalid SCEVs from the analysis, but
		/// will definitively remove `S` if invalid and will remove others
		/// optimistically.
		bool validateSCEVSubgraph(const SCEV *S);

/// Return true if `ExtendOpTy`({`Start`,+,`Step`}) can be proved to be		/// Return true if `ExtendOpTy`({`Start`,+,`Step`}) can be proved to be
/// equal to {`ExtendOpTy`(`Start`),+,`ExtendOpTy`(`Step`)}. This is		/// equal to {`ExtendOpTy`(`Start`),+,`ExtendOpTy`(`Step`)}. This is
/// equivalent to proving no signed (resp. unsigned) wrap in		/// equivalent to proving no signed (resp. unsigned) wrap in
/// {`Start`,+,`Step`} if `ExtendOpTy` is `SCEVSignExtendExpr`		/// {`Start`,+,`Step`} if `ExtendOpTy` is `SCEVSignExtendExpr`
/// (resp. `SCEVZeroExtendExpr`).		/// (resp. `SCEVZeroExtendExpr`).
///		///
template <typename ExtendOpTy>		template <typename ExtendOpTy>
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	public:
/// Return true if the SCEV is a scAddRecExpr or it contains		/// Return true if the SCEV is a scAddRecExpr or it contains
/// scAddRecExpr. The result will be cached in HasRecMap.		/// scAddRecExpr. The result will be cached in HasRecMap.
///		///
bool containsAddRecurrence(const SCEV *S);		bool containsAddRecurrence(const SCEV *S);

/// Return the Value set from which the SCEV expr is generated.		/// Return the Value set from which the SCEV expr is generated.
SetVector<ValueOffsetPair> getSCEVValues(const SCEV S);		SetVector<ValueOffsetPair> getSCEVValues(const SCEV S);

		/// Increment the SCEV generation.
		///
		/// This shifts the generation count forward.
		void incrementSCEVGeneration();

/// Erase Value from ValueExprMap and ExprValueMap.		/// Erase Value from ValueExprMap and ExprValueMap.
void eraseValueFromMap(Value *V);		void eraseValueFromMap(Value *V);

/// Return a SCEV expression for the full generality of the specified		/// Return a SCEV expression for the full generality of the specified
/// expression.		/// expression.
const SCEV getSCEV(Value V);		const SCEV getSCEV(Value V);

const SCEV getConstant(ConstantInt V);		const SCEV getConstant(ConstantInt V);
▲ Show 20 Lines • Show All 477 Lines • ▼ Show 20 Lines	private:
/// the stride and the knowledge of NSW/NUW flags on the recurrence.		/// the stride and the knowledge of NSW/NUW flags on the recurrence.
bool doesIVOverflowOnGT(const SCEV RHS, const SCEV Stride, bool IsSigned,		bool doesIVOverflowOnGT(const SCEV RHS, const SCEV Stride, bool IsSigned,
bool NoWrap);		bool NoWrap);

/// Get add expr already created or create a new one		/// Get add expr already created or create a new one
const SCEV getOrCreateAddExpr(SmallVectorImpl<const SCEV > &Ops,		const SCEV getOrCreateAddExpr(SmallVectorImpl<const SCEV > &Ops,
SCEV::NoWrapFlags Flags);		SCEV::NoWrapFlags Flags);

		SCEV lookupUniqueSCEV(const FoldingSetNodeID &ID, void &IP) {
		if (SCEV *S = UniqueSCEVs.FindNodeOrInsertPos(ID, IP))
		if (validateSCEVSubgraph(S))
		return S;
		return nullptr;
		}

private:		private:
FoldingSet<SCEV> UniqueSCEVs;		FoldingSet<SCEV> UniqueSCEVs;
FoldingSet<SCEVPredicate> UniquePreds;		FoldingSet<SCEVPredicate> UniquePreds;
BumpPtrAllocator SCEVAllocator;		BumpPtrAllocator SCEVAllocator;

/// The head of a linked list of all SCEVUnknown values that have been		/// The head of a linked list of all SCEVUnknown values that have been
/// allocated. This is used by releaseMemory to locate them all and call		/// allocated. This is used by releaseMemory to locate them all and call
/// their destructors.		/// their destructors.
▲ Show 20 Lines • Show All 137 Lines • Show Last 20 Lines

include/llvm/Analysis/ScalarEvolutionExpressions.h

Show First 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	public:
Type *getType() const { return V->getType(); }		Type *getType() const { return V->getType(); }

/// Methods for support type inquiry through isa, cast, and dyn_cast:		/// Methods for support type inquiry through isa, cast, and dyn_cast:
static inline bool classof(const SCEV *S) {		static inline bool classof(const SCEV *S) {
return S->getSCEVType() == scConstant;		return S->getSCEVType() == scConstant;
}		}
};		};

		/// This is the base class for all expression classes.
		///
		/// The common aspect to them is that they reference other SCEVs as part of
		/// their definition and so we need to be able to invalidate them when part
		/// of their expression graph changes.
		class SCEVExpr : public SCEV {
		private:
		// This is mutable as it inherently doesn't make up part of the SCEV state.
		mutable uint64_t Generation;

		public:
		explicit SCEVExpr(const FoldingSetNodeIDRef ID, unsigned SCEVTy,
		uint64_t CurrentGeneration)
		: SCEV(ID, SCEVTy), Generation(CurrentGeneration) {}

		uint64_t getGeneration() const { return Generation; }

		void setGeneration(uint64_t NewGen) const { Generation = NewGen; }

		/// Methods for support type inquiry through isa, cast, and dyn_cast:
		static inline bool classof(const SCEV *S) {
		switch (S->getSCEVType()) {
		case scTruncate:
		case scZeroExtend:
		case scSignExtend:
		case scAddExpr:
		case scMulExpr:
		case scSMaxExpr:
		case scUMaxExpr:
		case scAddRecExpr:
		case scUDivExpr:
		return true;

		case scUnknown:
		case scConstant:
		case scCouldNotCompute:
		return false;
		}
		llvm_unreachable("Uncovered SCEV type!");
		}

		#ifndef NDEBUG
		/// Helper to assert (in debug builds) that SCEVs have the current
		/// gerenation.
		static void verifyGeneration(ArrayRef<const SCEV *> SCEVs,
		uint64_t CurrentGeneration);
		#endif
		};

/// This is the base class for unary cast operator classes.		/// This is the base class for unary cast operator classes.
class SCEVCastExpr : public SCEV {		class SCEVCastExpr : public SCEVExpr {
protected:		protected:
const SCEV *Op;		const SCEV *Op;
Type *Ty;		Type *Ty;

SCEVCastExpr(const FoldingSetNodeIDRef ID,		SCEVCastExpr(const FoldingSetNodeIDRef ID,
unsigned SCEVTy, const SCEV op, Type ty);		unsigned SCEVTy, const SCEV op, Type ty,
		uint64_t CurrentGeneration);

public:		public:
const SCEV *getOperand() const { return Op; }		const SCEV *getOperand() const { return Op; }
Type *getType() const { return Ty; }		Type *getType() const { return Ty; }

/// Methods for support type inquiry through isa, cast, and dyn_cast:		/// Methods for support type inquiry through isa, cast, and dyn_cast:
static inline bool classof(const SCEV *S) {		static inline bool classof(const SCEV *S) {
return S->getSCEVType() == scTruncate \|\|		return S->getSCEVType() == scTruncate \|\|
S->getSCEVType() == scZeroExtend \|\|		S->getSCEVType() == scZeroExtend \|\|
S->getSCEVType() == scSignExtend;		S->getSCEVType() == scSignExtend;
}		}
};		};

/// This class represents a truncation of an integer value to a		/// This class represents a truncation of an integer value to a
/// smaller integer value.		/// smaller integer value.
class SCEVTruncateExpr : public SCEVCastExpr {		class SCEVTruncateExpr : public SCEVCastExpr {
friend class ScalarEvolution;		friend class ScalarEvolution;

SCEVTruncateExpr(const FoldingSetNodeIDRef ID,		SCEVTruncateExpr(const FoldingSetNodeIDRef ID,
const SCEV op, Type ty);		const SCEV op, Type ty,
		uint64_t CurrentGeneration);

public:		public:
/// Methods for support type inquiry through isa, cast, and dyn_cast:		/// Methods for support type inquiry through isa, cast, and dyn_cast:
static inline bool classof(const SCEV *S) {		static inline bool classof(const SCEV *S) {
return S->getSCEVType() == scTruncate;		return S->getSCEVType() == scTruncate;
}		}
};		};

/// This class represents a zero extension of a small integer value		/// This class represents a zero extension of a small integer value
/// to a larger integer value.		/// to a larger integer value.
class SCEVZeroExtendExpr : public SCEVCastExpr {		class SCEVZeroExtendExpr : public SCEVCastExpr {
friend class ScalarEvolution;		friend class ScalarEvolution;

SCEVZeroExtendExpr(const FoldingSetNodeIDRef ID,		SCEVZeroExtendExpr(const FoldingSetNodeIDRef ID,
const SCEV op, Type ty);		const SCEV op, Type ty,
		uint64_t CurrentGeneration);

public:		public:
/// Methods for support type inquiry through isa, cast, and dyn_cast:		/// Methods for support type inquiry through isa, cast, and dyn_cast:
static inline bool classof(const SCEV *S) {		static inline bool classof(const SCEV *S) {
return S->getSCEVType() == scZeroExtend;		return S->getSCEVType() == scZeroExtend;
}		}
};		};

/// This class represents a sign extension of a small integer value		/// This class represents a sign extension of a small integer value
/// to a larger integer value.		/// to a larger integer value.
class SCEVSignExtendExpr : public SCEVCastExpr {		class SCEVSignExtendExpr : public SCEVCastExpr {
friend class ScalarEvolution;		friend class ScalarEvolution;

SCEVSignExtendExpr(const FoldingSetNodeIDRef ID,		SCEVSignExtendExpr(const FoldingSetNodeIDRef ID,
const SCEV op, Type ty);		const SCEV op, Type ty,
		uint64_t CurrentGeneration);

public:		public:
/// Methods for support type inquiry through isa, cast, and dyn_cast:		/// Methods for support type inquiry through isa, cast, and dyn_cast:
static inline bool classof(const SCEV *S) {		static inline bool classof(const SCEV *S) {
return S->getSCEVType() == scSignExtend;		return S->getSCEVType() == scSignExtend;
}		}
};		};


/// This node is a base class providing common functionality for		/// This node is a base class providing common functionality for
/// n'ary operators.		/// n'ary operators.
class SCEVNAryExpr : public SCEV {		class SCEVNAryExpr : public SCEVExpr {
protected:		protected:
// Since SCEVs are immutable, ScalarEvolution allocates operand		// Since SCEVs are immutable, ScalarEvolution allocates operand
// arrays with its SCEVAllocator, so this class just needs a simple		// arrays with its SCEVAllocator, so this class just needs a simple
// pointer rather than a more elaborate vector-like data structure.		// pointer rather than a more elaborate vector-like data structure.
// This also avoids the need for a non-trivial destructor.		// This also avoids the need for a non-trivial destructor.
const SCEV const Operands;		const SCEV const Operands;
size_t NumOperands;		size_t NumOperands;

SCEVNAryExpr(const FoldingSetNodeIDRef ID,		SCEVNAryExpr(const FoldingSetNodeIDRef ID, enum SCEVTypes T,
enum SCEVTypes T, const SCEV const O, size_t N)		const SCEV const O, size_t N, uint64_t CurrentGeneration)
: SCEV(ID, T), Operands(O), NumOperands(N) {}		: SCEVExpr(ID, T, CurrentGeneration), Operands(O), NumOperands(N) {
		#ifndef NDEBUG
		verifyGeneration(makeArrayRef(Operands, NumOperands), CurrentGeneration);
		#endif
		}

public:		public:
size_t getNumOperands() const { return NumOperands; }		size_t getNumOperands() const { return NumOperands; }
const SCEV *getOperand(unsigned i) const {		const SCEV *getOperand(unsigned i) const {
assert(i < NumOperands && "Operand index out of range!");		assert(i < NumOperands && "Operand index out of range!");
return Operands[i];		return Operands[i];
}		}

Show All 32 Lines	static inline bool classof(const SCEV *S) {
S->getSCEVType() == scAddRecExpr;		S->getSCEVType() == scAddRecExpr;
}		}
};		};

/// This node is the base class for n'ary commutative operators.		/// This node is the base class for n'ary commutative operators.
class SCEVCommutativeExpr : public SCEVNAryExpr {		class SCEVCommutativeExpr : public SCEVNAryExpr {
protected:		protected:
SCEVCommutativeExpr(const FoldingSetNodeIDRef ID,		SCEVCommutativeExpr(const FoldingSetNodeIDRef ID,
enum SCEVTypes T, const SCEV const O, size_t N)		enum SCEVTypes T, const SCEV const O, size_t N,
: SCEVNAryExpr(ID, T, O, N) {}		uint64_t CurrentGeneration)
		: SCEVNAryExpr(ID, T, O, N, CurrentGeneration) {}

public:		public:
/// Methods for support type inquiry through isa, cast, and dyn_cast:		/// Methods for support type inquiry through isa, cast, and dyn_cast:
static inline bool classof(const SCEV *S) {		static inline bool classof(const SCEV *S) {
return S->getSCEVType() == scAddExpr \|\|		return S->getSCEVType() == scAddExpr \|\|
S->getSCEVType() == scMulExpr \|\|		S->getSCEVType() == scMulExpr \|\|
S->getSCEVType() == scSMaxExpr \|\|		S->getSCEVType() == scSMaxExpr \|\|
S->getSCEVType() == scUMaxExpr;		S->getSCEVType() == scUMaxExpr;
}		}

/// Set flags for a non-recurrence without clearing previously set flags.		/// Set flags for a non-recurrence without clearing previously set flags.
void setNoWrapFlags(NoWrapFlags Flags) {		void setNoWrapFlags(NoWrapFlags Flags) {
SubclassData \|= Flags;		SubclassData \|= Flags;
}		}
};		};


/// This node represents an addition of some number of SCEVs.		/// This node represents an addition of some number of SCEVs.
class SCEVAddExpr : public SCEVCommutativeExpr {		class SCEVAddExpr : public SCEVCommutativeExpr {
friend class ScalarEvolution;		friend class ScalarEvolution;

SCEVAddExpr(const FoldingSetNodeIDRef ID,		SCEVAddExpr(const FoldingSetNodeIDRef ID,
const SCEV const O, size_t N)		const SCEV const O, size_t N,
: SCEVCommutativeExpr(ID, scAddExpr, O, N) {		uint64_t CurrentGeneration)
		: SCEVCommutativeExpr(ID, scAddExpr, O, N, CurrentGeneration) {
}		}

public:		public:
Type *getType() const {		Type *getType() const {
// Use the type of the last operand, which is likely to be a pointer		// Use the type of the last operand, which is likely to be a pointer
// type, if there is one. This doesn't usually matter, but it can help		// type, if there is one. This doesn't usually matter, but it can help
// reduce casts when the expressions are expanded.		// reduce casts when the expressions are expanded.
return getOperand(getNumOperands() - 1)->getType();		return getOperand(getNumOperands() - 1)->getType();
}		}

/// Methods for support type inquiry through isa, cast, and dyn_cast:		/// Methods for support type inquiry through isa, cast, and dyn_cast:
static inline bool classof(const SCEV *S) {		static inline bool classof(const SCEV *S) {
return S->getSCEVType() == scAddExpr;		return S->getSCEVType() == scAddExpr;
}		}
};		};


/// This node represents multiplication of some number of SCEVs.		/// This node represents multiplication of some number of SCEVs.
class SCEVMulExpr : public SCEVCommutativeExpr {		class SCEVMulExpr : public SCEVCommutativeExpr {
friend class ScalarEvolution;		friend class ScalarEvolution;

SCEVMulExpr(const FoldingSetNodeIDRef ID,		SCEVMulExpr(const FoldingSetNodeIDRef ID,
const SCEV const O, size_t N)		const SCEV const O, size_t N,
: SCEVCommutativeExpr(ID, scMulExpr, O, N) {		uint64_t CurrentGeneration)
		: SCEVCommutativeExpr(ID, scMulExpr, O, N, CurrentGeneration) {
}		}

public:		public:
/// Methods for support type inquiry through isa, cast, and dyn_cast:		/// Methods for support type inquiry through isa, cast, and dyn_cast:
static inline bool classof(const SCEV *S) {		static inline bool classof(const SCEV *S) {
return S->getSCEVType() == scMulExpr;		return S->getSCEVType() == scMulExpr;
}		}
};		};


/// This class represents a binary unsigned division operation.		/// This class represents a binary unsigned division operation.
class SCEVUDivExpr : public SCEV {		class SCEVUDivExpr : public SCEVExpr {
friend class ScalarEvolution;		friend class ScalarEvolution;

const SCEV *LHS;		const SCEV *LHS;
const SCEV *RHS;		const SCEV *RHS;
SCEVUDivExpr(const FoldingSetNodeIDRef ID, const SCEV lhs, const SCEV rhs)		SCEVUDivExpr(const FoldingSetNodeIDRef ID, const SCEV lhs, const SCEV rhs,
: SCEV(ID, scUDivExpr), LHS(lhs), RHS(rhs) {}		uint64_t CurrentGeneration)
		: SCEVExpr(ID, scUDivExpr, CurrentGeneration), LHS(lhs), RHS(rhs) {
		#ifndef NDEBUG
		verifyGeneration({LHS, RHS}, CurrentGeneration);
		#endif
		}

public:		public:
const SCEV *getLHS() const { return LHS; }		const SCEV *getLHS() const { return LHS; }
const SCEV *getRHS() const { return RHS; }		const SCEV *getRHS() const { return RHS; }

Type *getType() const {		Type *getType() const {
// In most cases the types of LHS and RHS will be the same, but in some		// In most cases the types of LHS and RHS will be the same, but in some
// crazy cases one or the other may be a pointer. ScalarEvolution doesn't		// crazy cases one or the other may be a pointer. ScalarEvolution doesn't
Show All 19 Lines	#endif
/// All operands of an AddRec are required to be loop invariant.		/// All operands of an AddRec are required to be loop invariant.
///		///
class SCEVAddRecExpr : public SCEVNAryExpr {		class SCEVAddRecExpr : public SCEVNAryExpr {
friend class ScalarEvolution;		friend class ScalarEvolution;

const Loop *L;		const Loop *L;

SCEVAddRecExpr(const FoldingSetNodeIDRef ID,		SCEVAddRecExpr(const FoldingSetNodeIDRef ID,
const SCEV const O, size_t N, const Loop *l)		const SCEV const O, size_t N, const Loop *l,
: SCEVNAryExpr(ID, scAddRecExpr, O, N), L(l) {}		uint64_t CurrentGeneration)
		: SCEVNAryExpr(ID, scAddRecExpr, O, N, CurrentGeneration), L(l) {}

public:		public:
const SCEV *getStart() const { return Operands[0]; }		const SCEV *getStart() const { return Operands[0]; }
const Loop *getLoop() const { return L; }		const Loop *getLoop() const { return L; }

/// Constructs and returns the recurrence indicating how much this		/// Constructs and returns the recurrence indicating how much this
/// expression steps by. If this is a polynomial of degree N, it		/// expression steps by. If this is a polynomial of degree N, it
/// returns a chrec of degree N-1. We cannot determine whether		/// returns a chrec of degree N-1. We cannot determine whether
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	public:
}		}
};		};

/// This class represents a signed maximum selection.		/// This class represents a signed maximum selection.
class SCEVSMaxExpr : public SCEVCommutativeExpr {		class SCEVSMaxExpr : public SCEVCommutativeExpr {
friend class ScalarEvolution;		friend class ScalarEvolution;

SCEVSMaxExpr(const FoldingSetNodeIDRef ID,		SCEVSMaxExpr(const FoldingSetNodeIDRef ID,
const SCEV const O, size_t N)		const SCEV const O, size_t N,
: SCEVCommutativeExpr(ID, scSMaxExpr, O, N) {		uint64_t CurrentGeneration)
		: SCEVCommutativeExpr(ID, scSMaxExpr, O, N, CurrentGeneration) {
// Max never overflows.		// Max never overflows.
setNoWrapFlags((NoWrapFlags)(FlagNUW \| FlagNSW));		setNoWrapFlags((NoWrapFlags)(FlagNUW \| FlagNSW));
}		}

public:		public:
/// Methods for support type inquiry through isa, cast, and dyn_cast:		/// Methods for support type inquiry through isa, cast, and dyn_cast:
static inline bool classof(const SCEV *S) {		static inline bool classof(const SCEV *S) {
return S->getSCEVType() == scSMaxExpr;		return S->getSCEVType() == scSMaxExpr;
}		}
};		};


/// This class represents an unsigned maximum selection.		/// This class represents an unsigned maximum selection.
class SCEVUMaxExpr : public SCEVCommutativeExpr {		class SCEVUMaxExpr : public SCEVCommutativeExpr {
friend class ScalarEvolution;		friend class ScalarEvolution;

SCEVUMaxExpr(const FoldingSetNodeIDRef ID,		SCEVUMaxExpr(const FoldingSetNodeIDRef ID,
const SCEV const O, size_t N)		const SCEV const O, size_t N,
: SCEVCommutativeExpr(ID, scUMaxExpr, O, N) {		uint64_t CurrentGeneration)
		: SCEVCommutativeExpr(ID, scUMaxExpr, O, N, CurrentGeneration) {
// Max never overflows.		// Max never overflows.
setNoWrapFlags((NoWrapFlags)(FlagNUW \| FlagNSW));		setNoWrapFlags((NoWrapFlags)(FlagNUW \| FlagNSW));
}		}

public:		public:
/// Methods for support type inquiry through isa, cast, and dyn_cast:		/// Methods for support type inquiry through isa, cast, and dyn_cast:
static inline bool classof(const SCEV *S) {		static inline bool classof(const SCEV *S) {
return S->getSCEVType() == scUMaxExpr;		return S->getSCEVType() == scUMaxExpr;
▲ Show 20 Lines • Show All 343 Lines • Show Last 20 Lines

lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 332 Lines • ▼ Show 20 Lines	bool SCEVCouldNotCompute::classof(const SCEV *S) {
return S->getSCEVType() == scCouldNotCompute;		return S->getSCEVType() == scCouldNotCompute;
}		}

const SCEV ScalarEvolution::getConstant(ConstantInt V) {		const SCEV ScalarEvolution::getConstant(ConstantInt V) {
FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scConstant);		ID.AddInteger(scConstant);
ID.AddPointer(V);		ID.AddPointer(V);
void *IP = nullptr;		void *IP = nullptr;
if (const SCEV *S = UniqueSCEVs.FindNodeOrInsertPos(ID, IP)) return S;		if (const SCEV *S = lookupUniqueSCEV(ID, IP)) return S;
SCEV *S = new (SCEVAllocator) SCEVConstant(ID.Intern(SCEVAllocator), V);		SCEV *S = new (SCEVAllocator) SCEVConstant(ID.Intern(SCEVAllocator), V);
UniqueSCEVs.InsertNode(S, IP);		UniqueSCEVs.InsertNode(S, IP);
return S;		return S;
}		}

const SCEV *ScalarEvolution::getConstant(const APInt &Val) {		const SCEV *ScalarEvolution::getConstant(const APInt &Val) {
return getConstant(ConstantInt::get(getContext(), Val));		return getConstant(ConstantInt::get(getContext(), Val));
}		}

const SCEV *		const SCEV *
ScalarEvolution::getConstant(Type *Ty, uint64_t V, bool isSigned) {		ScalarEvolution::getConstant(Type *Ty, uint64_t V, bool isSigned) {
IntegerType *ITy = cast<IntegerType>(getEffectiveSCEVType(Ty));		IntegerType *ITy = cast<IntegerType>(getEffectiveSCEVType(Ty));
return getConstant(ConstantInt::get(ITy, V, isSigned));		return getConstant(ConstantInt::get(ITy, V, isSigned));
}		}

SCEVCastExpr::SCEVCastExpr(const FoldingSetNodeIDRef ID,		#ifndef NDEBUG
unsigned SCEVTy, const SCEV op, Type ty)		void SCEVExpr::verifyGeneration(ArrayRef<const SCEV *> SCEVs,
: SCEV(ID, SCEVTy), Op(op), Ty(ty) {}		uint64_t CurrentGeneration) {
		for (const SCEV *S : SCEVs) {
SCEVTruncateExpr::SCEVTruncateExpr(const FoldingSetNodeIDRef ID,		switch (S->getSCEVType()) {
const SCEV op, Type ty)		case scUnknown:
: SCEVCastExpr(ID, scTruncate, op, ty) {		case scConstant:
		// No generation.
		continue;

		case scTruncate:
		case scZeroExtend:
		case scSignExtend:
		case scAddExpr:
		case scMulExpr:
		case scSMaxExpr:
		case scUMaxExpr:
		case scAddRecExpr:
		case scUDivExpr:
		assert(cast<SCEVExpr>(S)->getGeneration() == CurrentGeneration &&
		"Unexpected generation!");
		continue;

		case scCouldNotCompute:
		llvm_unreachable("Attempt to use a SCEVCouldNotCompute object!");
		}
		llvm_unreachable("Unknown SCEV kind!");
		}
		}
		#endif

		SCEVCastExpr::SCEVCastExpr(const FoldingSetNodeIDRef ID, unsigned SCEVTy,
		const SCEV op, Type ty, uint64_t CurrentGeneration)
		: SCEVExpr(ID, SCEVTy, CurrentGeneration), Op(op), Ty(ty) {
		#ifndef NDEBUG
		verifyGeneration({Op}, CurrentGeneration);
		#endif
		}

		SCEVTruncateExpr::SCEVTruncateExpr(const FoldingSetNodeIDRef ID, const SCEV *op,
		Type *ty, uint64_t CurrentGeneration)
		: SCEVCastExpr(ID, scTruncate, op, ty, CurrentGeneration) {
assert((Op->getType()->isIntegerTy() \|\| Op->getType()->isPointerTy()) &&		assert((Op->getType()->isIntegerTy() \|\| Op->getType()->isPointerTy()) &&
(Ty->isIntegerTy() \|\| Ty->isPointerTy()) &&		(Ty->isIntegerTy() \|\| Ty->isPointerTy()) &&
"Cannot truncate non-integer value!");		"Cannot truncate non-integer value!");
}		}

SCEVZeroExtendExpr::SCEVZeroExtendExpr(const FoldingSetNodeIDRef ID,		SCEVZeroExtendExpr::SCEVZeroExtendExpr(const FoldingSetNodeIDRef ID,
const SCEV op, Type ty)		const SCEV op, Type ty,
: SCEVCastExpr(ID, scZeroExtend, op, ty) {		uint64_t CurrentGeneration)
		: SCEVCastExpr(ID, scZeroExtend, op, ty, CurrentGeneration) {
assert((Op->getType()->isIntegerTy() \|\| Op->getType()->isPointerTy()) &&		assert((Op->getType()->isIntegerTy() \|\| Op->getType()->isPointerTy()) &&
(Ty->isIntegerTy() \|\| Ty->isPointerTy()) &&		(Ty->isIntegerTy() \|\| Ty->isPointerTy()) &&
"Cannot zero extend non-integer value!");		"Cannot zero extend non-integer value!");
}		}

SCEVSignExtendExpr::SCEVSignExtendExpr(const FoldingSetNodeIDRef ID,		SCEVSignExtendExpr::SCEVSignExtendExpr(const FoldingSetNodeIDRef ID,
const SCEV op, Type ty)		const SCEV op, Type ty,
: SCEVCastExpr(ID, scSignExtend, op, ty) {		uint64_t CurrentGeneration)
		: SCEVCastExpr(ID, scSignExtend, op, ty, CurrentGeneration) {
assert((Op->getType()->isIntegerTy() \|\| Op->getType()->isPointerTy()) &&		assert((Op->getType()->isIntegerTy() \|\| Op->getType()->isPointerTy()) &&
(Ty->isIntegerTy() \|\| Ty->isPointerTy()) &&		(Ty->isIntegerTy() \|\| Ty->isPointerTy()) &&
"Cannot sign extend non-integer value!");		"Cannot sign extend non-integer value!");
}		}

void SCEVUnknown::deleted() {		void SCEVUnknown::deleted() {
// Clear this SCEVUnknown from various maps.		// Clear this SCEVUnknown from various maps.
SE->forgetMemoizedResults(this);		SE->forgetMemoizedResults(this);

// Remove this SCEVUnknown from the uniquing map.		// Remove this SCEVUnknown from the uniquing map.
SE->UniqueSCEVs.RemoveNode(this);		SE->UniqueSCEVs.RemoveNode(this);

		// Increment the generation so that users will be updated when next queried.
		SE->incrementSCEVGeneration();

// Release the value.		// Release the value.
setValPtr(nullptr);		setValPtr(nullptr);
}		}

void SCEVUnknown::allUsesReplacedWith(Value *New) {		void SCEVUnknown::allUsesReplacedWith(Value *New) {
// Clear this SCEVUnknown from various maps.		// Clear this SCEVUnknown from various maps.
SE->forgetMemoizedResults(this);		SE->forgetMemoizedResults(this);

▲ Show 20 Lines • Show All 757 Lines • ▼ Show 20 Lines	assert(isSCEVable(Ty) &&
"This is not a conversion to a SCEVable type!");		"This is not a conversion to a SCEVable type!");
Ty = getEffectiveSCEVType(Ty);		Ty = getEffectiveSCEVType(Ty);

FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scTruncate);		ID.AddInteger(scTruncate);
ID.AddPointer(Op);		ID.AddPointer(Op);
ID.AddPointer(Ty);		ID.AddPointer(Ty);
void *IP = nullptr;		void *IP = nullptr;
if (const SCEV *S = UniqueSCEVs.FindNodeOrInsertPos(ID, IP)) return S;		if (const SCEV *S = lookupUniqueSCEV(ID, IP)) return S;

// Fold if the operand is constant.		// Fold if the operand is constant.
if (const SCEVConstant *SC = dyn_cast<SCEVConstant>(Op))		if (const SCEVConstant *SC = dyn_cast<SCEVConstant>(Op))
return getConstant(		return getConstant(
cast<ConstantInt>(ConstantExpr::getTrunc(SC->getValue(), Ty)));		cast<ConstantInt>(ConstantExpr::getTrunc(SC->getValue(), Ty)));

// trunc(trunc(x)) --> trunc(x)		// trunc(trunc(x)) --> trunc(x)
if (const SCEVTruncateExpr *ST = dyn_cast<SCEVTruncateExpr>(Op))		if (const SCEVTruncateExpr *ST = dyn_cast<SCEVTruncateExpr>(Op))
▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	for (const SCEV *Op : AddRec->operands())
Operands.push_back(getTruncateExpr(Op, Ty));		Operands.push_back(getTruncateExpr(Op, Ty));
return getAddRecExpr(Operands, AddRec->getLoop(), SCEV::FlagAnyWrap);		return getAddRecExpr(Operands, AddRec->getLoop(), SCEV::FlagAnyWrap);
}		}

// The cast wasn't folded; create an explicit cast node. We can reuse		// The cast wasn't folded; create an explicit cast node. We can reuse
// the existing insert position since if we get here, we won't have		// the existing insert position since if we get here, we won't have
// made any changes which would invalidate it.		// made any changes which would invalidate it.
SCEV *S = new (SCEVAllocator) SCEVTruncateExpr(ID.Intern(SCEVAllocator),		SCEV *S = new (SCEVAllocator) SCEVTruncateExpr(ID.Intern(SCEVAllocator),
Op, Ty);		Op, Ty, Generation);
UniqueSCEVs.InsertNode(S, IP);		UniqueSCEVs.InsertNode(S, IP);
return S;		return S;
}		}

// Get the limit of a recurrence such that incrementing by Step cannot cause		// Get the limit of a recurrence such that incrementing by Step cannot cause
// signed overflow as long as the value of the recurrence within the		// signed overflow as long as the value of the recurrence within the
// loop does not exceed this limit before incrementing.		// loop does not exceed this limit before incrementing.
static const SCEV getSignedOverflowLimitForStep(const SCEV Step,		static const SCEV getSignedOverflowLimitForStep(const SCEV Step,
▲ Show 20 Lines • Show All 226 Lines • ▼ Show 20 Lines	for (unsigned Delta : {-2, -1, 1, 2}) {
const SCEV *PreStart = getConstant(StartAI - Delta);		const SCEV *PreStart = getConstant(StartAI - Delta);

FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scAddRecExpr);		ID.AddInteger(scAddRecExpr);
ID.AddPointer(PreStart);		ID.AddPointer(PreStart);
ID.AddPointer(Step);		ID.AddPointer(Step);
ID.AddPointer(L);		ID.AddPointer(L);
void *IP = nullptr;		void *IP = nullptr;
const auto *PreAR =		const auto PreAR = static_cast<SCEVAddRecExpr >(lookupUniqueSCEV(ID, IP));
static_cast<SCEVAddRecExpr *>(UniqueSCEVs.FindNodeOrInsertPos(ID, IP));

// Give up if we don't already have the add recurrence we need because		// Give up if we don't already have the add recurrence we need because
// actually constructing an add recurrence is relatively expensive.		// actually constructing an add recurrence is relatively expensive.
if (PreAR && PreAR->getNoWrapFlags(WrapType)) { // proves (2)		if (PreAR && PreAR->getNoWrapFlags(WrapType)) { // proves (2)
const SCEV *DeltaS = getConstant(StartC->getType(), Delta);		const SCEV *DeltaS = getConstant(StartC->getType(), Delta);
ICmpInst::Predicate Pred = ICmpInst::BAD_ICMP_PREDICATE;		ICmpInst::Predicate Pred = ICmpInst::BAD_ICMP_PREDICATE;
const SCEV *Limit = ExtendOpTraits<ExtendOpTy>::getOverflowLimitForStep(		const SCEV *Limit = ExtendOpTraits<ExtendOpTy>::getOverflowLimitForStep(
DeltaS, &Pred, this);		DeltaS, &Pred, this);
Show All 24 Lines	const SCEV ScalarEvolution::getZeroExtendExpr(const SCEV Op,

// Before doing any expensive analysis, check to see if we've already		// Before doing any expensive analysis, check to see if we've already
// computed a SCEV for this Op and Ty.		// computed a SCEV for this Op and Ty.
FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scZeroExtend);		ID.AddInteger(scZeroExtend);
ID.AddPointer(Op);		ID.AddPointer(Op);
ID.AddPointer(Ty);		ID.AddPointer(Ty);
void *IP = nullptr;		void *IP = nullptr;
if (const SCEV *S = UniqueSCEVs.FindNodeOrInsertPos(ID, IP)) return S;		if (const SCEV *S = lookupUniqueSCEV(ID, IP)) return S;

// zext(trunc(x)) --> zext(x) or x or trunc(x)		// zext(trunc(x)) --> zext(x) or x or trunc(x)
if (const SCEVTruncateExpr *ST = dyn_cast<SCEVTruncateExpr>(Op)) {		if (const SCEVTruncateExpr *ST = dyn_cast<SCEVTruncateExpr>(Op)) {
// It's possible the bits taken off by the truncate were all zero bits. If		// It's possible the bits taken off by the truncate were all zero bits. If
// so, we should be able to simplify this further.		// so, we should be able to simplify this further.
const SCEV *X = ST->getOperand();		const SCEV *X = ST->getOperand();
ConstantRange CR = getUnsignedRange(X);		ConstantRange CR = getUnsignedRange(X);
unsigned TruncBits = getTypeSizeInBits(ST->getType());		unsigned TruncBits = getTypeSizeInBits(ST->getType());
▲ Show 20 Lines • Show All 148 Lines • ▼ Show 20 Lines	if (SA->hasNoUnsignedWrap()) {
for (const auto *Op : SA->operands())		for (const auto *Op : SA->operands())
Ops.push_back(getZeroExtendExpr(Op, Ty));		Ops.push_back(getZeroExtendExpr(Op, Ty));
return getAddExpr(Ops, SCEV::FlagNUW);		return getAddExpr(Ops, SCEV::FlagNUW);
}		}
}		}

// The cast wasn't folded; create an explicit cast node.		// The cast wasn't folded; create an explicit cast node.
// Recompute the insert position, as it may have been invalidated.		// Recompute the insert position, as it may have been invalidated.
if (const SCEV *S = UniqueSCEVs.FindNodeOrInsertPos(ID, IP)) return S;		if (const SCEV *S = lookupUniqueSCEV(ID, IP)) return S;
SCEV *S = new (SCEVAllocator) SCEVZeroExtendExpr(ID.Intern(SCEVAllocator),		SCEV *S = new (SCEVAllocator) SCEVZeroExtendExpr(ID.Intern(SCEVAllocator),
Op, Ty);		Op, Ty, Generation);
UniqueSCEVs.InsertNode(S, IP);		UniqueSCEVs.InsertNode(S, IP);
return S;		return S;
}		}

const SCEV ScalarEvolution::getSignExtendExpr(const SCEV Op,		const SCEV ScalarEvolution::getSignExtendExpr(const SCEV Op,
Type *Ty) {		Type *Ty) {
assert(getTypeSizeInBits(Op->getType()) < getTypeSizeInBits(Ty) &&		assert(getTypeSizeInBits(Op->getType()) < getTypeSizeInBits(Ty) &&
"This is not an extending conversion!");		"This is not an extending conversion!");
Show All 16 Lines	const SCEV ScalarEvolution::getSignExtendExpr(const SCEV Op,

// Before doing any expensive analysis, check to see if we've already		// Before doing any expensive analysis, check to see if we've already
// computed a SCEV for this Op and Ty.		// computed a SCEV for this Op and Ty.
FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scSignExtend);		ID.AddInteger(scSignExtend);
ID.AddPointer(Op);		ID.AddPointer(Op);
ID.AddPointer(Ty);		ID.AddPointer(Ty);
void *IP = nullptr;		void *IP = nullptr;
if (const SCEV *S = UniqueSCEVs.FindNodeOrInsertPos(ID, IP)) return S;		if (const SCEV *S = lookupUniqueSCEV(ID, IP)) return S;

// sext(trunc(x)) --> sext(x) or x or trunc(x)		// sext(trunc(x)) --> sext(x) or x or trunc(x)
if (const SCEVTruncateExpr *ST = dyn_cast<SCEVTruncateExpr>(Op)) {		if (const SCEVTruncateExpr *ST = dyn_cast<SCEVTruncateExpr>(Op)) {
// It's possible the bits taken off by the truncate were all sign bits. If		// It's possible the bits taken off by the truncate were all sign bits. If
// so, we should be able to simplify this further.		// so, we should be able to simplify this further.
const SCEV *X = ST->getOperand();		const SCEV *X = ST->getOperand();
ConstantRange CR = getSignedRange(X);		ConstantRange CR = getSignedRange(X);
unsigned TruncBits = getTypeSizeInBits(ST->getType());		unsigned TruncBits = getTypeSizeInBits(ST->getType());
▲ Show 20 Lines • Show All 175 Lines • ▼ Show 20 Lines	const SCEV ScalarEvolution::getSignExtendExpr(const SCEV Op,

// If the input value is provably positive and we could not simplify		// If the input value is provably positive and we could not simplify
// away the sext build a zext instead.		// away the sext build a zext instead.
if (isKnownNonNegative(Op))		if (isKnownNonNegative(Op))
return getZeroExtendExpr(Op, Ty);		return getZeroExtendExpr(Op, Ty);

// The cast wasn't folded; create an explicit cast node.		// The cast wasn't folded; create an explicit cast node.
// Recompute the insert position, as it may have been invalidated.		// Recompute the insert position, as it may have been invalidated.
if (const SCEV *S = UniqueSCEVs.FindNodeOrInsertPos(ID, IP)) return S;		if (const SCEV *S = lookupUniqueSCEV(ID, IP)) return S;
SCEV *S = new (SCEVAllocator) SCEVSignExtendExpr(ID.Intern(SCEVAllocator),		SCEV *S = new (SCEVAllocator) SCEVSignExtendExpr(ID.Intern(SCEVAllocator),
Op, Ty);		Op, Ty, Generation);
UniqueSCEVs.InsertNode(S, IP);		UniqueSCEVs.InsertNode(S, IP);
return S;		return S;
}		}

/// getAnyExtendExpr - Return a SCEV for the given operand extended with		/// getAnyExtendExpr - Return a SCEV for the given operand extended with
/// unspecified bits out to the given type.		/// unspecified bits out to the given type.
///		///
const SCEV ScalarEvolution::getAnyExtendExpr(const SCEV Op,		const SCEV ScalarEvolution::getAnyExtendExpr(const SCEV Op,
▲ Show 20 Lines • Show All 548 Lines • ▼ Show 20 Lines
const SCEV *		const SCEV *
ScalarEvolution::getOrCreateAddExpr(SmallVectorImpl<const SCEV *> &Ops,		ScalarEvolution::getOrCreateAddExpr(SmallVectorImpl<const SCEV *> &Ops,
SCEV::NoWrapFlags Flags) {		SCEV::NoWrapFlags Flags) {
FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scAddExpr);		ID.AddInteger(scAddExpr);
for (unsigned i = 0, e = Ops.size(); i != e; ++i)		for (unsigned i = 0, e = Ops.size(); i != e; ++i)
ID.AddPointer(Ops[i]);		ID.AddPointer(Ops[i]);
void *IP = nullptr;		void *IP = nullptr;
SCEVAddExpr *S =		auto S = static_cast<SCEVAddExpr >(lookupUniqueSCEV(ID, IP));
static_cast<SCEVAddExpr *>(UniqueSCEVs.FindNodeOrInsertPos(ID, IP));
if (!S) {		if (!S) {
const SCEV *O = SCEVAllocator.Allocate<const SCEV >(Ops.size());		const SCEV *O = SCEVAllocator.Allocate<const SCEV >(Ops.size());
std::uninitialized_copy(Ops.begin(), Ops.end(), O);		std::uninitialized_copy(Ops.begin(), Ops.end(), O);
S = new (SCEVAllocator)		S = new (SCEVAllocator)
SCEVAddExpr(ID.Intern(SCEVAllocator), O, Ops.size());		SCEVAddExpr(ID.Intern(SCEVAllocator), O, Ops.size(), Generation);
UniqueSCEVs.InsertNode(S, IP);		UniqueSCEVs.InsertNode(S, IP);
}		}
S->setNoWrapFlags(Flags);		S->setNoWrapFlags(Flags);
return S;		return S;
}		}

static uint64_t umul_ov(uint64_t i, uint64_t j, bool &Overflow) {		static uint64_t umul_ov(uint64_t i, uint64_t j, bool &Overflow) {
uint64_t k = i*j;		uint64_t k = i*j;
▲ Show 20 Lines • Show All 269 Lines • ▼ Show 20 Lines	#endif

// Okay, it looks like we really DO need an mul expr. Check to see if we		// Okay, it looks like we really DO need an mul expr. Check to see if we
// already have one, otherwise create a new one.		// already have one, otherwise create a new one.
FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scMulExpr);		ID.AddInteger(scMulExpr);
for (unsigned i = 0, e = Ops.size(); i != e; ++i)		for (unsigned i = 0, e = Ops.size(); i != e; ++i)
ID.AddPointer(Ops[i]);		ID.AddPointer(Ops[i]);
void *IP = nullptr;		void *IP = nullptr;
SCEVMulExpr *S =		auto S = static_cast<SCEVMulExpr >(lookupUniqueSCEV(ID, IP));
static_cast<SCEVMulExpr *>(UniqueSCEVs.FindNodeOrInsertPos(ID, IP));
if (!S) {		if (!S) {
const SCEV *O = SCEVAllocator.Allocate<const SCEV >(Ops.size());		const SCEV *O = SCEVAllocator.Allocate<const SCEV >(Ops.size());
std::uninitialized_copy(Ops.begin(), Ops.end(), O);		std::uninitialized_copy(Ops.begin(), Ops.end(), O);
S = new (SCEVAllocator) SCEVMulExpr(ID.Intern(SCEVAllocator),		S = new (SCEVAllocator) SCEVMulExpr(ID.Intern(SCEVAllocator),
O, Ops.size());		O, Ops.size(), Generation);
UniqueSCEVs.InsertNode(S, IP);		UniqueSCEVs.InsertNode(S, IP);
}		}
S->setNoWrapFlags(Flags);		S->setNoWrapFlags(Flags);
return S;		return S;
}		}

/// Get a canonical unsigned division expression, or something simpler if		/// Get a canonical unsigned division expression, or something simpler if
/// possible.		/// possible.
▲ Show 20 Lines • Show All 101 Lines • ▼ Show 20 Lines	if (const SCEVConstant *RHSC = dyn_cast<SCEVConstant>(RHS)) {
}		}
}		}

FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scUDivExpr);		ID.AddInteger(scUDivExpr);
ID.AddPointer(LHS);		ID.AddPointer(LHS);
ID.AddPointer(RHS);		ID.AddPointer(RHS);
void *IP = nullptr;		void *IP = nullptr;
if (const SCEV *S = UniqueSCEVs.FindNodeOrInsertPos(ID, IP)) return S;		if (const SCEV *S = lookupUniqueSCEV(ID, IP)) return S;
SCEV *S = new (SCEVAllocator) SCEVUDivExpr(ID.Intern(SCEVAllocator),		SCEV *S = new (SCEVAllocator) SCEVUDivExpr(ID.Intern(SCEVAllocator),
LHS, RHS);		LHS, RHS, Generation);
UniqueSCEVs.InsertNode(S, IP);		UniqueSCEVs.InsertNode(S, IP);
return S;		return S;
}		}

static const APInt gcd(const SCEVConstant C1, const SCEVConstant C2) {		static const APInt gcd(const SCEVConstant C1, const SCEVConstant C2) {
APInt A = C1->getAPInt().abs();		APInt A = C1->getAPInt().abs();
APInt B = C2->getAPInt().abs();		APInt B = C2->getAPInt().abs();
uint32_t ABW = A.getBitWidth();		uint32_t ABW = A.getBitWidth();
▲ Show 20 Lines • Show All 157 Lines • ▼ Show 20 Lines	#endif
// Okay, it looks like we really DO need an addrec expr. Check to see if we		// Okay, it looks like we really DO need an addrec expr. Check to see if we
// already have one, otherwise create a new one.		// already have one, otherwise create a new one.
FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scAddRecExpr);		ID.AddInteger(scAddRecExpr);
for (unsigned i = 0, e = Operands.size(); i != e; ++i)		for (unsigned i = 0, e = Operands.size(); i != e; ++i)
ID.AddPointer(Operands[i]);		ID.AddPointer(Operands[i]);
ID.AddPointer(L);		ID.AddPointer(L);
void *IP = nullptr;		void *IP = nullptr;
SCEVAddRecExpr *S =		auto S = static_cast<SCEVAddRecExpr >(lookupUniqueSCEV(ID, IP));
static_cast<SCEVAddRecExpr *>(UniqueSCEVs.FindNodeOrInsertPos(ID, IP));
if (!S) {		if (!S) {
const SCEV *O = SCEVAllocator.Allocate<const SCEV >(Operands.size());		const SCEV *O = SCEVAllocator.Allocate<const SCEV >(Operands.size());
std::uninitialized_copy(Operands.begin(), Operands.end(), O);		std::uninitialized_copy(Operands.begin(), Operands.end(), O);
S = new (SCEVAllocator) SCEVAddRecExpr(ID.Intern(SCEVAllocator),		S = new (SCEVAllocator) SCEVAddRecExpr(ID.Intern(SCEVAllocator),
O, Operands.size(), L);		O, Operands.size(), L, Generation);
UniqueSCEVs.InsertNode(S, IP);		UniqueSCEVs.InsertNode(S, IP);
}		}
S->setNoWrapFlags(Flags);		S->setNoWrapFlags(Flags);
return S;		return S;
}		}

const SCEV *		const SCEV *
ScalarEvolution::getGEPExpr(GEPOperator *GEP,		ScalarEvolution::getGEPExpr(GEPOperator *GEP,
▲ Show 20 Lines • Show All 134 Lines • ▼ Show 20 Lines	#endif

// Okay, it looks like we really DO need an smax expr. Check to see if we		// Okay, it looks like we really DO need an smax expr. Check to see if we
// already have one, otherwise create a new one.		// already have one, otherwise create a new one.
FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scSMaxExpr);		ID.AddInteger(scSMaxExpr);
for (unsigned i = 0, e = Ops.size(); i != e; ++i)		for (unsigned i = 0, e = Ops.size(); i != e; ++i)
ID.AddPointer(Ops[i]);		ID.AddPointer(Ops[i]);
void *IP = nullptr;		void *IP = nullptr;
if (const SCEV *S = UniqueSCEVs.FindNodeOrInsertPos(ID, IP)) return S;		if (const SCEV *S = lookupUniqueSCEV(ID, IP)) return S;
const SCEV *O = SCEVAllocator.Allocate<const SCEV >(Ops.size());		const SCEV *O = SCEVAllocator.Allocate<const SCEV >(Ops.size());
std::uninitialized_copy(Ops.begin(), Ops.end(), O);		std::uninitialized_copy(Ops.begin(), Ops.end(), O);
SCEV *S = new (SCEVAllocator) SCEVSMaxExpr(ID.Intern(SCEVAllocator),		SCEV *S = new (SCEVAllocator) SCEVSMaxExpr(ID.Intern(SCEVAllocator),
O, Ops.size());		O, Ops.size(), Generation);
UniqueSCEVs.InsertNode(S, IP);		UniqueSCEVs.InsertNode(S, IP);
return S;		return S;
}		}

const SCEV ScalarEvolution::getUMaxExpr(const SCEV LHS,		const SCEV ScalarEvolution::getUMaxExpr(const SCEV LHS,
const SCEV *RHS) {		const SCEV *RHS) {
SmallVector<const SCEV *, 2> Ops = {LHS, RHS};		SmallVector<const SCEV *, 2> Ops = {LHS, RHS};
return getUMaxExpr(Ops);		return getUMaxExpr(Ops);
▲ Show 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	#endif

// Okay, it looks like we really DO need a umax expr. Check to see if we		// Okay, it looks like we really DO need a umax expr. Check to see if we
// already have one, otherwise create a new one.		// already have one, otherwise create a new one.
FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scUMaxExpr);		ID.AddInteger(scUMaxExpr);
for (unsigned i = 0, e = Ops.size(); i != e; ++i)		for (unsigned i = 0, e = Ops.size(); i != e; ++i)
ID.AddPointer(Ops[i]);		ID.AddPointer(Ops[i]);
void *IP = nullptr;		void *IP = nullptr;
if (const SCEV *S = UniqueSCEVs.FindNodeOrInsertPos(ID, IP)) return S;		if (const SCEV *S = lookupUniqueSCEV(ID, IP)) return S;
const SCEV *O = SCEVAllocator.Allocate<const SCEV >(Ops.size());		const SCEV *O = SCEVAllocator.Allocate<const SCEV >(Ops.size());
std::uninitialized_copy(Ops.begin(), Ops.end(), O);		std::uninitialized_copy(Ops.begin(), Ops.end(), O);
SCEV *S = new (SCEVAllocator) SCEVUMaxExpr(ID.Intern(SCEVAllocator),		SCEV *S = new (SCEVAllocator)
O, Ops.size());		SCEVUMaxExpr(ID.Intern(SCEVAllocator), O, Ops.size(), Generation);
UniqueSCEVs.InsertNode(S, IP);		UniqueSCEVs.InsertNode(S, IP);
return S;		return S;
}		}

const SCEV ScalarEvolution::getSMinExpr(const SCEV LHS,		const SCEV ScalarEvolution::getSMinExpr(const SCEV LHS,
const SCEV *RHS) {		const SCEV *RHS) {
// ~smax(~x, ~y) == smin(x, y).		// ~smax(~x, ~y) == smin(x, y).
return getNotSCEV(getSMaxExpr(getNotSCEV(LHS), getNotSCEV(RHS)));		return getNotSCEV(getSMaxExpr(getNotSCEV(LHS), getNotSCEV(RHS)));
Show All 27 Lines	const SCEV ScalarEvolution::getUnknown(Value V) {
// here. createSCEV only calls getUnknown after checking for all other		// here. createSCEV only calls getUnknown after checking for all other
// interesting possibilities, and any other code that calls getUnknown		// interesting possibilities, and any other code that calls getUnknown
// is doing so in order to hide a value from SCEV canonicalization.		// is doing so in order to hide a value from SCEV canonicalization.

FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scUnknown);		ID.AddInteger(scUnknown);
ID.AddPointer(V);		ID.AddPointer(V);
void *IP = nullptr;		void *IP = nullptr;
if (SCEV *S = UniqueSCEVs.FindNodeOrInsertPos(ID, IP)) {		if (const SCEV *S = lookupUniqueSCEV(ID, IP)) {
assert(cast<SCEVUnknown>(S)->getValue() == V &&		assert(cast<SCEVUnknown>(S)->getValue() == V &&
"Stale SCEVUnknown in uniquing map!");		"Stale SCEVUnknown in uniquing map!");
return S;		return S;
}		}
SCEV *S = new (SCEVAllocator) SCEVUnknown(ID.Intern(SCEVAllocator), V, this,		SCEV *S = new (SCEVAllocator) SCEVUnknown(ID.Intern(SCEVAllocator), V, this,
FirstUnknown);		FirstUnknown);
FirstUnknown = cast<SCEVUnknown>(S);		FirstUnknown = cast<SCEVUnknown>(S);
UniqueSCEVs.InsertNode(S, IP);		UniqueSCEVs.InsertNode(S, IP);
Show All 33 Lines	Type ScalarEvolution::getEffectiveSCEVType(Type Ty) const {
assert(Ty->isPointerTy() && "Unexpected non-pointer non-integer type!");		assert(Ty->isPointerTy() && "Unexpected non-pointer non-integer type!");
return getDataLayout().getIntPtrType(Ty);		return getDataLayout().getIntPtrType(Ty);
}		}

const SCEV *ScalarEvolution::getCouldNotCompute() {		const SCEV *ScalarEvolution::getCouldNotCompute() {
return CouldNotCompute.get();		return CouldNotCompute.get();
}		}

bool ScalarEvolution::checkValidity(const SCEV *S) const {		bool ScalarEvolution::validateSCEVSubgraph(const SCEV *S) {
bool ContainsNulls = SCEVExprContains(S, [](const SCEV *S) {		// We do a depth-first search of the SCEV expression DAG looking for
auto *SU = dyn_cast<SCEVUnknown>(S);		// potentially invalid expressions. Any expression with a generation older
return SU && SU->getValue() == nullptr;		// than the current one needs to be recursively validated. Once recursively
});		// validated, we can update the generation count to prune subsequent walks.
		SmallVector<std::pair<const SCEV *, size_t>, 8> Stack;
		size_t Index = 0;
		for (;;) {
		switch (S->getSCEVType()) {
		case scConstant:
		break;
		case scUnknown: {
		const auto *SU = cast<SCEVUnknown>(S);
		if (SU->getValue() == nullptr) {
		while (!Stack.empty()) {
		const SCEV *S = Stack.pop_back_val().first;
		if (SetVector<ValueOffsetPair> *SV = getSCEVValues(S))
		for (auto &VO : *SV)
		ValueExprMap.erase(VO.first);
		forgetMemoizedResults(S);
		}
		return false;
		}
		break;
		}
		case scTruncate:
		case scZeroExtend:
		case scSignExtend: {
		const auto *Cast = cast<SCEVCastExpr>(S);
		if (Index != 0) {
		// Finished visiting subgraph and it remained valid, update generation
		// and break.
		Cast->setGeneration(Generation);
		break;
		}
		// First visit, test and if necessary recurse.
		if (Cast->getGeneration() == Generation)
		break;

		Stack.push_back({S, Index + 1});
		S = Cast->getOperand();
		Index = 0;
		continue;
		}
		case scAddExpr:
		case scMulExpr:
		case scSMaxExpr:
		case scUMaxExpr:
		case scAddRecExpr: {
		const auto *NAry = cast<SCEVNAryExpr>(S);
		if (Index == NAry->getNumOperands()) {
		NAry->setGeneration(Generation);
		break;
		}
		if (NAry->getGeneration() == Generation)
		break;

		Stack.push_back({S, Index + 1});
		S = NAry->getOperand(Index);
		Index = 0;
		continue;
		}
		case scUDivExpr: {
		const auto *UDiv = cast<SCEVUDivExpr>(S);
		if (Index == 2) {
		UDiv->setGeneration(Generation);
		break;
		}

		if (Index == 0 && UDiv->getGeneration() == Generation)
		break;

return !ContainsNulls;		Stack.push_back({S, Index + 1});
		S = Index == 0 ? UDiv->getLHS() : UDiv->getRHS();
		Index = 0;
		continue;
		}
		case scCouldNotCompute:
		llvm_unreachable("Attempt to use a SCEVCouldNotCompute object!");
		default:
		llvm_unreachable("Unknown SCEV kind!");
		}

		if (Stack.empty())
		break;
		std::tie(S, Index) = Stack.pop_back_val();
		}

		return true;
}		}

bool ScalarEvolution::containsAddRecurrence(const SCEV *S) {		bool ScalarEvolution::containsAddRecurrence(const SCEV *S) {
HasRecMapType::iterator I = HasRecMap.find(S);		HasRecMapType::iterator I = HasRecMap.find(S);
if (I != HasRecMap.end())		if (I != HasRecMap.end())
return I->second;		return I->second;

bool FoundAddRec = SCEVExprContains(S, isa<SCEVAddRecExpr, const SCEV *>);		bool FoundAddRec = SCEVExprContains(S, isa<SCEVAddRecExpr, const SCEV *>);
Show All 31 Lines	if (VerifySCEVMap) {
// Check there is no dangling Value in the set returned.		// Check there is no dangling Value in the set returned.
for (const auto &VE : SI->second)		for (const auto &VE : SI->second)
assert(ValueExprMap.count(VE.first));		assert(ValueExprMap.count(VE.first));
}		}
#endif		#endif
return &SI->second;		return &SI->second;
}		}

		void ScalarEvolution::incrementSCEVGeneration() {
		// Note that we never wrap the generation count. This would require calling
		// this function 2^64 times which would require doing nothing else for a Unix
		// epoch. However, there is no semantic problem wrapping here as the
		// generation count works in the modular space rather than requiring an
		// ordering.
		assert(Generation != UINT64_MAX && "Likely memory corruption: the generation "
		"count has reached the max value!");
		++Generation;
		}

/// Erase Value from ValueExprMap and ExprValueMap. ValueExprMap.erase(V)		/// Erase Value from ValueExprMap and ExprValueMap. ValueExprMap.erase(V)
/// cannot be used separately. eraseValueFromMap should be used to remove		/// cannot be used separately. eraseValueFromMap should be used to remove
/// V from ValueExprMap and ExprValueMap at the same time.		/// V from ValueExprMap and ExprValueMap at the same time.
void ScalarEvolution::eraseValueFromMap(Value *V) {		void ScalarEvolution::eraseValueFromMap(Value *V) {
ValueExprMapType::iterator I = ValueExprMap.find_as(V);		ValueExprMapType::iterator I = ValueExprMap.find_as(V);
if (I != ValueExprMap.end()) {		if (I != ValueExprMap.end()) {
const SCEV *S = I->second;		const SCEV *S = I->second;
// Remove {V, 0} from the set of ExprValueMap[S]		// Remove {V, 0} from the set of ExprValueMap[S]
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
}		}

const SCEV ScalarEvolution::getExistingSCEV(Value V) {		const SCEV ScalarEvolution::getExistingSCEV(Value V) {
assert(isSCEVable(V->getType()) && "Value is not SCEVable!");		assert(isSCEVable(V->getType()) && "Value is not SCEVable!");

ValueExprMapType::iterator I = ValueExprMap.find_as(V);		ValueExprMapType::iterator I = ValueExprMap.find_as(V);
if (I != ValueExprMap.end()) {		if (I != ValueExprMap.end()) {
const SCEV *S = I->second;		const SCEV *S = I->second;
if (checkValidity(S))		if (validateSCEVSubgraph(S))
return S;		return S;
eraseValueFromMap(V);
forgetMemoizedResults(S);
}		}
return nullptr;		return nullptr;
}		}

/// Return a SCEV corresponding to -V = -1*V		/// Return a SCEV corresponding to -V = -1*V
///		///
const SCEV ScalarEvolution::getNegativeSCEV(const SCEV V,		const SCEV ScalarEvolution::getNegativeSCEV(const SCEV V,
SCEV::NoWrapFlags Flags) {		SCEV::NoWrapFlags Flags) {
▲ Show 20 Lines • Show All 5,929 Lines • ▼ Show 20 Lines

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ScalarEvolution Class Implementation		// ScalarEvolution Class Implementation
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

ScalarEvolution::ScalarEvolution(Function &F, TargetLibraryInfo &TLI,		ScalarEvolution::ScalarEvolution(Function &F, TargetLibraryInfo &TLI,
AssumptionCache &AC, DominatorTree &DT,		AssumptionCache &AC, DominatorTree &DT,
LoopInfo &LI)		LoopInfo &LI)
: F(F), TLI(TLI), AC(AC), DT(DT), LI(LI),		: F(F), TLI(TLI), AC(AC), DT(DT), LI(LI), Generation(0),
CouldNotCompute(new SCEVCouldNotCompute()),		CouldNotCompute(new SCEVCouldNotCompute()),
WalkingBEDominatingConds(false), ProvingSplitPredicate(false),		WalkingBEDominatingConds(false), ProvingSplitPredicate(false),
ValuesAtScopes(64), LoopDispositions(64), BlockDispositions(64),		ValuesAtScopes(64), LoopDispositions(64), BlockDispositions(64),
FirstUnknown(nullptr) {		FirstUnknown(nullptr) {

// To use guards for proving predicates, we need to scan every instruction in		// To use guards for proving predicates, we need to scan every instruction in
// relevant basic blocks, and not just terminators. Doing this is a waste of		// relevant basic blocks, and not just terminators. Doing this is a waste of
// time if the IR does not actually contain any calls to		// time if the IR does not actually contain any calls to
// @llvm.experimental.guard, so do a quick check and remember this beforehand.		// @llvm.experimental.guard, so do a quick check and remember this beforehand.
//		//
// This pessimizes the case where a pass that preserves ScalarEvolution wants		// This pessimizes the case where a pass that preserves ScalarEvolution wants
// to _add_ guards to the module when there weren't any before, and wants		// to _add_ guards to the module when there weren't any before, and wants
// ScalarEvolution to optimize based on those guards. For now we prefer to be		// ScalarEvolution to optimize based on those guards. For now we prefer to be
// efficient in lieu of being smart in that rather obscure case.		// efficient in lieu of being smart in that rather obscure case.

auto *GuardDecl = F.getParent()->getFunction(		auto *GuardDecl = F.getParent()->getFunction(
Intrinsic::getName(Intrinsic::experimental_guard));		Intrinsic::getName(Intrinsic::experimental_guard));
HasGuards = GuardDecl && !GuardDecl->use_empty();		HasGuards = GuardDecl && !GuardDecl->use_empty();
}		}

ScalarEvolution::ScalarEvolution(ScalarEvolution &&Arg)		ScalarEvolution::ScalarEvolution(ScalarEvolution &&Arg)
: F(Arg.F), HasGuards(Arg.HasGuards), TLI(Arg.TLI), AC(Arg.AC), DT(Arg.DT),		: F(Arg.F), HasGuards(Arg.HasGuards), TLI(Arg.TLI), AC(Arg.AC), DT(Arg.DT),
LI(Arg.LI), CouldNotCompute(std::move(Arg.CouldNotCompute)),		LI(Arg.LI), Generation(Arg.Generation),
		CouldNotCompute(std::move(Arg.CouldNotCompute)),
ValueExprMap(std::move(Arg.ValueExprMap)),		ValueExprMap(std::move(Arg.ValueExprMap)),
PendingLoopPredicates(std::move(Arg.PendingLoopPredicates)),		PendingLoopPredicates(std::move(Arg.PendingLoopPredicates)),
WalkingBEDominatingConds(false), ProvingSplitPredicate(false),		WalkingBEDominatingConds(false), ProvingSplitPredicate(false),
BackedgeTakenCounts(std::move(Arg.BackedgeTakenCounts)),		BackedgeTakenCounts(std::move(Arg.BackedgeTakenCounts)),
PredicatedBackedgeTakenCounts(		PredicatedBackedgeTakenCounts(
std::move(Arg.PredicatedBackedgeTakenCounts)),		std::move(Arg.PredicatedBackedgeTakenCounts)),
ConstantEvolutionLoopExitValue(		ConstantEvolutionLoopExitValue(
std::move(Arg.ConstantEvolutionLoopExitValue)),		std::move(Arg.ConstantEvolutionLoopExitValue)),
▲ Show 20 Lines • Show All 980 Lines • Show Last 20 Lines

lib/Transforms/Scalar/LoopRerollPass.cpp

Show First 20 Lines • Show All 359 Lines • ▼ Show 20 Lines	struct DAGRootTracker {
LoopControlIV(LoopCtrlIV) {}		LoopControlIV(LoopCtrlIV) {}

/// Stage 1: Find all the DAG roots for the induction variable.		/// Stage 1: Find all the DAG roots for the induction variable.
bool findRoots();		bool findRoots();
/// Stage 2: Validate if the found roots are valid.		/// Stage 2: Validate if the found roots are valid.
bool validate(ReductionTracker &Reductions);		bool validate(ReductionTracker &Reductions);
/// Stage 3: Assuming validate() returned true, perform the		/// Stage 3: Assuming validate() returned true, perform the
/// replacement.		/// replacement.
/// @param IterCount The maximum iteration count of L.		void replace();
void replace(const SCEV *IterCount);

protected:		protected:
typedef MapVector<Instruction*, BitVector> UsesTy;		typedef MapVector<Instruction*, BitVector> UsesTy;

void findRootsRecursive(Instruction *IVU,		void findRootsRecursive(Instruction *IVU,
SmallInstructionSet SubsumedInsts);		SmallInstructionSet SubsumedInsts);
bool findRootsBase(Instruction *IVU, SmallInstructionSet SubsumedInsts);		bool findRootsBase(Instruction *IVU, SmallInstructionSet SubsumedInsts);
bool collectPossibleRoots(Instruction *Base,		bool collectPossibleRoots(Instruction *Base,
Show All 13 Lines	protected:
UsesTy::iterator nextInstr(int Val, UsesTy &In,		UsesTy::iterator nextInstr(int Val, UsesTy &In,
const SmallInstructionSet &Exclude,		const SmallInstructionSet &Exclude,
UsesTy::iterator *StartI=nullptr);		UsesTy::iterator *StartI=nullptr);
bool isBaseInst(Instruction *I);		bool isBaseInst(Instruction *I);
bool isRootInst(Instruction *I);		bool isRootInst(Instruction *I);
bool instrDependsOn(Instruction *I,		bool instrDependsOn(Instruction *I,
UsesTy::iterator Start,		UsesTy::iterator Start,
UsesTy::iterator End);		UsesTy::iterator End);
void replaceIV(Instruction Inst, Instruction IV, const SCEV *IterCount);		void replaceIV(Instruction Inst, Instruction IV);
void updateNonLoopCtrlIncr();		void updateNonLoopCtrlIncr();

LoopReroll *Parent;		LoopReroll *Parent;

// Members of Parent, replicated here for brevity.		// Members of Parent, replicated here for brevity.
Loop *L;		Loop *L;
ScalarEvolution *SE;		ScalarEvolution *SE;
AliasAnalysis *AA;		AliasAnalysis *AA;
Show All 30 Lines	bool isCompareUsedByBranch(Instruction *I) {
return false;		return false;
return I->hasOneUse() && TI->getOperand(0) == I;		return I->hasOneUse() && TI->getOperand(0) == I;
};		};

bool isLoopControlIV(Loop L, Instruction IV);		bool isLoopControlIV(Loop L, Instruction IV);
void collectPossibleIVs(Loop *L, SmallInstructionVector &PossibleIVs);		void collectPossibleIVs(Loop *L, SmallInstructionVector &PossibleIVs);
void collectPossibleReductions(Loop *L,		void collectPossibleReductions(Loop *L,
ReductionTracker &Reductions);		ReductionTracker &Reductions);
bool reroll(Instruction IV, Loop L, BasicBlock Header, const SCEV IterCount,		bool reroll(Instruction IV, Loop L, BasicBlock *Header,
ReductionTracker &Reductions);		ReductionTracker &Reductions);
};		};
}		}

char LoopReroll::ID = 0;		char LoopReroll::ID = 0;
INITIALIZE_PASS_BEGIN(LoopReroll, "loop-reroll", "Reroll loops", false, false)		INITIALIZE_PASS_BEGIN(LoopReroll, "loop-reroll", "Reroll loops", false, false)
INITIALIZE_PASS_DEPENDENCY(LoopPass)		INITIALIZE_PASS_DEPENDENCY(LoopPass)
INITIALIZE_PASS_DEPENDENCY(TargetLibraryInfoWrapperPass)		INITIALIZE_PASS_DEPENDENCY(TargetLibraryInfoWrapperPass)
▲ Show 20 Lines • Show All 957 Lines • ▼ Show 20 Lines	bool LoopReroll::DAGRootTracker::validate(ReductionTracker &Reductions) {
}		}

DEBUG(dbgs() << "LRR: Matched all iteration increments for " <<		DEBUG(dbgs() << "LRR: Matched all iteration increments for " <<
*IV << "\n");		*IV << "\n");

return true;		return true;
}		}

void LoopReroll::DAGRootTracker::replace(const SCEV *IterCount) {		void LoopReroll::DAGRootTracker::replace() {
BasicBlock *Header = L->getHeader();		BasicBlock *Header = L->getHeader();
// Remove instructions associated with non-base iterations.		// Remove instructions associated with non-base iterations.
for (BasicBlock::reverse_iterator J = Header->rbegin(), JE = Header->rend();		for (BasicBlock::reverse_iterator J = Header->rbegin(), JE = Header->rend();
J != JE;) {		J != JE;) {
unsigned I = Uses[&*J].find_first();		unsigned I = Uses[&*J].find_first();
if (I > 0 && I < IL_All) {		if (I > 0 && I < IL_All) {
DEBUG(dbgs() << "LRR: removing: " << *J << "\n");		DEBUG(dbgs() << "LRR: removing: " << *J << "\n");
J++->eraseFromParent();		J++->eraseFromParent();
continue;		continue;
}		}

++J;		++J;
}		}
		// Flush the cached information about this loop in SCEV as we've mutated it
		// heavily.
		SE->forgetLoop(L);

bool HasTwoIVs = LoopControlIV && LoopControlIV != IV;		bool HasTwoIVs = LoopControlIV && LoopControlIV != IV;

if (HasTwoIVs) {		if (HasTwoIVs) {
updateNonLoopCtrlIncr();		updateNonLoopCtrlIncr();
replaceIV(LoopControlIV, LoopControlIV, IterCount);		replaceIV(LoopControlIV, LoopControlIV);
} else		} else
// We need to create a new induction variable for each different BaseInst.		// We need to create a new induction variable for each different BaseInst.
for (auto &DRS : RootSets)		for (auto &DRS : RootSets)
// Insert the new induction variable.		// Insert the new induction variable.
replaceIV(DRS.BaseInst, IV, IterCount);		replaceIV(DRS.BaseInst, IV);

SimplifyInstructionsInBlock(Header, TLI);		SimplifyInstructionsInBlock(Header, TLI);
DeleteDeadPHIs(Header, TLI);		DeleteDeadPHIs(Header, TLI);
}		}

// For non-loop-control IVs, we only need to update the last increment		// For non-loop-control IVs, we only need to update the last increment
// with right amount, then we are done.		// with right amount, then we are done.
void LoopReroll::DAGRootTracker::updateNonLoopCtrlIncr() {		void LoopReroll::DAGRootTracker::updateNonLoopCtrlIncr() {
Show All 20 Lines	for (auto *LoopInc : LoopIncs) {
} else		} else
NewInc = SE->getUDivExpr(COp, ScaleSCEV);		NewInc = SE->getUDivExpr(COp, ScaleSCEV);

LoopInc->setOperand(1, dyn_cast<SCEVConstant>(NewInc)->getValue());		LoopInc->setOperand(1, dyn_cast<SCEVConstant>(NewInc)->getValue());
}		}
}		}

void LoopReroll::DAGRootTracker::replaceIV(Instruction *Inst,		void LoopReroll::DAGRootTracker::replaceIV(Instruction *Inst,
Instruction *InstIV,		Instruction *InstIV) {
const SCEV *IterCount) {
BasicBlock *Header = L->getHeader();		BasicBlock *Header = L->getHeader();
int64_t Inc = IVToIncMap[InstIV];		int64_t Inc = IVToIncMap[InstIV];
bool NeedNewIV = InstIV == LoopControlIV;		bool NeedNewIV = InstIV == LoopControlIV;
bool Negative = !NeedNewIV && Inc < 0;		bool Negative = !NeedNewIV && Inc < 0;

const SCEVAddRecExpr *RealIVSCEV = cast<SCEVAddRecExpr>(SE->getSCEV(Inst));		const SCEVAddRecExpr *RealIVSCEV = cast<SCEVAddRecExpr>(SE->getSCEV(Inst));
const SCEV *Start = RealIVSCEV->getStart();		const SCEV *Start = RealIVSCEV->getStart();

Show All 20 Lines	const SCEV *NewIVSCEV =

for (auto &KV : Uses)		for (auto &KV : Uses)
if (KV.second.find_first() == 0)		if (KV.second.find_first() == 0)
KV.first->replaceUsesOfWith(Inst, NewIV);		KV.first->replaceUsesOfWith(Inst, NewIV);

if (BranchInst *BI = dyn_cast<BranchInst>(Header->getTerminator())) {		if (BranchInst *BI = dyn_cast<BranchInst>(Header->getTerminator())) {
// FIXME: Why do we need this check?		// FIXME: Why do we need this check?
if (Uses[BI].find_first() == IL_All) {		if (Uses[BI].find_first() == IL_All) {
		const SCEV *LIBETC = SE->getBackedgeTakenCount(L);
		const SCEV *IterCount =
		SE->getAddExpr(LIBETC, SE->getOne(LIBETC->getType()));
const SCEV ICSCEV = RealIVSCEV->evaluateAtIteration(IterCount, SE);		const SCEV ICSCEV = RealIVSCEV->evaluateAtIteration(IterCount, SE);

if (NeedNewIV)		if (NeedNewIV)
ICSCEV = SE->getMulExpr(IterCount,		ICSCEV = SE->getMulExpr(IterCount,
SE->getConstant(IterCount->getType(), Scale));		SE->getConstant(IterCount->getType(), Scale));

// Iteration count SCEV minus or plus 1		// Iteration count SCEV minus or plus 1
const SCEV *MinusPlus1SCEV =		const SCEV *MinusPlus1SCEV =
▲ Show 20 Lines • Show All 144 Lines • ▼ Show 20 Lines
// cannot reorder those side-effect-producing instructions, and rerolling		// cannot reorder those side-effect-producing instructions, and rerolling
// fails.		// fails.
//		//
// Finally, we make sure that all loop instructions are either loop increment		// Finally, we make sure that all loop instructions are either loop increment
// roots, belong to simple latch code, parts of validated reductions, part of		// roots, belong to simple latch code, parts of validated reductions, part of
// f(%iv) or part of some f(%iv.i). If all of that is true (and all reductions		// f(%iv) or part of some f(%iv.i). If all of that is true (and all reductions
// have been validated), then we reroll the loop.		// have been validated), then we reroll the loop.
bool LoopReroll::reroll(Instruction IV, Loop L, BasicBlock *Header,		bool LoopReroll::reroll(Instruction IV, Loop L, BasicBlock *Header,
const SCEV *IterCount,
ReductionTracker &Reductions) {		ReductionTracker &Reductions) {
DAGRootTracker DAGRoots(this, L, IV, SE, AA, TLI, DT, LI, PreserveLCSSA,		DAGRootTracker DAGRoots(this, L, IV, SE, AA, TLI, DT, LI, PreserveLCSSA,
IVToIncMap, LoopControlIV);		IVToIncMap, LoopControlIV);

if (!DAGRoots.findRoots())		if (!DAGRoots.findRoots())
return false;		return false;
DEBUG(dbgs() << "LRR: Found all root induction increments for: " <<		DEBUG(dbgs() << "LRR: Found all root induction increments for: " <<
*IV << "\n");		*IV << "\n");

if (!DAGRoots.validate(Reductions))		if (!DAGRoots.validate(Reductions))
return false;		return false;
if (!Reductions.validateSelected())		if (!Reductions.validateSelected())
return false;		return false;
// At this point, we've validated the rerolling, and we're committed to		// At this point, we've validated the rerolling, and we're committed to
// making changes!		// making changes!

Reductions.replaceSelected();		Reductions.replaceSelected();
DAGRoots.replace(IterCount);		DAGRoots.replace();

++NumRerolledLoops;		++NumRerolledLoops;
return true;		return true;
}		}

bool LoopReroll::runOnLoop(Loop *L, LPPassManager &LPM) {		bool LoopReroll::runOnLoop(Loop *L, LPPassManager &LPM) {
if (skipLoop(L))		if (skipLoop(L))
return false;		return false;
Show All 12 Lines	bool LoopReroll::runOnLoop(Loop *L, LPPassManager &LPM) {

// For now, we'll handle only single BB loops.		// For now, we'll handle only single BB loops.
if (L->getNumBlocks() > 1)		if (L->getNumBlocks() > 1)
return false;		return false;

if (!SE->hasLoopInvariantBackedgeTakenCount(L))		if (!SE->hasLoopInvariantBackedgeTakenCount(L))
return false;		return false;

const SCEV *LIBETC = SE->getBackedgeTakenCount(L);
const SCEV *IterCount = SE->getAddExpr(LIBETC, SE->getOne(LIBETC->getType()));
DEBUG(dbgs() << "\n Before Reroll:\n" << *(L->getHeader()) << "\n");		DEBUG(dbgs() << "\n Before Reroll:\n" << *(L->getHeader()) << "\n");
DEBUG(dbgs() << "LRR: iteration count = " << *IterCount << "\n");		DEBUG(dbgs() << "LRR: iteration count = "
		<< *SE->getAddExpr(
		SE->getBackedgeTakenCount(L),
		SE->getOne(SE->getBackedgeTakenCount(L)->getType()))
		<< "\n");

// First, we need to find the induction variable with respect to which we can		// First, we need to find the induction variable with respect to which we can
// reroll (there may be several possible options).		// reroll (there may be several possible options).
SmallInstructionVector PossibleIVs;		SmallInstructionVector PossibleIVs;
IVToIncMap.clear();		IVToIncMap.clear();
LoopControlIV = nullptr;		LoopControlIV = nullptr;
collectPossibleIVs(L, PossibleIVs);		collectPossibleIVs(L, PossibleIVs);

if (PossibleIVs.empty()) {		if (PossibleIVs.empty()) {
DEBUG(dbgs() << "LRR: No possible IVs found\n");		DEBUG(dbgs() << "LRR: No possible IVs found\n");
return false;		return false;
}		}

ReductionTracker Reductions;		ReductionTracker Reductions;
collectPossibleReductions(L, Reductions);		collectPossibleReductions(L, Reductions);
bool Changed = false;		bool Changed = false;

// For each possible IV, collect the associated possible set of 'root' nodes		// For each possible IV, collect the associated possible set of 'root' nodes
// (i+1, i+2, etc.).		// (i+1, i+2, etc.).
for (Instruction *PossibleIV : PossibleIVs)		for (Instruction *PossibleIV : PossibleIVs)
if (reroll(PossibleIV, L, Header, IterCount, Reductions)) {		if (reroll(PossibleIV, L, Header, Reductions)) {
Changed = true;		Changed = true;
break;		break;
}		}
DEBUG(dbgs() << "\n After Reroll:\n" << *(L->getHeader()) << "\n");		DEBUG(dbgs() << "\n After Reroll:\n" << *(L->getHeader()) << "\n");

// Trip count of L has changed so SE must be re-evaluated.		// Trip count of L has changed so SE must be re-evaluated.
if (Changed)		if (Changed)
SE->forgetLoop(L);		SE->forgetLoop(L);

return Changed;		return Changed;
}		}