This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/Transforms/Utils/
-
llvm/
-
Transforms/
-
Utils/
-
LoopUtils.h
-
lib/Transforms/Scalar/
-
Transforms/
-
Scalar/
1/9
LICM.cpp
-
test/Transforms/LICM/
-
Transforms/
-
LICM/
-
guards.ll
-
hoist-mustexec.ll
-
hoist-nounwind.ll
-
preheader-safe.ll

Differential D50377

[LICM] Use ICFLoopSafetyInfo in LICM
ClosedPublic

Authored by mkazantsev on Aug 7 2018, 2:46 AM.

Download Raw Diff

Details

Reviewers

skatkov
reames
apilipenko
dstenb

Commits

rG69f6dfa0f8c5: [LICM] Use ICFLoopSafetyInfo in LICM
rL346201: [LICM] Use ICFLoopSafetyInfo in LICM

Summary

This patch makes LICM use ICFLoopSafetyInfo that is a smarter version
of LoopSafetyInfo that leverages power of Implicit Control Flow Tracking
to keep track of throwing instructions and give less pessimistic answers
to queries related to throws.

The ICFLoopSafetyInfo itself has been introduced in rL344601. This patch
enables it in LICM only.

Diff Detail

Event Timeline

mkazantsev created this revision.Aug 7 2018, 2:46 AM

Herald added a subscriber: zzheng. · View Herald TranscriptAug 7 2018, 2:46 AM

I will split this into a functional change and a bunch of NFC's.

mkazantsev mentioned this in D50426: [NFC][MustExecute] Rework API to start making better analysis, part 2.Aug 7 2018, 10:47 PM

mkazantsev added inline comments.Aug 8 2018, 12:34 AM

test/Transforms/LICM/funclet.ll
60 ↗	(On Diff #159470)	Looks like we've exposed some bug in LICM.

Fixed bug (missing fill of color map in the beginning), rebased on top of API NFC.

mkazantsev added a parent revision: D50426: [NFC][MustExecute] Rework API to start making better analysis, part 2.Aug 8 2018, 3:50 AM

mkazantsev added a child revision: D50501: [LICM] Hoist guards with invariant conditions.Aug 9 2018, 3:13 AM

I'm really worried about LoopSafetyInfo getting more nontrivial state
(Philip already raised a question in D50426 about CurLoop potentially becoming invalid, and here we get even more with addition of ICF).

Before this change LoopSafetyInfo was one-time-per-Loop-worker + constant queries after that, and original interface conveyed that.
Now every query becomes a potentially mutating worker, with a cache (ICF) that causes correctness problems if not being invalidated properly.

I have a feeling that changes suggested to the interface do not fully reflect the new semantics.
I can easily see how people can forget to invalidateBlock.
(and, btw, are we sure that invalidateBlock is not needed in places that use LoopSafetyInfo other than LICM?)

I dont know what should be a better solution, though I do suspect this is a problem frequently seen in LLVM so there should already be solutions
to invalidation of stale information after IR mutation.

Feel free to ignore all the above if it does not bother you :)

It does bother me. I see now other way to make it fast. We can turn off caching in ICF and make it super-expensive but correct. That people may invalidate something is always a problem, we've seen a lot of it in SCEV, but what are the alternatives?

mkazantsev planned changes to this revision.Aug 14 2018, 9:29 PM

mkazantsev edited parent revisions, added: D50558: [MustExecute] Fix algorithmic bug in isGuaranteedToExecute. PR38514; removed: D50426: [NFC][MustExecute] Rework API to start making better analysis, part 2.Aug 15 2018, 1:00 AM

mkazantsev updated this revision to Diff 161002.Aug 16 2018, 4:49 AM

mkazantsev added a child revision: D50838: [NFC] Remove function isGuaranteedToExecute.Aug 16 2018, 5:25 AM

mkazantsev updated this revision to Diff 161012.Aug 16 2018, 6:15 AM

mkazantsev added a child revision: D50890: [NFC] Factor out predecessors collection into a separate method.Aug 17 2018, 3:59 AM

mkazantsev added inline comments.Aug 20 2018, 9:42 PM

include/llvm/Analysis/MustExecute.h
88 ↗	(On Diff #161012)	Infi -> Info

reames requested changes to this revision.Aug 20 2018, 10:15 PM

reames added inline comments.

lib/Analysis/MustExecute.cpp
32 ↗	(On Diff #161012)	We could chose to be more precise here by recomputing MayThrow lazily after invalidation.
lib/Transforms/Scalar/LICM.cpp
411	Invalidation is needed eagerly here.
485	BUG. Missing invalidation.
1296	There needs to be some invalidations somewhere within this function.
lib/Transforms/Scalar/LoopUnswitch.cpp
514 ↗	(On Diff #161012)	The fact there is no invalidation in loop unswitch is severely suspicious.

This revision now requires changes to proceed.Aug 20 2018, 10:15 PM

mkazantsev removed a child revision: D50890: [NFC] Factor out predecessors collection into a separate method.Aug 21 2018, 12:10 AM

mkazantsev added a child revision: D50891: [LICM] Hoist guards from non-header blocks.

mkazantsev removed a child revision: D50501: [LICM] Hoist guards with invariant conditions.Aug 21 2018, 12:20 AM

mkazantsev added a child revision: D50888: [NFC][LICM] Remove too conservative IsMustExecute variable.Aug 22 2018, 2:13 AM

Rebased, added cache invalidation in LICM. Still WIP because need to understand where do we need to invalidate in loop unswitching.

Still wip until invalidation in LoopUnswitching is done properly (or proved that it is not needed).

Added dependency to D51523. Though there is no functional dependency between them, I don't feel comfortable to let it be merged without validation we have there.

Added more invalidation. Base fuzzing suite passes ok.

fedor.sergeev added inline comments.Aug 31 2018, 1:15 AM

lib/Transforms/Scalar/LoopUnswitch.cpp
525 ↗	(On Diff #163463)	Since we only need SafetyInfo for SanitizeMemory and it has already been converted to a pointer... Perhaps it will be cleaner to: initialize SafetyInfo to non-null only under SanitizeMemory. operate on SafetyInfo (e.g. dropping caches) only under if (SanityInfo) setting SafetyInfo to null at the end of this function (just to avoid leaving dangling pointer)

mkazantsev added inline comments.Sep 2 2018, 8:01 PM

lib/Transforms/Scalar/LoopUnswitch.cpp
525 ↗	(On Diff #163463)	Good idea. I would rather not do it in this patch because I want everything which is not connected to the logic be straightforward. Will prepare a follow-up.

Another round of bugs found.

I am generally concerned about the lack of progress towards a clearly correct patch here. I think we need to revisit how this is being approached and find a way to split this into individually obviously correct pieces.

One possible split (not the only one) would be the following:

Add ICF info to LoopSafetyInfo, but don't remove HeaderMayThrow approach. Enable the use of ICF (for asserts only) under an off by default flag. (Flag could be a param to constructor or cl::opt?)
For each pass, review in isolation. Update tests to use off by default flag.
Once *all* passes are done, change default and remove flag. (But still only ICF for asserts!)
Wait 1 week.
Remove HeaderMayThrow, and enhance logic.

lib/Transforms/Scalar/LICM.cpp
1296	It doesn't look like my previous comment has been addressed. There's still a bug here.
lib/Transforms/Scalar/LoopUnswitch.cpp
187 ↗	(On Diff #163463)	Default init to nullptr.
1558 ↗	(On Diff #163463)	Why? I think this is unnecessary.

This revision now requires changes to proceed.Sep 10 2018, 2:01 PM

After digging deeper into the code, I grow more and more convinced that there was no bug to start with, and dangling pointers to instructions that are not tracked by ICF's map (even if they are tracked by OrderedInstructions) cause no problems.

I need to revisit the underlying patches, state clearly when we need to invalidate the ICF. My plan is to discard all invalidation from this code (except for guard motion). It was initially no bug there, and now the attempt to win this non-existent bug has made the code totally unreadable.

mkazantsev added a parent revision: D51923: [NFC] Add validation to Ordered Instructions.Sep 11 2018, 3:58 AM

https://reviews.llvm.org/D51664 This patch claims to make the OrderedInstructions auto-invalidable. If we build on top of that (and if it works :)), we may give up all invalidation stuff we currently have and only invalidate when we remove ICF instructions from blocks.

mkazantsev removed a parent revision: D51923: [NFC] Add validation to Ordered Instructions.Sep 11 2018, 10:32 PM

fhahn mentioned this in D51664: [IR] Lazily number instructions for local dominance queries.Sep 12 2018, 2:33 PM

Rebased on top of D51664. This patch is really helpful here because it makes the invalidation of OrderedInstructions automatic. With that, we *only* need to invalidate whenever we may actually change the first ICF instruction or when we erase a basic block. With that, I've discarded most of invalidation we've added on the previous steps because it is actually not needed. We don't need to invalidate when we insert Phis or do stuff like that.

I also think that the verification we have now is robust: it's up to OrderedInstructions to assert correctness of its auto-invalidation, and for ICF map we already have a pretty nice assertions that will tell us if we screwed up invalidating properly.

mkazantsev added a parent revision: D52017: [NFC] Introduce surgical invalidation of IPT.Sep 13 2018, 8:59 PM

Ping?

Actually review can wait until the D51664 is merged. Might need rebase after that.

mkazantsev removed a parent revision: D52017: [NFC] Introduce surgical invalidation of IPT.Oct 15 2018, 10:46 PM

I've checked in the underlying ICF logic in a separate class as NFC, see rL344601. The plan is to enable it in different passes one by one. This one enables it in LICM.

Sorry, I can likely not give any valuable input for this review.

mkazantsev added a reviewer: apilipenko.Oct 28 2018, 8:43 PM

Current patch seems to have comments like "this block doesn't need invalidation". Maybe sink this decision down to the SafetyInfo implementation? It should reduce the complexity of the user: just call invalidate every time you delete/insert instructions and you would be fine.

lib/Transforms/Scalar/LICM.cpp
407–409	I'd suggest introducing a helper for removing the instruction. We need to do some invalidation every time an instruction is removed. Having a helper would help to avoid missing invalidation in new code.
1522–1525	Having invalidation in Promoter's callbacks (like instructionDeleted) seems more natural. We invalidate CurAST there.

mkazantsev added inline comments.Nov 1 2018, 2:47 AM

lib/Transforms/Scalar/LICM.cpp
407–409	We don't always do invalidation by eraseFromParent. There are cases when we move instructions, insert instructions or call `run` on promoter.

Addressed comments.

Herald added a subscriber: jfb. · View Herald TranscriptNov 1 2018, 3:59 AM

mkazantsev marked an inline comment as done.Nov 1 2018, 4:00 AM

Rebased

Updated test/CodeGen/AMDGPU/build-vector-insert-elt-infloop.ll

Without this patch, SaferyInfo falsely assumes that this loop is throwing and therefore prohibits promoter from elimination of the store. With smarter analysis, it is legal to eliminate the store, but the test doesn't expect that. Make store volatile to ensure that it doesn't get removed.

Herald added subscribers: nhaehnle, jvesely. · View Herald TranscriptNov 1 2018, 6:03 PM

BTW this test is invalid, it has a load from nullptr and therefore contains UB.

apilipenko accepted this revision.Nov 2 2018, 4:21 PM

apilipenko added inline comments.

lib/Transforms/Scalar/LICM.cpp
518	I think we can invalidate once. I know, in previous comments I was asking to move the complexity of the decision making whether the invalidation needed or not to the implementation of the SafetyInfo, but calling invalidation two times in a row is simple enough to fix in the caller.

mkazantsev added inline comments.Nov 2 2018, 9:00 PM

lib/Transforms/Scalar/LICM.cpp
518	It is free anyways.

nhaehnle removed a subscriber: nhaehnle.Nov 5 2018, 4:03 AM

This revision was not accepted when it landed; it landed in state Needs Review.Nov 5 2018, 6:48 PM

Closed by commit rL346201: [LICM] Use ICFLoopSafetyInfo in LICM (authored by mkazantsev). · Explain Why

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

include/

llvm/

Transforms/

Utils/

LoopUtils.h

7 lines

lib/

Transforms/

Scalar/

LICM.cpp

31 lines

test/

Transforms/

LICM/

6 lines

147 lines

29 lines

21 lines

Diff 169806

include/llvm/Transforms/Utils/LoopUtils.h

	Show First 20 Lines • Show All 103 Lines • ▼ Show 20 Lines
	/// reverse depth first order w.r.t the DominatorTree. This allows us to visit			/// reverse depth first order w.r.t the DominatorTree. This allows us to visit
	/// uses before definitions, allowing us to sink a loop body in one pass without			/// uses before definitions, allowing us to sink a loop body in one pass without
	/// iteration. Takes DomTreeNode, AliasAnalysis, LoopInfo, DominatorTree,			/// iteration. Takes DomTreeNode, AliasAnalysis, LoopInfo, DominatorTree,
	/// DataLayout, TargetLibraryInfo, Loop, AliasSet information for all			/// DataLayout, TargetLibraryInfo, Loop, AliasSet information for all
	/// instructions of the loop and loop safety information as			/// instructions of the loop and loop safety information as
	/// arguments. Diagnostics is emitted via \p ORE. It returns changed status.			/// arguments. Diagnostics is emitted via \p ORE. It returns changed status.
	bool sinkRegion(DomTreeNode , AliasAnalysis , LoopInfo , DominatorTree ,			bool sinkRegion(DomTreeNode , AliasAnalysis , LoopInfo , DominatorTree ,
	TargetLibraryInfo , TargetTransformInfo , Loop *,			TargetLibraryInfo , TargetTransformInfo , Loop *,
	AliasSetTracker , LoopSafetyInfo ,			AliasSetTracker , ICFLoopSafetyInfo ,
	OptimizationRemarkEmitter *ORE);			OptimizationRemarkEmitter *ORE);

	/// Walk the specified region of the CFG (defined by all blocks			/// Walk the specified region of the CFG (defined by all blocks
	/// dominated by the specified block, and that are in the current loop) in depth			/// dominated by the specified block, and that are in the current loop) in depth
	/// first order w.r.t the DominatorTree. This allows us to visit definitions			/// first order w.r.t the DominatorTree. This allows us to visit definitions
	/// before uses, allowing us to hoist a loop body in one pass without iteration.			/// before uses, allowing us to hoist a loop body in one pass without iteration.
	/// Takes DomTreeNode, AliasAnalysis, LoopInfo, DominatorTree, DataLayout,			/// Takes DomTreeNode, AliasAnalysis, LoopInfo, DominatorTree, DataLayout,
	/// TargetLibraryInfo, Loop, AliasSet information for all instructions of the			/// TargetLibraryInfo, Loop, AliasSet information for all instructions of the
	/// loop and loop safety information as arguments. Diagnostics is emitted via \p			/// loop and loop safety information as arguments. Diagnostics is emitted via \p
	/// ORE. It returns changed status.			/// ORE. It returns changed status.
	bool hoistRegion(DomTreeNode , AliasAnalysis , LoopInfo , DominatorTree ,			bool hoistRegion(DomTreeNode , AliasAnalysis , LoopInfo , DominatorTree ,
	TargetLibraryInfo , Loop , AliasSetTracker *,			TargetLibraryInfo , Loop , AliasSetTracker *,
	LoopSafetyInfo , OptimizationRemarkEmitter ORE);			ICFLoopSafetyInfo , OptimizationRemarkEmitter ORE);

	/// This function deletes dead loops. The caller of this function needs to			/// This function deletes dead loops. The caller of this function needs to
	/// guarantee that the loop is infact dead.			/// guarantee that the loop is infact dead.
	/// The function requires a bunch or prerequisites to be present:			/// The function requires a bunch or prerequisites to be present:
	/// - The loop needs to be in LCSSA form			/// - The loop needs to be in LCSSA form
	/// - The loop needs to have a Preheader			/// - The loop needs to have a Preheader
	/// - A unique dedicated exit block must exist			/// - A unique dedicated exit block must exist
	///			///
	Show All 12 Lines
	/// LoopInfo, DominatorTree, Loop, AliasSet information for all instructions			/// LoopInfo, DominatorTree, Loop, AliasSet information for all instructions
	/// of the loop and loop safety information as arguments.			/// of the loop and loop safety information as arguments.
	/// Diagnostics is emitted via \p ORE. It returns changed status.			/// Diagnostics is emitted via \p ORE. It returns changed status.
	bool promoteLoopAccessesToScalars(const SmallSetVector<Value *, 8> &,			bool promoteLoopAccessesToScalars(const SmallSetVector<Value *, 8> &,
	SmallVectorImpl<BasicBlock *> &,			SmallVectorImpl<BasicBlock *> &,
	SmallVectorImpl<Instruction *> &,			SmallVectorImpl<Instruction *> &,
	PredIteratorCache &, LoopInfo *,			PredIteratorCache &, LoopInfo *,
	DominatorTree , const TargetLibraryInfo ,			DominatorTree , const TargetLibraryInfo ,
	Loop , AliasSetTracker , LoopSafetyInfo *,			Loop , AliasSetTracker ,
				ICFLoopSafetyInfo *,
	OptimizationRemarkEmitter *);			OptimizationRemarkEmitter *);

	/// Does a BFS from a given node to all of its children inside a given loop.			/// Does a BFS from a given node to all of its children inside a given loop.
	/// The returned vector of nodes includes the starting point.			/// The returned vector of nodes includes the starting point.
	SmallVector<DomTreeNode , 16> collectChildrenInLoop(DomTreeNode N,			SmallVector<DomTreeNode , 16> collectChildrenInLoop(DomTreeNode N,
	const Loop *CurLoop);			const Loop *CurLoop);

	/// Returns the instructions that use values defined in the loop.			/// Returns the instructions that use values defined in the loop.
	▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

lib/Transforms/Scalar/LICM.cpp

Show First 20 Lines • Show All 97 Lines • ▼ Show 20 Lines
LICMN2Theshold("licm-n2-threshold", cl::Hidden, cl::init(0),		LICMN2Theshold("licm-n2-threshold", cl::Hidden, cl::init(0),
cl::desc("How many instruction to cross product using AA"));		cl::desc("How many instruction to cross product using AA"));

static bool inSubLoop(BasicBlock BB, Loop CurLoop, LoopInfo *LI);		static bool inSubLoop(BasicBlock BB, Loop CurLoop, LoopInfo *LI);
static bool isNotUsedOrFreeInLoop(const Instruction &I, const Loop *CurLoop,		static bool isNotUsedOrFreeInLoop(const Instruction &I, const Loop *CurLoop,
const LoopSafetyInfo *SafetyInfo,		const LoopSafetyInfo *SafetyInfo,
TargetTransformInfo *TTI, bool &FreeInLoop);		TargetTransformInfo *TTI, bool &FreeInLoop);
static void hoist(Instruction &I, const DominatorTree DT, const Loop CurLoop,		static void hoist(Instruction &I, const DominatorTree DT, const Loop CurLoop,
LoopSafetyInfo *SafetyInfo,		ICFLoopSafetyInfo *SafetyInfo,
OptimizationRemarkEmitter *ORE);		OptimizationRemarkEmitter *ORE);
static bool sink(Instruction &I, LoopInfo LI, DominatorTree DT,		static bool sink(Instruction &I, LoopInfo LI, DominatorTree DT,
const Loop CurLoop, LoopSafetyInfo SafetyInfo,		const Loop CurLoop, ICFLoopSafetyInfo SafetyInfo,
OptimizationRemarkEmitter *ORE, bool FreeInLoop);		OptimizationRemarkEmitter *ORE, bool FreeInLoop);
static bool isSafeToExecuteUnconditionally(Instruction &Inst,		static bool isSafeToExecuteUnconditionally(Instruction &Inst,
const DominatorTree *DT,		const DominatorTree *DT,
const Loop *CurLoop,		const Loop *CurLoop,
const LoopSafetyInfo *SafetyInfo,		const LoopSafetyInfo *SafetyInfo,
OptimizationRemarkEmitter *ORE,		OptimizationRemarkEmitter *ORE,
const Instruction *CtxI = nullptr);		const Instruction *CtxI = nullptr);
static bool pointerInvalidatedByLoop(MemoryLocation MemLoc,		static bool pointerInvalidatedByLoop(MemoryLocation MemLoc,
▲ Show 20 Lines • Show All 144 Lines • ▼ Show 20 Lines	bool LoopInvariantCodeMotion::runOnLoop(
assert(L->isLCSSAForm(*DT) && "Loop is not in LCSSA form.");		assert(L->isLCSSAForm(*DT) && "Loop is not in LCSSA form.");

std::unique_ptr<AliasSetTracker> CurAST = collectAliasInfoForLoop(L, LI, AA);		std::unique_ptr<AliasSetTracker> CurAST = collectAliasInfoForLoop(L, LI, AA);

// Get the preheader block to move instructions into...		// Get the preheader block to move instructions into...
BasicBlock *Preheader = L->getLoopPreheader();		BasicBlock *Preheader = L->getLoopPreheader();

// Compute loop safety information.		// Compute loop safety information.
SimpleLoopSafetyInfo SafetyInfo;		ICFLoopSafetyInfo SafetyInfo(DT);
SafetyInfo.computeLoopSafetyInfo(L);		SafetyInfo.computeLoopSafetyInfo(L);

// We want to visit all of the instructions in this loop... that are not parts		// We want to visit all of the instructions in this loop... that are not parts
// of our subloops (they have already had their invariants hoisted out of		// of our subloops (they have already had their invariants hoisted out of
// their loop, into this loop, so there is no need to process the BODIES of		// their loop, into this loop, so there is no need to process the BODIES of
// the subloops).		// the subloops).
//		//
// Traverse the body of the loop in depth first order on the dominator tree so		// Traverse the body of the loop in depth first order on the dominator tree so
▲ Show 20 Lines • Show All 90 Lines • ▼ Show 20 Lines
/// Walk the specified region of the CFG (defined by all blocks dominated by		/// Walk the specified region of the CFG (defined by all blocks dominated by
/// the specified block, and that are in the current loop) in reverse depth		/// the specified block, and that are in the current loop) in reverse depth
/// first order w.r.t the DominatorTree. This allows us to visit uses before		/// first order w.r.t the DominatorTree. This allows us to visit uses before
/// definitions, allowing us to sink a loop body in one pass without iteration.		/// definitions, allowing us to sink a loop body in one pass without iteration.
///		///
bool llvm::sinkRegion(DomTreeNode N, AliasAnalysis AA, LoopInfo *LI,		bool llvm::sinkRegion(DomTreeNode N, AliasAnalysis AA, LoopInfo *LI,
DominatorTree DT, TargetLibraryInfo TLI,		DominatorTree DT, TargetLibraryInfo TLI,
TargetTransformInfo TTI, Loop CurLoop,		TargetTransformInfo TTI, Loop CurLoop,
AliasSetTracker CurAST, LoopSafetyInfo SafetyInfo,		AliasSetTracker CurAST, ICFLoopSafetyInfo SafetyInfo,
OptimizationRemarkEmitter *ORE) {		OptimizationRemarkEmitter *ORE) {

// Verify inputs.		// Verify inputs.
assert(N != nullptr && AA != nullptr && LI != nullptr && DT != nullptr &&		assert(N != nullptr && AA != nullptr && LI != nullptr && DT != nullptr &&
CurLoop != nullptr && CurAST && SafetyInfo != nullptr &&		CurLoop != nullptr && CurAST && SafetyInfo != nullptr &&
"Unexpected input to sinkRegion");		"Unexpected input to sinkRegion");

// We want to visit children before parents. We will enque all the parents		// We want to visit children before parents. We will enque all the parents
Show All 13 Lines	for (BasicBlock::iterator II = BB->end(); II != BB->begin();) {
Instruction &I = *--II;		Instruction &I = *--II;

// If the instruction is dead, we would try to sink it because it isn't		// If the instruction is dead, we would try to sink it because it isn't
// used in the loop, instead, just delete it.		// used in the loop, instead, just delete it.
if (isInstructionTriviallyDead(&I, TLI)) {		if (isInstructionTriviallyDead(&I, TLI)) {
LLVM_DEBUG(dbgs() << "LICM deleting dead inst: " << I << '\n');		LLVM_DEBUG(dbgs() << "LICM deleting dead inst: " << I << '\n');
salvageDebugInfo(I);		salvageDebugInfo(I);
++II;		++II;
		SafetyInfo->dropCachedInfo(I.getParent());
CurAST->deleteValue(&I);		CurAST->deleteValue(&I);
I.eraseFromParent();		I.eraseFromParent();
		apilipenkoUnsubmitted Not Done Reply Inline Actions I'd suggest introducing a helper for removing the instruction. We need to do some invalidation every time an instruction is removed. Having a helper would help to avoid missing invalidation in new code. apilipenko: I'd suggest introducing a helper for removing the instruction. We need to do some invalidation…
		mkazantsevAuthorUnsubmitted Not Done Reply Inline Actions We don't always do invalidation by eraseFromParent. There are cases when we move instructions, insert instructions or call `run` on promoter. mkazantsev: We don't always do invalidation by eraseFromParent. There are cases when we move instructions…
Changed = true;		Changed = true;
continue;		continue;
		reamesUnsubmitted Not Done Reply Inline Actions Invalidation is needed eagerly here. reames: Invalidation is needed eagerly here.
}		}

// Check to see if we can sink this instruction to the exit blocks		// Check to see if we can sink this instruction to the exit blocks
// of the loop. We can do this if the all users of the instruction are		// of the loop. We can do this if the all users of the instruction are
// outside of the loop. In this case, it doesn't even matter if the		// outside of the loop. In this case, it doesn't even matter if the
// operands of the instruction are loop invariant.		// operands of the instruction are loop invariant.
//		//
bool FreeInLoop = false;		bool FreeInLoop = false;
if (isNotUsedOrFreeInLoop(I, CurLoop, SafetyInfo, TTI, FreeInLoop) &&		if (isNotUsedOrFreeInLoop(I, CurLoop, SafetyInfo, TTI, FreeInLoop) &&
canSinkOrHoistInst(I, AA, DT, CurLoop, CurAST, true, ORE) &&		canSinkOrHoistInst(I, AA, DT, CurLoop, CurAST, true, ORE) &&
!I.mayHaveSideEffects()) {		!I.mayHaveSideEffects()) {
if (sink(I, LI, DT, CurLoop, SafetyInfo, ORE, FreeInLoop)) {		if (sink(I, LI, DT, CurLoop, SafetyInfo, ORE, FreeInLoop)) {
if (!FreeInLoop) {		if (!FreeInLoop) {
++II;		++II;
		SafetyInfo->dropCachedInfo(I.getParent());
CurAST->deleteValue(&I);		CurAST->deleteValue(&I);
I.eraseFromParent();		I.eraseFromParent();
}		}
Changed = true;		Changed = true;
}		}
}		}
}		}
}		}
return Changed;		return Changed;
}		}

/// Walk the specified region of the CFG (defined by all blocks dominated by		/// Walk the specified region of the CFG (defined by all blocks dominated by
/// the specified block, and that are in the current loop) in depth first		/// the specified block, and that are in the current loop) in depth first
/// order w.r.t the DominatorTree. This allows us to visit definitions before		/// order w.r.t the DominatorTree. This allows us to visit definitions before
/// uses, allowing us to hoist a loop body in one pass without iteration.		/// uses, allowing us to hoist a loop body in one pass without iteration.
///		///
bool llvm::hoistRegion(DomTreeNode N, AliasAnalysis AA, LoopInfo *LI,		bool llvm::hoistRegion(DomTreeNode N, AliasAnalysis AA, LoopInfo *LI,
DominatorTree DT, TargetLibraryInfo TLI, Loop *CurLoop,		DominatorTree DT, TargetLibraryInfo TLI, Loop *CurLoop,
AliasSetTracker CurAST, LoopSafetyInfo SafetyInfo,		AliasSetTracker CurAST, ICFLoopSafetyInfo SafetyInfo,
OptimizationRemarkEmitter *ORE) {		OptimizationRemarkEmitter *ORE) {
// Verify inputs.		// Verify inputs.
assert(N != nullptr && AA != nullptr && LI != nullptr && DT != nullptr &&		assert(N != nullptr && AA != nullptr && LI != nullptr && DT != nullptr &&
CurLoop != nullptr && CurAST != nullptr && SafetyInfo != nullptr &&		CurLoop != nullptr && CurAST != nullptr && SafetyInfo != nullptr &&
"Unexpected input to hoistRegion");		"Unexpected input to hoistRegion");

// We want to visit parents before children. We will enque all the parents		// We want to visit parents before children. We will enque all the parents
// before their children in the worklist and process the worklist in order.		// before their children in the worklist and process the worklist in order.
Show All 24 Lines	for (BasicBlock::iterator II = BB->begin(), E = BB->end(); II != E;) {
// just fold it.		// just fold it.
if (Constant *C = ConstantFoldInstruction(		if (Constant *C = ConstantFoldInstruction(
&I, I.getModule()->getDataLayout(), TLI)) {		&I, I.getModule()->getDataLayout(), TLI)) {
LLVM_DEBUG(dbgs() << "LICM folding inst: " << I << " --> " << *C		LLVM_DEBUG(dbgs() << "LICM folding inst: " << I << " --> " << *C
<< '\n');		<< '\n');
CurAST->copyValue(&I, C);		CurAST->copyValue(&I, C);
I.replaceAllUsesWith(C);		I.replaceAllUsesWith(C);
if (isInstructionTriviallyDead(&I, TLI)) {		if (isInstructionTriviallyDead(&I, TLI)) {
		SafetyInfo->dropCachedInfo(I.getParent());
CurAST->deleteValue(&I);		CurAST->deleteValue(&I);
I.eraseFromParent();		I.eraseFromParent();
reamesUnsubmitted Not Done Reply Inline Actions BUG. Missing invalidation. reames: BUG. Missing invalidation.
}		}
Changed = true;		Changed = true;
continue;		continue;
}		}

// Try hoisting the instruction out to the preheader. We can only do		// Try hoisting the instruction out to the preheader. We can only do
// this if all of the operands of the instruction are loop invariant and		// this if all of the operands of the instruction are loop invariant and
// if it is safe to hoist the instruction.		// if it is safe to hoist the instruction.
Show All 13 Lines	for (BasicBlock::iterator II = BB->begin(), E = BB->end(); II != E;) {
// converting it to a reciprocal multiplication.		// converting it to a reciprocal multiplication.
if (I.getOpcode() == Instruction::FDiv &&		if (I.getOpcode() == Instruction::FDiv &&
CurLoop->isLoopInvariant(I.getOperand(1)) &&		CurLoop->isLoopInvariant(I.getOperand(1)) &&
I.hasAllowReciprocal()) {		I.hasAllowReciprocal()) {
auto Divisor = I.getOperand(1);		auto Divisor = I.getOperand(1);
auto One = llvm::ConstantFP::get(Divisor->getType(), 1.0);		auto One = llvm::ConstantFP::get(Divisor->getType(), 1.0);
auto ReciprocalDivisor = BinaryOperator::CreateFDiv(One, Divisor);		auto ReciprocalDivisor = BinaryOperator::CreateFDiv(One, Divisor);
ReciprocalDivisor->setFastMathFlags(I.getFastMathFlags());		ReciprocalDivisor->setFastMathFlags(I.getFastMathFlags());
		SafetyInfo->dropCachedInfo(I.getParent());
		apilipenkoUnsubmitted Not Done Reply Inline Actions I think we can invalidate once. I know, in previous comments I was asking to move the complexity of the decision making whether the invalidation needed or not to the implementation of the SafetyInfo, but calling invalidation two times in a row is simple enough to fix in the caller. apilipenko: I think we can invalidate once. I know, in previous comments I was asking to move the…
		mkazantsevAuthorUnsubmitted Not Done Reply Inline Actions It is free anyways. mkazantsev: It is free anyways.
ReciprocalDivisor->insertBefore(&I);		ReciprocalDivisor->insertBefore(&I);

auto Product =		auto Product =
BinaryOperator::CreateFMul(I.getOperand(0), ReciprocalDivisor);		BinaryOperator::CreateFMul(I.getOperand(0), ReciprocalDivisor);
Product->setFastMathFlags(I.getFastMathFlags());		Product->setFastMathFlags(I.getFastMathFlags());
Product->insertAfter(&I);		Product->insertAfter(&I);
I.replaceAllUsesWith(Product);		I.replaceAllUsesWith(Product);
I.eraseFromParent();		I.eraseFromParent();
▲ Show 20 Lines • Show All 467 Lines • ▼ Show 20 Lines
}		}

/// When an instruction is found to only be used outside of the loop, this		/// When an instruction is found to only be used outside of the loop, this
/// function moves it to the exit blocks and patches up SSA form as needed.		/// function moves it to the exit blocks and patches up SSA form as needed.
/// This method is guaranteed to remove the original instruction from its		/// This method is guaranteed to remove the original instruction from its
/// position, and may either delete it or move it to outside of the loop.		/// position, and may either delete it or move it to outside of the loop.
///		///
static bool sink(Instruction &I, LoopInfo LI, DominatorTree DT,		static bool sink(Instruction &I, LoopInfo LI, DominatorTree DT,
const Loop CurLoop, LoopSafetyInfo SafetyInfo,		const Loop CurLoop, ICFLoopSafetyInfo SafetyInfo,
OptimizationRemarkEmitter *ORE, bool FreeInLoop) {		OptimizationRemarkEmitter *ORE, bool FreeInLoop) {
LLVM_DEBUG(dbgs() << "LICM sinking instruction: " << I << "\n");		LLVM_DEBUG(dbgs() << "LICM sinking instruction: " << I << "\n");
ORE->emit([&]() {		ORE->emit([&]() {
return OptimizationRemark(DEBUG_TYPE, "InstSunk", &I)		return OptimizationRemark(DEBUG_TYPE, "InstSunk", &I)
<< "sinking " << ore::NV("Inst", &I);		<< "sinking " << ore::NV("Inst", &I);
});		});
bool Changed = false;		bool Changed = false;
if (isa<LoadInst>(I))		if (isa<LoadInst>(I))
▲ Show 20 Lines • Show All 74 Lines • ▼ Show 20 Lines	for (auto *UI : Users) {

PHINode *PN = cast<PHINode>(User);		PHINode *PN = cast<PHINode>(User);
assert(ExitBlockSet.count(PN->getParent()) &&		assert(ExitBlockSet.count(PN->getParent()) &&
"The LCSSA PHI is not in an exit block!");		"The LCSSA PHI is not in an exit block!");
// The PHI must be trivially replaceable.		// The PHI must be trivially replaceable.
Instruction *New = sinkThroughTriviallyReplaceablePHI(PN, &I, LI, SunkCopies,		Instruction *New = sinkThroughTriviallyReplaceablePHI(PN, &I, LI, SunkCopies,
SafetyInfo, CurLoop);		SafetyInfo, CurLoop);
PN->replaceAllUsesWith(New);		PN->replaceAllUsesWith(New);
		SafetyInfo->dropCachedInfo(I.getParent());
		SafetyInfo->dropCachedInfo(New->getParent());
PN->eraseFromParent();		PN->eraseFromParent();
Changed = true;		Changed = true;
}		}
return Changed;		return Changed;
}		}

/// When an instruction is found to only use loop invariant operands that		/// When an instruction is found to only use loop invariant operands that
/// is safe to hoist, this instruction is called to do the dirty work.		/// is safe to hoist, this instruction is called to do the dirty work.
///		///
static void hoist(Instruction &I, const DominatorTree DT, const Loop CurLoop,		static void hoist(Instruction &I, const DominatorTree DT, const Loop CurLoop,
LoopSafetyInfo SafetyInfo, OptimizationRemarkEmitter ORE) {		ICFLoopSafetyInfo SafetyInfo, OptimizationRemarkEmitter ORE) {
auto *Preheader = CurLoop->getLoopPreheader();		auto *Preheader = CurLoop->getLoopPreheader();
LLVM_DEBUG(dbgs() << "LICM hoisting to " << Preheader->getName() << ": " << I		LLVM_DEBUG(dbgs() << "LICM hoisting to " << Preheader->getName() << ": " << I
<< "\n");		<< "\n");
ORE->emit([&]() {		ORE->emit([&]() {
return OptimizationRemark(DEBUG_TYPE, "Hoisted", &I) << "hoisting "		return OptimizationRemark(DEBUG_TYPE, "Hoisted", &I) << "hoisting "
<< ore::NV("Inst", &I);		<< ore::NV("Inst", &I);
});		});

// Metadata can be dependent on conditions we are hoisting above.		// Metadata can be dependent on conditions we are hoisting above.
// Conservatively strip all metadata on the instruction unless we were		// Conservatively strip all metadata on the instruction unless we were
// guaranteed to execute I if we entered the loop, in which case the metadata		// guaranteed to execute I if we entered the loop, in which case the metadata
// is valid in the loop preheader.		// is valid in the loop preheader.
if (I.hasMetadataOtherThanDebugLoc() &&		if (I.hasMetadataOtherThanDebugLoc() &&
// The check on hasMetadataOtherThanDebugLoc is to prevent us from burning		// The check on hasMetadataOtherThanDebugLoc is to prevent us from burning
// time in isGuaranteedToExecute if we don't actually have anything to		// time in isGuaranteedToExecute if we don't actually have anything to
// drop. It is a compile time optimization, not required for correctness.		// drop. It is a compile time optimization, not required for correctness.
!SafetyInfo->isGuaranteedToExecute(I, DT, CurLoop))		!SafetyInfo->isGuaranteedToExecute(I, DT, CurLoop))
I.dropUnknownNonDebugMetadata();		I.dropUnknownNonDebugMetadata();

		// Invalidation of Preheader is not needed because it is not a part of the
		// loop.
		SafetyInfo->dropCachedInfo(I.getParent());
// Move the new node to the Preheader, before its terminator.		// Move the new node to the Preheader, before its terminator.
I.moveBefore(Preheader->getTerminator());		I.moveBefore(Preheader->getTerminator());

// Do not retain debug locations when we are moving instructions to different		// Do not retain debug locations when we are moving instructions to different
// basic blocks, because we want to avoid jumpy line tables. Calls, however,		// basic blocks, because we want to avoid jumpy line tables. Calls, however,
// need to retain their debug locs because they may be inlined.		// need to retain their debug locs because they may be inlined.
// FIXME: How do we retain source locations without causing poor debugging		// FIXME: How do we retain source locations without causing poor debugging
// behavior?		// behavior?
▲ Show 20 Lines • Show All 145 Lines • ▼ Show 20 Lines
/// the stores in the loop, looking for stores to Must pointers which are		/// the stores in the loop, looking for stores to Must pointers which are
/// loop invariant.		/// loop invariant.
///		///
bool llvm::promoteLoopAccessesToScalars(		bool llvm::promoteLoopAccessesToScalars(
const SmallSetVector<Value *, 8> &PointerMustAliases,		const SmallSetVector<Value *, 8> &PointerMustAliases,
SmallVectorImpl<BasicBlock *> &ExitBlocks,		SmallVectorImpl<BasicBlock *> &ExitBlocks,
SmallVectorImpl<Instruction *> &InsertPts, PredIteratorCache &PIC,		SmallVectorImpl<Instruction *> &InsertPts, PredIteratorCache &PIC,
LoopInfo LI, DominatorTree DT, const TargetLibraryInfo *TLI,		LoopInfo LI, DominatorTree DT, const TargetLibraryInfo *TLI,
Loop CurLoop, AliasSetTracker CurAST, LoopSafetyInfo *SafetyInfo,		Loop CurLoop, AliasSetTracker CurAST, ICFLoopSafetyInfo *SafetyInfo,
OptimizationRemarkEmitter *ORE) {		OptimizationRemarkEmitter *ORE) {
// Verify inputs.		// Verify inputs.
assert(LI != nullptr && DT != nullptr && CurLoop != nullptr &&		assert(LI != nullptr && DT != nullptr && CurLoop != nullptr &&
CurAST != nullptr && SafetyInfo != nullptr &&		CurAST != nullptr && SafetyInfo != nullptr &&
"Unexpected Input to promoteLoopAccessesToScalars");		"Unexpected Input to promoteLoopAccessesToScalars");

Value SomePtr = PointerMustAliases.begin();		Value SomePtr = PointerMustAliases.begin();
		reamesUnsubmitted Not Done Reply Inline Actions There needs to be some invalidations somewhere within this function. reames: There needs to be some invalidations somewhere within this function.
		reamesUnsubmitted Not Done Reply Inline Actions It doesn't look like my previous comment has been addressed. There's still a bug here. reames: It doesn't look like my previous comment has been addressed. There's still a bug here.
BasicBlock *Preheader = CurLoop->getLoopPreheader();		BasicBlock *Preheader = CurLoop->getLoopPreheader();

// It is not safe to promote a load/store from the loop if the load/store is		// It is not safe to promote a load/store from the loop if the load/store is
// conditional. For example, turning:		// conditional. For example, turning:
//		//
// for () { if (c) *P += 1; }		// for () { if (c) *P += 1; }
//		//
// into:		// into:
▲ Show 20 Lines • Show All 209 Lines • ▼ Show 20 Lines	bool llvm::promoteLoopAccessesToScalars(
if (SawUnorderedAtomic)		if (SawUnorderedAtomic)
PreheaderLoad->setOrdering(AtomicOrdering::Unordered);		PreheaderLoad->setOrdering(AtomicOrdering::Unordered);
PreheaderLoad->setAlignment(Alignment);		PreheaderLoad->setAlignment(Alignment);
PreheaderLoad->setDebugLoc(DL);		PreheaderLoad->setDebugLoc(DL);
if (AATags)		if (AATags)
PreheaderLoad->setAAMetadata(AATags);		PreheaderLoad->setAAMetadata(AATags);
SSA.AddAvailableValue(Preheader, PreheaderLoad);		SSA.AddAvailableValue(Preheader, PreheaderLoad);

		// Drop all cached info regarding LoopUses.
		for (auto *I : LoopUses)
		SafetyInfo->dropCachedInfo(I->getParent());

		apilipenkoUnsubmitted Done Reply Inline Actions Having invalidation in Promoter's callbacks (like instructionDeleted) seems more natural. We invalidate CurAST there. apilipenko: Having invalidation in Promoter's callbacks (like instructionDeleted) seems more natural. We…
// Rewrite all the loads in the loop and remember all the definitions from		// Rewrite all the loads in the loop and remember all the definitions from
// stores in the loop.		// stores in the loop.
Promoter.run(LoopUses);		Promoter.run(LoopUses);

// If the SSAUpdater didn't use the load in the preheader, just zap it now.		// If the SSAUpdater didn't use the load in the preheader, just zap it now.
if (PreheaderLoad->use_empty())		if (PreheaderLoad->use_empty())
		// Invalidation of SafetyInfo is not needed since PreheaderLoad is not in
		// the loop and we should never make queries to it.
PreheaderLoad->eraseFromParent();		PreheaderLoad->eraseFromParent();

return true;		return true;
}		}

/// Returns an owning pointer to an alias set which incorporates aliasing info		/// Returns an owning pointer to an alias set which incorporates aliasing info
/// from L and all subloops of L.		/// from L and all subloops of L.
/// FIXME: In new pass manager, there is no helper function to handle loop		/// FIXME: In new pass manager, there is no helper function to handle loop
▲ Show 20 Lines • Show All 129 Lines • Show Last 20 Lines

test/Transforms/LICM/guards.ll

Show First 20 Lines • Show All 79 Lines • ▼ Show 20 Lines	loop:
call void (i1, ...) @llvm.experimental.guard(i1 %cond) ["deopt" (i32 0)]		call void (i1, ...) @llvm.experimental.guard(i1 %cond) ["deopt" (i32 0)]
%val = load i32, i32* %ptr		%val = load i32, i32* %ptr
store i32 0, i32* %ptr		store i32 0, i32* %ptr
%x.inc = add i32 %x, %val		%x.inc = add i32 %x, %val
br label %loop		br label %loop
}		}


; TODO: We can also hoist this load and guard from mustexec non-header block.		; TODO: We can also hoist this guard from mustexec non-header block.
define void @test4(i1 %c, i32* %p) {		define void @test4(i1 %c, i32* %p) {

; CHECK-LABEL: @test4(		; CHECK-LABEL: @test4(
; CHECK-LABEL: entry:		; CHECK-LABEL: entry:
; CHECK-LABEL: loop:
; CHECK-LABEL: backedge:
; CHECK: %a = load i32, i32* %p		; CHECK: %a = load i32, i32* %p
; CHECK: %invariant_cond = icmp ne i32 %a, 100		; CHECK: %invariant_cond = icmp ne i32 %a, 100
		; CHECK-LABEL: loop:
		; CHECK-LABEL: backedge:
; CHECK: call void (i1, ...) @llvm.experimental.guard(i1 %invariant_cond)		; CHECK: call void (i1, ...) @llvm.experimental.guard(i1 %invariant_cond)

entry:		entry:
br label %loop		br label %loop

loop:		loop:
%iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ]		%iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ]
%iv.next = add i32 %iv, 1		%iv.next = add i32 %iv, 1
▲ Show 20 Lines • Show All 162 Lines • Show Last 20 Lines

test/Transforms/LICM/hoist-mustexec.ll

Show First 20 Lines • Show All 450 Lines • ▼ Show 20 Lines	backedge:
%merge = phi i32 [ %a, %if.true ], [ %b, %if.false ]		%merge = phi i32 [ %a, %if.true ], [ %b, %if.false ]
%iv.next = add i32 %iv, %merge		%iv.next = add i32 %iv, %merge
%loop.cond = icmp ult i32 %iv.next, %load		%loop.cond = icmp ult i32 %iv.next, %load
br i1 %loop.cond, label %loop, label %exit		br i1 %loop.cond, label %loop, label %exit

exit:		exit:
ret void		ret void
}		}

		; Check that we can hoist a mustexecute load from backedge even if something
		; throws after it.
		define void @test_hoist_from_backedge_01(i32* %p, i32 %n) {

		; CHECK-LABEL: @test_hoist_from_backedge_01(
		; CHECK: entry:
		; CHECK-NEXT: %load = load i32, i32* %p
		; CHECK-NOT: load i32

		entry:
		br label %loop

		loop:
		%iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ]
		%dummy = phi i32 [ 0, %entry ], [ %merge, %backedge ]
		%cond = icmp slt i32 %iv, %n
		br i1 %cond, label %if.true, label %if.false

		if.true:
		%a = add i32 %iv, %iv
		br label %backedge

		if.false:
		%b = mul i32 %iv, %iv
		br label %backedge

		backedge:
		%merge = phi i32 [ %a, %if.true ], [ %b, %if.false ]
		%iv.next = add i32 %iv, %merge
		%load = load i32, i32* %p
		call void @may_throw()
		%loop.cond = icmp ult i32 %iv.next, %load
		br i1 %loop.cond, label %loop, label %exit

		exit:
		ret void
		}

		; Check that we don't hoist the load if something before it can throw.
		define void @test_hoist_from_backedge_02(i32* %p, i32 %n) {

		; CHECK-LABEL: @test_hoist_from_backedge_02(
		; CHECK: entry:
		; CHECK: loop:
		; CHECK: %load = load i32, i32* %p

		entry:
		br label %loop

		loop:
		%iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ]
		%dummy = phi i32 [ 0, %entry ], [ %merge, %backedge ]
		%cond = icmp slt i32 %iv, %n
		br i1 %cond, label %if.true, label %if.false

		if.true:
		%a = add i32 %iv, %iv
		br label %backedge

		if.false:
		%b = mul i32 %iv, %iv
		br label %backedge

		backedge:
		%merge = phi i32 [ %a, %if.true ], [ %b, %if.false ]
		%iv.next = add i32 %iv, %merge
		call void @may_throw()
		%load = load i32, i32* %p
		%loop.cond = icmp ult i32 %iv.next, %load
		br i1 %loop.cond, label %loop, label %exit

		exit:
		ret void
		}

		define void @test_hoist_from_backedge_03(i32* %p, i32 %n) {

		; CHECK-LABEL: @test_hoist_from_backedge_03(
		; CHECK: entry:
		; CHECK: loop:
		; CHECK: %load = load i32, i32* %p

		entry:
		br label %loop

		loop:
		%iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ]
		%dummy = phi i32 [ 0, %entry ], [ %merge, %backedge ]
		%cond = icmp slt i32 %iv, %n
		br i1 %cond, label %if.true, label %if.false

		if.true:
		%a = add i32 %iv, %iv
		br label %backedge

		if.false:
		%b = mul i32 %iv, %iv
		call void @may_throw()
		br label %backedge

		backedge:
		%merge = phi i32 [ %a, %if.true ], [ %b, %if.false ]
		%iv.next = add i32 %iv, %merge
		%load = load i32, i32* %p
		%loop.cond = icmp ult i32 %iv.next, %load
		br i1 %loop.cond, label %loop, label %exit

		exit:
		ret void
		}

		define void @test_hoist_from_backedge_04(i32* %p, i32 %n) {

		; CHECK-LABEL: @test_hoist_from_backedge_04(
		; CHECK: entry:
		; CHECK: loop:
		; CHECK: %load = load i32, i32* %p

		entry:
		br label %loop

		loop:
		%iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ]
		%dummy = phi i32 [ 0, %entry ], [ %merge, %backedge ]
		call void @may_throw()
		%cond = icmp slt i32 %iv, %n
		br i1 %cond, label %if.true, label %if.false

		if.true:
		%a = add i32 %iv, %iv
		br label %backedge

		if.false:
		%b = mul i32 %iv, %iv
		br label %backedge

		backedge:
		%merge = phi i32 [ %a, %if.true ], [ %b, %if.false ]
		%iv.next = add i32 %iv, %merge
		%load = load i32, i32* %p
		%loop.cond = icmp ult i32 %iv.next, %load
		br i1 %loop.cond, label %loop, label %exit

		exit:
		ret void
		}

test/Transforms/LICM/hoist-nounwind.ll

Show First 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	for.body:
%inc = add i32 %add, %div		%inc = add i32 %add, %div
%cmp = icmp slt i32 %inc, %N		%cmp = icmp slt i32 %inc, %N
br i1 %cmp, label %for.body, label %for.cond.cleanup		br i1 %cmp, label %for.body, label %for.cond.cleanup

for.cond.cleanup:		for.cond.cleanup:
ret i32 0		ret i32 0
}		}

; Don't hoist load past volatile load.		; Hoist a non-volatile load past volatile load.
define i32 @test3(i32* noalias nocapture readonly %a, i32* %v) nounwind uwtable {		define i32 @test3(i32* noalias nocapture readonly %a, i32* %v) nounwind uwtable {
; CHECK-LABEL: @test3(		; CHECK-LABEL: @test3(
entry:		entry:
br label %for.body		br label %for.body

		; CHECK: load i32
		; CHECK: for.body:
; CHECK: load volatile i32		; CHECK: load volatile i32
; CHECK-NEXT: load i32		; CHECK-NOT: load
for.body:		for.body:
%i.06 = phi i32 [ 0, %entry ], [ %inc, %for.body ]		%i.06 = phi i32 [ 0, %entry ], [ %inc, %for.body ]
%x.05 = phi i32 [ 0, %entry ], [ %add, %for.body ]		%x.05 = phi i32 [ 0, %entry ], [ %add, %for.body ]
%xxx = load volatile i32, i32* %v, align 4		%xxx = load volatile i32, i32* %v, align 4
%i1 = load i32, i32* %a, align 4		%i1 = load i32, i32* %a, align 4
%add = add nsw i32 %i1, %x.05		%add = add nsw i32 %i1, %x.05
%inc = add nuw nsw i32 %i.06, 1		%inc = add nuw nsw i32 %i.06, 1
%exitcond = icmp eq i32 %inc, 1000		%exitcond = icmp eq i32 %inc, 1000
br i1 %exitcond, label %for.cond.cleanup, label %for.body		br i1 %exitcond, label %for.cond.cleanup, label %for.body

for.cond.cleanup:		for.cond.cleanup:
ret i32 %add		ret i32 %add
}		}

		; Don't a volatile load past volatile load.
		define i32 @test4(i32* noalias nocapture readonly %a, i32* %v) nounwind uwtable {
		; CHECK-LABEL: @test4(
		entry:
		br label %for.body

		; CHECK: for.body:
		; CHECK: load volatile i32
		; CHECK-NEXT: load volatile i32
		for.body:
		%i.06 = phi i32 [ 0, %entry ], [ %inc, %for.body ]
		%x.05 = phi i32 [ 0, %entry ], [ %add, %for.body ]
		%xxx = load volatile i32, i32* %v, align 4
		%i1 = load volatile i32, i32* %a, align 4
		%add = add nsw i32 %i1, %x.05
		%inc = add nuw nsw i32 %i.06, 1
		%exitcond = icmp eq i32 %inc, 1000
		br i1 %exitcond, label %for.cond.cleanup, label %for.body

		for.cond.cleanup:
		ret i32 %add
		}
		No newline at end of file

test/Transforms/LICM/preheader-safe.ll

Show First 20 Lines • Show All 106 Lines • ▼ Show 20 Lines	loop: ; preds = %entry, %for.inc
%div = udiv i64 %x, %y		%div = udiv i64 %x, %y
br i1 %cond, label %loop-if, label %exit		br i1 %cond, label %loop-if, label %exit
loop-if:		loop-if:
call void @use(i64 %div)		call void @use(i64 %div)
br label %loop		br label %loop
exit:		exit:
ret void		ret void
}		}

		; Positive test - can hoist something that happens before thrower.
		define void @nothrow_header_pos(i64 %x, i64 %y, i1 %cond) {
		; CHECK-LABEL: nothrow_header_pos
		; CHECK-LABEL: entry
		; CHECK: %div = udiv i64 %x, %y
		; CHECK-LABEL: loop
		; CHECK: call void @use(i64 %div)
		entry:
		br label %loop
		loop: ; preds = %entry, %for.inc
		br label %loop-if
		loop-if:
		%div = udiv i64 %x, %y
		call void @use(i64 %div)
		br label %loop
		}


; Negative test - can't move out of throwing block		; Negative test - can't move out of throwing block
define void @nothrow_header_neg(i64 %x, i64 %y, i1 %cond) {		define void @nothrow_header_neg(i64 %x, i64 %y, i1 %cond) {
; CHECK-LABEL: nothrow_header_neg		; CHECK-LABEL: nothrow_header_neg
; CHECK-LABEL: entry		; CHECK-LABEL: entry
; CHECK-LABEL: loop		; CHECK-LABEL: loop
		; CHECK: call void @maythrow()
; CHECK: %div = udiv i64 %x, %y		; CHECK: %div = udiv i64 %x, %y
; CHECK: call void @use(i64 %div)		; CHECK: call void @use(i64 %div)
entry:		entry:
br label %loop		br label %loop
loop: ; preds = %entry, %for.inc		loop: ; preds = %entry, %for.inc
br label %loop-if		br label %loop-if
loop-if:		loop-if:
		call void @maythrow()
%div = udiv i64 %x, %y		%div = udiv i64 %x, %y
call void @use(i64 %div)		call void @use(i64 %div)
br label %loop		br label %loop
}		}

This is an archive of the discontinued LLVM Phabricator instance.

[LICM] Use ICFLoopSafetyInfo in LICMClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 169806

include/llvm/Transforms/Utils/LoopUtils.h

lib/Transforms/Scalar/LICM.cpp

test/Transforms/LICM/guards.ll

test/Transforms/LICM/hoist-mustexec.ll

test/Transforms/LICM/hoist-nounwind.ll

test/Transforms/LICM/preheader-safe.ll

[LICM] Use ICFLoopSafetyInfo in LICM
ClosedPublic