This is an archive of the discontinued LLVM Phabricator instance.

[LICM] Introduce a finer granularity option to compute early exits.
Needs RevisionPublic

Authored by trentxintong on May 1 2017, 4:03 PM.

Download Raw Diff

Details

Reviewers

hfinkel
• dberlin
efriedma
sanjoy

Summary

Instead of keeping a variable indicating whether there are early exits
in the loop. We keep all the early exits. This improves LICM's ability to
move instructions out of the loop based on is-guaranteed-to-execute.

The reason i do not want to compute early exits in LIR and LoopUnswitch is that its
probably not as beneficial as in LICM and its a pain to update the SafetyInfo
in case of LoopUnswitch, i.e. instructions can get deleted.

Diff Detail

Build Status

Buildable 7554
Build 7554: arc lint + arc unit

Event Timeline

trentxintong created this revision.May 1 2017, 4:03 PM

Harbormaster completed remote builds in B6036: Diff 97357.May 1 2017, 4:03 PM

Fix comments

trentxintong added reviewers: hfinkel, • dberlin, sanjoy, efriedma.May 1 2017, 4:10 PM

trentxintong added a subscriber: llvm-commits.

• dberlin added inline comments.May 1 2017, 4:23 PM

include/llvm/Transforms/Utils/LoopUtils.h
62	Would it trouble you too much to move this OBBMAP into Transforms/Utils as a class called "OrderedInstructions" or something. (IE provides a dominates call, handles calling dt->dominates if they are in different bbs, or checking ordering if they are in the same bb?) We should encapsulate this. I'm happy to clean up the existing users if you are willing to encapsulate it :)

trentxintong added inline comments.May 1 2017, 4:26 PM

include/llvm/Transforms/Utils/LoopUtils.h
62	I am thinking about that as well. I will do it

Add a dominanance check interface that uses caching for instructions within same basic block.

I do not particularly like the name. I will see whether i can come up with a better one later.

Harbormaster completed remote builds in B6040: Diff 97371.May 1 2017, 5:33 PM

Herald added a subscriber: mgorny. · View Herald TranscriptMay 1 2017, 5:33 PM

Address comments

Ping

efriedma added inline comments.Jun 13 2017, 2:01 PM

include/llvm/Transforms/Utils/OrderedInstructions.h
41	Not sure I like this constructor... easy to misuse.
lib/Transforms/Utils/LoopUtils.cpp
1086	See the FIXME below... but I guess there's another way to do this check: if Inst dominates the backedge, then it's guaranteed to execute even if the loop is infinite. A bit more general than checking that the parent is exactly the header.

efriedma added inline comments.Jun 13 2017, 2:48 PM

lib/Transforms/Utils/LoopUtils.cpp
1086	Err, wait, no, sorry, the check would be that Inst dominates the backedge, and there isn't any infinite loop between the start of the loop and the backedge. I guess just checking whether Inst is in the header is good enough for now.

Minor comment and code modifications. NFC.

Edit comments

Harbormaster completed remote builds in B7331: Diff 102936.Jun 17 2017, 6:38 AM

Remove the default constructor for OrderedInstructions.

Harbormaster completed remote builds in B7332: Diff 102937.Jun 17 2017, 6:51 AM

Herald added a subscriber: mzolotukhin. · View Herald TranscriptJun 17 2017, 6:51 AM

Update more comments

efriedma added inline comments.Jun 19 2017, 2:01 PM

include/llvm/Transforms/Utils/LoopUtils.h
60	Use AssertingVH here?
test/Transforms/LICM/loop-early-exits.ll
83	Could you add a testcase with a udiv after a nothrow call, to make it clear that an early exit doesn't necessarily involve unwinding?

Address comments

Fix a bug in how early exits are updated. Apparently, LICM could simplify or delete early exits,
e.g. some calls that are simplifiable, but we treat as early exit.

Build the deleteExit interface for the early exit to be updated. EarlyExits are now kept in a DenseSet
which means iterating over is not deterministic. But this is ok as we need to make sure the interested
instruction dominates ALL.

The test case which the bug is uncovered is Transforms/LICM/pr32129.ll.

AssertingVH catching a bug. Cool. :)

Some places you're adding calls to invalidateBlock; other places you're calling deleteExit. Why do we need to call one, but not the other, in LICM?

Address comments by reworking when SafetyInfo->deleteExit and
SafetyInfo->OrderInstruction.invalidateBlock are called.

The general rule to update OrderedInstruction and EarlyExits in SafetyInfo is as
follow: Before we delete an instruction in the loop, we invalidate the OrderedInstruction
for the block it is in, as OrderedInstruction may still hold pointer to the instruction.
In addition, we check whether it is an early exit. If it is, we remove it from the
early exit list.

We can delete instructions in the loop for various reasons in LICM including : instruction
is simplified and removed, instruction is sunk out of the loop and instruction is
hoisted out of the loop.

Minor cosmetic change

Minor assert fix

Minor change in OrderedInstruction invalidation.

Harbormaster completed remote builds in B7553: Diff 103817.Jun 23 2017, 6:16 PM

Hmm ... last version did not compile. This should be good now.

To reiterate:

In case we move an instruction, i.e. hoisting, we invalidate the block it is moved to as well.

Additionally, we check whether it is an early exit. If it is, we remove it from the
early exit list.

We can delete instructions in the loop for various reasons in LICM including : instruction
is simplified and removed, instruction is sunk out of the loop and instruction is
hoisted out of the loop.

@efriedma can you please finish the review when you have time ? Thanks !

Ping. Thanks!

LGTM.

As a followup to this, could you look into make OrderedInstructions and OrderedBasicBlock use AssertingVH, to try and catch mistakes?

This revision is now accepted and ready to land.Jun 28 2017, 12:16 PM

efriedma requested changes to this revision.Jun 28 2017, 12:21 PM

efriedma added inline comments.

lib/Transforms/Utils/LoopUtils.cpp
1053	Actually, one more issue we probably need to address. This is potentially linear in the number of calls in the loop, since we don't try to prune EarlyExits at all. This makes LICM potentially O(n^2) overall. Can we store this information in some more efficient way?

This revision now requires changes to proceed.Jun 28 2017, 12:21 PM

trentxintong added inline comments.Jun 28 2017, 1:42 PM

lib/Transforms/Utils/LoopUtils.cpp
1053	We could certainly prune the early exits by discovering them walking the dominator tree. If an instruction dominates an early exit (Inst A), it certainly dominates all the other early exits dominated by Inst A. So we do not need to include those early exits in the list. The drawback with this is that if an early exit is deleted, which is possible but unlikely. We need to invalidate the early exit list (we could try to recompute everything, but probably not worth it). By doing this, we potentially save a lot of OI.dominates checks. Worst case is still O(n^2) in case of a loop with many basic blocks that do not dominate each other and each one of them have early exits ... but this happens infrequently in practice IMO.

Thinking about it a bit more, instead of storing a list of instructions which can exit the loop early, you could store a list of basic blocks which are guaranteed to execute, and a mapping from those basic blocks to the first instruction in each basic block which could exit early. That would allow "constant-time" lookups, at the expense of a bit more computation upfront.

I agree it doesn't make sense to try to recompute if an early exit is erased; that should be very rare. (Granted, it might be a little bit more common than it should be: it looks like isGuaranteedToTransferExecutionToSuccessor is missing a few important cases which isInstructionTriviallyDead catches.)

In D32720#794481, @efriedma wrote:

Thinking about it a bit more, instead of storing a list of instructions which can exit the loop early, you could store a list of basic blocks which are guaranteed to execute, and a mapping from those basic blocks to the first instruction in each basic block which could exit early. That would allow "constant-time" lookups, at the expense of a bit more computation upfront.

I agree it doesn't make sense to try to recompute if an early exit is erased; that should be very rare. (Granted, it might be a little bit more common than it should be: it looks like isGuaranteedToTransferExecutionToSuccessor is missing a few important cases which isInstructionTriviallyDead catches.)

I think thats a good idea. We need to walk the DT and keep pushing the blocks that are guaranteed to execute into a list. When we hit an early exit instruction, we do not need to process its children on the DT, and we keep a pointer to the instruction.

This way, we only need to lookup this DenseSet of basic blocks and possibly do some fiddling with basic blocks that are partially guaranteed to execute.

• dberlin added inline comments.Jun 29 2017, 7:29 PM

lib/Transforms/Utils/LoopUtils.cpp
1053	Unless i'm misunderstanding, and maybe i am (!), I believe this can be made optimal: Here's the normal way: The ordering is completely described by the DFS in/out numbers of the dom tree + local numbers. It should completely suffice to, instead of just ordering the in/out of the DT nodes, model in/out of the entire block. IE Count = 0 number(root of DT) number(DTNode): In = Count for each DT child: number(child) for each instruction in this block InstNum[I] = ++Count Out = Count (or something similar) The early exits you dominate is, in the same block, any early exit with a number greater than yours, and for other blocks, any block with an in > your in and out < your out. This is trivially stored in a sorted smallvector. You dominate all the early exits only if using the above comparator, lower bound says you would appear first in the smallvector. I believe you can get away with a lot less, but not positive ATM: By definition, anything that dominates all the early exits must be in a block that has a DT level number <= than all of them. If all the exits are terminators, than you don't need the local numbering or DFS numbering at all, i think (I rely on Eli to point out any flaw in my reasoning, since i'm dashing this off pretty quickly). You just need the entries with the lowest DT level (in case they are siblings in the dominator tree) Then it would just be: if DT level number < lowest DT level number of an exit, you dominate all exits if DT level number == lowest DT level number then: if vector.size() == 1 && exit block == instruction block you dominate all exits else you do not (because there must be a sibling you don't dominate) I think one or the other of the above should work. The DT level numbers are now available in the DomTree :)

Actually, that won't entirely work without the DFS numbers.
You can shortcut the false cases using the level numbers (IE it's not possible for Inst to have level number 5, an early exit have level number 3, and you dominate it), but you still need to know what part of the dom tree you are in.

So you still have to store and stabbing query the dom tree node DFS intervals.

Thanks for the suggestions and review so far. I am in middle of a project. I will put this on hold for a couple of weeks and will come back to it.

sanjoy resigned from this revision.Jan 29 2022, 5:44 PM

Herald added a subscriber: asbirlea. · View Herald TranscriptJan 29 2022, 5:44 PM

FYI, all the tests in this patch are already handled with the current licm. Do we still need this patch?

$ opt -S -licm loop-early-exits.ll

; ModuleID = 'licm.ll'
source_filename = "licm.ll"

declare void @use(i64)

declare void @use_nothing()

; Function Attrs: nounwind
declare void @call_nothrow() #0

define void @throw_header1(i64 %x, i64 %y, i1* %cond) {
entry:
  %div = udiv i64 %x, %y
  br label %loop

loop:                                             ; preds = %loop, %entry
  call void @use(i64 %div)
  br label %loop
}

define void @throw_header2(i64 %x, i64 %y, i1* %cond) {
entry:
  br label %loop

loop:                                             ; preds = %loop, %entry
  call void @use_nothing()
  %div = udiv i64 %x, %y
  call void @use(i64 %div)
  br label %loop
}

define void @throw_body1(i64 %x, i64 %y, i1* %cond) {
entry:
  %div = udiv i64 %x, %y
  br label %loop

loop:                                             ; preds = %body, %entry
  br label %body

body:                                             ; preds = %loop
  call void @use(i64 %div)
  br i1 false, label %loop, label %exit

exit:                                             ; preds = %body
  ret void
}

define void @throw_body2(i64 %x, i64 %y, i1* %cond) {
entry:
  br label %loop

loop:                                             ; preds = %body, %entry
  br label %body

body:                                             ; preds = %loop
  call void @use_nothing()
  %div = udiv i64 %x, %y
  call void @use(i64 %div)
  br i1 false, label %loop, label %exit

exit:                                             ; preds = %body
  ret void
}

define void @throw_body3(i64 %x, i64 %y, i1* %cond) {
entry:
  br label %loop

loop:                                             ; preds = %body, %entry
  br label %body

body:                                             ; preds = %loop
  call void @call_nothrow()
  %div = udiv i64 %x, %y
  call void @use(i64 %div)
  br i1 false, label %loop, label %exit

exit:                                             ; preds = %body
  ret void
}

attributes #0 = { nounwind }

Herald added a project: Restricted Project. · View Herald TranscriptJul 28 2022, 8:11 PM

Revision Contents

Path

Size

include/

llvm/

Transforms/

Utils/

LoopUtils.h

28 lines

OrderedInstructions.h

11 lines

lib/

Transforms/

Scalar/

LICM.cpp

87 lines

LoopIdiomRecognize.cpp

2 lines

LoopUnswitch.cpp

10 lines

Utils/

LoopUtils.cpp

46 lines

OrderedInstructions.cpp

1 line

test/

Transforms/

LICM/

loop-early-exits.ll

107 lines

preheader-safe.ll

14 lines

Diff 103818

include/llvm/Transforms/Utils/LoopUtils.h

Show All 22 Lines
#include "llvm/Analysis/EHPersonalities.h"		#include "llvm/Analysis/EHPersonalities.h"
#include "llvm/Analysis/TargetTransformInfo.h"		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/IR/Dominators.h"		#include "llvm/IR/Dominators.h"
#include "llvm/IR/IRBuilder.h"		#include "llvm/IR/IRBuilder.h"
#include "llvm/IR/InstrTypes.h"		#include "llvm/IR/InstrTypes.h"
#include "llvm/IR/Operator.h"		#include "llvm/IR/Operator.h"
#include "llvm/IR/ValueHandle.h"		#include "llvm/IR/ValueHandle.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
		#include "llvm/Transforms/Utils/OrderedInstructions.h"

namespace llvm {		namespace llvm {

class AliasSet;		class AliasSet;
class AliasSetTracker;		class AliasSetTracker;
class BasicBlock;		class BasicBlock;
class DataLayout;		class DataLayout;
class Loop;		class Loop;
class LoopInfo;		class LoopInfo;
class OptimizationRemarkEmitter;		class OptimizationRemarkEmitter;
class PredicatedScalarEvolution;		class PredicatedScalarEvolution;
class PredIteratorCache;		class PredIteratorCache;
class ScalarEvolution;		class ScalarEvolution;
class SCEV;		class SCEV;
class TargetLibraryInfo;		class TargetLibraryInfo;
class TargetTransformInfo;		class TargetTransformInfo;

/// \brief Captures loop safety information.		/// \brief Captures loop safety information.
/// It keep information for loop & its header may throw exception.		/// It keep information for loop & its header may throw exception.
struct LoopSafetyInfo {		struct LoopSafetyInfo {
bool MayThrow = false; // The current loop contains an instruction which		bool MayThrow = false; // The current loop contains an instruction which
// may throw.		// may throw.
bool HeaderMayThrow = false; // Same as previous, but specific to loop header		bool HeaderMayThrow = false; // Same as previous, but specific to loop header

		// Whether we have computed all the early exits.
		bool ComputedEarlyExits = false;
		// The early exits in the loop, excluding loop exits.
		// These are calls that might throw, infinite loop, etc.
		DenseSet<AssertingVH<Instruction>> EarlyExits;
		efriedmaUnsubmitted Not Done Reply Inline Actions Use AssertingVH here? efriedma: Use AssertingVH here?

		// Utility to check for dominance information with caching.
		dberlinUnsubmitted Not Done Reply Inline Actions Would it trouble you too much to move this OBBMAP into Transforms/Utils as a class called "OrderedInstructions" or something. (IE provides a dominates call, handles calling dt->dominates if they are in different bbs, or checking ordering if they are in the same bb?) We should encapsulate this. I'm happy to clean up the existing users if you are willing to encapsulate it :) dberlin: Would it trouble you too much to move this OBBMAP into Transforms/Utils as a class called…
		trentxintongAuthorUnsubmitted Not Done Reply Inline Actions I am thinking about that as well. I will do it trentxintong: I am thinking about that as well. I will do it
		OrderedInstructions OI;

// Used to update funclet bundle operands.		// Used to update funclet bundle operands.
DenseMap<BasicBlock *, ColorVector> BlockColors;		DenseMap<BasicBlock *, ColorVector> BlockColors;

LoopSafetyInfo() = default;		// Delete the specified early exit. This can happen if the early exit
		// is removed from the loop.
		void deleteExit(Instruction *E) { EarlyExits.erase(E); }

		// Return true if this instruction is an early exit, false otherwise.
		bool isExit(Instruction *E) { return EarlyExits.count(E); }

		// Constructor.
		LoopSafetyInfo(DominatorTree *DT) : OI(DT) {}
};		};

/// The RecurrenceDescriptor is used to identify recurrences variables in a		/// The RecurrenceDescriptor is used to identify recurrences variables in a
/// loop. Reduction is a special case of recurrence that has uses of the		/// loop. Reduction is a special case of recurrence that has uses of the
/// recurrence variable outside the loop. The method isReductionPHI identifies		/// recurrence variable outside the loop. The method isReductionPHI identifies
/// reductions that are basic recurrences.		/// reductions that are basic recurrences.
///		///
/// Basic recurrences are defined as the summation, product, OR, AND, XOR, min,		/// Basic recurrences are defined as the summation, product, OR, AND, XOR, min,
▲ Show 20 Lines • Show All 373 Lines • ▼ Show 20 Lines	bool promoteLoopAccessesToScalars(AliasSet &, SmallVectorImpl<BasicBlock *> &,
DominatorTree , const TargetLibraryInfo ,		DominatorTree , const TargetLibraryInfo ,
Loop , AliasSetTracker , LoopSafetyInfo *,		Loop , AliasSetTracker , LoopSafetyInfo *,
OptimizationRemarkEmitter *);		OptimizationRemarkEmitter *);

/// \brief Computes safety information for a loop		/// \brief Computes safety information for a loop
/// checks loop body & header for the possibility of may throw		/// checks loop body & header for the possibility of may throw
/// exception, it takes LoopSafetyInfo and loop as argument.		/// exception, it takes LoopSafetyInfo and loop as argument.
/// Updates safety information in LoopSafetyInfo argument.		/// Updates safety information in LoopSafetyInfo argument.
void computeLoopSafetyInfo(LoopSafetyInfo , Loop );		/// In case ComputeEarlyExits is true, all the early exit points are recorded.
		void computeLoopSafetyInfo(LoopSafetyInfo , Loop ,
		bool ComputeEarlyExits = false);

/// Returns true if the instruction in a loop is guaranteed to execute at least		/// Returns true if the instruction in a loop is guaranteed to execute at least
/// once.		/// once.
bool isGuaranteedToExecute(const Instruction &Inst, const DominatorTree *DT,		bool isGuaranteedToExecute(const Instruction &Inst, const DominatorTree *DT,
const Loop *CurLoop,		const Loop CurLoop, LoopSafetyInfo SafetyInfo);
const LoopSafetyInfo *SafetyInfo);

/// \brief Returns the instructions that use values defined in the loop.		/// \brief Returns the instructions that use values defined in the loop.
SmallVector<Instruction , 8> findDefsUsedOutsideOfLoop(Loop L);		SmallVector<Instruction , 8> findDefsUsedOutsideOfLoop(Loop L);

/// \brief Find string metadata for loop		/// \brief Find string metadata for loop
///		///
/// If it has a value (e.g. {"llvm.distribute", 1} return the value as an		/// If it has a value (e.g. {"llvm.distribute", 1} return the value as an
/// operand or null otherwise. If the string metadata is not found return		/// operand or null otherwise. If the string metadata is not found return
▲ Show 20 Lines • Show All 65 Lines • Show Last 20 Lines

include/llvm/Transforms/Utils/OrderedInstructions.h

	//===- llvm/Transforms/Utils/OrderedInstructions.h -------------- C++ --===//			//===- llvm/Transforms/Utils/OrderedInstructions.h -------------- C++ --===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file defines an efficient way to check for dominance relation between 2			// This file defines an efficient way to check for dominance relation between 2
	// instructions.			// instructions.
	//			//
	// This interface dispatches to appropriate dominance check given 2
	// instructions, i.e. in case the instructions are in the same basic block,
	// OrderedBasicBlock (with instruction numbering and caching) are used.
	// Otherwise, dominator tree is used.
	//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_TRANSFORMS_UTILS_ORDEREDINSTRUCTIONS_H			#ifndef LLVM_TRANSFORMS_UTILS_ORDEREDINSTRUCTIONS_H
	#define LLVM_TRANSFORMS_UTILS_ORDEREDINSTRUCTIONS_H			#define LLVM_TRANSFORMS_UTILS_ORDEREDINSTRUCTIONS_H

	#include "llvm/ADT/DenseMap.h"			#include "llvm/ADT/DenseMap.h"
	#include "llvm/Analysis/OrderedBasicBlock.h"			#include "llvm/Analysis/OrderedBasicBlock.h"
	#include "llvm/IR/Dominators.h"			#include "llvm/IR/Dominators.h"
	#include "llvm/IR/Operator.h"			#include "llvm/IR/Operator.h"

	namespace llvm {			namespace llvm {

				/// This interface dispatches to appropriate dominance check given 2
				/// instructions, i.e. in case the instructions are in the same basic block,
				/// OrderedBasicBlock (with instruction numbering and caching) are used.
				/// Otherwise, dominator tree is used. This interface relies on the
				/// transformations to invalidate the basic blocks in case instructions in it
				/// are changed.
	class OrderedInstructions {			class OrderedInstructions {
	/// Used to check dominance for instructions in same basic block.			/// Used to check dominance for instructions in same basic block.
	mutable DenseMap<const BasicBlock *, std::unique_ptr<OrderedBasicBlock>>			mutable DenseMap<const BasicBlock *, std::unique_ptr<OrderedBasicBlock>>
	OBBMap;			OBBMap;

	/// The dominator tree of the parent function.			/// The dominator tree of the parent function.
	DominatorTree *DT;			DominatorTree *DT;

	public:			public:
	/// Constructor.			/// Constructor.
	OrderedInstructions(DominatorTree *DT) : DT(DT) {}			OrderedInstructions(DominatorTree *DT) : DT(DT) {}
				efriedmaUnsubmitted Not Done Reply Inline Actions Not sure I like this constructor... easy to misuse. efriedma: Not sure I like this constructor... easy to misuse.

	/// Return true if first instruction dominates the second.			/// Return true if first instruction dominates the second.
	bool dominates(const Instruction , const Instruction ) const;			bool dominates(const Instruction , const Instruction ) const;

	/// Invalidate the OrderedBasicBlock cache when its basic block changes.			/// Invalidate the OrderedBasicBlock cache when its basic block changes.
	void invalidateBlock(BasicBlock *BB) { OBBMap.erase(BB); }			void invalidateBlock(BasicBlock *BB) { OBBMap.erase(BB); }
	};			};

	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_TRANSFORMS_UTILS_ORDEREDINSTRUCTIONS_H			#endif // LLVM_TRANSFORMS_UTILS_ORDEREDINSTRUCTIONS_H

lib/Transforms/Scalar/LICM.cpp

Show First 20 Lines • Show All 85 Lines • ▼ Show 20 Lines	static cl::opt<uint32_t> MaxNumUsesTraversed(
"licm-max-num-uses-traversed", cl::Hidden, cl::init(8),		"licm-max-num-uses-traversed", cl::Hidden, cl::init(8),
cl::desc("Max num uses visited for identifying load "		cl::desc("Max num uses visited for identifying load "
"invariance in loop using invariant start (default = 8)"));		"invariance in loop using invariant start (default = 8)"));

static bool inSubLoop(BasicBlock BB, Loop CurLoop, LoopInfo *LI);		static bool inSubLoop(BasicBlock BB, Loop CurLoop, LoopInfo *LI);
static bool isNotUsedInLoop(const Instruction &I, const Loop *CurLoop,		static bool isNotUsedInLoop(const Instruction &I, const Loop *CurLoop,
const LoopSafetyInfo *SafetyInfo);		const LoopSafetyInfo *SafetyInfo);
static bool hoist(Instruction &I, const DominatorTree DT, const Loop CurLoop,		static bool hoist(Instruction &I, const DominatorTree DT, const Loop CurLoop,
const LoopSafetyInfo *SafetyInfo,		LoopSafetyInfo SafetyInfo, OptimizationRemarkEmitter ORE);
OptimizationRemarkEmitter *ORE);
static bool sink(Instruction &I, const LoopInfo LI, const DominatorTree DT,		static bool sink(Instruction &I, const LoopInfo LI, const DominatorTree DT,
const Loop CurLoop, AliasSetTracker CurAST,		const Loop CurLoop, AliasSetTracker CurAST,
const LoopSafetyInfo *SafetyInfo,		LoopSafetyInfo SafetyInfo, OptimizationRemarkEmitter ORE);
OptimizationRemarkEmitter *ORE);
static bool isSafeToExecuteUnconditionally(Instruction &Inst,		static bool isSafeToExecuteUnconditionally(Instruction &Inst,
const DominatorTree *DT,		const DominatorTree *DT,
const Loop *CurLoop,		const Loop *CurLoop,
const LoopSafetyInfo *SafetyInfo,		LoopSafetyInfo *SafetyInfo,
OptimizationRemarkEmitter *ORE,		OptimizationRemarkEmitter *ORE,
const Instruction *CtxI = nullptr);		const Instruction *CtxI = nullptr);
static bool pointerInvalidatedByLoop(Value *V, uint64_t Size,		static bool pointerInvalidatedByLoop(Value *V, uint64_t Size,
const AAMDNodes &AAInfo,		const AAMDNodes &AAInfo,
AliasSetTracker *CurAST);		AliasSetTracker *CurAST);
static Instruction *		static Instruction *
CloneInstructionInExitBlock(Instruction &I, BasicBlock &ExitBlock, PHINode &PN,		CloneInstructionInExitBlock(Instruction &I, BasicBlock &ExitBlock, PHINode &PN,
const LoopInfo *LI,		const LoopInfo *LI,
▲ Show 20 Lines • Show All 126 Lines • ▼ Show 20 Lines	bool LoopInvariantCodeMotion::runOnLoop(Loop L, AliasAnalysis AA,

assert(L->isLCSSAForm(*DT) && "Loop is not in LCSSA form.");		assert(L->isLCSSAForm(*DT) && "Loop is not in LCSSA form.");

AliasSetTracker *CurAST = collectAliasInfoForLoop(L, LI, AA);		AliasSetTracker *CurAST = collectAliasInfoForLoop(L, LI, AA);

// Get the preheader block to move instructions into...		// Get the preheader block to move instructions into...
BasicBlock *Preheader = L->getLoopPreheader();		BasicBlock *Preheader = L->getLoopPreheader();

// Compute loop safety information.		// Compute loop safety information. Along with all the early exits.
LoopSafetyInfo SafetyInfo;		LoopSafetyInfo SafetyInfo(DT);
computeLoopSafetyInfo(&SafetyInfo, L);		computeLoopSafetyInfo(&SafetyInfo, L, true);

// We want to visit all of the instructions in this loop... that are not parts		// We want to visit all of the instructions in this loop... that are not parts
// of our subloops (they have already had their invariants hoisted out of		// of our subloops (they have already had their invariants hoisted out of
// their loop, into this loop, so there is no need to process the BODIES of		// their loop, into this loop, so there is no need to process the BODIES of
// the subloops).		// the subloops).
//		//
// Traverse the body of the loop in depth first order on the dominator tree so		// Traverse the body of the loop in depth first order on the dominator tree so
// that we are guaranteed to see definitions before we see uses. This allows		// that we are guaranteed to see definitions before we see uses. This allows
▲ Show 20 Lines • Show All 108 Lines • ▼ Show 20 Lines	for (BasicBlock::iterator II = BB->end(); II != BB->begin();) {
Instruction &I = *--II;		Instruction &I = *--II;

// If the instruction is dead, we would try to sink it because it isn't used		// If the instruction is dead, we would try to sink it because it isn't used
// in the loop, instead, just delete it.		// in the loop, instead, just delete it.
if (isInstructionTriviallyDead(&I, TLI)) {		if (isInstructionTriviallyDead(&I, TLI)) {
DEBUG(dbgs() << "LICM deleting dead inst: " << I << '\n');		DEBUG(dbgs() << "LICM deleting dead inst: " << I << '\n');
++II;		++II;
CurAST->deleteValue(&I);		CurAST->deleteValue(&I);
		// We could have treated this instruction as an early exit, update the
		// early exit list.
		SafetyInfo->deleteExit(&I);
		// The ordered instruction list for this block is no longer valid.
		SafetyInfo->OI.invalidateBlock(I.getParent());
I.eraseFromParent();		I.eraseFromParent();
Changed = true;		Changed = true;
continue;		continue;
}		}

// Check to see if we can sink this instruction to the exit blocks		// Check to see if we can sink this instruction to the exit blocks
// of the loop. We can do this if the all users of the instruction are		// of the loop. We can do this if the all users of the instruction are
// outside of the loop. In this case, it doesn't even matter if the		// outside of the loop. In this case, it doesn't even matter if the
Show All 39 Lines	for (BasicBlock::iterator II = BB->begin(), E = BB->end(); II != E;) {
// fold it.		// fold it.
if (Constant *C = ConstantFoldInstruction(		if (Constant *C = ConstantFoldInstruction(
&I, I.getModule()->getDataLayout(), TLI)) {		&I, I.getModule()->getDataLayout(), TLI)) {
DEBUG(dbgs() << "LICM folding inst: " << I << " --> " << *C << '\n');		DEBUG(dbgs() << "LICM folding inst: " << I << " --> " << *C << '\n');
CurAST->copyValue(&I, C);		CurAST->copyValue(&I, C);
I.replaceAllUsesWith(C);		I.replaceAllUsesWith(C);
if (isInstructionTriviallyDead(&I, TLI)) {		if (isInstructionTriviallyDead(&I, TLI)) {
CurAST->deleteValue(&I);		CurAST->deleteValue(&I);
		// We could have treated this instruction as an early exit, update the
		// early exit list.
		SafetyInfo->deleteExit(&I);
		// The ordered instruction list for this block is no longer valid.
		SafetyInfo->OI.invalidateBlock(I.getParent());
I.eraseFromParent();		I.eraseFromParent();
}		}
Changed = true;		Changed = true;
continue;		continue;
}		}

// Attempt to remove floating point division out of the loop by converting		// Attempt to remove floating point division out of the loop by converting
// it to a reciprocal multiplication.		// it to a reciprocal multiplication.
if (I.getOpcode() == Instruction::FDiv &&		if (I.getOpcode() == Instruction::FDiv &&
CurLoop->isLoopInvariant(I.getOperand(1)) &&		CurLoop->isLoopInvariant(I.getOperand(1)) &&
I.hasAllowReciprocal()) {		I.hasAllowReciprocal()) {
auto Divisor = I.getOperand(1);		auto Divisor = I.getOperand(1);
auto One = llvm::ConstantFP::get(Divisor->getType(), 1.0);		auto One = llvm::ConstantFP::get(Divisor->getType(), 1.0);
auto ReciprocalDivisor = BinaryOperator::CreateFDiv(One, Divisor);		auto ReciprocalDivisor = BinaryOperator::CreateFDiv(One, Divisor);
ReciprocalDivisor->setFastMathFlags(I.getFastMathFlags());		ReciprocalDivisor->setFastMathFlags(I.getFastMathFlags());
ReciprocalDivisor->insertBefore(&I);		ReciprocalDivisor->insertBefore(&I);

auto Product = BinaryOperator::CreateFMul(I.getOperand(0),		auto Product = BinaryOperator::CreateFMul(I.getOperand(0),
ReciprocalDivisor);		ReciprocalDivisor);
Product->setFastMathFlags(I.getFastMathFlags());		Product->setFastMathFlags(I.getFastMathFlags());
Product->insertAfter(&I);		Product->insertAfter(&I);
I.replaceAllUsesWith(Product);		I.replaceAllUsesWith(Product);
		// The ordered instruction list for this block is no longer valid.
		SafetyInfo->OI.invalidateBlock(I.getParent());
		// We do not update the early exit list here, i.e. how can we treat this
		// instruction as an early exit ?
		assert(!SafetyInfo->isExit(&I) && "Invalid early exit");
I.eraseFromParent();		I.eraseFromParent();

hoist(*ReciprocalDivisor, DT, CurLoop, SafetyInfo, ORE);		hoist(*ReciprocalDivisor, DT, CurLoop, SafetyInfo, ORE);
Changed = true;		Changed = true;
continue;		continue;
}		}

// Try hoisting the instruction out to the preheader. We can only do this		// Try hoisting the instruction out to the preheader. We can only do this
Show All 13 Lines	for (DomTreeNode *Child : Children)
Changed \|=		Changed \|=
hoistRegion(Child, AA, LI, DT, TLI, CurLoop, CurAST, SafetyInfo, ORE);		hoistRegion(Child, AA, LI, DT, TLI, CurLoop, CurAST, SafetyInfo, ORE);
return Changed;		return Changed;
}		}

/// Computes loop safety information, checks loop body & header		/// Computes loop safety information, checks loop body & header
/// for the possibility of may throw exception.		/// for the possibility of may throw exception.
///		///
void llvm::computeLoopSafetyInfo(LoopSafetyInfo SafetyInfo, Loop CurLoop) {		void llvm::computeLoopSafetyInfo(LoopSafetyInfo SafetyInfo, Loop CurLoop,
		bool ComputeEarlyExits) {
assert(CurLoop != nullptr && "CurLoop cant be null");		assert(CurLoop != nullptr && "CurLoop cant be null");
BasicBlock *Header = CurLoop->getHeader();		BasicBlock *Header = CurLoop->getHeader();
// Setting default safety values.		// Setting default safety values.
SafetyInfo->MayThrow = false;		SafetyInfo->MayThrow = false;
SafetyInfo->HeaderMayThrow = false;		SafetyInfo->HeaderMayThrow = false;
		SafetyInfo->EarlyExits.clear();
		SafetyInfo->ComputedEarlyExits = ComputeEarlyExits;
// Iterate over header and compute safety info.		// Iterate over header and compute safety info.
for (BasicBlock::iterator I = Header->begin(), E = Header->end();		for (BasicBlock::iterator I = Header->begin(), E = Header->end(); I != E;
(I != E) && !SafetyInfo->HeaderMayThrow; ++I)		++I) {
SafetyInfo->HeaderMayThrow \|=		bool MayThrow = !isGuaranteedToTransferExecutionToSuccessor(&*I);
!isGuaranteedToTransferExecutionToSuccessor(&*I);		SafetyInfo->HeaderMayThrow \|= MayThrow;
		// Exit as soon as we find an instruction that may throw in case we are
		// not computing early exits.
		if (!ComputeEarlyExits && SafetyInfo->HeaderMayThrow)
		break;
		if (MayThrow)
		SafetyInfo->EarlyExits.insert(&*I);
		}

SafetyInfo->MayThrow = SafetyInfo->HeaderMayThrow;		SafetyInfo->MayThrow = SafetyInfo->HeaderMayThrow;
// Iterate over loop instructions and compute safety info.		// Iterate over loop instructions and compute safety info.
// Skip header as it has been computed and stored in HeaderMayThrow.		// Skip header as it has been computed and stored in HeaderMayThrow.
// The first block in loopinfo.Blocks is guaranteed to be the header.		// The first block in loopinfo.Blocks is guaranteed to be the header.
assert(Header == *CurLoop->getBlocks().begin() && "First block must be header");		assert(Header == *CurLoop->getBlocks().begin() && "First block must be header");
for (Loop::block_iterator BB = std::next(CurLoop->block_begin()),		for (Loop::block_iterator BB = std::next(CurLoop->block_begin()),
BBE = CurLoop->block_end();		BBE = CurLoop->block_end();
(BB != BBE) && !SafetyInfo->MayThrow; ++BB)		BB != BBE; ++BB)
for (BasicBlock::iterator I = (BB)->begin(), E = (BB)->end();		for (BasicBlock::iterator I = (BB)->begin(), E = (BB)->end(); I != E;
(I != E) && !SafetyInfo->MayThrow; ++I)		++I) {
SafetyInfo->MayThrow \|= !isGuaranteedToTransferExecutionToSuccessor(&*I);		bool MayThrow = !isGuaranteedToTransferExecutionToSuccessor(&*I);
		SafetyInfo->MayThrow \|= MayThrow;
		// Exit as soon as we find an instruction that may throw in case we are
		// not computing early exits.
		if (!ComputeEarlyExits && SafetyInfo->MayThrow)
		break;
		if (MayThrow)
		SafetyInfo->EarlyExits.insert(&*I);
		}

// Compute funclet colors if we might sink/hoist in a function with a funclet		// Compute funclet colors if we might sink/hoist in a function with a funclet
// personality routine.		// personality routine.
Function *Fn = CurLoop->getHeader()->getParent();		Function *Fn = CurLoop->getHeader()->getParent();
if (Fn->hasPersonalityFn())		if (Fn->hasPersonalityFn())
if (Constant *PersonalityFn = Fn->getPersonalityFn())		if (Constant *PersonalityFn = Fn->getPersonalityFn())
if (isFuncletEHPersonality(classifyEHPersonality(PersonalityFn)))		if (isFuncletEHPersonality(classifyEHPersonality(PersonalityFn)))
SafetyInfo->BlockColors = colorEHFunclets(*Fn);		SafetyInfo->BlockColors = colorEHFunclets(*Fn);
▲ Show 20 Lines • Show All 279 Lines • ▼ Show 20 Lines

/// When an instruction is found to only be used outside of the loop, this		/// When an instruction is found to only be used outside of the loop, this
/// function moves it to the exit blocks and patches up SSA form as needed.		/// function moves it to the exit blocks and patches up SSA form as needed.
/// This method is guaranteed to remove the original instruction from its		/// This method is guaranteed to remove the original instruction from its
/// position, and may either delete it or move it to outside of the loop.		/// position, and may either delete it or move it to outside of the loop.
///		///
static bool sink(Instruction &I, const LoopInfo LI, const DominatorTree DT,		static bool sink(Instruction &I, const LoopInfo LI, const DominatorTree DT,
const Loop CurLoop, AliasSetTracker CurAST,		const Loop CurLoop, AliasSetTracker CurAST,
const LoopSafetyInfo *SafetyInfo,		LoopSafetyInfo SafetyInfo, OptimizationRemarkEmitter ORE) {
OptimizationRemarkEmitter *ORE) {
DEBUG(dbgs() << "LICM sinking instruction: " << I << "\n");		DEBUG(dbgs() << "LICM sinking instruction: " << I << "\n");
ORE->emit(OptimizationRemark(DEBUG_TYPE, "InstSunk", &I)		ORE->emit(OptimizationRemark(DEBUG_TYPE, "InstSunk", &I)
<< "sinking " << ore::NV("Inst", &I));		<< "sinking " << ore::NV("Inst", &I));
bool Changed = false;		bool Changed = false;
if (isa<LoadInst>(I))		if (isa<LoadInst>(I))
++NumMovedLoads;		++NumMovedLoads;
else if (isa<CallInst>(I))		else if (isa<CallInst>(I))
++NumMovedCalls;		++NumMovedCalls;
Show All 40 Lines	while (!I.use_empty()) {
Instruction *New;		Instruction *New;
auto It = SunkCopies.find(ExitBlock);		auto It = SunkCopies.find(ExitBlock);
if (It != SunkCopies.end())		if (It != SunkCopies.end())
New = It->second;		New = It->second;
else		else
New = SunkCopies[ExitBlock] =		New = SunkCopies[ExitBlock] =
CloneInstructionInExitBlock(I, ExitBlock, PN, LI, SafetyInfo);		CloneInstructionInExitBlock(I, ExitBlock, PN, LI, SafetyInfo);

		// The ordered instruction list for this block is no longer valid.
		SafetyInfo->OI.invalidateBlock(PN->getParent());
PN->replaceAllUsesWith(New);		PN->replaceAllUsesWith(New);
PN->eraseFromParent();		PN->eraseFromParent();
}		}


		// We could have treated this instruction as an early exit, update the
		// early exit list.
		SafetyInfo->deleteExit(&I);
		// The ordered instruction list for this block is no longer valid.
		SafetyInfo->OI.invalidateBlock(I.getParent());
CurAST->deleteValue(&I);		CurAST->deleteValue(&I);
I.eraseFromParent();		I.eraseFromParent();
return Changed;		return Changed;
}		}

/// When an instruction is found to only use loop invariant operands that		/// When an instruction is found to only use loop invariant operands that
/// is safe to hoist, this instruction is called to do the dirty work.		/// is safe to hoist, this instruction is called to do the dirty work.
///		///
static bool hoist(Instruction &I, const DominatorTree DT, const Loop CurLoop,		static bool hoist(Instruction &I, const DominatorTree DT, const Loop CurLoop,
const LoopSafetyInfo *SafetyInfo,		LoopSafetyInfo SafetyInfo, OptimizationRemarkEmitter ORE) {
OptimizationRemarkEmitter *ORE) {
auto *Preheader = CurLoop->getLoopPreheader();		auto *Preheader = CurLoop->getLoopPreheader();
DEBUG(dbgs() << "LICM hoisting to " << Preheader->getName() << ": " << I		DEBUG(dbgs() << "LICM hoisting to " << Preheader->getName() << ": " << I
<< "\n");		<< "\n");
ORE->emit(OptimizationRemark(DEBUG_TYPE, "Hoisted", &I)		ORE->emit(OptimizationRemark(DEBUG_TYPE, "Hoisted", &I)
<< "hoisting " << ore::NV("Inst", &I));		<< "hoisting " << ore::NV("Inst", &I));

// Metadata can be dependent on conditions we are hoisting above.		// Metadata can be dependent on conditions we are hoisting above.
// Conservatively strip all metadata on the instruction unless we were		// Conservatively strip all metadata on the instruction unless we were
// guaranteed to execute I if we entered the loop, in which case the metadata		// guaranteed to execute I if we entered the loop, in which case the metadata
// is valid in the loop preheader.		// is valid in the loop preheader.
if (I.hasMetadataOtherThanDebugLoc() &&		if (I.hasMetadataOtherThanDebugLoc() &&
// The check on hasMetadataOtherThanDebugLoc is to prevent us from burning		// The check on hasMetadataOtherThanDebugLoc is to prevent us from burning
// time in isGuaranteedToExecute if we don't actually have anything to		// time in isGuaranteedToExecute if we don't actually have anything to
// drop. It is a compile time optimization, not required for correctness.		// drop. It is a compile time optimization, not required for correctness.
!isGuaranteedToExecute(I, DT, CurLoop, SafetyInfo))		!isGuaranteedToExecute(I, DT, CurLoop, SafetyInfo))
I.dropUnknownNonDebugMetadata();		I.dropUnknownNonDebugMetadata();

		// We could have treated this instruction as an early exit, update the
		// early exit list.
		SafetyInfo->deleteExit(&I);
		// The ordered instruction list for this block is no longer valid.
		SafetyInfo->OI.invalidateBlock(Preheader);
		SafetyInfo->OI.invalidateBlock(I.getParent());
// Move the new node to the Preheader, before its terminator.		// Move the new node to the Preheader, before its terminator.
I.moveBefore(Preheader->getTerminator());		I.moveBefore(Preheader->getTerminator());

// Do not retain debug locations when we are moving instructions to different		// Do not retain debug locations when we are moving instructions to different
// basic blocks, because we want to avoid jumpy line tables. Calls, however,		// basic blocks, because we want to avoid jumpy line tables. Calls, however,
// need to retain their debug locs because they may be inlined.		// need to retain their debug locs because they may be inlined.
// FIXME: How do we retain source locations without causing poor debugging		// FIXME: How do we retain source locations without causing poor debugging
// behavior?		// behavior?
Show All 9 Lines
}		}

/// Only sink or hoist an instruction if it is not a trapping instruction,		/// Only sink or hoist an instruction if it is not a trapping instruction,
/// or if the instruction is known not to trap when moved to the preheader.		/// or if the instruction is known not to trap when moved to the preheader.
/// or if it is a trapping instruction and is guaranteed to execute.		/// or if it is a trapping instruction and is guaranteed to execute.
static bool isSafeToExecuteUnconditionally(Instruction &Inst,		static bool isSafeToExecuteUnconditionally(Instruction &Inst,
const DominatorTree *DT,		const DominatorTree *DT,
const Loop *CurLoop,		const Loop *CurLoop,
const LoopSafetyInfo *SafetyInfo,		LoopSafetyInfo *SafetyInfo,
OptimizationRemarkEmitter *ORE,		OptimizationRemarkEmitter *ORE,
const Instruction *CtxI) {		const Instruction *CtxI) {
if (isSafeToSpeculativelyExecute(&Inst, CtxI, DT))		if (isSafeToSpeculativelyExecute(&Inst, CtxI, DT))
return true;		return true;

bool GuaranteedToExecute =		bool GuaranteedToExecute =
isGuaranteedToExecute(Inst, DT, CurLoop, SafetyInfo);		isGuaranteedToExecute(Inst, DT, CurLoop, SafetyInfo);

▲ Show 20 Lines • Show All 477 Lines • Show Last 20 Lines

lib/Transforms/Scalar/LoopIdiomRecognize.cpp

Show First 20 Lines • Show All 277 Lines • ▼ Show 20 Lines	bool LoopIdiomRecognize::runOnCountableLoop() {
DEBUG(dbgs() << "loop-idiom Scanning: F["		DEBUG(dbgs() << "loop-idiom Scanning: F["
<< CurLoop->getHeader()->getParent()->getName() << "] Loop %"		<< CurLoop->getHeader()->getParent()->getName() << "] Loop %"
<< CurLoop->getHeader()->getName() << "\n");		<< CurLoop->getHeader()->getName() << "\n");

bool MadeChange = false;		bool MadeChange = false;

// The following transforms hoist stores/memsets into the loop pre-header.		// The following transforms hoist stores/memsets into the loop pre-header.
// Give up if the loop has instructions may throw.		// Give up if the loop has instructions may throw.
LoopSafetyInfo SafetyInfo;		LoopSafetyInfo SafetyInfo(DT);
computeLoopSafetyInfo(&SafetyInfo, CurLoop);		computeLoopSafetyInfo(&SafetyInfo, CurLoop);
if (SafetyInfo.MayThrow)		if (SafetyInfo.MayThrow)
return MadeChange;		return MadeChange;

// Scan all the blocks in the loop that are not in subloops.		// Scan all the blocks in the loop that are not in subloops.
for (auto *BB : CurLoop->getBlocks()) {		for (auto *BB : CurLoop->getBlocks()) {
// Ignore blocks in subloops.		// Ignore blocks in subloops.
if (LI->getLoopFor(BB) != CurLoop)		if (LI->getLoopFor(BB) != CurLoop)
▲ Show 20 Lines • Show All 1,391 Lines • Show Last 20 Lines

lib/Transforms/Scalar/LoopUnswitch.cpp

Show First 20 Lines • Show All 166 Lines • ▼ Show 20 Lines	class LoopUnswitch : public LoopPass {
bool redoLoop;		bool redoLoop;

Loop *currentLoop;		Loop *currentLoop;
DominatorTree *DT;		DominatorTree *DT;
BasicBlock *loopHeader;		BasicBlock *loopHeader;
BasicBlock *loopPreheader;		BasicBlock *loopPreheader;

bool SanitizeMemory;		bool SanitizeMemory;
LoopSafetyInfo SafetyInfo;

// LoopBlocks contains all of the basic blocks of the loop, including the		// LoopBlocks contains all of the basic blocks of the loop, including the
// preheader of the loop, the body of the loop, and the exit blocks of the		// preheader of the loop, the body of the loop, and the exit blocks of the
// loop, in that order.		// loop, in that order.
std::vector<BasicBlock*> LoopBlocks;		std::vector<BasicBlock*> LoopBlocks;
// NewBlocks contained cloned copy of basic blocks from LoopBlocks.		// NewBlocks contained cloned copy of basic blocks from LoopBlocks.
std::vector<BasicBlock*> NewBlocks;		std::vector<BasicBlock*> NewBlocks;

bool hasBranchDivergence;		bool hasBranchDivergence;

public:		public:
static char ID; // Pass ID, replacement for typeid		static char ID; // Pass ID, replacement for typeid
explicit LoopUnswitch(bool Os = false, bool hasBranchDivergence = false) :		explicit LoopUnswitch(bool Os = false, bool hasBranchDivergence = false) :
LoopPass(ID), OptimizeForSize(Os), redoLoop(false),		LoopPass(ID), OptimizeForSize(Os), redoLoop(false),
currentLoop(nullptr), DT(nullptr), loopHeader(nullptr),		currentLoop(nullptr), DT(nullptr), loopHeader(nullptr),
loopPreheader(nullptr), hasBranchDivergence(hasBranchDivergence) {		loopPreheader(nullptr), hasBranchDivergence(hasBranchDivergence) {
initializeLoopUnswitchPass(*PassRegistry::getPassRegistry());		initializeLoopUnswitchPass(*PassRegistry::getPassRegistry());
}		}

bool runOnLoop(Loop *L, LPPassManager &LPM) override;		bool runOnLoop(Loop *L, LPPassManager &LPM) override;
bool processCurrentLoop();		bool processCurrentLoop(LoopSafetyInfo *SafetyInfo);
bool isUnreachableDueToPreviousUnswitching(BasicBlock *);		bool isUnreachableDueToPreviousUnswitching(BasicBlock *);
/// This transformation requires natural loop information & requires that		/// This transformation requires natural loop information & requires that
/// loop preheaders be inserted into the CFG.		/// loop preheaders be inserted into the CFG.
///		///
void getAnalysisUsage(AnalysisUsage &AU) const override {		void getAnalysisUsage(AnalysisUsage &AU) const override {
AU.addRequired<AssumptionCacheTracker>();		AU.addRequired<AssumptionCacheTracker>();
AU.addRequired<TargetTransformInfoWrapperPass>();		AU.addRequired<TargetTransformInfoWrapperPass>();
if (hasBranchDivergence)		if (hasBranchDivergence)
▲ Show 20 Lines • Show All 298 Lines • ▼ Show 20 Lines	AC = &getAnalysis<AssumptionCacheTracker>().getAssumptionCache(
*L->getHeader()->getParent());		*L->getHeader()->getParent());
LI = &getAnalysis<LoopInfoWrapperPass>().getLoopInfo();		LI = &getAnalysis<LoopInfoWrapperPass>().getLoopInfo();
LPM = &LPM_Ref;		LPM = &LPM_Ref;
DT = &getAnalysis<DominatorTreeWrapperPass>().getDomTree();		DT = &getAnalysis<DominatorTreeWrapperPass>().getDomTree();
currentLoop = L;		currentLoop = L;
Function *F = currentLoop->getHeader()->getParent();		Function *F = currentLoop->getHeader()->getParent();

SanitizeMemory = F->hasFnAttribute(Attribute::SanitizeMemory);		SanitizeMemory = F->hasFnAttribute(Attribute::SanitizeMemory);
		LoopSafetyInfo SafetyInfo(DT);
if (SanitizeMemory)		if (SanitizeMemory)
computeLoopSafetyInfo(&SafetyInfo, L);		computeLoopSafetyInfo(&SafetyInfo, L);

bool Changed = false;		bool Changed = false;
do {		do {
assert(currentLoop->isLCSSAForm(*DT));		assert(currentLoop->isLCSSAForm(*DT));
redoLoop = false;		redoLoop = false;
Changed \|= processCurrentLoop();		Changed \|= processCurrentLoop(&SafetyInfo);
} while(redoLoop);		} while(redoLoop);

// FIXME: Reconstruct dom info, because it is not preserved properly.		// FIXME: Reconstruct dom info, because it is not preserved properly.
if (Changed)		if (Changed)
DT->recalculate(*F);		DT->recalculate(*F);
return Changed;		return Changed;
}		}

Show All 22 Lines	while (currentLoop->contains(DomBB)) {

if (DT->dominates(UnreachableSucc, BB))		if (DT->dominates(UnreachableSucc, BB))
return true;		return true;
}		}
return false;		return false;
}		}

/// Do actual work and unswitch loop if possible and profitable.		/// Do actual work and unswitch loop if possible and profitable.
bool LoopUnswitch::processCurrentLoop() {		bool LoopUnswitch::processCurrentLoop(LoopSafetyInfo *SafetyInfo) {
bool Changed = false;		bool Changed = false;

initLoopData();		initLoopData();

// If LoopSimplify was unable to form a preheader, don't do any unswitching.		// If LoopSimplify was unable to form a preheader, don't do any unswitching.
if (!loopPreheader)		if (!loopPreheader)
return false;		return false;

▲ Show 20 Lines • Show All 78 Lines • ▼ Show 20 Lines	for (Loop::block_iterator I = currentLoop->block_begin(),

// Unswitching on a potentially uninitialized predicate is not		// Unswitching on a potentially uninitialized predicate is not
// MSan-friendly. Limit this to the cases when the original predicate is		// MSan-friendly. Limit this to the cases when the original predicate is
// guaranteed to execute, to avoid creating a use-of-uninitialized-value		// guaranteed to execute, to avoid creating a use-of-uninitialized-value
// in the code that did not have one.		// in the code that did not have one.
// This is a workaround for the discrepancy between LLVM IR and MSan		// This is a workaround for the discrepancy between LLVM IR and MSan
// semantics. See PR28054 for more details.		// semantics. See PR28054 for more details.
if (SanitizeMemory &&		if (SanitizeMemory &&
!isGuaranteedToExecute(*TI, DT, currentLoop, &SafetyInfo))		!isGuaranteedToExecute(*TI, DT, currentLoop, SafetyInfo))
continue;		continue;

if (BranchInst *BI = dyn_cast<BranchInst>(TI)) {		if (BranchInst *BI = dyn_cast<BranchInst>(TI)) {
// Some branches may be rendered unreachable because of previous		// Some branches may be rendered unreachable because of previous
// unswitching.		// unswitching.
// Unswitch only those branches that are reachable.		// Unswitch only those branches that are reachable.
if (isUnreachableDueToPreviousUnswitching(*I))		if (isUnreachableDueToPreviousUnswitching(*I))
continue;		continue;
▲ Show 20 Lines • Show All 867 Lines • Show Last 20 Lines

lib/Transforms/Utils/LoopUtils.cpp

Show All 11 Lines
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/Transforms/Utils/LoopUtils.h"		#include "llvm/Transforms/Utils/LoopUtils.h"
#include "llvm/Analysis/AliasAnalysis.h"		#include "llvm/Analysis/AliasAnalysis.h"
#include "llvm/Analysis/BasicAliasAnalysis.h"		#include "llvm/Analysis/BasicAliasAnalysis.h"
#include "llvm/Analysis/GlobalsModRef.h"		#include "llvm/Analysis/GlobalsModRef.h"
#include "llvm/Analysis/LoopInfo.h"		#include "llvm/Analysis/LoopInfo.h"
#include "llvm/Analysis/LoopPass.h"		#include "llvm/Analysis/LoopPass.h"
		#include "llvm/Analysis/OrderedBasicBlock.h"
#include "llvm/Analysis/ScalarEvolution.h"		#include "llvm/Analysis/ScalarEvolution.h"
#include "llvm/Analysis/ScalarEvolutionAliasAnalysis.h"		#include "llvm/Analysis/ScalarEvolutionAliasAnalysis.h"
#include "llvm/Analysis/ScalarEvolutionExpander.h"		#include "llvm/Analysis/ScalarEvolutionExpander.h"
#include "llvm/Analysis/ScalarEvolutionExpressions.h"		#include "llvm/Analysis/ScalarEvolutionExpressions.h"
#include "llvm/Analysis/TargetTransformInfo.h"		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/IR/Dominators.h"		#include "llvm/IR/Dominators.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
▲ Show 20 Lines • Show All 1,009 Lines • ▼ Show 20 Lines	Optional<const MDOperand > llvm::findStringMetadataForLoop(Loop TheLoop,
}		}
return None;		return None;
}		}

/// Returns true if the instruction in a loop is guaranteed to execute at least		/// Returns true if the instruction in a loop is guaranteed to execute at least
/// once.		/// once.
bool llvm::isGuaranteedToExecute(const Instruction &Inst,		bool llvm::isGuaranteedToExecute(const Instruction &Inst,
const DominatorTree DT, const Loop CurLoop,		const DominatorTree DT, const Loop CurLoop,
const LoopSafetyInfo *SafetyInfo) {		LoopSafetyInfo *SafetyInfo) {
// We have to check to make sure that the instruction dominates all		// If we have computed early exits, use it.
// of the exit blocks. If it doesn't, then there is a path out of the loop		if (SafetyInfo->ComputedEarlyExits) {
// which does not execute this instruction, so we can't hoist it.		// Check wehther the instruction dominates all early exits. If it doesn't,
		// then there is a path out of the loop which does not execute this
		// instruction and its not guaranteed to execute.
		for (Instruction *ExitInst : SafetyInfo->EarlyExits)
		if (!SafetyInfo->OI.dominates(&Inst, ExitInst))
		efriedmaUnsubmitted Not Done Reply Inline Actions Actually, one more issue we probably need to address. This is potentially linear in the number of calls in the loop, since we don't try to prune EarlyExits at all. This makes LICM potentially O(n^2) overall. Can we store this information in some more efficient way? efriedma: Actually, one more issue we probably need to address. This is potentially linear in the number…
		trentxintongAuthorUnsubmitted Not Done Reply Inline Actions We could certainly prune the early exits by discovering them walking the dominator tree. If an instruction dominates an early exit (Inst A), it certainly dominates all the other early exits dominated by Inst A. So we do not need to include those early exits in the list. The drawback with this is that if an early exit is deleted, which is possible but unlikely. We need to invalidate the early exit list (we could try to recompute everything, but probably not worth it). By doing this, we potentially save a lot of OI.dominates checks. Worst case is still O(n^2) in case of a loop with many basic blocks that do not dominate each other and each one of them have early exits ... but this happens infrequently in practice IMO. trentxintong: We could certainly prune the early exits by discovering them walking the dominator tree. If an…
		dberlinUnsubmitted Not Done Reply Inline Actions Unless i'm misunderstanding, and maybe i am (!), I believe this can be made optimal: Here's the normal way: The ordering is completely described by the DFS in/out numbers of the dom tree + local numbers. It should completely suffice to, instead of just ordering the in/out of the DT nodes, model in/out of the entire block. IE Count = 0 number(root of DT) number(DTNode): In = Count for each DT child: number(child) for each instruction in this block InstNum[I] = ++Count Out = Count (or something similar) The early exits you dominate is, in the same block, any early exit with a number greater than yours, and for other blocks, any block with an in > your in and out < your out. This is trivially stored in a sorted smallvector. You dominate all the early exits only if using the above comparator, lower bound says you would appear first in the smallvector. I believe you can get away with a lot less, but not positive ATM: By definition, anything that dominates all the early exits must be in a block that has a DT level number <= than all of them. If all the exits are terminators, than you don't need the local numbering or DFS numbering at all, i think (I rely on Eli to point out any flaw in my reasoning, since i'm dashing this off pretty quickly). You just need the entries with the lowest DT level (in case they are siblings in the dominator tree) Then it would just be: if DT level number < lowest DT level number of an exit, you dominate all exits if DT level number == lowest DT level number then: if vector.size() == 1 && exit block == instruction block you dominate all exits else you do not (because there must be a sibling you don't dominate) I think one or the other of the above should work. The DT level numbers are now available in the DomTree :) dberlin: Unless i'm misunderstanding, and maybe i am (!), I believe this can be made optimal: Here's…
		return false;
		} else {
// If the instruction is in the header block for the loop (which is very		// If the instruction is in the header block for the loop (which is very
// common), it is always guaranteed to dominate the exit blocks. Since this		// common), it is always guaranteed to dominate the exit blocks. Since this
// is a common case, and can save some work, check it now.		// is a common case, and can save some work, check it now.
if (Inst.getParent() == CurLoop->getHeader())		if (Inst.getParent() == CurLoop->getHeader())
// If there's a throw in the header block, we can't guarantee we'll reach		// If there's a throw in the header block, we can't guarantee we'll reach
// Inst.		// Inst.
return !SafetyInfo->HeaderMayThrow;		return !SafetyInfo->HeaderMayThrow;

// Somewhere in this loop there is an instruction which may throw and make us		// Somewhere in this loop there is an instruction which may throw and make
// exit the loop.		// us exit the loop.
if (SafetyInfo->MayThrow)		if (SafetyInfo->MayThrow)
return false;		return false;
		}

		// We have to check to make sure that the instruction dominates all
		// of the exit blocks. If it doesn't, then there is a path out of the loop
		// which does not execute this instruction.
// Get the exit blocks for the current loop.		// Get the exit blocks for the current loop.
SmallVector<BasicBlock *, 8> ExitBlocks;		SmallVector<BasicBlock *, 8> ExitBlocks;
CurLoop->getExitBlocks(ExitBlocks);		CurLoop->getExitBlocks(ExitBlocks);

// Verify that the block dominates each of the exit blocks of the loop.		// Verify that the block dominates each of the exit blocks of the loop.
for (BasicBlock *ExitBlock : ExitBlocks)		for (BasicBlock *ExitBlock : ExitBlocks)
if (!DT->dominates(Inst.getParent(), ExitBlock))		if (!DT->dominates(Inst.getParent(), ExitBlock))
return false;		return false;

// As a degenerate case, if the loop is statically infinite then we haven't		// As a degenerate case, if the loop is statically infinite then we haven't
// proven anything since there are no exit blocks.		// proven anything since there are no exit blocks.
if (ExitBlocks.empty())		// However, we also special case instruction from the header as the header
		// is always guaranteed to execute.
		if (ExitBlocks.empty() && Inst.getParent() != CurLoop->getHeader())
		efriedmaUnsubmitted Not Done Reply Inline Actions See the FIXME below... but I guess there's another way to do this check: if Inst dominates the backedge, then it's guaranteed to execute even if the loop is infinite. A bit more general than checking that the parent is exactly the header. efriedma: See the FIXME below... but I guess there's another way to do this check: if Inst dominates the…
		efriedmaUnsubmitted Not Done Reply Inline Actions Err, wait, no, sorry, the check would be that Inst dominates the backedge, and there isn't any infinite loop between the start of the loop and the backedge. I guess just checking whether Inst is in the header is good enough for now. efriedma: Err, wait, no, sorry, the check would be that Inst dominates the backedge, and there isn't any…
return false;		return false;

// FIXME: In general, we have to prove that the loop isn't an infinite loop.		// FIXME: In general, we have to prove that the loop isn't an infinite loop.
// See http::llvm.org/PR24078 . (The "ExitBlocks.empty()" check above is		// See http::llvm.org/PR24078 . (The "ExitBlocks.empty()" check above is
// just a special case of this.)		// just a special case of this.)
return true;		return true;
}		}

▲ Show 20 Lines • Show All 232 Lines • Show Last 20 Lines

lib/Transforms/Utils/OrderedInstructions.cpp

	Show All 13 Lines
	#include "llvm/Transforms/Utils/OrderedInstructions.h"			#include "llvm/Transforms/Utils/OrderedInstructions.h"
	using namespace llvm;			using namespace llvm;

	/// Given 2 instructions, use OrderedBasicBlock to check for dominance relation			/// Given 2 instructions, use OrderedBasicBlock to check for dominance relation
	/// if the instructions are in the same basic block, Otherwise, use dominator			/// if the instructions are in the same basic block, Otherwise, use dominator
	/// tree.			/// tree.
	bool OrderedInstructions::dominates(const Instruction *InstA,			bool OrderedInstructions::dominates(const Instruction *InstA,
	const Instruction *InstB) const {			const Instruction *InstB) const {
				assert(DT && "Uninitialized dominator tree");
	const BasicBlock *IBB = InstA->getParent();			const BasicBlock *IBB = InstA->getParent();
	// Use ordered basic block to do dominance check in case the 2 instructions			// Use ordered basic block to do dominance check in case the 2 instructions
	// are in the same basic block.			// are in the same basic block.
	if (IBB == InstB->getParent()) {			if (IBB == InstB->getParent()) {
	auto OBB = OBBMap.find(IBB);			auto OBB = OBBMap.find(IBB);
	if (OBB == OBBMap.end())			if (OBB == OBBMap.end())
	OBB = OBBMap.insert({IBB, make_unique<OrderedBasicBlock>(IBB)}).first;			OBB = OBBMap.insert({IBB, make_unique<OrderedBasicBlock>(IBB)}).first;
	return OBB->second->dominates(InstA, InstB);			return OBB->second->dominates(InstA, InstB);
	} else {			} else {
	return DT->dominates(InstA->getParent(), InstB->getParent());			return DT->dominates(InstA->getParent(), InstB->getParent());
	}			}
	}			}

test/Transforms/LICM/loop-early-exits.ll

This file was added.

				; RUN: opt -S -licm < %s \| FileCheck %s

				declare void @use(i64 %a)
				declare void @use_nothing()
				declare void @call_nothrow() nounwind

				; We can move this udiv out of the loop as it comes before
				; the call instruction that may throw.
				define void @throw_header1(i64 %x, i64 %y, i1* %cond) {
				; CHECK-LABEL: throw_header1
				; CHECK: %div = udiv i64 %x, %y
				; CHECK-LABEL: loop
				; CHECK: call void @use(i64 %div)
				entry:
				br label %loop

				loop: ; preds = %entry, %for.inc
				%div = udiv i64 %x, %y
				call void @use(i64 %div)
				br label %loop
				}

				; We can not move this udiv out of the loop as it comes after
				; the call instruction that may throw.
				define void @throw_header2(i64 %x, i64 %y, i1* %cond) {
				; CHECK-LABEL: throw_header2
				; CHECK-LABEL: loop
				; CHECK: call void @use_nothing()
				; CHECK: %div = udiv i64 %x, %y
				entry:
				br label %loop

				loop: ; preds = %entry, %for.inc
				call void @use_nothing()
				%div = udiv i64 %x, %y
				call void @use(i64 %div)
				br label %loop
				}

				; We can move this udiv out of the loop as it comes before
				; the call instruction that may throw.
				define void @throw_body1(i64 %x, i64 %y, i1* %cond) {
				; CHECK-LABEL: throw_body1
				; CHECK: %div = udiv i64 %x, %y
				; CHECK-LABEL: loop
				entry:
				br label %loop

				loop: ; preds = %entry, %for.inc
				br label %body

				body:
				%div = udiv i64 %x, %y
				call void @use(i64 %div)
				br i1 undef, label %loop, label %exit

				exit:
				ret void
				}

				; We can not move this udiv out of the loop as it comes after
				; the call instruction that may throw.
				define void @throw_body2(i64 %x, i64 %y, i1* %cond) {
				; CHECK-LABEL: throw_body2
				; CHECK-LABEL: loop
				; CHECK: call void @use_nothing()
				; CHECK: %div = udiv i64 %x, %y
				entry:
				br label %loop

				loop: ; preds = %entry, %for.inc
				br label %body

				body:
				call void @use_nothing()
				%div = udiv i64 %x, %y
				call void @use(i64 %div)
				br i1 undef, label %loop, label %exit

				exit:
				ret void
				}

				efriedmaUnsubmitted Not Done Reply Inline Actions Could you add a testcase with a udiv after a nothrow call, to make it clear that an early exit doesn't necessarily involve unwinding? efriedma: Could you add a testcase with a udiv after a nothrow call, to make it clear that an early exit…

				; We can not move this udiv out of the loop as it comes after
				; the call instruction that may not transfer execution to successor, even
				; though it does not throw.
				define void @throw_body3(i64 %x, i64 %y, i1* %cond) {
				; CHECK-LABEL: throw_body3
				; CHECK-LABEL: body
				; CHECK: call void @call_nothrow()
				; CHECK: %div = udiv i64 %x, %y
				entry:
				br label %loop

				loop: ; preds = %entry, %for.inc
				br label %body

				body:
				call void @call_nothrow()
				%div = udiv i64 %x, %y
				call void @use(i64 %div)
				br i1 undef, label %loop, label %exit

				exit:
				ret void
				}

test/Transforms/LICM/preheader-safe.ll

	Show All 15 Lines
	loop: ; preds = %entry, %for.inc			loop: ; preds = %entry, %for.inc
	%div = udiv i64 %x, %y			%div = udiv i64 %x, %y
	br label %loop2			br label %loop2

	loop2:			loop2:
	call void @use_nothrow(i64 %div)			call void @use_nothrow(i64 %div)
	br label %loop			br label %loop
	}			}
	; Negative test
	define void @throw_header(i64 %x, i64 %y, i1* %cond) {
	; CHECK-LABEL: throw_header
	; CHECK-LABEL: loop
	; CHECK: %div = udiv i64 %x, %y
	; CHECK: call void @use(i64 %div)
	entry:
	br label %loop

	loop: ; preds = %entry, %for.inc
	%div = udiv i64 %x, %y
	call void @use(i64 %div)
	br label %loop
	}

	; The header is known no throw, but the loop is not. We can			; The header is known no throw, but the loop is not. We can
	; still lift out of the header.			; still lift out of the header.
	define void @nothrow_header(i64 %x, i64 %y, i1 %cond) {			define void @nothrow_header(i64 %x, i64 %y, i1 %cond) {
	; CHECK-LABEL: nothrow_header			; CHECK-LABEL: nothrow_header
	; CHECK-LABEL: entry			; CHECK-LABEL: entry
	; CHECK: %div = udiv i64 %x, %y			; CHECK: %div = udiv i64 %x, %y
	; CHECK-LABEL: loop			; CHECK-LABEL: loop
	Show All 28 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[LICM] Introduce a finer granularity option to compute early exits.Needs RevisionPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 103818

include/llvm/Transforms/Utils/LoopUtils.h

include/llvm/Transforms/Utils/OrderedInstructions.h

lib/Transforms/Scalar/LICM.cpp

lib/Transforms/Scalar/LoopIdiomRecognize.cpp

lib/Transforms/Scalar/LoopUnswitch.cpp

lib/Transforms/Utils/LoopUtils.cpp

lib/Transforms/Utils/OrderedInstructions.cpp

test/Transforms/LICM/loop-early-exits.ll

test/Transforms/LICM/preheader-safe.ll

[LICM] Introduce a finer granularity option to compute early exits.
Needs RevisionPublic