This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/Transforms/
-
llvm/
-
Transforms/
-
Scalar.h
-
lib/Transforms/Scalar/
-
Transforms/
-
Scalar/
1/6
LoopUnrollPass.cpp
-
test/Transforms/LoopUnroll/
-
Transforms/
-
LoopUnroll/
-
peel-loop-pgo.ll

Differential D36258

Disable loop peeling during full unrolling pass.
ClosedPublic

Authored by tejohnson on Aug 2 2017, 10:12 PM.

Download Raw Diff

Details

Reviewers

chandlerc
davidxl

Commits

rG9a18a6f08b02: Disable loop peeling during full unrolling pass.
rL309966: Disable loop peeling during full unrolling pass.

Summary

Peeling should not occur during the full unrolling invocation early
in the pipeline, but rather later with partial and runtime loop
unrolling. The later loop unrolling invocation will also eventually
utilize profile summary and branch frequency information, which
we would like to use to control peeling. And for ThinLTO we want
to delay peeling until the backend (post thin link) phase, just as
we do for most types of unrolling.

Ensure peeling doesn't occur during the full unrolling invocation
by adding a parameter to the shared implementation function, similar
to the way partial and runtime loop unrolling are disabled.

Performance results for ThinLTO suggest this has a neutral to positive
effect on some internal benchmarks.

Diff Detail

Build Status

Buildable 8954
Build 8954: arc lint + arc unit

Event Timeline

tejohnson created this revision.Aug 2 2017, 10:12 PM

Herald added subscribers: mehdi_amini, mzolotukhin. · View Herald TranscriptAug 2 2017, 10:12 PM

Perhaps it is cleaner to

pass the information whether it is a full unroll to tryToUnroll
add a internal option FullUnrollAllowPeeling and make it off by default? This will be similar to UnrollAllowPeeling flag, but takes precedence if it is full unroll.

lib/Transforms/Scalar/LoopUnrollPass.cpp
204	Move this closer to line 188 above?

In D36258#830096, @davidxl wrote:

Perhaps it is cleaner to

pass the information whether it is a full unroll to tryToUnroll

add a internal option FullUnrollAllowPeeling and make it off by default? This will be similar to UnrollAllowPeeling flag, but takes precedence if it is full unroll.

While I'm somewhat fond of refactoring these interfaces to be less of a mess, would it be better in a follow-up patch? I would also hope it can tackle more than just peeling but encompass several of the numerous parameters.

lib/Transforms/Scalar/LoopUnrollPass.cpp
204	I think this location is more consistent with the surrounding code... I'm reluctant to deviate here from what seems like a very consistent pattern.
1289	Shouldn't this pass `AllowPeeling` since you added that variable above?

davidxl added inline comments.Aug 2 2017, 11:03 PM

lib/Transforms/Scalar/LoopUnrollPass.cpp
204	Right. They all should be interleaved so that settings to the same flag should be side by side, but that is not relevant here.

In D36258#830100, @chandlerc wrote:

In D36258#830096, @davidxl wrote:

Perhaps it is cleaner to

pass the information whether it is a full unroll to tryToUnroll

add a internal option FullUnrollAllowPeeling and make it off by default? This will be similar to UnrollAllowPeeling flag, but takes precedence if it is full unroll.

While I'm somewhat fond of refactoring these interfaces to be less of a mess, would it be better in a follow-up patch? I would also hope it can tackle more than just peeling but encompass several of the numerous parameters.

Right, I wanted to be consistent with what is there for other unrolling features and how we disable them during full unrolling.

lib/Transforms/Scalar/LoopUnrollPass.cpp
204	Right, I put it here to be consistent.
1289	Woops, added that variable then realized I could just pass None directly and forgot to remove it. I'll remove the variable.

Remove extraneous variable

lgtm

This revision is now accepted and ready to land.Aug 3 2017, 10:49 AM

Closed by commit rL309966: Disable loop peeling during full unrolling pass. (authored by tejohnson). · Explain WhyAug 3 2017, 10:53 AM

This revision was automatically updated to reflect the committed changes.

FYI, this patch causes a somewhat serious regression on one of our internal benchmarks. Given we have a custom pass pipeline, it's definitely our responsibility to adapt to upstream changes not the other way around, but it would have helped us recognize the problem if either of two things had happened along with this patch:

The patch had been separated into two pieces: adding the new control know without changing the default and then a small patch changing the default.
The change had been announced in some way. An email to llvm-dev probably would have been sufficient.

Given we're hardly the only ones with custom pass pipelines, I'd encourage everyone involved (author, reviewers, etc..) to think about the upgrade path in similar cases which arise in the future.

In D36258#878507, @reames wrote:

FYI, this patch causes a somewhat serious regression on one of our internal benchmarks. Given we have a custom pass pipeline, it's definitely our responsibility to adapt to upstream changes not the other way around, but it would have helped us recognize the problem if either of two things had happened along with this patch:

The patch had been separated into two pieces: adding the new control know without changing the default and then a small patch changing the default.

The change had been announced in some way. An email to llvm-dev probably would have been sufficient.

Given we're hardly the only ones with custom pass pipelines, I'd encourage everyone involved (author, reviewers, etc..) to think about the upgrade path in similar cases which arise in the future.

That's clearly interesting. This change, however, seems like it should be more-or-less a nop. If you're going to fully unroll, why would peeling help? Do you end up with a shorter critical path through the branches if we "peel" first?

In D36258#878511, @hfinkel wrote:

That's clearly interesting. This change, however, seems like it should be more-or-less a nop. If you're going to fully unroll, why would peeling help? Do you end up with a shorter critical path through the branches if we "peel" first?

In our pass pipeline, we'd end up with loop-unrolling being quite a bit earlier than upstream. This wasn't so much an intentional choice as an accident of history that we really hadn't evaluated up until now. My best guess is that the peeling was accidentally exposing information to other transformations and that the actual unrolling was incidental to the performance swing. Someone on my team is investigating that a bit further to see what actually happened; what I just said was just an educated guess.

Revision Contents

Path

Size

include/

llvm/

Transforms/

Scalar.h

2 lines

lib/

Transforms/

Scalar/

LoopUnrollPass.cpp

50 lines

test/

Transforms/

LoopUnroll/

peel-loop-pgo.ll

5 lines

Diff 109484

include/llvm/Transforms/Scalar.h

	Show First 20 Lines • Show All 178 Lines • ▼ Show 20 Lines
	Pass *createLoopInstSimplifyPass();			Pass *createLoopInstSimplifyPass();

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// LoopUnroll - This pass is a simple loop unrolling pass.			// LoopUnroll - This pass is a simple loop unrolling pass.
	//			//
	Pass *createLoopUnrollPass(int OptLevel = 2, int Threshold = -1, int Count = -1,			Pass *createLoopUnrollPass(int OptLevel = 2, int Threshold = -1, int Count = -1,
	int AllowPartial = -1, int Runtime = -1,			int AllowPartial = -1, int Runtime = -1,
	int UpperBound = -1);			int UpperBound = -1, int AllowPeeling = -1);
	// Create an unrolling pass for full unrolling that uses exact trip count only.			// Create an unrolling pass for full unrolling that uses exact trip count only.
	Pass *createSimpleLoopUnrollPass(int OptLevel = 2);			Pass *createSimpleLoopUnrollPass(int OptLevel = 2);

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// LoopReroll - This pass is a simple loop rerolling pass.			// LoopReroll - This pass is a simple loop rerolling pass.
	//			//
	Pass *createLoopRerollPass();			Pass *createLoopRerollPass();
	▲ Show 20 Lines • Show All 378 Lines • Show Last 20 Lines

lib/Transforms/Scalar/LoopUnrollPass.cpp

Show First 20 Lines • Show All 128 Lines • ▼ Show 20 Lines
static const unsigned NoThreshold = UINT_MAX;		static const unsigned NoThreshold = UINT_MAX;

/// Gather the various unrolling parameters based on the defaults, compiler		/// Gather the various unrolling parameters based on the defaults, compiler
/// flags, TTI overrides and user specified parameters.		/// flags, TTI overrides and user specified parameters.
static TargetTransformInfo::UnrollingPreferences gatherUnrollingPreferences(		static TargetTransformInfo::UnrollingPreferences gatherUnrollingPreferences(
Loop *L, ScalarEvolution &SE, const TargetTransformInfo &TTI, int OptLevel,		Loop *L, ScalarEvolution &SE, const TargetTransformInfo &TTI, int OptLevel,
Optional<unsigned> UserThreshold, Optional<unsigned> UserCount,		Optional<unsigned> UserThreshold, Optional<unsigned> UserCount,
Optional<bool> UserAllowPartial, Optional<bool> UserRuntime,		Optional<bool> UserAllowPartial, Optional<bool> UserRuntime,
Optional<bool> UserUpperBound) {		Optional<bool> UserUpperBound, Optional<bool> UserAllowPeeling) {
TargetTransformInfo::UnrollingPreferences UP;		TargetTransformInfo::UnrollingPreferences UP;

// Set up the defaults		// Set up the defaults
UP.Threshold = OptLevel > 2 ? 300 : 150;		UP.Threshold = OptLevel > 2 ? 300 : 150;
UP.MaxPercentThresholdBoost = 400;		UP.MaxPercentThresholdBoost = 400;
UP.OptSizeThreshold = 0;		UP.OptSizeThreshold = 0;
UP.PartialThreshold = 150;		UP.PartialThreshold = 150;
UP.PartialOptSizeThreshold = 0;		UP.PartialOptSizeThreshold = 0;
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	static TargetTransformInfo::UnrollingPreferences gatherUnrollingPreferences(
if (UserCount.hasValue())		if (UserCount.hasValue())
UP.Count = *UserCount;		UP.Count = *UserCount;
if (UserAllowPartial.hasValue())		if (UserAllowPartial.hasValue())
UP.Partial = *UserAllowPartial;		UP.Partial = *UserAllowPartial;
if (UserRuntime.hasValue())		if (UserRuntime.hasValue())
UP.Runtime = *UserRuntime;		UP.Runtime = *UserRuntime;
if (UserUpperBound.hasValue())		if (UserUpperBound.hasValue())
UP.UpperBound = *UserUpperBound;		UP.UpperBound = *UserUpperBound;
		if (UserAllowPeeling.hasValue())
		davidxlUnsubmitted Not Done Reply Inline Actions Move this closer to line 188 above? davidxl: Move this closer to line 188 above?
		chandlercUnsubmitted Not Done Reply Inline Actions I think this location is more consistent with the surrounding code... I'm reluctant to deviate here from what seems like a very consistent pattern. chandlerc: I think this location is more consistent with the surrounding code... I'm reluctant to deviate…
		davidxlUnsubmitted Not Done Reply Inline Actions Right. They all should be interleaved so that settings to the same flag should be side by side, but that is not relevant here. davidxl: Right. They all should be interleaved so that settings to the same flag should be side by side…
		tejohnsonAuthorUnsubmitted Not Done Reply Inline Actions Right, I put it here to be consistent. tejohnson: Right, I put it here to be consistent.
		UP.AllowPeeling = *UserAllowPeeling;

return UP;		return UP;
}		}

namespace {		namespace {
/// A struct to densely store the state of an instruction after unrolling at		/// A struct to densely store the state of an instruction after unrolling at
/// each iteration.		/// each iteration.
///		///
▲ Show 20 Lines • Show All 710 Lines • ▼ Show 20 Lines	#endif
if (UP.Count > UP.MaxCount)		if (UP.Count > UP.MaxCount)
UP.Count = UP.MaxCount;		UP.Count = UP.MaxCount;
DEBUG(dbgs() << " partially unrolling with count: " << UP.Count << "\n");		DEBUG(dbgs() << " partially unrolling with count: " << UP.Count << "\n");
if (UP.Count < 2)		if (UP.Count < 2)
UP.Count = 0;		UP.Count = 0;
return ExplicitUnroll;		return ExplicitUnroll;
}		}

static bool tryToUnrollLoop(Loop L, DominatorTree &DT, LoopInfo LI,		static bool tryToUnrollLoop(
ScalarEvolution &SE, const TargetTransformInfo &TTI,		Loop L, DominatorTree &DT, LoopInfo LI, ScalarEvolution &SE,
AssumptionCache &AC, OptimizationRemarkEmitter &ORE,		const TargetTransformInfo &TTI, AssumptionCache &AC,
bool PreserveLCSSA, int OptLevel,		OptimizationRemarkEmitter &ORE, bool PreserveLCSSA, int OptLevel,
Optional<unsigned> ProvidedCount,		Optional<unsigned> ProvidedCount, Optional<unsigned> ProvidedThreshold,
Optional<unsigned> ProvidedThreshold,		Optional<bool> ProvidedAllowPartial, Optional<bool> ProvidedRuntime,
Optional<bool> ProvidedAllowPartial,		Optional<bool> ProvidedUpperBound, Optional<bool> ProvidedAllowPeeling) {
Optional<bool> ProvidedRuntime,
Optional<bool> ProvidedUpperBound) {
DEBUG(dbgs() << "Loop Unroll: F[" << L->getHeader()->getParent()->getName()		DEBUG(dbgs() << "Loop Unroll: F[" << L->getHeader()->getParent()->getName()
<< "] Loop %" << L->getHeader()->getName() << "\n");		<< "] Loop %" << L->getHeader()->getName() << "\n");
if (HasUnrollDisablePragma(L))		if (HasUnrollDisablePragma(L))
return false;		return false;
if (!L->isLoopSimplifyForm()) {		if (!L->isLoopSimplifyForm()) {
DEBUG(		DEBUG(
dbgs() << " Not unrolling loop which is not in loop-simplify form.\n");		dbgs() << " Not unrolling loop which is not in loop-simplify form.\n");
return false;		return false;
}		}

unsigned NumInlineCandidates;		unsigned NumInlineCandidates;
bool NotDuplicatable;		bool NotDuplicatable;
bool Convergent;		bool Convergent;
TargetTransformInfo::UnrollingPreferences UP = gatherUnrollingPreferences(		TargetTransformInfo::UnrollingPreferences UP = gatherUnrollingPreferences(
L, SE, TTI, OptLevel, ProvidedThreshold, ProvidedCount,		L, SE, TTI, OptLevel, ProvidedThreshold, ProvidedCount,
ProvidedAllowPartial, ProvidedRuntime, ProvidedUpperBound);		ProvidedAllowPartial, ProvidedRuntime, ProvidedUpperBound,
		ProvidedAllowPeeling);
// Exit early if unrolling is disabled.		// Exit early if unrolling is disabled.
if (UP.Threshold == 0 && (!UP.Partial \|\| UP.PartialThreshold == 0))		if (UP.Threshold == 0 && (!UP.Partial \|\| UP.PartialThreshold == 0))
return false;		return false;
unsigned LoopSize = ApproximateLoopSize(		unsigned LoopSize = ApproximateLoopSize(
L, NumInlineCandidates, NotDuplicatable, Convergent, TTI, &AC, UP.BEInsns);		L, NumInlineCandidates, NotDuplicatable, Convergent, TTI, &AC, UP.BEInsns);
DEBUG(dbgs() << " Loop Size = " << LoopSize << "\n");		DEBUG(dbgs() << " Loop Size = " << LoopSize << "\n");
if (NotDuplicatable) {		if (NotDuplicatable) {
DEBUG(dbgs() << " Not unrolling loop which contains non-duplicatable"		DEBUG(dbgs() << " Not unrolling loop which contains non-duplicatable"
▲ Show 20 Lines • Show All 85 Lines • ▼ Show 20 Lines

namespace {		namespace {
class LoopUnroll : public LoopPass {		class LoopUnroll : public LoopPass {
public:		public:
static char ID; // Pass ID, replacement for typeid		static char ID; // Pass ID, replacement for typeid
LoopUnroll(int OptLevel = 2, Optional<unsigned> Threshold = None,		LoopUnroll(int OptLevel = 2, Optional<unsigned> Threshold = None,
Optional<unsigned> Count = None,		Optional<unsigned> Count = None,
Optional<bool> AllowPartial = None, Optional<bool> Runtime = None,		Optional<bool> AllowPartial = None, Optional<bool> Runtime = None,
Optional<bool> UpperBound = None)		Optional<bool> UpperBound = None,
		Optional<bool> AllowPeeling = None)
: LoopPass(ID), OptLevel(OptLevel), ProvidedCount(std::move(Count)),		: LoopPass(ID), OptLevel(OptLevel), ProvidedCount(std::move(Count)),
ProvidedThreshold(Threshold), ProvidedAllowPartial(AllowPartial),		ProvidedThreshold(Threshold), ProvidedAllowPartial(AllowPartial),
ProvidedRuntime(Runtime), ProvidedUpperBound(UpperBound) {		ProvidedRuntime(Runtime), ProvidedUpperBound(UpperBound),
		ProvidedAllowPeeling(AllowPeeling) {
initializeLoopUnrollPass(*PassRegistry::getPassRegistry());		initializeLoopUnrollPass(*PassRegistry::getPassRegistry());
}		}

int OptLevel;		int OptLevel;
Optional<unsigned> ProvidedCount;		Optional<unsigned> ProvidedCount;
Optional<unsigned> ProvidedThreshold;		Optional<unsigned> ProvidedThreshold;
Optional<bool> ProvidedAllowPartial;		Optional<bool> ProvidedAllowPartial;
Optional<bool> ProvidedRuntime;		Optional<bool> ProvidedRuntime;
Optional<bool> ProvidedUpperBound;		Optional<bool> ProvidedUpperBound;
		Optional<bool> ProvidedAllowPeeling;

bool runOnLoop(Loop *L, LPPassManager &) override {		bool runOnLoop(Loop *L, LPPassManager &) override {
if (skipLoop(L))		if (skipLoop(L))
return false;		return false;

Function &F = *L->getHeader()->getParent();		Function &F = *L->getHeader()->getParent();

auto &DT = getAnalysis<DominatorTreeWrapperPass>().getDomTree();		auto &DT = getAnalysis<DominatorTreeWrapperPass>().getDomTree();
LoopInfo *LI = &getAnalysis<LoopInfoWrapperPass>().getLoopInfo();		LoopInfo *LI = &getAnalysis<LoopInfoWrapperPass>().getLoopInfo();
ScalarEvolution &SE = getAnalysis<ScalarEvolutionWrapperPass>().getSE();		ScalarEvolution &SE = getAnalysis<ScalarEvolutionWrapperPass>().getSE();
const TargetTransformInfo &TTI =		const TargetTransformInfo &TTI =
getAnalysis<TargetTransformInfoWrapperPass>().getTTI(F);		getAnalysis<TargetTransformInfoWrapperPass>().getTTI(F);
auto &AC = getAnalysis<AssumptionCacheTracker>().getAssumptionCache(F);		auto &AC = getAnalysis<AssumptionCacheTracker>().getAssumptionCache(F);
// For the old PM, we can't use OptimizationRemarkEmitter as an analysis		// For the old PM, we can't use OptimizationRemarkEmitter as an analysis
// pass. Function analyses need to be preserved across loop transformations		// pass. Function analyses need to be preserved across loop transformations
// but ORE cannot be preserved (see comment before the pass definition).		// but ORE cannot be preserved (see comment before the pass definition).
OptimizationRemarkEmitter ORE(&F);		OptimizationRemarkEmitter ORE(&F);
bool PreserveLCSSA = mustPreserveAnalysisID(LCSSAID);		bool PreserveLCSSA = mustPreserveAnalysisID(LCSSAID);

return tryToUnrollLoop(L, DT, LI, SE, TTI, AC, ORE, PreserveLCSSA, OptLevel,		return tryToUnrollLoop(L, DT, LI, SE, TTI, AC, ORE, PreserveLCSSA, OptLevel,
ProvidedCount, ProvidedThreshold,		ProvidedCount, ProvidedThreshold,
ProvidedAllowPartial, ProvidedRuntime,		ProvidedAllowPartial, ProvidedRuntime,
ProvidedUpperBound);		ProvidedUpperBound, ProvidedAllowPeeling);
}		}

/// This transformation requires natural loop information & requires that		/// This transformation requires natural loop information & requires that
/// loop preheaders be inserted into the CFG...		/// loop preheaders be inserted into the CFG...
///		///
void getAnalysisUsage(AnalysisUsage &AU) const override {		void getAnalysisUsage(AnalysisUsage &AU) const override {
AU.addRequired<AssumptionCacheTracker>();		AU.addRequired<AssumptionCacheTracker>();
AU.addRequired<TargetTransformInfoWrapperPass>();		AU.addRequired<TargetTransformInfoWrapperPass>();
// FIXME: Loop passes are required to preserve domtree, and for now we just		// FIXME: Loop passes are required to preserve domtree, and for now we just
// recreate dom info if anything gets unrolled.		// recreate dom info if anything gets unrolled.
getLoopAnalysisUsage(AU);		getLoopAnalysisUsage(AU);
}		}
};		};
}		}

char LoopUnroll::ID = 0;		char LoopUnroll::ID = 0;
INITIALIZE_PASS_BEGIN(LoopUnroll, "loop-unroll", "Unroll loops", false, false)		INITIALIZE_PASS_BEGIN(LoopUnroll, "loop-unroll", "Unroll loops", false, false)
INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)		INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)
INITIALIZE_PASS_DEPENDENCY(LoopPass)		INITIALIZE_PASS_DEPENDENCY(LoopPass)
INITIALIZE_PASS_DEPENDENCY(TargetTransformInfoWrapperPass)		INITIALIZE_PASS_DEPENDENCY(TargetTransformInfoWrapperPass)
INITIALIZE_PASS_END(LoopUnroll, "loop-unroll", "Unroll loops", false, false)		INITIALIZE_PASS_END(LoopUnroll, "loop-unroll", "Unroll loops", false, false)

Pass *llvm::createLoopUnrollPass(int OptLevel, int Threshold, int Count,		Pass *llvm::createLoopUnrollPass(int OptLevel, int Threshold, int Count,
int AllowPartial, int Runtime,		int AllowPartial, int Runtime, int UpperBound,
int UpperBound) {		int AllowPeeling) {
// TODO: It would make more sense for this function to take the optionals		// TODO: It would make more sense for this function to take the optionals
// directly, but that's dangerous since it would silently break out of tree		// directly, but that's dangerous since it would silently break out of tree
// callers.		// callers.
return new LoopUnroll(		return new LoopUnroll(
OptLevel, Threshold == -1 ? None : Optional<unsigned>(Threshold),		OptLevel, Threshold == -1 ? None : Optional<unsigned>(Threshold),
Count == -1 ? None : Optional<unsigned>(Count),		Count == -1 ? None : Optional<unsigned>(Count),
AllowPartial == -1 ? None : Optional<bool>(AllowPartial),		AllowPartial == -1 ? None : Optional<bool>(AllowPartial),
Runtime == -1 ? None : Optional<bool>(Runtime),		Runtime == -1 ? None : Optional<bool>(Runtime),
UpperBound == -1 ? None : Optional<bool>(UpperBound));		UpperBound == -1 ? None : Optional<bool>(UpperBound),
		AllowPeeling == -1 ? None : Optional<bool>(AllowPeeling));
}		}

Pass *llvm::createSimpleLoopUnrollPass(int OptLevel) {		Pass *llvm::createSimpleLoopUnrollPass(int OptLevel) {
return llvm::createLoopUnrollPass(OptLevel, -1, -1, 0, 0, 0);		return llvm::createLoopUnrollPass(OptLevel, -1, -1, 0, 0, 0, 0);
}		}

PreservedAnalyses LoopFullUnrollPass::run(Loop &L, LoopAnalysisManager &AM,		PreservedAnalyses LoopFullUnrollPass::run(Loop &L, LoopAnalysisManager &AM,
LoopStandardAnalysisResults &AR,		LoopStandardAnalysisResults &AR,
LPMUpdater &Updater) {		LPMUpdater &Updater) {
const auto &FAM =		const auto &FAM =
AM.getResult<FunctionAnalysisManagerLoopProxy>(L, AR).getManager();		AM.getResult<FunctionAnalysisManagerLoopProxy>(L, AR).getManager();
Function *F = L.getHeader()->getParent();		Function *F = L.getHeader()->getParent();
Show All 13 Lines	if (ParentL)
OldLoops.insert(ParentL->begin(), ParentL->end());		OldLoops.insert(ParentL->begin(), ParentL->end());
else		else
OldLoops.insert(AR.LI.begin(), AR.LI.end());		OldLoops.insert(AR.LI.begin(), AR.LI.end());

bool Changed =		bool Changed =
tryToUnrollLoop(&L, AR.DT, &AR.LI, AR.SE, AR.TTI, AR.AC, *ORE,		tryToUnrollLoop(&L, AR.DT, &AR.LI, AR.SE, AR.TTI, AR.AC, *ORE,
/PreserveLCSSA/ true, OptLevel, /Count/ None,		/PreserveLCSSA/ true, OptLevel, /Count/ None,
/Threshold/ None, /AllowPartial/ false,		/Threshold/ None, /AllowPartial/ false,
/Runtime/ false, /UpperBound/ false);		/Runtime/ false, /UpperBound/ false,
		/AllowPeeling/ false);
if (!Changed)		if (!Changed)
return PreservedAnalyses::all();		return PreservedAnalyses::all();

// The parent must not be damaged by unrolling!		// The parent must not be damaged by unrolling!
#ifndef NDEBUG		#ifndef NDEBUG
if (ParentL)		if (ParentL)
ParentL->verifyLoop();		ParentL->verifyLoop();
#endif		#endif
▲ Show 20 Lines • Show All 101 Lines • ▼ Show 20 Lines
#ifndef NDEBUG		#ifndef NDEBUG
Loop *ParentL = L.getParentLoop();		Loop *ParentL = L.getParentLoop();
#endif		#endif

// The API here is quite complex to call, but there are only two interesting		// The API here is quite complex to call, but there are only two interesting
// states we support: partial and full (or "simple") unrolling. However, to		// states we support: partial and full (or "simple") unrolling. However, to
// enable these things we actually pass "None" in for the optional to avoid		// enable these things we actually pass "None" in for the optional to avoid
// providing an explicit choice.		// providing an explicit choice.
Optional<bool> AllowPartialParam, RuntimeParam, UpperBoundParam;		Optional<bool> AllowPartialParam, RuntimeParam, UpperBoundParam,
		AllowPeeling;
bool CurChanged = tryToUnrollLoop(		bool CurChanged = tryToUnrollLoop(
&L, DT, &LI, SE, TTI, AC, ORE,		&L, DT, &LI, SE, TTI, AC, ORE,
/PreserveLCSSA/ true, OptLevel, /Count/ None,		/PreserveLCSSA/ true, OptLevel, /Count/ None,
/Threshold/ None, AllowPartialParam, RuntimeParam, UpperBoundParam);		/Threshold/ None, AllowPartialParam, RuntimeParam, UpperBoundParam,
		/AllowPeeling/ None);
		chandlercUnsubmitted Done Reply Inline Actions Shouldn't this pass `AllowPeeling` since you added that variable above? chandlerc: Shouldn't this pass `AllowPeeling` since you added that variable above?
		tejohnsonAuthorUnsubmitted Not Done Reply Inline Actions Woops, added that variable then realized I could just pass None directly and forgot to remove it. I'll remove the variable. tejohnson: Woops, added that variable then realized I could just pass None directly and forgot to remove…
Changed \|= CurChanged;		Changed \|= CurChanged;

// The parent must not be damaged by unrolling!		// The parent must not be damaged by unrolling!
#ifndef NDEBUG		#ifndef NDEBUG
if (CurChanged && ParentL)		if (CurChanged && ParentL)
ParentL->verifyLoop();		ParentL->verifyLoop();
#endif		#endif
}		}

if (!Changed)		if (!Changed)
return PreservedAnalyses::all();		return PreservedAnalyses::all();

return getLoopPassPreservedAnalyses();		return getLoopPassPreservedAnalyses();
}		}

test/Transforms/LoopUnroll/peel-loop-pgo.ll

	; RUN: opt < %s -S -debug-only=loop-unroll -loop-unroll 2>&1 \| FileCheck %s			; RUN: opt < %s -S -debug-only=loop-unroll -loop-unroll 2>&1 \| FileCheck %s
				; RUN: opt < %s -S -debug-only=loop-unroll -passes='require<opt-remark-emit>,unroll' 2>&1 \| FileCheck %s
	; REQUIRES: asserts			; REQUIRES: asserts

	; Make sure we use the profile information correctly to peel-off 3 iterations			; Make sure we use the profile information correctly to peel-off 3 iterations
	; from the loop, and update the branch weights for the peeled loop properly.			; from the loop, and update the branch weights for the peeled loop properly.

	; CHECK: Loop Unroll: F[basic]			; CHECK: Loop Unroll: F[basic]
	; CHECK: PEELING loop %for.body with iteration count 3!			; CHECK: PEELING loop %for.body with iteration count 3!
	; CHECK: Loop Unroll: F[optsize]			; CHECK: Loop Unroll: F[optsize]
	; CHECK-NOT: PEELING			; CHECK-NOT: PEELING

				; Confirm that no peeling occurs when we are performing full unrolling.
				; RUN: opt < %s -S -debug-only=loop-unroll -passes='require<opt-remark-emit>,loop(unroll-full)' 2>&1 \| FileCheck %s --check-prefix=FULLUNROLL
				; FULLUNROLL-NOT: PEELING

	; CHECK-LABEL: @basic			; CHECK-LABEL: @basic
	; CHECK: br i1 %{{.}}, label %[[NEXT0:.]], label %for.cond.for.end_crit_edge, !prof !1			; CHECK: br i1 %{{.}}, label %[[NEXT0:.]], label %for.cond.for.end_crit_edge, !prof !1
	; CHECK: [[NEXT0]]:			; CHECK: [[NEXT0]]:
	; CHECK: br i1 %{{.}}, label %[[NEXT1:.]], label %for.cond.for.end_crit_edge, !prof !2			; CHECK: br i1 %{{.}}, label %[[NEXT1:.]], label %for.cond.for.end_crit_edge, !prof !2
	; CHECK: [[NEXT1]]:			; CHECK: [[NEXT1]]:
	; CHECK: br i1 %{{.}}, label %[[NEXT2:.]], label %for.cond.for.end_crit_edge, !prof !3			; CHECK: br i1 %{{.}}, label %[[NEXT2:.]], label %for.cond.for.end_crit_edge, !prof !3
	; CHECK: [[NEXT2]]:			; CHECK: [[NEXT2]]:
	; CHECK: br i1 %{{.}}, label %for.body, label %{{.}}, !prof !4			; CHECK: br i1 %{{.}}, label %for.body, label %{{.}}, !prof !4
	▲ Show 20 Lines • Show All 67 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Disable loop peeling during full unrolling pass.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 109484

include/llvm/Transforms/Scalar.h

lib/Transforms/Scalar/LoopUnrollPass.cpp

test/Transforms/LoopUnroll/peel-loop-pgo.ll

Disable loop peeling during full unrolling pass.
ClosedPublic