This is an archive of the discontinued LLVM Phabricator instance.

[LoopRotate] Add explicit flag to require MSSA.
AbandonedPublic

Authored by asbirlea on Feb 14 2020, 1:07 PM.

Download Raw Diff

Details

Reviewers

dmgreen
fedor.sergeev
nikic
fhahn

Summary

Add an explicit flag to require MSSA when LoopRotate is known to join a
loop pass pipeline with other passes that will require the anlysis.
This undoes the pipeline split from D74574, but keep the analysis from
being run when LoopRotate is run alone in the loop pipeline.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

asbirlea created this revision.Feb 14 2020, 1:07 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 14 2020, 1:07 PM

Herald added subscribers: george.burgess.iv, hiraditya, Prazek. · View Herald Transcript

Harbormaster failed remote builds in B46542: Diff 244745!Feb 14 2020, 1:34 PM

fhahn added inline comments.Feb 19 2020, 2:50 PM

llvm/lib/Transforms/Scalar/LoopRotation.cpp
80	nit: initialise directly in the initialiser list?
88	Would it be slightly simpler to just have RequireMSSA == true mean that we require and preserve it and RequireMSSA == false means neither require nor preserve it? I think both approaches are fine, but in the current pipeline the case we can preserve MSSA without requiring it is not going to happen I guess?

This re-regresses PR44408 and PR44889 at ToT today.
I am planning to resolve the underlying cause (a new DT being built when doing MSSA updates) before coming back to this. The regression will not occur when that work is done (I have tested this already).

Rebase at ToT.

Herald added subscribers: kerbowa, nhaehnle, jvesely. · View Herald TranscriptSep 1 2020, 1:58 PM

Rebased after the fix to remove DT recomputation in MSSA. I'm still seeing a small regression on one of the tests in the original PRs (0.6% in instructions on mogrify.i), but it's nowhere near the regression that motivated the pipeline split.
I'm not sure if it's worth pushing this forward if the only motivation is re-merging the pipeline in the LPM, unless it's entirely performance neutral.

@nikic: Would you mind checking the compile-time impact for this patch?

Harbormaster completed remote builds in B70300: Diff 289280.Sep 1 2020, 2:28 PM

Compile-time: https://llvm-compile-time-tracker.com/compare.php?from=553833958fdea48e41a11ee7e9c104c903deadf5&to=cfbffacf4188264ec40c0fcb7dbd92d0e23b1b74&stat=instructions Numbers are mixed with some improvements, some regressions. Largest individual regression is on libclamav_unsp.c with 1.8% at O3 and 4.2% for ThinLTO (pre-link).

Thank you for the results! Looking at libclamav_unsp.c, I'm seeing 3.96% in instructions for ThinLTO, which I'm guessing matches the results you see.
The fluctuations may make sense, since with the additional MSSA updates in LoopRotate it can do more or less work, depending on the updates. For instructions per cycle I'm seeing it neutral results (0.5%), and for wall time (average over 20 run), also neutral.

Again, I'm not convinced this is worth pushing forward due to it only affecting the LPM, but since the change I'm undoing here was a previous large release regression in clang9, I'm glad to see the DomTree work payed off and this is now showing mixed results.
I'm fine with whichever decisions the reviewers lean towards.

nikic mentioned this in D99249: [PassManager] Run additional LICM before LoopRotate.Mar 24 2021, 5:48 AM

Rebased
Fresh perf numbers:
https://llvm-compile-time-tracker.com/compare.php?from=760f4c2069d53ace13d20424b7209759a9186090&to=418931903396b23c29131ffd5fa6fd6b54698272&stat=instructions

nikic mentioned this in D99843: [LoopRotate] Don't split loop pass manager.Apr 3 2021, 12:02 PM

nikic mentioned this in rG59a2f67011ba: [LoopRotate] Don't split loop pass manager.Apr 8 2021, 1:05 PM

I believe this can be abandoned now, it's no longer relevant now that LICM runs before LoopRotate, so MSSA is required at that point anyway.

asbirlea abandoned this revision.Apr 13 2021, 11:49 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

Transforms/

Scalar.h

2 lines

lib/

Transforms/

IPO/

PassManagerBuilder.cpp

2 lines

Scalar/

LoopRotation.cpp

16 lines

test/

CodeGen/

AMDGPU/

opt-pipeline.ll

21 lines

Other/

opt-O2-pipeline.ll

7 lines

opt-O3-pipeline-enable-matrix.ll

7 lines

opt-O3-pipeline.ll

7 lines

opt-Os-pipeline.ll

7 lines

pass-pipelines.ll

1 line

Diff 289280

llvm/include/llvm/Transforms/Scalar.h

	Show First 20 Lines • Show All 199 Lines • ▼ Show 20 Lines
	// LoopReroll - This pass is a simple loop rerolling pass.			// LoopReroll - This pass is a simple loop rerolling pass.
	//			//
	Pass *createLoopRerollPass();			Pass *createLoopRerollPass();

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// LoopRotate - This pass is a simple loop rotating pass.			// LoopRotate - This pass is a simple loop rotating pass.
	//			//
	Pass *createLoopRotatePass(int MaxHeaderSize = -1);			Pass *createLoopRotatePass(int MaxHeaderSize = -1, bool RequiresMSSA = false);

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// LoopIdiom - This pass recognizes and replaces idioms in loops.			// LoopIdiom - This pass recognizes and replaces idioms in loops.
	//			//
	Pass *createLoopIdiomPass();			Pass *createLoopIdiomPass();

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	▲ Show 20 Lines • Show All 320 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp

Show First 20 Lines • Show All 410 Lines • ▼ Show 20 Lines	void PassManagerBuilder::addFunctionSimplificationPasses(
if (EnableSimpleLoopUnswitch) {		if (EnableSimpleLoopUnswitch) {
// The simple loop unswitch pass relies on separate cleanup passes. Schedule		// The simple loop unswitch pass relies on separate cleanup passes. Schedule
// them first so when we re-process a loop they run before other loop		// them first so when we re-process a loop they run before other loop
// passes.		// passes.
MPM.add(createLoopInstSimplifyPass());		MPM.add(createLoopInstSimplifyPass());
MPM.add(createLoopSimplifyCFGPass());		MPM.add(createLoopSimplifyCFGPass());
}		}
// Rotate Loop - disable header duplication at -Oz		// Rotate Loop - disable header duplication at -Oz
MPM.add(createLoopRotatePass(SizeLevel == 2 ? 0 : -1));		MPM.add(createLoopRotatePass(SizeLevel == 2 ? 0 : -1, true));
// TODO: Investigate promotion cap for O1.		// TODO: Investigate promotion cap for O1.
MPM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));		MPM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));
if (EnableSimpleLoopUnswitch)		if (EnableSimpleLoopUnswitch)
MPM.add(createSimpleLoopUnswitchLegacyPass());		MPM.add(createSimpleLoopUnswitchLegacyPass());
else		else
MPM.add(createLoopUnswitchPass(SizeLevel \|\| OptLevel < 3, DivergentTarget));		MPM.add(createLoopUnswitchPass(SizeLevel \|\| OptLevel < 3, DivergentTarget));
// FIXME: We break the loop pass pipeline here in order to do full		// FIXME: We break the loop pass pipeline here in order to do full
// simplify-cfg. Eventually loop-simplifycfg should be enhanced to replace the		// simplify-cfg. Eventually loop-simplifycfg should be enhanced to replace the
▲ Show 20 Lines • Show All 810 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/LoopRotation.cpp

Show First 20 Lines • Show All 60 Lines • ▼ Show 20 Lines	if (AR.MSSA)
PA.preserve<MemorySSAAnalysis>();		PA.preserve<MemorySSAAnalysis>();
return PA;		return PA;
}		}

namespace {		namespace {

class LoopRotateLegacyPass : public LoopPass {		class LoopRotateLegacyPass : public LoopPass {
unsigned MaxHeaderSize;		unsigned MaxHeaderSize;
		bool RequiresMSSA;

public:		public:
static char ID; // Pass ID, replacement for typeid		static char ID; // Pass ID, replacement for typeid
LoopRotateLegacyPass(int SpecifiedMaxHeaderSize = -1) : LoopPass(ID) {		LoopRotateLegacyPass(int SpecifiedMaxHeaderSize = -1, bool ReqMSSA = false)
		: LoopPass(ID) {
initializeLoopRotateLegacyPassPass(*PassRegistry::getPassRegistry());		initializeLoopRotateLegacyPassPass(*PassRegistry::getPassRegistry());
if (SpecifiedMaxHeaderSize == -1)		if (SpecifiedMaxHeaderSize == -1)
MaxHeaderSize = DefaultRotationThreshold;		MaxHeaderSize = DefaultRotationThreshold;
else		else
MaxHeaderSize = unsigned(SpecifiedMaxHeaderSize);		MaxHeaderSize = unsigned(SpecifiedMaxHeaderSize);
		RequiresMSSA = ReqMSSA;
		fhahnUnsubmitted Not Done Reply Inline Actions nit: initialise directly in the initialiser list? fhahn: nit: initialise directly in the initialiser list?
}		}

// LCSSA form makes instruction renaming easier.		// LCSSA form makes instruction renaming easier.
void getAnalysisUsage(AnalysisUsage &AU) const override {		void getAnalysisUsage(AnalysisUsage &AU) const override {
AU.addRequired<AssumptionCacheTracker>();		AU.addRequired<AssumptionCacheTracker>();
AU.addRequired<TargetTransformInfoWrapperPass>();		AU.addRequired<TargetTransformInfoWrapperPass>();
if (EnableMSSALoopDependency)		if (EnableMSSALoopDependency) {
		if (RequiresMSSA)
		fhahnUnsubmitted Not Done Reply Inline Actions Would it be slightly simpler to just have RequireMSSA == true mean that we require and preserve it and RequireMSSA == false means neither require nor preserve it? I think both approaches are fine, but in the current pipeline the case we can preserve MSSA without requiring it is not going to happen I guess? fhahn: Would it be slightly simpler to just have RequireMSSA == true mean that we require and preserve…
		AU.addRequired<MemorySSAWrapperPass>();
AU.addPreserved<MemorySSAWrapperPass>();		AU.addPreserved<MemorySSAWrapperPass>();
		}
getLoopAnalysisUsage(AU);		getLoopAnalysisUsage(AU);
}		}

bool runOnLoop(Loop *L, LPPassManager &LPM) override {		bool runOnLoop(Loop *L, LPPassManager &LPM) override {
if (skipLoop(L))		if (skipLoop(L))
return false;		return false;
Function &F = *L->getHeader()->getParent();		Function &F = *L->getHeader()->getParent();

auto *LI = &getAnalysis<LoopInfoWrapperPass>().getLoopInfo();		auto *LI = &getAnalysis<LoopInfoWrapperPass>().getLoopInfo();
const auto *TTI = &getAnalysis<TargetTransformInfoWrapperPass>().getTTI(F);		const auto *TTI = &getAnalysis<TargetTransformInfoWrapperPass>().getTTI(F);
auto *AC = &getAnalysis<AssumptionCacheTracker>().getAssumptionCache(F);		auto *AC = &getAnalysis<AssumptionCacheTracker>().getAssumptionCache(F);
auto &DT = getAnalysis<DominatorTreeWrapperPass>().getDomTree();		auto &DT = getAnalysis<DominatorTreeWrapperPass>().getDomTree();
auto &SE = getAnalysis<ScalarEvolutionWrapperPass>().getSE();		auto &SE = getAnalysis<ScalarEvolutionWrapperPass>().getSE();
const SimplifyQuery SQ = getBestSimplifyQuery(*this, F);		const SimplifyQuery SQ = getBestSimplifyQuery(*this, F);
Optional<MemorySSAUpdater> MSSAU;		Optional<MemorySSAUpdater> MSSAU;
if (EnableMSSALoopDependency) {		if (EnableMSSALoopDependency) {
// Not requiring MemorySSA and getting it only if available will split
// the loop pass pipeline when LoopRotate is being run first.
auto *MSSAA = getAnalysisIfAvailable<MemorySSAWrapperPass>();		auto *MSSAA = getAnalysisIfAvailable<MemorySSAWrapperPass>();
if (MSSAA)		if (MSSAA)
MSSAU = MemorySSAUpdater(&MSSAA->getMSSA());		MSSAU = MemorySSAUpdater(&MSSAA->getMSSA());
}		}
return LoopRotation(L, LI, TTI, AC, &DT, &SE,		return LoopRotation(L, LI, TTI, AC, &DT, &SE,
MSSAU.hasValue() ? MSSAU.getPointer() : nullptr, SQ,		MSSAU.hasValue() ? MSSAU.getPointer() : nullptr, SQ,
false, MaxHeaderSize, false);		false, MaxHeaderSize, false);
}		}
};		};
} // end namespace		} // end namespace

char LoopRotateLegacyPass::ID = 0;		char LoopRotateLegacyPass::ID = 0;
INITIALIZE_PASS_BEGIN(LoopRotateLegacyPass, "loop-rotate", "Rotate Loops",		INITIALIZE_PASS_BEGIN(LoopRotateLegacyPass, "loop-rotate", "Rotate Loops",
false, false)		false, false)
INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)		INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)
INITIALIZE_PASS_DEPENDENCY(LoopPass)		INITIALIZE_PASS_DEPENDENCY(LoopPass)
INITIALIZE_PASS_DEPENDENCY(TargetTransformInfoWrapperPass)		INITIALIZE_PASS_DEPENDENCY(TargetTransformInfoWrapperPass)
INITIALIZE_PASS_DEPENDENCY(MemorySSAWrapperPass)		INITIALIZE_PASS_DEPENDENCY(MemorySSAWrapperPass)
INITIALIZE_PASS_END(LoopRotateLegacyPass, "loop-rotate", "Rotate Loops", false,		INITIALIZE_PASS_END(LoopRotateLegacyPass, "loop-rotate", "Rotate Loops", false,
false)		false)

Pass *llvm::createLoopRotatePass(int MaxHeaderSize) {		Pass *llvm::createLoopRotatePass(int MaxHeaderSize, bool RequiresMSSA) {
return new LoopRotateLegacyPass(MaxHeaderSize);		return new LoopRotateLegacyPass(MaxHeaderSize, RequiresMSSA);
}		}

llvm/test/CodeGen/AMDGPU/opt-pipeline.ll

	Show First 20 Lines • Show All 123 Lines • ▼ Show 20 Lines
	; GCN-O1-NEXT: Block Frequency Analysis			; GCN-O1-NEXT: Block Frequency Analysis
	; GCN-O1-NEXT: Lazy Branch Probability Analysis			; GCN-O1-NEXT: Lazy Branch Probability Analysis
	; GCN-O1-NEXT: Lazy Block Frequency Analysis			; GCN-O1-NEXT: Lazy Block Frequency Analysis
	; GCN-O1-NEXT: Optimization Remark Emitter			; GCN-O1-NEXT: Optimization Remark Emitter
	; GCN-O1-NEXT: PGOMemOPSize			; GCN-O1-NEXT: PGOMemOPSize
	; GCN-O1-NEXT: Simplify the CFG			; GCN-O1-NEXT: Simplify the CFG
	; GCN-O1-NEXT: Reassociate expressions			; GCN-O1-NEXT: Reassociate expressions
	; GCN-O1-NEXT: Dominator Tree Construction			; GCN-O1-NEXT: Dominator Tree Construction
				; GCN-O1-NEXT: Basic Alias Analysis (stateless AA impl)
				; GCN-O1-NEXT: Function Alias Analysis Results
				; GCN-O1-NEXT: Memory SSA
	; GCN-O1-NEXT: Natural Loop Information			; GCN-O1-NEXT: Natural Loop Information
	; GCN-O1-NEXT: Canonicalize natural loops			; GCN-O1-NEXT: Canonicalize natural loops
	; GCN-O1-NEXT: LCSSA Verifier			; GCN-O1-NEXT: LCSSA Verifier
	; GCN-O1-NEXT: Loop-Closed SSA Form Pass			; GCN-O1-NEXT: Loop-Closed SSA Form Pass
	; GCN-O1-NEXT: Basic Alias Analysis (stateless AA impl)
	; GCN-O1-NEXT: Function Alias Analysis Results
	; GCN-O1-NEXT: Scalar Evolution Analysis			; GCN-O1-NEXT: Scalar Evolution Analysis
	; GCN-O1-NEXT: Loop Pass Manager			; GCN-O1-NEXT: Loop Pass Manager
	; GCN-O1-NEXT: Rotate Loops			; GCN-O1-NEXT: Rotate Loops
	; GCN-O1-NEXT: Memory SSA
	; GCN-O1-NEXT: Loop Pass Manager
	; GCN-O1-NEXT: Loop Invariant Code Motion			; GCN-O1-NEXT: Loop Invariant Code Motion
	; GCN-O1-NEXT: Post-Dominator Tree Construction			; GCN-O1-NEXT: Post-Dominator Tree Construction
	; GCN-O1-NEXT: Legacy Divergence Analysis			; GCN-O1-NEXT: Legacy Divergence Analysis
	; GCN-O1-NEXT: Loop Pass Manager			; GCN-O1-NEXT: Loop Pass Manager
	; GCN-O1-NEXT: Unswitch loops			; GCN-O1-NEXT: Unswitch loops
	; GCN-O1-NEXT: Simplify the CFG			; GCN-O1-NEXT: Simplify the CFG
	; GCN-O1-NEXT: Dominator Tree Construction			; GCN-O1-NEXT: Dominator Tree Construction
	; GCN-O1-NEXT: Basic Alias Analysis (stateless AA impl)			; GCN-O1-NEXT: Basic Alias Analysis (stateless AA impl)
	▲ Show 20 Lines • Show All 291 Lines • ▼ Show 20 Lines
	; GCN-O2-NEXT: Natural Loop Information			; GCN-O2-NEXT: Natural Loop Information
	; GCN-O2-NEXT: Lazy Branch Probability Analysis			; GCN-O2-NEXT: Lazy Branch Probability Analysis
	; GCN-O2-NEXT: Lazy Block Frequency Analysis			; GCN-O2-NEXT: Lazy Block Frequency Analysis
	; GCN-O2-NEXT: Optimization Remark Emitter			; GCN-O2-NEXT: Optimization Remark Emitter
	; GCN-O2-NEXT: Tail Call Elimination			; GCN-O2-NEXT: Tail Call Elimination
	; GCN-O2-NEXT: Simplify the CFG			; GCN-O2-NEXT: Simplify the CFG
	; GCN-O2-NEXT: Reassociate expressions			; GCN-O2-NEXT: Reassociate expressions
	; GCN-O2-NEXT: Dominator Tree Construction			; GCN-O2-NEXT: Dominator Tree Construction
				; GCN-O2-NEXT: Basic Alias Analysis (stateless AA impl)
				; GCN-O2-NEXT: Function Alias Analysis Results
				; GCN-O2-NEXT: Memory SSA
	; GCN-O2-NEXT: Natural Loop Information			; GCN-O2-NEXT: Natural Loop Information
	; GCN-O2-NEXT: Canonicalize natural loops			; GCN-O2-NEXT: Canonicalize natural loops
	; GCN-O2-NEXT: LCSSA Verifier			; GCN-O2-NEXT: LCSSA Verifier
	; GCN-O2-NEXT: Loop-Closed SSA Form Pass			; GCN-O2-NEXT: Loop-Closed SSA Form Pass
	; GCN-O2-NEXT: Basic Alias Analysis (stateless AA impl)
	; GCN-O2-NEXT: Function Alias Analysis Results
	; GCN-O2-NEXT: Scalar Evolution Analysis			; GCN-O2-NEXT: Scalar Evolution Analysis
	; GCN-O2-NEXT: Loop Pass Manager			; GCN-O2-NEXT: Loop Pass Manager
	; GCN-O2-NEXT: Rotate Loops			; GCN-O2-NEXT: Rotate Loops
	; GCN-O2-NEXT: Memory SSA
	; GCN-O2-NEXT: Loop Pass Manager
	; GCN-O2-NEXT: Loop Invariant Code Motion			; GCN-O2-NEXT: Loop Invariant Code Motion
	; GCN-O2-NEXT: Post-Dominator Tree Construction			; GCN-O2-NEXT: Post-Dominator Tree Construction
	; GCN-O2-NEXT: Legacy Divergence Analysis			; GCN-O2-NEXT: Legacy Divergence Analysis
	; GCN-O2-NEXT: Loop Pass Manager			; GCN-O2-NEXT: Loop Pass Manager
	; GCN-O2-NEXT: Unswitch loops			; GCN-O2-NEXT: Unswitch loops
	; GCN-O2-NEXT: Simplify the CFG			; GCN-O2-NEXT: Simplify the CFG
	; GCN-O2-NEXT: Dominator Tree Construction			; GCN-O2-NEXT: Dominator Tree Construction
	; GCN-O2-NEXT: Basic Alias Analysis (stateless AA impl)			; GCN-O2-NEXT: Basic Alias Analysis (stateless AA impl)
	▲ Show 20 Lines • Show All 331 Lines • ▼ Show 20 Lines
	; GCN-O3-NEXT: Natural Loop Information			; GCN-O3-NEXT: Natural Loop Information
	; GCN-O3-NEXT: Lazy Branch Probability Analysis			; GCN-O3-NEXT: Lazy Branch Probability Analysis
	; GCN-O3-NEXT: Lazy Block Frequency Analysis			; GCN-O3-NEXT: Lazy Block Frequency Analysis
	; GCN-O3-NEXT: Optimization Remark Emitter			; GCN-O3-NEXT: Optimization Remark Emitter
	; GCN-O3-NEXT: Tail Call Elimination			; GCN-O3-NEXT: Tail Call Elimination
	; GCN-O3-NEXT: Simplify the CFG			; GCN-O3-NEXT: Simplify the CFG
	; GCN-O3-NEXT: Reassociate expressions			; GCN-O3-NEXT: Reassociate expressions
	; GCN-O3-NEXT: Dominator Tree Construction			; GCN-O3-NEXT: Dominator Tree Construction
				; GCN-O3-NEXT: Basic Alias Analysis (stateless AA impl)
				; GCN-O3-NEXT: Function Alias Analysis Results
				; GCN-O3-NEXT: Memory SSA
	; GCN-O3-NEXT: Natural Loop Information			; GCN-O3-NEXT: Natural Loop Information
	; GCN-O3-NEXT: Canonicalize natural loops			; GCN-O3-NEXT: Canonicalize natural loops
	; GCN-O3-NEXT: LCSSA Verifier			; GCN-O3-NEXT: LCSSA Verifier
	; GCN-O3-NEXT: Loop-Closed SSA Form Pass			; GCN-O3-NEXT: Loop-Closed SSA Form Pass
	; GCN-O3-NEXT: Basic Alias Analysis (stateless AA impl)
	; GCN-O3-NEXT: Function Alias Analysis Results
	; GCN-O3-NEXT: Scalar Evolution Analysis			; GCN-O3-NEXT: Scalar Evolution Analysis
	; GCN-O3-NEXT: Loop Pass Manager			; GCN-O3-NEXT: Loop Pass Manager
	; GCN-O3-NEXT: Rotate Loops			; GCN-O3-NEXT: Rotate Loops
	; GCN-O3-NEXT: Memory SSA
	; GCN-O3-NEXT: Loop Pass Manager
	; GCN-O3-NEXT: Loop Invariant Code Motion			; GCN-O3-NEXT: Loop Invariant Code Motion
	; GCN-O3-NEXT: Post-Dominator Tree Construction			; GCN-O3-NEXT: Post-Dominator Tree Construction
	; GCN-O3-NEXT: Legacy Divergence Analysis			; GCN-O3-NEXT: Legacy Divergence Analysis
	; GCN-O3-NEXT: Loop Pass Manager			; GCN-O3-NEXT: Loop Pass Manager
	; GCN-O3-NEXT: Unswitch loops			; GCN-O3-NEXT: Unswitch loops
	; GCN-O3-NEXT: Simplify the CFG			; GCN-O3-NEXT: Simplify the CFG
	; GCN-O3-NEXT: Dominator Tree Construction			; GCN-O3-NEXT: Dominator Tree Construction
	; GCN-O3-NEXT: Basic Alias Analysis (stateless AA impl)			; GCN-O3-NEXT: Basic Alias Analysis (stateless AA impl)
	▲ Show 20 Lines • Show All 220 Lines • Show Last 20 Lines

llvm/test/Other/opt-O2-pipeline.ll

	Show First 20 Lines • Show All 95 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Lazy Branch Probability Analysis			; CHECK-NEXT: Lazy Branch Probability Analysis
	; CHECK-NEXT: Lazy Block Frequency Analysis			; CHECK-NEXT: Lazy Block Frequency Analysis
	; CHECK-NEXT: Optimization Remark Emitter			; CHECK-NEXT: Optimization Remark Emitter
	; CHECK-NEXT: Tail Call Elimination			; CHECK-NEXT: Tail Call Elimination
	; CHECK-NEXT: Simplify the CFG			; CHECK-NEXT: Simplify the CFG
	; CHECK-NEXT: Reassociate expressions			; CHECK-NEXT: Reassociate expressions
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
				; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
				; CHECK-NEXT: Function Alias Analysis Results
				; CHECK-NEXT: Memory SSA
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Canonicalize natural loops			; CHECK-NEXT: Canonicalize natural loops
	; CHECK-NEXT: LCSSA Verifier			; CHECK-NEXT: LCSSA Verifier
	; CHECK-NEXT: Loop-Closed SSA Form Pass			; CHECK-NEXT: Loop-Closed SSA Form Pass
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Function Alias Analysis Results
	; CHECK-NEXT: Scalar Evolution Analysis			; CHECK-NEXT: Scalar Evolution Analysis
	; CHECK-NEXT: Loop Pass Manager			; CHECK-NEXT: Loop Pass Manager
	; CHECK-NEXT: Rotate Loops			; CHECK-NEXT: Rotate Loops
	; CHECK-NEXT: Memory SSA
	; CHECK-NEXT: Loop Pass Manager
	; CHECK-NEXT: Loop Invariant Code Motion			; CHECK-NEXT: Loop Invariant Code Motion
	; CHECK-NEXT: Unswitch loops			; CHECK-NEXT: Unswitch loops
	; CHECK-NEXT: Simplify the CFG			; CHECK-NEXT: Simplify the CFG
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)			; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Function Alias Analysis Results			; CHECK-NEXT: Function Alias Analysis Results
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Lazy Branch Probability Analysis			; CHECK-NEXT: Lazy Branch Probability Analysis
	▲ Show 20 Lines • Show All 212 Lines • Show Last 20 Lines

llvm/test/Other/opt-O3-pipeline-enable-matrix.ll

	Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Lazy Branch Probability Analysis			; CHECK-NEXT: Lazy Branch Probability Analysis
	; CHECK-NEXT: Lazy Block Frequency Analysis			; CHECK-NEXT: Lazy Block Frequency Analysis
	; CHECK-NEXT: Optimization Remark Emitter			; CHECK-NEXT: Optimization Remark Emitter
	; CHECK-NEXT: Tail Call Elimination			; CHECK-NEXT: Tail Call Elimination
	; CHECK-NEXT: Simplify the CFG			; CHECK-NEXT: Simplify the CFG
	; CHECK-NEXT: Reassociate expressions			; CHECK-NEXT: Reassociate expressions
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
				; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
				; CHECK-NEXT: Function Alias Analysis Results
				; CHECK-NEXT: Memory SSA
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Canonicalize natural loops			; CHECK-NEXT: Canonicalize natural loops
	; CHECK-NEXT: LCSSA Verifier			; CHECK-NEXT: LCSSA Verifier
	; CHECK-NEXT: Loop-Closed SSA Form Pass			; CHECK-NEXT: Loop-Closed SSA Form Pass
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Function Alias Analysis Results
	; CHECK-NEXT: Scalar Evolution Analysis			; CHECK-NEXT: Scalar Evolution Analysis
	; CHECK-NEXT: Loop Pass Manager			; CHECK-NEXT: Loop Pass Manager
	; CHECK-NEXT: Rotate Loops			; CHECK-NEXT: Rotate Loops
	; CHECK-NEXT: Memory SSA
	; CHECK-NEXT: Loop Pass Manager
	; CHECK-NEXT: Loop Invariant Code Motion			; CHECK-NEXT: Loop Invariant Code Motion
	; CHECK-NEXT: Unswitch loops			; CHECK-NEXT: Unswitch loops
	; CHECK-NEXT: Simplify the CFG			; CHECK-NEXT: Simplify the CFG
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)			; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Function Alias Analysis Results			; CHECK-NEXT: Function Alias Analysis Results
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Lazy Branch Probability Analysis			; CHECK-NEXT: Lazy Branch Probability Analysis
	▲ Show 20 Lines • Show All 219 Lines • Show Last 20 Lines

llvm/test/Other/opt-O3-pipeline.ll

	Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Lazy Branch Probability Analysis			; CHECK-NEXT: Lazy Branch Probability Analysis
	; CHECK-NEXT: Lazy Block Frequency Analysis			; CHECK-NEXT: Lazy Block Frequency Analysis
	; CHECK-NEXT: Optimization Remark Emitter			; CHECK-NEXT: Optimization Remark Emitter
	; CHECK-NEXT: Tail Call Elimination			; CHECK-NEXT: Tail Call Elimination
	; CHECK-NEXT: Simplify the CFG			; CHECK-NEXT: Simplify the CFG
	; CHECK-NEXT: Reassociate expressions			; CHECK-NEXT: Reassociate expressions
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
				; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
				; CHECK-NEXT: Function Alias Analysis Results
				; CHECK-NEXT: Memory SSA
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Canonicalize natural loops			; CHECK-NEXT: Canonicalize natural loops
	; CHECK-NEXT: LCSSA Verifier			; CHECK-NEXT: LCSSA Verifier
	; CHECK-NEXT: Loop-Closed SSA Form Pass			; CHECK-NEXT: Loop-Closed SSA Form Pass
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Function Alias Analysis Results
	; CHECK-NEXT: Scalar Evolution Analysis			; CHECK-NEXT: Scalar Evolution Analysis
	; CHECK-NEXT: Loop Pass Manager			; CHECK-NEXT: Loop Pass Manager
	; CHECK-NEXT: Rotate Loops			; CHECK-NEXT: Rotate Loops
	; CHECK-NEXT: Memory SSA
	; CHECK-NEXT: Loop Pass Manager
	; CHECK-NEXT: Loop Invariant Code Motion			; CHECK-NEXT: Loop Invariant Code Motion
	; CHECK-NEXT: Unswitch loops			; CHECK-NEXT: Unswitch loops
	; CHECK-NEXT: Simplify the CFG			; CHECK-NEXT: Simplify the CFG
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)			; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Function Alias Analysis Results			; CHECK-NEXT: Function Alias Analysis Results
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Lazy Branch Probability Analysis			; CHECK-NEXT: Lazy Branch Probability Analysis
	▲ Show 20 Lines • Show All 212 Lines • Show Last 20 Lines

llvm/test/Other/opt-Os-pipeline.ll

	Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Lazy Block Frequency Analysis			; CHECK-NEXT: Lazy Block Frequency Analysis
	; CHECK-NEXT: Optimization Remark Emitter			; CHECK-NEXT: Optimization Remark Emitter
	; CHECK-NEXT: Combine redundant instructions			; CHECK-NEXT: Combine redundant instructions
	; CHECK-NEXT: Optimization Remark Emitter			; CHECK-NEXT: Optimization Remark Emitter
	; CHECK-NEXT: Tail Call Elimination			; CHECK-NEXT: Tail Call Elimination
	; CHECK-NEXT: Simplify the CFG			; CHECK-NEXT: Simplify the CFG
	; CHECK-NEXT: Reassociate expressions			; CHECK-NEXT: Reassociate expressions
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
				; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
				; CHECK-NEXT: Function Alias Analysis Results
				; CHECK-NEXT: Memory SSA
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Canonicalize natural loops			; CHECK-NEXT: Canonicalize natural loops
	; CHECK-NEXT: LCSSA Verifier			; CHECK-NEXT: LCSSA Verifier
	; CHECK-NEXT: Loop-Closed SSA Form Pass			; CHECK-NEXT: Loop-Closed SSA Form Pass
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Function Alias Analysis Results
	; CHECK-NEXT: Scalar Evolution Analysis			; CHECK-NEXT: Scalar Evolution Analysis
	; CHECK-NEXT: Loop Pass Manager			; CHECK-NEXT: Loop Pass Manager
	; CHECK-NEXT: Rotate Loops			; CHECK-NEXT: Rotate Loops
	; CHECK-NEXT: Memory SSA
	; CHECK-NEXT: Loop Pass Manager
	; CHECK-NEXT: Loop Invariant Code Motion			; CHECK-NEXT: Loop Invariant Code Motion
	; CHECK-NEXT: Unswitch loops			; CHECK-NEXT: Unswitch loops
	; CHECK-NEXT: Simplify the CFG			; CHECK-NEXT: Simplify the CFG
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)			; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Function Alias Analysis Results			; CHECK-NEXT: Function Alias Analysis Results
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Lazy Branch Probability Analysis			; CHECK-NEXT: Lazy Branch Probability Analysis
	▲ Show 20 Lines • Show All 212 Lines • Show Last 20 Lines

llvm/test/Other/pass-pipelines.ll

	Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
	; CHECK-O2-NEXT: Function Integration/Inlining			; CHECK-O2-NEXT: Function Integration/Inlining
	; CHECK-O2-NEXT: OpenMP specific optimizations			; CHECK-O2-NEXT: OpenMP specific optimizations
	; CHECK-O2-NEXT: Deduce function attributes			; CHECK-O2-NEXT: Deduce function attributes
	; Next up is the main function pass pipeline. It shouldn't be split up and			; Next up is the main function pass pipeline. It shouldn't be split up and
	; should contain the main loop pass pipeline as well.			; should contain the main loop pass pipeline as well.
	; CHECK-O2-NEXT: FunctionPass Manager			; CHECK-O2-NEXT: FunctionPass Manager
	; CHECK-O2-NOT: Manager			; CHECK-O2-NOT: Manager
	; CHECK-O2: Loop Pass Manager			; CHECK-O2: Loop Pass Manager
	; CHECK-O2: Loop Pass Manager
	; CHECK-O2-NOT: Manager			; CHECK-O2-NOT: Manager
	; FIXME: We shouldn't be pulling out to simplify-cfg and instcombine and			; FIXME: We shouldn't be pulling out to simplify-cfg and instcombine and
	; causing new loop pass managers.			; causing new loop pass managers.
	; CHECK-O2: Simplify the CFG			; CHECK-O2: Simplify the CFG
	; CHECK-O2-NOT: Manager			; CHECK-O2-NOT: Manager
	; CHECK-O2: Combine redundant instructions			; CHECK-O2: Combine redundant instructions
	; CHECK-O2-NOT: Manager			; CHECK-O2-NOT: Manager
	; CHECK-O2: Loop Pass Manager			; CHECK-O2: Loop Pass Manager
	▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[LoopRotate] Add explicit flag to require MSSA.AbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 289280

llvm/include/llvm/Transforms/Scalar.h

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp

llvm/lib/Transforms/Scalar/LoopRotation.cpp

llvm/test/CodeGen/AMDGPU/opt-pipeline.ll

llvm/test/Other/opt-O2-pipeline.ll

llvm/test/Other/opt-O3-pipeline-enable-matrix.ll

llvm/test/Other/opt-O3-pipeline.ll

llvm/test/Other/opt-Os-pipeline.ll

llvm/test/Other/pass-pipelines.ll

[LoopRotate] Add explicit flag to require MSSA.
AbandonedPublic