This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/
-
Passes/
1/1
PassBuilder.cpp
-
Transforms/Scalar/
-
Scalar/
4/5
SimpleLoopUnswitch.cpp
-
test/
-
Other/
-
new-pm-defaults.ll
-
new-pm-thinlto-defaults.ll
-
new-pm-thinlto-postlink-pgo-defaults.ll
-
new-pm-thinlto-postlink-samplepgo-defaults.ll
-
new-pm-thinlto-prelink-pgo-defaults.ll
-
new-pm-thinlto-prelink-samplepgo-defaults.ll
-
Transforms/
-
LoopPredication/
-
preserve-bpi.ll
-
SimpleLoopUnswitch/
5/5
PGO-nontrivial-unswitch.ll
-
nontrivial-unswitch-markloopasdeleted.ll

Differential D129599

[SimpleLoopUnswitch] Skip non-trivial unswitching of cold loops
AcceptedPublic

Authored by drcut on Jul 12 2022, 3:05 PM.

Download Raw Diff

Details

Reviewers

aeubanks
asbirlea

Commits

rGf756f06cc471: [SimpleLoopUnswitch] Skip non-trivial unswitching of cold loops

Summary

With profile data, non-trivial LoopUnswitch will only apply on non-cold loops, as unswitching cold loops may not gain much benefit but significantly increase the code size.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

drcut created this revision.Jul 12 2022, 3:05 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 12 2022, 3:05 PM

Herald added subscribers: wenlei, hiraditya. · View Herald Transcript

drcut requested review of this revision.Jul 12 2022, 3:05 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 12 2022, 3:05 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

drcut added reviewers: aeubanks, asbirlea.Jul 12 2022, 3:06 PM

the title should be something like [SimpleLoopUnswitch] Skip non-trivial unswitching of cold loops

otherwise this looks like the right approach

llvm/lib/Passes/PassBuilder.cpp
1402–1405	run `git clang-format`?
llvm/lib/Transforms/Scalar/SimpleLoopUnswitch.cpp
3121	as we found offline, this may return `nullptr` in the case of `opt -aa-pipeline=`, we should bail out in that case I'd add a `RUN: opt -passes=simple-loop-unswitch -aa-pipeline=` to the test to make sure it doesn't crash (no need to check IR)
3237	just directly pass `nullptr` below
3286	these are specific to the legacy pass and don't need to be here since we're not actulaly using BFI in the legacy pass
llvm/test/Transforms/SimpleLoopUnswitch/PGO-nontrivial-unswitch.ll
2	I'd just use `llvm/utils/update_test_checks.py` rather than manually checking loops once the test looks good it should be precommitted so we can see the delta this patch gives

Harbormaster completed remote builds in B174985: Diff 444089.Jul 12 2022, 6:23 PM

[llvm] Implement PGO for SLU
Using PGO for SLU to avoid non-trivial unswitch code on code loops, which will increase the binary code size without gain much benefit.

Herald added subscribers: ormris, steven_wu. · View Herald TranscriptAug 4 2022, 11:28 AM

aeubanks added inline comments.Aug 4 2022, 1:56 PM

llvm/lib/Transforms/Scalar/SimpleLoopUnswitch.cpp
3120	still looks like you need to run `git clang-format HEAD~`
llvm/test/Transforms/SimpleLoopUnswitch/PGO-nontrivial-unswitch.ll
2	the autogenerated CHECKs aren't in here, it should be `; RUN: opt < %s -passes='require<profile-summary>,function(loop-mssa(simple-loop-unswitch<nontrivial>)),print<loops>' -S \| FileCheck %`, then delete the existing CHECKs and run `update_test_checks.py`

Harbormaster completed remote builds in B179354: Diff 450083.Aug 4 2022, 2:25 PM

[llvm] Change test file for PGO on SLU

commit looks good, but the message/title should be updated as I commented before

can you check in just the test now, making sure to regenerate the CHECK lines to work with ToT LLVM (which will unswitch both loops), then rebase this on top of that? so we can see the diff this patch causes. the test commit can be called something like [test][SimpleLoopUnswitch] Precommit test for D129599

Harbormaster completed remote builds in B179393: Diff 450132.Aug 4 2022, 5:49 PM

fhahn added a subscriber: fhahn.Aug 5 2022, 4:12 AM

fhahn added inline comments.

llvm/test/Transforms/SimpleLoopUnswitch/PGO-nontrivial-unswitch.ll
2	`,print<loops>'` is this needed? It looks like you are not checking the output (which is emitted to stderr)
5	if you are not checking the output, this should probably use `-disable-output`.

aeubanks added inline comments.Aug 5 2022, 8:55 AM

llvm/test/Transforms/SimpleLoopUnswitch/PGO-nontrivial-unswitch.ll
2	sorry, typo, yeah we don't need `print<loops>`

[test][SimpleLoopUnswitch] Precommit test for D129599

Harbormaster completed remote builds in B179545: Diff 450326.Aug 5 2022, 12:41 PM

drcut mentioned this in rGb5244fb71cae: [test][SimpleLoopUnswitch] Precommit test for D129599.Aug 5 2022, 1:17 PM

[test] Update PGO SLU test

Harbormaster completed remote builds in B179604: Diff 450405.Aug 5 2022, 3:32 PM

lg with the commit title/description change
(one thing you can do is change it in phab, then run arc amend which will pull the commit title/description changes and also add a Reviewed-by: line)

llvm/lib/Transforms/Scalar/SimpleLoopUnswitch.cpp
3217–3231	unintentional change

This revision is now accepted and ready to land.Aug 6 2022, 11:37 AM

Thank you for adding this!

Changes look good. Please address the comment about the title change before committing.

drcut retitled this revision from Apply PGO on SimpleLoopUnswitch to [SimpleLoopUnswitch] Skip non-trivial unswitching of cold loops.Aug 8 2022, 11:08 AM

drcut edited the summary of this revision. (Show Details)

This revision was landed with ongoing or failed builds.Aug 8 2022, 11:13 AM

Closed by commit rGf756f06cc471: [SimpleLoopUnswitch] Skip non-trivial unswitching of cold loops (authored by drcut). · Explain Why

This revision was automatically updated to reflect the committed changes.

drcut added a commit: rGf756f06cc471: [SimpleLoopUnswitch] Skip non-trivial unswitching of cold loops.

I noticed significant performance degradation (~30%) on a spec benchmark due to this commit. isColdBlock doesn't seem to work as expected, because it considered cold a loop that was in a hot function through the profile.

For example, it could be the case that the loop in question is nested and is vectorizable after being unswitched, so not unswitching it can result in performance degradation if the surrounding function is hot.

void hotFunction(int M, int N, int * A, int *B, int *C){
for (j = 0; j < M; j++)
   for (i=0; i < N; i++) {
      A[i] = B[i] + C[i]
      if (cond) do_something();
   }
}

Could you please revert this commit while you work on replacing the isColdBlock check with something stronger?

This revision is now accepted and ready to land.Aug 25 2022, 6:29 AM

@alexgatea Thanks for the feedback.
isColdBlock should be global. One example is in (https://llvm.org/doxygen/ProfileSummaryInfo_8cpp_source.html#l00142), which uses isColdBlock to check whether a function is cold or not. So I assume the issue is the loop is non-cold in real case, but was concerned as cold in the profile data. I kindly suggest updating the profile data to see if there is still any degradation.
Please correct me if I am wrong. Thanks

In D129599#3748815, @alexgatea wrote:
I noticed significant performance degradation (~30%) on a spec benchmark due to this commit. isColdBlock doesn't seem to work as expected, because it considered cold a loop that was in a hot function through the profile.

For example, it could be the case that the loop in question is nested and is vectorizable after being unswitched, so not unswitching it can result in performance degradation if the surrounding function is hot.
void hotFunction(int M, int N, int * A, int *B, int *C){
for (j = 0; j < M; j++)
   for (i=0; i < N; i++) {
      A[i] = B[i] + C[i]
      if (cond) do_something();
   }
}
Could you please revert this commit while you work on replacing the isColdBlock check with something stronger?

In D129599#3749005, @drcut wrote:

@alexgatea Thanks for the feedback.
isColdBlock should be global. One example is in (https://llvm.org/doxygen/ProfileSummaryInfo_8cpp_source.html#l00142), which uses isColdBlock to check whether a function is cold or not. So I assume the issue is the loop is non-cold in real case, but was concerned as cold in the profile data. I kindly suggest updating the profile data to see if there is still any degradation.
Please correct me if I am wrong. Thanks

I appreciate your prompt response. A couple of points ...
The profile data is updated every time I run the spec benchmark, so it's accurate. And the 30% degradation is significant.
Thank you for pointing out that example; I agree that isColdBlock should be global.
I actually called isFunctionColdInCallGraph on the function in question and it returns false, so this function is hot. I also checked the ColdCount of the loop in question and it is 1 (in particular non-zero); another hot block in this same function has a ColdCount of 2000. My guess is that the block in question is cold relative to other blocks in the function (by a factor of ~2000 I guess) but is still itself significant. And of course, this cold block analysis doesn't take into consideration how much the loop itself is optimizable if unswitched so it could still be beneficial to unswitch even though it is a cold block.
Please let me know your thoughts.

we can change the condition to if the function is cold

In D129599#3749469, @aeubanks wrote:

we can change the condition to if the function is cold

That seems reasonable to me

As suggested by other users, we need to make the skip check more restrict to avoid performance degradation.

@alexgatea please try whether the new revision could solve your problem. Thanks

Harbormaster completed remote builds in B183768: Diff 456138.Aug 27 2022, 12:31 PM

In D129599#3753699, @drcut wrote:

@alexgatea please try whether the new revision could solve your problem. Thanks

I have verified that the new revision solves my problem. Thank you!

Avoid applying non-trivial unswitching in cold Function. Compared with previous PGO solution, this version will apply non-trivial unswitching on cold loops in hot functions.

drcut added a comment.Aug 29 2022, 7:16 AM

This comment was removed by drcut.

Harbormaster completed remote builds in B183920: Diff 456331.Aug 29 2022, 7:57 AM

every patch should be its own phabricator patch, it's not good practice to overwrite one that's been submitted (excluding if the patch was reverted) because people tend to use D### to refer to a patch. can you create a new one?

should have a test with a hot and cold function now, rather than two loops in one function

I will create a new revision for this patch

Sorry for the mistake, I will keep this revision open and make it related to the update revision for the regression.

This revision is now accepted and ready to land.Sep 4 2022, 7:37 AM

revert to previous revision that has been merged into main branch

Harbormaster completed remote builds in B185015: Diff 457861.Sep 4 2022, 9:52 AM

drcut mentioned this in D133275: [SimpleLoopUnswitch] Skip non-trivial unswitching of cold functions.Sep 4 2022, 10:01 AM

drcut added a child revision: D133275: [SimpleLoopUnswitch] Skip non-trivial unswitching of cold functions.

drcut removed a child revision: D133275: [SimpleLoopUnswitch] Skip non-trivial unswitching of cold functions.Sep 6 2022, 7:35 AM

drcut mentioned this in rGfb45f3c9486f: [SimpleLoopUnswitch] Skip non-trivial unswitching of cold functions.Sep 6 2022, 4:13 PM

tejohnson mentioned this in D146383: [SimpleLoopUnswitch] Skip non-trivial unswitching of cold loop nests.Mar 19 2023, 8:54 AM

tejohnson mentioned this in rGdfb40d3fd7a2: [SimpleLoopUnswitch] Skip non-trivial unswitching of cold loop nests.Mar 20 2023, 10:15 AM

Revision Contents

Path

Size

llvm/

lib/

Passes/

PassBuilder.cpp

6 lines

Transforms/

Scalar/

SimpleLoopUnswitch.cpp

26 lines

test/

Other/

new-pm-defaults.ll

1 line

new-pm-thinlto-defaults.ll

1 line

new-pm-thinlto-postlink-pgo-defaults.ll

1 line

new-pm-thinlto-postlink-samplepgo-defaults.ll

1 line

new-pm-thinlto-prelink-pgo-defaults.ll

1 line

new-pm-thinlto-prelink-samplepgo-defaults.ll

1 line

Transforms/

LoopPredication/

preserve-bpi.ll

1 line

SimpleLoopUnswitch/

PGO-nontrivial-unswitch.ll

25 lines

nontrivial-unswitch-markloopasdeleted.ll

1 line

Diff 450405

llvm/lib/Passes/PassBuilder.cpp

Show First 20 Lines • Show All 1,393 Lines • ▼ Show 20 Lines	if (Name == "function") {
return Error::success();		return Error::success();
}		}
if (Name == "loop" \|\| Name == "loop-mssa") {		if (Name == "loop" \|\| Name == "loop-mssa") {
LoopPassManager LPM;		LoopPassManager LPM;
if (auto Err = parseLoopPassPipeline(LPM, InnerPipeline))		if (auto Err = parseLoopPassPipeline(LPM, InnerPipeline))
return Err;		return Err;
// Add the nested pass manager with the appropriate adaptor.		// Add the nested pass manager with the appropriate adaptor.
bool UseMemorySSA = (Name == "loop-mssa");		bool UseMemorySSA = (Name == "loop-mssa");
bool UseBFI = llvm::any_of(		bool UseBFI = llvm::any_of(InnerPipeline, [](auto Pipeline) {
InnerPipeline, [](auto Pipeline) { return Pipeline.Name == "licm"; });		return Pipeline.Name.contains("licm") \|\|
		Pipeline.Name.contains("simple-loop-unswitch");
		});
		aeubanksUnsubmitted Done Reply Inline Actions run `git clang-format`? aeubanks: run `git clang-format`?
bool UseBPI = llvm::any_of(InnerPipeline, [](auto Pipeline) {		bool UseBPI = llvm::any_of(InnerPipeline, [](auto Pipeline) {
return Pipeline.Name == "loop-predication";		return Pipeline.Name == "loop-predication";
});		});
FPM.addPass(createFunctionToLoopPassAdaptor(std::move(LPM), UseMemorySSA,		FPM.addPass(createFunctionToLoopPassAdaptor(std::move(LPM), UseMemorySSA,
UseBFI, UseBPI));		UseBFI, UseBPI));
return Error::success();		return Error::success();
}		}
if (auto Count = parseRepeatPassName(Name)) {		if (auto Count = parseRepeatPassName(Name)) {
▲ Show 20 Lines • Show All 465 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/SimpleLoopUnswitch.cpp

Show All 10 Lines
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/Sequence.h"		#include "llvm/ADT/Sequence.h"
#include "llvm/ADT/SetVector.h"		#include "llvm/ADT/SetVector.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/ADT/Twine.h"		#include "llvm/ADT/Twine.h"
#include "llvm/Analysis/AssumptionCache.h"		#include "llvm/Analysis/AssumptionCache.h"
		#include "llvm/Analysis/BlockFrequencyInfo.h"
#include "llvm/Analysis/CFG.h"		#include "llvm/Analysis/CFG.h"
#include "llvm/Analysis/CodeMetrics.h"		#include "llvm/Analysis/CodeMetrics.h"
#include "llvm/Analysis/GuardUtils.h"		#include "llvm/Analysis/GuardUtils.h"
#include "llvm/Analysis/LoopAnalysisManager.h"		#include "llvm/Analysis/LoopAnalysisManager.h"
#include "llvm/Analysis/LoopInfo.h"		#include "llvm/Analysis/LoopInfo.h"
#include "llvm/Analysis/LoopIterator.h"		#include "llvm/Analysis/LoopIterator.h"
#include "llvm/Analysis/LoopPass.h"		#include "llvm/Analysis/LoopPass.h"
#include "llvm/Analysis/MemorySSA.h"		#include "llvm/Analysis/MemorySSA.h"
#include "llvm/Analysis/MemorySSAUpdater.h"		#include "llvm/Analysis/MemorySSAUpdater.h"
#include "llvm/Analysis/MustExecute.h"		#include "llvm/Analysis/MustExecute.h"
		#include "llvm/Analysis/ProfileSummaryInfo.h"
#include "llvm/Analysis/ScalarEvolution.h"		#include "llvm/Analysis/ScalarEvolution.h"
#include "llvm/Analysis/TargetTransformInfo.h"		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
#include "llvm/IR/Constant.h"		#include "llvm/IR/Constant.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/Dominators.h"		#include "llvm/IR/Dominators.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
▲ Show 20 Lines • Show All 3,002 Lines • ▼ Show 20 Lines
/// If `SE` is non-null, we will update that analysis based on the unswitching		/// If `SE` is non-null, we will update that analysis based on the unswitching
/// done.		/// done.
static bool		static bool
unswitchLoop(Loop &L, DominatorTree &DT, LoopInfo &LI, AssumptionCache &AC,		unswitchLoop(Loop &L, DominatorTree &DT, LoopInfo &LI, AssumptionCache &AC,
AAResults &AA, TargetTransformInfo &TTI, bool Trivial,		AAResults &AA, TargetTransformInfo &TTI, bool Trivial,
bool NonTrivial,		bool NonTrivial,
function_ref<void(bool, bool, ArrayRef<Loop *>)> UnswitchCB,		function_ref<void(bool, bool, ArrayRef<Loop *>)> UnswitchCB,
ScalarEvolution SE, MemorySSAUpdater MSSAU,		ScalarEvolution SE, MemorySSAUpdater MSSAU,
		ProfileSummaryInfo PSI, BlockFrequencyInfo BFI,
function_ref<void(Loop &, StringRef)> DestroyLoopCB) {		function_ref<void(Loop &, StringRef)> DestroyLoopCB) {
assert(L.isRecursivelyLCSSAForm(DT, LI) &&		assert(L.isRecursivelyLCSSAForm(DT, LI) &&
"Loops must be in LCSSA form before unswitching.");		"Loops must be in LCSSA form before unswitching.");

// Must be in loop simplified form: we need a preheader and dedicated exits.		// Must be in loop simplified form: we need a preheader and dedicated exits.
if (!L.isLoopSimplifyForm())		if (!L.isLoopSimplifyForm())
return false;		return false;

Show All 20 Lines	bool ContinueWithNonTrivial =
EnableNonTrivialUnswitch \|\| (NonTrivial && !TTI.hasBranchDivergence());		EnableNonTrivialUnswitch \|\| (NonTrivial && !TTI.hasBranchDivergence());
if (!ContinueWithNonTrivial)		if (!ContinueWithNonTrivial)
return false;		return false;

// Skip non-trivial unswitching for optsize functions.		// Skip non-trivial unswitching for optsize functions.
if (L.getHeader()->getParent()->hasOptSize())		if (L.getHeader()->getParent()->hasOptSize())
return false;		return false;

		// Skip cold loops, as unswitching them brings little benefit
		// but increases the code size
		if (PSI && PSI->hasProfileSummary() && BFI &&
		PSI->isColdBlock(L.getHeader(), BFI)) {
		LLVM_DEBUG(dbgs() << " Skip cold loop: " << L << "\n");
		return false;
		}

// Skip non-trivial unswitching for loops that cannot be cloned.		// Skip non-trivial unswitching for loops that cannot be cloned.
if (!L.isSafeToClone())		if (!L.isSafeToClone())
return false;		return false;

// For non-trivial unswitching, because it often creates new loops, we rely on		// For non-trivial unswitching, because it often creates new loops, we rely on
// the pass manager to iterate on the loops rather than trying to immediately		// the pass manager to iterate on the loops rather than trying to immediately
// reach a fixed point. There is no substantial advantage to iterating		// reach a fixed point. There is no substantial advantage to iterating
// internally, and if any of the new loops are simplified enough to contain		// internally, and if any of the new loops are simplified enough to contain
Show All 9 Lines	unswitchLoop(Loop &L, DominatorTree &DT, LoopInfo &LI, AssumptionCache &AC,
return false;		return false;
}		}

PreservedAnalyses SimpleLoopUnswitchPass::run(Loop &L, LoopAnalysisManager &AM,		PreservedAnalyses SimpleLoopUnswitchPass::run(Loop &L, LoopAnalysisManager &AM,
LoopStandardAnalysisResults &AR,		LoopStandardAnalysisResults &AR,
LPMUpdater &U) {		LPMUpdater &U) {
Function &F = *L.getHeader()->getParent();		Function &F = *L.getHeader()->getParent();
(void)F;		(void)F;
		ProfileSummaryInfo *PSI = nullptr;
		if (auto OuterProxy =
		aeubanksUnsubmitted Done Reply Inline Actions still looks like you need to run `git clang-format HEAD~` aeubanks: still looks like you need to run `git clang-format HEAD~`
		AM.getResult<FunctionAnalysisManagerLoopProxy>(L, AR)
		aeubanksUnsubmitted Done Reply Inline Actions as we found offline, this may return `nullptr` in the case of `opt -aa-pipeline=`, we should bail out in that case I'd add a `RUN: opt -passes=simple-loop-unswitch -aa-pipeline=` to the test to make sure it doesn't crash (no need to check IR) aeubanks: as we found offline, this may return `nullptr` in the case of `opt -aa-pipeline=`, we should…
		.getCachedResult<ModuleAnalysisManagerFunctionProxy>(F))
		PSI = OuterProxy->getCachedResult<ProfileSummaryAnalysis>(*F.getParent());
LLVM_DEBUG(dbgs() << "Unswitching loop in " << F.getName() << ": " << L		LLVM_DEBUG(dbgs() << "Unswitching loop in " << F.getName() << ": " << L
<< "\n");		<< "\n");

// Save the current loop name in a variable so that we can report it even		// Save the current loop name in a variable so that we can report it even
// after it has been deleted.		// after it has been deleted.
std::string LoopName = std::string(L.getName());		std::string LoopName = std::string(L.getName());

auto UnswitchCB = [&L, &U, &LoopName](bool CurrentLoopValid,		auto UnswitchCB = [&L, &U, &LoopName](bool CurrentLoopValid,
Show All 30 Lines	PreservedAnalyses SimpleLoopUnswitchPass::run(Loop &L, LoopAnalysisManager &AM,
Optional<MemorySSAUpdater> MSSAU;		Optional<MemorySSAUpdater> MSSAU;
if (AR.MSSA) {		if (AR.MSSA) {
MSSAU = MemorySSAUpdater(AR.MSSA);		MSSAU = MemorySSAUpdater(AR.MSSA);
if (VerifyMemorySSA)		if (VerifyMemorySSA)
AR.MSSA->verifyMemorySSA();		AR.MSSA->verifyMemorySSA();
}		}
if (!unswitchLoop(L, AR.DT, AR.LI, AR.AC, AR.AA, AR.TTI, Trivial, NonTrivial,		if (!unswitchLoop(L, AR.DT, AR.LI, AR.AC, AR.AA, AR.TTI, Trivial, NonTrivial,
UnswitchCB, &AR.SE, MSSAU ? MSSAU.getPointer() : nullptr,		UnswitchCB, &AR.SE, MSSAU ? MSSAU.getPointer() : nullptr,
DestroyLoopCB))		PSI, AR.BFI, DestroyLoopCB))
return PreservedAnalyses::all();		return PreservedAnalyses::all();

if (AR.MSSA && VerifyMemorySSA)		if (AR.MSSA && VerifyMemorySSA)
AR.MSSA->verifyMemorySSA();		AR.MSSA->verifyMemorySSA();

// Historically this pass has had issues with the dominator tree so verify it		// Historically this pass has had issues with the dominator tree so verify it
// in asserts builds.		// in asserts builds.
assert(AR.DT.verify(DominatorTree::VerificationLevel::Fast));		assert(AR.DT.verify(DominatorTree::VerificationLevel::Fast));
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines
bool SimpleLoopUnswitchLegacyPass::runOnLoop(Loop *L, LPPassManager &LPM) {		bool SimpleLoopUnswitchLegacyPass::runOnLoop(Loop *L, LPPassManager &LPM) {
if (skipLoop(L))		if (skipLoop(L))
return false;		return false;

Function &F = *L->getHeader()->getParent();		Function &F = *L->getHeader()->getParent();

LLVM_DEBUG(dbgs() << "Unswitching loop in " << F.getName() << ": " << *L		LLVM_DEBUG(dbgs() << "Unswitching loop in " << F.getName() << ": " << *L
<< "\n");		<< "\n");

auto &DT = getAnalysis<DominatorTreeWrapperPass>().getDomTree();		auto &DT = getAnalysis<DominatorTreeWrapperPass>().getDomTree();
auto &LI = getAnalysis<LoopInfoWrapperPass>().getLoopInfo();		auto &LI = getAnalysis<LoopInfoWrapperPass>().getLoopInfo();
auto &AC = getAnalysis<AssumptionCacheTracker>().getAssumptionCache(F);		auto &AC = getAnalysis<AssumptionCacheTracker>().getAssumptionCache(F);
auto &AA = getAnalysis<AAResultsWrapperPass>().getAAResults();		auto &AA = getAnalysis<AAResultsWrapperPass>().getAAResults();
auto &TTI = getAnalysis<TargetTransformInfoWrapperPass>().getTTI(F);		auto &TTI = getAnalysis<TargetTransformInfoWrapperPass>().getTTI(F);
MemorySSA *MSSA = &getAnalysis<MemorySSAWrapperPass>().getMSSA();		MemorySSA *MSSA = &getAnalysis<MemorySSAWrapperPass>().getMSSA();
		aeubanksUnsubmitted Done Reply Inline Actions just directly pass `nullptr` below aeubanks: just directly pass `nullptr` below
MemorySSAUpdater MSSAU(MSSA);		MemorySSAUpdater MSSAU(MSSA);

auto *SEWP = getAnalysisIfAvailable<ScalarEvolutionWrapperPass>();		auto *SEWP = getAnalysisIfAvailable<ScalarEvolutionWrapperPass>();
auto *SE = SEWP ? &SEWP->getSE() : nullptr;		auto *SE = SEWP ? &SEWP->getSE() : nullptr;

auto UnswitchCB = [&L, &LPM](bool CurrentLoopValid, bool PartiallyInvariant,		auto UnswitchCB = [&L, &LPM](bool CurrentLoopValid, bool PartiallyInvariant,
ArrayRef<Loop *> NewLoops) {		ArrayRef<Loop *> NewLoops) {
// If we did a non-trivial unswitch, we have added new (cloned) loops.		// If we did a non-trivial unswitch, we have added new (cloned) loops.
aeubanksUnsubmitted Not Done Reply Inline Actions unintentional change aeubanks: unintentional change
for (auto *NewL : NewLoops)		for (auto *NewL : NewLoops)
LPM.addLoop(*NewL);		LPM.addLoop(*NewL);

// If the current loop remains valid, re-add it to the queue. This is		// If the current loop remains valid, re-add it to the queue. This is
// a little wasteful as we'll finish processing the current loop as well,		// a little wasteful as we'll finish processing the current loop as well,
// but it is the best we can do in the old PM.		// but it is the best we can do in the old PM.
if (CurrentLoopValid) {		if (CurrentLoopValid) {
// If the current loop has been unswitched using a partially invariant		// If the current loop has been unswitched using a partially invariant
// condition, we should not re-add the current loop to avoid unswitching		// condition, we should not re-add the current loop to avoid unswitching
// on the same condition again.		// on the same condition again.
if (!PartiallyInvariant)		if (!PartiallyInvariant)
LPM.addLoop(*L);		LPM.addLoop(*L);
} else		} else
LPM.markLoopAsDeleted(*L);		LPM.markLoopAsDeleted(*L);
};		};

auto DestroyLoopCB = [&LPM](Loop &L, StringRef /* Name */) {		auto DestroyLoopCB = [&LPM](Loop &L, StringRef /* Name */) {
LPM.markLoopAsDeleted(L);		LPM.markLoopAsDeleted(L);
};		};

if (VerifyMemorySSA)		if (VerifyMemorySSA)
MSSA->verifyMemorySSA();		MSSA->verifyMemorySSA();
		bool Changed =
bool Changed = unswitchLoop(*L, DT, LI, AC, AA, TTI, true, NonTrivial,		unswitchLoop(*L, DT, LI, AC, AA, TTI, true, NonTrivial, UnswitchCB, SE,
UnswitchCB, SE, &MSSAU, DestroyLoopCB);		&MSSAU, nullptr, nullptr, DestroyLoopCB);

if (VerifyMemorySSA)		if (VerifyMemorySSA)
MSSA->verifyMemorySSA();		MSSA->verifyMemorySSA();

// Historically this pass has had issues with the dominator tree so verify it		// Historically this pass has had issues with the dominator tree so verify it
// in asserts builds.		// in asserts builds.
assert(DT.verify(DominatorTree::VerificationLevel::Fast));		assert(DT.verify(DominatorTree::VerificationLevel::Fast));

return Changed;		return Changed;
}		}

char SimpleLoopUnswitchLegacyPass::ID = 0;		char SimpleLoopUnswitchLegacyPass::ID = 0;
INITIALIZE_PASS_BEGIN(SimpleLoopUnswitchLegacyPass, "simple-loop-unswitch",		INITIALIZE_PASS_BEGIN(SimpleLoopUnswitchLegacyPass, "simple-loop-unswitch",
"Simple unswitch loops", false, false)		"Simple unswitch loops", false, false)
INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)		INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)
INITIALIZE_PASS_DEPENDENCY(DominatorTreeWrapperPass)		INITIALIZE_PASS_DEPENDENCY(DominatorTreeWrapperPass)
		aeubanksUnsubmitted Done Reply Inline Actions these are specific to the legacy pass and don't need to be here since we're not actulaly using BFI in the legacy pass aeubanks: these are specific to the legacy pass and don't need to be here since we're not actulaly using…
INITIALIZE_PASS_DEPENDENCY(LoopInfoWrapperPass)		INITIALIZE_PASS_DEPENDENCY(LoopInfoWrapperPass)
INITIALIZE_PASS_DEPENDENCY(LoopPass)		INITIALIZE_PASS_DEPENDENCY(LoopPass)
INITIALIZE_PASS_DEPENDENCY(MemorySSAWrapperPass)		INITIALIZE_PASS_DEPENDENCY(MemorySSAWrapperPass)
INITIALIZE_PASS_DEPENDENCY(TargetTransformInfoWrapperPass)		INITIALIZE_PASS_DEPENDENCY(TargetTransformInfoWrapperPass)
INITIALIZE_PASS_END(SimpleLoopUnswitchLegacyPass, "simple-loop-unswitch",		INITIALIZE_PASS_END(SimpleLoopUnswitchLegacyPass, "simple-loop-unswitch",
"Simple unswitch loops", false, false)		"Simple unswitch loops", false, false)

Pass *llvm::createSimpleLoopUnswitchLegacyPass(bool NonTrivial) {		Pass *llvm::createSimpleLoopUnswitchLegacyPass(bool NonTrivial) {
return new SimpleLoopUnswitchLegacyPass(NonTrivial);		return new SimpleLoopUnswitchLegacyPass(NonTrivial);
}		}

llvm/test/Other/new-pm-defaults.ll

	Show First 20 Lines • Show All 168 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running analysis: ScalarEvolutionAnalysis			; CHECK-O-NEXT: Running analysis: ScalarEvolutionAnalysis
	; CHECK-O-NEXT: Running analysis: InnerAnalysisManagerProxy			; CHECK-O-NEXT: Running analysis: InnerAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: LoopInstSimplifyPass			; CHECK-O-NEXT: Running pass: LoopInstSimplifyPass
	; CHECK-O-NEXT: Running pass: LoopSimplifyCFGPass			; CHECK-O-NEXT: Running pass: LoopSimplifyCFGPass
	; CHECK-O-NEXT: Running pass: LICM			; CHECK-O-NEXT: Running pass: LICM
	; CHECK-O-NEXT: Running pass: LoopRotatePass			; CHECK-O-NEXT: Running pass: LoopRotatePass
	; CHECK-O-NEXT: Running pass: LICM			; CHECK-O-NEXT: Running pass: LICM
	; CHECK-O-NEXT: Running pass: SimpleLoopUnswitchPass			; CHECK-O-NEXT: Running pass: SimpleLoopUnswitchPass
				; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running pass: LoopSimplifyPass			; CHECK-O-NEXT: Running pass: LoopSimplifyPass
	; CHECK-O-NEXT: Running pass: LCSSAPass			; CHECK-O-NEXT: Running pass: LCSSAPass
	; CHECK-O-NEXT: Running pass: LoopIdiomRecognizePass			; CHECK-O-NEXT: Running pass: LoopIdiomRecognizePass
	; CHECK-O-NEXT: Running pass: IndVarSimplifyPass			; CHECK-O-NEXT: Running pass: IndVarSimplifyPass
	; CHECK-EP-LOOP-LATE-NEXT: Running pass: NoOpLoopPass			; CHECK-EP-LOOP-LATE-NEXT: Running pass: NoOpLoopPass
	; CHECK-O-NEXT: Running pass: LoopDeletionPass			; CHECK-O-NEXT: Running pass: LoopDeletionPass
	▲ Show 20 Lines • Show All 125 Lines • Show Last 20 Lines

llvm/test/Other/new-pm-thinlto-defaults.ll

	Show First 20 Lines • Show All 131 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running analysis: ScalarEvolutionAnalysis			; CHECK-O-NEXT: Running analysis: ScalarEvolutionAnalysis
	; CHECK-O-NEXT: Running analysis: InnerAnalysisManagerProxy			; CHECK-O-NEXT: Running analysis: InnerAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: LoopInstSimplifyPass			; CHECK-O-NEXT: Running pass: LoopInstSimplifyPass
	; CHECK-O-NEXT: Running pass: LoopSimplifyCFGPass			; CHECK-O-NEXT: Running pass: LoopSimplifyCFGPass
	; CHECK-O-NEXT: Running pass: LICM			; CHECK-O-NEXT: Running pass: LICM
	; CHECK-O-NEXT: Running pass: LoopRotatePass			; CHECK-O-NEXT: Running pass: LoopRotatePass
	; CHECK-O-NEXT: Running pass: LICM			; CHECK-O-NEXT: Running pass: LICM
	; CHECK-O-NEXT: Running pass: SimpleLoopUnswitchPass			; CHECK-O-NEXT: Running pass: SimpleLoopUnswitchPass
				; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running pass: LoopSimplifyPass			; CHECK-O-NEXT: Running pass: LoopSimplifyPass
	; CHECK-O-NEXT: Running pass: LCSSAPass			; CHECK-O-NEXT: Running pass: LCSSAPass
	; CHECK-O-NEXT: Running pass: LoopIdiomRecognizePass			; CHECK-O-NEXT: Running pass: LoopIdiomRecognizePass
	; CHECK-O-NEXT: Running pass: IndVarSimplifyPass			; CHECK-O-NEXT: Running pass: IndVarSimplifyPass
	; CHECK-O-NEXT: Running pass: LoopDeletionPass			; CHECK-O-NEXT: Running pass: LoopDeletionPass
	; CHECK-O-NEXT: Running pass: LoopFullUnrollPass			; CHECK-O-NEXT: Running pass: LoopFullUnrollPass
	▲ Show 20 Lines • Show All 121 Lines • Show Last 20 Lines

llvm/test/Other/new-pm-thinlto-postlink-pgo-defaults.ll

	Show First 20 Lines • Show All 104 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running analysis: ScalarEvolutionAnalysis			; CHECK-O-NEXT: Running analysis: ScalarEvolutionAnalysis
	; CHECK-O-NEXT: Running analysis: InnerAnalysisManagerProxy			; CHECK-O-NEXT: Running analysis: InnerAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: LoopInstSimplifyPass			; CHECK-O-NEXT: Running pass: LoopInstSimplifyPass
	; CHECK-O-NEXT: Running pass: LoopSimplifyCFGPass			; CHECK-O-NEXT: Running pass: LoopSimplifyCFGPass
	; CHECK-O-NEXT: Running pass: LICM			; CHECK-O-NEXT: Running pass: LICM
	; CHECK-O-NEXT: Running pass: LoopRotatePass			; CHECK-O-NEXT: Running pass: LoopRotatePass
	; CHECK-O-NEXT: Running pass: LICM			; CHECK-O-NEXT: Running pass: LICM
	; CHECK-O-NEXT: Running pass: SimpleLoopUnswitchPass			; CHECK-O-NEXT: Running pass: SimpleLoopUnswitchPass
				; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running pass: LoopSimplifyPass			; CHECK-O-NEXT: Running pass: LoopSimplifyPass
	; CHECK-O-NEXT: Running pass: LCSSAPass			; CHECK-O-NEXT: Running pass: LCSSAPass
	; CHECK-O-NEXT: Running pass: LoopIdiomRecognizePass			; CHECK-O-NEXT: Running pass: LoopIdiomRecognizePass
	; CHECK-O-NEXT: Running pass: IndVarSimplifyPass			; CHECK-O-NEXT: Running pass: IndVarSimplifyPass
	; CHECK-O-NEXT: Running pass: LoopDeletionPass			; CHECK-O-NEXT: Running pass: LoopDeletionPass
	; CHECK-O-NEXT: Running pass: LoopFullUnrollPass			; CHECK-O-NEXT: Running pass: LoopFullUnrollPass
	▲ Show 20 Lines • Show All 147 Lines • Show Last 20 Lines

llvm/test/Other/new-pm-thinlto-postlink-samplepgo-defaults.ll

	Show First 20 Lines • Show All 113 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running analysis: ScalarEvolutionAnalysis			; CHECK-O-NEXT: Running analysis: ScalarEvolutionAnalysis
	; CHECK-O-NEXT: Running analysis: InnerAnalysisManagerProxy			; CHECK-O-NEXT: Running analysis: InnerAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: LoopInstSimplifyPass			; CHECK-O-NEXT: Running pass: LoopInstSimplifyPass
	; CHECK-O-NEXT: Running pass: LoopSimplifyCFGPass			; CHECK-O-NEXT: Running pass: LoopSimplifyCFGPass
	; CHECK-O-NEXT: Running pass: LICM			; CHECK-O-NEXT: Running pass: LICM
	; CHECK-O-NEXT: Running pass: LoopRotatePass			; CHECK-O-NEXT: Running pass: LoopRotatePass
	; CHECK-O-NEXT: Running pass: LICM			; CHECK-O-NEXT: Running pass: LICM
	; CHECK-O-NEXT: Running pass: SimpleLoopUnswitchPass			; CHECK-O-NEXT: Running pass: SimpleLoopUnswitchPass
				; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running pass: LoopSimplifyPass			; CHECK-O-NEXT: Running pass: LoopSimplifyPass
	; CHECK-O-NEXT: Running pass: LCSSAPass			; CHECK-O-NEXT: Running pass: LCSSAPass
	; CHECK-O-NEXT: Running pass: LoopIdiomRecognizePass			; CHECK-O-NEXT: Running pass: LoopIdiomRecognizePass
	; CHECK-O-NEXT: Running pass: IndVarSimplifyPass			; CHECK-O-NEXT: Running pass: IndVarSimplifyPass
	; CHECK-O-NEXT: Running pass: LoopDeletionPass			; CHECK-O-NEXT: Running pass: LoopDeletionPass
	; CHECK-O-NEXT: Running pass: LoopFullUnrollPass			; CHECK-O-NEXT: Running pass: LoopFullUnrollPass
	▲ Show 20 Lines • Show All 120 Lines • Show Last 20 Lines

llvm/test/Other/new-pm-thinlto-prelink-pgo-defaults.ll

	Show First 20 Lines • Show All 142 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running analysis: ScalarEvolutionAnalysis			; CHECK-O-NEXT: Running analysis: ScalarEvolutionAnalysis
	; CHECK-O-NEXT: Running analysis: InnerAnalysisManagerProxy			; CHECK-O-NEXT: Running analysis: InnerAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: LoopInstSimplifyPass			; CHECK-O-NEXT: Running pass: LoopInstSimplifyPass
	; CHECK-O-NEXT: Running pass: LoopSimplifyCFGPass			; CHECK-O-NEXT: Running pass: LoopSimplifyCFGPass
	; CHECK-O-NEXT: Running pass: LICM			; CHECK-O-NEXT: Running pass: LICM
	; CHECK-O-NEXT: Running pass: LoopRotatePass			; CHECK-O-NEXT: Running pass: LoopRotatePass
	; CHECK-O-NEXT: Running pass: LICM			; CHECK-O-NEXT: Running pass: LICM
	; CHECK-O-NEXT: Running pass: SimpleLoopUnswitchPass			; CHECK-O-NEXT: Running pass: SimpleLoopUnswitchPass
				; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running pass: LoopSimplifyPass			; CHECK-O-NEXT: Running pass: LoopSimplifyPass
	; CHECK-O-NEXT: Running pass: LCSSAPass			; CHECK-O-NEXT: Running pass: LCSSAPass
	; CHECK-O-NEXT: Running pass: LoopIdiomRecognizePass			; CHECK-O-NEXT: Running pass: LoopIdiomRecognizePass
	; CHECK-O-NEXT: Running pass: IndVarSimplifyPass			; CHECK-O-NEXT: Running pass: IndVarSimplifyPass
	; CHECK-O-NEXT: Running pass: LoopDeletionPass			; CHECK-O-NEXT: Running pass: LoopDeletionPass
	; CHECK-O-NEXT: Running pass: LoopFullUnrollPass			; CHECK-O-NEXT: Running pass: LoopFullUnrollPass
	▲ Show 20 Lines • Show All 84 Lines • Show Last 20 Lines

llvm/test/Other/new-pm-thinlto-prelink-samplepgo-defaults.ll

	Show First 20 Lines • Show All 108 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running analysis: ScalarEvolutionAnalysis			; CHECK-O-NEXT: Running analysis: ScalarEvolutionAnalysis
	; CHECK-O-NEXT: Running analysis: InnerAnalysisManagerProxy			; CHECK-O-NEXT: Running analysis: InnerAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: LoopInstSimplifyPass			; CHECK-O-NEXT: Running pass: LoopInstSimplifyPass
	; CHECK-O-NEXT: Running pass: LoopSimplifyCFGPass			; CHECK-O-NEXT: Running pass: LoopSimplifyCFGPass
	; CHECK-O-NEXT: Running pass: LICM			; CHECK-O-NEXT: Running pass: LICM
	; CHECK-O-NEXT: Running pass: LoopRotatePass			; CHECK-O-NEXT: Running pass: LoopRotatePass
	; CHECK-O-NEXT: Running pass: LICM			; CHECK-O-NEXT: Running pass: LICM
	; CHECK-O-NEXT: Running pass: SimpleLoopUnswitchPass			; CHECK-O-NEXT: Running pass: SimpleLoopUnswitchPass
				; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running pass: LoopSimplifyPass			; CHECK-O-NEXT: Running pass: LoopSimplifyPass
	; CHECK-O-NEXT: Running pass: LCSSAPass			; CHECK-O-NEXT: Running pass: LCSSAPass
	; CHECK-O-NEXT: Running pass: LoopIdiomRecognizePass			; CHECK-O-NEXT: Running pass: LoopIdiomRecognizePass
	; CHECK-O-NEXT: Running pass: IndVarSimplifyPass			; CHECK-O-NEXT: Running pass: IndVarSimplifyPass
	; CHECK-O-NEXT: Running pass: LoopDeletionPass			; CHECK-O-NEXT: Running pass: LoopDeletionPass
	; CHECK-O-NEXT: Running pass: SROAPass on foo			; CHECK-O-NEXT: Running pass: SROAPass on foo
	▲ Show 20 Lines • Show All 81 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopPredication/preserve-bpi.ll

	; RUN: opt -mtriple=x86_64 -passes='loop-mssa(loop-predication,licm,simple-loop-unswitch<nontrivial>,loop-simplifycfg)' -debug-pass-manager -debug-only=branch-prob -S < %s 2>&1 \| FileCheck %s			; RUN: opt -mtriple=x86_64 -passes='loop-mssa(loop-predication,licm,simple-loop-unswitch<nontrivial>,loop-simplifycfg)' -debug-pass-manager -debug-only=branch-prob -S < %s 2>&1 \| FileCheck %s

	; REQUIRES: asserts			; REQUIRES: asserts

	; This test is to solely check that we do not run BPI every single time loop			; This test is to solely check that we do not run BPI every single time loop
	; predication is invoked (since BPI is preserved as part of			; predication is invoked (since BPI is preserved as part of
	; LoopStandardAnalysisResults).			; LoopStandardAnalysisResults).
	declare void @llvm.experimental.guard(i1, ...)			declare void @llvm.experimental.guard(i1, ...)

	; CHECK: Running pass: LoopPredicationPass on Loop at depth 1			; CHECK: Running pass: LoopPredicationPass on Loop at depth 1
	; CHECK-NEXT: Running pass: LICMPass on Loop at depth 1			; CHECK-NEXT: Running pass: LICMPass on Loop at depth 1
	; CHECK-NEXT: Running pass: SimpleLoopUnswitchPass on Loop at depth 1			; CHECK-NEXT: Running pass: SimpleLoopUnswitchPass on Loop at depth 1
				; CHECK-NEXT: Running analysis: OuterAnalysisManagerProxy
	; CHECK-NEXT: Running pass: LoopPredicationPass on Loop at depth 1			; CHECK-NEXT: Running pass: LoopPredicationPass on Loop at depth 1
	; CHECK-NEXT: Running pass: LICMPass on Loop at depth 1			; CHECK-NEXT: Running pass: LICMPass on Loop at depth 1
	; CHECK-NEXT: Running pass: SimpleLoopUnswitchPass on Loop at depth 1			; CHECK-NEXT: Running pass: SimpleLoopUnswitchPass on Loop at depth 1
	; CHECK-NEXT: Running pass: LoopSimplifyCFGPass on Loop at depth 1			; CHECK-NEXT: Running pass: LoopSimplifyCFGPass on Loop at depth 1


	define i32 @unsigned_loop_0_to_n_ult_check(i32* %array, i32 %length, i32 %n) {			define i32 @unsigned_loop_0_to_n_ult_check(i32* %array, i32 %length, i32 %n) {
	entry:			entry:
	Show All 40 Lines

llvm/test/Transforms/SimpleLoopUnswitch/PGO-nontrivial-unswitch.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py

				aeubanksUnsubmitted Done Reply Inline Actions I'd just use `llvm/utils/update_test_checks.py` rather than manually checking loops once the test looks good it should be precommitted so we can see the delta this patch gives aeubanks: I'd just use `llvm/utils/update_test_checks.py` rather than manually checking loops once the…
				aeubanksUnsubmitted Done Reply Inline Actions the autogenerated CHECKs aren't in here, it should be `; RUN: opt < %s -passes='require<profile-summary>,function(loop-mssa(simple-loop-unswitch<nontrivial>)),print<loops>' -S \| FileCheck %`, then delete the existing CHECKs and run `update_test_checks.py` aeubanks: the autogenerated CHECKs aren't in here, it should be `; RUN: opt < %s -passes='require<profile…
				fhahnUnsubmitted Done Reply Inline Actions `,print<loops>'` is this needed? It looks like you are not checking the output (which is emitted to stderr) fhahn: `,print<loops>'` is this needed? It looks like you are not checking the output (which is…
				aeubanksUnsubmitted Done Reply Inline Actions sorry, typo, yeah we don't need `print<loops>` aeubanks: sorry, typo, yeah we don't need `print<loops>`
	; RUN: opt < %s -passes='require<profile-summary>,function(loop-mssa(simple-loop-unswitch<nontrivial>))' -S \| FileCheck %s			; RUN: opt < %s -passes='require<profile-summary>,function(loop-mssa(simple-loop-unswitch<nontrivial>))' -S \| FileCheck %s
	; This test checks for a crash.			; This test checks for a crash.
	; RUN: opt < %s -passes=simple-loop-unswitch -aa-pipeline= -disable-output			; RUN: opt < %s -passes=simple-loop-unswitch -aa-pipeline= -disable-output
				fhahnUnsubmitted Done Reply Inline Actions if you are not checking the output, this should probably use `-disable-output`. fhahn: if you are not checking the output, this should probably use `-disable-output`.

	declare i32 @a()			declare i32 @a()
	declare i32 @b()			declare i32 @b()

	define void @f1(i32 %i, i1 %cond, i1 %hot_cond, i1 %cold_cond, i1* %ptr) !prof !0 {			define void @f1(i32 %i, i1 %cond, i1 %hot_cond, i1 %cold_cond, i1* %ptr) !prof !0 {
	; CHECK-LABEL: @f1(			; CHECK-LABEL: @f1(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[ENTRY_HOT_LOOP:%.*]]			; CHECK-NEXT: br label [[ENTRY_HOT_LOOP:%.*]]
	Show All 27 Lines
	; CHECK-NEXT: br label [[HOT_LOOP_EXIT_LOOPEXIT]]			; CHECK-NEXT: br label [[HOT_LOOP_EXIT_LOOPEXIT]]
	; CHECK: hot_loop_exit.loopexit:			; CHECK: hot_loop_exit.loopexit:
	; CHECK-NEXT: br label [[HOT_LOOP_EXIT]]			; CHECK-NEXT: br label [[HOT_LOOP_EXIT]]
	; CHECK: hot_loop_exit:			; CHECK: hot_loop_exit:
	; CHECK-NEXT: br label [[ENTRY_COLD_LOOP:%.*]]			; CHECK-NEXT: br label [[ENTRY_COLD_LOOP:%.*]]
	; CHECK: entry_cold_loop:			; CHECK: entry_cold_loop:
	; CHECK-NEXT: br i1 [[COLD_COND:%.]], label [[COLD_LOOP_BEGIN_PREHEADER:%.]], label [[COLD_LOOP_EXIT:%.*]], !prof [[PROF16:![0-9]+]]			; CHECK-NEXT: br i1 [[COLD_COND:%.]], label [[COLD_LOOP_BEGIN_PREHEADER:%.]], label [[COLD_LOOP_EXIT:%.*]], !prof [[PROF16:![0-9]+]]
	; CHECK: cold_loop_begin.preheader:			; CHECK: cold_loop_begin.preheader:
	; CHECK-NEXT: br i1 [[COND]], label [[COLD_LOOP_BEGIN_PREHEADER_SPLIT_US:%.]], label [[COLD_LOOP_BEGIN_PREHEADER_SPLIT:%.]]
	; CHECK: cold_loop_begin.preheader.split.us:
	; CHECK-NEXT: br label [[COLD_LOOP_BEGIN_US:%.*]]
	; CHECK: cold_loop_begin.us:
	; CHECK-NEXT: br label [[COLD_LOOP_A_US:%.*]]
	; CHECK: cold_loop_a.us:
	; CHECK-NEXT: [[TMP2:%.*]] = call i32 @a()
	; CHECK-NEXT: br label [[COLD_LOOP_LATCH_US:%.*]]
	; CHECK: cold_loop_latch.us:
	; CHECK-NEXT: [[V2_US:%.]] = load i1, i1 [[PTR]], align 1
	; CHECK-NEXT: br i1 [[V2_US]], label [[COLD_LOOP_BEGIN_US]], label [[COLD_LOOP_EXIT_LOOPEXIT_SPLIT_US:%.*]]
	; CHECK: cold_loop_exit.loopexit.split.us:
	; CHECK-NEXT: br label [[COLD_LOOP_EXIT_LOOPEXIT:%.*]]
	; CHECK: cold_loop_begin.preheader.split:
	; CHECK-NEXT: br label [[COLD_LOOP_BEGIN:%.*]]			; CHECK-NEXT: br label [[COLD_LOOP_BEGIN:%.*]]
	; CHECK: cold_loop_begin:			; CHECK: cold_loop_begin:
	; CHECK-NEXT: br label [[COLD_LOOP_B:%.*]]			; CHECK-NEXT: br i1 [[COND]], label [[COLD_LOOP_A:%.]], label [[COLD_LOOP_B:%.]]
				; CHECK: cold_loop_a:
				; CHECK-NEXT: [[TMP2:%.*]] = call i32 @a()
				; CHECK-NEXT: br label [[COLD_LOOP_LATCH:%.*]]
	; CHECK: cold_loop_b:			; CHECK: cold_loop_b:
	; CHECK-NEXT: [[TMP3:%.*]] = call i32 @b()			; CHECK-NEXT: [[TMP3:%.*]] = call i32 @b()
	; CHECK-NEXT: br label [[COLD_LOOP_LATCH:%.*]]			; CHECK-NEXT: br label [[COLD_LOOP_LATCH]]
	; CHECK: cold_loop_latch:			; CHECK: cold_loop_latch:
	; CHECK-NEXT: [[V2:%.]] = load i1, i1 [[PTR]], align 1			; CHECK-NEXT: [[V2:%.]] = load i1, i1 [[PTR]], align 1
	; CHECK-NEXT: br i1 [[V2]], label [[COLD_LOOP_BEGIN]], label [[COLD_LOOP_EXIT_LOOPEXIT_SPLIT:%.*]]			; CHECK-NEXT: br i1 [[V2]], label [[COLD_LOOP_BEGIN]], label [[COLD_LOOP_EXIT_LOOPEXIT:%.*]]
	; CHECK: cold_loop_exit.loopexit.split:
	; CHECK-NEXT: br label [[COLD_LOOP_EXIT_LOOPEXIT]]
	; CHECK: cold_loop_exit.loopexit:			; CHECK: cold_loop_exit.loopexit:
	; CHECK-NEXT: br label [[COLD_LOOP_EXIT]]			; CHECK-NEXT: br label [[COLD_LOOP_EXIT]]
	; CHECK: cold_loop_exit:			; CHECK: cold_loop_exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	br label %entry_hot_loop			br label %entry_hot_loop

	▲ Show 20 Lines • Show All 61 Lines • Show Last 20 Lines

llvm/test/Transforms/SimpleLoopUnswitch/nontrivial-unswitch-markloopasdeleted.ll

	Show All 12 Lines
	; cleared. A special case here is that loop_a_inner is destroyed when			; cleared. A special case here is that loop_a_inner is destroyed when
	; unswitching the parent loop.			; unswitching the parent loop.
	;			;
	; The bug solved and verified by this test case was related to the			; The bug solved and verified by this test case was related to the
	; SimpleLoopUnswitch not marking the Loop as removed, so we missed clearing			; SimpleLoopUnswitch not marking the Loop as removed, so we missed clearing
	; the analysis caches.			; the analysis caches.
	;			;
	; CHECK: Running pass: SimpleLoopUnswitchPass on Loop at depth 1 containing: %loop_begin<header>,%loop_b,%loop_b_inner,%loop_b_inner_exit,%loop_a,%loop_a_inner,%loop_a_inner_exit,%latch<latch><exiting>			; CHECK: Running pass: SimpleLoopUnswitchPass on Loop at depth 1 containing: %loop_begin<header>,%loop_b,%loop_b_inner,%loop_b_inner_exit,%loop_a,%loop_a_inner,%loop_a_inner_exit,%latch<latch><exiting>
				; CHECK-NEXT: Running analysis: OuterAnalysisManagerProxy
	; CHECK-NEXT: Clearing all analysis results for: loop_a_inner			; CHECK-NEXT: Clearing all analysis results for: loop_a_inner


	; When running loop-distribute the second time we can see that loop_a_inner			; When running loop-distribute the second time we can see that loop_a_inner
	; isn't analysed because the loop no longer exists (instead we find a new loop,			; isn't analysed because the loop no longer exists (instead we find a new loop,
	; loop_a_inner.us). This kind of verifies that it was correct to remove the			; loop_a_inner.us). This kind of verifies that it was correct to remove the
	; loop_a_inner related analysis above.			; loop_a_inner related analysis above.
	;			;
	▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[SimpleLoopUnswitch] Skip non-trivial unswitching of cold loopsAcceptedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 450405

llvm/lib/Passes/PassBuilder.cpp

llvm/lib/Transforms/Scalar/SimpleLoopUnswitch.cpp

llvm/test/Other/new-pm-defaults.ll

llvm/test/Other/new-pm-thinlto-defaults.ll

llvm/test/Other/new-pm-thinlto-postlink-pgo-defaults.ll

llvm/test/Other/new-pm-thinlto-postlink-samplepgo-defaults.ll

llvm/test/Other/new-pm-thinlto-prelink-pgo-defaults.ll

llvm/test/Other/new-pm-thinlto-prelink-samplepgo-defaults.ll

llvm/test/Transforms/LoopPredication/preserve-bpi.ll

llvm/test/Transforms/SimpleLoopUnswitch/PGO-nontrivial-unswitch.ll

llvm/test/Transforms/SimpleLoopUnswitch/nontrivial-unswitch-markloopasdeleted.ll

[SimpleLoopUnswitch] Skip non-trivial unswitching of cold loops
AcceptedPublic