This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
Analysis/
-
LoopAnalysisManager.h
-
Transforms/Scalar/
-
Scalar/
-
LoopPassManager.h
-
lib/
-
Passes/
1/1
PassBuilder.cpp
-
Transforms/
-
Scalar/
-
LoopDistribute.cpp
-
LoopLoadElimination.cpp
-
LoopPassManager.cpp
-
LoopPredication.cpp
-
Utils/
-
LoopVersioning.cpp
-
Vectorize/
-
LoopVectorize.cpp
-
test/Transforms/LoopPredication/
-
Transforms/
-
LoopPredication/
2
preserve-bpi.ll

Differential D110438

[BPI] Keep BPI available in loop passes through LoopStandardAnalysisResults
ClosedPublic

Authored by anna on Sep 24 2021, 11:59 AM.

Download Raw Diff

Details

Reviewers

asbirlea
nikic
modimo
DaniilSuchkov

Commits

rG452714f8f803: [BPI] Keep BPI available in loop passes through LoopStandardAnalysisResults

Summary

This is analogous to D86156 (which preserves "lossy" BFI in loop
passes). Lossy means that the analysis preserved may not be up to date
with regards to new blocks that are added in loop passes, but BPI will
not contain stale pointers to basic blocks that are deleted by the loop
passes.

This is achieved through BasicBlockCallbackVH in BPI, which calls
eraseBlock that updates the data structures in BPI whenever a basic
block is deleted.

This patch does not have any changes in the upstream pipeline, since
none of the loop passes in the pipeline use BPI currently.
However, since BPI wasn't previously preserved in loop passes, the loop
predication pass was invoking BPI *on the entire
function* every time it ran in an LPM.  This caused massive compile time
in our downstream LPM invocation which contained loop predication.

See updated test with an invocation of a loop-pipeline containing loop
predication and -debug-pass turned ON.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	180 ms	x64 debian > ORC-x86_64-linux.TestCases/Linux/x86-64::trivial-cxa-atexit.S
	200 ms	x64 debian > ORC-x86_64-linux.TestCases/Linux/x86-64::trivial-static-initializer.S
	160 ms	x64 debian > ORC-x86_64-linux.TestCases/Linux/x86-64::trivial-tls.S

Event Timeline

anna created this revision.Sep 24 2021, 11:59 AM

Herald added a subscriber: hiraditya. · View Herald TranscriptSep 24 2021, 11:59 AM

anna requested review of this revision.Sep 24 2021, 11:59 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 24 2021, 11:59 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B125620: Diff 374920.Sep 24 2021, 12:25 PM

LGTM pending the added test case demonstrating that BPI is correctly preserved across loop passes. Also from D86156 we found there was a significant compiler speed up using LazyBlockFrequencyInfo for the passes that needed it. This may also help for LoopPredication's usage of BPI.

Changes LGTM. Please add the testcase mentioned in the description before pushing.

This revision is now accepted and ready to land.Sep 27 2021, 9:07 AM

Thank you for the reviews. I will precommit the test case and update this diff. The point to note is that since Loop predication didn't preserve BPI, we were recalculating it every time for the NPM. We are primarily interested in the NPM, so I'll be adding the testcase for this along with -debug-only=branch-prob, which shows that before this change, we calculate BPI every time we call Loop predication.

anna mentioned this in rG16a62d4f3dca: [LoopPredication] Add testcase showing BPI computation. NFC.Sep 27 2021, 1:54 PM

updated precommited testcase and BPI invocation removed from LoopPredication.

The removed code in LoopPredication (which removes BPI calculation per Loop predication invocation) shows the testcase difference. I was hoping to keep the loop predication change separate to save some downstream merge headaches, but couldn't show a testcase for the standalone original change in this review.

Harbormaster completed remote builds in B126100: Diff 375566.Sep 28 2021, 7:25 AM

In D110438#3027462, @anna wrote:

The removed code in LoopPredication (which removes BPI calculation per Loop predication invocation) shows the testcase difference. I was hoping to keep the loop predication change separate to save some downstream merge headaches, but couldn't show a testcase for the standalone original change in this review.

I think it makes sense as part of this change given LoopStandardAnalysisResults now stores BPI, so calculating it separately in LoopPredication is a waste.

Make sure to update the description before pushing, thanks!

anna edited the summary of this revision. (Show Details)Sep 28 2021, 1:00 PM

We need to explicitly check for loop-predication in PassBuilder, so that UseBranchProbabilityInfo is set true.
(Without this change profitability.ll test will fail since LPM invocation by default has UseBranchProbabilityInfo=false).

This is the only change in the diff and analogous to the UseBFI = any_pass = LICM check just above that.

anna added inline comments.Sep 28 2021, 1:04 PM

llvm/lib/Passes/PassBuilder.cpp
1305	Just for ease of review compared to previous LGTM'ed version: this is the (only) update done vs previous diff.

Harbormaster completed remote builds in B126168: Diff 375678.Sep 28 2021, 1:33 PM

asbirlea added inline comments.Sep 28 2021, 3:47 PM

llvm/test/Transforms/LoopPredication/preserve-bpi.ll
1	Does this still need the `require`s before `loop-mssa`?

anna added inline comments.Sep 30 2021, 5:56 AM

llvm/test/Transforms/LoopPredication/preserve-bpi.ll
1	We do not. Thanks. I'll update and land the patch.

This revision was landed with ongoing or failed builds.Sep 30 2021, 7:27 AM

Closed by commit rG452714f8f803: [BPI] Keep BPI available in loop passes through LoopStandardAnalysisResults (authored by anna). · Explain Why

This revision was automatically updated to reflect the committed changes.

anna added a commit: rG452714f8f803: [BPI] Keep BPI available in loop passes through LoopStandardAnalysisResults.

anna mentioned this in D111448: [BPI] Handle incomplete state in BPI.Oct 8 2021, 11:05 AM

anna mentioned this in D111668: [LoopPredication] Calculate profitability without BPI.Oct 12 2021, 12:13 PM

anna mentioned this in rG9403514e7649: [LoopPredication] Calculate profitability without BPI.Oct 19 2021, 11:25 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

LoopAnalysisManager.h

1 line

Transforms/

Scalar/

LoopPassManager.h

22 lines

lib/

Passes/

PassBuilder.cpp

5 lines

Transforms/

Scalar/

LoopDistribute.cpp

4 lines

LoopLoadElimination.cpp

4 lines

LoopPassManager.cpp

8 lines

LoopPredication.cpp

6 lines

Utils/

LoopVersioning.cpp

4 lines

Vectorize/

LoopVectorize.cpp

4 lines

test/

Transforms/

LoopPredication/

preserve-bpi.ll

9 lines

Diff 375678

llvm/include/llvm/Analysis/LoopAnalysisManager.h

Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	struct LoopStandardAnalysisResults {
AAResults &AA;		AAResults &AA;
AssumptionCache &AC;		AssumptionCache &AC;
DominatorTree &DT;		DominatorTree &DT;
LoopInfo &LI;		LoopInfo &LI;
ScalarEvolution &SE;		ScalarEvolution &SE;
TargetLibraryInfo &TLI;		TargetLibraryInfo &TLI;
TargetTransformInfo &TTI;		TargetTransformInfo &TTI;
BlockFrequencyInfo *BFI;		BlockFrequencyInfo *BFI;
		BranchProbabilityInfo *BPI;
MemorySSA *MSSA;		MemorySSA *MSSA;
};		};

/// Extern template declaration for the analysis set for this IR unit.		/// Extern template declaration for the analysis set for this IR unit.
extern template class AllAnalysesOn<Loop>;		extern template class AllAnalysesOn<Loop>;

extern template class AnalysisManager<Loop, LoopStandardAnalysisResults &>;		extern template class AnalysisManager<Loop, LoopStandardAnalysisResults &>;
/// The loop analysis manager.		/// The loop analysis manager.
▲ Show 20 Lines • Show All 94 Lines • Show Last 20 Lines

llvm/include/llvm/Transforms/Scalar/LoopPassManager.h

	Show First 20 Lines • Show All 415 Lines • ▼ Show 20 Lines
	public:			public:
	using PassConceptT =			using PassConceptT =
	detail::PassConcept<Loop, LoopAnalysisManager,			detail::PassConcept<Loop, LoopAnalysisManager,
	LoopStandardAnalysisResults &, LPMUpdater &>;			LoopStandardAnalysisResults &, LPMUpdater &>;

	explicit FunctionToLoopPassAdaptor(std::unique_ptr<PassConceptT> Pass,			explicit FunctionToLoopPassAdaptor(std::unique_ptr<PassConceptT> Pass,
	bool UseMemorySSA = false,			bool UseMemorySSA = false,
	bool UseBlockFrequencyInfo = false,			bool UseBlockFrequencyInfo = false,
				bool UseBranchProbabilityInfo = false,
	bool LoopNestMode = false)			bool LoopNestMode = false)
	: Pass(std::move(Pass)), LoopCanonicalizationFPM(),			: Pass(std::move(Pass)), LoopCanonicalizationFPM(),
	UseMemorySSA(UseMemorySSA),			UseMemorySSA(UseMemorySSA),
	UseBlockFrequencyInfo(UseBlockFrequencyInfo),			UseBlockFrequencyInfo(UseBlockFrequencyInfo),
				UseBranchProbabilityInfo(UseBranchProbabilityInfo),
	LoopNestMode(LoopNestMode) {			LoopNestMode(LoopNestMode) {
	LoopCanonicalizationFPM.addPass(LoopSimplifyPass());			LoopCanonicalizationFPM.addPass(LoopSimplifyPass());
	LoopCanonicalizationFPM.addPass(LCSSAPass());			LoopCanonicalizationFPM.addPass(LCSSAPass());
	}			}

	/// Runs the loop passes across every loop in the function.			/// Runs the loop passes across every loop in the function.
	PreservedAnalyses run(Function &F, FunctionAnalysisManager &AM);			PreservedAnalyses run(Function &F, FunctionAnalysisManager &AM);
	void printPipeline(raw_ostream &OS,			void printPipeline(raw_ostream &OS,
	function_ref<StringRef(StringRef)> MapClassName2PassName);			function_ref<StringRef(StringRef)> MapClassName2PassName);

	static bool isRequired() { return true; }			static bool isRequired() { return true; }

	bool isLoopNestMode() const { return LoopNestMode; }			bool isLoopNestMode() const { return LoopNestMode; }

	private:			private:
	std::unique_ptr<PassConceptT> Pass;			std::unique_ptr<PassConceptT> Pass;

	FunctionPassManager LoopCanonicalizationFPM;			FunctionPassManager LoopCanonicalizationFPM;

	bool UseMemorySSA = false;			bool UseMemorySSA = false;
	bool UseBlockFrequencyInfo = false;			bool UseBlockFrequencyInfo = false;
				bool UseBranchProbabilityInfo = false;
	const bool LoopNestMode;			const bool LoopNestMode;
	};			};

	/// A function to deduce a loop pass type and wrap it in the templated			/// A function to deduce a loop pass type and wrap it in the templated
	/// adaptor.			/// adaptor.
	///			///
	/// If \p Pass is a loop pass, the returned adaptor will be in loop mode.			/// If \p Pass is a loop pass, the returned adaptor will be in loop mode.
	template <typename LoopPassT>			template <typename LoopPassT>
	inline std::enable_if_t<is_detected<HasRunOnLoopT, LoopPassT>::value,			inline std::enable_if_t<is_detected<HasRunOnLoopT, LoopPassT>::value,
	FunctionToLoopPassAdaptor>			FunctionToLoopPassAdaptor>
	createFunctionToLoopPassAdaptor(LoopPassT &&Pass, bool UseMemorySSA = false,			createFunctionToLoopPassAdaptor(LoopPassT &&Pass, bool UseMemorySSA = false,
	bool UseBlockFrequencyInfo = false) {			bool UseBlockFrequencyInfo = false,
				bool UseBranchProbabilityInfo = false) {
	using PassModelT =			using PassModelT =
	detail::PassModel<Loop, LoopPassT, PreservedAnalyses, LoopAnalysisManager,			detail::PassModel<Loop, LoopPassT, PreservedAnalyses, LoopAnalysisManager,
	LoopStandardAnalysisResults &, LPMUpdater &>;			LoopStandardAnalysisResults &, LPMUpdater &>;
	return FunctionToLoopPassAdaptor(			return FunctionToLoopPassAdaptor(
	std::make_unique<PassModelT>(std::forward<LoopPassT>(Pass)), UseMemorySSA,			std::make_unique<PassModelT>(std::forward<LoopPassT>(Pass)), UseMemorySSA,
	UseBlockFrequencyInfo, false);			UseBlockFrequencyInfo, UseBranchProbabilityInfo, false);
	}			}

	/// If \p Pass is a loop-nest pass, \p Pass will first be wrapped into a			/// If \p Pass is a loop-nest pass, \p Pass will first be wrapped into a
	/// \c LoopPassManager and the returned adaptor will be in loop-nest mode.			/// \c LoopPassManager and the returned adaptor will be in loop-nest mode.
	template <typename LoopNestPassT>			template <typename LoopNestPassT>
	inline std::enable_if_t<!is_detected<HasRunOnLoopT, LoopNestPassT>::value,			inline std::enable_if_t<!is_detected<HasRunOnLoopT, LoopNestPassT>::value,
	FunctionToLoopPassAdaptor>			FunctionToLoopPassAdaptor>
	createFunctionToLoopPassAdaptor(LoopNestPassT &&Pass, bool UseMemorySSA = false,			createFunctionToLoopPassAdaptor(LoopNestPassT &&Pass, bool UseMemorySSA = false,
	bool UseBlockFrequencyInfo = false) {			bool UseBlockFrequencyInfo = false,
				bool UseBranchProbabilityInfo = false) {
	LoopPassManager LPM;			LoopPassManager LPM;
	LPM.addPass(std::forward<LoopNestPassT>(Pass));			LPM.addPass(std::forward<LoopNestPassT>(Pass));
	using PassModelT =			using PassModelT =
	detail::PassModel<Loop, LoopPassManager, PreservedAnalyses,			detail::PassModel<Loop, LoopPassManager, PreservedAnalyses,
	LoopAnalysisManager, LoopStandardAnalysisResults &,			LoopAnalysisManager, LoopStandardAnalysisResults &,
	LPMUpdater &>;			LPMUpdater &>;
	return FunctionToLoopPassAdaptor(std::make_unique<PassModelT>(std::move(LPM)),			return FunctionToLoopPassAdaptor(std::make_unique<PassModelT>(std::move(LPM)),
	UseMemorySSA, UseBlockFrequencyInfo, true);			UseMemorySSA, UseBlockFrequencyInfo,
				UseBranchProbabilityInfo, true);
	}			}

	/// If \p Pass is an instance of \c LoopPassManager, the returned adaptor will			/// If \p Pass is an instance of \c LoopPassManager, the returned adaptor will
	/// be in loop-nest mode if the pass manager contains only loop-nest passes.			/// be in loop-nest mode if the pass manager contains only loop-nest passes.
	template <>			template <>
	inline FunctionToLoopPassAdaptor			inline FunctionToLoopPassAdaptor
	createFunctionToLoopPassAdaptor<LoopPassManager>(LoopPassManager &&LPM,			createFunctionToLoopPassAdaptor<LoopPassManager>(
	bool UseMemorySSA,			LoopPassManager &&LPM, bool UseMemorySSA, bool UseBlockFrequencyInfo,
	bool UseBlockFrequencyInfo) {			bool UseBranchProbabilityInfo) {
	// Check if LPM contains any loop pass and if it does not, returns an adaptor			// Check if LPM contains any loop pass and if it does not, returns an adaptor
	// in loop-nest mode.			// in loop-nest mode.
	using PassModelT =			using PassModelT =
	detail::PassModel<Loop, LoopPassManager, PreservedAnalyses,			detail::PassModel<Loop, LoopPassManager, PreservedAnalyses,
	LoopAnalysisManager, LoopStandardAnalysisResults &,			LoopAnalysisManager, LoopStandardAnalysisResults &,
	LPMUpdater &>;			LPMUpdater &>;
	bool LoopNestMode = (LPM.getNumLoopPasses() == 0);			bool LoopNestMode = (LPM.getNumLoopPasses() == 0);
	return FunctionToLoopPassAdaptor(std::make_unique<PassModelT>(std::move(LPM)),			return FunctionToLoopPassAdaptor(std::make_unique<PassModelT>(std::move(LPM)),
	UseMemorySSA, UseBlockFrequencyInfo,			UseMemorySSA, UseBlockFrequencyInfo,
	LoopNestMode);			UseBranchProbabilityInfo, LoopNestMode);
	}			}

	/// Pass for printing a loop's contents as textual IR.			/// Pass for printing a loop's contents as textual IR.
	class PrintLoopPass : public PassInfoMixin<PrintLoopPass> {			class PrintLoopPass : public PassInfoMixin<PrintLoopPass> {
	raw_ostream &OS;			raw_ostream &OS;
	std::string Banner;			std::string Banner;

	public:			public:
	Show All 9 Lines

llvm/lib/Passes/PassBuilder.cpp

Show First 20 Lines • Show All 1,296 Lines • ▼ Show 20 Lines	if (!InnerPipeline.empty()) {
if (Name == "loop" \|\| Name == "loop-mssa") {		if (Name == "loop" \|\| Name == "loop-mssa") {
LoopPassManager LPM;		LoopPassManager LPM;
if (auto Err = parseLoopPassPipeline(LPM, InnerPipeline))		if (auto Err = parseLoopPassPipeline(LPM, InnerPipeline))
return Err;		return Err;
// Add the nested pass manager with the appropriate adaptor.		// Add the nested pass manager with the appropriate adaptor.
bool UseMemorySSA = (Name == "loop-mssa");		bool UseMemorySSA = (Name == "loop-mssa");
bool UseBFI = llvm::any_of(		bool UseBFI = llvm::any_of(
InnerPipeline, [](auto Pipeline) { return Pipeline.Name == "licm"; });		InnerPipeline, [](auto Pipeline) { return Pipeline.Name == "licm"; });
		bool UseBPI = llvm::any_of(InnerPipeline, [](auto Pipeline) {
		annaAuthorUnsubmitted Done Reply Inline Actions Just for ease of review compared to previous LGTM'ed version: this is the (only) update done vs previous diff. anna: Just for ease of review compared to previous LGTM'ed version: this is the (only) update done vs…
		return Pipeline.Name == "loop-predication";
		});
FPM.addPass(createFunctionToLoopPassAdaptor(std::move(LPM), UseMemorySSA,		FPM.addPass(createFunctionToLoopPassAdaptor(std::move(LPM), UseMemorySSA,
UseBFI));		UseBFI, UseBPI));
return Error::success();		return Error::success();
}		}
if (auto Count = parseRepeatPassName(Name)) {		if (auto Count = parseRepeatPassName(Name)) {
FunctionPassManager NestedFPM;		FunctionPassManager NestedFPM;
if (auto Err = parseFunctionPassPipeline(NestedFPM, InnerPipeline))		if (auto Err = parseFunctionPassPipeline(NestedFPM, InnerPipeline))
return Err;		return Err;
FPM.addPass(createRepeatedPass(*Count, std::move(NestedFPM)));		FPM.addPass(createRepeatedPass(*Count, std::move(NestedFPM)));
return Error::success();		return Error::success();
▲ Show 20 Lines • Show All 442 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/LoopDistribute.cpp

Show First 20 Lines • Show All 1,051 Lines • ▼ Show 20 Lines	PreservedAnalyses LoopDistributePass::run(Function &F,
auto &AA = AM.getResult<AAManager>(F);		auto &AA = AM.getResult<AAManager>(F);
auto &AC = AM.getResult<AssumptionAnalysis>(F);		auto &AC = AM.getResult<AssumptionAnalysis>(F);
auto &TTI = AM.getResult<TargetIRAnalysis>(F);		auto &TTI = AM.getResult<TargetIRAnalysis>(F);
auto &TLI = AM.getResult<TargetLibraryAnalysis>(F);		auto &TLI = AM.getResult<TargetLibraryAnalysis>(F);

auto &LAM = AM.getResult<LoopAnalysisManagerFunctionProxy>(F).getManager();		auto &LAM = AM.getResult<LoopAnalysisManagerFunctionProxy>(F).getManager();
std::function<const LoopAccessInfo &(Loop &)> GetLAA =		std::function<const LoopAccessInfo &(Loop &)> GetLAA =
[&](Loop &L) -> const LoopAccessInfo & {		[&](Loop &L) -> const LoopAccessInfo & {
LoopStandardAnalysisResults AR = {AA, AC, DT, LI, SE,		LoopStandardAnalysisResults AR = {AA, AC, DT, LI, SE,
TLI, TTI, nullptr, nullptr};		TLI, TTI, nullptr, nullptr, nullptr};
return LAM.getResult<LoopAccessAnalysis>(L, AR);		return LAM.getResult<LoopAccessAnalysis>(L, AR);
};		};

bool Changed = runImpl(F, &LI, &DT, &SE, &ORE, GetLAA);		bool Changed = runImpl(F, &LI, &DT, &SE, &ORE, GetLAA);
if (!Changed)		if (!Changed)
return PreservedAnalyses::all();		return PreservedAnalyses::all();
PreservedAnalyses PA;		PreservedAnalyses PA;
PA.preserve<LoopAnalysis>();		PA.preserve<LoopAnalysis>();
Show All 18 Lines

llvm/lib/Transforms/Scalar/LoopLoadElimination.cpp

Show First 20 Lines • Show All 715 Lines • ▼ Show 20 Lines	PreservedAnalyses LoopLoadEliminationPass::run(Function &F,
auto &MAMProxy = AM.getResult<ModuleAnalysisManagerFunctionProxy>(F);		auto &MAMProxy = AM.getResult<ModuleAnalysisManagerFunctionProxy>(F);
auto PSI = MAMProxy.getCachedResult<ProfileSummaryAnalysis>(F.getParent());		auto PSI = MAMProxy.getCachedResult<ProfileSummaryAnalysis>(F.getParent());
auto *BFI = (PSI && PSI->hasProfileSummary()) ?		auto *BFI = (PSI && PSI->hasProfileSummary()) ?
&AM.getResult<BlockFrequencyAnalysis>(F) : nullptr;		&AM.getResult<BlockFrequencyAnalysis>(F) : nullptr;

auto &LAM = AM.getResult<LoopAnalysisManagerFunctionProxy>(F).getManager();		auto &LAM = AM.getResult<LoopAnalysisManagerFunctionProxy>(F).getManager();
bool Changed = eliminateLoadsAcrossLoops(		bool Changed = eliminateLoadsAcrossLoops(
F, LI, DT, BFI, PSI, &SE, &AC, [&](Loop &L) -> const LoopAccessInfo & {		F, LI, DT, BFI, PSI, &SE, &AC, [&](Loop &L) -> const LoopAccessInfo & {
LoopStandardAnalysisResults AR = {AA, AC, DT, LI, SE,		LoopStandardAnalysisResults AR = {AA, AC, DT, LI, SE,
TLI, TTI, nullptr, nullptr};		TLI, TTI, nullptr, nullptr, nullptr};
return LAM.getResult<LoopAccessAnalysis>(L, AR);		return LAM.getResult<LoopAccessAnalysis>(L, AR);
});		});

if (!Changed)		if (!Changed)
return PreservedAnalyses::all();		return PreservedAnalyses::all();

PreservedAnalyses PA;		PreservedAnalyses PA;
return PA;		return PA;
}		}

llvm/lib/Transforms/Scalar/LoopPassManager.cpp

//===- LoopPassManager.cpp - Loop pass management -------------------------===//		//===- LoopPassManager.cpp - Loop pass management -------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/Transforms/Scalar/LoopPassManager.h"		#include "llvm/Transforms/Scalar/LoopPassManager.h"
#include "llvm/Analysis/AssumptionCache.h"		#include "llvm/Analysis/AssumptionCache.h"
#include "llvm/Analysis/BasicAliasAnalysis.h"		#include "llvm/Analysis/BasicAliasAnalysis.h"
#include "llvm/Analysis/BlockFrequencyInfo.h"		#include "llvm/Analysis/BlockFrequencyInfo.h"
		#include "llvm/Analysis/BranchProbabilityInfo.h"
#include "llvm/Analysis/GlobalsModRef.h"		#include "llvm/Analysis/GlobalsModRef.h"
#include "llvm/Analysis/MemorySSA.h"		#include "llvm/Analysis/MemorySSA.h"
#include "llvm/Analysis/ScalarEvolutionAliasAnalysis.h"		#include "llvm/Analysis/ScalarEvolutionAliasAnalysis.h"
#include "llvm/Analysis/TargetLibraryInfo.h"		#include "llvm/Analysis/TargetLibraryInfo.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/TimeProfiler.h"		#include "llvm/Support/TimeProfiler.h"

using namespace llvm;		using namespace llvm;
▲ Show 20 Lines • Show All 181 Lines • ▼ Show 20 Lines	if (LI.empty())
return PA;		return PA;

// Get the analysis results needed by loop passes.		// Get the analysis results needed by loop passes.
MemorySSA *MSSA =		MemorySSA *MSSA =
UseMemorySSA ? (&AM.getResult<MemorySSAAnalysis>(F).getMSSA()) : nullptr;		UseMemorySSA ? (&AM.getResult<MemorySSAAnalysis>(F).getMSSA()) : nullptr;
BlockFrequencyInfo *BFI = UseBlockFrequencyInfo && F.hasProfileData()		BlockFrequencyInfo *BFI = UseBlockFrequencyInfo && F.hasProfileData()
? (&AM.getResult<BlockFrequencyAnalysis>(F))		? (&AM.getResult<BlockFrequencyAnalysis>(F))
: nullptr;		: nullptr;
		BranchProbabilityInfo *BPI =
		UseBranchProbabilityInfo && F.hasProfileData()
		? (&AM.getResult<BranchProbabilityAnalysis>(F))
		: nullptr;
LoopStandardAnalysisResults LAR = {AM.getResult<AAManager>(F),		LoopStandardAnalysisResults LAR = {AM.getResult<AAManager>(F),
AM.getResult<AssumptionAnalysis>(F),		AM.getResult<AssumptionAnalysis>(F),
AM.getResult<DominatorTreeAnalysis>(F),		AM.getResult<DominatorTreeAnalysis>(F),
AM.getResult<LoopAnalysis>(F),		AM.getResult<LoopAnalysis>(F),
AM.getResult<ScalarEvolutionAnalysis>(F),		AM.getResult<ScalarEvolutionAnalysis>(F),
AM.getResult<TargetLibraryAnalysis>(F),		AM.getResult<TargetLibraryAnalysis>(F),
AM.getResult<TargetIRAnalysis>(F),		AM.getResult<TargetIRAnalysis>(F),
BFI,		BFI,
		BPI,
MSSA};		MSSA};

// Setup the loop analysis manager from its proxy. It is important that		// Setup the loop analysis manager from its proxy. It is important that
// this is only done when there are loops to process and we have built the		// this is only done when there are loops to process and we have built the
// LoopStandardAnalysisResults object. The loop analyses cached in this		// LoopStandardAnalysisResults object. The loop analyses cached in this
// manager have access to those analysis results and so it must invalidate		// manager have access to those analysis results and so it must invalidate
// itself when they go away.		// itself when they go away.
auto &LAMFP = AM.getResult<LoopAnalysisManagerFunctionProxy>(F);		auto &LAMFP = AM.getResult<LoopAnalysisManagerFunctionProxy>(F);
▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	#endif
PA.preserveSet<AllAnalysesOn<Loop>>();		PA.preserveSet<AllAnalysesOn<Loop>>();
PA.preserve<LoopAnalysisManagerFunctionProxy>();		PA.preserve<LoopAnalysisManagerFunctionProxy>();
// We also preserve the set of standard analyses.		// We also preserve the set of standard analyses.
PA.preserve<DominatorTreeAnalysis>();		PA.preserve<DominatorTreeAnalysis>();
PA.preserve<LoopAnalysis>();		PA.preserve<LoopAnalysis>();
PA.preserve<ScalarEvolutionAnalysis>();		PA.preserve<ScalarEvolutionAnalysis>();
if (UseBlockFrequencyInfo && F.hasProfileData())		if (UseBlockFrequencyInfo && F.hasProfileData())
PA.preserve<BlockFrequencyAnalysis>();		PA.preserve<BlockFrequencyAnalysis>();
		if (UseBranchProbabilityInfo && F.hasProfileData())
		PA.preserve<BranchProbabilityAnalysis>();
if (UseMemorySSA)		if (UseMemorySSA)
PA.preserve<MemorySSAAnalysis>();		PA.preserve<MemorySSAAnalysis>();
return PA;		return PA;
}		}

PrintLoopPass::PrintLoopPass() : OS(dbgs()) {}		PrintLoopPass::PrintLoopPass() : OS(dbgs()) {}
PrintLoopPass::PrintLoopPass(raw_ostream &OS, const std::string &Banner)		PrintLoopPass::PrintLoopPass(raw_ostream &OS, const std::string &Banner)
: OS(OS), Banner(Banner) {}		: OS(OS), Banner(Banner) {}

PreservedAnalyses PrintLoopPass::run(Loop &L, LoopAnalysisManager &,		PreservedAnalyses PrintLoopPass::run(Loop &L, LoopAnalysisManager &,
LoopStandardAnalysisResults &,		LoopStandardAnalysisResults &,
LPMUpdater &) {		LPMUpdater &) {
printLoop(L, OS, Banner);		printLoop(L, OS, Banner);
return PreservedAnalyses::all();		return PreservedAnalyses::all();
}		}

llvm/lib/Transforms/Scalar/LoopPredication.cpp

	Show First 20 Lines • Show All 361 Lines • ▼ Show 20 Lines
	Pass *llvm::createLoopPredicationPass() {			Pass *llvm::createLoopPredicationPass() {
	return new LoopPredicationLegacyPass();			return new LoopPredicationLegacyPass();
	}			}

	PreservedAnalyses LoopPredicationPass::run(Loop &L, LoopAnalysisManager &AM,			PreservedAnalyses LoopPredicationPass::run(Loop &L, LoopAnalysisManager &AM,
	LoopStandardAnalysisResults &AR,			LoopStandardAnalysisResults &AR,
	LPMUpdater &U) {			LPMUpdater &U) {
	Function *F = L.getHeader()->getParent();			Function *F = L.getHeader()->getParent();
	// For the new PM, we also can't use BranchProbabilityInfo as an analysis
	// pass. Function analyses need to be preserved across loop transformations
	// but BPI is not preserved, hence a newly built one is needed.
	BranchProbabilityInfo BPI(*F, AR.LI, &AR.TLI, &AR.DT, nullptr);
	std::unique_ptr<MemorySSAUpdater> MSSAU;			std::unique_ptr<MemorySSAUpdater> MSSAU;
	if (AR.MSSA)			if (AR.MSSA)
	MSSAU = std::make_unique<MemorySSAUpdater>(AR.MSSA);			MSSAU = std::make_unique<MemorySSAUpdater>(AR.MSSA);
	LoopPredication LP(&AR.AA, &AR.DT, &AR.SE, &AR.LI, &BPI,			LoopPredication LP(&AR.AA, &AR.DT, &AR.SE, &AR.LI, AR.BPI,
	MSSAU ? MSSAU.get() : nullptr);			MSSAU ? MSSAU.get() : nullptr);
	if (!LP.runOnLoop(&L))			if (!LP.runOnLoop(&L))
	return PreservedAnalyses::all();			return PreservedAnalyses::all();

	auto PA = getLoopPassPreservedAnalyses();			auto PA = getLoopPassPreservedAnalyses();
	if (AR.MSSA)			if (AR.MSSA)
	PA.preserve<MemorySSAAnalysis>();			PA.preserve<MemorySSAAnalysis>();
	return PA;			return PA;
	▲ Show 20 Lines • Show All 885 Lines • Show Last 20 Lines

llvm/lib/Transforms/Utils/LoopVersioning.cpp

Show First 20 Lines • Show All 351 Lines • ▼ Show 20 Lines	PreservedAnalyses LoopVersioningPass::run(Function &F,
auto &TTI = AM.getResult<TargetIRAnalysis>(F);		auto &TTI = AM.getResult<TargetIRAnalysis>(F);
auto &DT = AM.getResult<DominatorTreeAnalysis>(F);		auto &DT = AM.getResult<DominatorTreeAnalysis>(F);
auto &TLI = AM.getResult<TargetLibraryAnalysis>(F);		auto &TLI = AM.getResult<TargetLibraryAnalysis>(F);
auto &AA = AM.getResult<AAManager>(F);		auto &AA = AM.getResult<AAManager>(F);
auto &AC = AM.getResult<AssumptionAnalysis>(F);		auto &AC = AM.getResult<AssumptionAnalysis>(F);

auto &LAM = AM.getResult<LoopAnalysisManagerFunctionProxy>(F).getManager();		auto &LAM = AM.getResult<LoopAnalysisManagerFunctionProxy>(F).getManager();
auto GetLAA = [&](Loop &L) -> const LoopAccessInfo & {		auto GetLAA = [&](Loop &L) -> const LoopAccessInfo & {
LoopStandardAnalysisResults AR = {AA, AC, DT, LI, SE,		LoopStandardAnalysisResults AR = {AA, AC, DT, LI, SE,
TLI, TTI, nullptr, nullptr};		TLI, TTI, nullptr, nullptr, nullptr};
return LAM.getResult<LoopAccessAnalysis>(L, AR);		return LAM.getResult<LoopAccessAnalysis>(L, AR);
};		};

if (runImpl(&LI, GetLAA, &DT, &SE))		if (runImpl(&LI, GetLAA, &DT, &SE))
return PreservedAnalyses::none();		return PreservedAnalyses::none();
return PreservedAnalyses::all();		return PreservedAnalyses::all();
}		}
} // namespace llvm		} // namespace llvm

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 10,551 Lines • ▼ Show 20 Lines	PreservedAnalyses LoopVectorizePass::run(Function &F,
auto &AA = AM.getResult<AAManager>(F);		auto &AA = AM.getResult<AAManager>(F);
auto &AC = AM.getResult<AssumptionAnalysis>(F);		auto &AC = AM.getResult<AssumptionAnalysis>(F);
auto &DB = AM.getResult<DemandedBitsAnalysis>(F);		auto &DB = AM.getResult<DemandedBitsAnalysis>(F);
auto &ORE = AM.getResult<OptimizationRemarkEmitterAnalysis>(F);		auto &ORE = AM.getResult<OptimizationRemarkEmitterAnalysis>(F);

auto &LAM = AM.getResult<LoopAnalysisManagerFunctionProxy>(F).getManager();		auto &LAM = AM.getResult<LoopAnalysisManagerFunctionProxy>(F).getManager();
std::function<const LoopAccessInfo &(Loop &)> GetLAA =		std::function<const LoopAccessInfo &(Loop &)> GetLAA =
[&](Loop &L) -> const LoopAccessInfo & {		[&](Loop &L) -> const LoopAccessInfo & {
LoopStandardAnalysisResults AR = {AA, AC, DT, LI, SE,		LoopStandardAnalysisResults AR = {AA, AC, DT, LI, SE,
TLI, TTI, nullptr, nullptr};		TLI, TTI, nullptr, nullptr, nullptr};
return LAM.getResult<LoopAccessAnalysis>(L, AR);		return LAM.getResult<LoopAccessAnalysis>(L, AR);
};		};
auto &MAMProxy = AM.getResult<ModuleAnalysisManagerFunctionProxy>(F);		auto &MAMProxy = AM.getResult<ModuleAnalysisManagerFunctionProxy>(F);
ProfileSummaryInfo *PSI =		ProfileSummaryInfo *PSI =
MAMProxy.getCachedResult<ProfileSummaryAnalysis>(*F.getParent());		MAMProxy.getCachedResult<ProfileSummaryAnalysis>(*F.getParent());
LoopVectorizeResult Result =		LoopVectorizeResult Result =
runImpl(F, SE, LI, TTI, DT, BFI, &TLI, DB, AA, AC, GetLAA, ORE, PSI);		runImpl(F, SE, LI, TTI, DT, BFI, &TLI, DB, AA, AC, GetLAA, ORE, PSI);
if (!Result.MadeAnyChange)		if (!Result.MadeAnyChange)
Show All 26 Lines

llvm/test/Transforms/LoopPredication/preserve-bpi.ll

	; RUN: opt -mtriple=x86_64 -passes='require<scalar-evolution>,require<branch-prob>,loop-mssa(loop-predication,licm,simple-loop-unswitch<nontrivial>,loop-simplifycfg)' -debug-pass-manager -debug-only=branch-prob -S < %s 2>&1 \| FileCheck %s			; RUN: opt -mtriple=x86_64 -passes='require<scalar-evolution>,require<branch-prob>,loop-mssa(loop-predication,licm,simple-loop-unswitch<nontrivial>,loop-simplifycfg)' -debug-pass-manager -debug-only=branch-prob -S < %s 2>&1 \| FileCheck %s
				asbirleaUnsubmitted Not Done Reply Inline Actions Does this still need the `require`s before `loop-mssa`? asbirlea: Does this still need the `require`s before `loop-mssa`?
				annaAuthorUnsubmitted Not Done Reply Inline Actions We do not. Thanks. I'll update and land the patch. anna: We do not. Thanks. I'll update and land the patch.

	; REQUIRES: asserts			; REQUIRES: asserts

				; This test is to solely check that we do not run BPI every single time loop
				; predication is invoked (since BPI is preserved as part of
				; LoopStandardAnalysisResults).
	declare void @llvm.experimental.guard(i1, ...)			declare void @llvm.experimental.guard(i1, ...)

	; CHECK: Running pass: RequireAnalysisPass<{{.*}}BranchProbabilityAnalysis			; CHECK: Running pass: RequireAnalysisPass<{{.*}}BranchProbabilityAnalysis
	; CHECK-NEXT: Running analysis: BranchProbabilityAnalysis on unsigned_loop_0_to_n_ult_check			; CHECK-NEXT: Running analysis: BranchProbabilityAnalysis on unsigned_loop_0_to_n_ult_check
	; CHECK-NEXT: Running analysis: PostDominatorTreeAnalysis on unsigned_loop_0_to_n_ult_check			; CHECK-NEXT: Running analysis: PostDominatorTreeAnalysis on unsigned_loop_0_to_n_ult_check
	; CHECK-NEXT: ---- Branch Probability Info : unsigned_loop_0_to_n_ult_check ----			; CHECK-NEXT: ---- Branch Probability Info : unsigned_loop_0_to_n_ult_check ----
	; CHECK: Running pass: LoopSimplifyPass on unsigned_loop_0_to_n_ult_check			; CHECK: Running pass: LoopSimplifyPass on unsigned_loop_0_to_n_ult_check
	; CHECK: Running pass: LoopPredicationPass on Loop at depth 1			; CHECK: Running pass: LoopPredicationPass on Loop at depth 1
	; CHECK-NEXT: ---- Branch Probability Info : unsigned_loop_0_to_n_ult_check ----			; CHECK-NEXT: Running pass: LICMPass on Loop at depth 1
	; CHECK: Running pass: LICMPass on Loop at depth 1
	; CHECK-NEXT: Running pass: SimpleLoopUnswitchPass on Loop at depth 1			; CHECK-NEXT: Running pass: SimpleLoopUnswitchPass on Loop at depth 1
	; CHECK-NEXT: Running pass: LoopPredicationPass on Loop at depth 1			; CHECK-NEXT: Running pass: LoopPredicationPass on Loop at depth 1
	; CHECK-NEXT: ---- Branch Probability Info : unsigned_loop_0_to_n_ult_check ----			; CHECK-NEXT: Running pass: LICMPass on Loop at depth 1
	; CHECK: Running pass: LICMPass on Loop at depth 1
	; CHECK-NEXT: Running pass: SimpleLoopUnswitchPass on Loop at depth 1			; CHECK-NEXT: Running pass: SimpleLoopUnswitchPass on Loop at depth 1
	; CHECK-NEXT: Running pass: LoopSimplifyCFGPass on Loop at depth 1			; CHECK-NEXT: Running pass: LoopSimplifyCFGPass on Loop at depth 1
	; CHECK-NEXT: Invalidating analysis: BranchProbabilityAnalysis on unsigned_loop_0_to_n_ult_check			; CHECK-NEXT: Invalidating analysis: BranchProbabilityAnalysis on unsigned_loop_0_to_n_ult_check


	define i32 @unsigned_loop_0_to_n_ult_check(i32* %array, i32 %length, i32 %n) {			define i32 @unsigned_loop_0_to_n_ult_check(i32* %array, i32 %length, i32 %n) {
	entry:			entry:
	%tmp5 = icmp eq i32 %n, 0			%tmp5 = icmp eq i32 %n, 0
	Show All 39 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[BPI] Keep BPI available in loop passes through LoopStandardAnalysisResultsClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 375678

llvm/include/llvm/Analysis/LoopAnalysisManager.h

llvm/include/llvm/Transforms/Scalar/LoopPassManager.h

llvm/lib/Passes/PassBuilder.cpp

llvm/lib/Transforms/Scalar/LoopDistribute.cpp

llvm/lib/Transforms/Scalar/LoopLoadElimination.cpp

llvm/lib/Transforms/Scalar/LoopPassManager.cpp

llvm/lib/Transforms/Scalar/LoopPredication.cpp

llvm/lib/Transforms/Utils/LoopVersioning.cpp

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

llvm/test/Transforms/LoopPredication/preserve-bpi.ll

[BPI] Keep BPI available in loop passes through LoopStandardAnalysisResults
ClosedPublic