This is an archive of the discontinued LLVM Phabricator instance.

[LPM] A targeted but somewhat horrible fix to the legacy pass manager's querying of the pass registry.
ClosedPublic

Authored by chandlerc on Jan 27 2015, 3:41 PM.

Download Raw Diff

Details

Reviewers

Commits

rGd24b02335523: Merging r227299: --------------------------------------------------------------…
rG5b0d3e3f3add: [LPM] A targeted but somewhat horrible fix to the legacy pass manager's…
rL227299: [LPM] A targeted but somewhat horrible fix to the legacy pass manager's

Summary

The pass manager relies on the static registry of PassInfo objects to
perform all manner of its functionality. I don't understand why it does
much of this. My very vague understanding is that this registry is
touched both during static initialization *and* while each pass is being
constructed. As a consequence it is hard to make accessing it not
require a acquiring some lock. This lock ends up in the hot path of
setting up, tearing down, and invaliditing analyses in the legacy pass
manager.

On most systems you can observe this as a non-trivial % of the time
spent in 'ninja check-llvm'. However, I haven't really seen it be more
than 1% in extreme cases of compiling more real-world software,
including LTO.

Unfortunately, some of the GPU JITs are seeing this taking essentially
all of their time because they have very small IR running through
a small pass pipeline very many times (at least, this is the vague
understanding I have of it).

This patch tries to minimize the cost of looking up PassInfo objects by
leveraging the fact that the objects themselves are immutable and they
are allocated separately on the heap and so don't have their address
change. It also requires a change I made the last time I tried to debug
this problem which removed the ability to de-register a pass from the
registry. This patch creates a single access path to these objects
inside the PMTopLevelManager which memoizes the result of querying the
registry. This is somewhat gross as I don't really know if
PMTopLevelManager is the *right* place to put it, and I dislike using
a mutable member to memoize things, but it seems to work.

For long-lived pass managers this should completely eliminate
the cost of acquiring locks to look into the pass registry once the
memoized cache is warm. For 'ninja check' I measured about 10% reduction
in CPU time, and about a 1% reduction in total time on a machine with 32
hardware threads. For normal compilation, I don't know how much this
will help, sadly. We will still pay the cost while we populate the
memoized cache. I don't think it will hurt though, and for LTO or
compiles with many small functions it should still be a win.

Diff Detail

Repository: rL LLVM

Event Timeline

chandlerc updated this revision to Diff 18862.Jan 27 2015, 3:41 PM

chandlerc retitled this revision from to [LPM] A targeted but somewhat horrible fix to the legacy pass manager's querying of the pass registry..

chandlerc updated this object.

chandlerc edited the test plan for this revision. (Show Details)

chandlerc added a reviewer: resistor.

chandlerc added a subscriber: Unknown Object (MLST).

Herald added a subscriber: aemerson. · View Herald TranscriptJan 27 2015, 3:41 PM

Hi Chandler,

In my measurements, this captures about half of the regression, i.e. the regression goes from ~20% to ~10%. While the reader/writer locks are gone from the profile, several other PassManager related pieces of code have jumped up to take their place:

—Owen

ThreadProfile2.png80 KBDownload

To summarize the discussion on IRC:

This hot code path doesn't really seem to be due to my patch. If it is, I
can't explain why. I definitely can't explain why without some steps to
reproduce.

This patch essentially erases all the performance hits I can find any way
to measure of the pthread locks, so I'll wait for more information from you
or the others that have measured more significant issues here. Let me know
what you'd like me to do next.

mehdi_amini added a subscriber: mehdi_amini.Jan 27 2015, 10:15 PM

Closed by commit rL227299: [LPM] A targeted but somewhat horrible fix to the legacy pass manager's (authored by chandlerc). · Explain WhyJan 28 2015, 1:49 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

IR/

LegacyPassManagers.h

9 lines

lib/

IR/

LegacyPassManager.cpp

36 lines

Diff 18880

llvm/trunk/include/llvm/IR/LegacyPassManagers.h

Show First 20 Lines • Show All 189 Lines • ▼ Show 20 Lines	public:
/// Collect passes whose last user is P		/// Collect passes whose last user is P
void collectLastUses(SmallVectorImpl<Pass > &LastUses, Pass P);		void collectLastUses(SmallVectorImpl<Pass > &LastUses, Pass P);

/// Find the pass that implements Analysis AID. Search immutable		/// Find the pass that implements Analysis AID. Search immutable
/// passes and all pass managers. If desired pass is not found		/// passes and all pass managers. If desired pass is not found
/// then return NULL.		/// then return NULL.
Pass *findAnalysisPass(AnalysisID AID);		Pass *findAnalysisPass(AnalysisID AID);

		/// Retrieve the PassInfo for an analysis.
		const PassInfo *findAnalysisPassInfo(AnalysisID AID) const;

/// Find analysis usage information for the pass P.		/// Find analysis usage information for the pass P.
AnalysisUsage findAnalysisUsage(Pass P);		AnalysisUsage findAnalysisUsage(Pass P);

virtual ~PMTopLevelManager();		virtual ~PMTopLevelManager();

/// Add immutable pass and initialize it.		/// Add immutable pass and initialize it.
inline void addImmutablePass(ImmutablePass *P) {		inline void addImmutablePass(ImmutablePass *P) {
P->initializePass();		P->initializePass();
Show All 40 Lines	private:
// This inverse map is initialized at PM->run() based on		// This inverse map is initialized at PM->run() based on
// LastUser map.		// LastUser map.
DenseMap<Pass , SmallPtrSet<Pass , 8> > InversedLastUser;		DenseMap<Pass , SmallPtrSet<Pass , 8> > InversedLastUser;

/// Immutable passes are managed by top level manager.		/// Immutable passes are managed by top level manager.
SmallVector<ImmutablePass *, 16> ImmutablePasses;		SmallVector<ImmutablePass *, 16> ImmutablePasses;

DenseMap<Pass , AnalysisUsage > AnUsageMap;		DenseMap<Pass , AnalysisUsage > AnUsageMap;

		/// Collection of PassInfo objects found via analysis IDs and in this top
		/// level manager. This is used to memoize queries to the pass registry.
		/// FIXME: This is an egregious hack because querying the pass registry is
		/// either slow or racy.
		mutable DenseMap<AnalysisID, const PassInfo *> AnalysisPassInfos;
};		};



//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// PMDataManager		// PMDataManager

/// PMDataManager provides the common place to manage the analysis data		/// PMDataManager provides the common place to manage the analysis data
▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

llvm/trunk/lib/IR/LegacyPassManager.cpp

Show First 20 Lines • Show All 594 Lines • ▼ Show 20 Lines	void PMTopLevelManager::schedulePass(Pass *P) {
// may be inserted into previous function manager		// may be inserted into previous function manager

// Give pass a chance to prepare the stage.		// Give pass a chance to prepare the stage.
P->preparePassManager(activeStack);		P->preparePassManager(activeStack);

// If P is an analysis pass and it is available then do not		// If P is an analysis pass and it is available then do not
// generate the analysis again. Stale analysis info should not be		// generate the analysis again. Stale analysis info should not be
// available at this point.		// available at this point.
const PassInfo *PI =		const PassInfo *PI = findAnalysisPassInfo(P->getPassID());
PassRegistry::getPassRegistry()->getPassInfo(P->getPassID());
if (PI && PI->isAnalysis() && findAnalysisPass(P->getPassID())) {		if (PI && PI->isAnalysis() && findAnalysisPass(P->getPassID())) {
delete P;		delete P;
return;		return;
}		}

AnalysisUsage *AnUsage = findAnalysisUsage(P);		AnalysisUsage *AnUsage = findAnalysisUsage(P);

bool checkAnalysis = true;		bool checkAnalysis = true;
while (checkAnalysis) {		while (checkAnalysis) {
checkAnalysis = false;		checkAnalysis = false;

const AnalysisUsage::VectorType &RequiredSet = AnUsage->getRequiredSet();		const AnalysisUsage::VectorType &RequiredSet = AnUsage->getRequiredSet();
for (AnalysisUsage::VectorType::const_iterator I = RequiredSet.begin(),		for (AnalysisUsage::VectorType::const_iterator I = RequiredSet.begin(),
E = RequiredSet.end(); I != E; ++I) {		E = RequiredSet.end(); I != E; ++I) {

Pass AnalysisPass = findAnalysisPass(I);		Pass AnalysisPass = findAnalysisPass(I);
if (!AnalysisPass) {		if (!AnalysisPass) {
const PassInfo PI = PassRegistry::getPassRegistry()->getPassInfo(I);		const PassInfo PI = findAnalysisPassInfo(I);

if (!PI) {		if (!PI) {
// Pass P is not in the global PassRegistry		// Pass P is not in the global PassRegistry
dbgs() << "Pass '" << P->getPassName() << "' is not initialized." << "\n";		dbgs() << "Pass '" << P->getPassName() << "' is not initialized." << "\n";
dbgs() << "Verify if there is a pass dependency cycle." << "\n";		dbgs() << "Verify if there is a pass dependency cycle." << "\n";
dbgs() << "Required Passes:" << "\n";		dbgs() << "Required Passes:" << "\n";
for (AnalysisUsage::VectorType::const_iterator I2 = RequiredSet.begin(),		for (AnalysisUsage::VectorType::const_iterator I2 = RequiredSet.begin(),
E = RequiredSet.end(); I2 != E && I2 != I; ++I2) {		E = RequiredSet.end(); I2 != E && I2 != I; ++I2) {
▲ Show 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	Pass *PMTopLevelManager::findAnalysisPass(AnalysisID AID) {
// the most recently registered passes first.		// the most recently registered passes first.
for (SmallVectorImpl<ImmutablePass *>::reverse_iterator I =		for (SmallVectorImpl<ImmutablePass *>::reverse_iterator I =
ImmutablePasses.rbegin(), E = ImmutablePasses.rend(); I != E; ++I) {		ImmutablePasses.rbegin(), E = ImmutablePasses.rend(); I != E; ++I) {
AnalysisID PI = (*I)->getPassID();		AnalysisID PI = (*I)->getPassID();
if (PI == AID)		if (PI == AID)
return *I;		return *I;

// If Pass not found then check the interfaces implemented by Immutable Pass		// If Pass not found then check the interfaces implemented by Immutable Pass
const PassInfo *PassInf =		const PassInfo *PassInf = findAnalysisPassInfo(PI);
PassRegistry::getPassRegistry()->getPassInfo(PI);
assert(PassInf && "Expected all immutable passes to be initialized");		assert(PassInf && "Expected all immutable passes to be initialized");
const std::vector<const PassInfo*> &ImmPI =		const std::vector<const PassInfo*> &ImmPI =
PassInf->getInterfacesImplemented();		PassInf->getInterfacesImplemented();
for (std::vector<const PassInfo*>::const_iterator II = ImmPI.begin(),		for (std::vector<const PassInfo*>::const_iterator II = ImmPI.begin(),
EE = ImmPI.end(); II != EE; ++II) {		EE = ImmPI.end(); II != EE; ++II) {
if ((*II)->getTypeInfo() == AID)		if ((*II)->getTypeInfo() == AID)
return *I;		return *I;
}		}
}		}

return nullptr;		return nullptr;
}		}

		const PassInfo *PMTopLevelManager::findAnalysisPassInfo(AnalysisID AID) const {
		const PassInfo *&PI = AnalysisPassInfos[AID];
		if (!PI)
		PI = PassRegistry::getPassRegistry()->getPassInfo(AID);
		else
		assert(PI == PassRegistry::getPassRegistry()->getPassInfo(AID) &&
		"The pass info pointer changed for an analysis ID!");

		return PI;
		}

// Print passes managed by this top level manager.		// Print passes managed by this top level manager.
void PMTopLevelManager::dumpPasses() const {		void PMTopLevelManager::dumpPasses() const {

if (PassDebugging < Structure)		if (PassDebugging < Structure)
return;		return;

// Print out the immutable passes		// Print out the immutable passes
for (unsigned i = 0, e = ImmutablePasses.size(); i != e; ++i) {		for (unsigned i = 0, e = ImmutablePasses.size(); i != e; ++i) {
Show All 12 Lines
void PMTopLevelManager::dumpArguments() const {		void PMTopLevelManager::dumpArguments() const {

if (PassDebugging < Arguments)		if (PassDebugging < Arguments)
return;		return;

dbgs() << "Pass Arguments: ";		dbgs() << "Pass Arguments: ";
for (SmallVectorImpl<ImmutablePass *>::const_iterator I =		for (SmallVectorImpl<ImmutablePass *>::const_iterator I =
ImmutablePasses.begin(), E = ImmutablePasses.end(); I != E; ++I)		ImmutablePasses.begin(), E = ImmutablePasses.end(); I != E; ++I)
if (const PassInfo *PI =		if (const PassInfo PI = findAnalysisPassInfo((I)->getPassID())) {
PassRegistry::getPassRegistry()->getPassInfo((*I)->getPassID())) {
assert(PI && "Expected all immutable passes to be initialized");		assert(PI && "Expected all immutable passes to be initialized");
if (!PI->isAnalysisGroup())		if (!PI->isAnalysisGroup())
dbgs() << " -" << PI->getPassArgument();		dbgs() << " -" << PI->getPassArgument();
}		}
for (SmallVectorImpl<PMDataManager *>::const_iterator I =		for (SmallVectorImpl<PMDataManager *>::const_iterator I =
PassManagers.begin(), E = PassManagers.end(); I != E; ++I)		PassManagers.begin(), E = PassManagers.end(); I != E; ++I)
(*I)->dumpPassArguments();		(*I)->dumpPassArguments();
dbgs() << "\n";		dbgs() << "\n";
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	void PMDataManager::recordAvailableAnalysis(Pass *P) {
AnalysisID PI = P->getPassID();		AnalysisID PI = P->getPassID();

AvailableAnalysis[PI] = P;		AvailableAnalysis[PI] = P;

assert(!AvailableAnalysis.empty());		assert(!AvailableAnalysis.empty());

// This pass is the current implementation of all of the interfaces it		// This pass is the current implementation of all of the interfaces it
// implements as well.		// implements as well.
const PassInfo *PInf = PassRegistry::getPassRegistry()->getPassInfo(PI);		const PassInfo *PInf = TPM->findAnalysisPassInfo(PI);
if (!PInf) return;		if (!PInf) return;
const std::vector<const PassInfo*> &II = PInf->getInterfacesImplemented();		const std::vector<const PassInfo*> &II = PInf->getInterfacesImplemented();
for (unsigned i = 0, e = II.size(); i != e; ++i)		for (unsigned i = 0, e = II.size(); i != e; ++i)
AvailableAnalysis[II[i]->getTypeInfo()] = P;		AvailableAnalysis[II[i]->getTypeInfo()] = P;
}		}

// Return true if P preserves high level analysis used by other		// Return true if P preserves high level analysis used by other
// passes managed by this manager		// passes managed by this manager
▲ Show 20 Lines • Show All 116 Lines • ▼ Show 20 Lines	dumpPassInfo(P, FREEING_MSG, DBG_STR, Msg);
// If the pass crashes releasing memory, remember this.		// If the pass crashes releasing memory, remember this.
PassManagerPrettyStackEntry X(P);		PassManagerPrettyStackEntry X(P);
TimeRegion PassTimer(getPassTimer(P));		TimeRegion PassTimer(getPassTimer(P));

P->releaseMemory();		P->releaseMemory();
}		}

AnalysisID PI = P->getPassID();		AnalysisID PI = P->getPassID();
if (const PassInfo *PInf = PassRegistry::getPassRegistry()->getPassInfo(PI)) {		if (const PassInfo *PInf = TPM->findAnalysisPassInfo(PI)) {
// Remove the pass itself (if it is not already removed).		// Remove the pass itself (if it is not already removed).
AvailableAnalysis.erase(PI);		AvailableAnalysis.erase(PI);

// Remove all interfaces this pass implements, for which it is also		// Remove all interfaces this pass implements, for which it is also
// listed as the available implementation.		// listed as the available implementation.
const std::vector<const PassInfo*> &II = PInf->getInterfacesImplemented();		const std::vector<const PassInfo*> &II = PInf->getInterfacesImplemented();
for (unsigned i = 0, e = II.size(); i != e; ++i) {		for (unsigned i = 0, e = II.size(); i != e; ++i) {
DenseMap<AnalysisID, Pass*>::iterator Pos =		DenseMap<AnalysisID, Pass*>::iterator Pos =
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	if (!TransferLastUses.empty()) {
TPM->setLastUser(TransferLastUses, My_PM);		TPM->setLastUser(TransferLastUses, My_PM);
TransferLastUses.clear();		TransferLastUses.clear();
}		}

// Now, take care of required analyses that are not available.		// Now, take care of required analyses that are not available.
for (SmallVectorImpl<AnalysisID>::iterator		for (SmallVectorImpl<AnalysisID>::iterator
I = ReqAnalysisNotAvailable.begin(),		I = ReqAnalysisNotAvailable.begin(),
E = ReqAnalysisNotAvailable.end() ;I != E; ++I) {		E = ReqAnalysisNotAvailable.end() ;I != E; ++I) {
const PassInfo PI = PassRegistry::getPassRegistry()->getPassInfo(I);		const PassInfo PI = TPM->findAnalysisPassInfo(I);
Pass *AnalysisPass = PI->createPass();		Pass *AnalysisPass = PI->createPass();
this->addLowerLevelRequiredPass(P, AnalysisPass);		this->addLowerLevelRequiredPass(P, AnalysisPass);
}		}

// Take a note of analysis required and made available by this pass.		// Take a note of analysis required and made available by this pass.
// Remove the analysis not preserved by this pass		// Remove the analysis not preserved by this pass
removeNotPreservedAnalysis(P);		removeNotPreservedAnalysis(P);
recordAvailableAnalysis(P);		recordAvailableAnalysis(P);
▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines

void PMDataManager::dumpPassArguments() const {		void PMDataManager::dumpPassArguments() const {
for (SmallVectorImpl<Pass *>::const_iterator I = PassVector.begin(),		for (SmallVectorImpl<Pass *>::const_iterator I = PassVector.begin(),
E = PassVector.end(); I != E; ++I) {		E = PassVector.end(); I != E; ++I) {
if (PMDataManager PMD = (I)->getAsPMDataManager())		if (PMDataManager PMD = (I)->getAsPMDataManager())
PMD->dumpPassArguments();		PMD->dumpPassArguments();
else		else
if (const PassInfo *PI =		if (const PassInfo *PI =
PassRegistry::getPassRegistry()->getPassInfo((*I)->getPassID()))		TPM->findAnalysisPassInfo((*I)->getPassID()))
if (!PI->isAnalysisGroup())		if (!PI->isAnalysisGroup())
dbgs() << " -" << PI->getPassArgument();		dbgs() << " -" << PI->getPassArgument();
}		}
}		}

void PMDataManager::dumpPassInfo(Pass *P, enum PassDebuggingString S1,		void PMDataManager::dumpPassInfo(Pass *P, enum PassDebuggingString S1,
enum PassDebuggingString S2,		enum PassDebuggingString S2,
StringRef Msg) {		StringRef Msg) {
▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines
void PMDataManager::dumpAnalysisUsage(StringRef Msg, const Pass *P,		void PMDataManager::dumpAnalysisUsage(StringRef Msg, const Pass *P,
const AnalysisUsage::VectorType &Set) const {		const AnalysisUsage::VectorType &Set) const {
assert(PassDebugging >= Details);		assert(PassDebugging >= Details);
if (Set.empty())		if (Set.empty())
return;		return;
dbgs() << (const void)P << std::string(getDepth()2+3, ' ') << Msg << " Analyses:";		dbgs() << (const void)P << std::string(getDepth()2+3, ' ') << Msg << " Analyses:";
for (unsigned i = 0; i != Set.size(); ++i) {		for (unsigned i = 0; i != Set.size(); ++i) {
if (i) dbgs() << ',';		if (i) dbgs() << ',';
const PassInfo *PInf = PassRegistry::getPassRegistry()->getPassInfo(Set[i]);		const PassInfo *PInf = TPM->findAnalysisPassInfo(Set[i]);
if (!PInf) {		if (!PInf) {
// Some preserved passes, such as AliasAnalysis, may not be initialized by		// Some preserved passes, such as AliasAnalysis, may not be initialized by
// all drivers.		// all drivers.
dbgs() << " Uninitialized Pass";		dbgs() << " Uninitialized Pass";
continue;		continue;
}		}
dbgs() << ' ' << PInf->getPassName();		dbgs() << ' ' << PInf->getPassName();
}		}
▲ Show 20 Lines • Show All 423 Lines • ▼ Show 20 Lines	void MPPassManager::addLowerLevelRequiredPass(Pass P, Pass RequiredPass) {
FunctionPassManagerImpl *FPP = OnTheFlyManagers[P];		FunctionPassManagerImpl *FPP = OnTheFlyManagers[P];
if (!FPP) {		if (!FPP) {
FPP = new FunctionPassManagerImpl();		FPP = new FunctionPassManagerImpl();
// FPP is the top level manager.		// FPP is the top level manager.
FPP->setTopLevelManager(FPP);		FPP->setTopLevelManager(FPP);

OnTheFlyManagers[P] = FPP;		OnTheFlyManagers[P] = FPP;
}		}
const PassInfo * RequiredPassPI =		const PassInfo *RequiredPassPI =
PassRegistry::getPassRegistry()->getPassInfo(RequiredPass->getPassID());		TPM->findAnalysisPassInfo(RequiredPass->getPassID());

Pass *FoundPass = nullptr;		Pass *FoundPass = nullptr;
if (RequiredPassPI && RequiredPassPI->isAnalysis()) {		if (RequiredPassPI && RequiredPassPI->isAnalysis()) {
FoundPass =		FoundPass =
((PMTopLevelManager*)FPP)->findAnalysisPass(RequiredPass->getPassID());		((PMTopLevelManager*)FPP)->findAnalysisPass(RequiredPass->getPassID());
}		}
if (!FoundPass) {		if (!FoundPass) {
FoundPass = RequiredPass;		FoundPass = RequiredPass;
▲ Show 20 Lines • Show All 252 Lines • Show Last 20 Lines