Download Raw Diff

Details

Reviewers

nikic
reames
lebedev.ri
efriedma

Commits

rGdbab339ea44e: [SCEV][NFC] Apply mass forgetMemoizedResults queries where possible

Summary

When forgetting multiple SCEVs, rather than doing this one by one, we can
instead use mass updates. We plan to make them more efficient than they
are now, potentially improving compile time.

Diff Detail

Event Timeline

mkazantsev created this revision.Oct 11 2021, 11:00 PM

Herald added subscribers: javed.absar, hiraditya. · View Herald TranscriptOct 11 2021, 11:00 PM

mkazantsev requested review of this revision.Oct 11 2021, 11:00 PM

Herald added a project: Restricted Project. · View Herald TranscriptOct 11 2021, 11:01 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

mkazantsev added a parent revision: D111533: [SCEV] Invalidate user SCEVs along with operand SCEVs to avoid cache corruption.Oct 11 2021, 11:01 PM

mkazantsev updated this revision to Diff 378886.Oct 11 2021, 11:29 PM

mkazantsev mentioned this in D111533: [SCEV] Invalidate user SCEVs along with operand SCEVs to avoid cache corruption.

Harbormaster completed remote builds in B128270: Diff 378886.Oct 12 2021, 12:17 AM

This is an improvement in some cases, but less than I'd hoped: https://llvm-compile-time-tracker.com/compare.php?from=5e2c6abab42241c06680941be06fddcb9279e63d&to=18b24fb231b0dcbaf0dc304826974ac1aad77a01&stat=instructions

llvm/lib/Analysis/ScalarEvolution.cpp
7576	We should be collecting these into a vector as well.
12813	You can just pass `S` directly, no need to create SmallVector for single-element ArrayRef.

I think we can do more of this kind, e.g. in forgetLoops. Will look into it more.

llvm/lib/Analysis/ScalarEvolution.cpp
7576	Good point!

Found some more places where it can be done. @nikic could you please check the CT now?

Harbormaster completed remote builds in B128588: Diff 379344.Oct 13 2021, 5:53 AM

Updated version looks better: https://llvm-compile-time-tracker.com/compare.php?from=5e2c6abab42241c06680941be06fddcb9279e63d&to=82329ed73bb4100f3641703982cb661f77b95344&stat=instructions (don't mind SPASS, that's noise)

And here's both patches together: http://llvm-compile-time-tracker.com/compare.php?from=943b3048484b7e3cf04f4d51c23c82fcece2185d&to=82329ed73bb4100f3641703982cb661f77b95344&stat=instructions

llvm/lib/Analysis/ScalarEvolution.cpp
7494	Just wondering, does the problem described in this comment affect the new invalidation mechanism?

mkazantsev added inline comments.Oct 13 2021, 7:51 PM

llvm/lib/Analysis/ScalarEvolution.cpp
7494	If I'm reading this correctly, the new mechanism will drop trip count cache. The notion that we can sometimes forget something not for correctness purposes, but for the sole purpose of getting better results, is interesting. If it doesn't lead to any dangling pointers or directly wrong cached results, maybe we should consider non-recursive invalidation here.

mkazantsev added inline comments.Oct 13 2021, 9:10 PM

llvm/lib/Analysis/ScalarEvolution.cpp
7494	Giving it some more thought, I think this one might be a source of bugs. Imagine that exit count depends on a hypothetical SCEV (not corresponding to any existing instruction directly) which, in turn, uses value being forgotten. When it is destroyed, how do we ensure that we'll not end up with exit count referencing (indirectly) a dangling pointer?

Max, a suggestion review structure wise. If you flip the order of this patch and the one it's currently built on (i.e. by adding a forgetMemoizedResults arrayref wrapper), this can be easily landed as NFC. It won't really have any benefit compile time wise on it's own, but the change itself is not unreasonable. Inverting the review stack this way makes it easier to track the overall state of what's in review.

In fact, if you want to do that, you can consider this a conditional LGTM.

Two minor opportunities to exploit set invalidation for efficiency. I'll leave it to you whether these are worth separate review or not.

llvm/lib/Analysis/ScalarEvolution.cpp
12839	There's an opportunity to exploit the invalidation of sets at a time here by replacing multiple walks, with one walk and a set membership check.
12849	Same here. Though, I don't believe we have an set optimized contains_any already, so this might not be worth bothering with.

Ok, I also think that this one can go first.

mkazantsev removed a parent revision: D111533: [SCEV] Invalidate user SCEVs along with operand SCEVs to avoid cache corruption.Oct 21 2021, 10:34 PM

mkazantsev mentioned this in D112293: [SCEV][NFC] Introduce API for mass forgetMemoizedResults query.Oct 21 2021, 11:26 PM

mkazantsev updated this revision to Diff 381469.Oct 21 2021, 11:28 PM

mkazantsev retitled this revision from [SCEV][NFC] Tackle quadratic CT consumption when forgetting memoized results to [SCEV][NFC] Apply pass forgetMemoizedResults queries where possible.

mkazantsev edited the summary of this revision. (Show Details)

mkazantsev added reviewers: lebedev.ri, efriedma.

mkazantsev added a parent revision: D112293: [SCEV][NFC] Introduce API for mass forgetMemoizedResults query.

mkazantsev added inline comments.

llvm/lib/Analysis/ScalarEvolution.cpp
12839	Good point, I'll try to make this in a follow-up.

mkazantsev retitled this revision from [SCEV][NFC] Apply pass forgetMemoizedResults queries where possible to [SCEV][NFC] Apply mass forgetMemoizedResults queries where possible.Oct 21 2021, 11:29 PM

Harbormaster completed remote builds in B130092: Diff 381469.Oct 21 2021, 11:37 PM

mkazantsev added a child revision: D112294: [SCEV][NFC] Win some compile time from mass forgetMemoizedResults.Oct 22 2021, 12:20 AM

LGTM

This revision is now accepted and ready to land.Oct 22 2021, 10:11 AM

mkazantsev mentioned this in rGa6096b7f9ed3: [SCEV][NFC] Introduce API for mass forgetMemoizedResults query.Oct 24 2021, 11:49 PM

This revision was landed with ongoing or failed builds.Oct 25 2021, 12:05 AM

Closed by commit rGdbab339ea44e: [SCEV][NFC] Apply mass forgetMemoizedResults queries where possible (authored by mkazantsev). · Explain Why

This revision was automatically updated to reflect the committed changes.

mkazantsev added a commit: rGdbab339ea44e: [SCEV][NFC] Apply mass forgetMemoizedResults queries where possible.

Diff 378886

llvm/include/llvm/Analysis/ScalarEvolution.h

Show First 20 Lines • Show All 1,883 Lines • ▼ Show 20 Lines	private:

/// Try to match the Expr as "(L + R)<Flags>".		/// Try to match the Expr as "(L + R)<Flags>".
bool splitBinaryAdd(const SCEV Expr, const SCEV &L, const SCEV *&R,		bool splitBinaryAdd(const SCEV Expr, const SCEV &L, const SCEV *&R,
SCEV::NoWrapFlags &Flags);		SCEV::NoWrapFlags &Flags);

/// Drop memoized information computed for S.		/// Drop memoized information computed for S.
void forgetMemoizedResults(const SCEV *S);		void forgetMemoizedResults(const SCEV *S);

		/// Drop memoized information for all \p SCEVs.
		void forgetMemoizedResults(ArrayRef<const SCEV *> SCEVs);

void forgetMemoizedResultsImpl(const SCEV *S);		void forgetMemoizedResultsImpl(const SCEV *S);

/// Return an existing SCEV for V if there is one, otherwise return nullptr.		/// Return an existing SCEV for V if there is one, otherwise return nullptr.
const SCEV getExistingSCEV(Value V);		const SCEV getExistingSCEV(Value V);

/// Return false iff given SCEV contains a SCEVUnknown with NULL value-		/// Return false iff given SCEV contains a SCEVUnknown with NULL value-
/// pointer.		/// pointer.
bool checkValidity(const SCEV *S) const;		bool checkValidity(const SCEV *S) const;
▲ Show 20 Lines • Show All 307 Lines • Show Last 20 Lines

llvm/lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,485 Lines • ▼ Show 20 Lines	while (!Worklist.empty()) {
// loop1:		// loop1:
// %pn1 = phi		// %pn1 = phi
// ...		// ...
//		//
// where both loop0 and loop1's backedge taken count uses the SCEV		// where both loop0 and loop1's backedge taken count uses the SCEV
// expression for %v. If we don't have the early stop below then in cases		// expression for %v. If we don't have the early stop below then in cases
// like the above, getBackedgeTakenInfo(loop1) will clear out the trip		// like the above, getBackedgeTakenInfo(loop1) will clear out the trip
// count for loop0 and getBackedgeTakenInfo(loop0) will clear out the trip		// count for loop0 and getBackedgeTakenInfo(loop0) will clear out the trip
// count for loop1, effectively nullifying SCEV's trip count cache.		// count for loop1, effectively nullifying SCEV's trip count cache.
		nikicUnsubmitted Not Done Reply Inline Actions Just wondering, does the problem described in this comment affect the new invalidation mechanism? nikic: Just wondering, does the problem described in this comment affect the new invalidation…
		mkazantsevAuthorUnsubmitted Done Reply Inline Actions If I'm reading this correctly, the new mechanism will drop trip count cache. The notion that we can sometimes forget something not for correctness purposes, but for the sole purpose of getting better results, is interesting. If it doesn't lead to any dangling pointers or directly wrong cached results, maybe we should consider non-recursive invalidation here. mkazantsev: If I'm reading this correctly, the new mechanism will drop trip count cache. The notion that we…
		mkazantsevAuthorUnsubmitted Done Reply Inline Actions Giving it some more thought, I think this one might be a source of bugs. Imagine that exit count depends on a hypothetical SCEV (not corresponding to any existing instruction directly) which, in turn, uses value being forgotten. When it is destroyed, how do we ensure that we'll not end up with exit count referencing (indirectly) a dangling pointer? mkazantsev: Giving it some more thought, I think this one might be a source of bugs. Imagine that exit…
for (auto *U : I->users())		for (auto *U : I->users())
if (auto *I = dyn_cast<Instruction>(U)) {		if (auto *I = dyn_cast<Instruction>(U)) {
auto *LoopForUser = LI.getLoopFor(I->getParent());		auto *LoopForUser = LI.getLoopFor(I->getParent());
if (LoopForUser && L->contains(LoopForUser) &&		if (LoopForUser && L->contains(LoopForUser) &&
Discovered.insert(I).second)		Discovered.insert(I).second)
Worklist.push_back(I);		Worklist.push_back(I);
}		}
}		}
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	for (auto I = PredicatedSCEVRewrites.begin();
if (Entry.second == CurrL)		if (Entry.second == CurrL)
PredicatedSCEVRewrites.erase(I++);		PredicatedSCEVRewrites.erase(I++);
else		else
++I;		++I;
}		}

auto LoopUsersItr = LoopUsers.find(CurrL);		auto LoopUsersItr = LoopUsers.find(CurrL);
if (LoopUsersItr != LoopUsers.end()) {		if (LoopUsersItr != LoopUsers.end()) {
for (auto *S : LoopUsersItr->second)		forgetMemoizedResults(LoopUsersItr->second);
forgetMemoizedResults(S);
LoopUsers.erase(LoopUsersItr);		LoopUsers.erase(LoopUsersItr);
}		}

// Drop information about expressions based on loop-header PHIs.		// Drop information about expressions based on loop-header PHIs.
PushLoopPHIs(CurrL, Worklist);		PushLoopPHIs(CurrL, Worklist);

while (!Worklist.empty()) {		while (!Worklist.empty()) {
Instruction *I = Worklist.pop_back_val();		Instruction *I = Worklist.pop_back_val();
if (!Visited.insert(I).second)		if (!Visited.insert(I).second)
continue;		continue;

ValueExprMapType::iterator It =		ValueExprMapType::iterator It =
ValueExprMap.find_as(static_cast<Value *>(I));		ValueExprMap.find_as(static_cast<Value *>(I));
if (It != ValueExprMap.end()) {		if (It != ValueExprMap.end()) {
eraseValueFromMap(It->first);		eraseValueFromMap(It->first);
forgetMemoizedResults(It->second);		forgetMemoizedResults(It->second);
		nikicUnsubmitted Not Done Reply Inline Actions We should be collecting these into a vector as well. nikic: We should be collecting these into a vector as well.
		mkazantsevAuthorUnsubmitted Done Reply Inline Actions Good point! mkazantsev: Good point!
if (PHINode *PN = dyn_cast<PHINode>(I))		if (PHINode *PN = dyn_cast<PHINode>(I))
ConstantEvolutionLoopExitValue.erase(PN);		ConstantEvolutionLoopExitValue.erase(PN);
}		}

PushDefUseChildren(I, Worklist);		PushDefUseChildren(I, Worklist);
}		}

LoopPropertiesCache.erase(CurrL);		LoopPropertiesCache.erase(CurrL);
Show All 13 Lines	void ScalarEvolution::forgetValue(Value *V) {
Instruction *I = dyn_cast<Instruction>(V);		Instruction *I = dyn_cast<Instruction>(V);
if (!I) return;		if (!I) return;

// Drop information about expressions based on loop-header PHIs.		// Drop information about expressions based on loop-header PHIs.
SmallVector<Instruction *, 16> Worklist;		SmallVector<Instruction *, 16> Worklist;
Worklist.push_back(I);		Worklist.push_back(I);

SmallPtrSet<Instruction *, 8> Visited;		SmallPtrSet<Instruction *, 8> Visited;
		SmallVector<const SCEV *, 8> ToForget;
while (!Worklist.empty()) {		while (!Worklist.empty()) {
I = Worklist.pop_back_val();		I = Worklist.pop_back_val();
if (!Visited.insert(I).second)		if (!Visited.insert(I).second)
continue;		continue;

ValueExprMapType::iterator It =		ValueExprMapType::iterator It =
ValueExprMap.find_as(static_cast<Value *>(I));		ValueExprMap.find_as(static_cast<Value *>(I));
if (It != ValueExprMap.end()) {		if (It != ValueExprMap.end()) {
eraseValueFromMap(It->first);		eraseValueFromMap(It->first);
forgetMemoizedResults(It->second);		ToForget.push_back(It->second);
if (PHINode *PN = dyn_cast<PHINode>(I))		if (PHINode *PN = dyn_cast<PHINode>(I))
ConstantEvolutionLoopExitValue.erase(PN);		ConstantEvolutionLoopExitValue.erase(PN);
}		}

PushDefUseChildren(I, Worklist);		PushDefUseChildren(I, Worklist);
}		}
		forgetMemoizedResults(ToForget);
}		}

void ScalarEvolution::forgetLoopDispositions(const Loop *L) {		void ScalarEvolution::forgetLoopDispositions(const Loop *L) {
LoopDispositions.clear();		LoopDispositions.clear();
}		}

/// Get the exact loop backedge taken count considering all loop exits. A		/// Get the exact loop backedge taken count considering all loop exits. A
/// computable result can only be returned for loops with all exiting blocks		/// computable result can only be returned for loops with all exiting blocks
▲ Show 20 Lines • Show All 5,171 Lines • ▼ Show 20 Lines	bool ScalarEvolution::properlyDominates(const SCEV S, const BasicBlock BB) {
return getBlockDisposition(S, BB) == ProperlyDominatesBlock;		return getBlockDisposition(S, BB) == ProperlyDominatesBlock;
}		}

bool ScalarEvolution::hasOperand(const SCEV S, const SCEV Op) const {		bool ScalarEvolution::hasOperand(const SCEV S, const SCEV Op) const {
return SCEVExprContains(S, [&](const SCEV *Expr) { return Expr == Op; });		return SCEVExprContains(S, [&](const SCEV *Expr) { return Expr == Op; });
}		}

void ScalarEvolution::forgetMemoizedResults(const SCEV *S) {		void ScalarEvolution::forgetMemoizedResults(const SCEV *S) {
		SmallVector<const SCEV *> ToForget;
		ToForget.push_back(S);
		forgetMemoizedResults(ToForget);
		nikicUnsubmitted Not Done Reply Inline Actions You can just pass `S` directly, no need to create SmallVector for single-element ArrayRef. nikic: You can just pass `S` directly, no need to create SmallVector for single-element ArrayRef.
		}

		void ScalarEvolution::forgetMemoizedResults(ArrayRef<const SCEV *> SCEVs) {
SmallPtrSet<const SCEV *, 16> Visited;		SmallPtrSet<const SCEV *, 16> Visited;
SmallVector<const SCEV *, 16> Worklist;		SmallVector<const SCEV *, 16> Worklist;
Visited.insert(S);		for (auto *S : SCEVs)
		if (Visited.insert(S).second)
Worklist.push_back(S);		Worklist.push_back(S);

while (!Worklist.empty()) {		while (!Worklist.empty()) {
const SCEV *Curr = Worklist.pop_back_val();		const SCEV *Curr = Worklist.pop_back_val();
auto Users = SCEVUsers.find(Curr);		auto Users = SCEVUsers.find(Curr);
if (Users != SCEVUsers.end())		if (Users != SCEVUsers.end())
for (auto *User : Users->second)		for (auto *User : Users->second)
if (Visited.insert(User).second)		if (Visited.insert(User).second)
Worklist.push_back(User);		Worklist.push_back(User);
}		}
Show All 9 Lines	void ScalarEvolution::forgetMemoizedResultsImpl(const SCEV *S) {
SignedRanges.erase(S);		SignedRanges.erase(S);
ExprValueMap.erase(S);		ExprValueMap.erase(S);
HasRecMap.erase(S);		HasRecMap.erase(S);
MinTrailingZerosCache.erase(S);		MinTrailingZerosCache.erase(S);

for (auto I = PredicatedSCEVRewrites.begin();		for (auto I = PredicatedSCEVRewrites.begin();
I != PredicatedSCEVRewrites.end();) {		I != PredicatedSCEVRewrites.end();) {
std::pair<const SCEV , const Loop > Entry = I->first;		std::pair<const SCEV , const Loop > Entry = I->first;
if (Entry.first == S)		if (Entry.first == S)
reamesUnsubmitted Not Done Reply Inline Actions There's an opportunity to exploit the invalidation of sets at a time here by replacing multiple walks, with one walk and a set membership check. reames: There's an opportunity to exploit the invalidation of sets at a time here by replacing multiple…
mkazantsevAuthorUnsubmitted Done Reply Inline Actions Good point, I'll try to make this in a follow-up. mkazantsev: Good point, I'll try to make this in a follow-up.
PredicatedSCEVRewrites.erase(I++);		PredicatedSCEVRewrites.erase(I++);
else		else
++I;		++I;
}		}

auto RemoveSCEVFromBackedgeMap =		auto RemoveSCEVFromBackedgeMap =
[S](DenseMap<const Loop *, BackedgeTakenInfo> &Map) {		[S](DenseMap<const Loop *, BackedgeTakenInfo> &Map) {
for (auto I = Map.begin(), E = Map.end(); I != E;) {		for (auto I = Map.begin(), E = Map.end(); I != E;) {
BackedgeTakenInfo &BEInfo = I->second;		BackedgeTakenInfo &BEInfo = I->second;
if (BEInfo.hasOperand(S))		if (BEInfo.hasOperand(S))
reamesUnsubmitted Not Done Reply Inline Actions Same here. Though, I don't believe we have an set optimized contains_any already, so this might not be worth bothering with. reames: Same here. Though, I don't believe we have an set optimized contains_any already, so this…
Map.erase(I++);		Map.erase(I++);
else		else
++I;		++I;
}		}
};		};

RemoveSCEVFromBackedgeMap(BackedgeTakenCounts);		RemoveSCEVFromBackedgeMap(BackedgeTakenCounts);
RemoveSCEVFromBackedgeMap(PredicatedBackedgeTakenCounts);		RemoveSCEVFromBackedgeMap(PredicatedBackedgeTakenCounts);
▲ Show 20 Lines • Show All 930 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[SCEV][NFC] Apply mass forgetMemoizedResults queries where possible
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 378886

llvm/include/llvm/Analysis/ScalarEvolution.h

llvm/lib/Analysis/ScalarEvolution.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[SCEV][NFC] Apply mass forgetMemoizedResults queries where possibleClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 378886

llvm/include/llvm/Analysis/ScalarEvolution.h

llvm/lib/Analysis/ScalarEvolution.cpp

[SCEV][NFC] Apply mass forgetMemoizedResults queries where possible
ClosedPublic