This is an archive of the discontinued LLVM Phabricator instance.

Make PredIteratorCache size() logically const. Do not require copying predecessors to get size.
ClosedPublic

Authored by • dberlin on Mar 12 2017, 9:41 AM.

Download Raw Diff

Details

Reviewers

chandlerc
davide

Commits

rG43dab5a5fbe3: Make PredIteratorCache size() logically const. Do not require copying…
rL297733: Make PredIteratorCache size() logically const. Do not require copying…

Summary

Every single benchmark i can run, on large and small cfgs, fully
connected, etc, across 3 different platforms (x86, arm., and PPC) says
that the current pred iterator cache is a losing proposition.

I can't find a case where it's faster than just walking preds, and in some cases, it's 5-10% slower.

This is due to copying the preds.
It also degrades into copying the entire cfg.

The one operation that is occasionally faster is the cached size.
This makes that operation faster by not relying on having the copies available.

I'm not even sure that is faster enough to be worth it. I, again, have
trouble finding cases where this takes long enough in a pass to be
worth caching compared to a million other things they could cache or
improve.

My suggestion:
We next remove the get() interface.
We do stronger benchmarking of size().
We probably end up killing this entire cache.
/

Diff Detail

Build Status

Buildable 4726
Build 4726: arc lint + arc unit

Event Timeline

• dberlin created this revision.Mar 12 2017, 9:41 AM

Harbormaster completed remote builds in B4726: Diff 91499.Mar 12 2017, 9:41 AM

Herald added a subscriber: aemerson. · View Herald TranscriptMar 12 2017, 9:41 AM

LGTM. I personally like your plan of removing the cache altogether supported by benchmarks.

This revision is now accepted and ready to land.Mar 12 2017, 1:16 PM

Closed by commit rL297733: Make PredIteratorCache size() logically const. Do not require copying… (authored by dannyb). · Explain WhyMar 14 2017, 4:37 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

include/

llvm/

IR/

PredIteratorCache.h

14 lines

Diff 91499

include/llvm/IR/PredIteratorCache.h

Show All 21 Lines

namespace llvm {		namespace llvm {

/// PredIteratorCache - This class is an extremely trivial cache for		/// PredIteratorCache - This class is an extremely trivial cache for
/// predecessor iterator queries. This is useful for code that repeatedly		/// predecessor iterator queries. This is useful for code that repeatedly
/// wants the predecessor list for the same blocks.		/// wants the predecessor list for the same blocks.
class PredIteratorCache {		class PredIteratorCache {
/// BlockToPredsMap - Pointer to null-terminated list.		/// BlockToPredsMap - Pointer to null-terminated list.
DenseMap<BasicBlock , BasicBlock *> BlockToPredsMap;		mutable DenseMap<BasicBlock , BasicBlock *> BlockToPredsMap;
DenseMap<BasicBlock *, unsigned> BlockToPredCountMap;		mutable DenseMap<BasicBlock *, unsigned> BlockToPredCountMap;

/// Memory - This is the space that holds cached preds.		/// Memory - This is the space that holds cached preds.
BumpPtrAllocator Memory;		BumpPtrAllocator Memory;

private:		private:
/// GetPreds - Get a cached list for the null-terminated predecessor list of		/// GetPreds - Get a cached list for the null-terminated predecessor list of
/// the specified block. This can be used in a loop like this:		/// the specified block. This can be used in a loop like this:
/// for (BasicBlock *PI = PredCache->GetPreds(BB); PI; ++PI)		/// for (BasicBlock *PI = PredCache->GetPreds(BB); PI; ++PI)
Show All 10 Lines	BasicBlock *GetPreds(BasicBlock BB) {

BlockToPredCountMap[BB] = PredCache.size() - 1;		BlockToPredCountMap[BB] = PredCache.size() - 1;

Entry = Memory.Allocate<BasicBlock *>(PredCache.size());		Entry = Memory.Allocate<BasicBlock *>(PredCache.size());
std::copy(PredCache.begin(), PredCache.end(), Entry);		std::copy(PredCache.begin(), PredCache.end(), Entry);
return Entry;		return Entry;
}		}

unsigned GetNumPreds(BasicBlock *BB) {		unsigned GetNumPreds(BasicBlock *BB) const {
GetPreds(BB);		auto Result = BlockToPredCountMap.find(BB);
return BlockToPredCountMap[BB];		if (Result != BlockToPredCountMap.end())
		return Result->second;
		return BlockToPredCountMap[BB] = std::distance(pred_begin(BB), pred_end(BB));
}		}

public:		public:
size_t size(BasicBlock *BB) { return GetNumPreds(BB); }		size_t size(BasicBlock *BB) const { return GetNumPreds(BB); }
ArrayRef<BasicBlock > get(BasicBlock BB) {		ArrayRef<BasicBlock > get(BasicBlock BB) {
return makeArrayRef(GetPreds(BB), GetNumPreds(BB));		return makeArrayRef(GetPreds(BB), GetNumPreds(BB));
}		}

/// clear - Remove all information.		/// clear - Remove all information.
void clear() {		void clear() {
BlockToPredsMap.clear();		BlockToPredsMap.clear();
BlockToPredCountMap.clear();		BlockToPredCountMap.clear();
Memory.Reset();		Memory.Reset();
}		}
};		};

} // end namespace llvm		} // end namespace llvm

#endif		#endif