This is an archive of the discontinued LLVM Phabricator instance.

PDB HashTable: Move TraitsT from class parameter to the methods that need it
ClosedPublic

Authored by thakis on Jul 12 2019, 8:21 AM.

Download Raw Diff

Details

Reviewers

Commits

rG51a52b58930c: PDB HashTable: Move TraitsT from class parameter to the methods that need it
rL365974: PDB HashTable: Move TraitsT from class parameter to the methods that need it

Summary

The traits object is only used by a few methods. Deserializing a hash
table and walking it is possible without the traits object, so it
shouldn't be required to build a dummy object for that use case.

The TraitsT object used to be a function template parameter before
r327647, this restores it to that state.

This makes it clear that the traits object isn't needed at all in 1 of
the current 3 uses of HashTable (and I am going to add another use that
doesn't need it), and that the default PdbHashTraits isn't used outside
of tests.

While here, also re-enable 3 checks in the test that were commented out
(which requires making HashTableInternals templated and giving FooBar
an operator==).

No intended behavior change.

Diff Detail

Event Timeline

thakis created this revision.Jul 12 2019, 8:21 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 12 2019, 8:21 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

thakis added a child revision: D64641: PDB HashTable: Make iterator type const.Jul 12 2019, 8:36 AM

The traits object is only used by a few methods. Deserializing a hash
table and walking it is possible without the traits object, so it
shouldn't be required to build a dummy object for that use case.

The traits are needed to do any actual hashing. I think I have a bit of a preference for the code as it is before your change. Now all the hash lookup operations have to take an extra parameter, and the caller is responsible for managing the lifetime of a new object that probably should exactly match the lifetime of the hash table.

What do you think of having a traits-less HashTableView or HashTableBase and having HashTable inherit from it? Would that solve your use case?

In D64640#1583265, @rnk wrote:

The traits object is only used by a few methods. Deserializing a hash
table and walking it is possible without the traits object, so it
shouldn't be required to build a dummy object for that use case.

The traits are needed to do any actual hashing.

I believe that's not really true: The has key is always an uint32_t, and assuming that lookupKeyToStorageKey() and storageKeyToLookupKey() are inverses of each other (which currently is true for StringTableHashTraits; for NamedStreamMapTraits I'm not yet sure what the semantics for two streams with the same name are. I believe it's not allowed – passing two identical /natvis: params to lld seems to confuse everyone at least -- and then it'd be true there too). So I currently think (but could be wrong, I'm not sure yet) that the hashtable hash function is overly complicated and should always work just on the uint32_t key and there should be a convenience function to convert a string key to an uint32_t. But that's for another change.

I think I have a bit of a preference for the code as it is before your change. Now all the hash lookup operations have to take an extra parameter, and the caller is responsible for managing the lifetime of a new object that probably should exactly match the lifetime of the hash table.

The callers did manage the traits object before as well. Maybe that wasn't necessary, but see PDBFileBuilder.h and llvm/include/llvm/DebugInfo/PDB/Native/NamedStreamMap.h on the lhs – the traits objects were already member variables. There are 3 distinct call sites to the lookup operations from non-test code.

What do you think of having a traits-less HashTableView or HashTableBase and having HashTable inherit from it? Would that solve your use case?

https://reviews.llvm.org/D64428?id=208735 used to do this. It was imo much messier (see the "drat" comments for some unresolved issues with that approach, but even with them resolved it's pretty messy). I then left the hash table as is and added another traits class that was silently unused in https://reviews.llvm.org/D64428?id=209475 but didn't like that either (see "xxx blah" for an annoying issue there – a function that's silently unused can't be implemented nicely). I think this change is by far the nicest.

In D64640#1583291, @thakis wrote:

In D64640#1583265, @rnk wrote:

The traits object is only used by a few methods. Deserializing a hash
table and walking it is possible without the traits object, so it
shouldn't be required to build a dummy object for that use case.

The traits are needed to do any actual hashing.

I believe that's not really true: The has key is always an uint32_t, and assuming that lookupKeyToStorageKey() and storageKeyToLookupKey() are inverses of each other (which currently is true for StringTableHashTraits; for NamedStreamMapTraits I'm not yet sure what the semantics for two streams with the same name are. I believe it's not allowed – passing two identical /natvis: params to lld seems to confuse everyone at least -- and then it'd be true there too). So I currently think (but could be wrong, I'm not sure yet) that the hashtable hash function is overly complicated and should always work just on the uint32_t key and there should be a convenience function to convert a string key to an uint32_t. But that's for another change.

Hey, I like the sound of that: who needs traits templates, we can just use functions. Insofar as this is a step in that direction, I'm on board.

I think I have a bit of a preference for the code as it is before your change. Now all the hash lookup operations have to take an extra parameter, and the caller is responsible for managing the lifetime of a new object that probably should exactly match the lifetime of the hash table.

The callers did manage the traits object before as well. Maybe that wasn't necessary, but see PDBFileBuilder.h and llvm/include/llvm/DebugInfo/PDB/Native/NamedStreamMap.h on the lhs – the traits objects were already member variables. There are 3 distinct call sites to the lookup operations from non-test code.

What do you think of having a traits-less HashTableView or HashTableBase and having HashTable inherit from it? Would that solve your use case?

https://reviews.llvm.org/D64428?id=208735 used to do this. It was imo much messier (see the "drat" comments for some unresolved issues with that approach, but even with them resolved it's pretty messy). I then left the hash table as is and added another traits class that was silently unused in https://reviews.llvm.org/D64428?id=209475 but didn't like that either (see "xxx blah" for an annoying issue there – a function that's silently unused can't be implemented nicely). I think this change is by far the nicest.

I took a look at the linked stuff, but mostly I'm convinced that you've thought about these other approaches and I'll trust that you've made the right judgement here, looks good to me.

This revision is now accepted and ready to land.Jul 12 2019, 4:03 PM

Closed by commit rL365974: PDB HashTable: Move TraitsT from class parameter to the methods that need it (authored by nico). · Explain WhyJul 12 2019, 4:32 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

include/

llvm/

DebugInfo/

PDB/

Native/

HashTable.h

63 lines

NamedStreamMap.h

2 lines

PDBFileBuilder.h

2 lines

lib/

DebugInfo/

PDB/

Native/

NamedStreamMap.cpp

7 lines

PDBFileBuilder.cpp

5 lines

unittests/

DebugInfo/

PDB/

HashTableTest.cpp

119 lines

Diff 209492

llvm/include/llvm/DebugInfo/PDB/Native/HashTable.h

Show All 25 Lines
class BinaryStreamReader;		class BinaryStreamReader;
class BinaryStreamWriter;		class BinaryStreamWriter;

namespace pdb {		namespace pdb {

Error readSparseBitVector(BinaryStreamReader &Stream, SparseBitVector<> &V);		Error readSparseBitVector(BinaryStreamReader &Stream, SparseBitVector<> &V);
Error writeSparseBitVector(BinaryStreamWriter &Writer, SparseBitVector<> &Vec);		Error writeSparseBitVector(BinaryStreamWriter &Writer, SparseBitVector<> &Vec);

template <typename ValueT, typename TraitsT> class HashTable;		template <typename ValueT> class HashTable;

template <typename ValueT, typename TraitsT>		template <typename ValueT>
class HashTableIterator		class HashTableIterator
: public iterator_facade_base<HashTableIterator<ValueT, TraitsT>,		: public iterator_facade_base<HashTableIterator<ValueT>,
std::forward_iterator_tag,		std::forward_iterator_tag,
std::pair<uint32_t, ValueT>> {		std::pair<uint32_t, ValueT>> {
friend HashTable<ValueT, TraitsT>;		friend HashTable<ValueT>;

HashTableIterator(const HashTable<ValueT, TraitsT> &Map, uint32_t Index,		HashTableIterator(const HashTable<ValueT> &Map, uint32_t Index,
bool IsEnd)		bool IsEnd)
: Map(&Map), Index(Index), IsEnd(IsEnd) {}		: Map(&Map), Index(Index), IsEnd(IsEnd) {}

public:		public:
HashTableIterator(const HashTable<ValueT, TraitsT> &Map) : Map(&Map) {		HashTableIterator(const HashTable<ValueT> &Map) : Map(&Map) {
int I = Map.Present.find_first();		int I = Map.Present.find_first();
if (I == -1) {		if (I == -1) {
Index = 0;		Index = 0;
IsEnd = true;		IsEnd = true;
} else {		} else {
Index = static_cast<uint32_t>(I);		Index = static_cast<uint32_t>(I);
IsEnd = false;		IsEnd = false;
}		}
Show All 25 Lines	HashTableIterator &operator++() {
IsEnd = true;		IsEnd = true;
return *this;		return *this;
}		}

private:		private:
bool isEnd() const { return IsEnd; }		bool isEnd() const { return IsEnd; }
uint32_t index() const { return Index; }		uint32_t index() const { return Index; }

const HashTable<ValueT, TraitsT> *Map;		const HashTable<ValueT> *Map;
uint32_t Index;		uint32_t Index;
bool IsEnd;		bool IsEnd;
};		};

template <typename T> struct PdbHashTraits {};		template <typename ValueT>

template <> struct PdbHashTraits<uint32_t> {
uint32_t hashLookupKey(uint32_t N) const { return N; }
uint32_t storageKeyToLookupKey(uint32_t N) const { return N; }
uint32_t lookupKeyToStorageKey(uint32_t N) { return N; }
};

template <typename ValueT, typename TraitsT = PdbHashTraits<ValueT>>
class HashTable {		class HashTable {
using iterator = HashTableIterator<ValueT, TraitsT>;		using iterator = HashTableIterator<ValueT>;
friend iterator;		friend iterator;

struct Header {		struct Header {
support::ulittle32_t Size;		support::ulittle32_t Size;
support::ulittle32_t Capacity;		support::ulittle32_t Capacity;
};		};

using BucketList = std::vector<std::pair<uint32_t, ValueT>>;		using BucketList = std::vector<std::pair<uint32_t, ValueT>>;

public:		public:
HashTable() { Buckets.resize(8); }		HashTable() { Buckets.resize(8); }
		explicit HashTable(uint32_t Capacity) {
explicit HashTable(TraitsT Traits) : HashTable(8, std::move(Traits)) {}
HashTable(uint32_t Capacity, TraitsT Traits) : Traits(Traits) {
Buckets.resize(Capacity);		Buckets.resize(Capacity);
}		}

Error load(BinaryStreamReader &Stream) {		Error load(BinaryStreamReader &Stream) {
const Header *H;		const Header *H;
if (auto EC = Stream.readObject(H))		if (auto EC = Stream.readObject(H))
return EC;		return EC;
if (H->Capacity == 0)		if (H->Capacity == 0)
▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	public:
uint32_t capacity() const { return Buckets.size(); }		uint32_t capacity() const { return Buckets.size(); }
uint32_t size() const { return Present.count(); }		uint32_t size() const { return Present.count(); }

iterator begin() const { return iterator(*this); }		iterator begin() const { return iterator(*this); }
iterator end() const { return iterator(*this, 0, true); }		iterator end() const { return iterator(*this, 0, true); }

/// Find the entry whose key has the specified hash value, using the specified		/// Find the entry whose key has the specified hash value, using the specified
/// traits defining hash function and equality.		/// traits defining hash function and equality.
template <typename Key> iterator find_as(const Key &K) const {		template <typename Key, typename TraitsT>
		iterator find_as(const Key &K, TraitsT &Traits) const {
uint32_t H = Traits.hashLookupKey(K) % capacity();		uint32_t H = Traits.hashLookupKey(K) % capacity();
uint32_t I = H;		uint32_t I = H;
Optional<uint32_t> FirstUnused;		Optional<uint32_t> FirstUnused;
do {		do {
if (isPresent(I)) {		if (isPresent(I)) {
if (Traits.storageKeyToLookupKey(Buckets[I].first) == K)		if (Traits.storageKeyToLookupKey(Buckets[I].first) == K)
return iterator(*this, I, false);		return iterator(*this, I, false);
} else {		} else {
Show All 14 Lines	iterator find_as(const Key &K, TraitsT &Traits) const {
// table were Present. But this would violate the load factor constraints		// table were Present. But this would violate the load factor constraints
// that we impose, so it should never happen.		// that we impose, so it should never happen.
assert(FirstUnused);		assert(FirstUnused);
return iterator(this, FirstUnused, true);		return iterator(this, FirstUnused, true);
}		}

/// Set the entry using a key type that the specified Traits can convert		/// Set the entry using a key type that the specified Traits can convert
/// from a real key to an internal key.		/// from a real key to an internal key.
template <typename Key> bool set_as(const Key &K, ValueT V) {		template <typename Key, typename TraitsT>
return set_as_internal(K, std::move(V), None);		bool set_as(const Key &K, ValueT V, TraitsT &Traits) {
		return set_as_internal(K, std::move(V), Traits, None);
}		}

template <typename Key> ValueT get(const Key &K) const {		template <typename Key, typename TraitsT>
auto Iter = find_as(K);		ValueT get(const Key &K, TraitsT &Traits) const {
		auto Iter = find_as(K, Traits);
assert(Iter != end());		assert(Iter != end());
return (*Iter).second;		return (*Iter).second;
}		}

protected:		protected:
bool isPresent(uint32_t K) const { return Present.test(K); }		bool isPresent(uint32_t K) const { return Present.test(K); }
bool isDeleted(uint32_t K) const { return Deleted.test(K); }		bool isDeleted(uint32_t K) const { return Deleted.test(K); }

TraitsT Traits;
BucketList Buckets;		BucketList Buckets;
mutable SparseBitVector<> Present;		mutable SparseBitVector<> Present;
mutable SparseBitVector<> Deleted;		mutable SparseBitVector<> Deleted;

private:		private:
/// Set the entry using a key type that the specified Traits can convert		/// Set the entry using a key type that the specified Traits can convert
/// from a real key to an internal key.		/// from a real key to an internal key.
template <typename Key>		template <typename Key, typename TraitsT>
bool set_as_internal(const Key &K, ValueT V, Optional<uint32_t> InternalKey) {		bool set_as_internal(const Key &K, ValueT V, TraitsT &Traits,
auto Entry = find_as(K);		Optional<uint32_t> InternalKey) {
		auto Entry = find_as(K, Traits);
if (Entry != end()) {		if (Entry != end()) {
assert(isPresent(Entry.index()));		assert(isPresent(Entry.index()));
assert(Traits.storageKeyToLookupKey(Buckets[Entry.index()].first) == K);		assert(Traits.storageKeyToLookupKey(Buckets[Entry.index()].first) == K);
// We're updating, no need to do anything special.		// We're updating, no need to do anything special.
Buckets[Entry.index()].second = V;		Buckets[Entry.index()].second = V;
return false;		return false;
}		}

auto &B = Buckets[Entry.index()];		auto &B = Buckets[Entry.index()];
assert(!isPresent(Entry.index()));		assert(!isPresent(Entry.index()));
assert(Entry.isEnd());		assert(Entry.isEnd());
B.first = InternalKey ? *InternalKey : Traits.lookupKeyToStorageKey(K);		B.first = InternalKey ? *InternalKey : Traits.lookupKeyToStorageKey(K);
B.second = V;		B.second = V;
Present.set(Entry.index());		Present.set(Entry.index());
Deleted.reset(Entry.index());		Deleted.reset(Entry.index());

grow();		grow(Traits);

assert((find_as(K)) != end());		assert((find_as(K, Traits)) != end());
return true;		return true;
}		}

static uint32_t maxLoad(uint32_t capacity) { return capacity * 2 / 3 + 1; }		static uint32_t maxLoad(uint32_t capacity) { return capacity * 2 / 3 + 1; }

void grow() {		template <typename TraitsT>
		void grow(TraitsT &Traits) {
uint32_t S = size();		uint32_t S = size();
uint32_t MaxLoad = maxLoad(capacity());		uint32_t MaxLoad = maxLoad(capacity());
if (S < maxLoad(capacity()))		if (S < maxLoad(capacity()))
return;		return;
assert(capacity() != UINT32_MAX && "Can't grow Hash table!");		assert(capacity() != UINT32_MAX && "Can't grow Hash table!");

uint32_t NewCapacity = (capacity() <= INT32_MAX) ? MaxLoad * 2 : UINT32_MAX;		uint32_t NewCapacity = (capacity() <= INT32_MAX) ? MaxLoad * 2 : UINT32_MAX;

// Growing requires rebuilding the table and re-hashing every item. Make a		// Growing requires rebuilding the table and re-hashing every item. Make a
// copy with a larger capacity, insert everything into the copy, then swap		// copy with a larger capacity, insert everything into the copy, then swap
// it in.		// it in.
HashTable NewMap(NewCapacity, Traits);		HashTable NewMap(NewCapacity);
for (auto I : Present) {		for (auto I : Present) {
auto LookupKey = Traits.storageKeyToLookupKey(Buckets[I].first);		auto LookupKey = Traits.storageKeyToLookupKey(Buckets[I].first);
NewMap.set_as_internal(LookupKey, Buckets[I].second, Buckets[I].first);		NewMap.set_as_internal(LookupKey, Buckets[I].second, Traits,
		Buckets[I].first);
}		}

Buckets.swap(NewMap.Buckets);		Buckets.swap(NewMap.Buckets);
std::swap(Present, NewMap.Present);		std::swap(Present, NewMap.Present);
std::swap(Deleted, NewMap.Deleted);		std::swap(Deleted, NewMap.Deleted);
assert(capacity() == NewCapacity);		assert(capacity() == NewCapacity);
assert(size() == S);		assert(size() == S);
}		}
};		};

} // end namespace pdb		} // end namespace pdb

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_DEBUGINFO_PDB_NATIVE_HASHTABLE_H		#endif // LLVM_DEBUGINFO_PDB_NATIVE_HASHTABLE_H

llvm/include/llvm/DebugInfo/PDB/Native/NamedStreamMap.h

Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	public:
uint32_t hashString(uint32_t Offset) const;		uint32_t hashString(uint32_t Offset) const;

StringMap<uint32_t> entries() const;		StringMap<uint32_t> entries() const;

private:		private:
NamedStreamMapTraits HashTraits;		NamedStreamMapTraits HashTraits;
/// Closed hash table from Offset -> StreamNumber, where Offset is the offset		/// Closed hash table from Offset -> StreamNumber, where Offset is the offset
/// of the stream name in NamesBuffer.		/// of the stream name in NamesBuffer.
HashTable<support::ulittle32_t, NamedStreamMapTraits> OffsetIndexMap;		HashTable<support::ulittle32_t> OffsetIndexMap;

/// Buffer of string data.		/// Buffer of string data.
std::vector<char> NamesBuffer;		std::vector<char> NamesBuffer;
};		};

} // end namespace pdb		} // end namespace pdb

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_DEBUGINFO_PDB_NATIVE_NAMEDSTREAMMAP_H		#endif // LLVM_DEBUGINFO_PDB_NATIVE_NAMEDSTREAMMAP_H

llvm/include/llvm/DebugInfo/PDB/Native/PDBFileBuilder.h

Show First 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	private:
std::unique_ptr<InfoStreamBuilder> Info;		std::unique_ptr<InfoStreamBuilder> Info;
std::unique_ptr<DbiStreamBuilder> Dbi;		std::unique_ptr<DbiStreamBuilder> Dbi;
std::unique_ptr<GSIStreamBuilder> Gsi;		std::unique_ptr<GSIStreamBuilder> Gsi;
std::unique_ptr<TpiStreamBuilder> Tpi;		std::unique_ptr<TpiStreamBuilder> Tpi;
std::unique_ptr<TpiStreamBuilder> Ipi;		std::unique_ptr<TpiStreamBuilder> Ipi;

PDBStringTableBuilder Strings;		PDBStringTableBuilder Strings;
StringTableHashTraits InjectedSourceHashTraits;		StringTableHashTraits InjectedSourceHashTraits;
HashTable<SrcHeaderBlockEntry, StringTableHashTraits> InjectedSourceTable;		HashTable<SrcHeaderBlockEntry> InjectedSourceTable;

SmallVector<InjectedSourceDescriptor, 2> InjectedSources;		SmallVector<InjectedSourceDescriptor, 2> InjectedSources;

NamedStreamMap NamedStreams;		NamedStreamMap NamedStreams;
DenseMap<uint32_t, std::string> NamedStreamData;		DenseMap<uint32_t, std::string> NamedStreamData;
};		};
}		}
}		}

#endif		#endif

llvm/lib/DebugInfo/PDB/Native/NamedStreamMap.cpp

Show All 40 Lines
StringRef NamedStreamMapTraits::storageKeyToLookupKey(uint32_t Offset) const {		StringRef NamedStreamMapTraits::storageKeyToLookupKey(uint32_t Offset) const {
return NS->getString(Offset);		return NS->getString(Offset);
}		}

uint32_t NamedStreamMapTraits::lookupKeyToStorageKey(StringRef S) {		uint32_t NamedStreamMapTraits::lookupKeyToStorageKey(StringRef S) {
return NS->appendStringData(S);		return NS->appendStringData(S);
}		}

NamedStreamMap::NamedStreamMap()		NamedStreamMap::NamedStreamMap() : HashTraits(*this), OffsetIndexMap(1) {}
: HashTraits(*this), OffsetIndexMap(1, HashTraits) {}

Error NamedStreamMap::load(BinaryStreamReader &Stream) {		Error NamedStreamMap::load(BinaryStreamReader &Stream) {
uint32_t StringBufferSize;		uint32_t StringBufferSize;
if (auto EC = Stream.readInteger(StringBufferSize))		if (auto EC = Stream.readInteger(StringBufferSize))
return joinErrors(std::move(EC),		return joinErrors(std::move(EC),
make_error<RawError>(raw_error_code::corrupt_file,		make_error<RawError>(raw_error_code::corrupt_file,
"Expected string buffer size"));		"Expected string buffer size"));

Show All 35 Lines	StringRef NamedStreamMap::getString(uint32_t Offset) const {
return StringRef(NamesBuffer.data() + Offset);		return StringRef(NamesBuffer.data() + Offset);
}		}

uint32_t NamedStreamMap::hashString(uint32_t Offset) const {		uint32_t NamedStreamMap::hashString(uint32_t Offset) const {
return hashStringV1(getString(Offset));		return hashStringV1(getString(Offset));
}		}

bool NamedStreamMap::get(StringRef Stream, uint32_t &StreamNo) const {		bool NamedStreamMap::get(StringRef Stream, uint32_t &StreamNo) const {
auto Iter = OffsetIndexMap.find_as(Stream);		auto Iter = OffsetIndexMap.find_as(Stream, HashTraits);
if (Iter == OffsetIndexMap.end())		if (Iter == OffsetIndexMap.end())
return false;		return false;
StreamNo = (*Iter).second;		StreamNo = (*Iter).second;
return true;		return true;
}		}

StringMap<uint32_t> NamedStreamMap::entries() const {		StringMap<uint32_t> NamedStreamMap::entries() const {
StringMap<uint32_t> Result;		StringMap<uint32_t> Result;
for (const auto &Entry : OffsetIndexMap) {		for (const auto &Entry : OffsetIndexMap) {
StringRef Stream(NamesBuffer.data() + Entry.first);		StringRef Stream(NamesBuffer.data() + Entry.first);
Result.try_emplace(Stream, Entry.second);		Result.try_emplace(Stream, Entry.second);
}		}
return Result;		return Result;
}		}

uint32_t NamedStreamMap::appendStringData(StringRef S) {		uint32_t NamedStreamMap::appendStringData(StringRef S) {
uint32_t Offset = NamesBuffer.size();		uint32_t Offset = NamesBuffer.size();
NamesBuffer.insert(NamesBuffer.end(), S.begin(), S.end());		NamesBuffer.insert(NamesBuffer.end(), S.begin(), S.end());
NamesBuffer.push_back('\0');		NamesBuffer.push_back('\0');
return Offset;		return Offset;
}		}

void NamedStreamMap::set(StringRef Stream, uint32_t StreamNo) {		void NamedStreamMap::set(StringRef Stream, uint32_t StreamNo) {
OffsetIndexMap.set_as(Stream, support::ulittle32_t(StreamNo));		OffsetIndexMap.set_as(Stream, support::ulittle32_t(StreamNo), HashTraits);
}		}

llvm/lib/DebugInfo/PDB/Native/PDBFileBuilder.cpp

Show All 28 Lines
using namespace llvm;		using namespace llvm;
using namespace llvm::codeview;		using namespace llvm::codeview;
using namespace llvm::msf;		using namespace llvm::msf;
using namespace llvm::pdb;		using namespace llvm::pdb;
using namespace llvm::support;		using namespace llvm::support;

PDBFileBuilder::PDBFileBuilder(BumpPtrAllocator &Allocator)		PDBFileBuilder::PDBFileBuilder(BumpPtrAllocator &Allocator)
: Allocator(Allocator), InjectedSourceHashTraits(Strings),		: Allocator(Allocator), InjectedSourceHashTraits(Strings),
InjectedSourceTable(2, InjectedSourceHashTraits) {}		InjectedSourceTable(2) {}

PDBFileBuilder::~PDBFileBuilder() {}		PDBFileBuilder::~PDBFileBuilder() {}

Error PDBFileBuilder::initialize(uint32_t BlockSize) {		Error PDBFileBuilder::initialize(uint32_t BlockSize) {
auto ExpectedMsf = MSFBuilder::create(Allocator, BlockSize);		auto ExpectedMsf = MSFBuilder::create(Allocator, BlockSize);
if (!ExpectedMsf)		if (!ExpectedMsf)
return ExpectedMsf.takeError();		return ExpectedMsf.takeError();
Msf = llvm::make_unique<MSFBuilder>(std::move(*ExpectedMsf));		Msf = llvm::make_unique<MSFBuilder>(std::move(*ExpectedMsf));
▲ Show 20 Lines • Show All 138 Lines • ▼ Show 20 Lines	for (const auto &IS : InjectedSources) {
Entry.FileNI = IS.NameIndex;		Entry.FileNI = IS.NameIndex;
Entry.VFileNI = IS.VNameIndex;		Entry.VFileNI = IS.VNameIndex;
Entry.ObjNI = 1;		Entry.ObjNI = 1;
Entry.IsVirtual = 0;		Entry.IsVirtual = 0;
Entry.Version =		Entry.Version =
static_cast<uint32_t>(PdbRaw_SrcHeaderBlockVer::SrcVerOne);		static_cast<uint32_t>(PdbRaw_SrcHeaderBlockVer::SrcVerOne);
Entry.CRC = CRC.getCRC();		Entry.CRC = CRC.getCRC();
StringRef VName = getStringTableBuilder().getStringForId(IS.VNameIndex);		StringRef VName = getStringTableBuilder().getStringForId(IS.VNameIndex);
InjectedSourceTable.set_as(VName, std::move(Entry));		InjectedSourceTable.set_as(VName, std::move(Entry),
		InjectedSourceHashTraits);
}		}

uint32_t SrcHeaderBlockSize =		uint32_t SrcHeaderBlockSize =
sizeof(SrcHeaderBlockHeader) +		sizeof(SrcHeaderBlockHeader) +
InjectedSourceTable.calculateSerializedLength();		InjectedSourceTable.calculateSerializedLength();
SN = allocateNamedStream("/src/headerblock", SrcHeaderBlockSize);		SN = allocateNamedStream("/src/headerblock", SrcHeaderBlockSize);
if (!SN)		if (!SN)
return SN.takeError();		return SN.takeError();
▲ Show 20 Lines • Show All 157 Lines • Show Last 20 Lines

llvm/unittests/DebugInfo/PDB/HashTableTest.cpp

Show All 21 Lines
#include <vector>		#include <vector>

using namespace llvm;		using namespace llvm;
using namespace llvm::pdb;		using namespace llvm::pdb;
using namespace llvm::support;		using namespace llvm::support;

namespace {		namespace {

class HashTableInternals : public HashTable<uint32_t> {		struct IdentityHashTraits {
		uint32_t hashLookupKey(uint32_t N) const { return N; }
		uint32_t storageKeyToLookupKey(uint32_t N) const { return N; }
		uint32_t lookupKeyToStorageKey(uint32_t N) { return N; }
		};

		template <class T = uint32_t>
		class HashTableInternals : public HashTable<T> {
public:		public:
using HashTable::Buckets;		using HashTable<T>::Buckets;
using HashTable::Present;		using HashTable<T>::Present;
using HashTable::Deleted;		using HashTable<T>::Deleted;
};		};
}		}

TEST(HashTableTest, TestSimple) {		TEST(HashTableTest, TestSimple) {
HashTableInternals Table;		HashTableInternals<> Table;
EXPECT_EQ(0u, Table.size());		EXPECT_EQ(0u, Table.size());
EXPECT_GT(Table.capacity(), 0u);		EXPECT_GT(Table.capacity(), 0u);

Table.set_as(3u, 7);		IdentityHashTraits Traits;
		Table.set_as(3u, 7, Traits);
EXPECT_EQ(1u, Table.size());		EXPECT_EQ(1u, Table.size());
ASSERT_NE(Table.end(), Table.find_as(3u));		ASSERT_NE(Table.end(), Table.find_as(3u, Traits));
EXPECT_EQ(7u, Table.get(3u));		EXPECT_EQ(7u, Table.get(3u, Traits));
}		}

TEST(HashTableTest, TestCollision) {		TEST(HashTableTest, TestCollision) {
HashTableInternals Table;		HashTableInternals<> Table;
EXPECT_EQ(0u, Table.size());		EXPECT_EQ(0u, Table.size());
EXPECT_GT(Table.capacity(), 0u);		EXPECT_GT(Table.capacity(), 0u);

// We use knowledge of the hash table's implementation details to make sure		// We use knowledge of the hash table's implementation details to make sure
// to add another value that is the equivalent to the first value modulo the		// to add another value that is the equivalent to the first value modulo the
// hash table's capacity.		// hash table's capacity.
uint32_t N1 = Table.capacity() + 1;		uint32_t N1 = Table.capacity() + 1;
uint32_t N2 = 2 * N1;		uint32_t N2 = 2 * N1;

Table.set_as(N1, 7);		IdentityHashTraits Traits;
Table.set_as(N2, 12);		Table.set_as(N1, 7, Traits);
		Table.set_as(N2, 12, Traits);
EXPECT_EQ(2u, Table.size());		EXPECT_EQ(2u, Table.size());
ASSERT_NE(Table.end(), Table.find_as(N1));		ASSERT_NE(Table.end(), Table.find_as(N1, Traits));
ASSERT_NE(Table.end(), Table.find_as(N2));		ASSERT_NE(Table.end(), Table.find_as(N2, Traits));

EXPECT_EQ(7u, Table.get(N1));		EXPECT_EQ(7u, Table.get(N1, Traits));
EXPECT_EQ(12u, Table.get(N2));		EXPECT_EQ(12u, Table.get(N2, Traits));
}		}

TEST(HashTableTest, TestRemove) {		TEST(HashTableTest, TestRemove) {
HashTableInternals Table;		HashTableInternals<> Table;
EXPECT_EQ(0u, Table.size());		EXPECT_EQ(0u, Table.size());
EXPECT_GT(Table.capacity(), 0u);		EXPECT_GT(Table.capacity(), 0u);

Table.set_as(1u, 2);		IdentityHashTraits Traits;
Table.set_as(3u, 4);		Table.set_as(1u, 2, Traits);
		Table.set_as(3u, 4, Traits);
EXPECT_EQ(2u, Table.size());		EXPECT_EQ(2u, Table.size());
ASSERT_NE(Table.end(), Table.find_as(1u));		ASSERT_NE(Table.end(), Table.find_as(1u, Traits));
ASSERT_NE(Table.end(), Table.find_as(3u));		ASSERT_NE(Table.end(), Table.find_as(3u, Traits));

EXPECT_EQ(2u, Table.get(1u));		EXPECT_EQ(2u, Table.get(1u, Traits));
EXPECT_EQ(4u, Table.get(3u));		EXPECT_EQ(4u, Table.get(3u, Traits));
}		}

TEST(HashTableTest, TestCollisionAfterMultipleProbes) {		TEST(HashTableTest, TestCollisionAfterMultipleProbes) {
HashTableInternals Table;		HashTableInternals<> Table;
EXPECT_EQ(0u, Table.size());		EXPECT_EQ(0u, Table.size());
EXPECT_GT(Table.capacity(), 0u);		EXPECT_GT(Table.capacity(), 0u);

// Probing looks for the first available slot. A slot may already be filled		// Probing looks for the first available slot. A slot may already be filled
// as a result of an item with a different hash value already being there.		// as a result of an item with a different hash value already being there.
// Test that when this happens, the probe still finds the value.		// Test that when this happens, the probe still finds the value.
uint32_t N1 = Table.capacity() + 1;		uint32_t N1 = Table.capacity() + 1;
uint32_t N2 = N1 + 1;		uint32_t N2 = N1 + 1;
uint32_t N3 = 2 * N1;		uint32_t N3 = 2 * N1;

Table.set_as(N1, 7);		IdentityHashTraits Traits;
Table.set_as(N2, 11);		Table.set_as(N1, 7, Traits);
Table.set_as(N3, 13);		Table.set_as(N2, 11, Traits);
		Table.set_as(N3, 13, Traits);
EXPECT_EQ(3u, Table.size());		EXPECT_EQ(3u, Table.size());
ASSERT_NE(Table.end(), Table.find_as(N1));		ASSERT_NE(Table.end(), Table.find_as(N1, Traits));
ASSERT_NE(Table.end(), Table.find_as(N2));		ASSERT_NE(Table.end(), Table.find_as(N2, Traits));
ASSERT_NE(Table.end(), Table.find_as(N3));		ASSERT_NE(Table.end(), Table.find_as(N3, Traits));

EXPECT_EQ(7u, Table.get(N1));		EXPECT_EQ(7u, Table.get(N1, Traits));
EXPECT_EQ(11u, Table.get(N2));		EXPECT_EQ(11u, Table.get(N2, Traits));
EXPECT_EQ(13u, Table.get(N3));		EXPECT_EQ(13u, Table.get(N3, Traits));
}		}

TEST(HashTableTest, Grow) {		TEST(HashTableTest, Grow) {
// So that we are independent of the load factor, `capacity` items, which is		// So that we are independent of the load factor, `capacity` items, which is
// guaranteed to trigger a grow. Then verify that the size is the same, the		// guaranteed to trigger a grow. Then verify that the size is the same, the
// capacity is larger, and all the original items are still in the table.		// capacity is larger, and all the original items are still in the table.

HashTableInternals Table;		HashTableInternals<> Table;
		IdentityHashTraits Traits;
uint32_t OldCapacity = Table.capacity();		uint32_t OldCapacity = Table.capacity();
for (uint32_t I = 0; I < OldCapacity; ++I) {		for (uint32_t I = 0; I < OldCapacity; ++I) {
Table.set_as(OldCapacity + I * 2 + 1, I * 2 + 3);		Table.set_as(OldCapacity + I * 2 + 1, I * 2 + 3, Traits);
}		}
EXPECT_EQ(OldCapacity, Table.size());		EXPECT_EQ(OldCapacity, Table.size());
EXPECT_GT(Table.capacity(), OldCapacity);		EXPECT_GT(Table.capacity(), OldCapacity);
for (uint32_t I = 0; I < OldCapacity; ++I) {		for (uint32_t I = 0; I < OldCapacity; ++I) {
ASSERT_NE(Table.end(), Table.find_as(OldCapacity + I * 2 + 1));		ASSERT_NE(Table.end(), Table.find_as(OldCapacity + I * 2 + 1, Traits));
EXPECT_EQ(I * 2 + 3, Table.get(OldCapacity + I * 2 + 1));		EXPECT_EQ(I * 2 + 3, Table.get(OldCapacity + I * 2 + 1, Traits));
}		}
}		}

TEST(HashTableTest, Serialization) {		TEST(HashTableTest, Serialization) {
HashTableInternals Table;		HashTableInternals<> Table;
		IdentityHashTraits Traits;
uint32_t Cap = Table.capacity();		uint32_t Cap = Table.capacity();
for (uint32_t I = 0; I < Cap; ++I) {		for (uint32_t I = 0; I < Cap; ++I) {
Table.set_as(Cap + I * 2 + 1, I * 2 + 3);		Table.set_as(Cap + I * 2 + 1, I * 2 + 3, Traits);
}		}

std::vector<uint8_t> Buffer(Table.calculateSerializedLength());		std::vector<uint8_t> Buffer(Table.calculateSerializedLength());
MutableBinaryByteStream Stream(Buffer, little);		MutableBinaryByteStream Stream(Buffer, little);
BinaryStreamWriter Writer(Stream);		BinaryStreamWriter Writer(Stream);
EXPECT_THAT_ERROR(Table.commit(Writer), Succeeded());		EXPECT_THAT_ERROR(Table.commit(Writer), Succeeded());
// We should have written precisely the number of bytes we calculated earlier.		// We should have written precisely the number of bytes we calculated earlier.
EXPECT_EQ(Buffer.size(), Writer.getOffset());		EXPECT_EQ(Buffer.size(), Writer.getOffset());

HashTableInternals Table2;		HashTableInternals<> Table2;
BinaryStreamReader Reader(Stream);		BinaryStreamReader Reader(Stream);
EXPECT_THAT_ERROR(Table2.load(Reader), Succeeded());		EXPECT_THAT_ERROR(Table2.load(Reader), Succeeded());
// We should have read precisely the number of bytes we calculated earlier.		// We should have read precisely the number of bytes we calculated earlier.
EXPECT_EQ(Buffer.size(), Reader.getOffset());		EXPECT_EQ(Buffer.size(), Reader.getOffset());

EXPECT_EQ(Table.size(), Table2.size());		EXPECT_EQ(Table.size(), Table2.size());
EXPECT_EQ(Table.capacity(), Table2.capacity());		EXPECT_EQ(Table.capacity(), Table2.capacity());
EXPECT_EQ(Table.Buckets, Table2.Buckets);		EXPECT_EQ(Table.Buckets, Table2.Buckets);
Show All 36 Lines	do {
EXPECT_TRUE(NSM.get("Six", N));		EXPECT_TRUE(NSM.get("Six", N));
EXPECT_EQ(6U, N);		EXPECT_EQ(6U, N);

EXPECT_TRUE(NSM.get("Seven", N));		EXPECT_TRUE(NSM.get("Seven", N));
EXPECT_EQ(7U, N);		EXPECT_EQ(7U, N);
} while (std::next_permutation(Streams.begin(), Streams.end()));		} while (std::next_permutation(Streams.begin(), Streams.end()));
}		}

namespace {
struct FooBar {		struct FooBar {
uint32_t X;		uint32_t X;
uint32_t Y;		uint32_t Y;
};

} // namespace		bool operator==(const FooBar &RHS) const {
		return X == RHS.X && Y == RHS.Y;
		}
		};

namespace llvm {		struct FooBarHashTraits {
namespace pdb {
template <> struct PdbHashTraits<FooBar> {
std::vector<char> Buffer;		std::vector<char> Buffer;

PdbHashTraits() { Buffer.push_back(0); }		FooBarHashTraits() { Buffer.push_back(0); }

uint32_t hashLookupKey(StringRef S) const {		uint32_t hashLookupKey(StringRef S) const {
return llvm::pdb::hashStringV1(S);		return llvm::pdb::hashStringV1(S);
}		}

StringRef storageKeyToLookupKey(uint32_t N) const {		StringRef storageKeyToLookupKey(uint32_t N) const {
if (N >= Buffer.size())		if (N >= Buffer.size())
return StringRef();		return StringRef();

return StringRef(Buffer.data() + N);		return StringRef(Buffer.data() + N);
}		}

uint32_t lookupKeyToStorageKey(StringRef S) {		uint32_t lookupKeyToStorageKey(StringRef S) {
uint32_t N = Buffer.size();		uint32_t N = Buffer.size();
Buffer.insert(Buffer.end(), S.begin(), S.end());		Buffer.insert(Buffer.end(), S.begin(), S.end());
Buffer.push_back('\0');		Buffer.push_back('\0');
return N;		return N;
}		}
};		};
} // namespace pdb
} // namespace llvm

TEST(HashTableTest, NonTrivialValueType) {		TEST(HashTableTest, NonTrivialValueType) {
HashTable<FooBar> Table;		HashTableInternals<FooBar> Table;
		FooBarHashTraits Traits;
uint32_t Cap = Table.capacity();		uint32_t Cap = Table.capacity();
for (uint32_t I = 0; I < Cap; ++I) {		for (uint32_t I = 0; I < Cap; ++I) {
FooBar F;		FooBar F;
F.X = I;		F.X = I;
F.Y = I + 1;		F.Y = I + 1;
Table.set_as(utostr(I), F);		Table.set_as(utostr(I), F, Traits);
}		}

std::vector<uint8_t> Buffer(Table.calculateSerializedLength());		std::vector<uint8_t> Buffer(Table.calculateSerializedLength());
MutableBinaryByteStream Stream(Buffer, little);		MutableBinaryByteStream Stream(Buffer, little);
BinaryStreamWriter Writer(Stream);		BinaryStreamWriter Writer(Stream);
EXPECT_THAT_ERROR(Table.commit(Writer), Succeeded());		EXPECT_THAT_ERROR(Table.commit(Writer), Succeeded());
// We should have written precisely the number of bytes we calculated earlier.		// We should have written precisely the number of bytes we calculated earlier.
EXPECT_EQ(Buffer.size(), Writer.getOffset());		EXPECT_EQ(Buffer.size(), Writer.getOffset());

HashTable<FooBar> Table2;		HashTableInternals<FooBar> Table2;
BinaryStreamReader Reader(Stream);		BinaryStreamReader Reader(Stream);
EXPECT_THAT_ERROR(Table2.load(Reader), Succeeded());		EXPECT_THAT_ERROR(Table2.load(Reader), Succeeded());
// We should have read precisely the number of bytes we calculated earlier.		// We should have read precisely the number of bytes we calculated earlier.
EXPECT_EQ(Buffer.size(), Reader.getOffset());		EXPECT_EQ(Buffer.size(), Reader.getOffset());

EXPECT_EQ(Table.size(), Table2.size());		EXPECT_EQ(Table.size(), Table2.size());
EXPECT_EQ(Table.capacity(), Table2.capacity());		EXPECT_EQ(Table.capacity(), Table2.capacity());
// EXPECT_EQ(Table.Buckets, Table2.Buckets);		EXPECT_EQ(Table.Buckets, Table2.Buckets);
// EXPECT_EQ(Table.Present, Table2.Present);		EXPECT_EQ(Table.Present, Table2.Present);
// EXPECT_EQ(Table.Deleted, Table2.Deleted);		EXPECT_EQ(Table.Deleted, Table2.Deleted);
}		}

This is an archive of the discontinued LLVM Phabricator instance.

PDB HashTable: Move TraitsT from class parameter to the methods that need itClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 209492

llvm/include/llvm/DebugInfo/PDB/Native/HashTable.h

llvm/include/llvm/DebugInfo/PDB/Native/NamedStreamMap.h

llvm/include/llvm/DebugInfo/PDB/Native/PDBFileBuilder.h

llvm/lib/DebugInfo/PDB/Native/NamedStreamMap.cpp

llvm/lib/DebugInfo/PDB/Native/PDBFileBuilder.cpp

llvm/unittests/DebugInfo/PDB/HashTableTest.cpp

PDB HashTable: Move TraitsT from class parameter to the methods that need it
ClosedPublic