This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
include/llvm/DebugInfo/CodeView/
-
llvm/
-
DebugInfo/
-
CodeView/
-
TypeSerializer.h
-
TypeTableBuilder.h
-
TypeTableCollection.h
-
lib/DebugInfo/CodeView/
-
DebugInfo/
-
CodeView/
1
TypeSerializer.cpp
-
TypeTableCollection.cpp
-
tools/llvm-pdbdump/
-
llvm-pdbdump/
-
llvm-pdbdump.cpp

Differential D33428

[PDB] Hash types up front when merging types instead of using StringMap
ClosedPublic

Authored by rnk on May 22 2017, 6:33 PM.

Download Raw Diff

Details

Reviewers

zturner
inglorion
ruiu

Commits

rGded38803c5c5: [PDB] Hash types up front when merging types instead of using StringMap
rL303665: [PDB] Hash types up front when merging types instead of using StringMap

Summary

First, StringMap uses llvm::HashString, which is only good for short
identifiers and really bad for large blobs of binary data like type
records. Moving to DenseMap<StringRef, TypeIndex> with some tricks for
memory allocation fixes that.

Unfortunately, that didn't buy very much performance. Profiling showed
that we spend a long time during DenseMap growth rehashing existing
entries. Also, in general, DenseMap is faster when the keys are small.
This change takes that to the logical conclusion by introducing a small
wrapper value type around a pointer to key data. The key data contains a
precomputed hash, the original record data (pointer and size), and the
type index, which is the "value" of our original map.

This reduces the time to produce llvm-as.exe and llvm-as.pdb from ~15s
on my machine to 3.5s, which is about a 4x improvement.

Diff Detail

Repository: rL LLVM

Event Timeline

rnk created this revision.May 22 2017, 6:33 PM

zturner added inline comments.May 22 2017, 8:04 PM

llvm/lib/DebugInfo/CodeView/TypeSerializer.cpp
22 ↗	(On Diff #99840)	`hash_code`?
23–24 ↗	(On Diff #99840)	`ArrayRef<uint8_t>` or `StringRef`?
25 ↗	(On Diff #99840)	`TypeIndex`?
53–54 ↗	(On Diff #99840)	I haven't looked at the implementation of `DenseMap`, but is this check necessary? I would imagine you could assert that the hashes are different, otherwise why would `DenseMap` be calling this function?
59–60 ↗	(On Diff #99840)	Is this an important consideration? Seems like it sacrifices readability
98–99 ↗	(On Diff #99840)	Why not `xxHash64`? `hash_value` appears to operate on one char at a time.

rnk added a subscriber: chandlerc.May 23 2017, 11:19 AM

rnk added inline comments.

llvm/lib/DebugInfo/CodeView/TypeSerializer.cpp
23–24 ↗	(On Diff #99840)	This saves 4 bytes. We know type records are always shorter than 0xFF00 bytes, so it's safe to go all the way to uint16_t as the comment suggests.
25 ↗	(On Diff #99840)	Yeah, we can do that. I kind of like `unsigned` or `uint32_t` because it doesn't depend on llvm::support::ulittle32_t, which uses all this memcpy craziness.
53–54 ↗	(On Diff #99840)	DenseMap takes the low bits of the hash and uses them to index into its table, so there can be hash collisions. So, if the table has 256 entries and two records hash to 0x100 and 0x200, this comparison will save us from doing a full memcmp.
59–60 ↗	(On Diff #99840)	I really wanted HashedType and HashedTypePtr to be in anonymous namespaces, and this is what I had to do to accomplish that. They really aren't general purpose types that should float around in public headers.
98–99 ↗	(On Diff #99840)	@chandlerc said we'll probably have to delete xxHash64, so I didn't want to use it. hash_value actually hashes 64 bytes at a time if it can. I'm pretty confident we're hitting that overload, but I could do more to check. It also worried me.

zturner accepted this revision.May 23 2017, 11:22 AM

This revision is now accepted and ready to land.May 23 2017, 11:22 AM

Closed by commit rL303665: [PDB] Hash types up front when merging types instead of using StringMap (authored by rnk). · Explain WhyMay 23 2017, 11:24 AM

This revision was automatically updated to reflect the committed changes.

Drive by comment...

llvm/trunk/lib/DebugInfo/CodeView/TypeSerializer.cpp
40	This isn't suitably aligned. You should use the DenseMapInfo for some pointer type instead.

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

DebugInfo/

CodeView/

TypeSerializer.h

23 lines

TypeTableBuilder.h

6 lines

TypeTableCollection.h

4 lines

lib/

DebugInfo/

CodeView/

TypeSerializer.cpp

183 lines

TypeTableCollection.cpp

3 lines

tools/

llvm-pdbdump/

llvm-pdbdump.cpp

14 lines

Diff 99956

llvm/trunk/include/llvm/DebugInfo/CodeView/TypeSerializer.h

	Show All 11 Lines

	#include "llvm/DebugInfo/CodeView/TypeRecordMapping.h"			#include "llvm/DebugInfo/CodeView/TypeRecordMapping.h"
	#include "llvm/DebugInfo/CodeView/TypeVisitorCallbacks.h"			#include "llvm/DebugInfo/CodeView/TypeVisitorCallbacks.h"
	#include "llvm/Support/BinaryByteStream.h"			#include "llvm/Support/BinaryByteStream.h"
	#include "llvm/Support/BinaryStreamWriter.h"			#include "llvm/Support/BinaryStreamWriter.h"

	#include "llvm/ADT/Optional.h"			#include "llvm/ADT/Optional.h"
	#include "llvm/ADT/SmallVector.h"			#include "llvm/ADT/SmallVector.h"
	#include "llvm/ADT/StringMap.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/Support/Allocator.h"			#include "llvm/Support/Allocator.h"
	#include "llvm/Support/Error.h"			#include "llvm/Support/Error.h"

	namespace llvm {			namespace llvm {

	namespace codeview {			namespace codeview {

				class TypeHasher;

	class TypeSerializer : public TypeVisitorCallbacks {			class TypeSerializer : public TypeVisitorCallbacks {
	struct SubRecord {			struct SubRecord {
	SubRecord(TypeLeafKind K, uint32_t S) : Kind(K), Size(S) {}			SubRecord(TypeLeafKind K, uint32_t S) : Kind(K), Size(S) {}

	TypeLeafKind Kind;			TypeLeafKind Kind;
	uint32_t Size = 0;			uint32_t Size = 0;
	};			};
	struct RecordSegment {			struct RecordSegment {
	SmallVector<SubRecord, 16> SubRecords;			SmallVector<SubRecord, 16> SubRecords;

	uint32_t length() const {			uint32_t length() const {
	uint32_t L = sizeof(RecordPrefix);			uint32_t L = sizeof(RecordPrefix);
	for (const auto &R : SubRecords) {			for (const auto &R : SubRecords) {
	L += R.Size;			L += R.Size;
	}			}
	return L;			return L;
	}			}
	};			};

	typedef SmallVector<MutableArrayRef<uint8_t>, 2> RecordList;			typedef SmallVector<MutableArrayRef<uint8_t>, 2> MutableRecordList;

	static constexpr uint8_t ContinuationLength = 8;			static constexpr uint8_t ContinuationLength = 8;
	BumpPtrAllocator &RecordStorage;			BumpPtrAllocator &RecordStorage;
	RecordSegment CurrentSegment;			RecordSegment CurrentSegment;
	RecordList FieldListSegments;			MutableRecordList FieldListSegments;

	TypeIndex LastTypeIndex;
	Optional<TypeLeafKind> TypeKind;			Optional<TypeLeafKind> TypeKind;
	Optional<TypeLeafKind> MemberKind;			Optional<TypeLeafKind> MemberKind;
	std::vector<uint8_t> RecordBuffer;			std::vector<uint8_t> RecordBuffer;
	MutableBinaryByteStream Stream;			MutableBinaryByteStream Stream;
	BinaryStreamWriter Writer;			BinaryStreamWriter Writer;
	TypeRecordMapping Mapping;			TypeRecordMapping Mapping;

	RecordList SeenRecords;			/// Private type record hashing implementation details are handled here.
	StringMap<TypeIndex> HashedRecords;			std::unique_ptr<TypeHasher> Hasher;

	bool isInFieldList() const;			bool isInFieldList() const;
	TypeIndex calcNextTypeIndex() const;
	TypeIndex incrementTypeIndex();
	MutableArrayRef<uint8_t> getCurrentSubRecordData();			MutableArrayRef<uint8_t> getCurrentSubRecordData();
	MutableArrayRef<uint8_t> getCurrentRecordData();			MutableArrayRef<uint8_t> getCurrentRecordData();
	Error writeRecordPrefix(TypeLeafKind Kind);			Error writeRecordPrefix(TypeLeafKind Kind);
	TypeIndex insertRecordBytesPrivate(MutableArrayRef<uint8_t> Record);
	TypeIndex insertRecordBytesWithCopy(CVType &Record,
	MutableArrayRef<uint8_t> Data);

	Expected<MutableArrayRef<uint8_t>>			Expected<MutableArrayRef<uint8_t>>
	addPadding(MutableArrayRef<uint8_t> Record);			addPadding(MutableArrayRef<uint8_t> Record);

	public:			public:
	explicit TypeSerializer(BumpPtrAllocator &Storage);			explicit TypeSerializer(BumpPtrAllocator &Storage);
				~TypeSerializer();

	ArrayRef<MutableArrayRef<uint8_t>> records() const;			ArrayRef<ArrayRef<uint8_t>> records() const;
	TypeIndex getLastTypeIndex() const;			TypeIndex insertRecordBytes(ArrayRef<uint8_t> Record);
	TypeIndex insertRecordBytes(MutableArrayRef<uint8_t> Record);
	Expected<TypeIndex> visitTypeEndGetIndex(CVType &Record);			Expected<TypeIndex> visitTypeEndGetIndex(CVType &Record);

	Error visitTypeBegin(CVType &Record) override;			Error visitTypeBegin(CVType &Record) override;
	Error visitTypeEnd(CVType &Record) override;			Error visitTypeEnd(CVType &Record) override;
	Error visitMemberBegin(CVMemberRecord &Record) override;			Error visitMemberBegin(CVMemberRecord &Record) override;
	Error visitMemberEnd(CVMemberRecord &Record) override;			Error visitMemberEnd(CVMemberRecord &Record) override;

	#define TYPE_RECORD(EnumName, EnumVal, Name) \			#define TYPE_RECORD(EnumName, EnumVal, Name) \
	▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/DebugInfo/CodeView/TypeTableBuilder.h

Show First 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	template <typename T> TypeIndex writeKnownType(T &Record) {

auto ExpectedIndex = Serializer.visitTypeEndGetIndex(Type);		auto ExpectedIndex = Serializer.visitTypeEndGetIndex(Type);
if (!ExpectedIndex)		if (!ExpectedIndex)
return handleError(ExpectedIndex.takeError());		return handleError(ExpectedIndex.takeError());

return *ExpectedIndex;		return *ExpectedIndex;
}		}

TypeIndex writeSerializedRecord(MutableArrayRef<uint8_t> Record) {		TypeIndex writeSerializedRecord(ArrayRef<uint8_t> Record) {
return Serializer.insertRecordBytes(Record);		return Serializer.insertRecordBytes(Record);
}		}

template <typename TFunc> void ForEachRecord(TFunc Func) {		template <typename TFunc> void ForEachRecord(TFunc Func) {
uint32_t Index = TypeIndex::FirstNonSimpleIndex;		uint32_t Index = TypeIndex::FirstNonSimpleIndex;

for (auto Record : Serializer.records()) {		for (auto Record : Serializer.records()) {
Func(TypeIndex(Index), Record);		Func(TypeIndex(Index), Record);
++Index;		++Index;
}		}
}		}

ArrayRef<MutableArrayRef<uint8_t>> records() const {		ArrayRef<ArrayRef<uint8_t>> records() const { return Serializer.records(); }
return Serializer.records();
}
};		};

class FieldListRecordBuilder {		class FieldListRecordBuilder {
TypeTableBuilder &TypeTable;		TypeTableBuilder &TypeTable;
TypeSerializer TempSerializer;		TypeSerializer TempSerializer;
CVType Type;		CVType Type;

public:		public:
▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/DebugInfo/CodeView/TypeTableCollection.h

	Show All 12 Lines
	#include "llvm/DebugInfo/CodeView/TypeCollection.h"			#include "llvm/DebugInfo/CodeView/TypeCollection.h"
	#include "llvm/DebugInfo/CodeView/TypeDatabase.h"			#include "llvm/DebugInfo/CodeView/TypeDatabase.h"

	namespace llvm {			namespace llvm {
	namespace codeview {			namespace codeview {

	class TypeTableCollection : public TypeCollection {			class TypeTableCollection : public TypeCollection {
	public:			public:
	explicit TypeTableCollection(ArrayRef<MutableArrayRef<uint8_t>> Records);			explicit TypeTableCollection(ArrayRef<ArrayRef<uint8_t>> Records);

	Optional<TypeIndex> getFirst() override;			Optional<TypeIndex> getFirst() override;
	Optional<TypeIndex> getNext(TypeIndex Prev) override;			Optional<TypeIndex> getNext(TypeIndex Prev) override;

	CVType getType(TypeIndex Index) override;			CVType getType(TypeIndex Index) override;
	StringRef getTypeName(TypeIndex Index) override;			StringRef getTypeName(TypeIndex Index) override;
	bool contains(TypeIndex Index) override;			bool contains(TypeIndex Index) override;
	uint32_t size() override;			uint32_t size() override;
	uint32_t capacity() override;			uint32_t capacity() override;

	private:			private:
	bool hasCapacityFor(TypeIndex Index) const;			bool hasCapacityFor(TypeIndex Index) const;
	void ensureTypeExists(TypeIndex Index);			void ensureTypeExists(TypeIndex Index);

	ArrayRef<MutableArrayRef<uint8_t>> Records;			ArrayRef<ArrayRef<uint8_t>> Records;
	TypeDatabase Database;			TypeDatabase Database;
	};			};
	}			}
	}			}

	#endif			#endif

llvm/trunk/lib/DebugInfo/CodeView/TypeSerializer.cpp

//===- TypeSerialzier.cpp ---------------------------------------- C++ --===//		//===- TypeSerialzier.cpp ---------------------------------------- C++ --===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/DebugInfo/CodeView/TypeSerializer.h"		#include "llvm/DebugInfo/CodeView/TypeSerializer.h"

		#include "llvm/ADT/DenseSet.h"
#include "llvm/Support/BinaryStreamWriter.h"		#include "llvm/Support/BinaryStreamWriter.h"

#include <string.h>		#include <string.h>

using namespace llvm;		using namespace llvm;
using namespace llvm::codeview;		using namespace llvm::codeview;

bool TypeSerializer::isInFieldList() const {		namespace {
return TypeKind.hasValue() && *TypeKind == TypeLeafKind::LF_FIELDLIST;		struct HashedType {
		uint64_t Hash;
		const uint8_t *Data;
		unsigned Size; // FIXME: Go to uint16_t?
		TypeIndex Index;
		};

		/// Wrapper around a poitner to a HashedType. Hash and equality operations are
		/// based on data in the pointee.
		struct HashedTypePtr {
		HashedTypePtr() = default;
		HashedTypePtr(HashedType *Ptr) : Ptr(Ptr) {}
		HashedType *Ptr = nullptr;
		};
		} // namespace

		template <> struct DenseMapInfo<HashedTypePtr> {
		static inline HashedTypePtr getEmptyKey() { return HashedTypePtr(nullptr); }
		static inline HashedTypePtr getTombstoneKey() {
		return HashedTypePtr(reinterpret_cast<HashedType *>(1));
		chandlercUnsubmitted Not Done Reply Inline Actions This isn't suitably aligned. You should use the DenseMapInfo for some pointer type instead. chandlerc: This isn't suitably aligned. You should use the DenseMapInfo for some pointer type instead.
		}
		static unsigned getHashValue(HashedTypePtr Val) {
		assert(Val.Ptr != getEmptyKey().Ptr && Val.Ptr != getTombstoneKey().Ptr);
		return Val.Ptr->Hash;
		}
		static bool isEqual(HashedTypePtr LHSP, HashedTypePtr RHSP) {
		HashedType *LHS = LHSP.Ptr;
		HashedType *RHS = RHSP.Ptr;
		if (RHS == getEmptyKey().Ptr \|\| RHS == getTombstoneKey().Ptr)
		return LHS == RHS;
		if (LHS->Hash != RHS->Hash \|\| LHS->Size != RHS->Size)
		return false;
		return ::memcmp(LHS->Data, RHS->Data, LHS->Size) == 0;
		}
		};

		/// Private implementation so that we don't leak our DenseMap instantiations to
		/// users.
		class llvm::codeview::TypeHasher {
		private:
		/// Storage for type record provided by the caller. Records will outlive the
		/// hasher object, so they should be allocated here.
		BumpPtrAllocator &RecordStorage;

		/// Storage for hash keys. These only need to live as long as the hashing
		/// operation.
		BumpPtrAllocator KeyStorage;

		/// Hash table. We really want a DenseMap<ArrayRef<uint8_t>, TypeIndex> here,
		/// but DenseMap is inefficient when the keys are long (like type records)
		/// because it recomputes the hash value of every key when it grows. This
		/// value type stores the hash out of line in KeyStorage, so that table
		/// entries are small and easy to rehash.
		DenseSet<HashedTypePtr> HashedRecords;

		SmallVector<ArrayRef<uint8_t>, 2> SeenRecords;

		TypeIndex NextTypeIndex = TypeIndex(TypeIndex::FirstNonSimpleIndex);

		public:
		TypeHasher(BumpPtrAllocator &RecordStorage) : RecordStorage(RecordStorage) {}

		ArrayRef<ArrayRef<uint8_t>> records() const { return SeenRecords; }

		/// Takes the bytes of type record, inserts them into the hash table, saves
		/// them, and returns a pointer to an identical stable type record along with
		/// its type index in the destination stream.
		TypeIndex getOrCreateRecord(ArrayRef<uint8_t> &Record);
		};

		TypeIndex TypeHasher::getOrCreateRecord(ArrayRef<uint8_t> &Record) {
		assert(Record.size() < UINT32_MAX && "Record too big");
		assert(Record.size() % 4 == 0 && "Record is not aligned to 4 bytes!");

		// Compute the hash up front so we can store it in the key.
		HashedType TempHashedType = {hash_value(Record), Record.data(),
		unsigned(Record.size()), NextTypeIndex};

		auto Result = HashedRecords.insert(HashedTypePtr(&TempHashedType));
		HashedType *&Hashed = Result.first->Ptr;

		if (Result.second) {
		// This was a new type record. We need stable storage for both the key and
		// the record. The record should outlive the hashing operation.
		Hashed = KeyStorage.Allocate<HashedType>();
		*Hashed = TempHashedType;

		uint8_t *Stable = RecordStorage.Allocate<uint8_t>(Record.size());
		memcpy(Stable, Record.data(), Record.size());
		Hashed->Data = Stable;
		assert(Hashed->Size == Record.size());

		// This was a new record, so increment our next type index.
		++NextTypeIndex;
		}

		// Update the caller's copy of Record to point a stable copy.
		Record = ArrayRef<uint8_t>(Hashed->Data, Hashed->Size);

		if (Result.second) {
		// FIXME: Can we record these in a more efficient way?
		SeenRecords.push_back(Record);
}		}

TypeIndex TypeSerializer::calcNextTypeIndex() const {		return TypeIndex(Hashed->Index);
if (LastTypeIndex.isNoneType())
return TypeIndex(TypeIndex::FirstNonSimpleIndex);
else
return TypeIndex(LastTypeIndex.getIndex() + 1);
}		}

TypeIndex TypeSerializer::incrementTypeIndex() {		bool TypeSerializer::isInFieldList() const {
TypeIndex Previous = LastTypeIndex;		return TypeKind.hasValue() && *TypeKind == TypeLeafKind::LF_FIELDLIST;
LastTypeIndex = calcNextTypeIndex();
return Previous;
}		}

MutableArrayRef<uint8_t> TypeSerializer::getCurrentSubRecordData() {		MutableArrayRef<uint8_t> TypeSerializer::getCurrentSubRecordData() {
assert(isInFieldList());		assert(isInFieldList());
return getCurrentRecordData().drop_front(CurrentSegment.length());		return getCurrentRecordData().drop_front(CurrentSegment.length());
}		}

MutableArrayRef<uint8_t> TypeSerializer::getCurrentRecordData() {		MutableArrayRef<uint8_t> TypeSerializer::getCurrentRecordData() {
return MutableArrayRef<uint8_t>(RecordBuffer).take_front(Writer.getOffset());		return MutableArrayRef<uint8_t>(RecordBuffer).take_front(Writer.getOffset());
}		}

Error TypeSerializer::writeRecordPrefix(TypeLeafKind Kind) {		Error TypeSerializer::writeRecordPrefix(TypeLeafKind Kind) {
RecordPrefix Prefix;		RecordPrefix Prefix;
Prefix.RecordKind = Kind;		Prefix.RecordKind = Kind;
Prefix.RecordLen = 0;		Prefix.RecordLen = 0;
if (auto EC = Writer.writeObject(Prefix))		if (auto EC = Writer.writeObject(Prefix))
return EC;		return EC;
return Error::success();		return Error::success();
}		}

TypeIndex
TypeSerializer::insertRecordBytesPrivate(MutableArrayRef<uint8_t> Record) {
assert(Record.size() % 4 == 0 && "Record is not aligned to 4 bytes!");

StringRef S(reinterpret_cast<const char *>(Record.data()), Record.size());

TypeIndex NextTypeIndex = calcNextTypeIndex();
auto Result = HashedRecords.try_emplace(S, NextTypeIndex);
if (Result.second) {
LastTypeIndex = NextTypeIndex;
SeenRecords.push_back(Record);
}
return Result.first->getValue();
}

TypeIndex
TypeSerializer::insertRecordBytesWithCopy(CVType &Record,
MutableArrayRef<uint8_t> Data) {
assert(Data.size() % 4 == 0 && "Record is not aligned to 4 bytes!");

StringRef S(reinterpret_cast<const char *>(Data.data()), Data.size());

// Do a two state lookup / insert so that we don't have to allocate unless
// we're going
// to do an insert. This is a big memory savings.
auto Iter = HashedRecords.find(S);
if (Iter != HashedRecords.end())
return Iter->second;

LastTypeIndex = calcNextTypeIndex();
uint8_t *Copy = RecordStorage.Allocate<uint8_t>(Data.size());
::memcpy(Copy, Data.data(), Data.size());
Data = MutableArrayRef<uint8_t>(Copy, Data.size());
S = StringRef(reinterpret_cast<const char *>(Data.data()), Data.size());
HashedRecords.insert(std::make_pair(S, LastTypeIndex));
SeenRecords.push_back(Data);
Record.RecordData = Data;
return LastTypeIndex;
}

Expected<MutableArrayRef<uint8_t>>		Expected<MutableArrayRef<uint8_t>>
TypeSerializer::addPadding(MutableArrayRef<uint8_t> Record) {		TypeSerializer::addPadding(MutableArrayRef<uint8_t> Record) {
uint32_t Align = Record.size() % 4;		uint32_t Align = Record.size() % 4;
if (Align == 0)		if (Align == 0)
return Record;		return Record;

int PaddingBytes = 4 - Align;		int PaddingBytes = 4 - Align;
int N = PaddingBytes;		int N = PaddingBytes;
while (PaddingBytes > 0) {		while (PaddingBytes > 0) {
uint8_t Pad = static_cast<uint8_t>(LF_PAD0 + PaddingBytes);		uint8_t Pad = static_cast<uint8_t>(LF_PAD0 + PaddingBytes);
if (auto EC = Writer.writeInteger(Pad))		if (auto EC = Writer.writeInteger(Pad))
return std::move(EC);		return std::move(EC);
--PaddingBytes;		--PaddingBytes;
}		}
return MutableArrayRef<uint8_t>(Record.data(), Record.size() + N);		return MutableArrayRef<uint8_t>(Record.data(), Record.size() + N);
}		}

TypeSerializer::TypeSerializer(BumpPtrAllocator &Storage)		TypeSerializer::TypeSerializer(BumpPtrAllocator &Storage)
: RecordStorage(Storage), LastTypeIndex(),		: RecordStorage(Storage), RecordBuffer(MaxRecordLength * 2),
RecordBuffer(MaxRecordLength * 2),
Stream(RecordBuffer, llvm::support::little), Writer(Stream),		Stream(RecordBuffer, llvm::support::little), Writer(Stream),
Mapping(Writer) {		Mapping(Writer), Hasher(make_unique<TypeHasher>(Storage)) {
// RecordBuffer needs to be able to hold enough data so that if we are 1		// RecordBuffer needs to be able to hold enough data so that if we are 1
// byte short of MaxRecordLen, and then we try to write MaxRecordLen bytes,		// byte short of MaxRecordLen, and then we try to write MaxRecordLen bytes,
// we won't overflow.		// we won't overflow.
}		}

ArrayRef<MutableArrayRef<uint8_t>> TypeSerializer::records() const {		TypeSerializer::~TypeSerializer() = default;
return SeenRecords;
}

TypeIndex TypeSerializer::getLastTypeIndex() const { return LastTypeIndex; }		ArrayRef<ArrayRef<uint8_t>> TypeSerializer::records() const {
		return Hasher->records();
		}

TypeIndex TypeSerializer::insertRecordBytes(MutableArrayRef<uint8_t> Record) {		TypeIndex TypeSerializer::insertRecordBytes(ArrayRef<uint8_t> Record) {
assert(!TypeKind.hasValue() && "Already in a type mapping!");		assert(!TypeKind.hasValue() && "Already in a type mapping!");
assert(Writer.getOffset() == 0 && "Stream has data already!");		assert(Writer.getOffset() == 0 && "Stream has data already!");

return insertRecordBytesPrivate(Record);		return Hasher->getOrCreateRecord(Record);
}		}

Error TypeSerializer::visitTypeBegin(CVType &Record) {		Error TypeSerializer::visitTypeBegin(CVType &Record) {
assert(!TypeKind.hasValue() && "Already in a type mapping!");		assert(!TypeKind.hasValue() && "Already in a type mapping!");
assert(Writer.getOffset() == 0 && "Stream has data already!");		assert(Writer.getOffset() == 0 && "Stream has data already!");

if (auto EC = writeRecordPrefix(Record.kind()))		if (auto EC = writeRecordPrefix(Record.kind()))
return EC;		return EC;
Show All 18 Lines	if (!ExpectedData)
return ExpectedData.takeError();		return ExpectedData.takeError();
ThisRecordData = *ExpectedData;		ThisRecordData = *ExpectedData;

RecordPrefix *Prefix =		RecordPrefix *Prefix =
reinterpret_cast<RecordPrefix *>(ThisRecordData.data());		reinterpret_cast<RecordPrefix *>(ThisRecordData.data());
Prefix->RecordLen = ThisRecordData.size() - sizeof(uint16_t);		Prefix->RecordLen = ThisRecordData.size() - sizeof(uint16_t);

Record.Type = *TypeKind;		Record.Type = *TypeKind;
TypeIndex InsertedTypeIndex =		Record.RecordData = ThisRecordData;
insertRecordBytesWithCopy(Record, ThisRecordData);		TypeIndex InsertedTypeIndex = Hasher->getOrCreateRecord(Record.RecordData);

// Write out each additional segment in reverse order, and update each		// Write out each additional segment in reverse order, and update each
// record's continuation index to point to the previous one.		// record's continuation index to point to the previous one.
for (auto X : reverse(FieldListSegments)) {		for (auto X : reverse(FieldListSegments)) {
auto CIBytes = X.take_back(sizeof(uint32_t));		auto CIBytes = X.take_back(sizeof(uint32_t));
support::ulittle32_t *CI =		support::ulittle32_t *CI =
reinterpret_cast<support::ulittle32_t *>(CIBytes.data());		reinterpret_cast<support::ulittle32_t *>(CIBytes.data());
assert(*CI == 0xB0C0B0C0 && "Invalid TypeIndex placeholder");		assert(*CI == 0xB0C0B0C0 && "Invalid TypeIndex placeholder");
*CI = InsertedTypeIndex.getIndex();		*CI = InsertedTypeIndex.getIndex();
InsertedTypeIndex = insertRecordBytesPrivate(X);		InsertedTypeIndex = Hasher->getOrCreateRecord(X);
}		}

TypeKind.reset();		TypeKind.reset();
Writer.setOffset(0);		Writer.setOffset(0);
FieldListSegments.clear();		FieldListSegments.clear();
CurrentSegment.SubRecords.clear();		CurrentSegment.SubRecords.clear();

return InsertedTypeIndex;		return InsertedTypeIndex;
▲ Show 20 Lines • Show All 82 Lines • Show Last 20 Lines

llvm/trunk/lib/DebugInfo/CodeView/TypeTableCollection.cpp

	Show All 18 Lines
	using namespace llvm::codeview;			using namespace llvm::codeview;

	static void error(Error &&EC) {			static void error(Error &&EC) {
	assert(!static_cast<bool>(EC));			assert(!static_cast<bool>(EC));
	if (EC)			if (EC)
	consumeError(std::move(EC));			consumeError(std::move(EC));
	}			}

	TypeTableCollection::TypeTableCollection(			TypeTableCollection::TypeTableCollection(ArrayRef<ArrayRef<uint8_t>> Records)
	ArrayRef<MutableArrayRef<uint8_t>> Records)
	: Records(Records), Database(Records.size()) {}			: Records(Records), Database(Records.size()) {}

	Optional<TypeIndex> TypeTableCollection::getFirst() {			Optional<TypeIndex> TypeTableCollection::getFirst() {
	if (empty())			if (empty())
	return None;			return None;
	return TypeIndex::fromArrayIndex(0);			return TypeIndex::fromArrayIndex(0);
	}			}

	▲ Show 20 Lines • Show All 47 Lines • Show Last 20 Lines

llvm/trunk/tools/llvm-pdbdump/llvm-pdbdump.cpp

Show First 20 Lines • Show All 871 Lines • ▼ Show 20 Lines	static void mergePdbs() {
ExitOnErr(Builder.initialize(4096));		ExitOnErr(Builder.initialize(4096));
// Add each of the reserved streams. We might not put any data in them,		// Add each of the reserved streams. We might not put any data in them,
// but at least they have to be present.		// but at least they have to be present.
for (uint32_t I = 0; I < kSpecialStreamCount; ++I)		for (uint32_t I = 0; I < kSpecialStreamCount; ++I)
ExitOnErr(Builder.getMsfBuilder().addStream(0));		ExitOnErr(Builder.getMsfBuilder().addStream(0));

auto &DestTpi = Builder.getTpiBuilder();		auto &DestTpi = Builder.getTpiBuilder();
auto &DestIpi = Builder.getIpiBuilder();		auto &DestIpi = Builder.getIpiBuilder();
MergedTpi.ForEachRecord(		MergedTpi.ForEachRecord([&DestTpi](TypeIndex TI, ArrayRef<uint8_t> Data) {
[&DestTpi](TypeIndex TI, MutableArrayRef<uint8_t> Data) {
DestTpi.addTypeRecord(Data, None);		DestTpi.addTypeRecord(Data, None);
});		});
MergedIpi.ForEachRecord(		MergedIpi.ForEachRecord([&DestIpi](TypeIndex TI, ArrayRef<uint8_t> Data) {
[&DestIpi](TypeIndex TI, MutableArrayRef<uint8_t> Data) {
DestIpi.addTypeRecord(Data, None);		DestIpi.addTypeRecord(Data, None);
});		});

SmallString<64> OutFile(opts::merge::PdbOutputFile);		SmallString<64> OutFile(opts::merge::PdbOutputFile);
if (OutFile.empty()) {		if (OutFile.empty()) {
OutFile = opts::merge::InputFilenames[0];		OutFile = opts::merge::InputFilenames[0];
llvm::sys::path::replace_extension(OutFile, "merged.pdb");		llvm::sys::path::replace_extension(OutFile, "merged.pdb");
}		}
ExitOnErr(Builder.commit(OutFile));		ExitOnErr(Builder.commit(OutFile));
}		}
▲ Show 20 Lines • Show All 133 Lines • Show Last 20 Lines