This is an archive of the discontinued LLVM Phabricator instance.

Differential D51956

lld-link: Set PDB GUID to hash of PDB contents instead of to a random byte sequence.
ClosedPublic

Authored by thakis on Sep 11 2018, 4:20 PM.

Download Raw Diff

Details

Reviewers

zturner
ruiu

Commits

rG0bd2d304e672: lld-link: Set PDB GUID to hash of PDB contents instead of to a random byte…
rG205ca68b8db3: Give InfoStreamBuilder an opt-in method to write a hash of the PDB as GUID.
rL342334: lld-link: Set PDB GUID to hash of PDB contents instead of to a random byte…
rLLD342334: lld-link: Set PDB GUID to hash of PDB contents instead of to a random byte…
rL342333: Give InfoStreamBuilder an opt-in method to write a hash of the PDB as GUID.

Summary

Previously, lld-link would use a random byte sequence as the PDB GUID. Instead, use a hash of the PDB file contents.

Naively computing the hash after the PDB data has been generated is in practice as fast as other approaches I tried. I also tried online-computing the hash as parts of the PDB were written out (https://reviews.llvm.org/D51887; that's also where all the measuring data is) and computing the hash in parallel (https://reviews.llvm.org/D51957). This approach here is simplest, without being slower.

To not disturb llvm-pdbutil pdb2yaml, make the hash generation an opt-in feature on InfoStreamBuilder and let ldb/COFF/PDB.cpp always set it.

Since writing the PDB computes this ID which also goes in the exe, the PDB writing code now must be called before writeBuildId(). writeBuildId() for that reason is no longer included in the "Code Layout" timer.

Since the PDB GUID is now a function of the PDB contents, the PDB Age is always set to 1. There was a long comment above loadExistingBuildId (now gone) about how not changing the GUID and only incrementing the age was important, but according to the discussion in PR35914 that comment was incorrect.

Diff Detail

Event Timeline

thakis created this revision.Sep 11 2018, 4:20 PM

Herald added a subscriber: hiraditya. · View Herald TranscriptSep 11 2018, 4:20 PM

This replaces https://reviews.llvm.org/D51957.

Actually, this replaces https://reviews.llvm.org/D51887

rebase, minor cleanups

Please take a look!

ruiu added inline comments.Sep 14 2018, 9:48 AM

lld/COFF/PDB.cpp
128–129	It's return type is void. I'd change the comment or change the return type.
llvm/lib/DebugInfo/PDB/Native/PDBFileBuilder.cpp
28	As long as you are using a non-crypto hash function, there is a risk of generating the same build id, and the probability is not negligible if you have a lot of executables due to the birthday problem. Is this okay?

thakis added inline comments.Sep 14 2018, 10:27 AM

lld/COFF/PDB.cpp
128–129	Will do.
llvm/lib/DebugInfo/PDB/Native/PDBFileBuilder.cpp
28	The 8 byte hash still gives decent hash collision resistance for up to 2**32 different pdb files, and since pdbs are keyed by executable name on the symbol server that's per binary. Projects tend to have far fewer revisions than 4 billion. Does that make sense?

improve comment

ruiu added inline comments.Sep 14 2018, 10:39 AM

llvm/lib/DebugInfo/PDB/Native/PDBFileBuilder.cpp
28	Maybe it is safe. But what could happen if two executables have the same hash? Since xxhash is not cryptographically-safe, you could easily generate two executables having the same ID. Is there any security risks or something caused by that possibility? If the probability is small and the result of hash collision is not that bad, xxhash is probably okay.

thakis added inline comments.Sep 14 2018, 11:25 AM

llvm/lib/DebugInfo/PDB/Native/PDBFileBuilder.cpp
28	The main use case for this guid is to an executable to its pdb file. The common workflow is that a build server builds an executable and its pdb, then uploads both to a symbol server (under the namespace of the exe, the exe in a subdir containing the exe's pe timestamp and size, and the pdb under the guid). If the executable crashes, it produces a minidump. From the minidump, crash infrastructure can obtain the full executable and the pdb. Since nothing guarantees that the pdb guid is a hash of the pdb data, I can't think of anything where being able to produce a pdb with a given uuid that is an xxhash buys you anything: Since nothing forces the guid to be a hash, you can just produce a pdb and set its guid field to whatever you want anyways.

The symptoms of a collision are just going to be you can’t debug the
program, so not very severe imo, especially since it would almost certainly
be resolved on the next incremental build

msg-23510-146.txt162 BDownload

In D51956#1235313, @llvm-commits wrote:

The symptoms of a collision are just going to be you can’t debug the
program, so not very severe imo, especially since it would almost certainly
be resolved on the next incremental build

Can you explain how it would lead to you not being able to debug the program?

Do you mean for local builds? If so, if two back-to-back builds end up with the same pdb guid in the exe and pdb by chance even though the pdb changes, the debugger should still load the new pdb off disk fine (?)

Do you mean if a build server produces PDBs with the same guid for different builds? If so, that would probably produce an error during pdb upload and make the build fail, not debugging (?)

If you’re uploading to build server, i don’t think it would be an error, it
would either overwrite or not. If it does overwrite, debugging the exe
matching the pdb that was there before wouldn’t work, and if it did not
overwrite debugging the new exe would fail.

That said, my point was mainly that the probability of this being an issue
in practice is negligible

LGTM

It sounds like I worried a bit too much about hash collisions.

This revision is now accepted and ready to land.Sep 14 2018, 1:18 PM

Closed by commit rL342333: Give InfoStreamBuilder an opt-in method to write a hash of the PDB as GUID. (authored by nico). · Explain WhySep 15 2018, 11:37 AM

This revision was automatically updated to reflect the committed changes.

https://reviews.llvm.org/rLLD342334 too. Thanks!

thakis mentioned this in D89418: [lld-macho] Implement LC_UUID.Nov 16 2020, 3:52 PM

Revision Contents

Path

Size

lld/

COFF/

PDB.h

2 lines

PDB.cpp

37 lines

Writer.cpp

88 lines

test/

COFF/

rsds.test

51 lines

llvm/

include/

llvm/

DebugInfo/

PDB/

Native/

InfoStreamBuilder.h

11 lines

PDBFileBuilder.h

4 lines

lib/

DebugInfo/

PDB/

Native/

GSIStreamBuilder.cpp

5 lines

InfoStreamBuilder.cpp

11 lines

PDBFileBuilder.cpp

29 lines

tools/

llvm-pdbutil/

llvm-pdbutil.cpp

7 lines

Diff 164994

lld/COFF/PDB.h

	Show All 22 Lines
	namespace coff {			namespace coff {
	class OutputSection;			class OutputSection;
	class SectionChunk;			class SectionChunk;
	class SymbolTable;			class SymbolTable;

	void createPDB(SymbolTable *Symtab,			void createPDB(SymbolTable *Symtab,
	llvm::ArrayRef<OutputSection *> OutputSections,			llvm::ArrayRef<OutputSection *> OutputSections,
	llvm::ArrayRef<uint8_t> SectionTable,			llvm::ArrayRef<uint8_t> SectionTable,
	const llvm::codeview::DebugInfo &BuildId);			llvm::codeview::DebugInfo *BuildId);

	std::pair<llvm::StringRef, uint32_t> getFileLine(const SectionChunk *C,			std::pair<llvm::StringRef, uint32_t> getFileLine(const SectionChunk *C,
	uint32_t Addr);			uint32_t Addr);
	}			}
	}			}

	#endif			#endif

lld/COFF/PDB.cpp

Show First 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	PDBLinker(SymbolTable *Symtab)
IDTable(Alloc), GlobalTypeTable(Alloc), GlobalIDTable(Alloc) {		IDTable(Alloc), GlobalTypeTable(Alloc), GlobalIDTable(Alloc) {
// This isn't strictly necessary, but link.exe usually puts an empty string		// This isn't strictly necessary, but link.exe usually puts an empty string
// as the first "valid" string in the string table, so we do the same in		// as the first "valid" string in the string table, so we do the same in
// order to maintain as much byte-for-byte compatibility as possible.		// order to maintain as much byte-for-byte compatibility as possible.
PDBStrTab.insert("");		PDBStrTab.insert("");
}		}

/// Emit the basic PDB structure: initial streams, headers, etc.		/// Emit the basic PDB structure: initial streams, headers, etc.
void initialize(const llvm::codeview::DebugInfo &BuildId);		void initialize(llvm::codeview::DebugInfo *BuildId);

/// Add natvis files specified on the command line.		/// Add natvis files specified on the command line.
void addNatvisFiles();		void addNatvisFiles();

/// Link CodeView from each object file in the symbol table into the PDB.		/// Link CodeView from each object file in the symbol table into the PDB.
void addObjectsToPDB();		void addObjectsToPDB();

/// Link CodeView from a single object file into the PDB.		/// Link CodeView from a single object file into the PDB.
Show All 15 Lines	public:

Expected<const CVIndexMap&> maybeMergeTypeServerPDB(ObjFile *File,		Expected<const CVIndexMap&> maybeMergeTypeServerPDB(ObjFile *File,
TypeServer2Record &TS);		TypeServer2Record &TS);

/// Add the section map and section contributions to the PDB.		/// Add the section map and section contributions to the PDB.
void addSections(ArrayRef<OutputSection *> OutputSections,		void addSections(ArrayRef<OutputSection *> OutputSections,
ArrayRef<uint8_t> SectionTable);		ArrayRef<uint8_t> SectionTable);

/// Write the PDB to disk.		/// Write the PDB to disk and return the Guid generated for it.
void commit();		void commit(codeview::GUID *Guid);
		ruiuUnsubmitted Done Reply Inline Actions It's return type is void. I'd change the comment or change the return type. ruiu: It's return type is void. I'd change the comment or change the return type.
		thakisAuthorUnsubmitted Not Done Reply Inline Actions Will do. thakis: Will do.

private:		private:
BumpPtrAllocator Alloc;		BumpPtrAllocator Alloc;

SymbolTable *Symtab;		SymbolTable *Symtab;

pdb::PDBFileBuilder Builder;		pdb::PDBFileBuilder Builder;

▲ Show 20 Lines • Show All 188 Lines • ▼ Show 20 Lines	tryToLoadPDB(const GUID &GuidFromObj, StringRef TSPath) {
// PDB file doesn't mean it matches. For it to match the InfoStream's GUID		// PDB file doesn't mean it matches. For it to match the InfoStream's GUID
// must match the GUID specified in the TypeServer2 record.		// must match the GUID specified in the TypeServer2 record.
if (ExpectedInfo->getGuid() != GuidFromObj)		if (ExpectedInfo->getGuid() != GuidFromObj)
return make_error<pdb::PDBError>(pdb::pdb_error_code::signature_out_of_date);		return make_error<pdb::PDBError>(pdb::pdb_error_code::signature_out_of_date);

return std::move(NS);		return std::move(NS);
}		}

Expected<const CVIndexMap&> PDBLinker::maybeMergeTypeServerPDB(ObjFile *File,		Expected<const CVIndexMap &>
TypeServer2Record &TS) {		PDBLinker::maybeMergeTypeServerPDB(ObjFile *File, TypeServer2Record &TS) {
const GUID &TSId = TS.getGuid();		const GUID &TSId = TS.getGuid();
StringRef TSPath = TS.getName();		StringRef TSPath = TS.getName();

// First, check if the PDB has previously failed to load.		// First, check if the PDB has previously failed to load.
auto PrevErr = MissingTypeServerPDBs.find(TSId);		auto PrevErr = MissingTypeServerPDBs.find(TSId);
if (PrevErr != MissingTypeServerPDBs.end())		if (PrevErr != MissingTypeServerPDBs.end())
return createFileError(		return createFileError(
TSPath,		TSPath,
▲ Show 20 Lines • Show All 603 Lines • ▼ Show 20 Lines	void PDBLinker::addObjFile(ObjFile *File) {
}		}

// Make a new file checksum table that refers to offsets in the PDB-wide		// Make a new file checksum table that refers to offsets in the PDB-wide
// string table. Generally the string table subsection appears after the		// string table. Generally the string table subsection appears after the
// checksum table, so we have to do this after looping over all the		// checksum table, so we have to do this after looping over all the
// subsections.		// subsections.
auto NewChecksums = make_unique<DebugChecksumsSubsection>(PDBStrTab);		auto NewChecksums = make_unique<DebugChecksumsSubsection>(PDBStrTab);
for (FileChecksumEntry &FC : Checksums) {		for (FileChecksumEntry &FC : Checksums) {
SmallString<128> FileName = ExitOnErr(CVStrTab.getString(FC.FileNameOffset));		SmallString<128> FileName =
		ExitOnErr(CVStrTab.getString(FC.FileNameOffset));
if (!sys::path::is_absolute(FileName) &&		if (!sys::path::is_absolute(FileName) &&
!Config->PDBSourcePath.empty()) {		!Config->PDBSourcePath.empty()) {
SmallString<128> AbsoluteFileName = Config->PDBSourcePath;		SmallString<128> AbsoluteFileName = Config->PDBSourcePath;
sys::path::append(AbsoluteFileName, FileName);		sys::path::append(AbsoluteFileName, FileName);
sys::path::native(AbsoluteFileName);		sys::path::native(AbsoluteFileName);
sys::path::remove_dots(AbsoluteFileName, /remove_dot_dots=/true);		sys::path::remove_dots(AbsoluteFileName, /remove_dot_dots=/true);
FileName = std::move(AbsoluteFileName);		FileName = std::move(AbsoluteFileName);
}		}
▲ Show 20 Lines • Show All 162 Lines • ▼ Show 20 Lines	static void addLinkerModuleSectionSymbol(pdb::DbiModuleDescriptorBuilder &Mod,
Mod.addSymbol(codeview::SymbolSerializer::writeOneSymbol(		Mod.addSymbol(codeview::SymbolSerializer::writeOneSymbol(
Sym, Allocator, CodeViewContainer::Pdb));		Sym, Allocator, CodeViewContainer::Pdb));
}		}

// Creates a PDB file.		// Creates a PDB file.
void coff::createPDB(SymbolTable *Symtab,		void coff::createPDB(SymbolTable *Symtab,
ArrayRef<OutputSection *> OutputSections,		ArrayRef<OutputSection *> OutputSections,
ArrayRef<uint8_t> SectionTable,		ArrayRef<uint8_t> SectionTable,
const llvm::codeview::DebugInfo &BuildId) {		llvm::codeview::DebugInfo *BuildId) {
ScopedTimer T1(TotalPdbLinkTimer);		ScopedTimer T1(TotalPdbLinkTimer);
PDBLinker PDB(Symtab);		PDBLinker PDB(Symtab);

PDB.initialize(BuildId);		PDB.initialize(BuildId);
PDB.addObjectsToPDB();		PDB.addObjectsToPDB();
PDB.addSections(OutputSections, SectionTable);		PDB.addSections(OutputSections, SectionTable);
PDB.addNatvisFiles();		PDB.addNatvisFiles();

ScopedTimer T2(DiskCommitTimer);		ScopedTimer T2(DiskCommitTimer);
PDB.commit();		codeview::GUID Guid;
		PDB.commit(&Guid);
		memcpy(&BuildId->PDB70.Signature, &Guid, 16);
}		}

void PDBLinker::initialize(const llvm::codeview::DebugInfo &BuildId) {		void PDBLinker::initialize(llvm::codeview::DebugInfo *BuildId) {
ExitOnErr(Builder.initialize(4096)); // 4096 is blocksize		ExitOnErr(Builder.initialize(4096)); // 4096 is blocksize

		BuildId->Signature.CVSignature = OMF::Signature::PDB70;
		// Signature is set to a hash of the PDB contents when the PDB is done.
		memset(BuildId->PDB70.Signature, 0, 16);
		BuildId->PDB70.Age = 1;

// Create streams in MSF for predefined streams, namely		// Create streams in MSF for predefined streams, namely
// PDB, TPI, DBI and IPI.		// PDB, TPI, DBI and IPI.
for (int I = 0; I < (int)pdb::kSpecialStreamCount; ++I)		for (int I = 0; I < (int)pdb::kSpecialStreamCount; ++I)
ExitOnErr(Builder.getMsfBuilder().addStream(0));		ExitOnErr(Builder.getMsfBuilder().addStream(0));

// Add an Info stream.		// Add an Info stream.
auto &InfoBuilder = Builder.getInfoBuilder();		auto &InfoBuilder = Builder.getInfoBuilder();
GUID uuid;
memcpy(&uuid, &BuildId.PDB70.Signature, sizeof(uuid));
InfoBuilder.setAge(BuildId.PDB70.Age);
InfoBuilder.setGuid(uuid);
InfoBuilder.setVersion(pdb::PdbRaw_ImplVer::PdbImplVC70);		InfoBuilder.setVersion(pdb::PdbRaw_ImplVer::PdbImplVC70);
		InfoBuilder.setHashPDBContentsToGUID(true);

// Add an empty DBI stream.		// Add an empty DBI stream.
pdb::DbiStreamBuilder &DbiBuilder = Builder.getDbiBuilder();		pdb::DbiStreamBuilder &DbiBuilder = Builder.getDbiBuilder();
DbiBuilder.setAge(BuildId.PDB70.Age);		DbiBuilder.setAge(BuildId->PDB70.Age);
DbiBuilder.setVersionHeader(pdb::PdbDbiV70);		DbiBuilder.setVersionHeader(pdb::PdbDbiV70);
DbiBuilder.setMachineType(Config->Machine);		DbiBuilder.setMachineType(Config->Machine);
// Technically we are not link.exe 14.11, but there are known cases where		// Technically we are not link.exe 14.11, but there are known cases where
// debugging tools on Windows expect Microsoft-specific version numbers or		// debugging tools on Windows expect Microsoft-specific version numbers or
// they fail to work at all. Since we know we produce PDBs that are		// they fail to work at all. Since we know we produce PDBs that are
// compatible with LINK 14.11, we set that version number here.		// compatible with LINK 14.11, we set that version number here.
DbiBuilder.setBuildNumber(14, 11);		DbiBuilder.setBuildNumber(14, 11);
}		}
Show All 27 Lines	void PDBLinker::addSections(ArrayRef<OutputSection *> OutputSections,
SectionMap = pdb::DbiStreamBuilder::createSectionMap(Sections);		SectionMap = pdb::DbiStreamBuilder::createSectionMap(Sections);
DbiBuilder.setSectionMap(SectionMap);		DbiBuilder.setSectionMap(SectionMap);

// Add COFF section header stream.		// Add COFF section header stream.
ExitOnErr(		ExitOnErr(
DbiBuilder.addDbgStream(pdb::DbgHeaderType::SectionHdr, SectionTable));		DbiBuilder.addDbgStream(pdb::DbgHeaderType::SectionHdr, SectionTable));
}		}

void PDBLinker::commit() {		void PDBLinker::commit(codeview::GUID *Guid) {
// Write to a file.		// Write to a file.
ExitOnErr(Builder.commit(Config->PDBPath));		ExitOnErr(Builder.commit(Config->PDBPath, Guid));
}		}

static Expected<StringRef>		static Expected<StringRef>
getFileName(const DebugStringTableSubsectionRef &Strings,		getFileName(const DebugStringTableSubsectionRef &Strings,
const DebugChecksumsSubsectionRef &Checksums, uint32_t FileID) {		const DebugChecksumsSubsectionRef &Checksums, uint32_t FileID) {
auto Iter = Checksums.getArray().at(FileID);		auto Iter = Checksums.getArray().at(FileID);
if (Iter == Checksums.getArray().end())		if (Iter == Checksums.getArray().end())
return make_error<CodeViewError>(cv_error_code::no_records);		return make_error<CodeViewError>(cv_error_code::no_records);
▲ Show 20 Lines • Show All 135 Lines • Show Last 20 Lines

lld/COFF/Writer.cpp

Show First 20 Lines • Show All 203 Lines • ▼ Show 20 Lines	private:
IdataContents Idata;		IdataContents Idata;
DelayLoadContents DelayIdata;		DelayLoadContents DelayIdata;
EdataContents Edata;		EdataContents Edata;
bool SetNoSEHCharacteristic = false;		bool SetNoSEHCharacteristic = false;

DebugDirectoryChunk *DebugDirectory = nullptr;		DebugDirectoryChunk *DebugDirectory = nullptr;
std::vector<Chunk *> DebugRecords;		std::vector<Chunk *> DebugRecords;
CVDebugRecordChunk *BuildId = nullptr;		CVDebugRecordChunk *BuildId = nullptr;
Optional<codeview::DebugInfo> PreviousBuildId;
ArrayRef<uint8_t> SectionTable;		ArrayRef<uint8_t> SectionTable;

uint64_t FileSize;		uint64_t FileSize;
uint32_t PointerToSymbolTable = 0;		uint32_t PointerToSymbolTable = 0;
uint64_t SizeOfImage;		uint64_t SizeOfImage;
uint64_t SizeOfHeaders;		uint64_t SizeOfHeaders;

OutputSection *TextSec;		OutputSection *TextSec;
▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	if (StringTableOff) {
strncpy(Hdr->Name, Name.data(),		strncpy(Hdr->Name, Name.data(),
std::min(Name.size(), (size_t)COFF::NameSize));		std::min(Name.size(), (size_t)COFF::NameSize));
}		}
}		}

} // namespace coff		} // namespace coff
} // namespace lld		} // namespace lld

// PDBs are matched against executables using a build id which consists of three
// components:
// 1. A 16-bit GUID
// 2. An age
// 3. A time stamp.
//
// Debuggers and symbol servers match executables against debug info by checking
// each of these components of the EXE/DLL against the corresponding value in
// the PDB and failing a match if any of the components differ. In the case of
// symbol servers, symbols are cached in a folder that is a function of the
// GUID. As a result, in order to avoid symbol cache pollution where every
// incremental build copies a new PDB to the symbol cache, we must try to re-use
// the existing GUID if one exists, but bump the age. This way the match will
// fail, so the symbol cache knows to use the new PDB, but the GUID matches, so
// it overwrites the existing item in the symbol cache rather than making a new
// one.
static Optional<codeview::DebugInfo> loadExistingBuildId(StringRef Path) {
// We don't need to incrementally update a previous build id if we're not
// writing codeview debug info.
if (!Config->Debug)
return None;

auto ExpectedBinary = llvm::object::createBinary(Path);
if (!ExpectedBinary) {
consumeError(ExpectedBinary.takeError());
return None;
}

auto Binary = std::move(*ExpectedBinary);
if (!Binary.getBinary()->isCOFF())
return None;

std::error_code EC;
COFFObjectFile File(Binary.getBinary()->getMemoryBufferRef(), EC);
if (EC)
return None;

// If the machine of the binary we're outputting doesn't match the machine
// of the existing binary, don't try to re-use the build id.
if (File.is64() != Config->is64() \|\| File.getMachine() != Config->Machine)
return None;

for (const auto &DebugDir : File.debug_directories()) {
if (DebugDir.Type != IMAGE_DEBUG_TYPE_CODEVIEW)
continue;

const codeview::DebugInfo *ExistingDI = nullptr;
StringRef PDBFileName;
if (auto EC = File.getDebugPDBInfo(ExistingDI, PDBFileName)) {
(void)EC;
return None;
}
// We only support writing PDBs in v70 format. So if this is not a build
// id that we recognize / support, ignore it.
if (ExistingDI->Signature.CVSignature != OMF::Signature::PDB70)
return None;
return *ExistingDI;
}
return None;
}

// The main function of the writer.		// The main function of the writer.
void Writer::run() {		void Writer::run() {
ScopedTimer T1(CodeLayoutTimer);		ScopedTimer T1(CodeLayoutTimer);

createSections();		createSections();
createMiscChunks();		createMiscChunks();
createImportTables();		createImportTables();
createExportTable();		createExportTable();
mergeSections();		mergeSections();
assignAddresses();		assignAddresses();
removeEmptySections();		removeEmptySections();
setSectionPermissions();		setSectionPermissions();
createSymbolAndStringTable();		createSymbolAndStringTable();

if (FileSize > UINT32_MAX)		if (FileSize > UINT32_MAX)
fatal("image size (" + Twine(FileSize) + ") " +		fatal("image size (" + Twine(FileSize) + ") " +
"exceeds maximum allowable size (" + Twine(UINT32_MAX) + ")");		"exceeds maximum allowable size (" + Twine(UINT32_MAX) + ")");

// We must do this before opening the output file, as it depends on being able
// to read the contents of the existing output file.
PreviousBuildId = loadExistingBuildId(Config->OutputFile);
openFile(Config->OutputFile);		openFile(Config->OutputFile);
if (Config->is64()) {		if (Config->is64()) {
writeHeader<pe32plus_header>();		writeHeader<pe32plus_header>();
} else {		} else {
writeHeader<pe32_header>();		writeHeader<pe32_header>();
}		}
writeSections();		writeSections();
sortExceptionTable();		sortExceptionTable();
writeBuildId();

T1.stop();		T1.stop();

if (!Config->PDBPath.empty() && Config->Debug) {		if (!Config->PDBPath.empty() && Config->Debug) {
assert(BuildId);		assert(BuildId);
createPDB(Symtab, OutputSections, SectionTable, *BuildId->BuildId);		createPDB(Symtab, OutputSections, SectionTable, BuildId->BuildId);
}		}
		writeBuildId();

writeMapFile(OutputSections);		writeMapFile(OutputSections);

ScopedTimer T2(DiskCommitTimer);		ScopedTimer T2(DiskCommitTimer);
if (auto E = Buffer->commit())		if (auto E = Buffer->commit())
fatal("failed to write the output file: " + toString(std::move(E)));		fatal("failed to write the output file: " + toString(std::move(E)));
}		}

▲ Show 20 Lines • Show All 826 Lines • ▼ Show 20 Lines
}		}

void Writer::writeBuildId() {		void Writer::writeBuildId() {
// There are two important parts to the build ID.		// There are two important parts to the build ID.
// 1) If building with debug info, the COFF debug directory contains a		// 1) If building with debug info, the COFF debug directory contains a
// timestamp as well as a Guid and Age of the PDB.		// timestamp as well as a Guid and Age of the PDB.
// 2) In all cases, the PE COFF file header also contains a timestamp.		// 2) In all cases, the PE COFF file header also contains a timestamp.
// For reproducibility, instead of a timestamp we want to use a hash of the		// For reproducibility, instead of a timestamp we want to use a hash of the
// binary, however when building with debug info the hash needs to take into		// PE contents.
// account the debug info, since it's possible to add blank lines to a file
// which causes the debug info to change but not the generated code.
//
// To handle this, we first set the Guid and Age in the debug directory (but
// only if we're doing a debug build). Then, we hash the binary (thus causing
// the hash to change if only the debug info changes, since the Age will be
// different). Finally, we write that hash into the debug directory (if
// present) as well as the COFF file header (always).
if (Config->Debug) {		if (Config->Debug) {
assert(BuildId && "BuildId is not set!");		assert(BuildId && "BuildId is not set!");
if (PreviousBuildId.hasValue()) {		// BuildId->BuildId was filled in when the PDB was written.
BuildId->BuildId = PreviousBuildId;
BuildId->BuildId->PDB70.Age = BuildId->BuildId->PDB70.Age + 1;
} else {
BuildId->BuildId->Signature.CVSignature = OMF::Signature::PDB70;
BuildId->BuildId->PDB70.Age = 1;
llvm::getRandomBytes(BuildId->BuildId->PDB70.Signature, 16);
}
}		}

// At this point the only fields in the COFF file which remain unset are the		// At this point the only fields in the COFF file which remain unset are the
// "timestamp" in the COFF file header, and the ones in the coff debug		// "timestamp" in the COFF file header, and the ones in the coff debug
// directory. Now we can hash the file and write that hash to the various		// directory. Now we can hash the file and write that hash to the various
// timestamp fields in the file.		// timestamp fields in the file.
StringRef OutputFileData(		StringRef OutputFileData(
reinterpret_cast<const char *>(Buffer->getBufferStart()),		reinterpret_cast<const char *>(Buffer->getBufferStart()),
▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

lld/test/COFF/rsds.test

	# RUN: yaml2obj %s > %t.obj			# RUN: yaml2obj %s > %t.obj

	# RUN: rm -f %t.dll %t.pdb			# RUN: rm -f %t.dll %t.pdb
	# RUN: lld-link /debug /pdbaltpath:test1.pdb /dll /out:%t.dll /entry:DllMain %t.obj			# RUN: lld-link /debug /pdbaltpath:test.pdb /dll /out:%t.dll /entry:DllMain %t.obj
	# RUN: llvm-readobj -coff-debug-directory %t.dll > %t.1.txt			# RUN: llvm-readobj -coff-debug-directory %t.dll > %t.1.txt
	# RUN: lld-link /debug /pdbaltpath:test2.pdb /dll /out:%t.dll /entry:DllMain %t.obj			# RUN: lld-link /debug /pdbaltpath:test.pdb /dll /out:%t.dll /entry:DllMain %t.obj
	# RUN: llvm-readobj -coff-debug-directory %t.dll > %t.2.txt			# RUN: llvm-readobj -coff-debug-directory %t.dll > %t.2.txt
	# RUN: cat %t.1.txt %t.2.txt \| FileCheck %s			# RUN: cat %t.1.txt %t.2.txt \| FileCheck %s

	# RUN: rm -f %t.dll %t.pdb			# RUN: rm -f %t.dll %t.pdb
	# RUN: lld-link /debug /pdb:%t1.pdb /dll /out:%t.dll /entry:DllMain %t.obj			# RUN: lld-link /debug /pdb:%t1.pdb /dll /out:%t.dll /entry:DllMain %t.obj
	# RUN: llvm-readobj -coff-debug-directory %t.dll > %t.3.txt			# RUN: llvm-readobj -coff-debug-directory %t.dll > %t.3.txt
	# RUN: lld-link /debug /pdb:%t2.pdb /dll /out:%t.dll /entry:DllMain %t.obj			# RUN: lld-link /debug /pdb:%t2.pdb /dll /out:%t.dll /entry:DllMain %t.obj
	# RUN: llvm-readobj -coff-debug-directory %t.dll > %t.4.txt			# RUN: llvm-readobj -coff-debug-directory %t.dll > %t.4.txt
	# RUN: cat %t.3.txt %t.4.txt \| FileCheck %s			# RUN: cat %t.3.txt %t.4.txt \| FileCheck --check-prefix TWOPDBS %s

	# RUN: rm -f %t.dll %t.pdb			# RUN: rm -f %t.dll %t.pdb
	# RUN: lld-link /Brepro /dll /out:%t.dll /entry:DllMain %t.obj			# RUN: lld-link /Brepro /dll /out:%t.dll /entry:DllMain %t.obj
	# RUN: llvm-readobj -coff-debug-directory %t.dll \| FileCheck --check-prefix REPRO %s			# RUN: llvm-readobj -coff-debug-directory %t.dll \| FileCheck --check-prefix REPRO %s

	# RUN: rm -f %t.dll %t.pdb			# RUN: rm -f %t.dll %t.pdb
	# RUN: lld-link /Brepro /debug /dll /out:%t.dll /entry:DllMain %t.obj			# RUN: lld-link /Brepro /debug /dll /out:%t.dll /entry:DllMain %t.obj
	# RUN: llvm-readobj -coff-debug-directory %t.dll \| FileCheck --check-prefix REPRODEBUG %s			# RUN: llvm-readobj -coff-debug-directory %t.dll \| FileCheck --check-prefix REPRODEBUG %s

	# CHECK: File: [[FILE:.*]].dll			# CHECK: File: [[FILE:.*]].dll
	# CHECK: DebugDirectory [			# CHECK: DebugDirectory [
	# CHECK: DebugEntry {			# CHECK: DebugEntry {
	# CHECK: Characteristics: 0x0			# CHECK: Characteristics: 0x0
	# CHECK: TimeDateStamp:			# CHECK: TimeDateStamp:
	# CHECK: MajorVersion: 0x0			# CHECK: MajorVersion: 0x0
	# CHECK: MinorVersion: 0x0			# CHECK: MinorVersion: 0x0
	# CHECK: Type: CodeView (0x2)			# CHECK: Type: CodeView (0x2)
	# CHECK: SizeOfData: 0x{{[^0]}}			# CHECK: SizeOfData: 0x{{[^0]}}
	# CHECK: AddressOfRawData: 0x{{[^0]}}			# CHECK: AddressOfRawData: 0x{{[^0]}}
	# CHECK: PointerToRawData: 0x{{[^0]}}			# CHECK: PointerToRawData: 0x{{[^0]}}
	# CHECK: PDBInfo {			# CHECK: PDBInfo {
	# CHECK: PDBSignature: 0x53445352			# CHECK: PDBSignature: 0x53445352
	# CHECK: PDBGUID: [[GUID:\(([A-Za-z0-9]{2} ?){16}\)]]			# CHECK: PDBGUID: [[GUID:\(([A-Za-z0-9]{2} ?){16}\)]]
	# CHECK: PDBAge: 1			# CHECK: PDBAge: 1
	# CHECK: PDBFileName: {{.*}}1.pdb			# CHECK: PDBFileName: {{.*}}.pdb
	# CHECK: }			# CHECK: }
	# CHECK: }			# CHECK: }
	# CHECK: ]			# CHECK: ]
	# CHECK: File: [[FILE]].dll			# CHECK: File: [[FILE]].dll
	# CHECK: DebugDirectory [			# CHECK: DebugDirectory [
	# CHECK: DebugEntry {			# CHECK: DebugEntry {
	# CHECK: Characteristics: 0x0			# CHECK: Characteristics: 0x0
	# CHECK: TimeDateStamp:			# CHECK: TimeDateStamp:
	# CHECK: MajorVersion: 0x0			# CHECK: MajorVersion: 0x0
	# CHECK: MinorVersion: 0x0			# CHECK: MinorVersion: 0x0
	# CHECK: Type: CodeView (0x2)			# CHECK: Type: CodeView (0x2)
	# CHECK: SizeOfData: 0x{{[^0]}}			# CHECK: SizeOfData: 0x{{[^0]}}
	# CHECK: AddressOfRawData: 0x{{[^0]}}			# CHECK: AddressOfRawData: 0x{{[^0]}}
	# CHECK: PointerToRawData: 0x{{[^0]}}			# CHECK: PointerToRawData: 0x{{[^0]}}
	# CHECK: PDBInfo {			# CHECK: PDBInfo {
	# CHECK: PDBSignature: 0x53445352			# CHECK: PDBSignature: 0x53445352
	# CHECK: PDBGUID: [[GUID]]			# CHECK: PDBGUID: [[GUID]]
	# CHECK: PDBAge: 2			# CHECK: PDBAge: 1
	# CHECK: PDBFileName: {{.*}}2.pdb			# CHECK: PDBFileName: {{.*}}.pdb
	# CHECK: }			# CHECK: }
	# CHECK: }			# CHECK: }
	# CHECK: ]			# CHECK: ]

				# TWOPDBS: File: [[FILE:.*]].dll
				# TWOPDBS: DebugDirectory [
				# TWOPDBS: DebugEntry {
				# TWOPDBS: Characteristics: 0x0
				# TWOPDBS: TimeDateStamp:
				# TWOPDBS: MajorVersion: 0x0
				# TWOPDBS: MinorVersion: 0x0
				# TWOPDBS: Type: CodeView (0x2)
				# TWOPDBS: SizeOfData: 0x{{[^0]}}
				# TWOPDBS: AddressOfRawData: 0x{{[^0]}}
				# TWOPDBS: PointerToRawData: 0x{{[^0]}}
				# TWOPDBS: PDBInfo {
				# TWOPDBS: PDBSignature: 0x53445352
				# TWOPDBS: PDBGUID: [[GUID:\(([A-Za-z0-9]{2} ?){16}\)]]
				# TWOPDBS: PDBAge: 1
				# TWOPDBS: PDBFileName: {{.*}}.pdb
				# TWOPDBS: }
				# TWOPDBS: }
				# TWOPDBS: ]
				# TWOPDBS: File: [[FILE]].dll
				# TWOPDBS: DebugDirectory [
				# TWOPDBS: DebugEntry {
				# TWOPDBS: Characteristics: 0x0
				# TWOPDBS: TimeDateStamp:
				# TWOPDBS: MajorVersion: 0x0
				# TWOPDBS: MinorVersion: 0x0
				# TWOPDBS: Type: CodeView (0x2)
				# TWOPDBS: SizeOfData: 0x{{[^0]}}
				# TWOPDBS: AddressOfRawData: 0x{{[^0]}}
				# TWOPDBS: PointerToRawData: 0x{{[^0]}}
				# TWOPDBS: PDBInfo {
				# TWOPDBS: PDBSignature: 0x53445352
				# TWOPDBS-NOT: PDBGUID: [[GUID]]
				# TWOPDBS: PDBAge: 1
				# TWOPDBS: PDBFileName: {{.*}}.pdb
				# TWOPDBS: }
				# TWOPDBS: }
				# TWOPDBS: ]

	# REPRO: File: {{.*}}.dll			# REPRO: File: {{.*}}.dll
	# REPRO: DebugDirectory [			# REPRO: DebugDirectory [
	# REPRO: DebugEntry {			# REPRO: DebugEntry {
	# REPRO: Characteristics: 0x0			# REPRO: Characteristics: 0x0
	# REPRO: TimeDateStamp:			# REPRO: TimeDateStamp:
	# REPRO: MajorVersion: 0x0			# REPRO: MajorVersion: 0x0
	# REPRO: MinorVersion: 0x0			# REPRO: MinorVersion: 0x0
	# REPRO: Type: Repro (0x10)			# REPRO: Type: Repro (0x10)
	▲ Show 20 Lines • Show All 102 Lines • Show Last 20 Lines

llvm/include/llvm/DebugInfo/PDB/Native/InfoStreamBuilder.h

	Show All 29 Lines

	class InfoStreamBuilder {			class InfoStreamBuilder {
	public:			public:
	InfoStreamBuilder(msf::MSFBuilder &Msf, NamedStreamMap &NamedStreams);			InfoStreamBuilder(msf::MSFBuilder &Msf, NamedStreamMap &NamedStreams);
	InfoStreamBuilder(const InfoStreamBuilder &) = delete;			InfoStreamBuilder(const InfoStreamBuilder &) = delete;
	InfoStreamBuilder &operator=(const InfoStreamBuilder &) = delete;			InfoStreamBuilder &operator=(const InfoStreamBuilder &) = delete;

	void setVersion(PdbRaw_ImplVer V);			void setVersion(PdbRaw_ImplVer V);
				void addFeature(PdbRaw_FeatureSig Sig);

				// If this is true, the PDB contents are hashed and this hash is used as
				// PDB GUID and as Signature. The age is always 1.
				void setHashPDBContentsToGUID(bool B);

				// These only have an effect if hashPDBContentsToGUID() is false.
	void setSignature(uint32_t S);			void setSignature(uint32_t S);
	void setAge(uint32_t A);			void setAge(uint32_t A);
	void setGuid(codeview::GUID G);			void setGuid(codeview::GUID G);
	void addFeature(PdbRaw_FeatureSig Sig);

				bool hashPDBContentsToGUID() const { return HashPDBContentsToGUID; }
	uint32_t getAge() const { return Age; }			uint32_t getAge() const { return Age; }
	codeview::GUID getGuid() const { return Guid; }			codeview::GUID getGuid() const { return Guid; }
	Optional<uint32_t> getSignature() const { return Signature; }			Optional<uint32_t> getSignature() const { return Signature; }

	uint32_t finalize();			uint32_t finalize();

	Error finalizeMsfLayout();			Error finalizeMsfLayout();

	Error commit(const msf::MSFLayout &Layout,			Error commit(const msf::MSFLayout &Layout,
	WritableBinaryStreamRef Buffer) const;			WritableBinaryStreamRef Buffer) const;

	private:			private:
	msf::MSFBuilder &Msf;			msf::MSFBuilder &Msf;

	std::vector<PdbRaw_FeatureSig> Features;			std::vector<PdbRaw_FeatureSig> Features;
	PdbRaw_ImplVer Ver;			PdbRaw_ImplVer Ver;
	uint32_t Age;			uint32_t Age;
	Optional<uint32_t> Signature;			Optional<uint32_t> Signature;
	codeview::GUID Guid;			codeview::GUID Guid;

				bool HashPDBContentsToGUID = false;

	NamedStreamMap &NamedStreams;			NamedStreamMap &NamedStreams;
	};			};
	}			}
	}			}

	#endif			#endif

llvm/include/llvm/DebugInfo/PDB/Native/PDBFileBuilder.h

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	public:
msf::MSFBuilder &getMsfBuilder();		msf::MSFBuilder &getMsfBuilder();
InfoStreamBuilder &getInfoBuilder();		InfoStreamBuilder &getInfoBuilder();
DbiStreamBuilder &getDbiBuilder();		DbiStreamBuilder &getDbiBuilder();
TpiStreamBuilder &getTpiBuilder();		TpiStreamBuilder &getTpiBuilder();
TpiStreamBuilder &getIpiBuilder();		TpiStreamBuilder &getIpiBuilder();
PDBStringTableBuilder &getStringTableBuilder();		PDBStringTableBuilder &getStringTableBuilder();
GSIStreamBuilder &getGsiBuilder();		GSIStreamBuilder &getGsiBuilder();

Error commit(StringRef Filename);		// If HashPDBContentsToGUID is true on the InfoStreamBuilder, Guid is filled
		// with the computed PDB GUID on return.
		Error commit(StringRef Filename, codeview::GUID *Guid);

Expected<uint32_t> getNamedStreamIndex(StringRef Name) const;		Expected<uint32_t> getNamedStreamIndex(StringRef Name) const;
Error addNamedStream(StringRef Name, StringRef Data);		Error addNamedStream(StringRef Name, StringRef Data);
void addInjectedSource(StringRef Name, std::unique_ptr<MemoryBuffer> Buffer);		void addInjectedSource(StringRef Name, std::unique_ptr<MemoryBuffer> Buffer);

private:		private:
struct InjectedSourceDescriptor {		struct InjectedSourceDescriptor {
// The full name of the stream that contains the contents of this injected		// The full name of the stream that contains the contents of this injected
▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

llvm/lib/DebugInfo/PDB/Native/GSIStreamBuilder.cpp

	Show First 20 Lines • Show All 304 Lines • ▼ Show 20 Lines
	}			}

	Error GSIStreamBuilder::commitPublicsHashStream(			Error GSIStreamBuilder::commitPublicsHashStream(
	WritableBinaryStreamRef Stream) {			WritableBinaryStreamRef Stream) {
	BinaryStreamWriter Writer(Stream);			BinaryStreamWriter Writer(Stream);
	PublicsStreamHeader Header;			PublicsStreamHeader Header;

	// FIXME: Fill these in. They are for incremental linking.			// FIXME: Fill these in. They are for incremental linking.
				Header.SymHash = PSH->calculateSerializedLength();
				Header.AddrMap = PSH->Records.size() * 4;
	Header.NumThunks = 0;			Header.NumThunks = 0;
	Header.SizeOfThunk = 0;			Header.SizeOfThunk = 0;
	Header.ISectThunkTable = 0;			Header.ISectThunkTable = 0;
				memset(Header.Padding, 0, sizeof(Header.Padding));
	Header.OffThunkTable = 0;			Header.OffThunkTable = 0;
	Header.NumSections = 0;			Header.NumSections = 0;
	Header.SymHash = PSH->calculateSerializedLength();
	Header.AddrMap = PSH->Records.size() * 4;
	if (auto EC = Writer.writeObject(Header))			if (auto EC = Writer.writeObject(Header))
	return EC;			return EC;

	if (auto EC = PSH->commit(Writer))			if (auto EC = PSH->commit(Writer))
	return EC;			return EC;

	std::vector<ulittle32_t> AddrMap = computeAddrMap(PSH->Records);			std::vector<ulittle32_t> AddrMap = computeAddrMap(PSH->Records);
	if (auto EC = Writer.writeArray(makeArrayRef(AddrMap)))			if (auto EC = Writer.writeArray(makeArrayRef(AddrMap)))
	Show All 28 Lines

llvm/lib/DebugInfo/PDB/Native/InfoStreamBuilder.cpp

Show All 26 Lines	InfoStreamBuilder::InfoStreamBuilder(msf::MSFBuilder &Msf,
NamedStreamMap &NamedStreams)		NamedStreamMap &NamedStreams)
: Msf(Msf), Ver(PdbRaw_ImplVer::PdbImplVC70), Age(0),		: Msf(Msf), Ver(PdbRaw_ImplVer::PdbImplVC70), Age(0),
NamedStreams(NamedStreams) {		NamedStreams(NamedStreams) {
::memset(&Guid, 0, sizeof(Guid));		::memset(&Guid, 0, sizeof(Guid));
}		}

void InfoStreamBuilder::setVersion(PdbRaw_ImplVer V) { Ver = V; }		void InfoStreamBuilder::setVersion(PdbRaw_ImplVer V) { Ver = V; }

		void InfoStreamBuilder::addFeature(PdbRaw_FeatureSig Sig) {
		Features.push_back(Sig);
		}

		void InfoStreamBuilder::setHashPDBContentsToGUID(bool B) {
		HashPDBContentsToGUID = B;
		}

void InfoStreamBuilder::setAge(uint32_t A) { Age = A; }		void InfoStreamBuilder::setAge(uint32_t A) { Age = A; }

void InfoStreamBuilder::setSignature(uint32_t S) { Signature = S; }		void InfoStreamBuilder::setSignature(uint32_t S) { Signature = S; }

void InfoStreamBuilder::setGuid(GUID G) { Guid = G; }		void InfoStreamBuilder::setGuid(GUID G) { Guid = G; }

void InfoStreamBuilder::addFeature(PdbRaw_FeatureSig Sig) {
Features.push_back(Sig);
}

Error InfoStreamBuilder::finalizeMsfLayout() {		Error InfoStreamBuilder::finalizeMsfLayout() {
uint32_t Length = sizeof(InfoStreamHeader) +		uint32_t Length = sizeof(InfoStreamHeader) +
NamedStreams.calculateSerializedLength() +		NamedStreams.calculateSerializedLength() +
(Features.size() + 1) * sizeof(uint32_t);		(Features.size() + 1) * sizeof(uint32_t);
if (auto EC = Msf.setStreamSize(StreamPDB, Length))		if (auto EC = Msf.setStreamSize(StreamPDB, Length))
return EC;		return EC;
return Error::success();		return Error::success();
Show All 27 Lines

llvm/lib/DebugInfo/PDB/Native/PDBFileBuilder.cpp

Show All 19 Lines
#include "llvm/DebugInfo/PDB/Native/PDBStringTableBuilder.h"		#include "llvm/DebugInfo/PDB/Native/PDBStringTableBuilder.h"
#include "llvm/DebugInfo/PDB/Native/RawError.h"		#include "llvm/DebugInfo/PDB/Native/RawError.h"
#include "llvm/DebugInfo/PDB/Native/TpiStream.h"		#include "llvm/DebugInfo/PDB/Native/TpiStream.h"
#include "llvm/DebugInfo/PDB/Native/TpiStreamBuilder.h"		#include "llvm/DebugInfo/PDB/Native/TpiStreamBuilder.h"
#include "llvm/Support/BinaryStream.h"		#include "llvm/Support/BinaryStream.h"
#include "llvm/Support/BinaryStreamWriter.h"		#include "llvm/Support/BinaryStreamWriter.h"
#include "llvm/Support/JamCRC.h"		#include "llvm/Support/JamCRC.h"
#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
		#include "llvm/Support/xxhash.h"
		ruiuUnsubmitted Not Done Reply Inline Actions As long as you are using a non-crypto hash function, there is a risk of generating the same build id, and the probability is not negligible if you have a lot of executables due to the birthday problem. Is this okay? ruiu: As long as you are using a non-crypto hash function, there is a risk of generating the same…
		thakisAuthorUnsubmitted Not Done Reply Inline Actions The 8 byte hash still gives decent hash collision resistance for up to 232 different pdb files, and since pdbs are keyed by executable name on the symbol server that's per binary. Projects tend to have far fewer revisions than 4 billion. Does that make sense? thakis: The 8 byte hash still gives decent hash collision resistance for up to 232 different pdb…
		ruiuUnsubmitted Not Done Reply Inline Actions Maybe it is safe. But what could happen if two executables have the same hash? Since xxhash is not cryptographically-safe, you could easily generate two executables having the same ID. Is there any security risks or something caused by that possibility? If the probability is small and the result of hash collision is not that bad, xxhash is probably okay. ruiu: Maybe it is safe. But what could happen if two executables have the same hash? Since xxhash is…
		thakisAuthorUnsubmitted Not Done Reply Inline Actions The main use case for this guid is to an executable to its pdb file. The common workflow is that a build server builds an executable and its pdb, then uploads both to a symbol server (under the namespace of the exe, the exe in a subdir containing the exe's pe timestamp and size, and the pdb under the guid). If the executable crashes, it produces a minidump. From the minidump, crash infrastructure can obtain the full executable and the pdb. Since nothing guarantees that the pdb guid is a hash of the pdb data, I can't think of anything where being able to produce a pdb with a given uuid that is an xxhash buys you anything: Since nothing forces the guid to be a hash, you can just produce a pdb and set its guid field to whatever you want anyways. thakis: The main use case for this guid is to an executable to its pdb file. The common workflow is…

using namespace llvm;		using namespace llvm;
using namespace llvm::codeview;		using namespace llvm::codeview;
using namespace llvm::msf;		using namespace llvm::msf;
using namespace llvm::pdb;		using namespace llvm::pdb;
using namespace llvm::support;		using namespace llvm::support;

PDBFileBuilder::PDBFileBuilder(BumpPtrAllocator &Allocator)		PDBFileBuilder::PDBFileBuilder(BumpPtrAllocator &Allocator)
▲ Show 20 Lines • Show All 220 Lines • ▼ Show 20 Lines	auto SourceStream = WritableMappedBlockStream::createIndexedStream(
Layout, MsfBuffer, SN, Allocator);		Layout, MsfBuffer, SN, Allocator);
BinaryStreamWriter SourceWriter(*SourceStream);		BinaryStreamWriter SourceWriter(*SourceStream);
assert(SourceWriter.bytesRemaining() == IS.Content->getBufferSize());		assert(SourceWriter.bytesRemaining() == IS.Content->getBufferSize());
cantFail(SourceWriter.writeBytes(		cantFail(SourceWriter.writeBytes(
arrayRefFromStringRef(IS.Content->getBuffer())));		arrayRefFromStringRef(IS.Content->getBuffer())));
}		}
}		}

Error PDBFileBuilder::commit(StringRef Filename) {		Error PDBFileBuilder::commit(StringRef Filename, codeview::GUID *Guid) {
assert(!Filename.empty());		assert(!Filename.empty());
if (auto EC = finalizeMsfLayout())		if (auto EC = finalizeMsfLayout())
return EC;		return EC;

MSFLayout Layout;		MSFLayout Layout;
auto ExpectedMsfBuffer = Msf->commit(Filename, Layout);		Expected<FileBufferByteStream> ExpectedMsfBuffer =
		Msf->commit(Filename, Layout);
if (!ExpectedMsfBuffer)		if (!ExpectedMsfBuffer)
return ExpectedMsfBuffer.takeError();		return ExpectedMsfBuffer.takeError();
FileBufferByteStream Buffer = std::move(*ExpectedMsfBuffer);		FileBufferByteStream Buffer = std::move(*ExpectedMsfBuffer);

auto ExpectedSN = getNamedStreamIndex("/names");		auto ExpectedSN = getNamedStreamIndex("/names");
if (!ExpectedSN)		if (!ExpectedSN)
return ExpectedSN.takeError();		return ExpectedSN.takeError();

▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	uint64_t InfoStreamFileOffset =
blockToOffset(InfoStreamBlocks.front(), Layout.SB->BlockSize);		blockToOffset(InfoStreamBlocks.front(), Layout.SB->BlockSize);
InfoStreamHeader H = reinterpret_cast<InfoStreamHeader >(		InfoStreamHeader H = reinterpret_cast<InfoStreamHeader >(
Buffer.getBufferStart() + InfoStreamFileOffset);		Buffer.getBufferStart() + InfoStreamFileOffset);

commitInjectedSources(Buffer, Layout);		commitInjectedSources(Buffer, Layout);

// Set the build id at the very end, after every other byte of the PDB		// Set the build id at the very end, after every other byte of the PDB
// has been written.		// has been written.
// FIXME: Use a hash of the PDB rather than time(nullptr) for the signature.		if (Info->hashPDBContentsToGUID()) {
		// Compute a hash of all sections of the output file.
		uint8_t *Start = Buffer.getBufferStart();
		uint8_t *End = Buffer.getBufferEnd();
		ArrayRef<uint8_t> D{Start, End};
		uint64_t Digest = xxHash64(D);

		H->Age = 1;

		memcpy(H->Guid.Guid, &Digest, 8);
		// xxhash only gives us 8 bytes, so put some fixed data in the other half.
		memcpy(H->Guid.Guid + 8, "LLD PDB.", 8);

		// Return GUID to caller.
		memcpy(Guid, H->Guid.Guid, 16);
		} else {
H->Age = Info->getAge();		H->Age = Info->getAge();
H->Guid = Info->getGuid();		H->Guid = Info->getGuid();
		}

		// FIXME: Use a hash of the PDB rather than time(nullptr) for the signature.
		// XXX: change this too
Optional<uint32_t> Sig = Info->getSignature();		Optional<uint32_t> Sig = Info->getSignature();
H->Signature = Sig.hasValue() ? *Sig : time(nullptr);		H->Signature = Sig.hasValue() ? *Sig : time(nullptr);

return Buffer.commit();		return Buffer.commit();
}		}

llvm/tools/llvm-pdbutil/llvm-pdbutil.cpp

Show First 20 Lines • Show All 794 Lines • ▼ Show 20 Lines	static void yamlToPdb(StringRef Path) {
IpiBuilder.setVersionHeader(Ipi.Version);		IpiBuilder.setVersionHeader(Ipi.Version);
for (const auto &R : Ipi.Records) {		for (const auto &R : Ipi.Records) {
CVType Type = R.toCodeViewRecord(TS);		CVType Type = R.toCodeViewRecord(TS);
IpiBuilder.addTypeRecord(Type.RecordData, None);		IpiBuilder.addTypeRecord(Type.RecordData, None);
}		}

Builder.getStringTableBuilder().setStrings(*Strings.strings());		Builder.getStringTableBuilder().setStrings(*Strings.strings());

ExitOnErr(Builder.commit(opts::yaml2pdb::YamlPdbOutputFile));		codeview::GUID IgnoredOutGuid;
		ExitOnErr(Builder.commit(opts::yaml2pdb::YamlPdbOutputFile, &IgnoredOutGuid));
}		}

static PDBFile &loadPDB(StringRef Path, std::unique_ptr<IPDBSession> &Session) {		static PDBFile &loadPDB(StringRef Path, std::unique_ptr<IPDBSession> &Session) {
ExitOnErr(loadDataForPDB(PDB_ReaderType::Native, Path, Session));		ExitOnErr(loadDataForPDB(PDB_ReaderType::Native, Path, Session));

NativeSession NS = static_cast<NativeSession >(Session.get());		NativeSession NS = static_cast<NativeSession >(Session.get());
return NS->getPDBFile();		return NS->getPDBFile();
}		}
▲ Show 20 Lines • Show All 443 Lines • ▼ Show 20 Lines	static void mergePdbs() {
});		});
Builder.getInfoBuilder().addFeature(PdbRaw_FeatureSig::VC140);		Builder.getInfoBuilder().addFeature(PdbRaw_FeatureSig::VC140);

SmallString<64> OutFile(opts::merge::PdbOutputFile);		SmallString<64> OutFile(opts::merge::PdbOutputFile);
if (OutFile.empty()) {		if (OutFile.empty()) {
OutFile = opts::merge::InputFilenames[0];		OutFile = opts::merge::InputFilenames[0];
llvm::sys::path::replace_extension(OutFile, "merged.pdb");		llvm::sys::path::replace_extension(OutFile, "merged.pdb");
}		}
ExitOnErr(Builder.commit(OutFile));
		codeview::GUID IgnoredOutGuid;
		ExitOnErr(Builder.commit(OutFile, &IgnoredOutGuid));
}		}

static void explain() {		static void explain() {
std::unique_ptr<IPDBSession> Session;		std::unique_ptr<IPDBSession> Session;
InputFile IF =		InputFile IF =
ExitOnErr(InputFile::open(opts::explain::InputFilename.front(), true));		ExitOnErr(InputFile::open(opts::explain::InputFilename.front(), true));

for (uint64_t Off : opts::explain::Offsets) {		for (uint64_t Off : opts::explain::Offsets) {
▲ Show 20 Lines • Show All 221 Lines • Show Last 20 Lines