This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
include/llvm/CodeGen/
-
llvm/
-
CodeGen/
-
DwarfStringPoolEntry.h
-
lib/CodeGen/AsmPrinter/
-
CodeGen/
-
AsmPrinter/
-
DwarfDebug.cpp
-
DwarfStringPool.h
-
DwarfStringPool.cpp
-
DwarfUnit.cpp
-
test/DebugInfo/X86/
-
DebugInfo/
-
X86/
-
string-offsets-table-order.ll
-
string-offsets-table.ll
-
tools/dsymutil/
-
dsymutil/
-
DwarfStreamer.cpp
-
MachOUtils.cpp
-
NonRelocatableStringpool.h
-
NonRelocatableStringpool.cpp
-
unittests/
-
CodeGen/
-
DIEHashTest.cpp
-
DebugInfo/DWARF/
-
DWARF/
-
DWARFDebugInfoTest.cpp
-
DwarfGenerator.cpp

Differential D49493

[DebugInfo] Reduce debug_str_offsets section size
ClosedPublic

Authored by labath on Jul 18 2018, 9:27 AM.

Download Raw Diff

Details

Reviewers

probinson
dblaikie
JDevlieghere

Commits

rG2f0881160c21: [DebugInfo] Reduce debug_str_offsets section size
rL339122: [DebugInfo] Reduce debug_str_offsets section size

Summary

The accelerator tables use the debug_str section to store their strings.
However, they do not support the indirect method of access that is
available for the debug_info section (DW_FORM_strx et al.).

Currently our code is assuming that all strings can/will be referenced
indirectly, and puts all of them into the debug_str_offsets section.
This is generally true for regular (unsplit) dwarf, but in the DWO case,
most of the strings in the debug_str section will only be used from the
accelerator tables. Therefore the contents of the debug_str_offsets
section will be largely unused and bloating the main executable.

This patch rectifies this by teaching the DwarfStringPool to
differentiate between strings accessed directly and indirectly. When a
user inserts a string into the pool it has to declare whether that
string will be referenced directly or not. If at least one user requsts
indirect access, that string will be assigned an index ID and put into
debug_str_offsets table. Otherwise, the offset table is skipped.

This approach reduces the overall binary size (when compiled with
-gdwarf-5 -gsplit-dwarf) in my tests by about 2% (debug_str_offsets is
shrunk by 99%).

Diff Detail

Repository: rL LLVM

Event Timeline

labath created this revision.Jul 18 2018, 9:27 AM

Herald added subscribers: mgrang, aprantl. · View Herald TranscriptJul 18 2018, 9:27 AM

Harbormaster completed remote builds in B20474: Diff 156095.Jul 18 2018, 9:27 AM

mgrang added inline comments.Jul 18 2018, 10:37 AM

lib/CodeGen/AsmPrinter/DwarfStringPool.cpp
59 ↗	(On Diff #156095)	Please use llvm::sort instead of std::sort. See https://llvm.org/docs/CodingStandards.html#beware-of-non-deterministic-sorting-order-of-equal-elements.

A few typos.

I would like a test that did the following: add string "A" not indexed; add string "B" indexed; add string "A" indexed.
Then show that the string section has "A" followed by "B", and the offsets table is correct (entry 0 points to "B", entry 1 points to "A").
Is that feasible?

lib/CodeGen/AsmPrinter/DwarfStringPool.cpp
59 ↗	(On Diff #156095)	`llvm::sort`
lib/CodeGen/AsmPrinter/DwarfStringPool.h
52 ↗	(On Diff #156095)	s/it's/its/
test/DebugInfo/X86/string-offsets-table.ll
97 ↗	(On Diff #156095)	... offsets of strings ...

JDevlieghere added inline comments.Jul 19 2018, 2:39 AM

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
2407 ↗	(On Diff #156095)	Would it make sense to make false the default?

In D49493#1166860, @probinson wrote:

I would like a test that did the following: add string "A" not indexed; add string "B" indexed; add string "A" indexed.
Then show that the string section has "A" followed by "B", and the offsets table is correct (entry 0 points to "B", entry 1 points to "A").
Is that feasible?

It's feasible, but just barely. In a non-dwo build the accel table (the only source of non-indexed strings) will not contain any new strings, as everything will be referenced (and indexed) from .debug_info. OTOH, in a dwo build the .debug_str section contains almost exclusively (non-indexed) accelerator table entries. AFAICT, the only exceptions are the DW_AT_GNU_dwo_name and DW_AT_comp_dir attributes of the skeleton units. I was able to tickle this by having three compile units in a single .ll file and having the compilation directories of some units match the variable names of others, but I fear the resulting test might be a bit brittle.

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
2407 ↗	(On Diff #156095)	I wouldn't recommend it as I think it increases the chance of getting things wrong. (e.g. when writing this patch I definitely wanted to make the arg explicit so I could audit each usage for the correct value of the argument). If you're worried about verbosity, one way I've considered addressing this would be to have an `AccelTable::addName` overload which accepts a StringPool and call's getEntry itself. This would have the extra advantage that the decision of whether to create an Indexed entry or not is moved closer to the code which does the actual emission (the part that knows whether it needs the index or not).

Fix typos and add the extra test.

Harbormaster completed remote builds in B20505: Diff 156228.Jul 19 2018, 3:52 AM

JDevlieghere added inline comments.Jul 19 2018, 4:13 AM

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
2407 ↗	(On Diff #156095)	That's what I figured, but I agree a strong interface is worth being a little more verbose. The overload sounds like a good idea.

labath mentioned this in D49542: DwarfDebug: Reduce duplication in addAccel*** methods.Jul 19 2018, 5:49 AM

labath added inline comments.Jul 19 2018, 5:53 AM

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
2407 ↗	(On Diff #156095)	I tried that out, but I wasn't too happy with the result. I've had to move `DwarfStringPool.h` from `lib/` to the `include/` folder and add an extra `AsmPrinter` (in addition to the `StringPool`) argument to the `addName` method. Instead I propose to do something like https://reviews.llvm.org/D49542, which is to keep the StringPool code in DwarfDebug, but collapse all of these `getEntry` calls into one.

I agree the ordering test looks fragile. The only other option that comes to mind is trying to exercise the APIs directly from a unittest, but then you need enough scaffolding around it to capture at least the assembler output and verify it. Probably best to leave the test as it is.

LGTM but I'll let @JDevlieghere have the last word. It looks like he still has one open question in DwarfDebug.cpp.

I'm happy with this, LGTM!

This revision is now accepted and ready to land.Jul 19 2018, 9:56 AM

Anyone thought about whether it'd be worth making the indexed V unindexed
case work implicitly? If we were willing to add another pointer to
DwarfStringPoolEntryRef (doubling its size, admittedly) - back to the
string pool itself - then we could index any string where the
DwarfStringPoolEntryRef was asked for an index, and otherwise skip that?
Rather than having to declare up-front whether an index was required.

(if we wanted to get super-fancy, maybe we could emit indexes as assembly
expressions ((str_label/str_begin_label)/word_size) then you wouldn't have
to increase the size of DwarfStringPoolEntryRef, because the absolute index
wouldn't have to be known ahead of time - it'd be computed by the assembler
instead - which is just moving the work around, of course)

Couple of other possible ideas:

splitting the indexed and unindexed functions into separate, named functions for clarity (alternatively - 'named boolean' (so they don't have to be commented) by adding an enum class for legibility).

Also, possible that there could be assert/runtime checking to ensure that the caller who requests an unindexed string - could use a PointerIntPair & squirrel a boolean in the low bit to say "this DwarfStringPoolEntryRef can not provide an index"? (so that there's no chance of weird race conditions that would hide bugs - ask for an unindexed string, then later ask for that same string indexed, then the first user can request the index without a failure - until later on the second user goes away/changes/etc)

dblaikie added inline comments.Jul 19 2018, 1:26 PM

test/DebugInfo/X86/string-offsets-table-order.ll
28–31 ↗	(On Diff #156228)	Rather than hardcoding these particular byte offset prefixes (honestly, probably not necessary for the dumper to dump these except in the most verbose modes maybe) - but use CHECK-NEXT instead, to ensure these are the only 4 strings (CHECK something at the end to ensure that's the end of the list - though I realize the "contribution size" validates that, but again, probably easier not to check for that specific value (easier to update the test with new strings if it's just checking for the specific elements in a list, without checking offsets, sizes, etc)

labath mentioned this in rL337562: DwarfDebug: Reduce duplication in addAccel*** methods.Jul 20 2018, 8:29 AM

Rebase on top of D49542

Make the API more explicit. Of the options proposed by David, I chose to make the string pool api have two methods (getEntry/getIndexedEntry), and also I've stashed a bit into DwarfStringPoolEntryRef to store the original mode in which the reference was obtained. Now it will assert if someone obtains a non-indexed reference, but then later ask for it's index.

Emitting the indexes does not seem feasible, as we currently have code which assumes they are known statically (so it can compute the smallest DW_FORM_strxN in which they fit).

Making the assignment of indexes automatical would be possible, but it seemed like a pessimization that is not worth it. We currently call get(Indexed)Entry in three places and in each of them it is quite obvious how will the resulting string be used.

Harbormaster completed remote builds in B20550: Diff 156511.Jul 20 2018, 9:12 AM

labath added inline comments.Jul 20 2018, 9:12 AM

test/DebugInfo/X86/string-offsets-table-order.ll
28–31 ↗	(On Diff #156228)	I've updated this a bit, though I'm not sure I did exactly what you wanted. PTAL.

labath mentioned this in D49670: dwarfgen: Add support for generating the debug_str_offsets section.Jul 23 2018, 7:53 AM

rebase the patch on top of D49670
add a unittest-style test of the scenario Paul wanted (so far I've kept both tests, let me know which one looks better)
fix a bug in dsymutil which was exposed when turing assertions on. The issue was that dsymutil is using the same string pool (it seems to me) both for emitting strings into the debug info, and as a general string pool for interning strings. It tells the difference by marking the symbol's index as -1, which is the same value I used for non-indexed strings. This caused an assertion to fire when we were sorting DwarfStringPoolEntryRef for emission. Fortunately, the fix is simple -- instead of sorting the entries which will never be emitted, I just never put them in the emission list in the first place (NonrelocatableStringPool::getEntries). To make it extra explicit that the list returned by getEntries does not contain all strings in the pool, I rename the function to getEntriesForEmission.
fix a bug where we would needlessly emit a .debug_str_offsets header even though the string pool contained zero indexed strings.

Harbormaster completed remote builds in B20606: Diff 156795.Jul 23 2018, 8:23 AM

labath mentioned this in rL337910: dwarfgen: Add support for generating the debug_str_offsets section.Jul 25 2018, 4:56 AM

labath mentioned this in rL337933: dwarfgen: Add support for generating the debug_str_offsets section, take 2.Jul 25 2018, 8:33 AM

labath mentioned this in rL338031: dwarfgen: Add support for generating the debug_str_offsets section, take 3.Jul 26 2018, 7:36 AM

I nearly forgot I still have this patch pending. I think I addressed all issues raised, and it is the accepted state, so I am going to commit it today. There are two minor issues that I haven't received final feedback on, but these can be easily tweaked post-commit too:

testing: there are now two types of tests (unit test and lit) for the scenario of mixing indexed and unindexed entries. One of them could be removed if needed.
is the additional safety embedded into DwarfStringPoolEntryRef sufficient?

Closed by commit rL339122: [DebugInfo] Reduce debug_str_offsets section size (authored by labath). · Explain WhyAug 7 2018, 2:55 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

CodeGen/

DwarfStringPoolEntry.h

47 lines

lib/

CodeGen/

AsmPrinter/

3 lines

10 lines

50 lines

7 lines

test/

DebugInfo/

X86/

string-offsets-table-order.ll

79 lines

string-offsets-table.ll

26 lines

tools/

dsymutil/

DwarfStreamer.cpp

4 lines

MachOUtils.cpp

5 lines

NonRelocatableStringpool.h

4 lines

NonRelocatableStringpool.cpp

11 lines

unittests/

CodeGen/

DIEHashTest.cpp

4 lines

DebugInfo/

DWARF/

DWARFDebugInfoTest.cpp

93 lines

DwarfGenerator.cpp

11 lines

Diff 159473

llvm/trunk/include/llvm/CodeGen/DwarfStringPoolEntry.h

	//===- llvm/CodeGen/DwarfStringPoolEntry.h - String pool entry --- C++ --===//			//===- llvm/CodeGen/DwarfStringPoolEntry.h - String pool entry --- C++ --===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_CODEGEN_DWARFSTRINGPOOLENTRY_H			#ifndef LLVM_CODEGEN_DWARFSTRINGPOOLENTRY_H
	#define LLVM_CODEGEN_DWARFSTRINGPOOLENTRY_H			#define LLVM_CODEGEN_DWARFSTRINGPOOLENTRY_H

				#include "llvm/ADT/PointerIntPair.h"
	#include "llvm/ADT/StringMap.h"			#include "llvm/ADT/StringMap.h"

	namespace llvm {			namespace llvm {

	class MCSymbol;			class MCSymbol;

	/// Data for a string pool entry.			/// Data for a string pool entry.
	struct DwarfStringPoolEntry {			struct DwarfStringPoolEntry {
				static constexpr unsigned NotIndexed = -1;

	MCSymbol *Symbol;			MCSymbol *Symbol;
	unsigned Offset;			unsigned Offset;
	unsigned Index;			unsigned Index;

				bool isIndexed() const { return Index != NotIndexed; }
	};			};

	/// String pool entry reference.			/// String pool entry reference.
	struct DwarfStringPoolEntryRef {			class DwarfStringPoolEntryRef {
	const StringMapEntry<DwarfStringPoolEntry> *I = nullptr;			PointerIntPair<const StringMapEntry<DwarfStringPoolEntry> *, 1, bool>
				MapEntryAndIndexed;

				const StringMapEntry<DwarfStringPoolEntry> *getMapEntry() const {
				return MapEntryAndIndexed.getPointer();
				}

	public:			public:
	DwarfStringPoolEntryRef() = default;			DwarfStringPoolEntryRef() = default;
	explicit DwarfStringPoolEntryRef(			DwarfStringPoolEntryRef(const StringMapEntry<DwarfStringPoolEntry> &Entry,
	const StringMapEntry<DwarfStringPoolEntry> &I)			bool Indexed)
	: I(&I) {}			: MapEntryAndIndexed(&Entry, Indexed) {}

	explicit operator bool() const { return I; }			explicit operator bool() const { return getMapEntry(); }
	MCSymbol *getSymbol() const {			MCSymbol *getSymbol() const {
	assert(I->second.Symbol && "No symbol available!");			assert(getMapEntry()->second.Symbol && "No symbol available!");
	return I->second.Symbol;			return getMapEntry()->second.Symbol;
	}			}
	unsigned getOffset() const { return I->second.Offset; }			unsigned getOffset() const { return getMapEntry()->second.Offset; }
	unsigned getIndex() const { return I->second.Index; }			bool isIndexed() const { return MapEntryAndIndexed.getInt(); }
	StringRef getString() const { return I->first(); }			unsigned getIndex() const {
				assert(isIndexed());
				assert(getMapEntry()->getValue().isIndexed());
				return getMapEntry()->second.Index;
				}
				StringRef getString() const { return getMapEntry()->first(); }
	/// Return the entire string pool entry for convenience.			/// Return the entire string pool entry for convenience.
	DwarfStringPoolEntry getEntry() const { return I->getValue(); }			DwarfStringPoolEntry getEntry() const { return getMapEntry()->getValue(); }

	bool operator==(const DwarfStringPoolEntryRef &X) const { return I == X.I; }			bool operator==(const DwarfStringPoolEntryRef &X) const {
	bool operator!=(const DwarfStringPoolEntryRef &X) const { return I != X.I; }			return getMapEntry() == X.getMapEntry();
				}
				bool operator!=(const DwarfStringPoolEntryRef &X) const {
				return getMapEntry() != X.getMapEntry();
				}
	};			};

	} // end namespace llvm			} // end namespace llvm

	#endif			#endif

llvm/trunk/lib/CodeGen/AsmPrinter/DwarfDebug.cpp

	Show First 20 Lines • Show All 2,431 Lines • ▼ Show 20 Lines
	// accelerator tables are disabled, this function does nothing.			// accelerator tables are disabled, this function does nothing.
	template <typename DataT>			template <typename DataT>
	void DwarfDebug::addAccelNameImpl(AccelTable<DataT> &AppleAccel, StringRef Name,			void DwarfDebug::addAccelNameImpl(AccelTable<DataT> &AppleAccel, StringRef Name,
	const DIE &Die) {			const DIE &Die) {
	if (getAccelTableKind() == AccelTableKind::None)			if (getAccelTableKind() == AccelTableKind::None)
	return;			return;

	DwarfFile &Holder = useSplitDwarf() ? SkeletonHolder : InfoHolder;			DwarfFile &Holder = useSplitDwarf() ? SkeletonHolder : InfoHolder;
	DwarfStringPoolEntryRef Ref =			DwarfStringPoolEntryRef Ref = Holder.getStringPool().getEntry(*Asm, Name);
	Holder.getStringPool().getEntry(*Asm, Name);

	switch (getAccelTableKind()) {			switch (getAccelTableKind()) {
	case AccelTableKind::Apple:			case AccelTableKind::Apple:
	AppleAccel.addName(Ref, Die);			AppleAccel.addName(Ref, Die);
	break;			break;
	case AccelTableKind::Dwarf:			case AccelTableKind::Dwarf:
	AccelDebugNames.addName(Ref, Die);			AccelDebugNames.addName(Ref, Die);
	break;			break;
	Show All 28 Lines

llvm/trunk/lib/CodeGen/AsmPrinter/DwarfStringPool.h

	Show All 24 Lines
	// A String->Symbol mapping of strings used by indirect			// A String->Symbol mapping of strings used by indirect
	// references.			// references.
	class DwarfStringPool {			class DwarfStringPool {
	using EntryTy = DwarfStringPoolEntry;			using EntryTy = DwarfStringPoolEntry;

	StringMap<EntryTy, BumpPtrAllocator &> Pool;			StringMap<EntryTy, BumpPtrAllocator &> Pool;
	StringRef Prefix;			StringRef Prefix;
	unsigned NumBytes = 0;			unsigned NumBytes = 0;
				unsigned NumIndexedStrings = 0;
	bool ShouldCreateSymbols;			bool ShouldCreateSymbols;

				StringMapEntry<EntryTy> &getEntryImpl(AsmPrinter &Asm, StringRef Str);

	public:			public:
	using EntryRef = DwarfStringPoolEntryRef;			using EntryRef = DwarfStringPoolEntryRef;

	DwarfStringPool(BumpPtrAllocator &A, AsmPrinter &Asm, StringRef Prefix);			DwarfStringPool(BumpPtrAllocator &A, AsmPrinter &Asm, StringRef Prefix);

	void emitStringOffsetsTableHeader(AsmPrinter &Asm, MCSection *OffsetSection,			void emitStringOffsetsTableHeader(AsmPrinter &Asm, MCSection *OffsetSection,
	MCSymbol *StartSym);			MCSymbol *StartSym);

	void emit(AsmPrinter &Asm, MCSection *StrSection,			void emit(AsmPrinter &Asm, MCSection *StrSection,
	MCSection *OffsetSection = nullptr,			MCSection *OffsetSection = nullptr,
	bool UseRelativeOffsets = false);			bool UseRelativeOffsets = false);

	bool empty() const { return Pool.empty(); }			bool empty() const { return Pool.empty(); }

	unsigned size() const { return Pool.size(); }			unsigned size() const { return Pool.size(); }

				unsigned getNumIndexedStrings() const { return NumIndexedStrings; }

	/// Get a reference to an entry in the string pool.			/// Get a reference to an entry in the string pool.
	EntryRef getEntry(AsmPrinter &Asm, StringRef Str);			EntryRef getEntry(AsmPrinter &Asm, StringRef Str);

				/// Same as getEntry, except that you can use EntryRef::getIndex to obtain a
				/// unique ID of this entry (e.g., for use in indexed forms like
				/// DW_FORM_strx).
				EntryRef getIndexedEntry(AsmPrinter &Asm, StringRef Str);
	};			};

	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_LIB_CODEGEN_ASMPRINTER_DWARFSTRINGPOOL_H			#endif // LLVM_LIB_CODEGEN_ASMPRINTER_DWARFSTRINGPOOL_H

llvm/trunk/lib/CodeGen/AsmPrinter/DwarfStringPool.cpp

	Show All 18 Lines

	using namespace llvm;			using namespace llvm;

	DwarfStringPool::DwarfStringPool(BumpPtrAllocator &A, AsmPrinter &Asm,			DwarfStringPool::DwarfStringPool(BumpPtrAllocator &A, AsmPrinter &Asm,
	StringRef Prefix)			StringRef Prefix)
	: Pool(A), Prefix(Prefix),			: Pool(A), Prefix(Prefix),
	ShouldCreateSymbols(Asm.MAI->doesDwarfUseRelocationsAcrossSections()) {}			ShouldCreateSymbols(Asm.MAI->doesDwarfUseRelocationsAcrossSections()) {}

	DwarfStringPool::EntryRef DwarfStringPool::getEntry(AsmPrinter &Asm,			StringMapEntry<DwarfStringPool::EntryTy> &
	StringRef Str) {			DwarfStringPool::getEntryImpl(AsmPrinter &Asm, StringRef Str) {
	auto I = Pool.insert(std::make_pair(Str, EntryTy()));			auto I = Pool.insert(std::make_pair(Str, EntryTy()));
	if (I.second) {
	auto &Entry = I.first->second;			auto &Entry = I.first->second;
	Entry.Index = Pool.size() - 1;			if (I.second) {
				Entry.Index = EntryTy::NotIndexed;
	Entry.Offset = NumBytes;			Entry.Offset = NumBytes;
	Entry.Symbol = ShouldCreateSymbols ? Asm.createTempSymbol(Prefix) : nullptr;			Entry.Symbol = ShouldCreateSymbols ? Asm.createTempSymbol(Prefix) : nullptr;

	NumBytes += Str.size() + 1;			NumBytes += Str.size() + 1;
	assert(NumBytes > Entry.Offset && "Unexpected overflow");			assert(NumBytes > Entry.Offset && "Unexpected overflow");
	}			}
	return EntryRef(*I.first);			return *I.first;
				}

				DwarfStringPool::EntryRef DwarfStringPool::getEntry(AsmPrinter &Asm,
				StringRef Str) {
				auto &MapEntry = getEntryImpl(Asm, Str);
				return EntryRef(MapEntry, false);
				}

				DwarfStringPool::EntryRef DwarfStringPool::getIndexedEntry(AsmPrinter &Asm,
				StringRef Str) {
				auto &MapEntry = getEntryImpl(Asm, Str);
				if (!MapEntry.getValue().isIndexed())
				MapEntry.getValue().Index = NumIndexedStrings++;
				return EntryRef(MapEntry, true);
	}			}

	void DwarfStringPool::emitStringOffsetsTableHeader(AsmPrinter &Asm,			void DwarfStringPool::emitStringOffsetsTableHeader(AsmPrinter &Asm,
	MCSection *Section,			MCSection *Section,
	MCSymbol *StartSym) {			MCSymbol *StartSym) {
	if (empty())			if (getNumIndexedStrings() == 0)
	return;			return;
	Asm.OutStreamer->SwitchSection(Section);			Asm.OutStreamer->SwitchSection(Section);
	unsigned EntrySize = 4;			unsigned EntrySize = 4;
	// FIXME: DWARF64			// FIXME: DWARF64
	// We are emitting the header for a contribution to the string offsets			// We are emitting the header for a contribution to the string offsets
	// table. The header consists of an entry with the contribution's			// table. The header consists of an entry with the contribution's
	// size (not including the size of the length field), the DWARF version and			// size (not including the size of the length field), the DWARF version and
	// 2 bytes of padding.			// 2 bytes of padding.
	Asm.emitInt32(size() * EntrySize + 4);			Asm.emitInt32(getNumIndexedStrings() * EntrySize + 4);
	Asm.emitInt16(Asm.getDwarfVersion());			Asm.emitInt16(Asm.getDwarfVersion());
	Asm.emitInt16(0);			Asm.emitInt16(0);
	// Define the symbol that marks the start of the contribution. It is			// Define the symbol that marks the start of the contribution. It is
	// referenced by most unit headers via DW_AT_str_offsets_base.			// referenced by most unit headers via DW_AT_str_offsets_base.
	// Split units do not use the attribute.			// Split units do not use the attribute.
	if (StartSym)			if (StartSym)
	Asm.OutStreamer->EmitLabel(StartSym);			Asm.OutStreamer->EmitLabel(StartSym);
	}			}

	void DwarfStringPool::emit(AsmPrinter &Asm, MCSection *StrSection,			void DwarfStringPool::emit(AsmPrinter &Asm, MCSection *StrSection,
	MCSection *OffsetSection, bool UseRelativeOffsets) {			MCSection *OffsetSection, bool UseRelativeOffsets) {
	if (Pool.empty())			if (Pool.empty())
	return;			return;

	// Start the dwarf str section.			// Start the dwarf str section.
	Asm.OutStreamer->SwitchSection(StrSection);			Asm.OutStreamer->SwitchSection(StrSection);

	// Get all of the string pool entries and put them in an array by their ID so			// Get all of the string pool entries and sort them by their offset.
	// we can sort them.			SmallVector<const StringMapEntry<EntryTy> *, 64> Entries;
	SmallVector<const StringMapEntry<EntryTy> *, 64> Entries(Pool.size());			Entries.reserve(Pool.size());

	for (const auto &E : Pool)			for (const auto &E : Pool)
	Entries[E.getValue().Index] = &E;			Entries.push_back(&E);

				llvm::sort(
				Entries.begin(), Entries.end(),
				[](const StringMapEntry<EntryTy> A, const StringMapEntry<EntryTy> B) {
				return A->getValue().Offset < B->getValue().Offset;
				});

	for (const auto &Entry : Entries) {			for (const auto &Entry : Entries) {
	assert(ShouldCreateSymbols == static_cast<bool>(Entry->getValue().Symbol) &&			assert(ShouldCreateSymbols == static_cast<bool>(Entry->getValue().Symbol) &&
	"Mismatch between setting and entry");			"Mismatch between setting and entry");

	// Emit a label for reference from debug information entries.			// Emit a label for reference from debug information entries.
	if (ShouldCreateSymbols)			if (ShouldCreateSymbols)
	Asm.OutStreamer->EmitLabel(Entry->getValue().Symbol);			Asm.OutStreamer->EmitLabel(Entry->getValue().Symbol);

	// Emit the string itself with a terminating null byte.			// Emit the string itself with a terminating null byte.
	Asm.OutStreamer->AddComment("string offset=" +			Asm.OutStreamer->AddComment("string offset=" +
	Twine(Entry->getValue().Offset));			Twine(Entry->getValue().Offset));
	Asm.OutStreamer->EmitBytes(			Asm.OutStreamer->EmitBytes(
	StringRef(Entry->getKeyData(), Entry->getKeyLength() + 1));			StringRef(Entry->getKeyData(), Entry->getKeyLength() + 1));
	}			}

	// If we've got an offset section go ahead and emit that now as well.			// If we've got an offset section go ahead and emit that now as well.
	if (OffsetSection) {			if (OffsetSection) {
				// Now only take the indexed entries and put them in an array by their ID so
				// we can emit them in order.
				Entries.resize(NumIndexedStrings);
				for (const auto &Entry : Pool) {
				if (Entry.getValue().isIndexed())
				Entries[Entry.getValue().Index] = &Entry;
				}

	Asm.OutStreamer->SwitchSection(OffsetSection);			Asm.OutStreamer->SwitchSection(OffsetSection);
	unsigned size = 4; // FIXME: DWARF64 is 8.			unsigned size = 4; // FIXME: DWARF64 is 8.
	for (const auto &Entry : Entries)			for (const auto &Entry : Entries)
	if (UseRelativeOffsets)			if (UseRelativeOffsets)
	Asm.emitDwarfStringOffset(Entry->getValue());			Asm.emitDwarfStringOffset(Entry->getValue());
	else			else
	Asm.OutStreamer->EmitIntValue(Entry->getValue().Offset, size);			Asm.OutStreamer->EmitIntValue(Entry->getValue().Offset, size);
	}			}
	}			}

llvm/trunk/lib/CodeGen/AsmPrinter/DwarfUnit.cpp

Show First 20 Lines • Show All 237 Lines • ▼ Show 20 Lines	if (CUNode->isDebugDirectivesOnly())
return;		return;

if (DD->useInlineStrings()) {		if (DD->useInlineStrings()) {
Die.addValue(DIEValueAllocator, Attribute, dwarf::DW_FORM_string,		Die.addValue(DIEValueAllocator, Attribute, dwarf::DW_FORM_string,
new (DIEValueAllocator)		new (DIEValueAllocator)
DIEInlineString(String, DIEValueAllocator));		DIEInlineString(String, DIEValueAllocator));
return;		return;
}		}
auto StringPoolEntry = DU->getStringPool().getEntry(*Asm, String);
dwarf::Form IxForm =		dwarf::Form IxForm =
isDwoUnit() ? dwarf::DW_FORM_GNU_str_index : dwarf::DW_FORM_strp;		isDwoUnit() ? dwarf::DW_FORM_GNU_str_index : dwarf::DW_FORM_strp;

		auto StringPoolEntry =
		useSegmentedStringOffsetsTable() \|\| IxForm == dwarf::DW_FORM_GNU_str_index
		? DU->getStringPool().getIndexedEntry(*Asm, String)
		: DU->getStringPool().getEntry(*Asm, String);

// For DWARF v5 and beyond, use the smallest strx? form possible.		// For DWARF v5 and beyond, use the smallest strx? form possible.
if (useSegmentedStringOffsetsTable()) {		if (useSegmentedStringOffsetsTable()) {
IxForm = dwarf::DW_FORM_strx1;		IxForm = dwarf::DW_FORM_strx1;
unsigned Index = StringPoolEntry.getIndex();		unsigned Index = StringPoolEntry.getIndex();
if (Index > 0xffffff)		if (Index > 0xffffff)
IxForm = dwarf::DW_FORM_strx4;		IxForm = dwarf::DW_FORM_strx4;
else if (Index > 0xffff)		else if (Index > 0xffff)
IxForm = dwarf::DW_FORM_strx3;		IxForm = dwarf::DW_FORM_strx3;
▲ Show 20 Lines • Show All 1,510 Lines • Show Last 20 Lines

llvm/trunk/test/DebugInfo/X86/string-offsets-table-order.ll

				; REQUIRES: object-emission
				; RUN: llc -mtriple=x86_64-unknown-linux-gnu -split-dwarf-file=foo.dwo -filetype=obj < %s \
				; RUN: \| llvm-dwarfdump -v - \| FileCheck %s

				; This triggers a situation where the order of entries in the .debug_str and
				; .debug_str_offsets sections does not match and makes sure that all entries are
				; still wired up correctly.

				; Produced with "clang -S -emit-llvm -gdwarf-5" from source "int X;", copied
				; three times and modified by hand.

				; CHECK: .debug_info contents:
				; CHECK: DW_TAG_compile_unit
				; CHECK: DW_AT_comp_dir [DW_FORM_strx1] ( indexed (00000001) string = "X3")
				; CHECK: DW_TAG_compile_unit
				; CHECK: DW_AT_comp_dir [DW_FORM_strx1] ( indexed (00000002) string = "X2")
				; CHECK: DW_TAG_compile_unit
				; CHECK: DW_AT_comp_dir [DW_FORM_strx1] ( indexed (00000003) string = "X1")
				; CHECK: .debug_info.dwo contents:

				; CHECK: .debug_str contents:
				; CHECK: 0x[[X3:[0-9a-f]*]]: "X3"
				; CHECK: 0x[[X1:[0-9a-f]*]]: "X1"
				; CHECK: 0x[[X2:[0-9a-f]*]]: "X2"

				; CHECK: .debug_str_offsets contents:
				; CHECK: Format = DWARF32, Version = 5
				; CHECK-NEXT: 00000000 "foo.dwo"
				; CHECK-NEXT: [[X3]] "X3"
				; CHECK-NEXT: [[X2]] "X2"
				; CHECK-NEXT: [[X1]] "X1"
				; CHECK-EMPTY:



				!llvm.dbg.cu = !{!10, !20, !30}
				!llvm.module.flags = !{!0, !1, !2}
				!llvm.ident = !{!3}

				!0 = !{i32 2, !"Dwarf Version", i32 5}
				!1 = !{i32 2, !"Debug Info Version", i32 3}
				!2 = !{i32 1, !"wchar_size", i32 4}
				!3 = !{!"clang version 7.0.0 (trunk 337353) (llvm/trunk 337361)"}


				@X1 = dso_local global i32 0, align 4, !dbg !11

				!10 = distinct !DICompileUnit(language: DW_LANG_C_plus_plus, file: !13, producer: "clang version 7.0.0 (trunk 337353) (llvm/trunk 337361)", isOptimized: false, runtimeVersion: 0, emissionKind: FullDebug, enums: !14, globals: !15)
				!11 = !DIGlobalVariableExpression(var: !12, expr: !DIExpression())
				!12 = distinct !DIGlobalVariable(name: "X1", scope: !10, file: !16, line: 1, type: !17, isLocal: false, isDefinition: true)
				!13 = !DIFile(filename: "-", directory: "X3", checksumkind: CSK_MD5, checksum: "f2e6e10e303927a308f1645fbf6f710e")
				!14 = !{}
				!15 = !{!11}
				!16 = !DIFile(filename: "<stdin>", directory: "X3", checksumkind: CSK_MD5, checksum: "f2e6e10e303927a308f1645fbf6f710e")
				!17 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed)


				@X2 = dso_local global i32 0, align 4, !dbg !21

				!20 = distinct !DICompileUnit(language: DW_LANG_C_plus_plus, file: !23, producer: "clang version 7.0.0 (trunk 337353) (llvm/trunk 337361)", isOptimized: false, runtimeVersion: 0, emissionKind: FullDebug, enums: !24, globals: !25)
				!21 = !DIGlobalVariableExpression(var: !22, expr: !DIExpression())
				!22 = distinct !DIGlobalVariable(name: "X2", scope: !20, file: !26, line: 1, type: !27, isLocal: false, isDefinition: true)
				!23 = !DIFile(filename: "-", directory: "X2", checksumkind: CSK_MD5, checksum: "f2e6e10e303927a308f1645fbf6f710e")
				!24 = !{}
				!25 = !{!21}
				!26 = !DIFile(filename: "<stdin>", directory: "X2", checksumkind: CSK_MD5, checksum: "f2e6e10e303927a308f1645fbf6f710e")
				!27 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed)


				@X3 = dso_local global i32 0, align 4, !dbg !31

				!30 = distinct !DICompileUnit(language: DW_LANG_C_plus_plus, file: !33, producer: "clang version 7.0.0 (trunk 337353) (llvm/trunk 337361)", isOptimized: false, runtimeVersion: 0, emissionKind: FullDebug, enums: !34, globals: !35)
				!31 = !DIGlobalVariableExpression(var: !32, expr: !DIExpression())
				!32 = distinct !DIGlobalVariable(name: "X3", scope: !30, file: !36, line: 1, type: !37, isLocal: false, isDefinition: true)
				!33 = !DIFile(filename: "-", directory: "X1", checksumkind: CSK_MD5, checksum: "f2e6e10e303927a308f1645fbf6f710e")
				!34 = !{}
				!35 = !{!31}
				!36 = !DIFile(filename: "<stdin>", directory: "X1", checksumkind: CSK_MD5, checksum: "f2e6e10e303927a308f1645fbf6f710e")
				!37 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed)

llvm/trunk/test/DebugInfo/X86/string-offsets-table.ll

	; REQUIRES: object-emission			; REQUIRES: object-emission
	; RUN: llc -mtriple=x86_64-unknown-linux-gnu -filetype=obj < %s \| llvm-dwarfdump -v - \			; RUN: llc -mtriple=x86_64-unknown-linux-gnu -filetype=obj < %s \| llvm-dwarfdump -v - \
	; RUN: \| FileCheck --check-prefix=MONOLITHIC %s			; RUN: \| FileCheck --check-prefix=MONOLITHIC %s
	; RUN: llc -mtriple=x86_64-unknown-linux-gnu -split-dwarf-file=%t.dwo -filetype=obj < %s \			; RUN: llc -mtriple=x86_64-unknown-linux-gnu -split-dwarf-file=foo.dwo -filetype=obj < %s \
	; RUN: \| llvm-dwarfdump -v - \| FileCheck --check-prefix=SPLIT %s			; RUN: \| llvm-dwarfdump -v - \| FileCheck --check-prefix=SPLIT %s

	; This basic test checks the emission of a DWARF v5 string offsets table in			; This basic test checks the emission of a DWARF v5 string offsets table in
	; the split and non-split (monolithic) scenario.			; the split and non-split (monolithic) scenario.
	;			;
	; Constructed from the following source with			; Constructed from the following source with
	; clang -S -emit-llvm -gdwarf-5			; clang -S -emit-llvm -gdwarf-5
	;			;
	▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
	; attribute and that it has the right value.			; attribute and that it has the right value.
	;			;
	; SPLIT: .debug_info contents:			; SPLIT: .debug_info contents:
	; SPLIT-NEXT: 0x00000000: Compile Unit:{{.*}}DW_UT_skeleton			; SPLIT-NEXT: 0x00000000: Compile Unit:{{.*}}DW_UT_skeleton
	; SPLIT-NOT: contents:			; SPLIT-NOT: contents:
	; SPLIT: DW_TAG_compile_unit			; SPLIT: DW_TAG_compile_unit
	; SPLIT-NOT: {{DW_TAG\|contents:}}			; SPLIT-NOT: {{DW_TAG\|contents:}}
	; SPLIT: DW_AT_str_offsets_base [DW_FORM_sec_offset] (0x00000008)			; SPLIT: DW_AT_str_offsets_base [DW_FORM_sec_offset] (0x00000008)
				; SPLIT: DW_AT_GNU_dwo_name [DW_FORM_strx1] ( indexed (00000000) string = "foo.dwo")
				; SPLIT: DW_AT_comp_dir [DW_FORM_strx1] ( indexed (00000001) string = "/home/test")

	; Check for the split CU in .debug_info.dwo.			; Check for the split CU in .debug_info.dwo.
	; SPLIT: .debug_info.dwo contents:			; SPLIT: .debug_info.dwo contents:
	; SPLIT-NEXT: 0x00000000: Compile Unit:{{.*}}DW_UT_split_compile			; SPLIT-NEXT: 0x00000000: Compile Unit:{{.*}}DW_UT_split_compile
	; SPLIT-NOT: contents:			; SPLIT-NOT: contents:
	; SPLIT: DW_TAG_compile_unit			; SPLIT: DW_TAG_compile_unit
	;			;
	; Check that a couple of indexed strings are displayed correctly and that			; Check that a couple of indexed strings are displayed correctly and that
	; they have the right format (DW_FORM_strx1).			; they have the right format (DW_FORM_strx1).
	; SPLIT-NOT: contents:			; SPLIT-NOT: contents:
	; SPLIT: DW_TAG_enumerator			; SPLIT: DW_TAG_enumerator
	; SPLIT-NOT: {{DW_TAG\|NULL}}			; SPLIT-NOT: {{DW_TAG\|NULL}}
	; SPLIT: DW_AT_name [DW_FORM_strx1] ( indexed (00000004) string = "a")			; SPLIT: DW_AT_name [DW_FORM_strx1] ( indexed (00000004) string = "a")
	; SPLIT-NOT: contents:			; SPLIT-NOT: contents:
	; SPLIT: DW_TAG_enumerator			; SPLIT: DW_TAG_enumerator
	; SPLIT-NOT: {{DW_TAG\|NULL}}			; SPLIT-NOT: {{DW_TAG\|NULL}}
	; SPLIT: DW_AT_name [DW_FORM_strx1] ( indexed (00000005) string = "b")			; SPLIT: DW_AT_name [DW_FORM_strx1] ( indexed (00000005) string = "b")
	;			;
	; Extract the string offsets referenced in the main file by the skeleton unit.			; Extract the string offsets referenced in the main file by the skeleton unit.
	; SPLIT: .debug_str contents:			; SPLIT: .debug_str contents:
	; SPLIT-NEXT: 0x00000000:{{.*}}			; SPLIT-NEXT: 0x00000000: "foo.dwo"
	; SPLIT-NEXT: 0x[[STRING2SPLIT:[0-9a-f]]]{{.}}			; SPLIT-NEXT: 0x[[STRING2SPLIT:[0-9a-f]*]]: "/home/test"
	; SPLIT-NEXT: 0x[[STRING3SPLIT:[0-9a-f]]]{{.}}			; SPLIT-NEXT: 0x[[STRING3SPLIT:[0-9a-f]*]]: "E"
	; SPLIT-NEXT: 0x[[STRING4SPLIT:[0-9a-f]]]{{.}}			; SPLIT-NEXT: 0x[[STRING4SPLIT:[0-9a-f]*]]: "glob"
	;			;
	; Extract the string offsets referenced in the .dwo file by the split unit.			; Extract the string offsets referenced in the .dwo file by the split unit.
	; SPLIT: .debug_str.dwo contents:			; SPLIT: .debug_str.dwo contents:
	; SPLIT-NEXT: 0x00000000:{{.*}}			; SPLIT-NEXT: 0x00000000:{{.*}}
	; SPLIT-NEXT: 0x[[STRING2DWO:[0-9a-f]]]{{.}}			; SPLIT-NEXT: 0x[[STRING2DWO:[0-9a-f]]]{{.}}
	; SPLIT-NEXT: 0x[[STRING3DWO:[0-9a-f]]]{{.}}			; SPLIT-NEXT: 0x[[STRING3DWO:[0-9a-f]]]{{.}}
	;			;
	; Check the string offsets sections in both the main and the .dwo files and			; Check the string offsets sections in both the main and the .dwo files and
	; verify that the extracted string offsets are referenced correctly.			; verify that the extracted string offsets are referenced correctly. The
				; sections should contain only the offsets of strings that are actually
				; referenced by the debug info.
	; SPLIT: .debug_str_offsets contents:			; SPLIT: .debug_str_offsets contents:
	; SPLIT-NEXT: 0x00000000: Contribution size = 20, Format = DWARF32, Version = 5			; SPLIT-NEXT: 0x00000000: Contribution size = 12, Format = DWARF32, Version = 5
	; SPLIT-NEXT: 0x00000008: 00000000{{.*}}			; SPLIT-NEXT: 0x00000008: 00000000 "foo.dwo"
	; SPLIT-NEXT: 0x0000000c: [[STRING2SPLIT]]			; SPLIT-NEXT: 0x0000000c: [[STRING2SPLIT]] "/home/test"
	; SPLIT-NEXT: 0x00000010: [[STRING3SPLIT]]			; SPLIT-EMPTY:
	; SPLIT-NEXT: 0x00000014: [[STRING4SPLIT]]
	; SPLIT: .debug_str_offsets.dwo contents:			; SPLIT: .debug_str_offsets.dwo contents:
	; SPLIT-NEXT: 0x00000000: Contribution size = 36, Format = DWARF32, Version = 5			; SPLIT-NEXT: 0x00000000: Contribution size = 36, Format = DWARF32, Version = 5
	; SPLIT-NEXT: 0x00000008: 00000000{{.*}}			; SPLIT-NEXT: 0x00000008: 00000000{{.*}}
	; SPLIT-NEXT: 0x0000000c: [[STRING2DWO]]{{.*}}			; SPLIT-NEXT: 0x0000000c: [[STRING2DWO]]{{.*}}
	; SPLIT-NEXT: 0x00000010: [[STRING3DWO]]			; SPLIT-NEXT: 0x00000010: [[STRING3DWO]]

	@glob = global i32 0, align 4, !dbg !0			@glob = global i32 0, align 4, !dbg !0

	Show All 19 Lines

llvm/trunk/tools/dsymutil/DwarfStreamer.cpp

	Show First 20 Lines • Show All 184 Lines • ▼ Show 20 Lines
	void DwarfStreamer::emitDIE(DIE &Die) {			void DwarfStreamer::emitDIE(DIE &Die) {
	MS->SwitchSection(MOFI->getDwarfInfoSection());			MS->SwitchSection(MOFI->getDwarfInfoSection());
	Asm->emitDwarfDIE(Die);			Asm->emitDwarfDIE(Die);
	}			}

	/// Emit the debug_str section stored in \p Pool.			/// Emit the debug_str section stored in \p Pool.
	void DwarfStreamer::emitStrings(const NonRelocatableStringpool &Pool) {			void DwarfStreamer::emitStrings(const NonRelocatableStringpool &Pool) {
	Asm->OutStreamer->SwitchSection(MOFI->getDwarfStrSection());			Asm->OutStreamer->SwitchSection(MOFI->getDwarfStrSection());
	std::vector<DwarfStringPoolEntryRef> Entries = Pool.getEntries();			std::vector<DwarfStringPoolEntryRef> Entries = Pool.getEntriesForEmission();
	for (auto Entry : Entries) {			for (auto Entry : Entries) {
	if (Entry.getIndex() == -1U)
	break;
	// Emit the string itself.			// Emit the string itself.
	Asm->OutStreamer->EmitBytes(Entry.getString());			Asm->OutStreamer->EmitBytes(Entry.getString());
	// Emit a null terminator.			// Emit a null terminator.
	Asm->emitInt8(0);			Asm->emitInt8(0);
	}			}
	}			}

	void DwarfStreamer::emitDebugNames(			void DwarfStreamer::emitDebugNames(
	▲ Show 20 Lines • Show All 480 Lines • Show Last 20 Lines

llvm/trunk/tools/dsymutil/MachOUtils.cpp

Show First 20 Lines • Show All 508 Lines • ▼ Show 20 Lines	if (ShouldEmitSymtab) {
assert(OutFile.tell() == StringStart);		assert(OutFile.tell() == StringStart);

// Transfer string table.		// Transfer string table.
// FIXME: The NonRelocatableStringpool starts with an empty string, but		// FIXME: The NonRelocatableStringpool starts with an empty string, but
// dsymutil-classic starts the reconstructed string table with 2 of these.		// dsymutil-classic starts the reconstructed string table with 2 of these.
// Reproduce that behavior for now (there is corresponding code in		// Reproduce that behavior for now (there is corresponding code in
// transferSymbol).		// transferSymbol).
OutFile << '\0';		OutFile << '\0';
std::vector<DwarfStringPoolEntryRef> Strings = NewStrings.getEntries();		std::vector<DwarfStringPoolEntryRef> Strings =
		NewStrings.getEntriesForEmission();
for (auto EntryRef : Strings) {		for (auto EntryRef : Strings) {
if (EntryRef.getIndex() == -1U)
break;
OutFile.write(EntryRef.getString().data(),		OutFile.write(EntryRef.getString().data(),
EntryRef.getString().size() + 1);		EntryRef.getString().size() + 1);
}		}
}		}

assert(OutFile.tell() == StringStart + NewStringsSize);		assert(OutFile.tell() == StringStart + NewStringsSize);

// Pad till the Dwarf segment start.		// Pad till the Dwarf segment start.
Show All 18 Lines

llvm/trunk/tools/dsymutil/NonRelocatableStringpool.h

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	public:
/// will chain it though.		/// will chain it though.
///		///
/// \returns The StringRef that points to permanent storage to use		/// \returns The StringRef that points to permanent storage to use
/// in place of \p S.		/// in place of \p S.
StringRef internString(StringRef S);		StringRef internString(StringRef S);

uint64_t getSize() { return CurrentEndOffset; }		uint64_t getSize() { return CurrentEndOffset; }

std::vector<DwarfStringPoolEntryRef> getEntries() const;		/// Return the list of strings to be emitted. This does not contain the
		/// strings which were added via internString only.
		std::vector<DwarfStringPoolEntryRef> getEntriesForEmission() const;

private:		private:
MapTy Strings;		MapTy Strings;
uint32_t CurrentEndOffset = 0;		uint32_t CurrentEndOffset = 0;
unsigned NumEntries = 0;		unsigned NumEntries = 0;
DwarfStringPoolEntryRef EmptyString;		DwarfStringPoolEntryRef EmptyString;
};		};

Show All 19 Lines

llvm/trunk/tools/dsymutil/NonRelocatableStringpool.cpp

	Show All 12 Lines
	namespace dsymutil {			namespace dsymutil {

	DwarfStringPoolEntryRef NonRelocatableStringpool::getEntry(StringRef S) {			DwarfStringPoolEntryRef NonRelocatableStringpool::getEntry(StringRef S) {
	if (S.empty() && !Strings.empty())			if (S.empty() && !Strings.empty())
	return EmptyString;			return EmptyString;

	auto I = Strings.insert({S, DwarfStringPoolEntry()});			auto I = Strings.insert({S, DwarfStringPoolEntry()});
	auto &Entry = I.first->second;			auto &Entry = I.first->second;
	if (I.second \|\| Entry.Index == -1U) {			if (I.second \|\| !Entry.isIndexed()) {
	Entry.Index = NumEntries++;			Entry.Index = NumEntries++;
	Entry.Offset = CurrentEndOffset;			Entry.Offset = CurrentEndOffset;
	Entry.Symbol = nullptr;			Entry.Symbol = nullptr;
	CurrentEndOffset += S.size() + 1;			CurrentEndOffset += S.size() + 1;
	}			}
	return DwarfStringPoolEntryRef(*I.first);			return DwarfStringPoolEntryRef(*I.first, true);
	}			}

	StringRef NonRelocatableStringpool::internString(StringRef S) {			StringRef NonRelocatableStringpool::internString(StringRef S) {
	DwarfStringPoolEntry Entry{nullptr, 0, -1U};			DwarfStringPoolEntry Entry{nullptr, 0, DwarfStringPoolEntry::NotIndexed};
	auto InsertResult = Strings.insert({S, Entry});			auto InsertResult = Strings.insert({S, Entry});
	return InsertResult.first->getKey();			return InsertResult.first->getKey();
	}			}

	std::vector<DwarfStringPoolEntryRef>			std::vector<DwarfStringPoolEntryRef>
	NonRelocatableStringpool::getEntries() const {			NonRelocatableStringpool::getEntriesForEmission() const {
	std::vector<DwarfStringPoolEntryRef> Result;			std::vector<DwarfStringPoolEntryRef> Result;
	Result.reserve(Strings.size());			Result.reserve(Strings.size());
	for (const auto &E : Strings)			for (const auto &E : Strings)
	Result.emplace_back(E);			if (E.getValue().isIndexed())
				Result.emplace_back(E, true);
	llvm::sort(			llvm::sort(
	Result.begin(), Result.end(),			Result.begin(), Result.end(),
	[](const DwarfStringPoolEntryRef A, const DwarfStringPoolEntryRef B) {			[](const DwarfStringPoolEntryRef A, const DwarfStringPoolEntryRef B) {
	return A.getIndex() < B.getIndex();			return A.getIndex() < B.getIndex();
	});			});
	return Result;			return Result;
	}			}

	} // namespace dsymutil			} // namespace dsymutil
	} // namespace llvm			} // namespace llvm

llvm/trunk/unittests/CodeGen/DIEHashTest.cpp

Show All 25 Lines	public:
BumpPtrAllocator Alloc;		BumpPtrAllocator Alloc;

private:		private:
StringMap<DwarfStringPoolEntry> Pool;		StringMap<DwarfStringPoolEntry> Pool;

public:		public:
DIEString getString(StringRef S) {		DIEString getString(StringRef S) {
DwarfStringPoolEntry Entry = {nullptr, 1, 1};		DwarfStringPoolEntry Entry = {nullptr, 1, 1};
return DIEString(		return DIEString(DwarfStringPoolEntryRef(
DwarfStringPoolEntryRef(*Pool.insert(std::make_pair(S, Entry)).first));		*Pool.insert(std::make_pair(S, Entry)).first, Entry.isIndexed()));
}		}
};		};

TEST_F(DIEHashTest, Data1) {		TEST_F(DIEHashTest, Data1) {
DIEHash Hash;		DIEHash Hash;
DIE &Die = *DIE::get(Alloc, dwarf::DW_TAG_base_type);		DIE &Die = *DIE::get(Alloc, dwarf::DW_TAG_base_type);
DIEInteger Size(4);		DIEInteger Size(4);
Die.addValue(Alloc, dwarf::DW_AT_byte_size, dwarf::DW_FORM_data1, Size);		Die.addValue(Alloc, dwarf::DW_AT_byte_size, dwarf::DW_FORM_data1, Size);
▲ Show 20 Lines • Show All 656 Lines • Show Last 20 Lines

llvm/trunk/unittests/DebugInfo/DWARF/DWARFDebugInfoTest.cpp

	Show First 20 Lines • Show All 1,001 Lines • ▼ Show 20 Lines

	TEST(DWARFDebugInfo, TestDWARF32Version4Addr8Addresses) {			TEST(DWARFDebugInfo, TestDWARF32Version4Addr8Addresses) {
	// Test that we can decode address values in DWARF32, version 4, with 8 byte			// Test that we can decode address values in DWARF32, version 4, with 8 byte
	// addresses.			// addresses.
	typedef uint64_t AddrType;			typedef uint64_t AddrType;
	TestAddresses<4, AddrType>();			TestAddresses<4, AddrType>();
	}			}

				TEST(DWARFDebugInfo, TestStringOffsets) {
				Triple Triple = getHostTripleForAddrSize(sizeof(void *));
				if (!isConfigurationSupported(Triple))
				return;

				const char *String1 = "Hello";
				const char *String2 = "World";

				auto ExpectedDG = dwarfgen::Generator::create(Triple, 5);
				ASSERT_THAT_EXPECTED(ExpectedDG, Succeeded());
				dwarfgen::Generator *DG = ExpectedDG.get().get();
				dwarfgen::CompileUnit &CU = DG->addCompileUnit();
				dwarfgen::DIE CUDie = CU.getUnitDIE();

				CUDie.addStrOffsetsBaseAttribute();

				uint16_t Attr = DW_AT_lo_user;

				// Create our strings. First we create a non-indexed reference to String1,
				// followed by an indexed String2. Finally, we add an indexed reference to
				// String1.
				const auto Attr1 = static_cast<dwarf::Attribute>(Attr++);
				CUDie.addAttribute(Attr1, DW_FORM_strp, String1);

				const auto Attr2 = static_cast<dwarf::Attribute>(Attr++);
				CUDie.addAttribute(Attr2, DW_FORM_strx, String2);

				const auto Attr3 = static_cast<dwarf::Attribute>(Attr++);
				CUDie.addAttribute(Attr3, DW_FORM_strx, String1);

				// Generate the DWARF
				StringRef FileBytes = DG->generate();
				MemoryBufferRef FileBuffer(FileBytes, "dwarf");
				auto Obj = object::ObjectFile::createObjectFile(FileBuffer);
				ASSERT_TRUE((bool)Obj);
				std::unique_ptr<DWARFContext> DwarfContext = DWARFContext::create(**Obj);
				uint32_t NumCUs = DwarfContext->getNumCompileUnits();
				ASSERT_EQ(NumCUs, 1u);
				DWARFUnit *U = DwarfContext->getUnitAtIndex(0);
				auto DieDG = U->getUnitDIE(false);
				ASSERT_TRUE(DieDG.isValid());

				// Now make sure the string offsets came out properly. Attr2 should have index
				// 0 (because it was the first indexed string) even though the string itself
				// was added eariler.
				auto Extracted1 = toString(DieDG.find(Attr1));
				ASSERT_TRUE((bool)Extracted1);
				EXPECT_STREQ(String1, *Extracted1);

				Optional<DWARFFormValue> Form2 = DieDG.find(Attr2);
				ASSERT_TRUE((bool)Form2);
				EXPECT_EQ(0u, Form2->getRawUValue());
				auto Extracted2 = toString(Form2);
				ASSERT_TRUE((bool)Extracted2);
				EXPECT_STREQ(String2, *Extracted2);

				Optional<DWARFFormValue> Form3 = DieDG.find(Attr3);
				ASSERT_TRUE((bool)Form3);
				EXPECT_EQ(1u, Form3->getRawUValue());
				auto Extracted3 = toString(Form3);
				ASSERT_TRUE((bool)Extracted3);
				EXPECT_STREQ(String1, *Extracted3);
				}

				TEST(DWARFDebugInfo, TestEmptyStringOffsets) {
				Triple Triple = getHostTripleForAddrSize(sizeof(void *));
				if (!isConfigurationSupported(Triple))
				return;

				const char *String1 = "Hello";

				auto ExpectedDG = dwarfgen::Generator::create(Triple, 5);
				ASSERT_THAT_EXPECTED(ExpectedDG, Succeeded());
				dwarfgen::Generator *DG = ExpectedDG.get().get();
				dwarfgen::CompileUnit &CU = DG->addCompileUnit();
				dwarfgen::DIE CUDie = CU.getUnitDIE();

				uint16_t Attr = DW_AT_lo_user;

				// We shall insert only one string. It will be referenced directly.
				const auto Attr1 = static_cast<dwarf::Attribute>(Attr++);
				CUDie.addAttribute(Attr1, DW_FORM_strp, String1);

				// Generate the DWARF
				StringRef FileBytes = DG->generate();
				MemoryBufferRef FileBuffer(FileBytes, "dwarf");
				auto Obj = object::ObjectFile::createObjectFile(FileBuffer);
				ASSERT_TRUE((bool)Obj);
				std::unique_ptr<DWARFContext> DwarfContext = DWARFContext::create(**Obj);
				EXPECT_TRUE(
				DwarfContext->getDWARFObj().getStringOffsetSection().Data.empty());
				}

	TEST(DWARFDebugInfo, TestRelations) {			TEST(DWARFDebugInfo, TestRelations) {
	Triple Triple = getHostTripleForAddrSize(sizeof(void *));			Triple Triple = getHostTripleForAddrSize(sizeof(void *));
	if (!isConfigurationSupported(Triple))			if (!isConfigurationSupported(Triple))
	return;			return;

	// Test the DWARF APIs related to accessing the DW_AT_low_pc and			// Test the DWARF APIs related to accessing the DW_AT_low_pc and
	// DW_AT_high_pc.			// DW_AT_high_pc.
	uint16_t Version = 4;			uint16_t Version = 4;
	▲ Show 20 Lines • Show All 2,085 Lines • Show Last 20 Lines

llvm/trunk/unittests/DebugInfo/DWARF/DwarfGenerator.cpp

Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	void dwarfgen::DIE::addAttribute(uint16_t A, dwarf::Form Form,
switch (Form) {		switch (Form) {
case DW_FORM_string:		case DW_FORM_string:
Die->addValue(DG.getAllocator(), static_cast<dwarf::Attribute>(A), Form,		Die->addValue(DG.getAllocator(), static_cast<dwarf::Attribute>(A), Form,
new (DG.getAllocator())		new (DG.getAllocator())
DIEInlineString(String, DG.getAllocator()));		DIEInlineString(String, DG.getAllocator()));
break;		break;

case DW_FORM_strp:		case DW_FORM_strp:
		Die->addValue(
		DG.getAllocator(), static_cast<dwarf::Attribute>(A), Form,
		DIEString(DG.getStringPool().getEntry(*DG.getAsmPrinter(), String)));
		break;

case DW_FORM_GNU_str_index:		case DW_FORM_GNU_str_index:
case DW_FORM_strx:		case DW_FORM_strx:
case DW_FORM_strx1:		case DW_FORM_strx1:
case DW_FORM_strx2:		case DW_FORM_strx2:
case DW_FORM_strx3:		case DW_FORM_strx3:
case DW_FORM_strx4:		case DW_FORM_strx4:
Die->addValue(		Die->addValue(DG.getAllocator(), static_cast<dwarf::Attribute>(A), Form,
DG.getAllocator(), static_cast<dwarf::Attribute>(A), Form,		DIEString(DG.getStringPool().getIndexedEntry(
DIEString(DG.getStringPool().getEntry(*DG.getAsmPrinter(), String)));		*DG.getAsmPrinter(), String)));
break;		break;

default:		default:
llvm_unreachable("Unhandled form!");		llvm_unreachable("Unhandled form!");
}		}
}		}

void dwarfgen::DIE::addAttribute(uint16_t A, dwarf::Form Form,		void dwarfgen::DIE::addAttribute(uint16_t A, dwarf::Form Form,
▲ Show 20 Lines • Show All 458 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[DebugInfo] Reduce debug_str_offsets section sizeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 159473

llvm/trunk/include/llvm/CodeGen/DwarfStringPoolEntry.h

llvm/trunk/lib/CodeGen/AsmPrinter/DwarfDebug.cpp

llvm/trunk/lib/CodeGen/AsmPrinter/DwarfStringPool.h

llvm/trunk/lib/CodeGen/AsmPrinter/DwarfStringPool.cpp

llvm/trunk/lib/CodeGen/AsmPrinter/DwarfUnit.cpp

llvm/trunk/test/DebugInfo/X86/string-offsets-table-order.ll

llvm/trunk/test/DebugInfo/X86/string-offsets-table.ll

llvm/trunk/tools/dsymutil/DwarfStreamer.cpp

llvm/trunk/tools/dsymutil/MachOUtils.cpp

llvm/trunk/tools/dsymutil/NonRelocatableStringpool.h

llvm/trunk/tools/dsymutil/NonRelocatableStringpool.cpp

llvm/trunk/unittests/CodeGen/DIEHashTest.cpp

llvm/trunk/unittests/DebugInfo/DWARF/DWARFDebugInfoTest.cpp

llvm/trunk/unittests/DebugInfo/DWARF/DwarfGenerator.cpp

[DebugInfo] Reduce debug_str_offsets section size
ClosedPublic