This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/lldb/Core/
-
lldb/
-
Core/
1/1
FileSpecList.h
-
RangeMap.h
-
lit/SymbolFile/Breakpad/
-
SymbolFile/
-
Breakpad/
-
Inputs/
-
line-table-discontinuous-file-ids.syms
-
line-table-edgecases.syms
-
line-table-missing-file.syms
-
line-table.syms
-
line-table-discontinuous-file-ids.test
-
line-table-edgecases.test
-
line-table-missing-file.test
-
line-table.test
-
source/
-
Core/
-
FileSpecList.cpp
-
Plugins/SymbolFile/Breakpad/
-
SymbolFile/
-
Breakpad/
4/8
SymbolFileBreakpad.h
9/16
SymbolFileBreakpad.cpp

Differential D56595

SymbolFileBreakpad: Add line table support
ClosedPublic

Authored by labath on Jan 11 2019, 5:08 AM.

Download Raw Diff

Details

Reviewers

clayborg
zturner
lemo
markmentovai

Commits

rG3f35ab8b3000: SymbolFileBreakpad: Add line table support
rLLDB353404: SymbolFileBreakpad: Add line table support
rL353404: SymbolFileBreakpad: Add line table support

Summary

This patch teaches SymbolFileBreakpad to parse the line information in
breakpad files and present it to lldb.

The trickiest question here was what kind of "compile units" to present
to lldb, as there really isn't enough information in breakpad files to
correctly reconstruct those.

A couple of options were considered

have the entire file be one compile unit
have one compile unit for each FILE record
have one compile unit for each FUNC record

The main drawback of the first approach is that all of the files would
be considered "headers" by lldb, and so they wouldn't be searched if
target.inline-breakpoint-strategy=never. The single compile unit would
also be huge, and there isn't a good way to name it.

The second approach will create mostly correct compile units for cpp
files, but it will still be wrong for headers. However, the biggest
drawback here seemed to be the fact that this can cause a compile unit
to change mid-function (for example when a function from another file is
inlined or another file is #included into a function). While I don't
know of any specific thing that would break in this case, it does sound
like a thing that we should avoid.

In the end, we chose the third option, as it didn't seem to have any
major disadvantages, though it was not ideal either. One disadvantage
here is that this generates a large number of compile units, and there
is still a question on how to name it. We chose to simply name it after
the first line record in that function. This should be correct 99.99% of
the time, though it can produce somewhat strange results if the very
first line record comes from an #included file.

Diff Detail

Build Status

Buildable 27822
Build 27821: arc lint + arc unit

Event Timeline

labath created this revision.Jan 11 2019, 5:08 AM

Harbormaster completed remote builds in B26695: Diff 181252.Jan 11 2019, 5:08 AM

labath marked an inline comment as done.Jan 11 2019, 5:13 AM

labath added inline comments.

source/Plugins/SymbolFile/Breakpad/SymbolFileBreakpad.cpp
245–246	Note that here i set `file=column=line=0` for the terminal entry, which isn't consistent with the dwarf plugin for instance (it puts there whatever falls out of the state automaton, which most likely means the values from the previous entry). AFAICT, this shouldn't be a problem, because the terminal entry is there to just determine the range of the last real entry.

So LLDB treats compile units special depending on the inline strategy. Because of this, I outlined some examples of how and why we should create a compile unit per "FUNC" token. Let me know if anything above was unclear

source/Plugins/SymbolFile/Breakpad/SymbolFileBreakpad.cpp
193	I would vote to make 1 compile unit for each "FUNC". The hard part will be to select the right source file for each function. I would just start by selecting the first line entry for the function. Compile units are expected to give a list of support files, and in this case, you would make a set of files that are in the line entries only. So if you had: MODULE Linux x86_64 761550E08086333960A9074A9CE2895C0 a.out INFO CODE_ID E05015768680393360A9074A9CE2895C FILE 0 /tmp/a.c FILE 1 /tmp/b.c FILE 2 /usr/include/foo.h FUNC b0 10 0 func1 b0 1 1 0 b1 1 2 0 b2 1 2 1 b4 1 3 1 FUNC b0 10 0 func2 b0 1 1 1 b1 1 2 1 b2 1 2 2 We would have a compile unit for "func1" with a cu for "/tmp/a.c" and with support files: support_file[0] = "/tmp/a.c" support_file[1] = "/usr/include/foo.h" We would have a compile unit for "func2" with a cu for "/tmp/b.c" and with support files: support_file[0] = "/tmp/b.c" support_file[1] = "/usr/include/foo.h" The main reason for make individual compile units, is LLDB treats a compile unit specially when settings breakpoints depending on if we ask for inline functions to be set. If we set the following setting: (lldb) settings set target.inline-breakpoint-strategy never Then we will check the name of the compile unit to ensure it matches. So if we did: (lldb) b a.c:12 This would only work if the actual lldb_private::CompileUnit has a FileSpec that matches "a.c".
194–220	return the count of the number of "FUNC" objects
195–197	parse a compile unit per function. We might want to cache all "FILE" entries in a list inside the SymbolFileBreakpad so we can easily pull out the FileSpecs when creating each compile unit. Also, each compile unit's ID can be the line number in the breakpad file to the "FUNC" entry. This allows easy access to each "FUNC" entry in the breakpad file when we are asked to parse more information about it (get compile unit support files, or any parsing of info for a compile unit.
253–274	It would be nice to put this parsing code into the lldb_private::breakpad::Line" class I talked about in the other patch? It would be great if this code looked like: lldb_private::breakpad::Line bp_line(line); switch (bp_line.GetToken()) { case Token::Func: // Create the compile unit and store into cu_sp cu_sp = bp_line.CreateCompileUnit(); break; case Token::Line: { addr_t address; size_t size; uint32_t line_num, file_num; if (bp_line.ParseLineEntry(address, size, line_num, file_num)) { // Discontiguous entries. Finish off the previous sequence and reset. if (next_addr && *next_addr != address) finish_sequence(); line_table_up->AppendLineEntryToSequence( line_seq_up.get(), address, line_num, 0, file_num, true, false, false, false, false); }
257	We need to search each compile unit to see which compile unit contains the address now.
294	This function will need to populate support_files for a given FUNC as mentioned above
source/Plugins/SymbolFile/Breakpad/SymbolFileBreakpad.h
98	Remove whitespace only change

I think I understand what you mean. I'll try to refactor this to create a compile unit for each function.

It took me a while to get back to this, but I believe this version now does what
we want. It creates a compile unit for each function. The compile units are
created as soon as the symbol file is initialized (it's needed to get the number
of compile units). The global list of support files is also parsed at this time
(needed to get the compile unit name). The list of support files and line tables
is created lazily for those compile units that need them.

A new element here is the "bookmark" class, which allows us to efficiently go
back the to place where the line tables for a given function/compile unit are
defined and parse those.

I added a couple of new tests to exercise handling of odd or broken minidump
files.

Let me know what you think.

Harbormaster completed remote builds in B27446: Diff 184103.Jan 29 2019, 9:20 AM

Greg, what do you think about the new approach in this patch?

I like the way you did the compile units and the line tables and support file list. It would be nice to change this to do things more lazily. Right now we are parsing all compile unit data into CompUnitData structures and then passing their info along to the lldb_private::CompileUnit when we need to. We can be more lazy, see inlined comments.

source/Plugins/SymbolFile/Breakpad/SymbolFileBreakpad.cpp
193–194	This seems like where we would do the heavy parsing code that are in the initialize object function. .Get the character file offset from m_compile_units and just parse what we need here? It will cause less work for us in the initialize object call then and we can be lazier
218	Do we need to iterate over the file multiple times here? We do it once here, and then once on line 260.
220–239	Seems like we should just populate the m_compile_units data with address range to character file offset here? When we are asked to create a compile unit, we do this work by going to the "lldb_private::CompileUnit::GetID()" which will return the file offset and we just head there and start parsing?
229–245	From the compile unit, if GetID() returns the character file offset to the FUNC or first LINE, then we don't need the preparsed CompUnitData? We can just parse the line table here if and only if we need to
235	Ditto above
source/Plugins/SymbolFile/Breakpad/SymbolFileBreakpad.h
150–153	Why do we need more than just a file character offset here?
181–187	Seems like if we just pick the file character offset of the function or the function's line entries as the lldb_private::CompileUnit user ID (lldb::user_id_t) then we don't really need this class? We just create the compile unit as a lldb_private::CompileUnit and our symbol file parser will fill in the rest? Any reason we need this CompUnitData class?
211	Could this just be: using CompUnitMap = RangeDataVector<lldb::addr_t, lldb::addr_t, lldb::offset_t>; Where offset_t is the character fie offset for the first line of the FUNC entry? Any reason to use CompUnitData instead of just creating lldb_private::CompileUnit objects?
213	Use FileSpecList?

Thanks for the review Greg. See my responses inline. I'm going to try incorporating the changes tomorrow.

source/Plugins/SymbolFile/Breakpad/SymbolFileBreakpad.cpp
218	The two loops iterate over different parts of the file. The first one goes through the FILE records, and this one does the FUNC records. So the iteration here is efficient because we already know where different kinds of records are located in the file. (of course, to figure out where these records are located, we've had to go through it once already (in ObjectFileBreakpad), so we still have to make two passes over this data in general. However, that is pretty much unavoidable if we want to do lazy (i.e. random access) into the file as it doesn't have any kind of index to start with.)
220–239	I think I could avoid creating the CompileUnit object here. However, I will still need to do the parsing here, as I will need to figure out the number of compile units first (best I might be able to achieve is to delay this until GetNumCompileUnits() time).
229–245	That is pretty much what happens here. CompUnitData construct the line table (almost) lazily. It doesn't preparse. The reason I have this indirection, is that the creation of line tables is coupled with the creation of the support file list: in order to build the line table, I (obviously) need to go through the LINE records however, I also need to go through the LINE records in order to build the CU file list, because I need to know what files are actually used in this "CU" It seemed like a good idea to me to avoid parsing the LINE records twice. So what I've done is that on the first call to (ReleaseLineTable\|ReleaseSupportFiles), CompUnitData will parse both things. Then, the second call will return the already parsed data. That seems like a good tradeoff to me as these two items are generally used together (one is fairly useless without the other).
source/Plugins/SymbolFile/Breakpad/SymbolFileBreakpad.h
150–153	That's because ObjectFileBreakpad breaks down the file into sections (so all FILE records would go into one section, PUBLIC records into another, etc.). This means that we don't need any "bookmarks" when we want to jump straight to the PUBLIC records for instance, but it does mean we need two coordinates (section, offset) when we want to jump to a specific record within a section.
181–187	I'd like the keep to enable the CompUnitData for conjugated parsing of line tables and support files. I think I can get rid of the compile unit field inside it, but that would mean relying on the SymbolVendor to conjure up the CompUnitSP when I need it, which is a bit of an odd dependency. (I need to conjure up the pointer from somewhere in the ResolveSymbolContext functions).
213	FileSpecList doesn't have the `resize` method. I use that to implement parsing of the FILE records, since the file records theoretically don't have to come in order, so I just resize the vector to fit the largest number I've encountered.

The latest version looks good to me. Please update the description (it still says it uses the one CU per symbols file)

include/lldb/Core/FileSpecList.h
68	nit: if you have move assignment-operator also add the move constructor

This revision is now accepted and ready to land.Feb 5 2019, 11:57 AM

Tried to make parsing as lazy as possible. GetNumSections() will count the
number of FUNC records, but will not create CompileUnit objects. FILE records
will be parsed when we create the first compile unit. The support files and line
tables will be parsed when the first of them is requested for a given CU.

I've removed the CU shared pointer from the CompUnitData structure. Instead, I go through the symbol vendor to fetch the CU SP.

Added the move constructor for FileSpecList.

Please take another look.

Harbormaster completed remote builds in B27814: Diff 185533.Feb 6 2019, 5:38 AM

labath edited the summary of this revision. (Show Details)Feb 6 2019, 5:59 AM

Just bounds check "index" in parse compile unit and this is good to go

source/Plugins/SymbolFile/Breakpad/SymbolFileBreakpad.cpp
196	Validate "index" first? We will crash if someone iterates over too many CUs?

Add a bounds check to the GetCompileUnitAtIndex method

Harbormaster completed remote builds in B27822: Diff 185557.Feb 6 2019, 7:41 AM

clayborg accepted this revision.Feb 6 2019, 8:00 AM

Closed by commit rL353404: SymbolFileBreakpad: Add line table support (authored by labath). · Explain WhyFeb 7 2019, 5:42 AM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptFeb 7 2019, 5:42 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Revision Contents

Path

Size

include/

lldb/

Core/

FileSpecList.h

20 lines

RangeMap.h

1 line

lit/

SymbolFile/

Breakpad/

Inputs/

line-table-discontinuous-file-ids.syms

8 lines

line-table-edgecases.syms

7 lines

line-table-missing-file.syms

7 lines

line-table.syms

17 lines

line-table-discontinuous-file-ids.test

13 lines

line-table-edgecases.test

21 lines

line-table-missing-file.test

17 lines

line-table.test

45 lines

source/

Core/

FileSpecList.cpp

11 lines

Plugins/

SymbolFile/

Breakpad/

SymbolFileBreakpad.h

72 lines

SymbolFileBreakpad.cpp

320 lines

Diff 185557

include/lldb/Core/FileSpecList.h

	Show All 31 Lines
	public:			public:
	//------------------------------------------------------------------			//------------------------------------------------------------------
	/// Default constructor.			/// Default constructor.
	///			///
	/// Initialize this object with an empty file list.			/// Initialize this object with an empty file list.
	//------------------------------------------------------------------			//------------------------------------------------------------------
	FileSpecList();			FileSpecList();

	//------------------------------------------------------------------
	/// Copy constructor.			/// Copy constructor.
	///			FileSpecList(const FileSpecList &rhs) = default;
	/// Initialize this object with a copy of the file list from \a rhs.
	///			/// Move constructor
	/// @param[in] rhs			FileSpecList(FileSpecList &&rhs) = default;
	/// A const reference to another file list object.
	//------------------------------------------------------------------			/// Initialize this object from a vector of FileSpecs
	FileSpecList(const FileSpecList &rhs);			FileSpecList(std::vector<FileSpec> &&rhs) : m_files(std::move(rhs)) {}

	//------------------------------------------------------------------			//------------------------------------------------------------------
	/// Destructor.			/// Destructor.
	//------------------------------------------------------------------			//------------------------------------------------------------------
	~FileSpecList();			~FileSpecList();

	//------------------------------------------------------------------			//------------------------------------------------------------------
	/// Assignment operator.			/// Assignment operator.
	///			///
	/// Replace the file list in this object with the file list from \a rhs.			/// Replace the file list in this object with the file list from \a rhs.
	///			///
	/// @param[in] rhs			/// @param[in] rhs
	/// A file list object to copy.			/// A file list object to copy.
	///			///
	/// @return			/// @return
	/// A const reference to this object.			/// A const reference to this object.
	//------------------------------------------------------------------			//------------------------------------------------------------------
	const FileSpecList &operator=(const FileSpecList &rhs);			FileSpecList &operator=(const FileSpecList &rhs) = default;

				/// Move-assignment operator.
				FileSpecList &operator=(FileSpecList &&rhs) = default;
				lemoUnsubmitted Done Reply Inline Actions nit: if you have move assignment-operator also add the move constructor lemo: nit: if you have move assignment-operator also add the move constructor

	//------------------------------------------------------------------			//------------------------------------------------------------------
	/// Append a FileSpec object to the list.			/// Append a FileSpec object to the list.
	///			///
	/// Appends \a file to the end of the file list.			/// Appends \a file to the end of the file list.
	///			///
	/// @param[in] file			/// @param[in] file
	/// A new file to append to this file list.			/// A new file to append to this file list.
	▲ Show 20 Lines • Show All 147 Lines • Show Last 20 Lines

include/lldb/Core/RangeMap.h

Show First 20 Lines • Show All 709 Lines • ▼ Show 20 Lines	#endif
}		}

Entry *GetMutableEntryAtIndex(size_t i) {		Entry *GetMutableEntryAtIndex(size_t i) {
return ((i < m_entries.size()) ? &m_entries[i] : nullptr);		return ((i < m_entries.size()) ? &m_entries[i] : nullptr);
}		}

// Clients must ensure that "i" is a valid index prior to calling this		// Clients must ensure that "i" is a valid index prior to calling this
// function		// function
		Entry &GetEntryRef(size_t i) { return m_entries[i]; }
const Entry &GetEntryRef(size_t i) const { return m_entries[i]; }		const Entry &GetEntryRef(size_t i) const { return m_entries[i]; }

static bool BaseLessThan(const Entry &lhs, const Entry &rhs) {		static bool BaseLessThan(const Entry &lhs, const Entry &rhs) {
return lhs.GetRangeBase() < rhs.GetRangeBase();		return lhs.GetRangeBase() < rhs.GetRangeBase();
}		}

uint32_t FindEntryIndexThatContains(B addr) const {		uint32_t FindEntryIndexThatContains(B addr) const {
const Entry *entry = FindEntryThatContains(addr);		const Entry *entry = FindEntryThatContains(addr);
▲ Show 20 Lines • Show All 224 Lines • Show Last 20 Lines

lit/SymbolFile/Breakpad/Inputs/line-table-discontinuous-file-ids.syms

This file was added.

				MODULE Linux x86_64 761550E08086333960A9074A9CE2895C0 a.out
				INFO CODE_ID E05015768680393360A9074A9CE2895C
				FILE 1 /tmp/a.c
				FILE 3 /tmp/c.c
				FUNC b0 10 0 func
				b0 1 1 1
				b1 1 2 1
				b2 1 2 3

lit/SymbolFile/Breakpad/Inputs/line-table-edgecases.syms

This file was added.

				MODULE Linux x86_64 761550E08086333960A9074A9CE2895C0 a.out
				INFO CODE_ID E05015768680393360A9074A9CE2895C
				FILE 0 /tmp/a.c
				a0 1 1 0
				FUNC b0 10 0 func
				FUNC c0 10 0 func2
				c0 2 2 0

lit/SymbolFile/Breakpad/Inputs/line-table-missing-file.syms

This file was added.

				MODULE Linux x86_64 761550E08086333960A9074A9CE2895C0 a.out
				INFO CODE_ID E05015768680393360A9074A9CE2895C
				FILE 0 /tmp/a.c
				FUNC b0 10 0 func
				b0 1 1 0
				b1 1 2 0
				b2 1 2 1

lit/SymbolFile/Breakpad/Inputs/line-table.syms

This file was added.

				MODULE Linux x86_64 761550E08086333960A9074A9CE2895C0 a.out
				INFO CODE_ID E05015768680393360A9074A9CE2895C
				FILE 0 /tmp/a.c
				FILE 1 /tmp/c.c
				FILE 2 /tmp/d.c
				FUNC b0 10 0 func
				b0 1 1 0
				b1 1 2 0
				b2 1 2 1
				b4 1 3 1
				FUNC c0 10 0 func2
				c0 2 1 1
				c2 2 2 0
				FUNC d0 10 0 func3
				d0 2 1 2
				FUNC e0 10 0 func4
				e0 2 2 2

lit/SymbolFile/Breakpad/line-table-discontinuous-file-ids.test

This file was added.

				# Test that we handle files which has gaps in the FILE record IDs.

				# RUN: yaml2obj %S/Inputs/basic-elf.yaml > %T/line-table-discontinuous-file-ids.out
				# RUN: %lldb %T/line-table-discontinuous-file-ids.out \
				# RUN: -o "target symbols add -s line-table-discontinuous-file-ids.out %S/Inputs/line-table-discontinuous-file-ids.syms" \
				# RUN: -s %s -o exit \| FileCheck %s

				image dump line-table a.c
				# CHECK-LABEL: Line table for /tmp/a.c
				# CHECK-NEXT: 0x00000000004000b0: /tmp/a.c:1
				# CHECK-NEXT: 0x00000000004000b1: /tmp/a.c:2
				# CHECK-NEXT: 0x00000000004000b2: /tmp/c.c:2
				# CHECK-NEXT: 0x00000000004000b3:

lit/SymbolFile/Breakpad/line-table-edgecases.test

This file was added.

				# Test handling of breakpad files with some unusual or erroneous constructs. The
				# input contains a LINE record which does not belong to any function as well as
				# a FUNC record without any LINE records.

				# RUN: yaml2obj %S/Inputs/basic-elf.yaml > %T/line-table-edgecases.out
				# RUN: %lldb %T/line-table-edgecases.out \
				# RUN: -o "target symbols add -s line-table-edgecases.out %S/Inputs/line-table-edgecases.syms" \
				# RUN: -s %s -o exit \| FileCheck %s

				# Test that line table for func2 was parsed properly:
				image dump line-table a.c
				# CHECK-LABEL: Line table for /tmp/a.c
				# CHECK-NEXT: 0x00000000004000c0: /tmp/a.c:2
				# CHECK-NEXT: 0x00000000004000c2:
				# CHECK-EMPTY:

				# Looking up an address inside func should still work even if it does not result
				# in a line entry.
				image lookup -a 0x4000b2 -v
				# CHECK-LABEL: image lookup -a 0x4000b2 -v
				# CHECK: Summary: line-table-edgecases.out`func + 2

lit/SymbolFile/Breakpad/line-table-missing-file.test

This file was added.

				# Test that we do something reasonable if a LINE record references a
				# non-existing FILE record.
				# Right now, "something reasonable" means creating a line entry with an empty
				# file.

				# RUN: yaml2obj %S/Inputs/basic-elf.yaml > %T/line-table-missing-file.out
				# RUN: %lldb %T/line-table-missing-file.out \
				# RUN: -o "target symbols add -s line-table-missing-file.out %S/Inputs/line-table-missing-file.syms" \
				# RUN: -s %s -o exit \| FileCheck %s

				image dump line-table a.c
				# CHECK-LABEL: Line table for /tmp/a.c
				# CHECK-NEXT: 0x00000000004000b0: /tmp/a.c:1
				# CHECK-NEXT: 0x00000000004000b1: /tmp/a.c:2
				# CHECK-NEXT: 0x00000000004000b2: :2
				# CHECK-NEXT: 0x00000000004000b3:
				# CHECK-EMPTY:

lit/SymbolFile/Breakpad/line-table.test

This file was added.

				# RUN: yaml2obj %S/Inputs/basic-elf.yaml > %T/line-table.out
				# RUN: %lldb %T/line-table.out -o "target symbols add -s line-table.out %S/Inputs/line-table.syms" \
				# RUN: -s %s -o exit \| FileCheck %s

				# We create a compile unit for each function. The compile unit name is the first
				# line table entry in that function.
				# This symbol file contains a single function in the "compile unit" a.c. This
				# function has two line table sequences.
				image dump line-table a.c
				# CHECK-LABEL: Line table for /tmp/a.c
				# CHECK-NEXT: 0x00000000004000b0: /tmp/a.c:1
				# CHECK-NEXT: 0x00000000004000b1: /tmp/a.c:2
				# CHECK-NEXT: 0x00000000004000b2: /tmp/c.c:2
				# CHECK-NEXT: 0x00000000004000b3:
				# CHECK-EMPTY:
				# CHECK-NEXT: 0x00000000004000b4: /tmp/c.c:3
				# CHECK-NEXT: 0x00000000004000b5:
				# CHECK-EMPTY:

				# Single compile unit for c.c with a single line sequence.
				image dump line-table c.c
				# CHECK-LABEL: Line table for /tmp/c.c
				# CHECK-NEXT: 0x00000000004000c0: /tmp/c.c:1
				# CHECK-NEXT: 0x00000000004000c2: /tmp/a.c:2
				# CHECK-NEXT: 0x00000000004000c4:
				# CHECK-EMPTY:

				# There are two compile units called "d.c". Hence, two line tables.
				image dump line-table d.c
				# CHECK-LABEL: Line table for /tmp/d.c
				# CHECK-NEXT: 0x00000000004000d0: /tmp/d.c:1
				# CHECK-NEXT: 0x00000000004000d2:
				# CHECK-EMPTY:
				# CHECK-LABEL: Line table for /tmp/d.c
				# CHECK-NEXT: 0x00000000004000e0: /tmp/d.c:2
				# CHECK-NEXT: 0x00000000004000e2:
				# CHECK-EMPTY:

				image lookup -a 0x4000b2 -v
				# CHECK-LABEL: image lookup -a 0x4000b2 -v
				# CHECK: Summary: line-table.out`func + 2

				breakpoint set -f c.c -l 2
				# CHECK-LABEL: breakpoint set -f c.c -l 2
				# CHECK: Breakpoint 1: where = line-table.out`func + 2, address = 0x00000000004000b2

source/Core/FileSpecList.cpp

	Show All 14 Lines

	#include <stdint.h>			#include <stdint.h>

	using namespace lldb_private;			using namespace lldb_private;
	using namespace std;			using namespace std;

	FileSpecList::FileSpecList() : m_files() {}			FileSpecList::FileSpecList() : m_files() {}

	FileSpecList::FileSpecList(const FileSpecList &rhs) = default;

	FileSpecList::~FileSpecList() = default;			FileSpecList::~FileSpecList() = default;

	//------------------------------------------------------------------			//------------------------------------------------------------------
	// Assignment operator
	//------------------------------------------------------------------
	const FileSpecList &FileSpecList::operator=(const FileSpecList &rhs) {
	if (this != &rhs)
	m_files = rhs.m_files;
	return *this;
	}

	//------------------------------------------------------------------
	// Append the "file_spec" to the end of the file spec list.			// Append the "file_spec" to the end of the file spec list.
	//------------------------------------------------------------------			//------------------------------------------------------------------
	void FileSpecList::Append(const FileSpec &file_spec) {			void FileSpecList::Append(const FileSpec &file_spec) {
	m_files.push_back(file_spec);			m_files.push_back(file_spec);
	}			}

	//------------------------------------------------------------------			//------------------------------------------------------------------
	// Only append the "file_spec" if this list doesn't already contain it.			// Only append the "file_spec" if this list doesn't already contain it.
	▲ Show 20 Lines • Show All 104 Lines • Show Last 20 Lines

source/Plugins/SymbolFile/Breakpad/SymbolFileBreakpad.h

//===-- SymbolFileBreakpad.h ------------------------------------- C++ --===//		//===-- SymbolFileBreakpad.h ------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLDB_PLUGINS_SYMBOLFILE_BREAKPAD_SYMBOLFILEBREAKPAD_H		#ifndef LLDB_PLUGINS_SYMBOLFILE_BREAKPAD_SYMBOLFILEBREAKPAD_H
#define LLDB_PLUGINS_SYMBOLFILE_BREAKPAD_SYMBOLFILEBREAKPAD_H		#define LLDB_PLUGINS_SYMBOLFILE_BREAKPAD_SYMBOLFILEBREAKPAD_H

		#include "Plugins/ObjectFile/Breakpad/BreakpadRecords.h"
		#include "lldb/Core/FileSpecList.h"
		#include "lldb/Symbol/LineTable.h"
#include "lldb/Symbol/SymbolFile.h"		#include "lldb/Symbol/SymbolFile.h"

namespace lldb_private {		namespace lldb_private {

namespace breakpad {		namespace breakpad {

class SymbolFileBreakpad : public SymbolFile {		class SymbolFileBreakpad : public SymbolFile {
public:		public:
Show All 38 Lines	public:

size_t ParseFunctions(CompileUnit &comp_unit) override;		size_t ParseFunctions(CompileUnit &comp_unit) override;

bool ParseLineTable(CompileUnit &comp_unit) override;		bool ParseLineTable(CompileUnit &comp_unit) override;

bool ParseDebugMacros(CompileUnit &comp_unit) override { return false; }		bool ParseDebugMacros(CompileUnit &comp_unit) override { return false; }

bool ParseSupportFiles(CompileUnit &comp_unit,		bool ParseSupportFiles(CompileUnit &comp_unit,
FileSpecList &support_files) override {		FileSpecList &support_files) override;
return false;
}
size_t ParseTypes(CompileUnit &cu) override { return 0; }		size_t ParseTypes(CompileUnit &cu) override { return 0; }

bool		bool
ParseImportedModules(const SymbolContext &sc,		ParseImportedModules(const SymbolContext &sc,
std::vector<ConstString> &imported_modules) override {		std::vector<ConstString> &imported_modules) override {
return false;		return false;
}		}

Show All 12 Lines	public:
Type *ResolveTypeUID(lldb::user_id_t type_uid) override { return nullptr; }		Type *ResolveTypeUID(lldb::user_id_t type_uid) override { return nullptr; }
llvm::Optional<ArrayInfo> GetDynamicArrayInfoForUID(		llvm::Optional<ArrayInfo> GetDynamicArrayInfoForUID(
lldb::user_id_t type_uid,		lldb::user_id_t type_uid,
const lldb_private::ExecutionContext *exe_ctx) override {		const lldb_private::ExecutionContext *exe_ctx) override {
return llvm::None;		return llvm::None;
}		}

bool CompleteType(CompilerType &compiler_type) override { return false; }		bool CompleteType(CompilerType &compiler_type) override { return false; }
uint32_t ResolveSymbolContext(const Address &so_addr,		uint32_t ResolveSymbolContext(const Address &so_addr,
		clayborgUnsubmitted Done Reply Inline Actions Remove whitespace only change clayborg: Remove whitespace only change
lldb::SymbolContextItem resolve_scope,		lldb::SymbolContextItem resolve_scope,
SymbolContext &sc) override;		SymbolContext &sc) override;

		uint32_t ResolveSymbolContext(const FileSpec &file_spec, uint32_t line,
		bool check_inlines,
		lldb::SymbolContextItem resolve_scope,
		SymbolContextList &sc_list) override;

size_t GetTypes(SymbolContextScope *sc_scope, lldb::TypeClass type_mask,		size_t GetTypes(SymbolContextScope *sc_scope, lldb::TypeClass type_mask,
TypeList &type_list) override {		TypeList &type_list) override {
return 0;		return 0;
}		}

uint32_t FindFunctions(const ConstString &name,		uint32_t FindFunctions(const ConstString &name,
const CompilerDeclContext *parent_decl_ctx,		const CompilerDeclContext *parent_decl_ctx,
lldb::FunctionNameType name_type_mask,		lldb::FunctionNameType name_type_mask,
Show All 23 Lines	public:
}		}

void AddSymbols(Symtab &symtab) override;		void AddSymbols(Symtab &symtab) override;

ConstString GetPluginName() override { return GetPluginNameStatic(); }		ConstString GetPluginName() override { return GetPluginNameStatic(); }
uint32_t GetPluginVersion() override { return 1; }		uint32_t GetPluginVersion() override { return 1; }

private:		private:
		// A class representing a position in the breakpad file. Useful for
		// remembering the position so we can go back to it later and parse more data.
		// Can be converted to/from a LineIterator, but it has a much smaller memory
		// footprint.
		struct Bookmark {
		uint32_t section;
		size_t offset;
		};
		clayborgUnsubmitted Not Done Reply Inline Actions Why do we need more than just a file character offset here? clayborg: Why do we need more than just a file character offset here?
		labathAuthorUnsubmitted Done Reply Inline Actions That's because ObjectFileBreakpad breaks down the file into sections (so all FILE records would go into one section, PUBLIC records into another, etc.). This means that we don't need any "bookmarks" when we want to jump straight to the PUBLIC records for instance, but it does mean we need two coordinates (section, offset) when we want to jump to a specific record within a section. labath: That's because ObjectFileBreakpad breaks down the file into sections (so all FILE records would…

		// At iterator class for simplifying algorithms reading data from the breakpad
		// file. It iterates over all records (lines) in the sections of a given type.
		// It also supports saving a specific position (via the GetBookmark() method)
		// and then resuming from it afterwards.
		class LineIterator;

		// Return an iterator range for all records in the given object file of the
		// given type.
		llvm::iterator_range<LineIterator> lines(Record::Kind section_type);

		// Breakpad files do not contain sufficient information to correctly
		// reconstruct compile units. The approach chosen here is to treat each
		// function as a compile unit. The compile unit name is the name if the first
		// line entry belonging to this function.
		// This class is our internal representation of a compile unit. It stores the
		// CompileUnit object and a bookmark pointing to the FUNC record of the
		// compile unit function. It also lazily construct the list of support files
		// and line table entries for the compile unit, when these are needed.
		class CompUnitData {
		public:
		CompUnitData(Bookmark bookmark) : bookmark(bookmark) {}

		CompUnitData() = default;
		CompUnitData(const CompUnitData &rhs) : bookmark(rhs.bookmark) {}
		CompUnitData &operator=(const CompUnitData &rhs) {
		bookmark = rhs.bookmark;
		support_files.reset();
		line_table_up.reset();
		return *this;
		}
		friend bool operator<(const CompUnitData &lhs, const CompUnitData &rhs) {
		return std::tie(lhs.bookmark.section, lhs.bookmark.offset) <
		std::tie(rhs.bookmark.section, rhs.bookmark.offset);
		clayborgUnsubmitted Not Done Reply Inline Actions Seems like if we just pick the file character offset of the function or the function's line entries as the lldb_private::CompileUnit user ID (lldb::user_id_t) then we don't really need this class? We just create the compile unit as a lldb_private::CompileUnit and our symbol file parser will fill in the rest? Any reason we need this CompUnitData class? clayborg: Seems like if we just pick the file character offset of the function or the function's line…
		labathAuthorUnsubmitted Done Reply Inline Actions I'd like the keep to enable the CompUnitData for conjugated parsing of line tables and support files. I think I can get rid of the compile unit field inside it, but that would mean relying on the SymbolVendor to conjure up the CompUnitSP when I need it, which is a bit of an odd dependency. (I need to conjure up the pointer from somewhere in the ResolveSymbolContext functions). labath: I'd like the keep to enable the CompUnitData for conjugated parsing of line tables and support…
		}

		Bookmark bookmark;
		llvm::Optional<FileSpecList> support_files;
		std::unique_ptr<LineTable> line_table_up;

		};

		SymbolVendor &GetSymbolVendor();
		lldb::addr_t GetBaseFileAddress();
		void ParseFileRecords();
		void ParseCUData();
		void ParseLineTableAndSupportFiles(CompileUnit &cu, CompUnitData &data);

		using CompUnitMap = RangeDataVector<lldb::addr_t, lldb::addr_t, CompUnitData>;

		llvm::Optional<std::vector<FileSpec>> m_files;
		llvm::Optional<CompUnitMap> m_cu_data;
};		};

} // namespace breakpad		} // namespace breakpad
} // namespace lldb_private		} // namespace lldb_private

#endif		#endif
		clayborgUnsubmitted Not Done Reply Inline Actions Could this just be: using CompUnitMap = RangeDataVector<lldb::addr_t, lldb::addr_t, lldb::offset_t>; Where offset_t is the character fie offset for the first line of the FUNC entry? Any reason to use CompUnitData instead of just creating lldb_private::CompileUnit objects? clayborg: Could this just be: ``` using CompUnitMap = RangeDataVector<lldb::addr_t, lldb::addr_t, lldb…
		clayborgUnsubmitted Not Done Reply Inline Actions Use FileSpecList? clayborg: Use FileSpecList?
		labathAuthorUnsubmitted Done Reply Inline Actions FileSpecList doesn't have the `resize` method. I use that to implement parsing of the FILE records, since the file records theoretically don't have to come in order, so I just resize the vector to fit the largest number I've encountered. labath: FileSpecList doesn't have the `resize` method. I use that to implement parsing of the FILE…

source/Plugins/SymbolFile/Breakpad/SymbolFileBreakpad.cpp

//===-- SymbolFileBreakpad.cpp ----------------------------------- C++ --===//		//===-- SymbolFileBreakpad.cpp ----------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "Plugins/SymbolFile/Breakpad/SymbolFileBreakpad.h"		#include "Plugins/SymbolFile/Breakpad/SymbolFileBreakpad.h"
#include "Plugins/ObjectFile/Breakpad/BreakpadRecords.h"		#include "Plugins/ObjectFile/Breakpad/BreakpadRecords.h"
#include "Plugins/ObjectFile/Breakpad/ObjectFileBreakpad.h"		#include "Plugins/ObjectFile/Breakpad/ObjectFileBreakpad.h"
#include "lldb/Core/Module.h"		#include "lldb/Core/Module.h"
#include "lldb/Core/PluginManager.h"		#include "lldb/Core/PluginManager.h"
#include "lldb/Core/Section.h"		#include "lldb/Core/Section.h"
#include "lldb/Host/FileSystem.h"		#include "lldb/Host/FileSystem.h"
		#include "lldb/Symbol/CompileUnit.h"
#include "lldb/Symbol/ObjectFile.h"		#include "lldb/Symbol/ObjectFile.h"
		#include "lldb/Symbol/SymbolVendor.h"
#include "lldb/Symbol/TypeMap.h"		#include "lldb/Symbol/TypeMap.h"
#include "lldb/Utility/Log.h"		#include "lldb/Utility/Log.h"
#include "llvm/ADT/StringExtras.h"		#include "llvm/ADT/StringExtras.h"

using namespace lldb;		using namespace lldb;
using namespace lldb_private;		using namespace lldb_private;
using namespace lldb_private::breakpad;		using namespace lldb_private::breakpad;

namespace {		class SymbolFileBreakpad::LineIterator {
class LineIterator {
public:		public:
// begin iterator for sections of given type		// begin iterator for sections of given type
LineIterator(ObjectFile &obj, Record::Kind section_type)		LineIterator(ObjectFile &obj, Record::Kind section_type)
: m_obj(&obj), m_section_type(toString(section_type)),		: m_obj(&obj), m_section_type(toString(section_type)),
m_next_section_idx(0) {		m_next_section_idx(0), m_next_line(llvm::StringRef::npos) {
++*this;		++*this;
}		}

		// An iterator starting at the position given by the bookmark.
		LineIterator(ObjectFile &obj, Record::Kind section_type, Bookmark bookmark);

// end iterator		// end iterator
explicit LineIterator(ObjectFile &obj)		explicit LineIterator(ObjectFile &obj)
: m_obj(&obj),		: m_obj(&obj),
m_next_section_idx(m_obj->GetSectionList()->GetNumSections(0)) {}		m_next_section_idx(m_obj->GetSectionList()->GetNumSections(0)),
		m_current_line(llvm::StringRef::npos),
		m_next_line(llvm::StringRef::npos) {}

friend bool operator!=(const LineIterator &lhs, const LineIterator &rhs) {		friend bool operator!=(const LineIterator &lhs, const LineIterator &rhs) {
assert(lhs.m_obj == rhs.m_obj);		assert(lhs.m_obj == rhs.m_obj);
if (lhs.m_next_section_idx != rhs.m_next_section_idx)		if (lhs.m_next_section_idx != rhs.m_next_section_idx)
return true;		return true;
if (lhs.m_next_text.data() != rhs.m_next_text.data())		if (lhs.m_current_line != rhs.m_current_line)
return true;		return true;
assert(lhs.m_current_text == rhs.m_current_text);		assert(lhs.m_next_line == rhs.m_next_line);
assert(rhs.m_next_text == rhs.m_next_text);
return false;		return false;
}		}

const LineIterator &operator++();		const LineIterator &operator++();
llvm::StringRef operator*() const { return m_current_text; }		llvm::StringRef operator*() const {
		return m_section_text.slice(m_current_line, m_next_line);
		}

		Bookmark GetBookmark() const {
		return Bookmark{m_next_section_idx, m_current_line};
		}

private:		private:
ObjectFile *m_obj;		ObjectFile *m_obj;
ConstString m_section_type;		ConstString m_section_type;
uint32_t m_next_section_idx;		uint32_t m_next_section_idx;
llvm::StringRef m_current_text;		llvm::StringRef m_section_text;
llvm::StringRef m_next_text;		size_t m_current_line;
		size_t m_next_line;

		void FindNextLine() {
		m_next_line = m_section_text.find('\n', m_current_line);
		if (m_next_line != llvm::StringRef::npos) {
		++m_next_line;
		if (m_next_line >= m_section_text.size())
		m_next_line = llvm::StringRef::npos;
		}
		}
};		};
} // namespace

const LineIterator &LineIterator::operator++() {		SymbolFileBreakpad::LineIterator::LineIterator(ObjectFile &obj,
		Record::Kind section_type,
		Bookmark bookmark)
		: m_obj(&obj), m_section_type(toString(section_type)),
		m_next_section_idx(bookmark.section), m_current_line(bookmark.offset) {
		Section &sect =
		*obj.GetSectionList()->GetSectionAtIndex(m_next_section_idx - 1);
		assert(sect.GetName() == m_section_type);

		DataExtractor data;
		obj.ReadSectionData(&sect, data);
		m_section_text = toStringRef(data.GetData());

		assert(m_current_line < m_section_text.size());
		FindNextLine();
		}

		const SymbolFileBreakpad::LineIterator &
		SymbolFileBreakpad::LineIterator::operator++() {
const SectionList &list = *m_obj->GetSectionList();		const SectionList &list = *m_obj->GetSectionList();
size_t num_sections = list.GetNumSections(0);		size_t num_sections = list.GetNumSections(0);
while (m_next_text.empty() && m_next_section_idx < num_sections) {		while (m_next_line != llvm::StringRef::npos \|\|
		m_next_section_idx < num_sections) {
		if (m_next_line != llvm::StringRef::npos) {
		m_current_line = m_next_line;
		FindNextLine();
		return *this;
		}

Section &sect = *list.GetSectionAtIndex(m_next_section_idx++);		Section &sect = *list.GetSectionAtIndex(m_next_section_idx++);
if (sect.GetName() != m_section_type)		if (sect.GetName() != m_section_type)
continue;		continue;
DataExtractor data;		DataExtractor data;
m_obj->ReadSectionData(&sect, data);		m_obj->ReadSectionData(&sect, data);
m_next_text =		m_section_text = toStringRef(data.GetData());
llvm::StringRef(reinterpret_cast<const char *>(data.GetDataStart()),		m_next_line = 0;
data.GetByteSize());
}		}
std::tie(m_current_text, m_next_text) = m_next_text.split('\n');		// We've reached the end.
		m_current_line = m_next_line;
return *this;		return *this;
}		}

static llvm::iterator_range<LineIterator> lines(ObjectFile &obj,		llvm::iterator_range<SymbolFileBreakpad::LineIterator>
Record::Kind section_type) {		SymbolFileBreakpad::lines(Record::Kind section_type) {
return llvm::make_range(LineIterator(obj, section_type), LineIterator(obj));		return llvm::make_range(LineIterator(*m_obj_file, section_type),
		LineIterator(*m_obj_file));
		}

		namespace {
		// A helper class for constructing the list of support files for a given compile
		// unit.
		class SupportFileMap {
		public:
		// Given a breakpad file ID, return a file ID to be used in the support files
		// for this compile unit.
		size_t operator[](size_t file) {
		return m_map.try_emplace(file, m_map.size() + 1).first->second;
		}

		// Construct a FileSpecList containing only the support files relevant for
		// this compile unit (in the correct order).
		FileSpecList translate(const FileSpec &cu_spec,
		llvm::ArrayRef<FileSpec> all_files);

		private:
		llvm::DenseMap<size_t, size_t> m_map;
		};
		} // namespace

		FileSpecList SupportFileMap::translate(const FileSpec &cu_spec,
		llvm::ArrayRef<FileSpec> all_files) {
		std::vector<FileSpec> result;
		result.resize(m_map.size() + 1);
		result[0] = cu_spec;
		for (const auto &KV : m_map) {
		if (KV.first < all_files.size())
		result[KV.second] = all_files[KV.first];
		}
		return FileSpecList(std::move(result));
}		}

void SymbolFileBreakpad::Initialize() {		void SymbolFileBreakpad::Initialize() {
PluginManager::RegisterPlugin(GetPluginNameStatic(),		PluginManager::RegisterPlugin(GetPluginNameStatic(),
GetPluginDescriptionStatic(), CreateInstance,		GetPluginDescriptionStatic(), CreateInstance,
DebuggerInitialize);		DebuggerInitialize);
}		}

void SymbolFileBreakpad::Terminate() {		void SymbolFileBreakpad::Terminate() {
PluginManager::UnregisterPlugin(CreateInstance);		PluginManager::UnregisterPlugin(CreateInstance);
}		}

ConstString SymbolFileBreakpad::GetPluginNameStatic() {		ConstString SymbolFileBreakpad::GetPluginNameStatic() {
static ConstString g_name("breakpad");		static ConstString g_name("breakpad");
return g_name;		return g_name;
}		}

uint32_t SymbolFileBreakpad::CalculateAbilities() {		uint32_t SymbolFileBreakpad::CalculateAbilities() {
if (!m_obj_file)		if (!m_obj_file)
return 0;		return 0;
if (m_obj_file->GetPluginName() != ObjectFileBreakpad::GetPluginNameStatic())		if (m_obj_file->GetPluginName() != ObjectFileBreakpad::GetPluginNameStatic())
return 0;		return 0;

return CompileUnits \| Functions;		return CompileUnits \| Functions \| LineTables;
}		}

uint32_t SymbolFileBreakpad::GetNumCompileUnits() {		uint32_t SymbolFileBreakpad::GetNumCompileUnits() {
// TODO		ParseCUData();
return 0;		return m_cu_data->GetSize();
}		}

CompUnitSP SymbolFileBreakpad::ParseCompileUnitAtIndex(uint32_t index) {		CompUnitSP SymbolFileBreakpad::ParseCompileUnitAtIndex(uint32_t index) {
		clayborgUnsubmitted Done Reply Inline Actions I would vote to make 1 compile unit for each "FUNC". The hard part will be to select the right source file for each function. I would just start by selecting the first line entry for the function. Compile units are expected to give a list of support files, and in this case, you would make a set of files that are in the line entries only. So if you had: MODULE Linux x86_64 761550E08086333960A9074A9CE2895C0 a.out INFO CODE_ID E05015768680393360A9074A9CE2895C FILE 0 /tmp/a.c FILE 1 /tmp/b.c FILE 2 /usr/include/foo.h FUNC b0 10 0 func1 b0 1 1 0 b1 1 2 0 b2 1 2 1 b4 1 3 1 FUNC b0 10 0 func2 b0 1 1 1 b1 1 2 1 b2 1 2 2 We would have a compile unit for "func1" with a cu for "/tmp/a.c" and with support files: support_file[0] = "/tmp/a.c" support_file[1] = "/usr/include/foo.h" We would have a compile unit for "func2" with a cu for "/tmp/b.c" and with support files: support_file[0] = "/tmp/b.c" support_file[1] = "/usr/include/foo.h" The main reason for make individual compile units, is LLDB treats a compile unit specially when settings breakpoints depending on if we ask for inline functions to be set. If we set the following setting: (lldb) settings set target.inline-breakpoint-strategy never Then we will check the name of the compile unit to ensure it matches. So if we did: (lldb) b a.c:12 This would only work if the actual lldb_private::CompileUnit has a FileSpec that matches "a.c". clayborg: I would vote to make 1 compile unit for each "FUNC". The hard part will be to select the right…
// TODO		if (index >= m_cu_data->GetSize())
		clayborgUnsubmitted Not Done Reply Inline Actions This seems like where we would do the heavy parsing code that are in the initialize object function. .Get the character file offset from m_compile_units and just parse what we need here? It will cause less work for us in the initialize object call then and we can be lazier clayborg: This seems like where we would do the heavy parsing code that are in the initialize object…
return nullptr;		return nullptr;

		clayborgUnsubmitted Not Done Reply Inline Actions Validate "index" first? We will crash if someone iterates over too many CUs? clayborg: Validate "index" first? We will crash if someone iterates over too many CUs?
		CompUnitData &data = m_cu_data->GetEntryRef(index).data;
		clayborgUnsubmitted Done Reply Inline Actions parse a compile unit per function. We might want to cache all "FILE" entries in a list inside the SymbolFileBreakpad so we can easily pull out the FileSpecs when creating each compile unit. Also, each compile unit's ID can be the line number in the breakpad file to the "FUNC" entry. This allows easy access to each "FUNC" entry in the breakpad file when we are asked to parse more information about it (get compile unit support files, or any parsing of info for a compile unit. clayborg: parse a compile unit per function. We might want to cache all "FILE" entries in a list inside…

		ParseFileRecords();

		FileSpec spec;

		// The FileSpec of the compile unit will be the file corresponding to the
		// first LINE record.
		LineIterator It(m_obj_file, Record::Func, data.bookmark), End(m_obj_file);
		assert(Record::classify(*It) == Record::Func);
		++It; // Skip FUNC record.
		if (It != End) {
		auto record = LineRecord::parse(*It);
		if (record && record->FileNum < m_files->size())
		spec = (*m_files)[record->FileNum];
		}

		auto cu_sp = std::make_shared<CompileUnit>(m_obj_file->GetModule(),
		/user_data/ nullptr, spec, index,
		eLanguageTypeUnknown,
		/is_optimized/ eLazyBoolNo);

		clayborgUnsubmitted Not Done Reply Inline Actions Do we need to iterate over the file multiple times here? We do it once here, and then once on line 260. clayborg: Do we need to iterate over the file multiple times here? We do it once here, and then once on…
		labathAuthorUnsubmitted Done Reply Inline Actions The two loops iterate over different parts of the file. The first one goes through the FILE records, and this one does the FUNC records. So the iteration here is efficient because we already know where different kinds of records are located in the file. (of course, to figure out where these records are located, we've had to go through it once already (in ObjectFileBreakpad), so we still have to make two passes over this data in general. However, that is pretty much unavoidable if we want to do lazy (i.e. random access) into the file as it doesn't have any kind of index to start with.) labath: The two loops iterate over different parts of the file. The first one goes through the FILE…
		GetSymbolVendor().SetCompileUnitAtIndex(index, cu_sp);
		return cu_sp;
		clayborgUnsubmitted Done Reply Inline Actions return the count of the number of "FUNC" objects clayborg: return the count of the number of "FUNC" objects
}		}

size_t SymbolFileBreakpad::ParseFunctions(CompileUnit &comp_unit) {		size_t SymbolFileBreakpad::ParseFunctions(CompileUnit &comp_unit) {
// TODO		// TODO
return 0;		return 0;
}		}

bool SymbolFileBreakpad::ParseLineTable(CompileUnit &comp_unit) {		bool SymbolFileBreakpad::ParseLineTable(CompileUnit &comp_unit) {
// TODO		CompUnitData &data = m_cu_data->GetEntryRef(comp_unit.GetID()).data;
return 0;
		if (!data.line_table_up)
		ParseLineTableAndSupportFiles(comp_unit, data);

		comp_unit.SetLineTable(data.line_table_up.release());
		return true;
		clayborgUnsubmitted Not Done Reply Inline Actions Ditto above clayborg: Ditto above
		}

		bool SymbolFileBreakpad::ParseSupportFiles(CompileUnit &comp_unit,
		FileSpecList &support_files) {
		clayborgUnsubmitted Not Done Reply Inline Actions Seems like we should just populate the m_compile_units data with address range to character file offset here? When we are asked to create a compile unit, we do this work by going to the "lldb_private::CompileUnit::GetID()" which will return the file offset and we just head there and start parsing? clayborg: Seems like we should just populate the m_compile_units data with address range to character…
		labathAuthorUnsubmitted Done Reply Inline Actions I think I could avoid creating the CompileUnit object here. However, I will still need to do the parsing here, as I will need to figure out the number of compile units first (best I might be able to achieve is to delay this until GetNumCompileUnits() time). labath: I think I could avoid creating the CompileUnit object here. However, I will still need to do…
		CompUnitData &data = m_cu_data->GetEntryRef(comp_unit.GetID()).data;
		if (!data.support_files)
		ParseLineTableAndSupportFiles(comp_unit, data);

		support_files = std::move(*data.support_files);
		return true;
		clayborgUnsubmitted Not Done Reply Inline Actions From the compile unit, if GetID() returns the character file offset to the FUNC or first LINE, then we don't need the preparsed CompUnitData? We can just parse the line table here if and only if we need to clayborg: From the compile unit, if GetID() returns the character file offset to the FUNC or first LINE…
		labathAuthorUnsubmitted Done Reply Inline Actions That is pretty much what happens here. CompUnitData construct the line table (almost) lazily. It doesn't preparse. The reason I have this indirection, is that the creation of line tables is coupled with the creation of the support file list: in order to build the line table, I (obviously) need to go through the LINE records however, I also need to go through the LINE records in order to build the CU file list, because I need to know what files are actually used in this "CU" It seemed like a good idea to me to avoid parsing the LINE records twice. So what I've done is that on the first call to (ReleaseLineTable\|ReleaseSupportFiles), CompUnitData will parse both things. Then, the second call will return the already parsed data. That seems like a good tradeoff to me as these two items are generally used together (one is fairly useless without the other). labath: That is pretty much what happens here. CompUnitData construct the line table (almost) lazily.
}		}
		labathAuthorUnsubmitted Done Reply Inline Actions Note that here i set `file=column=line=0` for the terminal entry, which isn't consistent with the dwarf plugin for instance (it puts there whatever falls out of the state automaton, which most likely means the values from the previous entry). AFAICT, this shouldn't be a problem, because the terminal entry is there to just determine the range of the last real entry. labath: Note that here i set `file=column=line=0` for the terminal entry, which isn't consistent with…

uint32_t		uint32_t
SymbolFileBreakpad::ResolveSymbolContext(const Address &so_addr,		SymbolFileBreakpad::ResolveSymbolContext(const Address &so_addr,
SymbolContextItem resolve_scope,		SymbolContextItem resolve_scope,
SymbolContext &sc) {		SymbolContext &sc) {
// TODO		if (!(resolve_scope & (eSymbolContextCompUnit \| eSymbolContextLineEntry)))
return 0;		return 0;

		ParseCUData();
		uint32_t idx =
		m_cu_data->FindEntryIndexThatContains(so_addr.GetFileAddress());
		clayborgUnsubmitted Not Done Reply Inline Actions We need to search each compile unit to see which compile unit contains the address now. clayborg: We need to search each compile unit to see which compile unit contains the address now.
		if (idx == UINT32_MAX)
		return 0;

		sc.comp_unit = GetSymbolVendor().GetCompileUnitAtIndex(idx).get();
		SymbolContextItem result = eSymbolContextCompUnit;
		if (resolve_scope & eSymbolContextLineEntry) {
		if (sc.comp_unit->GetLineTable()->FindLineEntryByAddress(so_addr,
		sc.line_entry)) {
		result \|= eSymbolContextLineEntry;
		}
		}

		return result;
		}

		uint32_t SymbolFileBreakpad::ResolveSymbolContext(
		const FileSpec &file_spec, uint32_t line, bool check_inlines,
		clayborgUnsubmitted Done Reply Inline Actions It would be nice to put this parsing code into the lldb_private::breakpad::Line" class I talked about in the other patch? It would be great if this code looked like: lldb_private::breakpad::Line bp_line(line); switch (bp_line.GetToken()) { case Token::Func: // Create the compile unit and store into cu_sp cu_sp = bp_line.CreateCompileUnit(); break; case Token::Line: { addr_t address; size_t size; uint32_t line_num, file_num; if (bp_line.ParseLineEntry(address, size, line_num, file_num)) { // Discontiguous entries. Finish off the previous sequence and reset. if (next_addr && next_addr != address) finish_sequence(); line_table_up->AppendLineEntryToSequence( line_seq_up.get(), address, line_num, 0, file_num, true, false, false, false, false); } clayborg:* It would be nice to put this parsing code into the lldb_private::breakpad::Line" class I talked…
		lldb::SymbolContextItem resolve_scope, SymbolContextList &sc_list) {
		if (!(resolve_scope & eSymbolContextCompUnit))
		return 0;

		uint32_t old_size = sc_list.GetSize();
		for (size_t i = 0, size = GetNumCompileUnits(); i < size; ++i) {
		CompileUnit &cu = *GetSymbolVendor().GetCompileUnitAtIndex(i);
		cu.ResolveSymbolContext(file_spec, line, check_inlines,
		/exact/ false, resolve_scope, sc_list);
		}
		return sc_list.GetSize() - old_size;
}		}

uint32_t SymbolFileBreakpad::FindFunctions(		uint32_t SymbolFileBreakpad::FindFunctions(
const ConstString &name, const CompilerDeclContext *parent_decl_ctx,		const ConstString &name, const CompilerDeclContext *parent_decl_ctx,
FunctionNameType name_type_mask, bool include_inlines, bool append,		FunctionNameType name_type_mask, bool include_inlines, bool append,
SymbolContextList &sc_list) {		SymbolContextList &sc_list) {
// TODO		// TODO
if (!append)		if (!append)
sc_list.Clear();		sc_list.Clear();
		clayborgUnsubmitted Done Reply Inline Actions This function will need to populate support_files for a given FUNC as mentioned above clayborg: This function will need to populate support_files for a given FUNC as mentioned above
return sc_list.GetSize();		return sc_list.GetSize();
}		}

uint32_t SymbolFileBreakpad::FindFunctions(const RegularExpression &regex,		uint32_t SymbolFileBreakpad::FindFunctions(const RegularExpression &regex,
bool include_inlines, bool append,		bool include_inlines, bool append,
SymbolContextList &sc_list) {		SymbolContextList &sc_list) {
// TODO		// TODO
if (!append)		if (!append)
Show All 16 Lines	SymbolFileBreakpad::FindTypes(const std::vector<CompilerContext> &context,
if (!append)		if (!append)
types.Clear();		types.Clear();
return types.GetSize();		return types.GetSize();
}		}

void SymbolFileBreakpad::AddSymbols(Symtab &symtab) {		void SymbolFileBreakpad::AddSymbols(Symtab &symtab) {
Log *log = GetLogIfAllCategoriesSet(LIBLLDB_LOG_SYMBOLS);		Log *log = GetLogIfAllCategoriesSet(LIBLLDB_LOG_SYMBOLS);
Module &module = *m_obj_file->GetModule();		Module &module = *m_obj_file->GetModule();
addr_t base = module.GetObjectFile()->GetBaseAddress().GetFileAddress();		addr_t base = GetBaseFileAddress();
if (base == LLDB_INVALID_ADDRESS) {		if (base == LLDB_INVALID_ADDRESS) {
LLDB_LOG(log, "Unable to fetch the base address of object file. Skipping "		LLDB_LOG(log, "Unable to fetch the base address of object file. Skipping "
"symtab population.");		"symtab population.");
return;		return;
}		}

const SectionList &list = *module.GetSectionList();		const SectionList &list = *module.GetSectionList();
llvm::DenseMap<addr_t, Symbol> symbols;		llvm::DenseMap<addr_t, Symbol> symbols;
Show All 12 Lines	symbols.try_emplace(
address, /symID/ 0, Mangled(name, /is_mangled/ false),		address, /symID/ 0, Mangled(name, /is_mangled/ false),
eSymbolTypeCode, /is_global/ true, /is_debug/ false,		eSymbolTypeCode, /is_global/ true, /is_debug/ false,
/is_trampoline/ false, /is_artificial/ false,		/is_trampoline/ false, /is_artificial/ false,
AddressRange(section_sp, address - section_sp->GetFileAddress(),		AddressRange(section_sp, address - section_sp->GetFileAddress(),
size.getValueOr(0)),		size.getValueOr(0)),
size.hasValue(), /contains_linker_annotations/ false, /flags/ 0);		size.hasValue(), /contains_linker_annotations/ false, /flags/ 0);
};		};

for (llvm::StringRef line : lines(*m_obj_file, Record::Func)) {		for (llvm::StringRef line : lines(Record::Func)) {
if (auto record = FuncRecord::parse(line))		if (auto record = FuncRecord::parse(line))
add_symbol(record->Address, record->Size, record->Name);		add_symbol(record->Address, record->Size, record->Name);
}		}

for (llvm::StringRef line : lines(*m_obj_file, Record::Public)) {		for (llvm::StringRef line : lines(Record::Public)) {
if (auto record = PublicRecord::parse(line))		if (auto record = PublicRecord::parse(line))
add_symbol(record->Address, llvm::None, record->Name);		add_symbol(record->Address, llvm::None, record->Name);
else		else
LLDB_LOG(log, "Failed to parse: {0}. Skipping record.", line);		LLDB_LOG(log, "Failed to parse: {0}. Skipping record.", line);
}		}

for (auto &KV : symbols)		for (auto &KV : symbols)
symtab.AddSymbol(std::move(KV.second));		symtab.AddSymbol(std::move(KV.second));
symtab.CalculateSymbolSizes();		symtab.CalculateSymbolSizes();
}		}

		SymbolVendor &SymbolFileBreakpad::GetSymbolVendor() {
		return *m_obj_file->GetModule()->GetSymbolVendor();
		}

		addr_t SymbolFileBreakpad::GetBaseFileAddress() {
		return m_obj_file->GetModule()
		->GetObjectFile()
		->GetBaseAddress()
		.GetFileAddress();
		}

		// Parse out all the FILE records from the breakpad file. These will be needed
		// when constructing the support file lists for individual compile units.
		void SymbolFileBreakpad::ParseFileRecords() {
		if (m_files)
		return;
		m_files.emplace();

		Log *log = GetLogIfAllCategoriesSet(LIBLLDB_LOG_SYMBOLS);
		for (llvm::StringRef line : lines(Record::File)) {
		auto record = FileRecord::parse(line);
		if (!record) {
		LLDB_LOG(log, "Failed to parse: {0}. Skipping record.", line);
		continue;
		}

		if (record->Number >= m_files->size())
		m_files->resize(record->Number + 1);
		(*m_files)[record->Number] = FileSpec(record->Name);
		}
		}

		void SymbolFileBreakpad::ParseCUData() {
		if (m_cu_data)
		return;

		m_cu_data.emplace();
		Log *log = GetLogIfAllCategoriesSet(LIBLLDB_LOG_SYMBOLS);
		addr_t base = GetBaseFileAddress();
		if (base == LLDB_INVALID_ADDRESS) {
		LLDB_LOG(log, "SymbolFile parsing failed: Unable to fetch the base address "
		"of object file.");
		}

		// We shall create one compile unit for each FUNC record. So, count the number
		// of FUNC records, and store them in m_cu_data, together with their ranges.
		for (LineIterator It(m_obj_file, Record::Func), End(m_obj_file); It != End;
		++It) {
		if (auto record = FuncRecord::parse(*It)) {
		m_cu_data->Append(CompUnitMap::Entry(base + record->Address, record->Size,
		CompUnitData(It.GetBookmark())));
		} else
		LLDB_LOG(log, "Failed to parse: {0}. Skipping record.", *It);
		}
		m_cu_data->Sort();
		}

		// Construct the list of support files and line table entries for the given
		// compile unit.
		void SymbolFileBreakpad::ParseLineTableAndSupportFiles(CompileUnit &cu,
		CompUnitData &data) {
		addr_t base = GetBaseFileAddress();
		assert(base != LLDB_INVALID_ADDRESS &&
		"How did we create compile units without a base address?");

		SupportFileMap map;
		data.line_table_up = llvm::make_unique<LineTable>(&cu);
		std::unique_ptr<LineSequence> line_seq_up(
		data.line_table_up->CreateLineSequenceContainer());
		llvm::Optional<addr_t> next_addr;
		auto finish_sequence = [&]() {
		data.line_table_up->AppendLineEntryToSequence(
		line_seq_up.get(), next_addr, /line/ 0, /column*/ 0,
		/file_idx/ 0, /is_start_of_statement/ false,
		/is_start_of_basic_block/ false, /is_prologue_end/ false,
		/is_epilogue_begin/ false, /is_terminal_entry/ true);
		data.line_table_up->InsertSequence(line_seq_up.get());
		line_seq_up->Clear();
		};

		LineIterator It(m_obj_file, Record::Func, data.bookmark), End(m_obj_file);
		assert(Record::classify(*It) == Record::Func);
		for (++It; It != End; ++It) {
		auto record = LineRecord::parse(*It);
		if (!record)
		break;

		record->Address += base;

		if (next_addr && *next_addr != record->Address) {
		// Discontiguous entries. Finish off the previous sequence and reset.
		finish_sequence();
		}
		data.line_table_up->AppendLineEntryToSequence(
		line_seq_up.get(), record->Address, record->LineNum, /column/ 0,
		map[record->FileNum], /is_start_of_statement/ true,
		/is_start_of_basic_block/ false, /is_prologue_end/ false,
		/is_epilogue_begin/ false, /is_terminal_entry/ false);
		next_addr = record->Address + record->Size;
		}
		if (next_addr)
		finish_sequence();
		data.support_files = map.translate(cu, *m_files);
		}