This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
-
CommandGuide/
-
llvm-symbolizer.rst
1/2
ReleaseNotes.rst
-
include/llvm/DebugInfo/Symbolize/
-
llvm/
-
DebugInfo/
-
Symbolize/
1/1
DIPrinter.h
-
SymbolizableModule.h
-
SymbolizableObjectFile.h
1/1
Symbolize.h
-
lib/DebugInfo/Symbolize/
-
DebugInfo/
-
Symbolize/
2/3
DIPrinter.cpp
2/3
SymbolizableObjectFile.cpp
5/9
Symbolize.cpp
-
test/tools/llvm-symbolizer/
-
tools/
-
llvm-symbolizer/
-
Inputs/
-
addr.inp
-
discrim.inp
-
debuginfod.test
1
flag-grouping.test
-
flush-output.s
-
invalid-input-address.test
1/2
output-style-empty-line.test
1/2
output-style-json-code.test
-
sym-verbose.test
1
sym.test
6/25
symbol-search.test
-
tools/llvm-symbolizer/
-
llvm-symbolizer/
7/7
llvm-symbolizer.cpp
-
unittests/ProfileData/
-
ProfileData/
-
MemProfTest.cpp

Differential D149759

[symbolizer] Support symbol lookup
ClosedPublic

Authored by sepavloff on May 3 2023, 9:13 AM.

Download Raw Diff

Details

Reviewers

jhenderson
dblaikie
mysterymath
MaskRay
ikudrin
dvyukov

Commits

rGe144ae54dcb9: [symbolizer] Support symbol lookup
rG2b27948783e4: [symbolizer] Support symbol lookup

Summary

Recent versions of GNU binutils starting from 2.39 support symbol+offset
lookup in addition to the usual numeric address lookup. This change adds
symbol lookup to llvm-symbolize and llvm-addr2line.

Now llvm-symbolize behaves closer to GNU addr2line, - if the value specified
as address in command line or input stream is not a number, it is treated as
a symbol name. For example:

llvm-symbolize --obj=abc.so func_22
llvm-symbolize --obj=abc.so "CODE func_22"

This lookup is now supported only for functions. Specification with
offset is not supported yet.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

sepavloff created this revision.May 3 2023, 9:13 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 3 2023, 9:13 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

sepavloff requested review of this revision.May 3 2023, 9:13 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 3 2023, 9:13 AM

sepavloff mentioned this in D139859: [symbolizer] Support symbol+offset lookup.May 3 2023, 9:20 AM

Harbormaster completed remote builds in B229712: Diff 519106.May 3 2023, 9:47 AM

Ping.

I believe it would be much better not to add the new mode (SYMBOL), but to support the new way of specifying an address for CODE.

Please, don't forget to reflected the changes in llvm/docs/CommandGuide/llvm-symbolizer.rst and probably llvm/docs/CommandGuide/llvm-addr2line.rst.

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
225–227	I don't think we need a new point of divergence here. It's unlikely that anyone would rely on the tool to generate an error for such an input.

Completely agree with what @ikudrin said. We don't need to maintain backwards compatibility between versions of our tools if the old behaviour didn't make much sense (i.e. treating non-numbers as symbols is a good thing, and the old behaviour of just echoing the input wasn't particularly useful). I also don't think we need a new SYMBOL directive (assuming GNU addr2line doesn't support it anyway), unless there's no practical way to make symbols work in the CODE directive (but I don't see why there wouldn't be).

Address reviewers' notes

Removed command SYMBOL,
Support symbol lookup in llvm-symbolizer as well.

Harbormaster completed remote builds in B243123: Diff 537212.Jul 4 2023, 9:39 PM

jhenderson added inline comments.Jul 5 2023, 1:45 AM

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h
69	I'm thinking this might be better named `IsGNUStyle`, since llvm-symbolizer has an `--output-style` option, and this variable controls how the output is formatted.
llvm/lib/DebugInfo/Symbolize/SymbolizableObjectFile.cpp
357	Nit: This should probably have braces, as it is a non-trivial statement (even if it is only a single statement technically). See https://llvm.org/docs/CodingStandards.html#don-t-use-braces-on-simple-single-statement-bodies-of-if-else-loop-statements for more details.
llvm/test/tools/llvm-symbolizer/Inputs/addr2.inp
1 ↗	(On Diff #537212)	Please don't add a separate input file that is only used by one test. The llvm-symbolizer tests could do with a bit of cleaning-up. Using a separate input file for text input is one such example.
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
467	This should be controlled by the output-style command-line option (see my comment elsewhere).

jhenderson added inline comments.Jul 5 2023, 1:45 AM

llvm/test/tools/llvm-symbolizer/output-style-empty-line.test
16–19	I don't think these check-suffixes are particularly understandable any more, given the changed behaviour. I'd suggest instead renaming them all and moving the check patterns to be immediately after the group that uses them. Something like: RUN: llvm-symbolizer -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \ RUN: \| FileCheck %s --check-prefix=SYMB-LLVM RUN: llvm-symbolizer --output-style=LLVM -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \ RUN: \| FileCheck %s --check-prefix=SYMB-LLVM SYMB-LLVM: x.c:14:0 SYMB-LLVM-EMPTY: SYMB-LLVM-NEXT: some text2 RUN: llvm-symbolizer --output-style=GNU -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \ RUN: \| FileCheck %s --check-prefix=SYMB-GNU SYMB-GNU: x.c:14 SYMB-GNU-NEXT: some text2 RUN: llvm-addr2line -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \ RUN: \| FileCheck %s --check-prefix=ADDR-GNU RUN: llvm-addr2line --output-style=GNU -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \ RUN: \| FileCheck %s --check-prefix=ADDR-GNU ADDR-GNU: x.c:14 ADDR-GNU-NEXT: ??:0 RUN: llvm-addr2line --output-style=LLVM -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \ RUN: \| FileCheck %s --check-prefix=ADDR-LLVM ADDR-LLVM: x.c:14:0 ADDR-LLVM-EMPTY: ADDR-LLVM-NEXT: ??:0
llvm/test/tools/llvm-symbolizer/output-style-json-code.test
50	I think you could make this a little more self descriptive, by changing your invalid input to something like "0 not a symbol name or number". That being said, the intent of this test, I think, was to use the exact same set of inputs as one that doesn't use JSON style.
llvm/test/tools/llvm-symbolizer/symbol-search.test
2	You're using a weird mix of comment markers in this file. The standard rules in newer tools tests are `##` for actual comments; `#` for RUN and CHECK lines; all comment markers should have a space between them and the rest of the line (e.g. `# CHECK:` or `## This is a comment`). optionally, if the test consists solely of things prefixed with a comment marker, you can drop one `#` from each of the above, but it's more common to have them than not. In addition, this test should have a comment explaining what the test is testing, probably before this comment.
5	I can't tell what "PRFUNC" is supposed to stand for. It might be useful to add a comment before each test case explaining what that specific case is testing.
10	Why is this test case epeated twice?
18	It would be better for both "nonexistent" cases to have a suffix, rather than just the llvm-symbolizer one.
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
204
212	This whole area of code could do with some blank lines to aid readability. I would suggest that you have them between each group of related lines, where "related" means a comment, one if statement, and any lines strongly related to that if statement (e.g. variable declarations). For example, I'd add a line before the comment about 0x prefixes, and another one after the if's closing brace.
225–227	+1: I don't think we need to handle invalid input differently between llvm-symbolizer and llvm-addr2line specifically, as long as the updated behaviour makes sense.

ikudrin added inline comments.Jul 6 2023, 5:55 PM

llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp
266–269	The conditions with fewer negations are much easier to understand.

Address reviewers' notes

Harbormaster completed remote builds in B243752: Diff 538109.Jul 7 2023, 6:01 AM

sepavloff marked 9 inline comments as done.Jul 7 2023, 6:10 AM

sepavloff added inline comments.

llvm/test/tools/llvm-symbolizer/output-style-empty-line.test
16–19	Yes, it looks better. Thank you!
llvm/test/tools/llvm-symbolizer/symbol-search.test
2	Thank you for eplanations. I modernized also sym.test, it didn't follow these rules.
10	It should be `llvm-addr2line` and `llvm-symbolizer` variants. Fixed.
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
467	Initialization of `Config.IsGNUStyle` is now below in this function.

Make response of llvm-symbolizer on invalid input with spaces identical to llvm-addr2line

sepavloff marked 2 inline comments as done.Jul 7 2023, 10:20 AM

Harbormaster completed remote builds in B243807: Diff 538185.Jul 7 2023, 10:21 AM

Ping.

sepavloff edited the summary of this revision. (Show Details)Jul 18 2023, 11:52 AM

Sorry for the delay - I was away last week, and am still catching up on reviews.

llvm/test/tools/llvm-symbolizer/flag-grouping.test
6	Let's match the full line in all of these cases where the text has changed.
llvm/test/tools/llvm-symbolizer/symbol-search.test
2	Can I ask you to spin off sym.test's updates into a new review? I've got no objection to the update (sym.test itself is a little bit archaic), but it's not really related to this patch. Also "when address" -> "when an address"
5	The individual sub cases within this test file could still do with some comments to make their purpose clearer. I think I follow it better now with the recent improvements, but more clarity would help.
9	I forgot to mention that I prefer `-` to `_` in prefix names. It avoids weird awkardness like `CODE_CMD-NEXT` for example, and `-` is easier to type on an English keyboard :D
19	`ADDR` is probably not a good abbreviation for `addr2line` since it could be confused with `address` which is a a very relevant term to these tests. Perhaps `A2L`? Apologies for the churn.

Address reviewer's notes

Harbormaster completed remote builds in B247260: Diff 542971.Jul 21 2023, 9:41 AM

jhenderson added inline comments.Jul 25 2023, 12:42 AM

llvm/test/tools/llvm-symbolizer/sym.test
3–1	I generally would avoid double blank lines. My personal rule is similar to what I follow in C++ code: No blank line if the comment is tightly linked to the immediately following block. One blank line if the comment is more general (e.g. it applies to a wider area of code/isn't really targeted at a specific block etc).
llvm/test/tools/llvm-symbolizer/symbol-search.test
8	Prefer a slightly longer form that says something like "Show that the "CODE" command supports search by symbol name." Same goes for below comments.
9	See my above comment: as this comment is tightly tied to the test case that immediately follows, it doesn't need a blank line (and regardless, blank lines generally don't have a comment marker, unless they are separating paragraphs within a longer comment).
14–15	I don't think you should reference GNU here - that's a motivation for the behaviour, but not for the test case for that behaviour. Rather, what this comment could say is "Show that llvm-addr2line and llvm-symbolizer accept symbol names on the command-line."
21
32	I think I might be getting confused between different patches, but looking at this again, my feeling is that we don't really want this divergence in behaviour between GNU and LLVM mode, unless there's a strong motivation for the LLVM behaviour style. I might be inclined to create a different precursor patch that unifies the two, so that they do what GNU addr2line does.

ikudrin mentioned this in D149757: Test data for symbol lookup. NFC.Jul 31 2023, 7:21 PM

ikudrin added inline comments.Jul 31 2023, 10:05 PM

llvm/docs/ReleaseNotes.rst
326	Please remove the spaces in the blank line.

Rebase the patch

@sepavloff, what's the situation with this patch? Are you planning on addressing review comments or similar, or waiting for an upstream patch to land etc etc?

In D149759#4636578, @jhenderson wrote:

@sepavloff, what's the situation with this patch? Are you planning on addressing review comments or similar, or waiting for an upstream patch to land etc etc?

All necessary prerequisites are landed, the patch is updated and is ready for review.

jhenderson added inline comments.Sep 5 2023, 1:03 AM

llvm/docs/ReleaseNotes.rst
326	Not addresssed?
llvm/include/llvm/DebugInfo/Symbolize/Symbolize.h
110	This is a bit of a nit-pick, but it is throwing me off a bit having both `const std::string &` and `StringRef` arguments for this overload of `findSymbol`. I see why you've done it (similarity with other overloaded functions above and below), but I feel like it should probably still use the correct and consistent style (`StringRef`) for both.
llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp
392	It's possible I've missed it, but I don't see any test that shows this array has appropriate contents (I see a few where it is empty, but there needs to be one or more test cases that cover it having contents, at least one of which should cover it having more than one entry).
llvm/lib/DebugInfo/Symbolize/SymbolizableObjectFile.cpp
357	I'm a little concerned about the performance of this code. It might be a premature concern, but looping through a list of symbols to find one, for every input symbol, sounds like it could get expensive quite rapidly - if you were naively to use llvm-symbolizer to symbolize an entire object by using its listed symbol names, you'd end up with an n^2 operation, as every search has to go through the entire list. The performance of the symbolizer code is considered important, hence why I'm bringing this up. I wonder if it's worth considering changing the `std::vector` used for `Symbols` into another container with more efficient searching performance?
llvm/lib/DebugInfo/Symbolize/Symbolize.cpp
237–238	Is there testing covering this failure for this specific case?
245–246	Ditto.
253–254	This needs a test case to show that if `Opts.Demangle` is false, the name isn't demangled.
llvm/test/tools/llvm-symbolizer/symbol-search.test
9	Not addressed. For a concrete example, I personally recommend the following format: # Comment RUN: ... RUN: ... CHECK: ... CHEC-NEXT: # Next test case comment RUN: ...
14–15	Not addressed.
25
37
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
212	`StartsWithHexPrefix` appears to be unused?

Update patch

There look like there are a few of my previous comments that haven't been addressed yet?

llvm/test/tools/llvm-symbolizer/output-style-json-code.test
66
llvm/test/tools/llvm-symbolizer/symbol-search.test
13	You don't specify things "in" a command line, you specify them "on the command-line."
18	I'm not sure this comment needed changing. The previous version was good: "If symbol has a space in its name, ignore everything after it." though I might change "after it" to "from the space onwards." You could add "Check that" to the start too, so the final thing might look like: "Check that if a symbol has a space in its name, ignore everything from the space onwards."
27	"Show that if a symbol ..."

sepavloff marked 5 inline comments as done.Sep 7 2023, 3:06 AM

sepavloff added inline comments.

llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp
392	At the end of `output-style-json-code.test` a test is added that checks the case of more than one entry.
llvm/lib/DebugInfo/Symbolize/SymbolizableObjectFile.cpp
357	This is a very important aspect of this solution. It however requires substantial code changes, so it is not practical to implement the performance enhancements in this patch. There are few users of this feature right now and GNU `addr2line` also uses linear search, so it is unlikely that performance problems would be noticed immediately. Some possible applications need `llvm-symbolizer` for large files and many symbols, performance is critical for such cases, so a solution will be elaborated. It requires more efforts than just changing the implementation of `Symbol`, as it is already sorted by address. There are other points where the speed or memory consumption can be improved, they could be treated together.
llvm/lib/DebugInfo/Symbolize/Symbolize.cpp
237–238	It is unlikely that this code fails, because `getOrCreateModuleInfo` is called early to check existence and validity of binary file.
245–246	Actually this code was copied from `symbolize*Common`, it will fail exactly in the same cases when fail those functions.
253–254	Such tests are added at the end of `symbol-search.test`.

Fix comments

Add missing tests

sepavloff added inline comments.Sep 7 2023, 12:16 PM

llvm/lib/DebugInfo/Symbolize/Symbolize.cpp
237–238	I was wrong. If binary file name was not specified via `--obj` option, it will be extracted from command line arguments. In this case `getOrCreateModuleInfo` is called just in this code and may fail. The relevant test is added to `symbol-search.test`.
245–246	It seems this case (no error from `getOrCreateModuleInfo` but zero pointer to `SymbolizableModule`) cannot be realized.

Harbormaster completed remote builds in B256821: Diff 556193.Sep 7 2023, 3:24 PM

Sorry for the delay - Phabricator issues and being busy meant I only just got back to this.

llvm/lib/DebugInfo/Symbolize/Symbolize.cpp
245–246	Okay, thanks for looking into it.
llvm/test/tools/llvm-symbolizer/symbol-search.test
13	I think canoncially it should be "command-line" not "command line" (this is based on what someone from our docs team once told me).
56	Strictly speaking, using the `--obj` option to provide the binary file is also specifying it on the command-line. I think this comment should be a little more specific (assuming it matters). I'd also use "input file" rather than "binary file" to clarify the file's purpose. Something like: "Show that both the symbol and input file can be specified in the search string on the command-line." or something like that. Similar comments apply below.
62	Nit: here and below, add a space after `>2` to make it stand out more.

Update the patch

Fix comments in symbol-search.test,
Merge D149757,
Rebase.

Harbormaster completed remote builds in B257567: Diff 557294.Sep 25 2023, 2:04 AM

LGTM, thanks, but please give @ikudrin/@MaskRay/... a few days to make any other comments.

This revision is now accepted and ready to land.Sep 26 2023, 12:00 AM

Thanks!

This revision was landed with ongoing or failed builds.Oct 2 2023, 7:39 AM

Closed by commit rG2b27948783e4: [symbolizer] Support symbol lookup (authored by sepavloff). · Explain Why

This revision was automatically updated to reflect the committed changes.

sepavloff added a commit: rG2b27948783e4: [symbolizer] Support symbol lookup.

sepavloff added a reverting change: rG39fec5457c09: Revert "[symbolizer] Support symbol lookup".Oct 2 2023, 8:22 AM

The patch was reverted because the test llvm/test/Support/interrupts.test started failing. The fail is observed only on Windows. It looks like this is the test problem, it works incorrectly, if stderr is not empty. The fix is provided in https://github.com/llvm/llvm-project/pull/68556.

GitHub <noreply@github.com> mentioned this in rG18f036d01055: [test] Align behavior of interrupts.test on different platforms (#68556).Oct 30 2023, 9:32 PM

sepavloff added a commit: rGe144ae54dcb9: [symbolizer] Support symbol lookup.Nov 1 2023, 12:42 AM

Revision Contents

Path

Size

llvm/

docs/

CommandGuide/

llvm-symbolizer.rst

13 lines

ReleaseNotes.rst

2 lines

include/

llvm/

DebugInfo/

Symbolize/

DIPrinter.h

8 lines

SymbolizableModule.h

3 lines

SymbolizableObjectFile.h

2 lines

Symbolize.h

11 lines

lib/

DebugInfo/

Symbolize/

DIPrinter.cpp

29 lines

SymbolizableObjectFile.cpp

13 lines

Symbolize.cpp

44 lines

test/

tools/

llvm-symbolizer/

Inputs/

4 lines

2 lines

2 lines

4 lines

2 lines

invalid-input-address.test

19 lines

output-style-empty-line.test

31 lines

output-style-json-code.test

23 lines

sym-verbose.test

4 lines

sym.test

153 lines

symbol-search.test

33 lines

tools/

llvm-symbolizer/

llvm-symbolizer.cpp

62 lines

unittests/

ProfileData/

MemProfTest.cpp

4 lines

Diff 538185

llvm/docs/CommandGuide/llvm-symbolizer.rst

llvm-symbolizer - convert addresses into source code locations		llvm-symbolizer - convert addresses into source code locations
==============================================================		==============================================================

.. program:: llvm-symbolizer		.. program:: llvm-symbolizer

SYNOPSIS		SYNOPSIS
--------		--------

:program:`llvm-symbolizer` [options] [addresses...]		:program:`llvm-symbolizer` [options] [addresses...]

DESCRIPTION		DESCRIPTION
-----------		-----------

:program:`llvm-symbolizer` reads input names and addresses from the command-line		:program:`llvm-symbolizer` reads input names and addresses from the command-line
and prints corresponding source code locations to standard output. It can also		and prints corresponding source code locations to standard output. It can also
symbolize logs containing :doc:`Symbolizer Markup </SymbolizerMarkupFormat>` via		symbolize logs containing :doc:`Symbolizer Markup </SymbolizerMarkupFormat>` via
:option:`--filter-markup`.		:option:`--filter-markup`. Addresses may be specified as numbers or symbol names.

If no address is specified on the command-line, it reads the addresses from		If no address is specified on the command-line, it reads the addresses from
standard input. If no input name is specified on the command-line, but addresses		standard input. If no input name is specified on the command-line, but addresses
are, or if at any time an input value is not recognized, the input is simply		are, or if at any time an input value is not recognized, the input is simply
echoed to the output.		echoed to the output.

Input names can be specified together with the addresses either on standard		Input names can be specified together with the addresses either on standard
input or as positional arguments on the command-line. By default, input names		input or as positional arguments on the command-line. By default, input names
▲ Show 20 Lines • Show All 165 Lines • ▼ Show 20 Lines	.. code-block:: console
/tmp/foo/test.cpp:15:0		/tmp/foo/test.cpp:15:0
$ llvm-symbolizer --obj=test.elf 0x4004a0 --basenames		$ llvm-symbolizer --obj=test.elf 0x4004a0 --basenames
main		main
test.cpp:15:0		test.cpp:15:0
$ llvm-symbolizer --obj=test.elf 0x4004a0 --relativenames		$ llvm-symbolizer --obj=test.elf 0x4004a0 --relativenames
main		main
foo/test.cpp:15:0		foo/test.cpp:15:0

		Example 7 - Addresses as symbol names:

		.. code-block:: console

		$ llvm-symbolizer --obj=test.elf main
		main
		/tmp/test.cpp:14:0
		$ llvm-symbolizer --obj=test.elf "CODE foz"
		foz
		/tmp/test.h:1:0

OPTIONS		OPTIONS
-------		-------

.. option:: --adjust-vma <offset>		.. option:: --adjust-vma <offset>

Add the specified offset to object file addresses when performing lookups.		Add the specified offset to object file addresses when performing lookups.
This can be used to perform lookups as if the object were relocated by the		This can be used to perform lookups as if the object were relocated by the
offset.		offset.
▲ Show 20 Lines • Show All 321 Lines • Show Last 20 Lines

llvm/docs/ReleaseNotes.rst

Show First 20 Lines • Show All 317 Lines • ▼ Show 20 Lines	* When a template class annotated with the ``[[clang::preferred_name]]`` attribute
(`D145803 <https://reviews.llvm.org/D145803>`_)		(`D145803 <https://reviews.llvm.org/D145803>`_)

Changes to the LLVM tools		Changes to the LLVM tools
---------------------------------		---------------------------------
* llvm-lib now supports the /def option for generating a Windows import library from a definition file.		* llvm-lib now supports the /def option for generating a Windows import library from a definition file.

* Made significant changes to JSON output format of `llvm-readobj`/`llvm-readelf`		* Made significant changes to JSON output format of `llvm-readobj`/`llvm-readelf`
to improve correctness and clarity.		to improve correctness and clarity.

		ikudrinUnsubmitted Not Done Reply Inline Actions Please remove the spaces in the blank line. ikudrin: Please remove the spaces in the blank line.
		jhendersonUnsubmitted Done Reply Inline Actions Not addresssed? jhenderson: Not addresssed?
		* llvm-symbolizer and llvm-addr2line now support addresses specified as symbol names.

Changes to LLDB		Changes to LLDB
---------------------------------		---------------------------------

* In the results of commands such as ``expr`` and ``frame var``, type summaries will now		* In the results of commands such as ``expr`` and ``frame var``, type summaries will now
omit defaulted template parameters. The full template parameter list can still be		omit defaulted template parameters. The full template parameter list can still be
viewed with ``expr --raw-output``/``frame var --raw-output``. (`D141828 <https://reviews.llvm.org/D141828>`_)		viewed with ``expr --raw-output``/``frame var --raw-output``. (`D141828 <https://reviews.llvm.org/D141828>`_)

* LLDB is now able to show the subtype of signals found in a core file. For example		* LLDB is now able to show the subtype of signals found in a core file. For example
▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h

Show All 28 Lines

namespace symbolize {		namespace symbolize {

class SourceCode;		class SourceCode;

struct Request {		struct Request {
StringRef ModuleName;		StringRef ModuleName;
std::optional<uint64_t> Address;		std::optional<uint64_t> Address;
		StringRef Symbol;
};		};

class DIPrinter {		class DIPrinter {
public:		public:
DIPrinter() = default;		DIPrinter() = default;
virtual ~DIPrinter() = default;		virtual ~DIPrinter() = default;

virtual void print(const Request &Request, const DILineInfo &Info) = 0;		virtual void print(const Request &Request, const DILineInfo &Info) = 0;
virtual void print(const Request &Request, const DIInliningInfo &Info) = 0;		virtual void print(const Request &Request, const DIInliningInfo &Info) = 0;
virtual void print(const Request &Request, const DIGlobal &Global) = 0;		virtual void print(const Request &Request, const DIGlobal &Global) = 0;
virtual void print(const Request &Request,		virtual void print(const Request &Request,
const std::vector<DILocal> &Locals) = 0;		const std::vector<DILocal> &Locals) = 0;
		virtual void print(const Request &Request,
		const std::vector<DILineInfo> &Locations) = 0;

virtual void printInvalidCommand(const Request &Request,		virtual void printInvalidCommand(const Request &Request,
StringRef Command) = 0;		StringRef Command) = 0;

virtual bool printError(const Request &Request,		virtual bool printError(const Request &Request,
const ErrorInfoBase &ErrorInfo) = 0;		const ErrorInfoBase &ErrorInfo) = 0;

virtual void listBegin() = 0;		virtual void listBegin() = 0;
virtual void listEnd() = 0;		virtual void listEnd() = 0;
};		};

struct PrinterConfig {		struct PrinterConfig {
bool PrintAddress;		bool PrintAddress;
bool PrintFunctions;		bool PrintFunctions;
bool Pretty;		bool Pretty;
bool Verbose;		bool Verbose;
int SourceContextLines;		int SourceContextLines;
		bool IsGNUStyle;
		jhendersonUnsubmitted Done Reply Inline Actions I'm thinking this might be better named `IsGNUStyle`, since llvm-symbolizer has an `--output-style` option, and this variable controls how the output is formatted. jhenderson: I'm thinking this might be better named `IsGNUStyle`, since llvm-symbolizer has an `--output…
};		};

using ErrorHandler = function_ref<void(const ErrorInfoBase &, StringRef)>;		using ErrorHandler = function_ref<void(const ErrorInfoBase &, StringRef)>;

class PlainPrinterBase : public DIPrinter {		class PlainPrinterBase : public DIPrinter {
protected:		protected:
raw_ostream &OS;		raw_ostream &OS;
ErrorHandler ErrHandler;		ErrorHandler ErrHandler;
Show All 15 Lines	public:
PlainPrinterBase(raw_ostream &OS, ErrorHandler EH, PrinterConfig &Config)		PlainPrinterBase(raw_ostream &OS, ErrorHandler EH, PrinterConfig &Config)
: OS(OS), ErrHandler(EH), Config(Config) {}		: OS(OS), ErrHandler(EH), Config(Config) {}

void print(const Request &Request, const DILineInfo &Info) override;		void print(const Request &Request, const DILineInfo &Info) override;
void print(const Request &Request, const DIInliningInfo &Info) override;		void print(const Request &Request, const DIInliningInfo &Info) override;
void print(const Request &Request, const DIGlobal &Global) override;		void print(const Request &Request, const DIGlobal &Global) override;
void print(const Request &Request,		void print(const Request &Request,
const std::vector<DILocal> &Locals) override;		const std::vector<DILocal> &Locals) override;
		void print(const Request &Request,
		const std::vector<DILineInfo> &Locations) override;

void printInvalidCommand(const Request &Request, StringRef Command) override;		void printInvalidCommand(const Request &Request, StringRef Command) override;

bool printError(const Request &Request,		bool printError(const Request &Request,
const ErrorInfoBase &ErrorInfo) override;		const ErrorInfoBase &ErrorInfo) override;

void listBegin() override {}		void listBegin() override {}
void listEnd() override {}		void listEnd() override {}
Show All 36 Lines	public:
JSONPrinter(raw_ostream &OS, PrinterConfig &Config)		JSONPrinter(raw_ostream &OS, PrinterConfig &Config)
: OS(OS), Config(Config) {}		: OS(OS), Config(Config) {}

void print(const Request &Request, const DILineInfo &Info) override;		void print(const Request &Request, const DILineInfo &Info) override;
void print(const Request &Request, const DIInliningInfo &Info) override;		void print(const Request &Request, const DIInliningInfo &Info) override;
void print(const Request &Request, const DIGlobal &Global) override;		void print(const Request &Request, const DIGlobal &Global) override;
void print(const Request &Request,		void print(const Request &Request,
const std::vector<DILocal> &Locals) override;		const std::vector<DILocal> &Locals) override;
		void print(const Request &Request,
		const std::vector<DILineInfo> &Locations) override;

void printInvalidCommand(const Request &Request, StringRef Command) override;		void printInvalidCommand(const Request &Request, StringRef Command) override;

bool printError(const Request &Request,		bool printError(const Request &Request,
const ErrorInfoBase &ErrorInfo) override;		const ErrorInfoBase &ErrorInfo) override;

void listBegin() override;		void listBegin() override;
void listEnd() override;		void listEnd() override;
};		};
} // namespace symbolize		} // namespace symbolize
} // namespace llvm		} // namespace llvm

#endif		#endif

llvm/include/llvm/DebugInfo/Symbolize/SymbolizableModule.h

Show All 30 Lines	public:
symbolizeInlinedCode(object::SectionedAddress ModuleOffset,		symbolizeInlinedCode(object::SectionedAddress ModuleOffset,
DILineInfoSpecifier LineInfoSpecifier,		DILineInfoSpecifier LineInfoSpecifier,
bool UseSymbolTable) const = 0;		bool UseSymbolTable) const = 0;
virtual DIGlobal		virtual DIGlobal
symbolizeData(object::SectionedAddress ModuleOffset) const = 0;		symbolizeData(object::SectionedAddress ModuleOffset) const = 0;
virtual std::vector<DILocal>		virtual std::vector<DILocal>
symbolizeFrame(object::SectionedAddress ModuleOffset) const = 0;		symbolizeFrame(object::SectionedAddress ModuleOffset) const = 0;

		virtual std::vector<object::SectionedAddress>
		findSymbol(StringRef Symbol) const = 0;

// Return true if this is a 32-bit x86 PE COFF module.		// Return true if this is a 32-bit x86 PE COFF module.
virtual bool isWin32Module() const = 0;		virtual bool isWin32Module() const = 0;

// Returns the preferred base of the module, i.e. where the loader would place		// Returns the preferred base of the module, i.e. where the loader would place
// it in memory assuming there were no conflicts.		// it in memory assuming there were no conflicts.
virtual uint64_t getModulePreferredBase() const = 0;		virtual uint64_t getModulePreferredBase() const = 0;
};		};

} // end namespace symbolize		} // end namespace symbolize
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_DEBUGINFO_SYMBOLIZE_SYMBOLIZABLEMODULE_H		#endif // LLVM_DEBUGINFO_SYMBOLIZE_SYMBOLIZABLEMODULE_H

llvm/include/llvm/DebugInfo/Symbolize/SymbolizableObjectFile.h

Show All 37 Lines	DILineInfo symbolizeCode(object::SectionedAddress ModuleOffset,
DILineInfoSpecifier LineInfoSpecifier,		DILineInfoSpecifier LineInfoSpecifier,
bool UseSymbolTable) const override;		bool UseSymbolTable) const override;
DIInliningInfo symbolizeInlinedCode(object::SectionedAddress ModuleOffset,		DIInliningInfo symbolizeInlinedCode(object::SectionedAddress ModuleOffset,
DILineInfoSpecifier LineInfoSpecifier,		DILineInfoSpecifier LineInfoSpecifier,
bool UseSymbolTable) const override;		bool UseSymbolTable) const override;
DIGlobal symbolizeData(object::SectionedAddress ModuleOffset) const override;		DIGlobal symbolizeData(object::SectionedAddress ModuleOffset) const override;
std::vector<DILocal>		std::vector<DILocal>
symbolizeFrame(object::SectionedAddress ModuleOffset) const override;		symbolizeFrame(object::SectionedAddress ModuleOffset) const override;
		std::vector<object::SectionedAddress>
		findSymbol(StringRef Symbol) const override;

// Return true if this is a 32-bit x86 PE COFF module.		// Return true if this is a 32-bit x86 PE COFF module.
bool isWin32Module() const override;		bool isWin32Module() const override;

// Returns the preferred base of the module, i.e. where the loader would place		// Returns the preferred base of the module, i.e. where the loader would place
// it in memory assuming there were no conflicts.		// it in memory assuming there were no conflicts.
uint64_t getModulePreferredBase() const override;		uint64_t getModulePreferredBase() const override;

▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines

llvm/include/llvm/DebugInfo/Symbolize/Symbolize.h

Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	public:
Expected<std::vector<DILocal>>		Expected<std::vector<DILocal>>
symbolizeFrame(const ObjectFile &Obj, object::SectionedAddress ModuleOffset);		symbolizeFrame(const ObjectFile &Obj, object::SectionedAddress ModuleOffset);
Expected<std::vector<DILocal>>		Expected<std::vector<DILocal>>
symbolizeFrame(const std::string &ModuleName,		symbolizeFrame(const std::string &ModuleName,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);
Expected<std::vector<DILocal>>		Expected<std::vector<DILocal>>
symbolizeFrame(ArrayRef<uint8_t> BuildID,		symbolizeFrame(ArrayRef<uint8_t> BuildID,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);

		Expected<std::vector<DILineInfo>> findSymbol(const ObjectFile &Obj,
		StringRef Symbol);
		Expected<std::vector<DILineInfo>> findSymbol(const std::string &ModuleName,
		jhendersonUnsubmitted Done Reply Inline Actions This is a bit of a nit-pick, but it is throwing me off a bit having both `const std::string &` and `StringRef` arguments for this overload of `findSymbol`. I see why you've done it (similarity with other overloaded functions above and below), but I feel like it should probably still use the correct and consistent style (`StringRef`) for both. jhenderson: This is a bit of a nit-pick, but it is throwing me off a bit having both `const std::string &`…
		StringRef Symbol);
		Expected<std::vector<DILineInfo>> findSymbol(ArrayRef<uint8_t> BuildID,
		StringRef Symbol);

void flush();		void flush();

// Evict entries from the binary cache until it is under the maximum size		// Evict entries from the binary cache until it is under the maximum size
// given in the options. Calling this invalidates references in the DI...		// given in the options. Calling this invalidates references in the DI...
// objects returned by the methods above.		// objects returned by the methods above.
void pruneCache();		void pruneCache();

static std::string		static std::string
Show All 26 Lines	symbolizeInlinedCodeCommon(const T &ModuleSpecifier,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);
template <typename T>		template <typename T>
Expected<DIGlobal> symbolizeDataCommon(const T &ModuleSpecifier,		Expected<DIGlobal> symbolizeDataCommon(const T &ModuleSpecifier,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);
template <typename T>		template <typename T>
Expected<std::vector<DILocal>>		Expected<std::vector<DILocal>>
symbolizeFrameCommon(const T &ModuleSpecifier,		symbolizeFrameCommon(const T &ModuleSpecifier,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);
		template <typename T>
		Expected<std::vector<DILineInfo>> findSymbolCommon(const T &ModuleSpecifier,
		StringRef Symbol);

Expected<SymbolizableModule *> getOrCreateModuleInfo(const ObjectFile &Obj);		Expected<SymbolizableModule *> getOrCreateModuleInfo(const ObjectFile &Obj);

/// Returns a SymbolizableModule or an error if loading debug info failed.		/// Returns a SymbolizableModule or an error if loading debug info failed.
/// Unlike the above, errors are reported each time, since they are more		/// Unlike the above, errors are reported each time, since they are more
/// likely to be transient.		/// likely to be transient.
Expected<SymbolizableModule *>		Expected<SymbolizableModule *>
getOrCreateModuleInfo(ArrayRef<uint8_t> BuildID);		getOrCreateModuleInfo(ArrayRef<uint8_t> BuildID);
▲ Show 20 Lines • Show All 94 Lines • Show Last 20 Lines

llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp

Show First 20 Lines • Show All 254 Lines • ▼ Show 20 Lines for (const DILocal &L : Locals) {

OS << *L.TagOffset; OS << *L.TagOffset;

else else

OS << DILineInfo::Addr2LineBadString; OS << DILineInfo::Addr2LineBadString;

OS << '\n'; OS << '\n';

} }

printFooter(); printFooter();

} }

void PlainPrinterBase::print(const Request &Request,

const std::vector<DILineInfo> &Locations) {

if (Locations.empty()) {

if (Config.IsGNUStyle || Request.Symbol.empty())

OS << DILineInfo::Addr2LineBadString << ":0\n";

else

OS << Request.Symbol;

ikudrinUnsubmitted

Done

if (Locations.empty()) {

- if (!Config.IsAddr2Line && !Request.Symbol.empty())

- OS << Request.Symbol << '\n';

- else

+ if (Config.IsAddr2Line || Request.Symbol.empty())

OS << DILineInfo::Addr2LineBadString << ":0\n";

+ else

+ OS << Request.Symbol << '\n';

} else {

The conditions with fewer negations are much easier to understand.

ikudrin: The conditions with fewer negations are much easier to understand.

} else {

for (const DILineInfo &L : Locations)

print(L, false);

}

printFooter();

}

void PlainPrinterBase::printInvalidCommand(const Request &Request, void PlainPrinterBase::printInvalidCommand(const Request &Request,

StringRef Command) { StringRef Command) {

OS << Command << '\n'; OS << Command << '\n';

} }

bool PlainPrinterBase::printError(const Request &Request, bool PlainPrinterBase::printError(const Request &Request,

const ErrorInfoBase &ErrorInfo) { const ErrorInfoBase &ErrorInfo) {

ErrHandler(ErrorInfo, Request.ModuleName); ErrHandler(ErrorInfo, Request.ModuleName);

// Print an empty struct too. // Print an empty struct too.

return true; return true;

} }

static std::string toHex(uint64_t V) { static std::string toHex(uint64_t V) {

return ("0x" + Twine::utohexstr(V)).str(); return ("0x" + Twine::utohexstr(V)).str();

} }

static json::Object toJSON(const Request &Request, StringRef ErrorMsg = "") { static json::Object toJSON(const Request &Request, StringRef ErrorMsg = "") {

json::Object Json({{"ModuleName", Request.ModuleName.str()}}); json::Object Json({{"ModuleName", Request.ModuleName.str()}});

if (!Request.Symbol.empty())

Json["SymName"] = Request.Symbol.str();

if (Request.Address) if (Request.Address)

Json["Address"] = toHex(*Request.Address); Json["Address"] = toHex(*Request.Address);

if (!ErrorMsg.empty()) if (!ErrorMsg.empty())

Json["Error"] = json::Object({{"Message", ErrorMsg.str()}}); Json["Error"] = json::Object({{"Message", ErrorMsg.str()}});

return Json; return Json;

} }

static json::Object toJSON(const DILineInfo &LineInfo) { static json::Object toJSON(const DILineInfo &LineInfo) {

▲ Show 20 Lines • Show All 73 Lines • ▼ Show 20 Lines void JSONPrinter::print(const Request &Request,

json::Object Json = toJSON(Request); json::Object Json = toJSON(Request);

Json["Frame"] = std::move(Frame); Json["Frame"] = std::move(Frame);

if (ObjectList) if (ObjectList)

ObjectList->push_back(std::move(Json)); ObjectList->push_back(std::move(Json));

else else

printJSON(std::move(Json)); printJSON(std::move(Json));

} }

void JSONPrinter::print(const Request &Request,

const std::vector<DILineInfo> &Locations) {

json::Array Definitions;

for (const DILineInfo &L : Locations)

Definitions.push_back(toJSON(L));

json::Object Json = toJSON(Request);

Json["Loc"] = std::move(Definitions);

jhendersonUnsubmitted

Not Done

It's possible I've missed it, but I don't see any test that shows this array has appropriate contents (I see a few where it is empty, but there needs to be one or more test cases that cover it having contents, at least one of which should cover it having more than one entry).

jhenderson: It's possible I've missed it, but I don't see any test that shows this array has appropriate…

sepavloffAuthorUnsubmitted

Done

At the end of output-style-json-code.test a test is added that checks the case of more than one entry.

sepavloff: At the end of `output-style-json-code.test` a test is added that checks the case of more than…

if (ObjectList)

ObjectList->push_back(std::move(Json));

else

printJSON(std::move(Json));

}

void JSONPrinter::printInvalidCommand(const Request &Request, void JSONPrinter::printInvalidCommand(const Request &Request,

StringRef Command) { StringRef Command) {

printError(Request, printError(Request,

StringError("unable to parse arguments: " + Command, StringError("unable to parse arguments: " + Command,

std::make_error_code(std::errc::invalid_argument))); std::make_error_code(std::errc::invalid_argument)));

} }

bool JSONPrinter::printError(const Request &Request, bool JSONPrinter::printError(const Request &Request,

Show All 22 Lines

llvm/lib/DebugInfo/Symbolize/SymbolizableObjectFile.cpp

	Show First 20 Lines • Show All 345 Lines • ▼ Show 20 Lines
	std::vector<DILocal> SymbolizableObjectFile::symbolizeFrame(			std::vector<DILocal> SymbolizableObjectFile::symbolizeFrame(
	object::SectionedAddress ModuleOffset) const {			object::SectionedAddress ModuleOffset) const {
	if (ModuleOffset.SectionIndex == object::SectionedAddress::UndefSection)			if (ModuleOffset.SectionIndex == object::SectionedAddress::UndefSection)
	ModuleOffset.SectionIndex =			ModuleOffset.SectionIndex =
	getModuleSectionIndexForAddress(ModuleOffset.Address);			getModuleSectionIndexForAddress(ModuleOffset.Address);
	return DebugInfoContext->getLocalsForAddress(ModuleOffset);			return DebugInfoContext->getLocalsForAddress(ModuleOffset);
	}			}

				std::vector<object::SectionedAddress>
				SymbolizableObjectFile::findSymbol(StringRef Symbol) const {
				std::vector<object::SectionedAddress> Result;
				for (const SymbolDesc &Sym : Symbols) {
				jhendersonUnsubmitted Done Reply Inline Actions Nit: This should probably have braces, as it is a non-trivial statement (even if it is only a single statement technically). See https://llvm.org/docs/CodingStandards.html#don-t-use-braces-on-simple-single-statement-bodies-of-if-else-loop-statements for more details. jhenderson: Nit: This should probably have braces, as it is a non-trivial statement (even if it is only a…
				jhendersonUnsubmitted Not Done Reply Inline Actions I'm a little concerned about the performance of this code. It might be a premature concern, but looping through a list of symbols to find one, for every input symbol, sounds like it could get expensive quite rapidly - if you were naively to use llvm-symbolizer to symbolize an entire object by using its listed symbol names, you'd end up with an n^2 operation, as every search has to go through the entire list. The performance of the symbolizer code is considered important, hence why I'm bringing this up. I wonder if it's worth considering changing the `std::vector` used for `Symbols` into another container with more efficient searching performance? jhenderson: I'm a little concerned about the performance of this code. It might be a premature concern, but…
				sepavloffAuthorUnsubmitted Done Reply Inline Actions This is a very important aspect of this solution. It however requires substantial code changes, so it is not practical to implement the performance enhancements in this patch. There are few users of this feature right now and GNU `addr2line` also uses linear search, so it is unlikely that performance problems would be noticed immediately. Some possible applications need `llvm-symbolizer` for large files and many symbols, performance is critical for such cases, so a solution will be elaborated. It requires more efforts than just changing the implementation of `Symbol`, as it is already sorted by address. There are other points where the speed or memory consumption can be improved, they could be treated together. sepavloff: This is a very important aspect of this solution. It however requires substantial code changes…
				if (Sym.Name.equals(Symbol)) {
				object::SectionedAddress A{Sym.Addr,
				getModuleSectionIndexForAddress(Sym.Addr)};
				Result.push_back(A);
				}
				}
				return Result;
				}

	/// Search for the first occurence of specified Address in ObjectFile.			/// Search for the first occurence of specified Address in ObjectFile.
	uint64_t SymbolizableObjectFile::getModuleSectionIndexForAddress(			uint64_t SymbolizableObjectFile::getModuleSectionIndexForAddress(
	uint64_t Address) const {			uint64_t Address) const {

	for (SectionRef Sec : Module->sections()) {			for (SectionRef Sec : Module->sections()) {
	if (!Sec.isText() \|\| Sec.isVirtual())			if (!Sec.isText() \|\| Sec.isVirtual())
	continue;			continue;

	if (Address >= Sec.getAddress() &&			if (Address >= Sec.getAddress() &&
	Address < Sec.getAddress() + Sec.getSize())			Address < Sec.getAddress() + Sec.getSize())
	return Sec.getIndex();			return Sec.getIndex();
	}			}

	return object::SectionedAddress::UndefSection;			return object::SectionedAddress::UndefSection;
	}			}

llvm/lib/DebugInfo/Symbolize/Symbolize.cpp

	Show First 20 Lines • Show All 224 Lines • ▼ Show 20 Lines
	}			}

	Expected<std::vector<DILocal>>			Expected<std::vector<DILocal>>
	LLVMSymbolizer::symbolizeFrame(ArrayRef<uint8_t> BuildID,			LLVMSymbolizer::symbolizeFrame(ArrayRef<uint8_t> BuildID,
	object::SectionedAddress ModuleOffset) {			object::SectionedAddress ModuleOffset) {
	return symbolizeFrameCommon(BuildID, ModuleOffset);			return symbolizeFrameCommon(BuildID, ModuleOffset);
	}			}

				template <typename T>
				Expected<std::vector<DILineInfo>>
				LLVMSymbolizer::findSymbolCommon(const T &ModuleSpecifier, StringRef Symbol) {
				auto InfoOrErr = getOrCreateModuleInfo(ModuleSpecifier);
				if (!InfoOrErr)
				return InfoOrErr.takeError();
				jhendersonUnsubmitted Not Done Reply Inline Actions Is there testing covering this failure for this specific case? jhenderson: Is there testing covering this failure for this specific case?
				sepavloffAuthorUnsubmitted Done Reply Inline Actions It is unlikely that this code fails, because `getOrCreateModuleInfo` is called early to check existence and validity of binary file. sepavloff: It is unlikely that this code fails, because `getOrCreateModuleInfo` is called early to check…
				sepavloffAuthorUnsubmitted Done Reply Inline Actions I was wrong. If binary file name was not specified via `--obj` option, it will be extracted from command line arguments. In this case `getOrCreateModuleInfo` is called just in this code and may fail. The relevant test is added to `symbol-search.test`. sepavloff: I was wrong. If binary file name was not specified via `--obj` option, it will be extracted…

				SymbolizableModule Info = InfoOrErr;
				std::vector<DILineInfo> Result;

				// A null module means an error has already been reported. Return an empty
				// result.
				if (!Info)
				return Result;
				jhendersonUnsubmitted Not Done Reply Inline Actions Ditto. jhenderson: Ditto.
				sepavloffAuthorUnsubmitted Done Reply Inline Actions Actually this code was copied from `symbolizeCommon`, it will fail exactly in the same cases when fail those functions. sepavloff:* Actually this code was copied from `symbolize*Common`, it will fail exactly in the same cases…
				sepavloffAuthorUnsubmitted Done Reply Inline Actions It seems this case (no error from `getOrCreateModuleInfo` but zero pointer to `SymbolizableModule`) cannot be realized. sepavloff: It seems this case (no error from `getOrCreateModuleInfo` but zero pointer to…
				jhendersonUnsubmitted Not Done Reply Inline Actions Okay, thanks for looking into it. jhenderson: Okay, thanks for looking into it.

				for (object::SectionedAddress A : Info->findSymbol(Symbol)) {
				DILineInfo LineInfo = Info->symbolizeCode(
				A, DILineInfoSpecifier(Opts.PathStyle, Opts.PrintFunctions),
				Opts.UseSymbolTable);
				if (LineInfo.FileName != DILineInfo::BadString) {
				if (Opts.Demangle)
				LineInfo.FunctionName = DemangleName(LineInfo.FunctionName, Info);
				jhendersonUnsubmitted Not Done Reply Inline Actions This needs a test case to show that if `Opts.Demangle` is false, the name isn't demangled. jhenderson: This needs a test case to show that if `Opts.Demangle` is false, the name isn't demangled.
				sepavloffAuthorUnsubmitted Done Reply Inline Actions Such tests are added at the end of `symbol-search.test`. sepavloff: Such tests are added at the end of `symbol-search.test`.
				Result.push_back(LineInfo);
				}
				}

				return Result;
				}

				Expected<std::vector<DILineInfo>>
				LLVMSymbolizer::findSymbol(const ObjectFile &Obj, StringRef Symbol) {
				return findSymbolCommon(Obj, Symbol);
				}

				Expected<std::vector<DILineInfo>>
				LLVMSymbolizer::findSymbol(const std::string &ModuleName, StringRef Symbol) {
				return findSymbolCommon(ModuleName, Symbol);
				}

				Expected<std::vector<DILineInfo>>
				LLVMSymbolizer::findSymbol(ArrayRef<uint8_t> BuildID, StringRef Symbol) {
				return findSymbolCommon(BuildID, Symbol);
				}

	void LLVMSymbolizer::flush() {			void LLVMSymbolizer::flush() {
	ObjectForUBPathAndArch.clear();			ObjectForUBPathAndArch.clear();
	LRUBinaries.clear();			LRUBinaries.clear();
	CacheSize = 0;			CacheSize = 0;
	BinaryForPath.clear();			BinaryForPath.clear();
	ObjectPairForPathArch.clear();			ObjectPairForPathArch.clear();
	Modules.clear();			Modules.clear();
	BuildIDPaths.clear();			BuildIDPaths.clear();
	▲ Show 20 Lines • Show All 501 Lines • Show Last 20 Lines

llvm/test/tools/llvm-symbolizer/Inputs/addr.inp

	some text			something not a valid address
	0x40054d			0x40054d
	some text2			some text possibly a symbol

llvm/test/tools/llvm-symbolizer/Inputs/discrim.inp

	some text			some text
	0x400590			0x400590
	0x4005a5			0x4005a5
	0x4005ad			0x4005ad
	0x4005b9			0x4005b9
	0x4005ce			0x4005ce
	0x4005d4			0x4005d4
	some more text			another text

llvm/test/tools/llvm-symbolizer/debuginfod.test

	Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines
	NOTHINGFOUND: ??			NOTHINGFOUND: ??
	NOTHINGFOUND-NEXT: ??:0:0			NOTHINGFOUND-NEXT: ??:0:0

	# BUILDID shouldn't be parsed if --obj is given, just like regular filenames.			# BUILDID shouldn't be parsed if --obj is given, just like regular filenames.
	RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \			RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \
	RUN: --obj=%t/addr.exe \			RUN: --obj=%t/addr.exe \
	RUN: "BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d" \| \			RUN: "BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d" \| \
	RUN: FileCheck %s --check-prefix=BUILDIDIGNORED			RUN: FileCheck %s --check-prefix=BUILDIDIGNORED
	BUILDIDIGNORED: BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d			BUILDIDIGNORED: BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4

	# Providing both BUILDID and FILE is a syntax error.			# Providing both BUILDID and FILE is a syntax error.
	RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \			RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \
	RUN: "BUILDID:FILE:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d" \| \			RUN: "BUILDID:FILE:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d" \| \
	RUN: FileCheck %s --check-prefix=BUILDIDFILE			RUN: FileCheck %s --check-prefix=BUILDIDFILE
	BUILDIDFILE: BUILDID:FILE:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d			BUILDIDFILE: BUILDID:FILE:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d
	RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \			RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \
	RUN: "FILE:BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d" \| \			RUN: "FILE:BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d" \| \
	RUN: FileCheck %s --check-prefix=FILEBUILDID			RUN: FileCheck %s --check-prefix=FILEBUILDID
	FILEBUILDID: FILE:BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d			FILEBUILDID: FILE:BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d

llvm/test/tools/llvm-symbolizer/flag-grouping.test

	RUN: llvm-symbolizer -apCi --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s			RUN: llvm-symbolizer -apCi --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s
	RUN: llvm-symbolizer -apCie %p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s			RUN: llvm-symbolizer -apCie %p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s
	RUN: llvm-symbolizer -apCie=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s			RUN: llvm-symbolizer -apCie=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s
	RUN: llvm-symbolizer -apCie%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s			RUN: llvm-symbolizer -apCie%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s

	CHECK: some text			CHECK: something
				jhendersonUnsubmitted Not Done Reply Inline Actions Let's match the full line in all of these cases where the text has changed. jhenderson: Let's match the full line in all of these cases where the text has changed.
	CHECK: 0x40054d: inctwo			CHECK: 0x40054d: inctwo
	CHECK: (inlined by) inc			CHECK: (inlined by) inc
	CHECK (inlined by) main			CHECK (inlined by) main
	CHECK: some text2			CHECK: some

llvm/test/tools/llvm-symbolizer/flush-output.s

	# REQUIRES: x86-registered-target			# REQUIRES: x86-registered-target

	## If a process spawns llvm-symbolizer, and wishes to feed it addresses one at a			## If a process spawns llvm-symbolizer, and wishes to feed it addresses one at a
	## time, llvm-symbolizer needs to flush its output after each input has been			## time, llvm-symbolizer needs to flush its output after each input has been
	## processed or the parent process will not be able to read the output and may			## processed or the parent process will not be able to read the output and may
	## deadlock. This test runs a script that simulates this situation for both a			## deadlock. This test runs a script that simulates this situation for both a
	## a good and bad input.			## a good and bad input.

	foo:			foo:
	nop			nop

	# RUN: llvm-mc -filetype=obj -triple=x86_64-pc-linux %s -o %t.o -g			# RUN: llvm-mc -filetype=obj -triple=x86_64-pc-linux %s -o %t.o -g
	# RUN: %python %p/Inputs/flush-output.py llvm-symbolizer %t.o \			# RUN: %python %p/Inputs/flush-output.py llvm-symbolizer %t.o \
	# RUN: \| FileCheck %s			# RUN: \| FileCheck %s

	# CHECK: flush-output.s:10			# CHECK: flush-output.s:10
	# CHECK: bad			# CHECK: ??:0

llvm/test/tools/llvm-symbolizer/invalid-input-address.test

	# Use address that can't fit in a 64-bit number. Show that llvm-symbolizer			# Use address that can't fit in a 64-bit number. Show that llvm-symbolizer
	# simply echoes it as per other malformed input addresses.			# simply echoes it as per other malformed input addresses.
	RUN: llvm-symbolizer --obj=addr.exe 0x10000000000000000 \| FileCheck --check-prefix=LARGE-ADDR %s			RUN: llvm-symbolizer --obj=addr.exe 0x10000000000000000 \| FileCheck --check-prefix=LARGE-ADDR %s

	LARGE-ADDR-NOT: {{.}}			LARGE-ADDR-NOT: {{.}}
	LARGE-ADDR: 0x10000000000000000			LARGE-ADDR: 0x10000000000000000
	LARGE-ADDR-NOT: {{.}}			LARGE-ADDR-NOT: {{.}}

	RUN: echo '"some text"' '"some text2"' > %t.rsp			RUN: echo '"some text"' '"another text"' > %t.rsp
	RUN: echo -e 'some text\nsome text2\n' > %t.inp			RUN: echo -e 'some text\nanother text\n' > %t.inp

	# Test bad input address values, via stdin, command line and response file.			# Test bad input address values, via stdin, command line and response file.
	RUN: llvm-symbolizer --obj=%p/Inputs/addr.exe < %t.inp \| FileCheck --check-prefix=BAD-INPUT %s			RUN: llvm-symbolizer --obj=%p/Inputs/addr.exe < %t.inp \| FileCheck --check-prefix=BAD-INPUT %s
	RUN: llvm-symbolizer --obj=%p/Inputs/addr.exe "some text" "some text2" \| FileCheck --check-prefix=BAD-INPUT %s			RUN: llvm-symbolizer --obj=%p/Inputs/addr.exe "some text" "another text" \| FileCheck --check-prefix=BAD-INPUT %s
	RUN: llvm-symbolizer --obj=%p/Inputs/addr.exe @%t.rsp \| FileCheck --check-prefix=BAD-INPUT %s			RUN: llvm-symbolizer --obj=%p/Inputs/addr.exe @%t.rsp \| FileCheck --check-prefix=BAD-INPUT %s

	# Test bad input address values for the GNU-compatible version.			# Test bad input address values for the GNU-compatible version.
	RUN: llvm-addr2line --obj=%p/Inputs/addr.exe < %t.inp \| FileCheck --check-prefix=BAD-INPUT %s			RUN: llvm-addr2line --obj=%p/Inputs/addr.exe < %t.inp \| FileCheck --check-prefix=BAD-INPUT-GNU %s
	RUN: llvm-addr2line --obj=%p/Inputs/addr.exe "some text" "some text2" \| FileCheck --check-prefix=BAD-INPUT %s			RUN: llvm-addr2line --obj=%p/Inputs/addr.exe "some text" "another text" \| FileCheck --check-prefix=BAD-INPUT-GNU %s
	RUN: llvm-addr2line --obj=%p/Inputs/addr.exe @%t.rsp \| FileCheck --check-prefix=BAD-INPUT %s			RUN: llvm-addr2line --obj=%p/Inputs/addr.exe @%t.rsp \| FileCheck --check-prefix=BAD-INPUT-GNU %s

	BAD-INPUT: some text			BAD-INPUT: some
	BAD-INPUT-NEXT: some text2			BAD-INPUT-NEXT: another

				BAD-INPUT-GNU: ??:0
				BAD-INPUT-GNU-NEXT: ??:0

llvm/test/tools/llvm-symbolizer/output-style-empty-line.test

	This test checks that with --output-style=GNU the tool does not print an empty			This test checks that with --output-style=GNU the tool does not print an empty
	line after the report for an address. The current behavior is preserved for			line after the report for an address. The current behavior is preserved for
	--output-style=LLVM or if the option is omitted.			--output-style=LLVM or if the option is omitted.

	RUN: llvm-symbolizer -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \			RUN: llvm-symbolizer -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \
	RUN: \| FileCheck %s --check-prefix=LLVM			RUN: \| FileCheck %s --check-prefix=SYMB-LLVM

	RUN: llvm-symbolizer --output-style=LLVM -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \			RUN: llvm-symbolizer --output-style=LLVM -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \
	RUN: \| FileCheck %s --check-prefix=LLVM			RUN: \| FileCheck %s --check-prefix=SYMB-LLVM

				SYMB-LLVM: x.c:14:0
				SYMB-LLVM-EMPTY:
				SYMB-LLVM-NEXT: some

	RUN: llvm-symbolizer --output-style=GNU -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \			RUN: llvm-symbolizer --output-style=GNU -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \
	RUN: \| FileCheck %s --check-prefix=GNU			RUN: \| FileCheck %s --check-prefix=SYMB-GNU

				SYMB-GNU: x.c:14
				SYMB-GNU-NEXT: ??:0
				jhendersonUnsubmitted Not Done Reply Inline Actions I don't think these check-suffixes are particularly understandable any more, given the changed behaviour. I'd suggest instead renaming them all and moving the check patterns to be immediately after the group that uses them. Something like: RUN: llvm-symbolizer -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \ RUN: \| FileCheck %s --check-prefix=SYMB-LLVM RUN: llvm-symbolizer --output-style=LLVM -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \ RUN: \| FileCheck %s --check-prefix=SYMB-LLVM SYMB-LLVM: x.c:14:0 SYMB-LLVM-EMPTY: SYMB-LLVM-NEXT: some text2 RUN: llvm-symbolizer --output-style=GNU -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \ RUN: \| FileCheck %s --check-prefix=SYMB-GNU SYMB-GNU: x.c:14 SYMB-GNU-NEXT: some text2 RUN: llvm-addr2line -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \ RUN: \| FileCheck %s --check-prefix=ADDR-GNU RUN: llvm-addr2line --output-style=GNU -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \ RUN: \| FileCheck %s --check-prefix=ADDR-GNU ADDR-GNU: x.c:14 ADDR-GNU-NEXT: ??:0 RUN: llvm-addr2line --output-style=LLVM -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \ RUN: \| FileCheck %s --check-prefix=ADDR-LLVM ADDR-LLVM: x.c:14:0 ADDR-LLVM-EMPTY: ADDR-LLVM-NEXT: ??:0 jhenderson: I don't think these check-suffixes are particularly understandable any more, given the changed…
				sepavloffAuthorUnsubmitted Done Reply Inline Actions Yes, it looks better. Thank you! sepavloff: Yes, it looks better. Thank you!

	RUN: llvm-addr2line -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \			RUN: llvm-addr2line -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \
	RUN: \| FileCheck %s --check-prefix=GNU			RUN: \| FileCheck %s --check-prefix=ADDR-GNU

	RUN: llvm-addr2line --output-style=GNU -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \			RUN: llvm-addr2line --output-style=GNU -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \
	RUN: \| FileCheck %s --check-prefix=GNU			RUN: \| FileCheck %s --check-prefix=ADDR-GNU

	RUN: llvm-addr2line --output-style=LLVM -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \			ADDR-GNU: x.c:14
	RUN: \| FileCheck %s --check-prefix=LLVM			ADDR-GNU-NEXT: ??:0

	LLVM: x.c:14:0			RUN: llvm-addr2line --output-style=LLVM -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp \
	LLVM-EMPTY:			RUN: \| FileCheck %s --check-prefix=ADDR-LLVM
	LLVM-NEXT: some text2

	GNU: x.c:14			ADDR-LLVM: x.c:14:0
	GNU-NEXT: some text2			ADDR-LLVM-EMPTY:
				ADDR-LLVM-NEXT: some

llvm/test/tools/llvm-symbolizer/output-style-json-code.test

Show All 19 Lines

# DISCRIM:[{"Address":"0x400575","ModuleName":"{{.*}}/Inputs/discrim","Symbol":[{"Column":17,"Discriminator":2,"FileName":"/tmp{{/|\\\\}}discrim.c","FunctionName":"foo","Line":5,"StartAddress":"0x400560","StartFileName":"/tmp{{/|\\\\}}discrim.c","StartLine":4}]}]

## In case of stdin input the output will contain a single JSON object for each input string.

## This test case is testing stdin input, with the --no-inlines option.

# RUN: llvm-symbolizer --output-style=JSON --no-inlines -e %p/Inputs/addr.exe < %p/Inputs/addr.inp | \

# RUN: FileCheck %s --check-prefix=NO-INLINES --strict-whitespace --match-full-lines --implicit-check-not={{.}}

## Invalid first argument before any valid one.

# NO-INLINES:{"Error":{"Message":"unable to parse arguments: some text"},"ModuleName":"{{.*}}/Inputs/addr.exe"}

# NO-INLINES:{"Address":"0x0","Loc":[],"ModuleName":"{{.*}}/Inputs/addr.exe","SymName":"something"}

## Resolve valid address.

# NO-INLINES-NEXT:{"Address":"0x40054d","ModuleName":"{{.*}}/Inputs/addr.exe","Symbol":[{"Column":3,"Discriminator":0,"FileName":"/tmp{{/|\\\\}}x.c","FunctionName":"main","Line":3,"StartAddress":"0x400540","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":2}]}

## Invalid argument after a valid one.

# NO-INLINES-NEXT:{"Error":{"Message":"unable to parse arguments: some text2"},"ModuleName":"{{.*}}/Inputs/addr.exe"}

# NO-INLINES-NEXT:{"Address":"0x0","Loc":[],"ModuleName":"{{.*}}/Inputs/addr.exe","SymName":"some"}

## This test case is testing stdin input, inlines by default.

# RUN: llvm-symbolizer --output-style=JSON -e %p/Inputs/addr.exe < %p/Inputs/addr.inp | \

# RUN: FileCheck %s --check-prefix=INLINE --strict-whitespace --match-full-lines --implicit-check-not={{.}}

## Invalid first argument before any valid one.

# INLINE:{"Error":{"Message":"unable to parse arguments: some text"},"ModuleName":"{{.*}}/Inputs/addr.exe"}

# INLINE:{"Address":"0x0","Loc":[],"ModuleName":"{{.*}}/Inputs/addr.exe","SymName":"something"}

## Resolve valid address.

# INLINE-NEXT:{"Address":"0x40054d","ModuleName":"{{.*}}/Inputs/addr.exe","Symbol":[{"Column":3,"Discriminator":0,"FileName":"/tmp{{/|\\\\}}x.c","FunctionName":"inctwo","Line":3,"StartAddress":"0x400540","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":2},{"Column":0,"Discriminator":0,"FileName":"/tmp{{/|\\\\}}x.c","FunctionName":"inc","Line":7,"StartAddress":"0x400540","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":6},{"Column":0,"Discriminator":0,"FileName":"/tmp{{/|\\\\}}x.c","FunctionName":"main","Line":14,"StartAddress":"0x400540","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":12}]}

## Invalid argument after a valid one.

# INLINE-NEXT:{"Error":{"Message":"unable to parse arguments: some text2"},"ModuleName":"{{.*}}/Inputs/addr.exe"}

# INLINE-NEXT:{"Address":"0x0","Loc":[],"ModuleName":"{{.*}}/Inputs/addr.exe","SymName":"some"}

## Also check the last test case with llvm-adr2line.

## The expected result is the same with -f -i.

# RUN: llvm-addr2line --output-style=JSON -f -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp | \

# RUN: FileCheck %s --check-prefix=INLINE-A2L --strict-whitespace --match-full-lines --implicit-check-not={{.}}

## Invalid first argument before any valid one.

# INLINE-A2L:{"Error":{"Message":"unable to parse arguments: some text"},"ModuleName":"{{.*}}/Inputs/addr.exe"}

# INLINE-A2L:{"Address":"0x0","Loc":[],"ModuleName":"{{.*}}/Inputs/addr.exe","SymName":"something"}

jhendersonUnsubmitted

Done

I think you could make this a little more self descriptive, by changing your invalid input to something like "0 not a symbol name or number". That being said, the intent of this test, I think, was to use the exact same set of inputs as one that doesn't use JSON style.

jhenderson: I think you could make this a little more self descriptive, by changing your invalid input to…

## Resolve valid address.

# INLINE-A2L-NEXT:{"Address":"0x40054d","ModuleName":"{{.*}}/Inputs/addr.exe","Symbol":[{"Column":3,"Discriminator":0,"FileName":"/tmp{{/|\\\\}}x.c","FunctionName":"inctwo","Line":3,"StartAddress":"0x400540","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":2},{"Column":0,"Discriminator":0,"FileName":"/tmp{{/|\\\\}}x.c","FunctionName":"inc","Line":7,"StartAddress":"0x400540","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":6},{"Column":0,"Discriminator":0,"FileName":"/tmp{{/|\\\\}}x.c","FunctionName":"main","Line":14,"StartAddress":"0x400540","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":12}]}

## Invalid argument after a valid one.

# INLINE-A2L-NEXT:{"Error":{"Message":"unable to parse arguments: some text2"},"ModuleName":"{{.*}}/Inputs/addr.exe"}

# INLINE-A2L:{"Address":"0x0","Loc":[],"ModuleName":"{{.*}}/Inputs/addr.exe","SymName":"some"}

## Note llvm-addr2line without -f does not print the function name in JSON too.

# RUN: llvm-addr2line --output-style=JSON -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp | \

# RUN: FileCheck %s --check-prefix=NO-FUNC-A2L --strict-whitespace --match-full-lines --implicit-check-not={{.}}

## Invalid first argument before any valid one.

# NO-FUNC-A2L:{"Error":{"Message":"unable to parse arguments: some text"},"ModuleName":"{{.*}}/Inputs/addr.exe"}

# NO-FUNC-A2L:{"Address":"0x0","Loc":[],"ModuleName":"{{.*}}/Inputs/addr.exe","SymName":"something"}

## Resolve valid address.

# NO-FUNC-A2L-NEXT:{"Address":"0x40054d","ModuleName":"{{.*}}/Inputs/addr.exe","Symbol":[{"Column":3,"Discriminator":0,"FileName":"/tmp{{/|\\\\}}x.c","FunctionName":"","Line":3,"StartAddress":"0x400540","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":2},{"Column":0,"Discriminator":0,"FileName":"/tmp{{/|\\\\}}x.c","FunctionName":"","Line":7,"StartAddress":"0x400540","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":6},{"Column":0,"Discriminator":0,"FileName":"/tmp{{/|\\\\}}x.c","FunctionName":"","Line":14,"StartAddress":"0x400540","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":12}]}

## Invalid argument after a valid one.

# NO-FUNC-A2L-NEXT:{"Error":{"Message":"unable to parse arguments: some text2"},"ModuleName":"{{.*}}/Inputs/addr.exe"}

# NO-FUNC-A2L-NEXT:{"Address":"0x0","Loc":[],"ModuleName":"{{.*}}/Inputs/addr.exe","SymName":"some"}

jhendersonUnsubmitted

Not Done

# NO-FUNC-A2L-NEXT:{"Loc":[],"ModuleName":"{{.*}}/Inputs/addr.exe","SymName":"some"}

- ## When a module offset is specified by a symbol, more than one source locations can be found.

+ ## When a module offset is specified by a symbol, more than one source location can be found.

# RUN: llvm-symbolizer --output-style=JSON --no-inlines -e %p/Inputs/symbols.so "static_func" | \

jhenderson:

llvm/test/tools/llvm-symbolizer/sym-verbose.test

	#static volatile int do_mul;			#static volatile int do_mul;
	#static volatile int x, v;			#static volatile int x, v;
	#			#
	#int foo () {			#int foo () {
	# if (do_mul) x *= v; else x /= v;			# if (do_mul) x *= v; else x /= v;
	# return x;			# return x;
	#}			#}
	#			#
	#int main() {			#int main() {
	# return foo() + foo();			# return foo() + foo();
	#}			#}
	#Build as : clang -gmlt -fdebug-info-for-profiling -O2 discrim.c -o discrim			#Build as : clang -gmlt -fdebug-info-for-profiling -O2 discrim.c -o discrim

	RUN: llvm-symbolizer --verbose --print-address --obj=%p/Inputs/discrim < %p/Inputs/discrim.inp \| FileCheck %s			RUN: llvm-symbolizer --verbose --print-address --obj=%p/Inputs/discrim < %p/Inputs/discrim.inp \| FileCheck %s

	#CHECK: some text			#CHECK: some

	#CHECK: 0x400590			#CHECK: 0x400590
	#CHECK-NEXT: foo			#CHECK-NEXT: foo
	#CHECK-NEXT: Filename: /tmp{{[\\/]}}discrim.c			#CHECK-NEXT: Filename: /tmp{{[\\/]}}discrim.c
	#CHECK-NEXT: Function start filename: /tmp{{[\\/]}}discrim.c			#CHECK-NEXT: Function start filename: /tmp{{[\\/]}}discrim.c
	#CHECK-NEXT: Function start line: 4			#CHECK-NEXT: Function start line: 4
	#CHECK-NEXT: Function start address: 0x400590			#CHECK-NEXT: Function start address: 0x400590
	#CHECK-NEXT: Line: 5			#CHECK-NEXT: Line: 5
	▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines
	#CHECK-NEXT: Filename: /tmp{{[\\/]}}discrim.c			#CHECK-NEXT: Filename: /tmp{{[\\/]}}discrim.c
	#CHECK-NEXT: Function start filename: /tmp{{[\\/]}}discrim.c			#CHECK-NEXT: Function start filename: /tmp{{[\\/]}}discrim.c
	#CHECK-NEXT: Function start line: 9			#CHECK-NEXT: Function start line: 9
	#CHECK-NEXT: Function start address: 0x400590			#CHECK-NEXT: Function start address: 0x400590
	#CHECK-NEXT: Line: 10			#CHECK-NEXT: Line: 10
	#CHECK-NEXT: Column: 0			#CHECK-NEXT: Column: 0
	#CHECK-NEXT: Discriminator: 2			#CHECK-NEXT: Discriminator: 2

	#CHECK: some more text			#CHECK: another

llvm/test/tools/llvm-symbolizer/sym.test

	#Source:			## Source:
				jhendersonUnsubmitted Not Done Reply Inline Actions I generally would avoid double blank lines. My personal rule is similar to what I follow in C++ code: No blank line if the comment is tightly linked to the immediately following block. One blank line if the comment is more general (e.g. it applies to a wider area of code/isn't really targeted at a specific block etc). jhenderson: I generally would avoid double blank lines. My personal rule is similar to what I follow in C++…
	##include <stdio.h>			## #include <stdio.h>
	#static inline int inctwo (int *a) {			## static inline int inctwo (int *a) {
	# printf ("%d\n",(*a)++);			## printf ("%d\n",(*a)++);
	# return (*a)++;			## return (*a)++;
	#}			## }
	#static inline int inc (int *a) {			## static inline int inc (int *a) {
	# printf ("%d\n",inctwo(a));			## printf ("%d\n",inctwo(a));
	# return (*a)++;			## return (*a)++;
	#}			## }
	#			##
	#			##
	#int main () {			## int main () {
	# int x = 1;			## int x = 1;
	# return inc(&x);			## return inc(&x);
	#}			## }
				##
				## Build as : clang -g -O2 addr.c

				# RUN: llvm-symbolizer --print-address --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s
				# RUN: llvm-symbolizer --addresses --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s
				# RUN: llvm-symbolizer -a --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s
	#			#
	#Build as : clang -g -O2 addr.c			# CHECK: something

	RUN: llvm-symbolizer --print-address --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s
	RUN: llvm-symbolizer --addresses --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s
	RUN: llvm-symbolizer -a --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck %s
	RUN: llvm-symbolizer --inlining --print-address --pretty-print --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s
	RUN: llvm-symbolizer --inlining --print-address -p --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s
	RUN: llvm-symbolizer --inlines --print-address --pretty-print --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s
	RUN: llvm-symbolizer --inlines --print-address -p --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s
	RUN: llvm-symbolizer -i --print-address --pretty-print --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s
	RUN: llvm-symbolizer -i --print-address -p --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s
	## Before 2020-08-04, asan_symbolize.py passed --inlining=true.
	## Support this compatibility alias for a while.
	RUN: llvm-symbolizer --inlining=true --print-address -p --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s

	RUN: echo "0x1" > %t.input
	RUN: llvm-symbolizer --obj=%p/Inputs/zero < %t.input \| FileCheck -check-prefix="ZERO" %s

	RUN: llvm-addr2line --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix=A2L %s
	RUN: llvm-addr2line -a --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2L,A2L_A %s
	RUN: llvm-addr2line -f --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2L,A2L_F %s
	RUN: llvm-addr2line -i --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2L,A2L_I %s
	RUN: llvm-addr2line -fi --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2L,A2L_F,A2L_I,A2L_FI %s

	RUN: llvm-addr2line -pa --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2LP,A2LP_A %s
	RUN: llvm-addr2line -pf --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2LP,A2LP_F %s
	RUN: llvm-addr2line -paf --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2LP,A2LP_AF %s
	RUN: llvm-addr2line -pai --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2LP,A2LP_A,A2LP_I %s
	RUN: llvm-addr2line -pfi --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2LP,A2LP_F,A2LP_FI %s
	RUN: llvm-addr2line -pafi --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2LP,A2LP_AF,A2LP_FI %s

	# CHECK: some text
	# CHECK-NEXT: 0x40054d			# CHECK-NEXT: 0x40054d
	# CHECK-NEXT: inctwo			# CHECK-NEXT: inctwo
	# CHECK-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:3:3			# CHECK-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:3:3
	# CHECK-NEXT: inc			# CHECK-NEXT: inc
	# CHECK-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:7:0			# CHECK-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:7:0
	# CHECK-NEXT: main			# CHECK-NEXT: main
	# CHECK-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:14:0			# CHECK-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:14:0
	# CHECK-EMPTY:			# CHECK-EMPTY:
	# CHECK-NEXT: some text2			# CHECK-NEXT: some

				## Before 2020-08-04, asan_symbolize.py passed --inlining=true.
				## Support this compatibility alias for a while.
				# RUN: llvm-symbolizer --inlining=true --print-address -p --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s

				# RUN: llvm-symbolizer --inlining --print-address --pretty-print --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s
				# RUN: llvm-symbolizer --inlining --print-address -p --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s
				# RUN: llvm-symbolizer --inlines --print-address --pretty-print --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s
				# RUN: llvm-symbolizer --inlines --print-address -p --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s
				# RUN: llvm-symbolizer -i --print-address --pretty-print --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s
				# RUN: llvm-symbolizer -i --print-address -p --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix="PRETTY" %s
	#			#
	#PRETTY: some text			# PRETTY: something
	#PRETTY: {{[0x]+}}40054d: inctwo at {{[/\]+}}tmp{{[/\]+}}x.c:3:3			# PRETTY: {{[0x]+}}40054d: inctwo at {{[/\]+}}tmp{{[/\]+}}x.c:3:3
	#PRETTY: (inlined by) inc at {{[/\]+}}tmp{{[/\]+}}x.c:7:0			# PRETTY: (inlined by) inc at {{[/\]+}}tmp{{[/\]+}}x.c:7:0
	#PRETTY: (inlined by) main at {{[/\]+}}tmp{{[/\]+}}x.c:14:0			# PRETTY: (inlined by) main at {{[/\]+}}tmp{{[/\]+}}x.c:14:0
	#PRETTY: some text2			# PRETTY: some

				# RUN: echo "0x1" > %t.input
				# RUN: llvm-symbolizer --obj=%p/Inputs/zero < %t.input \| FileCheck -check-prefix="ZERO" %s
	#			#
	#ZERO: ??			# ZERO: ??
	#ZERO: ??:0:0			# ZERO: ??:0:0

				# RUN: llvm-addr2line --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefix=A2L %s
				# RUN: llvm-addr2line -a --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2L,A2L_A %s
				# RUN: llvm-addr2line -f --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2L,A2L_F %s
				# RUN: llvm-addr2line -i --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2L,A2L_I %s
				# RUN: llvm-addr2line -fi --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2L,A2L_F,A2L_I,A2L_FI %s
				#
				# RUN: llvm-addr2line -pa --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2LP,A2LP_A %s
				# RUN: llvm-addr2line -pf --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2LP,A2LP_F %s
				# RUN: llvm-addr2line -paf --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2LP,A2LP_AF %s
				# RUN: llvm-addr2line -pai --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2LP,A2LP_A,A2LP_I %s
				# RUN: llvm-addr2line -pfi --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2LP,A2LP_F,A2LP_FI %s
				# RUN: llvm-addr2line -pafi --obj=%p/Inputs/addr.exe < %p/Inputs/addr.inp \| FileCheck -check-prefixes=A2LP,A2LP_AF,A2LP_FI %s
	#			#
	#A2L: some text			# A2L: ??:0
	#A2L_A-NEXT: 0x40054d			# A2L_A-NEXT: 0x40054d
	#A2L_F-NEXT: inctwo			# A2L_F-NEXT: inctwo
	#A2L-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:3{{$}}			# A2L-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:3{{$}}
	#A2L_FI-NEXT: inc{{$}}			# A2L_FI-NEXT: inc{{$}}
	#A2L_I-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:7{{$}}			# A2L_I-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:7{{$}}
	#A2L_FI-NEXT: main			# A2L_FI-NEXT: main
	#A2L_I-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:14{{$}}			# A2L_I-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:14{{$}}
	#A2L-NEXT: some text2			# A2L-NEXT: ??:0
				#
	#A2LP: some text			# A2LP: ??:0
	#A2LP_A-NEXT: 0x40054d: {{[/\]+}}tmp{{[/\]+}}x.c:3{{$}}			# A2LP_A-NEXT: 0x40054d: {{[/\]+}}tmp{{[/\]+}}x.c:3{{$}}
	#A2LP_F-NEXT: inctwo at {{[/\]+}}tmp{{[/\]+}}x.c:3{{$}}			# A2LP_F-NEXT: inctwo at {{[/\]+}}tmp{{[/\]+}}x.c:3{{$}}
	#A2LP_AF-NEXT: 0x40054d: inctwo at {{[/\]+}}tmp{{[/\]+}}x.c:3{{$}}			# A2LP_AF-NEXT: 0x40054d: inctwo at {{[/\]+}}tmp{{[/\]+}}x.c:3{{$}}
	#A2LP_I-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:7{{$}}			# A2LP_I-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:7{{$}}
	#A2LP_I-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:14{{$}}			# A2LP_I-NEXT: {{[/\]+}}tmp{{[/\]+}}x.c:14{{$}}
	#A2LP_FI-NEXT: (inlined by) inc at {{[/\]+}}tmp{{[/\]+}}x.c:7{{$}}			# A2LP_FI-NEXT: (inlined by) inc at {{[/\]+}}tmp{{[/\]+}}x.c:7{{$}}
	#A2LP_FI-NEXT: (inlined by) main at {{[/\]+}}tmp{{[/\]+}}x.c:14{{$}}			# A2LP_FI-NEXT: (inlined by) main at {{[/\]+}}tmp{{[/\]+}}x.c:14{{$}}
	#A2LP-NEXT: some text2			# A2LP-NEXT: ??:0

				No newline at end of file

llvm/test/tools/llvm-symbolizer/symbol-search.test

This file was added.

## This test checks the case when address is specified by a symbol name rather

## than a number.

jhendersonUnsubmitted

Not Done

You're using a weird mix of comment markers in this file. The standard rules in newer tools tests are

## for actual comments;
# for RUN and CHECK lines;
all comment markers should have a space between them and the rest of the line (e.g. # CHECK: or ## This is a comment).
optionally, if the test consists solely of things prefixed with a comment marker, you can drop one # from each of the above, but it's more common to have them than not.

In addition, this test should have a comment explaining what the test is testing, probably before this comment.

jhenderson: You're using a weird mix of comment markers in this file. The standard rules in newer tools…

sepavloffAuthorUnsubmitted

Done

Thank you for eplanations. I modernized also sym.test, it didn't follow these rules.

sepavloff: Thank you for eplanations. I modernized also sym.test, it didn't follow these rules.

jhendersonUnsubmitted

Not Done

Can I ask you to spin off sym.test's updates into a new review? I've got no objection to the update (sym.test itself is a little bit archaic), but it's not really related to this patch.

Also "when address" -> "when an address"

jhenderson: Can I ask you to spin off sym.test's updates into a new review? I've got no objection to the…

## It uses ELF shared object `Inputs/symbols.so` built for x86_64 using

## the instructions from `Inputs/symbols.h`.

jhendersonUnsubmitted

Done

I can't tell what "PRFUNC" is supposed to stand for. It might be useful to add a comment before each test case explaining what that specific case is testing.

jhenderson: I can't tell what "PRFUNC" is supposed to stand for. It might be useful to add a comment before…

jhendersonUnsubmitted

Not Done

The individual sub cases within this test file could still do with some comments to make their purpose clearer. I think I follow it better now with the recent improvements, but more clarity would help.

jhenderson: The individual sub cases within this test file could still do with some comments to make their…

# RUN: llvm-addr2line --obj=%p/Inputs/symbols.so "CODE func_01" | FileCheck --check-prefix=CODE_CMD %s

# RUN: llvm-symbolizer --obj=%p/Inputs/symbols.so "CODE func_01" | FileCheck --check-prefix=CODE_CMD %s

jhendersonUnsubmitted

Not Done

Prefer a slightly longer form that says something like "Show that the "CODE" command supports search by symbol name." Same goes for below comments.

jhenderson: Prefer a slightly longer form that says something like "Show that the "CODE" command supports…

# CODE_CMD: /tmp/dbginfo{{[/\]+}}symbols.part1.cpp:12

jhendersonUnsubmitted

Not Done

I forgot to mention that I prefer - to _ in prefix names. It avoids weird awkardness like CODE_CMD-NEXT for example, and - is easier to type on an English keyboard :D

jhenderson: I forgot to mention that I prefer `-` to `_` in prefix names. It avoids weird awkardness like…

jhendersonUnsubmitted

Not Done

See my above comment: as this comment is tightly tied to the test case that immediately follows, it doesn't need a blank line (and regardless, blank lines generally don't have a comment marker, unless they are separating paragraphs within a longer comment).

jhenderson: See my above comment: as this comment is tightly tied to the test case that immediately follows…

jhendersonUnsubmitted

Done

Not addressed. For a concrete example, I personally recommend the following format:

# Comment
RUN: ...
RUN: ...

CHECK: ...
CHEC-NEXT:

# Next test case comment
RUN: ...

jhenderson: Not addressed. For a concrete example, I personally recommend the following format: ``` #…

jhendersonUnsubmitted

Not Done

Why is this test case epeated twice?

jhenderson: Why is this test case epeated twice?

sepavloffAuthorUnsubmitted

Done

It should be llvm-addr2line and llvm-symbolizer variants. Fixed.

sepavloff: It should be `llvm-addr2line` and `llvm-symbolizer` variants. Fixed.

# RUN: llvm-addr2line -e %p/Inputs/symbols.so func_01 | FileCheck --check-prefix=SYMB %s

# RUN: llvm-symbolizer -e %p/Inputs/symbols.so func_01 | FileCheck --check-prefix=SYMB %s

# SYMB: /tmp/dbginfo{{[/\]+}}symbols.part1.cpp:12

jhendersonUnsubmitted

Not Done

CODE-CMD: /tmp/dbginfo{{[/\]+}}symbols.part1.cpp:12

- # Check if a symbol name can be specified in command line.

+ # Check if a symbol name can be specified on the command-line.

RUN: llvm-addr2line -e %p/Inputs/symbols.so func_01 | FileCheck --check-prefix=SYMB %s

You don't specify things "in" a command line, you specify them "on the command-line."

jhenderson: You don't specify things "in" a command line, you specify them "on the command-line."

jhendersonUnsubmitted

Not Done

I think canoncially it should be "command-line" not "command line" (this is based on what someone from our docs team once told me).

jhenderson: I think canoncially it should be "command-line" not "command line" (this is based on what…

# RUN: llvm-addr2line -e %p/Inputs/symbols.so static_func | FileCheck --check-prefix=SYMB_MULTI %s

jhendersonUnsubmitted

Not Done

I don't think you should reference GNU here - that's a motivation for the behaviour, but not for the test case for that behaviour. Rather, what this comment could say is "Show that llvm-addr2line and llvm-symbolizer accept symbol names on the command-line."

jhenderson: I don't think you should reference GNU here - that's a motivation for the behaviour, but not…

jhendersonUnsubmitted

Done

Not addressed.

jhenderson: Not addressed.

# SYMB_MULTI: /tmp/dbginfo{{[/\]+}}symbols.part3.c:4

# SYMB_MULTI-NEXT: /tmp/dbginfo{{[/\]+}}symbols.part4.c:4

jhendersonUnsubmitted

Done

It would be better for both "nonexistent" cases to have a suffix, rather than just the llvm-symbolizer one.

jhenderson: It would be better for both "nonexistent" cases to have a suffix, rather than just the llvm…

jhendersonUnsubmitted

Not Done

I'm not sure this comment needed changing. The previous version was good: "If symbol has a space in its name, ignore everything after it." though I might change "after it" to "from the space onwards."

You could add "Check that" to the start too, so the final thing might look like:
"Check that if a symbol has a space in its name, ignore everything from the space onwards."

jhenderson: I'm not sure this comment needed changing. The previous version was good: "If symbol has a…

# RUN: llvm-addr2line --obj=%p/Inputs/symbols.so func_666 | FileCheck --check-prefix=NONEXISTENT_ADDR %s

jhendersonUnsubmitted

Not Done

ADDR is probably not a good abbreviation for addr2line since it could be confused with address which is a a very relevant term to these tests. Perhaps A2L? Apologies for the churn.

jhenderson: `ADDR` is probably not a good abbreviation for `addr2line` since it could be confused with…

# NONEXISTENT_ADDR: ??

jhendersonUnsubmitted

Not Done

# SYMB: /tmp/dbginfo{{[/\]+}}symbols.part1.cpp:12

- ## A symbol name may be resolved into more than one location.

+ ## Show that a symbol name may be resolved to more than one location.

# RUN: llvm-addr2line -e %p/Inputs/symbols.so static_func | FileCheck --check-prefix=SYMB-MULTI %s

jhenderson:

# RUN: llvm-symbolizer --obj=%p/Inputs/symbols.so func_666 | FileCheck --check-prefix=NONEXISTENT_LLVM %s

# NONEXISTENT_LLVM: func_666

# RUN: llvm-addr2line --obj=%p/Inputs/symbols.so func_01 func_02 | FileCheck --check-prefix=FUNCS %s

jhendersonUnsubmitted

Not Done

RUN: llvm-symbolizer -e %p/Inputs/symbols.so "func_01 ignored text" | FileCheck --check-prefix=SYMB %s

- # A symbol name may be resolved into more than one location.

+ # A symbol name may be resolved to more than one location.

RUN: llvm-addr2line -e %p/Inputs/symbols.so static_func | FileCheck --check-prefix=SYMB-MULTI %s

jhenderson:

# RUN: llvm-symbolizer --obj=%p/Inputs/symbols.so func_01 func_02 | FileCheck --check-prefix=FUNCS %s

# FUNCS: /tmp/dbginfo{{[/\]+}}symbols.part1.cpp:12

jhendersonUnsubmitted

Not Done

"Show that if a symbol ..."

jhenderson: "Show that if a symbol ..."

# FUNCS: /tmp/dbginfo{{[/\]+}}symbols.part2.cpp:10

# RUN: llvm-addr2line --obj=%p/Inputs/symbols.so _ZL14static_func_01i | FileCheck --check-prefix=MULTI_CXX %s

# RUN: llvm-symbolizer --obj=%p/Inputs/symbols.so _ZL14static_func_01i | FileCheck --check-prefix=MULTI_CXX %s

# MULTI_CXX: /tmp/dbginfo{{[/\]+}}symbols.part1.cpp:7

jhendersonUnsubmitted

Not Done

I think I might be getting confused between different patches, but looking at this again, my feeling is that we don't really want this divergence in behaviour between GNU and LLVM mode, unless there's a strong motivation for the LLVM behaviour style. I might be inclined to create a different precursor patch that unifies the two, so that they do what GNU addr2line does.

jhenderson: I think I might be getting confused between different patches, but looking at this again, my…

# MULTI_CXX: /tmp/dbginfo{{[/\]+}}symbols.part2.cpp:5

jhendersonUnsubmitted

Not Done

NONEXISTENT: ??

- # More than one symbols may be specified.

+ # More than one symbol may be specified.

RUN: llvm-addr2line --obj=%p/Inputs/symbols.so func_01 func_02 | FileCheck --check-prefix=FUNCS %s

jhenderson:

jhendersonUnsubmitted

Not Done

Strictly speaking, using the --obj option to provide the binary file is also specifying it on the command-line. I think this comment should be a little more specific (assuming it matters). I'd also use "input file" rather than "binary file" to clarify the file's purpose. Something like: "Show that both the symbol and input file can be specified in the search string on the command-line." or something like that.

Similar comments apply below.

jhenderson: Strictly speaking, using the `--obj` option to provide the binary file is also specifying it on…

jhendersonUnsubmitted

Not Done

Nit: here and below, add a space after >2 to make it stand out more.

jhenderson: Nit: here and below, add a space after `>2` to make it stand out more.

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp

Show First 20 Lines • Show All 132 Lines • ▼ Show 20 Lines Symbolizer.setBuildIDFetcher(std::make_unique<DebuginfodFetcher>(

Args.getAllArgValues(OPT_debug_file_directory_EQ))); Args.getAllArgValues(OPT_debug_file_directory_EQ)));

// The HTTPClient must be initialized for use by the debuginfod client. // The HTTPClient must be initialized for use by the debuginfod client.

HTTPClient::initialize(); HTTPClient::initialize();

} }

static bool parseCommand(StringRef BinaryName, bool IsAddr2Line, static bool parseCommand(StringRef BinaryName, bool IsAddr2Line,

StringRef InputString, Command &Cmd, StringRef InputString, Command &Cmd,

std::string &ModuleName, object::BuildID &BuildID, std::string &ModuleName, object::BuildID &BuildID,

uint64_t &ModuleOffset) { StringRef &Symbol, uint64_t &ModuleOffset) {

const char kDelimiters[] = " \n\r"; const char kDelimiters[] = " \n\r";

ModuleName = ""; ModuleName = "";

if (InputString.consume_front("CODE ")) { if (InputString.consume_front("CODE ")) {

Cmd = Command::Code; Cmd = Command::Code;

} else if (InputString.consume_front("DATA ")) { } else if (InputString.consume_front("DATA ")) {

Cmd = Command::Data; Cmd = Command::Data;

} else if (InputString.consume_front("FRAME ")) { } else if (InputString.consume_front("FRAME ")) {

Cmd = Command::Frame; Cmd = Command::Frame;

▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines if (HasBuildIDPrefix) {

if (BuildID.empty()) if (BuildID.empty())

return false; return false;

ModuleName.clear(); ModuleName.clear();

} }

} else { } else {

Pos = InputString.data(); Pos = InputString.data();

ModuleName = BinaryName.str(); ModuleName = BinaryName.str();

} }

// Skip delimiters and parse module offset.

// Parse address, which can be specified as a module offset or as a

jhendersonUnsubmitted

Done

ModuleName = BinaryName.str();

}

- // Parse address, which can be specified as an offset in module or as a

+ // Parse address, which can be specified as a module offset or as a

// symbol.

jhenderson:

// symbol.

Pos += strspn(Pos, kDelimiters); Pos += strspn(Pos, kDelimiters);

int OffsetLength = strcspn(Pos, kDelimiters); int OffsetLength = strcspn(Pos, kDelimiters);

StringRef Offset(Pos, OffsetLength); StringRef Offset(Pos, OffsetLength);

// GNU addr2line assumes the offset is hexadecimal and allows a redundant // GNU addr2line assumes the offset is hexadecimal and allows a redundant

// "0x" or "0X" prefix; do the same for compatibility. // "0x" or "0X" prefix; do the same for compatibility.

bool StartsWithHexPrefix = false;

jhendersonUnsubmitted

Done

This whole area of code could do with some blank lines to aid readability. I would suggest that you have them between each group of related lines, where "related" means a comment, one if statement, and any lines strongly related to that if statement (e.g. variable declarations). For example, I'd add a line before the comment about 0x prefixes, and another one after the if's closing brace.

jhenderson: This whole area of code could do with some blank lines to aid readability. I would suggest that…

jhendersonUnsubmitted

Done

StartsWithHexPrefix appears to be unused?

jhenderson: `StartsWithHexPrefix` appears to be unused?

if (IsAddr2Line) if (IsAddr2Line)

StartsWithHexPrefix =

Offset.consume_front("0x") || Offset.consume_front("0X"); Offset.consume_front("0x") || Offset.consume_front("0X");

return !Offset.getAsInteger(IsAddr2Line ? 16 : 0, ModuleOffset); if (!Offset.getAsInteger(IsAddr2Line ? 16 : 0, ModuleOffset)) {

// Address specification is a module offset.

Symbol = StringRef();

return true;

}

// Recognize the cases when address specification is absent or invalid.

if (Offset.empty() || StartsWithHexPrefix || std::isdigit(Offset.front()))

return false;

// Address in executable code may be specified as a symbol name.

if (Cmd != Command::Code)

ikudrinUnsubmitted

Done

I don't think we need a new point of divergence here. It's unlikely that anyone would rely on the tool to generate an error for such an input.

ikudrin: I don't think we need a new point of divergence here. It's unlikely that anyone would rely on…

jhendersonUnsubmitted

Done

+1: I don't think we need to handle invalid input differently between llvm-symbolizer and llvm-addr2line specifically, as long as the updated behaviour makes sense.

jhenderson: +1: I don't think we need to handle invalid input differently between llvm-symbolizer and llvm…

return false;

// Address specification is a symbol if addr2line compatibility mode in on.

// Otherwise treat it as an error for compatibility with previous versions of

// llvm-symbolizer.

Symbol = Offset;

ModuleOffset = 0;

return true;

} }

template <typename T> template <typename T>

void executeCommand(StringRef ModuleName, const T &ModuleSpec, Command Cmd, void executeCommand(StringRef ModuleName, const T &ModuleSpec, Command Cmd,

uint64_t Offset, uint64_t AdjustVMA, bool ShouldInline, StringRef Symbol, uint64_t Offset, uint64_t AdjustVMA,

OutputStyle Style, LLVMSymbolizer &Symbolizer, bool ShouldInline, OutputStyle Style,

DIPrinter &Printer) { LLVMSymbolizer &Symbolizer, DIPrinter &Printer) {

uint64_t AdjustedOffset = Offset - AdjustVMA; uint64_t AdjustedOffset = Offset - AdjustVMA;

object::SectionedAddress Address = {AdjustedOffset, object::SectionedAddress Address = {AdjustedOffset,

object::SectionedAddress::UndefSection}; object::SectionedAddress::UndefSection};

Request SymRequest = {ModuleName, Offset}; Request SymRequest = {ModuleName, Offset, Symbol};

if (Cmd == Command::Data) { if (Cmd == Command::Data) {

Expected<DIGlobal> ResOrErr = Symbolizer.symbolizeData(ModuleSpec, Address); Expected<DIGlobal> ResOrErr = Symbolizer.symbolizeData(ModuleSpec, Address);

print(SymRequest, ResOrErr, Printer); print(SymRequest, ResOrErr, Printer);

} else if (Cmd == Command::Frame) { } else if (Cmd == Command::Frame) {

Expected<std::vector<DILocal>> ResOrErr = Expected<std::vector<DILocal>> ResOrErr =

Symbolizer.symbolizeFrame(ModuleSpec, Address); Symbolizer.symbolizeFrame(ModuleSpec, Address);

print(SymRequest, ResOrErr, Printer); print(SymRequest, ResOrErr, Printer);

} else if (!Symbol.empty()) {

Expected<std::vector<DILineInfo>> ResOrErr =

Symbolizer.findSymbol(ModuleSpec, Symbol);

print(SymRequest, ResOrErr, Printer);

} else if (ShouldInline) { } else if (ShouldInline) {

Expected<DIInliningInfo> ResOrErr = Expected<DIInliningInfo> ResOrErr =

Symbolizer.symbolizeInlinedCode(ModuleSpec, Address); Symbolizer.symbolizeInlinedCode(ModuleSpec, Address);

print(SymRequest, ResOrErr, Printer); print(SymRequest, ResOrErr, Printer);

} else if (Style == OutputStyle::GNU) { } else if (Style == OutputStyle::GNU) {

// With PrintFunctions == FunctionNameKind::LinkageName (default) // With PrintFunctions == FunctionNameKind::LinkageName (default)

// and UseSymbolTable == true (also default), Symbolizer.symbolizeCode() // and UseSymbolTable == true (also default), Symbolizer.symbolizeCode()

// may override the name of an inlined function with the name of the topmost // may override the name of an inlined function with the name of the topmost

Show All 20 Lines static void symbolizeInput(const opt::InputArgList &Args,

object::BuildIDRef IncomingBuildID, object::BuildIDRef IncomingBuildID,

uint64_t AdjustVMA, bool IsAddr2Line, uint64_t AdjustVMA, bool IsAddr2Line,

OutputStyle Style, StringRef InputString, OutputStyle Style, StringRef InputString,

LLVMSymbolizer &Symbolizer, DIPrinter &Printer) { LLVMSymbolizer &Symbolizer, DIPrinter &Printer) {

Command Cmd; Command Cmd;

std::string ModuleName; std::string ModuleName;

object::BuildID BuildID(IncomingBuildID.begin(), IncomingBuildID.end()); object::BuildID BuildID(IncomingBuildID.begin(), IncomingBuildID.end());

uint64_t Offset = 0; uint64_t Offset = 0;

StringRef Symbol;

if (!parseCommand(Args.getLastArgValue(OPT_obj_EQ), IsAddr2Line, if (!parseCommand(Args.getLastArgValue(OPT_obj_EQ), IsAddr2Line,

StringRef(InputString), Cmd, ModuleName, BuildID, Offset)) { StringRef(InputString), Cmd, ModuleName, BuildID, Symbol,

Printer.printInvalidCommand({ModuleName, std::nullopt}, InputString); Offset)) {

Printer.printInvalidCommand({ModuleName, std::nullopt, Symbol},

InputString);

return; return;

} }

bool ShouldInline = Args.hasFlag(OPT_inlines, OPT_no_inlines, !IsAddr2Line); bool ShouldInline = Args.hasFlag(OPT_inlines, OPT_no_inlines, !IsAddr2Line);

if (!BuildID.empty()) { if (!BuildID.empty()) {

assert(ModuleName.empty()); assert(ModuleName.empty());

if (!Args.hasArg(OPT_no_debuginfod)) if (!Args.hasArg(OPT_no_debuginfod))

enableDebuginfod(Symbolizer, Args); enableDebuginfod(Symbolizer, Args);

std::string BuildIDStr = toHex(BuildID); std::string BuildIDStr = toHex(BuildID);

executeCommand(BuildIDStr, BuildID, Cmd, Offset, AdjustVMA, ShouldInline, executeCommand(BuildIDStr, BuildID, Cmd, Symbol, Offset, AdjustVMA,

Style, Symbolizer, Printer); ShouldInline, Style, Symbolizer, Printer);

} else { } else {

executeCommand(ModuleName, ModuleName, Cmd, Offset, AdjustVMA, ShouldInline, executeCommand(ModuleName, ModuleName, Cmd, Symbol, Offset, AdjustVMA,

Style, Symbolizer, Printer); ShouldInline, Style, Symbolizer, Printer);

} }

static void printHelp(StringRef ToolName, const SymbolizerOptTable &Tbl, static void printHelp(StringRef ToolName, const SymbolizerOptTable &Tbl,

raw_ostream &OS) { raw_ostream &OS) {

const char HelpText[] = " [options] addresses..."; const char HelpText[] = " [options] addresses...";

Tbl.printHelp(OS, (ToolName + HelpText).str().c_str(), Tbl.printHelp(OS, (ToolName + HelpText).str().c_str(),

ToolName.str().c_str()); ToolName.str().c_str());

▲ Show 20 Lines • Show All 138 Lines • ▼ Show 20 Lines

#endif #endif

Opts.UseSymbolTable = true; Opts.UseSymbolTable = true;

if (Args.hasArg(OPT_cache_size_EQ)) if (Args.hasArg(OPT_cache_size_EQ))

parseIntArg(Args, OPT_cache_size_EQ, Opts.MaxCacheSize); parseIntArg(Args, OPT_cache_size_EQ, Opts.MaxCacheSize);

Config.PrintAddress = Args.hasArg(OPT_addresses); Config.PrintAddress = Args.hasArg(OPT_addresses);

Config.PrintFunctions = Opts.PrintFunctions != FunctionNameKind::None; Config.PrintFunctions = Opts.PrintFunctions != FunctionNameKind::None;

Config.Pretty = Args.hasArg(OPT_pretty_print); Config.Pretty = Args.hasArg(OPT_pretty_print);

Config.Verbose = Args.hasArg(OPT_verbose); Config.Verbose = Args.hasArg(OPT_verbose);

jhendersonUnsubmitted

Done

This should be controlled by the output-style command-line option (see my comment elsewhere).

jhenderson: This should be controlled by the output-style command-line option (see my comment elsewhere).

sepavloffAuthorUnsubmitted

Done

Initialization of Config.IsGNUStyle is now below in this function.

sepavloff: Initialization of `Config.IsGNUStyle` is now below in this function.

for (const opt::Arg *A : Args.filtered(OPT_dsym_hint_EQ)) { for (const opt::Arg *A : Args.filtered(OPT_dsym_hint_EQ)) {

StringRef Hint(A->getValue()); StringRef Hint(A->getValue());

if (sys::path::extension(Hint) == ".dSYM") { if (sys::path::extension(Hint) == ".dSYM") {

Opts.DsymHints.emplace_back(Hint); Opts.DsymHints.emplace_back(Hint);

} else { } else {

errs() << "Warning: invalid dSYM hint: \"" << Hint errs() << "Warning: invalid dSYM hint: \"" << Hint

<< "\" (must have the '.dSYM' extension).\n"; << "\" (must have the '.dSYM' extension).\n";

} }

Show All 13 Lines #endif

if (const opt::Arg *A = Args.getLastArg(OPT_output_style_EQ)) { if (const opt::Arg *A = Args.getLastArg(OPT_output_style_EQ)) {

if (strcmp(A->getValue(), "GNU") == 0) if (strcmp(A->getValue(), "GNU") == 0)

Style = OutputStyle::GNU; Style = OutputStyle::GNU;

else if (strcmp(A->getValue(), "JSON") == 0) else if (strcmp(A->getValue(), "JSON") == 0)

Style = OutputStyle::JSON; Style = OutputStyle::JSON;

else else

Style = OutputStyle::LLVM; Style = OutputStyle::LLVM;

} }

Config.IsGNUStyle = Style == OutputStyle::GNU;

if (Args.hasArg(OPT_build_id_EQ) && Args.hasArg(OPT_obj_EQ)) { if (Args.hasArg(OPT_build_id_EQ) && Args.hasArg(OPT_obj_EQ)) {

errs() << "error: cannot specify both --build-id and --obj\n"; errs() << "error: cannot specify both --build-id and --obj\n";

return EXIT_FAILURE; return EXIT_FAILURE;

} }

object::BuildID BuildID = parseBuildIDArg(Args, OPT_build_id_EQ); object::BuildID BuildID = parseBuildIDArg(Args, OPT_build_id_EQ);

std::unique_ptr<DIPrinter> Printer; std::unique_ptr<DIPrinter> Printer;

if (Style == OutputStyle::GNU) if (Style == OutputStyle::GNU)

Printer = std::make_unique<GNUPrinter>(outs(), printError, Config); Printer = std::make_unique<GNUPrinter>(outs(), printError, Config);

else if (Style == OutputStyle::JSON) else if (Style == OutputStyle::JSON)

Printer = std::make_unique<JSONPrinter>(outs(), Config); Printer = std::make_unique<JSONPrinter>(outs(), Config);

else else

Printer = std::make_unique<LLVMPrinter>(outs(), printError, Config); Printer = std::make_unique<LLVMPrinter>(outs(), printError, Config);

// When an input file is specified, exit immediately if the file cannot be // When an input file is specified, exit immediately if the file cannot be

// read. If getOrCreateModuleInfo succeeds, symbolizeInput will reuse the // read. If getOrCreateModuleInfo succeeds, symbolizeInput will reuse the

// cached file handle. // cached file handle.

if (auto *Arg = Args.getLastArg(OPT_obj_EQ); Arg && IsAddr2Line) { if (auto *Arg = Args.getLastArg(OPT_obj_EQ); Arg && IsAddr2Line) {

auto Status = Symbolizer.getOrCreateModuleInfo(Arg->getValue()); auto Status = Symbolizer.getOrCreateModuleInfo(Arg->getValue());

if (!Status) { if (!Status) {

Request SymRequest = {Arg->getValue(), 0}; Request SymRequest = {Arg->getValue(), 0, StringRef()};

handleAllErrors(Status.takeError(), [&](const ErrorInfoBase &EI) { handleAllErrors(Status.takeError(), [&](const ErrorInfoBase &EI) {

Printer->printError(SymRequest, EI); Printer->printError(SymRequest, EI);

}); });

return EXIT_FAILURE; return EXIT_FAILURE;

} }

std::vector<std::string> InputAddresses = Args.getAllArgValues(OPT_INPUT); std::vector<std::string> InputAddresses = Args.getAllArgValues(OPT_INPUT);

Show All 23 Lines

llvm/unittests/ProfileData/MemProfTest.cpp

Show All 18 Lines

namespace {		namespace {

using ::llvm::DIGlobal;		using ::llvm::DIGlobal;
using ::llvm::DIInliningInfo;		using ::llvm::DIInliningInfo;
using ::llvm::DILineInfo;		using ::llvm::DILineInfo;
using ::llvm::DILineInfoSpecifier;		using ::llvm::DILineInfoSpecifier;
using ::llvm::DILocal;		using ::llvm::DILocal;
		using ::llvm::StringRef;
using ::llvm::memprof::CallStackMap;		using ::llvm::memprof::CallStackMap;
using ::llvm::memprof::Frame;		using ::llvm::memprof::Frame;
using ::llvm::memprof::FrameId;		using ::llvm::memprof::FrameId;
using ::llvm::memprof::IndexedMemProfRecord;		using ::llvm::memprof::IndexedMemProfRecord;
using ::llvm::memprof::MemInfoBlock;		using ::llvm::memprof::MemInfoBlock;
using ::llvm::memprof::MemProfRecord;		using ::llvm::memprof::MemProfRecord;
using ::llvm::memprof::MemProfSchema;		using ::llvm::memprof::MemProfSchema;
using ::llvm::memprof::Meta;		using ::llvm::memprof::Meta;
Show All 16 Lines	virtual DILineInfo symbolizeCode(SectionedAddress, DILineInfoSpecifier,
llvm_unreachable("unused");		llvm_unreachable("unused");
}		}
virtual DIGlobal symbolizeData(SectionedAddress) const {		virtual DIGlobal symbolizeData(SectionedAddress) const {
llvm_unreachable("unused");		llvm_unreachable("unused");
}		}
virtual std::vector<DILocal> symbolizeFrame(SectionedAddress) const {		virtual std::vector<DILocal> symbolizeFrame(SectionedAddress) const {
llvm_unreachable("unused");		llvm_unreachable("unused");
}		}
		virtual std::vector<SectionedAddress> findSymbol(StringRef Symbol) const {
		llvm_unreachable("unused");
		}
virtual bool isWin32Module() const { llvm_unreachable("unused"); }		virtual bool isWin32Module() const { llvm_unreachable("unused"); }
virtual uint64_t getModulePreferredBase() const {		virtual uint64_t getModulePreferredBase() const {
llvm_unreachable("unused");		llvm_unreachable("unused");
}		}
};		};

struct MockInfo {		struct MockInfo {
std::string FunctionName;		std::string FunctionName;
▲ Show 20 Lines • Show All 295 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[symbolizer] Support symbol lookupClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 538185

llvm/docs/CommandGuide/llvm-symbolizer.rst

llvm/docs/ReleaseNotes.rst

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h

llvm/include/llvm/DebugInfo/Symbolize/SymbolizableModule.h

llvm/include/llvm/DebugInfo/Symbolize/SymbolizableObjectFile.h

llvm/include/llvm/DebugInfo/Symbolize/Symbolize.h

llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp

llvm/lib/DebugInfo/Symbolize/SymbolizableObjectFile.cpp

llvm/lib/DebugInfo/Symbolize/Symbolize.cpp

llvm/test/tools/llvm-symbolizer/Inputs/addr.inp

llvm/test/tools/llvm-symbolizer/Inputs/discrim.inp

llvm/test/tools/llvm-symbolizer/debuginfod.test

llvm/test/tools/llvm-symbolizer/flag-grouping.test

llvm/test/tools/llvm-symbolizer/flush-output.s

llvm/test/tools/llvm-symbolizer/invalid-input-address.test

llvm/test/tools/llvm-symbolizer/output-style-empty-line.test

llvm/test/tools/llvm-symbolizer/output-style-json-code.test

llvm/test/tools/llvm-symbolizer/sym-verbose.test

llvm/test/tools/llvm-symbolizer/sym.test

llvm/test/tools/llvm-symbolizer/symbol-search.test

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp

llvm/unittests/ProfileData/MemProfTest.cpp

[symbolizer] Support symbol lookup
ClosedPublic