This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/CommandGuide/
-
CommandGuide/
4/4
llvm-symbolizer.rst
-
test/tools/llvm-symbolizer/
-
tools/
-
llvm-symbolizer/
1/1
debuginfod.test
-
file-prefix.test
-
tools/llvm-symbolizer/
-
llvm-symbolizer/
1/1
llvm-symbolizer.cpp

Differential D119901

[Debuginfod] Add BUILD_ID syntax to llvm-symbolizer.
ClosedPublic

Authored by mysterymath on Feb 15 2022, 3:28 PM.

Download Raw Diff

Details

Reviewers

phosek
mcgrathr
jhenderson

Commits

rG565add5a628b: [Debuginfod] Add BUILD_ID syntax to llvm-symbolizer.

Summary

This adds a BUILD_ID prefix to the llvm-symbolizer stdin and argument
syntax. The prefix causes the given binary name to be interpreted as a
build ID instead of an object file path. The semantics are analagous to
the behavior of --obj and --build-id.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	60,090 ms	x64 debian > AddressSanitizer-x86_64-linux.TestCases::scariness_score_test.cpp
	60,140 ms	x64 debian > Clang.CodeGen/RISCV/rvv-intrinsics::vloxseg.c
	60,160 ms	x64 debian > Clang.CodeGen/RISCV/rvv-intrinsics::vluxseg.c
	60,170 ms	x64 debian > Clang.CodeGen/RISCV/rvv-intrinsics-overloaded::vloxseg.c
	60,160 ms	x64 debian > Clang.CodeGen/RISCV/rvv-intrinsics-overloaded::vluxseg.c

Event Timeline

mysterymath created this revision.Feb 15 2022, 3:28 PM

Herald added a reviewer: jhenderson. · View Herald TranscriptFeb 15 2022, 3:28 PM

Herald added a subscriber: rupprecht. · View Herald Transcript

mysterymath requested review of this revision.Feb 15 2022, 3:28 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 15 2022, 3:28 PM

Herald added subscribers: llvm-commits, MaskRay. · View Herald Transcript

Harbormaster completed remote builds in B149849: Diff 409079.Feb 15 2022, 6:09 PM

I'm not at all familiar with the debuginfod stuff, so this probably needs a second pair of eyes from someone with more knowledge in this area.

Do you need a test-case where BUILD_ID is used and --obj is specified?

llvm/docs/CommandGuide/llvm-symbolizer.rst
34	I have a marginal preference for `BUILDID` (withotu the underscore), as I dislike typing underscores. I don't feel strongly about this though, so if you prefer with the underscore, that's fine. The use of the term "object file" seems a bit unintuitive here though, since a build ID is neither an object file itself, nor a path to one. Perhaps it would be better to rephrase the references to "object file" earlier in this section with "input name" or similar, then here, I'd start this paragraph with something like "By default, input names are interpreted as object file paths. However, prefixing the command with ...". Finally, I'd then put this paragraph second or third in order in this section.
179	It would be good to have a test case that tests the interaction of `BUILD_ID` and `DATA` and/or `CODE`.
llvm/test/tools/llvm-symbolizer/debuginfod.test
54	Add some indentation to make the output line up, as if it were on the command-line.

phosek added inline comments.Feb 16 2022, 10:23 AM

llvm/docs/CommandGuide/llvm-symbolizer.rst
34	Could we reverse the order and instead do something like: [CODE] [FILE:]<file> <address> DATA [FILE:]<file> <address> [CODE] BUILDID:<build ID> <address> DATA BUILDID:<build ID> <address> This would avoid the confusion between file and build ID.

Addressed review feedback.

Harbormaster completed remote builds in B150088: Diff 409421.Feb 16 2022, 4:33 PM

jhenderson added inline comments.Feb 17 2022, 1:10 AM

llvm/docs/CommandGuide/llvm-symbolizer.rst
113–121	Rather than add `FILE:` prefixes here, I'd instead modify Example 4 to say "BUILDID and FILE prefixes" or similar, like Example 5's "CODE and DATA prefixes".
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
149–150	This piece of code makes me wonder whether a test case with something like `FILE:BUILDID:<hex>` and/or `BUILDID:FILE:<path>` or similar might be useful. Not sure either way.

Update docs.
The semantics of mixing multiple input name prefixes isn't obvious, so detect all combinations of more than one prefix reject the line as a syntax error.

mysterymath marked 2 inline comments as done.Feb 17 2022, 2:17 PM

Harbormaster completed remote builds in B150332: Diff 409792.Feb 17 2022, 3:27 PM

LGTM!

This revision is now accepted and ready to land.Feb 18 2022, 12:47 AM

This revision was landed with ongoing or failed builds.Feb 24 2022, 4:39 PM

Closed by commit rG565add5a628b: [Debuginfod] Add BUILD_ID syntax to llvm-symbolizer. (authored by mysterymath). · Explain Why

This revision was automatically updated to reflect the committed changes.

mysterymath added a commit: rG565add5a628b: [Debuginfod] Add BUILD_ID syntax to llvm-symbolizer..

Revision Contents

Path

Size

llvm/

docs/

CommandGuide/

llvm-symbolizer.rst

55 lines

test/

tools/

llvm-symbolizer/

debuginfod.test

27 lines

file-prefix.test

4 lines

tools/

llvm-symbolizer/

llvm-symbolizer.cpp

113 lines

Diff 409421

llvm/docs/CommandGuide/llvm-symbolizer.rst

llvm-symbolizer - convert addresses into source code locations		llvm-symbolizer - convert addresses into source code locations
==============================================================		==============================================================

.. program:: llvm-symbolizer		.. program:: llvm-symbolizer

SYNOPSIS		SYNOPSIS
--------		--------

:program:`llvm-symbolizer` [options] [addresses...]		:program:`llvm-symbolizer` [options] [addresses...]

DESCRIPTION		DESCRIPTION
-----------		-----------

:program:`llvm-symbolizer` reads object file names and addresses from the		:program:`llvm-symbolizer` reads input names and addresses from the command-line
command-line and prints corresponding source code locations to standard output.		and prints corresponding source code locations to standard output.

If no address is specified on the command-line, it reads the addresses from		If no address is specified on the command-line, it reads the addresses from
standard input. If no object file is specified on the command-line, but		standard input. If no input name is specified on the command-line, but addresses
addresses are, or if at any time an input value is not recognized, the input is		are, or if at any time an input value is not recognized, the input is simply
simply echoed to the output.		echoed to the output.

		Input names can be specified together with the addresses either on standard
		input or as positional arguments on the command-line. By default, input names
		are interpreted as object file paths. However, prefixing a name with
		``BUILDID:`` states that it is a hex build ID rather than a path. This will look
		up the corresponding debug binary. For consistency, prefixing a name with
		``FILE:`` explicitly states that it is an object file path (the default).

A positional argument or standard input value can be preceded by "DATA" or		A positional argument or standard input value can be preceded by "DATA" or
"CODE" to indicate that the address should be symbolized as data or executable		"CODE" to indicate that the address should be symbolized as data or executable
code respectively. If neither is specified, "CODE" is assumed. DATA is		code respectively. If neither is specified, "CODE" is assumed. DATA is
symbolized as address and symbol size rather than line number.		symbolized as address and symbol size rather than line number.

Object files can be specified together with the addresses either on standard
input or as positional arguments on the command-line, following any "DATA" or
"CODE" prefix.

:program:`llvm-symbolizer` parses options from the environment variable		:program:`llvm-symbolizer` parses options from the environment variable
		jhendersonUnsubmitted Done Reply Inline Actions I have a marginal preference for `BUILDID` (withotu the underscore), as I dislike typing underscores. I don't feel strongly about this though, so if you prefer with the underscore, that's fine. The use of the term "object file" seems a bit unintuitive here though, since a build ID is neither an object file itself, nor a path to one. Perhaps it would be better to rephrase the references to "object file" earlier in this section with "input name" or similar, then here, I'd start this paragraph with something like "By default, input names are interpreted as object file paths. However, prefixing the command with ...". Finally, I'd then put this paragraph second or third in order in this section. jhenderson: I have a marginal preference for `BUILDID` (withotu the underscore), as I dislike typing…
		phosekUnsubmitted Done Reply Inline Actions Could we reverse the order and instead do something like: [CODE] [FILE:]<file> <address> DATA [FILE:]<file> <address> [CODE] BUILDID:<build ID> <address> DATA BUILDID:<build ID> <address> This would avoid the confusion between file and build ID. phosek: Could we reverse the order and instead do something like: ``` [CODE] [FILE:]<file> <address>…
``LLVM_SYMBOLIZER_OPTS`` after parsing options from the command line.		``LLVM_SYMBOLIZER_OPTS`` after parsing options from the command line.
``LLVM_SYMBOLIZER_OPTS`` is primarily useful for supplementing the command-line		``LLVM_SYMBOLIZER_OPTS`` is primarily useful for supplementing the command-line
options when :program:`llvm-symbolizer` is invoked by another program or		options when :program:`llvm-symbolizer` is invoked by another program or
runtime.		runtime.

EXAMPLES		EXAMPLES
--------		--------

▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	.. code-block:: console

foz		foz
/tmp/./test.h:1:0		/tmp/./test.h:1:0

Example 3 - object specified with address:		Example 3 - object specified with address:

.. code-block:: console		.. code-block:: console

$ llvm-symbolizer "test.elf 0x400490" "inlined.elf 0x400480"		$ llvm-symbolizer "test.elf 0x400490" "FILE:inlined.elf 0x400480"
baz()		baz()
/tmp/test.cpp:11:0		/tmp/test.cpp:11:0

foo()		foo()
/tmp/test.cpp:8:10		/tmp/test.cpp:8:10

$ cat addr2.txt		$ cat addr2.txt
test.elf 0x4004a0		FILE:test.elf 0x4004a0
		jhendersonUnsubmitted Done Reply Inline Actions Rather than add `FILE:` prefixes here, I'd instead modify Example 4 to say "BUILDID and FILE prefixes" or similar, like Example 5's "CODE and DATA prefixes". jhenderson: Rather than add `FILE:` prefixes here, I'd instead modify Example 4 to say "BUILDID and FILE…
inlined.elf 0x400480		inlined.elf 0x400480

$ llvm-symbolizer < addr2.txt		$ llvm-symbolizer < addr2.txt
main		main
/tmp/test.cpp:15:0		/tmp/test.cpp:15:0

foo()		foo()
/tmp/test.cpp:8:10		/tmp/test.cpp:8:10

Example 4 - CODE and DATA prefixes:		Example 4 - build ID specified with address:

.. code-block:: console		.. code-block:: console

$ llvm-symbolizer --obj=test.elf "CODE 0x400490" "DATA 0x601028"		$ llvm-symbolizer "BUILDID:123456789abcdef 0x400490" "DATA BUILDID:123456789abcdef 0x601028"
baz()		baz()
/tmp/test.cpp:11:0		/tmp/test.cpp:11:0

bar		bar
6295592 4		6295592 4

$ cat addr3.txt		$ cat addr3.txt
		BUILDID:123456789abcdef 0x400490
		DATA BUILDID:123456789abcdef 0x601028

		$ llvm-symbolizer < addr3.txt
		baz()
		/tmp/test.cpp:11:0

		bar
		6295592 4

		Example 5 - CODE and DATA prefixes:

		.. code-block:: console

		$ llvm-symbolizer --obj=test.elf "CODE 0x400490" "DATA 0x601028"
		baz()
		/tmp/test.cpp:11:0

		bar
		6295592 4

		$ cat addr4.txt
CODE test.elf 0x4004a0		CODE test.elf 0x4004a0
DATA inlined.elf 0x601028		DATA inlined.elf 0x601028

$ llvm-symbolizer < addr3.txt		$ llvm-symbolizer < addr4.txt
main		main
/tmp/test.cpp:15:0		/tmp/test.cpp:15:0

bar		bar
6295592 4		6295592 4

Example 5 - path-style options:		Example 6 - path-style options:

This example uses the same source file as above, but the source file's		This example uses the same source file as above, but the source file's
full path is /tmp/foo/test.cpp and is compiled as follows. The first case		full path is /tmp/foo/test.cpp and is compiled as follows. The first case
shows the default absolute path, the second --basenames, and the third		shows the default absolute path, the second --basenames, and the third
		jhendersonUnsubmitted Done Reply Inline Actions It would be good to have a test case that tests the interaction of `BUILD_ID` and `DATA` and/or `CODE`. jhenderson: It would be good to have a test case that tests the interaction of `BUILD_ID` and `DATA` and/or…
shows --relativenames.		shows --relativenames.

.. code-block:: console		.. code-block:: console

$ pwd		$ pwd
/tmp		/tmp
$ clang -g foo/test.cpp -o test.elf		$ clang -g foo/test.cpp -o test.elf
$ llvm-symbolizer --obj=test.elf 0x4004a0		$ llvm-symbolizer --obj=test.elf 0x4004a0
▲ Show 20 Lines • Show All 303 Lines • Show Last 20 Lines

llvm/test/tools/llvm-symbolizer/debuginfod.test

Show All 21 Lines

# The symbolizer should call the debuginfod client library, which finds the

# debuginfo placed in the cache, enabling symbolization of the address.

RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \

RUN: --obj=%t/addr.exe 0x40054d --debuginfod | \

RUN: FileCheck %s --check-prefix=FOUND

FOUND: {{[/\]+}}tmp{{[/\]+}}x.c:14:0

# This should also work if the build ID is provided.

# This should also work if the build ID is provided via flag.

RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \

RUN: --build-id=127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d | \

RUN: FileCheck %s --check-prefix=FOUND

# This should also work if the build ID is provided via stdin.

RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \

RUN: "BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d" | \

RUN: FileCheck %s --check-prefix=FOUND

# CODE should work preceding build ID.

RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \

RUN: "CODE BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d" | \

RUN: FileCheck %s --check-prefix=FOUND

# The symbolizer shouldn't call the debuginfod library by default with no URLs.

RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer --print-address \

RUN: --obj=%t/addr.exe 0x40054d | FileCheck %s --check-prefix=NOTFOUND

# The symbolizer shouldn't call the debuginfod library if explicitly disabled.

RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \

RUN: --no-debuginfod \

RUN: "BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d" | \

RUN: FileCheck %s --check-prefix=NOTHINGFOUND

NOTHINGFOUND: ??

jhendersonUnsubmitted

Done

RUN: FileCheck %s --check-prefix=NOTHINGFOUND

- NOTHINGFOUND: ??

+ NOTHINGFOUND: ??

NOTHINGFOUND-NEXT: ??:0:0

Add some indentation to make the output line up, as if it were on the command-line.

jhenderson: Add some indentation to make the output line up, as if it were on the command-line.

NOTHINGFOUND-NEXT: ??:0:0

# The build ID flag shouldn't be parsed if --obj is given, just like regular filenames.

RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \

RUN: --obj=%t/addr.exe \

RUN: "BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d" | \

RUN: FileCheck %s --check-prefix=BUILDIDIGNORED

BUILDIDIGNORED: BUILDID:127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d

llvm/test/tools/llvm-symbolizer/file-prefix.test

This file was added.

				# The FILE prefix acts as a no-op, but it provides consistency with BUILDID.
				RUN: llvm-symbolizer "CODE FILE:%p/Inputs/addr.exe 0x40054d" \| \
				RUN: FileCheck %s --check-prefix=FOUND
				FOUND: {{[/\]+}}tmp{{[/\]+}}x.c:14:0

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp

Show First 20 Lines • Show All 99 Lines • ▼ Show 20 Lines
enum class OutputStyle { LLVM, GNU, JSON };		enum class OutputStyle { LLVM, GNU, JSON };

enum class Command {		enum class Command {
Code,		Code,
Data,		Data,
Frame,		Frame,
};		};

static bool parseCommand(StringRef BinaryName, ArrayRef<uint8_t> BuildID,		static void enableDebuginfod(LLVMSymbolizer &Symbolizer) {
bool IsAddr2Line, StringRef InputString, Command &Cmd,		static bool IsEnabled = false;
std::string &ModuleName, uint64_t &ModuleOffset) {		if (IsEnabled)
		return;
		IsEnabled = true;
		// Look up symbols using the debuginfod client.
		Symbolizer.addDIFetcher(std::make_unique<DebuginfodDIFetcher>());
		// The HTTPClient must be initialized for use by the debuginfod client.
		HTTPClient::initialize();
		}

		static SmallVector<uint8_t> parseBuildID(StringRef Str) {
		std::string Bytes;
		if (!tryGetFromHex(Str, Bytes))
		return {};
		ArrayRef<uint8_t> BuildID(reinterpret_cast<const uint8_t *>(Bytes.data()),
		Bytes.size());
		return SmallVector<uint8_t>(BuildID.begin(), BuildID.end());
		}

		static bool parseCommand(StringRef BinaryName, bool IsAddr2Line,
		StringRef InputString, Command &Cmd,
		std::string &ModuleName,
		SmallVectorImpl<uint8_t> &BuildID,
		uint64_t &ModuleOffset) {
const char kDelimiters[] = " \n\r";		const char kDelimiters[] = " \n\r";
ModuleName = "";		ModuleName = "";
if (InputString.consume_front("CODE ")) {		if (InputString.consume_front("CODE ")) {
Cmd = Command::Code;		Cmd = Command::Code;
} else if (InputString.consume_front("DATA ")) {		} else if (InputString.consume_front("DATA ")) {
Cmd = Command::Data;		Cmd = Command::Data;
} else if (InputString.consume_front("FRAME ")) {		} else if (InputString.consume_front("FRAME ")) {
Cmd = Command::Frame;		Cmd = Command::Frame;
} else {		} else {
// If no cmd, assume it's CODE.		// If no cmd, assume it's CODE.
Cmd = Command::Code;		Cmd = Command::Code;
}		}
const char *Pos = InputString.data();
		const char *Pos;
// Skip delimiters and parse input filename (if needed).		// Skip delimiters and parse input filename (if needed).
if (BinaryName.empty() && BuildID.empty()) {		if (BinaryName.empty() && BuildID.empty()) {
		bool NameIsBuildID = !InputString.consume_front("FILE:") &&
		InputString.consume_front("BUILDID:");
		jhendersonUnsubmitted Done Reply Inline Actions This piece of code makes me wonder whether a test case with something like `FILE:BUILDID:<hex>` and/or `BUILDID:FILE:<path>` or similar might be useful. Not sure either way. jhenderson: This piece of code makes me wonder whether a test case with something like `FILE:BUILDID:<hex>`…
		Pos = InputString.data();
Pos += strspn(Pos, kDelimiters);		Pos += strspn(Pos, kDelimiters);
if (Pos == '"' \|\| Pos == '\'') {		if (Pos == '"' \|\| Pos == '\'') {
char Quote = *Pos;		char Quote = *Pos;
Pos++;		Pos++;
const char *End = strchr(Pos, Quote);		const char *End = strchr(Pos, Quote);
if (!End)		if (!End)
return false;		return false;
ModuleName = std::string(Pos, End - Pos);		ModuleName = std::string(Pos, End - Pos);
Pos = End + 1;		Pos = End + 1;
} else {		} else {
int NameLength = strcspn(Pos, kDelimiters);		int NameLength = strcspn(Pos, kDelimiters);
ModuleName = std::string(Pos, NameLength);		ModuleName = std::string(Pos, NameLength);
Pos += NameLength;		Pos += NameLength;
}		}
		if (NameIsBuildID) {
		BuildID = parseBuildID(ModuleName);
		if (BuildID.empty())
		return false;
		ModuleName.clear();
		}
} else {		} else {
		Pos = InputString.data();
ModuleName = BinaryName.str();		ModuleName = BinaryName.str();
}		}
// Skip delimiters and parse module offset.		// Skip delimiters and parse module offset.
Pos += strspn(Pos, kDelimiters);		Pos += strspn(Pos, kDelimiters);
int OffsetLength = strcspn(Pos, kDelimiters);		int OffsetLength = strcspn(Pos, kDelimiters);
StringRef Offset(Pos, OffsetLength);		StringRef Offset(Pos, OffsetLength);
// GNU addr2line assumes the offset is hexadecimal and allows a redundant		// GNU addr2line assumes the offset is hexadecimal and allows a redundant
// "0x" or "0X" prefix; do the same for compatibility.		// "0x" or "0X" prefix; do the same for compatibility.
Show All 39 Lines	void executeCommand(StringRef ModuleName, const T &ModuleSpec, Command Cmd,
} else {		} else {
Expected<DILineInfo> ResOrErr =		Expected<DILineInfo> ResOrErr =
Symbolizer.symbolizeCode(ModuleSpec, Address);		Symbolizer.symbolizeCode(ModuleSpec, Address);
print({ModuleName, Offset}, ResOrErr, Printer);		print({ModuleName, Offset}, ResOrErr, Printer);
}		}
}		}

static void symbolizeInput(const opt::InputArgList &Args,		static void symbolizeInput(const opt::InputArgList &Args,
ArrayRef<uint8_t> BuildID, uint64_t AdjustVMA,		ArrayRef<uint8_t> IncomingBuildID,
bool IsAddr2Line, OutputStyle Style,		uint64_t AdjustVMA, bool IsAddr2Line,
StringRef InputString, LLVMSymbolizer &Symbolizer,		OutputStyle Style, StringRef InputString,
DIPrinter &Printer) {		LLVMSymbolizer &Symbolizer, DIPrinter &Printer) {
Command Cmd;		Command Cmd;
std::string ModuleName;		std::string ModuleName;
		SmallVector<uint8_t> BuildID(IncomingBuildID.begin(), IncomingBuildID.end());
uint64_t Offset = 0;		uint64_t Offset = 0;
if (!parseCommand(Args.getLastArgValue(OPT_obj_EQ), BuildID, IsAddr2Line,		if (!parseCommand(Args.getLastArgValue(OPT_obj_EQ), IsAddr2Line,
StringRef(InputString), Cmd, ModuleName, Offset)) {		StringRef(InputString), Cmd, ModuleName, BuildID, Offset)) {
Printer.printInvalidCommand({ModuleName, None}, InputString);		Printer.printInvalidCommand({ModuleName, None}, InputString);
return;		return;
}		}
bool ShouldInline = Args.hasFlag(OPT_inlines, OPT_no_inlines, !IsAddr2Line);		bool ShouldInline = Args.hasFlag(OPT_inlines, OPT_no_inlines, !IsAddr2Line);
if (!BuildID.empty()) {		if (!BuildID.empty()) {
assert(ModuleName.empty());		assert(ModuleName.empty());
		if (!Args.hasArg(OPT_no_debuginfod))
		enableDebuginfod(Symbolizer);
std::string BuildIDStr = toHex(BuildID);		std::string BuildIDStr = toHex(BuildID);
executeCommand(BuildIDStr, BuildID, Cmd, Offset, AdjustVMA, ShouldInline,		executeCommand(BuildIDStr, BuildID, Cmd, Offset, AdjustVMA, ShouldInline,
Style, Symbolizer, Printer);		Style, Symbolizer, Printer);
} else {		} else {
executeCommand(ModuleName, ModuleName, Cmd, Offset, AdjustVMA, ShouldInline,		executeCommand(ModuleName, ModuleName, Cmd, Offset, AdjustVMA, ShouldInline,
Style, Symbolizer, Printer);		Style, Symbolizer, Printer);
}		}
}		}
▲ Show 20 Lines • Show All 57 Lines • ▼ Show 20 Lines	static FunctionNameKind decideHowToPrintFunctions(const opt::InputArgList &Args,
if (const opt::Arg *A = Args.getLastArg(OPT_functions_EQ))		if (const opt::Arg *A = Args.getLastArg(OPT_functions_EQ))
return StringSwitch<FunctionNameKind>(A->getValue())		return StringSwitch<FunctionNameKind>(A->getValue())
.Case("none", FunctionNameKind::None)		.Case("none", FunctionNameKind::None)
.Case("short", FunctionNameKind::ShortName)		.Case("short", FunctionNameKind::ShortName)
.Default(FunctionNameKind::LinkageName);		.Default(FunctionNameKind::LinkageName);
return IsAddr2Line ? FunctionNameKind::None : FunctionNameKind::LinkageName;		return IsAddr2Line ? FunctionNameKind::None : FunctionNameKind::LinkageName;
}		}

SmallVector<uint8_t> parseBuildIDArg(const opt::InputArgList &Args, int ID) {		static SmallVector<uint8_t> parseBuildIDArg(const opt::InputArgList &Args,
if (const opt::Arg *A = Args.getLastArg(ID)) {		int ID) {
		const opt::Arg *A = Args.getLastArg(ID);
		if (!A)
		return {};

StringRef V(A->getValue());		StringRef V(A->getValue());
std::string Bytes;		SmallVector<uint8_t> BuildID = parseBuildID(V);
if (!tryGetFromHex(V, Bytes)) {		if (BuildID.empty()) {
errs() << A->getSpelling() + ": expected a build ID, but got '" + V +		errs() << A->getSpelling() + ": expected a build ID, but got '" + V + "'\n";
"'\n";
exit(1);		exit(1);
}		}
ArrayRef<uint8_t> BuildID(reinterpret_cast<const uint8_t *>(Bytes.data()),		return BuildID;
Bytes.size());
return SmallVector<uint8_t>(BuildID.begin(), BuildID.end());
}
return {};
}		}

ExitOnError ExitOnErr;		ExitOnError ExitOnErr;

static bool shouldUseDebuginfodByDefault(ArrayRef<uint8_t> BuildID) {
// If the user explicitly specified a build ID, the usual way to find it is
// debuginfod.
if (!BuildID.empty())
return true;

// A debuginfod lookup could succeed if a HTTP client is available and at
// least one backing URL is configured.
if (HTTPClient::isAvailable() &&
!ExitOnErr(getDefaultDebuginfodUrls()).empty())
return true;

// A debuginfod lookup could also succeed if something were present in the
// cache directory, but it would be surprising to enable debuginfod on this
// basis alone. To use existing caches in an "offline" fashion, the debuginfod
// flag must be set.
return false;
}

int main(int argc, char **argv) {		int main(int argc, char **argv) {
InitLLVM X(argc, argv);		InitLLVM X(argc, argv);
sys::InitializeCOMRAII COM(sys::COMThreadingMode::MultiThreaded);		sys::InitializeCOMRAII COM(sys::COMThreadingMode::MultiThreaded);

bool IsAddr2Line = sys::path::stem(argv[0]).contains("addr2line");		bool IsAddr2Line = sys::path::stem(argv[0]).contains("addr2line");
BumpPtrAllocator A;		BumpPtrAllocator A;
StringSaver Saver(A);		StringSaver Saver(A);
SymbolizerOptTable Tbl;		SymbolizerOptTable Tbl;
▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	#endif
if (Args.hasArg(OPT_build_id_EQ) && Args.hasArg(OPT_obj_EQ)) {		if (Args.hasArg(OPT_build_id_EQ) && Args.hasArg(OPT_obj_EQ)) {
errs() << "error: cannot specify both --build-id and --obj\n";		errs() << "error: cannot specify both --build-id and --obj\n";
return EXIT_FAILURE;		return EXIT_FAILURE;
}		}
SmallVector<uint8_t> BuildID = parseBuildIDArg(Args, OPT_build_id_EQ);		SmallVector<uint8_t> BuildID = parseBuildIDArg(Args, OPT_build_id_EQ);

LLVMSymbolizer Symbolizer(Opts);		LLVMSymbolizer Symbolizer(Opts);

		// A debuginfod lookup could succeed if a HTTP client is available and at
		// least one backing URL is configured.
		bool ShouldUseDebuginfodByDefault =
		HTTPClient::isAvailable() &&
		!ExitOnErr(getDefaultDebuginfodUrls()).empty();
if (Args.hasFlag(OPT_debuginfod, OPT_no_debuginfod,		if (Args.hasFlag(OPT_debuginfod, OPT_no_debuginfod,
shouldUseDebuginfodByDefault(BuildID))) {		ShouldUseDebuginfodByDefault))
// Look up symbols using the debuginfod client.		enableDebuginfod(Symbolizer);
Symbolizer.addDIFetcher(std::make_unique<DebuginfodDIFetcher>());
// The HTTPClient must be initialized for use by the debuginfod client.
HTTPClient::initialize();
}

std::unique_ptr<DIPrinter> Printer;		std::unique_ptr<DIPrinter> Printer;
if (Style == OutputStyle::GNU)		if (Style == OutputStyle::GNU)
Printer = std::make_unique<GNUPrinter>(outs(), errs(), Config);		Printer = std::make_unique<GNUPrinter>(outs(), errs(), Config);
else if (Style == OutputStyle::JSON)		else if (Style == OutputStyle::JSON)
Printer = std::make_unique<JSONPrinter>(outs(), Config);		Printer = std::make_unique<JSONPrinter>(outs(), Config);
else		else
Printer = std::make_unique<LLVMPrinter>(outs(), errs(), Config);		Printer = std::make_unique<LLVMPrinter>(outs(), errs(), Config);
Show All 25 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Debuginfod] Add BUILD_ID syntax to llvm-symbolizer.ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 409421

llvm/docs/CommandGuide/llvm-symbolizer.rst

llvm/test/tools/llvm-symbolizer/debuginfod.test

llvm/test/tools/llvm-symbolizer/file-prefix.test

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp

[Debuginfod] Add BUILD_ID syntax to llvm-symbolizer.
ClosedPublic