Download Raw Diff

Details

Reviewers

ikudrin
rupprecht
jhenderson

Commits

rG9a709dd2bb45: llvm-addr2line: assume addresses on the command line are hexadecimal rather…

Summary

This matches the behavior of GNU addr2line. We previously treated
hexadecimal addresses as binary if they started with 0b, otherwise as
octal if they started with 0, otherwise as decimal.

This only affects llvm-addr2line; the behavior of llvm-symbolize is
unaffected.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

rsmith created this revision.Jan 23 2020, 4:32 PM

Herald added a reviewer: jhenderson. · View Herald TranscriptJan 23 2020, 4:32 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B44783: Diff 240046.Jan 23 2020, 4:40 PM

I wonder if it's worth adding a note the llvm-symbolizer user guide describing the format of the input address? At the moment, it doesn't mention what format the numbers must be in. Probably should be a separate change though, I guess.

llvm/docs/CommandGuide/llvm-addr2line.rst
21	I don't think referring to it as C literals is necessarily fair, since the style is more widely spread than just C. How about saying "instead of deriving their base from their prefix (if present)"?
llvm/test/lit.cfg.py
148	I think this needs reflowing (looks like we're sticking to the 80-column width in this python script).
llvm/test/tools/llvm-symbolizer/input-base.test
2	Could you add a leading '#' character here and for the other comments, to clearly delineate them from the lit and FileCheck directives, please. It makes it easier to read.
10	You need a test-case for '0X' prefixes as well as '0x'.
14	What gets printed instead here? I thought llvm-addr2line's behaviour was the same a llvm-symbolizer and that it prints the input value as-is if the value is not a valid number, so I'm surprised to see this check passes. Does reading from /dev/null work on Windows? (I can try it out if you need me to).
16–17	I think you need to add checks for the "name not found pattern" (off the top of my head I want to say '??'), to show that this is actually a look-up and not simply echoing a rejection of a malformed input address.
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
219–226	I do not like the mixed variable naming styles in this function! Given we use both upper and lower-case variable names, let's make this one follow the standard LLVM style (i.e. `Offset`).

LGTM after James' comments too. Thanks for spotting this inconsistency.

llvm/test/tools/llvm-symbolizer/input-base.test
9	Tiny nit: my first time reading through this I read "assumes hex" as "guesses it's hex but falls back in other cases" (e.g. if the `0b` prefix is provided). Something like "requires hex" (like you have in the docs) might be more straightforward? Same with the comment in the code.
14	Does reading from /dev/null work on Windows? (I can try it out if you need me to). Looks like lit handles that: https://github.com/llvm/llvm-project/blob/master/llvm/utils/lit/lit/TestRunner.py#L37

Switch this function to the LLVM variable naming convention, to match the rest of the file.

Herald added a subscriber: MaskRay. · View Herald TranscriptMar 30 2020, 8:20 PM

rsmith marked 2 inline comments as done.Mar 30 2020, 8:21 PM

rsmith added inline comments.

llvm/docs/CommandGuide/llvm-addr2line.rst
21	The major difference I was trying to get across is that unprefixed numbers default to decimal in `llvm-symbolizer` (or octal if there's a `0` prefix) but to hexadecimal in `llvm-addr2line`, and I don't think your alternative really captures that. I've taken another pass over this. (I also changed the bullets to include a noun, since it wasn't really clear whether these bullets were talking about the behavior of `addr2line` or `symbolizer`.)
llvm/test/tools/llvm-symbolizer/input-base.test
10	Added both cases for all the tests.
16–17	Note the "x". That wasn't there in the input address.
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
219–226	Every local variable in this function uses `lower_camel_case`, so I followed that. I'll just change them all to follow LLVM convention (as a separate commit).

Harbormaster failed remote builds in B51082: Diff 253775!Mar 30 2020, 9:18 PM

jhenderson added inline comments.Mar 31 2020, 12:47 AM

llvm/docs/CommandGuide/llvm-addr2line.rst
28–38	The rewording changes look good to me, but please put them in a separate commit (feel free to push that bit with no further review).
llvm/test/tools/llvm-symbolizer/input-base.test
20	This line working is surprising to me. FileCheck is supposed to be case sensitive, so it looks to me like llvm-addr2line is consuming `0O1234` as an actual number or something and not treating it as an invalid address.
22	Some of my comments have moved around too much in Phabricator's view to easily see the context, so apologies if this feels like a bit of repetition. This CHECK on its own is insufficient for the cases where the input is the string `0x1234`. In those cases, this CHECK will not distinguish between llvm-addr2line recognising the string as a valid hexadecimal address (and thus being able to use it to do the look up) and not (and thus just echoing it to the output). I think the only ways to tell are to either turn off `-a` (which would cause valid addresses to not appear and invalid values to be printed), or to check for the not-found pattern (to prove that a lookup was performed). I'd recommend the latter.
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
219–226	Could you do the variable renaming in a separate pre-requisite patch, please, and then rebase this patch on top of that? That'll make it easier to spot the real differences.

Explicitly check for failed lookup in tests to differentiate between success and errors.
Fix test expectations for 0O prefix, which is not supported by llvm-symbolizer nor llvm-addr2line.

Harbormaster completed remote builds in B51178: Diff 253973.Mar 31 2020, 1:44 PM

I think all the changes are good, but I'd like to see this patch independently of the variable renamings and document rewording before giving final approval to be sure. Please could you create separate commits for those and then rebase this patch on top, showing the diff of this patch only then.

Remove separately-committed cleanup commits.

rsmith added inline comments.Apr 7 2020, 3:58 PM

llvm/docs/CommandGuide/llvm-addr2line.rst
28–38	They are already in a separate commit; sorry for not mentioning that. (FYI, you can see the commits in the "Commits" tab in phabricator.) I went ahead and committed that.
llvm/test/tools/llvm-symbolizer/input-base.test
20	Nice catch =) Sorry, I made this change at the last minute before uploading and forgot to rerun the test, which does indeed fail as you expected. And it turns out that `llvm-symbolizer` doesn't accept the `0O` (zero, capital o) prefix either. Test updated.
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
219–226	This was already split out into a separate commit, but... it doesn't look like Phabricator lets you look at the contents of the individual commits in a review. That seems to make the ability to upload a series of commits to Phabricator as a single review... kind of useless. I've gone ahead and submitted the renaming since I think it's obvious and uncontroversial.

Harbormaster completed remote builds in B52257: Diff 255843.Apr 7 2020, 4:55 PM

This was already split out into a separate commit, but... it doesn't look like Phabricator lets you look at the contents of the individual commits in a review. That seems to make the ability to upload a series of commits to Phabricator as a single review... kind of useless. I've gone ahead and submitted the renaming since I think it's obvious and uncontroversial.

So I think the design for Phabricator is intended to be one review per commit. I vaguely recall someone saying this was also the preferred style for reviews in LLVM in general, in one of the Github PRs versus Phabricator threads. Certainly, I find it easiest to work in that way. I don't know if arcanist has any way of linking things like this (I don't use it), but if you put "Depends on DXXXX" in your patch description, it will create a commit "Stack" tab which shows the commit thread (see D77308 for an example of the Stack tab). You can also do it manually using the "Edit Related Revisions" UI option.

In D73306#1968815, @jhenderson wrote:

So I think the design for Phabricator is intended to be one review per commit. I vaguely recall someone saying this was also the preferred style for reviews in LLVM in general, in one of the Github PRs versus Phabricator threads. Certainly, I find it easiest to work in that way. I don't know if arcanist has any way of linking things like this (I don't use it), but if you put "Depends on DXXXX" in your patch description, it will create a commit "Stack" tab which shows the commit thread (see D77308 for an example of the Stack tab). You can also do it manually using the "Edit Related Revisions" UI option.

I found an article describing the workflow I want, and some folks at Mozilla are working on automating it. But yeah, it seems like doing the phabricator review management yourself manually is the only way at the moment :-( Ah well, and thanks for the information. In any case, I think this is the patch you wanted to review, with the other bits already committed.

LGTM. Sorry, should have posted that on the last comment!

This revision is now accepted and ready to land.Apr 9 2020, 12:24 AM

Closed by commit rG9a709dd2bb45: llvm-addr2line: assume addresses on the command line are hexadecimal rather… (authored by rsmith). · Explain WhyApr 16 2020, 4:44 PM

This revision was automatically updated to reflect the committed changes.

Diff 258203

llvm/docs/CommandGuide/llvm-addr2line.rst

	Show All 11 Lines
	-----------			-----------

	:program:`llvm-addr2line` is an alias for the :manpage:`llvm-symbolizer(1)`			:program:`llvm-addr2line` is an alias for the :manpage:`llvm-symbolizer(1)`
	tool with different defaults. The goal is to make it a drop-in replacement for			tool with different defaults. The goal is to make it a drop-in replacement for
	GNU's :program:`addr2line`.			GNU's :program:`addr2line`.

	Here are some of those differences:			Here are some of those differences:

				- ``llvm-addr2line`` interprets all addresses as hexadecimal and ignores an
				optional ``0x`` prefix, whereas ``llvm-symbolizer`` attempts to determine
				jhendersonUnsubmitted Done Reply Inline Actions I don't think referring to it as C literals is necessarily fair, since the style is more widely spread than just C. How about saying "instead of deriving their base from their prefix (if present)"? jhenderson: I don't think referring to it as C literals is necessarily fair, since the style is more widely…
				rsmithAuthorUnsubmitted Done Reply Inline Actions The major difference I was trying to get across is that unprefixed numbers default to decimal in `llvm-symbolizer` (or octal if there's a `0` prefix) but to hexadecimal in `llvm-addr2line`, and I don't think your alternative really captures that. I've taken another pass over this. (I also changed the bullets to include a noun, since it wasn't really clear whether these bullets were talking about the behavior of `addr2line` or `symbolizer`.) rsmith: The major difference I was trying to get across is that unprefixed numbers default to decimal…
				the base from the literal's prefix and defaults to decimal if there is no
				prefix.

	- ``llvm-addr2line`` defaults not to print function names. Use `-f`_ to enable			- ``llvm-addr2line`` defaults not to print function names. Use `-f`_ to enable
	that.			that.

	- ``llvm-addr2line`` defaults not to demangle function names. Use `-C`_ to			- ``llvm-addr2line`` defaults not to demangle function names. Use `-C`_ to
	switch the demangling on.			switch the demangling on.

	- ``llvm-addr2line`` defaults not to print inlined frames. Use `-i`_ to show			- ``llvm-addr2line`` defaults not to print inlined frames. Use `-i`_ to show
	inlined frames for a source code location in an inlined function.			inlined frames for a source code location in an inlined function.

	- ``llvm-addr2line`` uses `--output-style=GNU`_ by default.			- ``llvm-addr2line`` uses `--output-style=GNU`_ by default.

	- ``llvm-addr2line`` parses options from the environment variable			- ``llvm-addr2line`` parses options from the environment variable
	``LLVM_ADDR2LINE_OPTS`` instead of from ``LLVM_SYMBOLIZER_OPTS``.			``LLVM_ADDR2LINE_OPTS`` instead of from ``LLVM_SYMBOLIZER_OPTS``.

				jhendersonUnsubmitted Done Reply Inline Actions The rewording changes look good to me, but please put them in a separate commit (feel free to push that bit with no further review). jhenderson: The rewording changes look good to me, but please put them in a separate commit (feel free to…
				rsmithAuthorUnsubmitted Done Reply Inline Actions They are already in a separate commit; sorry for not mentioning that. (FYI, you can see the commits in the "Commits" tab in phabricator.) I went ahead and committed that. rsmith: They are already in a separate commit; sorry for not mentioning that. (FYI, you can see the…
	SEE ALSO			SEE ALSO
	--------			--------

	:manpage:`llvm-symbolizer(1)`			:manpage:`llvm-symbolizer(1)`

	.. _-f: llvm-symbolizer.html#llvm-symbolizer-opt-f			.. _-f: llvm-symbolizer.html#llvm-symbolizer-opt-f
	.. _-C: llvm-symbolizer.html#llvm-symbolizer-opt-c			.. _-C: llvm-symbolizer.html#llvm-symbolizer-opt-c
	.. _-i: llvm-symbolizer.html#llvm-symbolizer-opt-i			.. _-i: llvm-symbolizer.html#llvm-symbolizer-opt-i
	.. _--output-style=GNU: llvm-symbolizer.html#llvm-symbolizer-opt-output-style			.. _--output-style=GNU: llvm-symbolizer.html#llvm-symbolizer-opt-output-style

llvm/test/lit.cfg.py

Show First 20 Lines • Show All 139 Lines • ▼ Show 20 Lines	tools = [
ToolSubst('%opt-viewer', opt_viewer_cmd),		ToolSubst('%opt-viewer', opt_viewer_cmd),
ToolSubst('%llvm-objcopy', FindTool('llvm-objcopy')),		ToolSubst('%llvm-objcopy', FindTool('llvm-objcopy')),
ToolSubst('%llvm-strip', FindTool('llvm-strip')),		ToolSubst('%llvm-strip', FindTool('llvm-strip')),
ToolSubst('%llvm-install-name-tool', FindTool('llvm-install-name-tool')),		ToolSubst('%llvm-install-name-tool', FindTool('llvm-install-name-tool')),
]		]

# FIXME: Why do we have both `lli` and `%lli` that do slightly different things?		# FIXME: Why do we have both `lli` and `%lli` that do slightly different things?
tools.extend([		tools.extend([
'dsymutil', 'lli', 'lli-child-target', 'llvm-ar', 'llvm-as',		'dsymutil', 'lli', 'lli-child-target', 'llvm-ar', 'llvm-as',
		jhendersonUnsubmitted Done Reply Inline Actions I think this needs reflowing (looks like we're sticking to the 80-column width in this python script). jhenderson: I think this needs reflowing (looks like we're sticking to the 80-column width in this python…
'llvm-bcanalyzer', 'llvm-config', 'llvm-cov', 'llvm-cxxdump', 'llvm-cvtres',		'llvm-addr2line', 'llvm-bcanalyzer', 'llvm-config', 'llvm-cov',
'llvm-diff', 'llvm-dis', 'llvm-dwarfdump', 'llvm-exegesis', 'llvm-extract',		'llvm-cxxdump', 'llvm-cvtres', 'llvm-diff', 'llvm-dis', 'llvm-dwarfdump',
'llvm-isel-fuzzer', 'llvm-ifs', 'llvm-install-name-tool',		'llvm-exegesis', 'llvm-extract', 'llvm-isel-fuzzer', 'llvm-ifs',
'llvm-jitlink', 'llvm-opt-fuzzer', 'llvm-lib',		'llvm-install-name-tool', 'llvm-jitlink', 'llvm-opt-fuzzer', 'llvm-lib',
'llvm-link', 'llvm-lto', 'llvm-lto2', 'llvm-mc', 'llvm-mca',		'llvm-link', 'llvm-lto', 'llvm-lto2', 'llvm-mc', 'llvm-mca',
'llvm-modextract', 'llvm-nm', 'llvm-objcopy', 'llvm-objdump',		'llvm-modextract', 'llvm-nm', 'llvm-objcopy', 'llvm-objdump',
'llvm-pdbutil', 'llvm-profdata', 'llvm-ranlib', 'llvm-rc', 'llvm-readelf',		'llvm-pdbutil', 'llvm-profdata', 'llvm-ranlib', 'llvm-rc', 'llvm-readelf',
'llvm-readobj', 'llvm-rtdyld', 'llvm-size', 'llvm-split', 'llvm-strings',		'llvm-readobj', 'llvm-rtdyld', 'llvm-size', 'llvm-split', 'llvm-strings',
'llvm-strip', 'llvm-tblgen', 'llvm-undname', 'llvm-c-test', 'llvm-cxxfilt',		'llvm-strip', 'llvm-tblgen', 'llvm-undname', 'llvm-c-test', 'llvm-cxxfilt',
'llvm-xray', 'yaml2obj', 'obj2yaml', 'yaml-bench', 'verify-uselistorder',		'llvm-xray', 'yaml2obj', 'obj2yaml', 'yaml-bench', 'verify-uselistorder',
'bugpoint', 'llc', 'llvm-symbolizer', 'opt', 'sancov', 'sanstats'])		'bugpoint', 'llc', 'llvm-symbolizer', 'opt', 'sancov', 'sanstats'])

▲ Show 20 Lines • Show All 196 Lines • Show Last 20 Lines

llvm/test/tools/llvm-symbolizer/input-base.test

This file was added.

				# llvm-symbolizer infers the number base from the form of the address.
				RUN: llvm-symbolizer -e /dev/null -a 0x1234 \| FileCheck %s
				jhendersonUnsubmitted Done Reply Inline Actions Could you add a leading '#' character here and for the other comments, to clearly delineate them from the lit and FileCheck directives, please. It makes it easier to read. jhenderson: Could you add a leading '#' character here and for the other comments, to clearly delineate…
				RUN: llvm-symbolizer -e /dev/null -a 0X1234 \| FileCheck %s
				RUN: llvm-symbolizer -e /dev/null -a 4660 \| FileCheck %s
				RUN: llvm-symbolizer -e /dev/null -a 011064 \| FileCheck %s
				RUN: llvm-symbolizer -e /dev/null -a 0b1001000110100 \| FileCheck %s
				RUN: llvm-symbolizer -e /dev/null -a 0B1001000110100 \| FileCheck %s
				RUN: llvm-symbolizer -e /dev/null -a 0o11064 \| FileCheck %s

				rupprechtUnsubmitted Done Reply Inline Actions Tiny nit: my first time reading through this I read "assumes hex" as "guesses it's hex but falls back in other cases" (e.g. if the `0b` prefix is provided). Something like "requires hex" (like you have in the docs) might be more straightforward? Same with the comment in the code. rupprecht: Tiny nit: my first time reading through this I read "assumes hex" as "guesses it's hex but…
				# llvm-symbolizer / StringRef::getAsInteger only accepts the 0o prefix in lowercase.
				jhendersonUnsubmitted Done Reply Inline Actions You need a test-case for '0X' prefixes as well as '0x'. jhenderson: You need a test-case for '0X' prefixes as well as '0x'.
				rsmithAuthorUnsubmitted Done Reply Inline Actions Added both cases for all the tests. rsmith: Added both cases for all the tests.
				RUN: llvm-symbolizer -e /dev/null -a 0O1234 \| FileCheck %s --check-prefix=INVALID-NOT-OCTAL-UPPER

				# llvm-addr2line always requires hexadecimal, but accepts an optional 0x prefix.
				RUN: llvm-addr2line -e /dev/null -a 0x1234 \| FileCheck %s
				jhendersonUnsubmitted Done Reply Inline Actions What gets printed instead here? I thought llvm-addr2line's behaviour was the same a llvm-symbolizer and that it prints the input value as-is if the value is not a valid number, so I'm surprised to see this check passes. Does reading from /dev/null work on Windows? (I can try it out if you need me to). jhenderson: What gets printed instead here? I thought llvm-addr2line's behaviour was the same a llvm…
				rupprechtUnsubmitted Done Reply Inline Actions Does reading from /dev/null work on Windows? (I can try it out if you need me to). Looks like lit handles that: https://github.com/llvm/llvm-project/blob/master/llvm/utils/lit/lit/TestRunner.py#L37 rupprecht: > Does reading from /dev/null work on Windows? (I can try it out if you need me to). Looks…
				RUN: llvm-addr2line -e /dev/null -a 0X1234 \| FileCheck %s
				RUN: llvm-addr2line -e /dev/null -a 1234 \| FileCheck %s
				RUN: llvm-addr2line -e /dev/null -a 01234 \| FileCheck %s
				jhendersonUnsubmitted Done Reply Inline Actions I think you need to add checks for the "name not found pattern" (off the top of my head I want to say '??'), to show that this is actually a look-up and not simply echoing a rejection of a malformed input address. jhenderson: I think you need to add checks for the "name not found pattern" (off the top of my head I want…
				rsmithAuthorUnsubmitted Done Reply Inline Actions Note the "x". That wasn't there in the input address. rsmith: Note the "x". That wasn't there in the input address.
				RUN: llvm-addr2line -e /dev/null -a 0b1010 \| FileCheck %s --check-prefix=HEXADECIMAL-NOT-BINARY
				RUN: llvm-addr2line -e /dev/null -a 0B1010 \| FileCheck %s --check-prefix=HEXADECIMAL-NOT-BINARY
				RUN: llvm-addr2line -e /dev/null -a 0o1234 \| FileCheck %s --check-prefix=INVALID-NOT-OCTAL-LOWER
				jhendersonUnsubmitted Done Reply Inline Actions This line working is surprising to me. FileCheck is supposed to be case sensitive, so it looks to me like llvm-addr2line is consuming `0O1234` as an actual number or something and not treating it as an invalid address. jhenderson: This line working is surprising to me. FileCheck is supposed to be case sensitive, so it looks…
				rsmithAuthorUnsubmitted Done Reply Inline Actions Nice catch =) Sorry, I made this change at the last minute before uploading and forgot to rerun the test, which does indeed fail as you expected. And it turns out that `llvm-symbolizer` doesn't accept the `0O` (zero, capital o) prefix either. Test updated. rsmith: Nice catch =) Sorry, I made this change at the last minute before uploading and forgot to rerun…
				RUN: llvm-addr2line -e /dev/null -a 0O1234 \| FileCheck %s --check-prefix=INVALID-NOT-OCTAL-UPPER

				jhendersonUnsubmitted Done Reply Inline Actions Some of my comments have moved around too much in Phabricator's view to easily see the context, so apologies if this feels like a bit of repetition. This CHECK on its own is insufficient for the cases where the input is the string `0x1234`. In those cases, this CHECK will not distinguish between llvm-addr2line recognising the string as a valid hexadecimal address (and thus being able to use it to do the look up) and not (and thus just echoing it to the output). I think the only ways to tell are to either turn off `-a` (which would cause valid addresses to not appear and invalid values to be printed), or to check for the not-found pattern (to prove that a lookup was performed). I'd recommend the latter. jhenderson: Some of my comments have moved around too much in Phabricator's view to easily see the context…
				CHECK: 0x1234
				CHECK-NEXT: ??

				HEXADECIMAL-NOT-BINARY: 0xb1010
				HEXADECIMAL-NOT-BINARY: ??

				INVALID-NOT-OCTAL-LOWER: 0o1234
				INVALID-NOT-OCTAL-LOWER-NOT: ??

				INVALID-NOT-OCTAL-UPPER: 0O1234
				INVALID-NOT-OCTAL-UPPER-NOT: ??

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp

Show First 20 Lines • Show All 175 Lines • ▼ Show 20 Lines
}		}

enum class Command {		enum class Command {
Code,		Code,
Data,		Data,
Frame,		Frame,
};		};

static bool parseCommand(StringRef InputString, Command &Cmd,		static bool parseCommand(bool IsAddr2Line, StringRef InputString, Command &Cmd,
std::string &ModuleName, uint64_t &ModuleOffset) {		std::string &ModuleName, uint64_t &ModuleOffset) {
const char kDelimiters[] = " \n\r";		const char kDelimiters[] = " \n\r";
ModuleName = "";		ModuleName = "";
if (InputString.consume_front("CODE ")) {		if (InputString.consume_front("CODE ")) {
Cmd = Command::Code;		Cmd = Command::Code;
} else if (InputString.consume_front("DATA ")) {		} else if (InputString.consume_front("DATA ")) {
Cmd = Command::Data;		Cmd = Command::Data;
} else if (InputString.consume_front("FRAME ")) {		} else if (InputString.consume_front("FRAME ")) {
Show All 18 Lines	if (Pos == '"' \|\| Pos == '\'') {
int NameLength = strcspn(Pos, kDelimiters);		int NameLength = strcspn(Pos, kDelimiters);
ModuleName = std::string(Pos, NameLength);		ModuleName = std::string(Pos, NameLength);
Pos += NameLength;		Pos += NameLength;
}		}
} else {		} else {
ModuleName = ClBinaryName;		ModuleName = ClBinaryName;
}		}
// Skip delimiters and parse module offset.		// Skip delimiters and parse module offset.
Pos += strspn(Pos, kDelimiters);		Pos += strspn(Pos, kDelimiters);
int OffsetLength = strcspn(Pos, kDelimiters);		int OffsetLength = strcspn(Pos, kDelimiters);
return !StringRef(Pos, OffsetLength).getAsInteger(0, ModuleOffset);		StringRef Offset(Pos, OffsetLength);
		// GNU addr2line assumes the offset is hexadecimal and allows a redundant
		// "0x" or "0X" prefix; do the same for compatibility.
		if (IsAddr2Line)
		Offset.consume_front("0x") \|\| Offset.consume_front("0X");
		return !Offset.getAsInteger(IsAddr2Line ? 16 : 0, ModuleOffset);
		jhendersonUnsubmitted Done Reply Inline Actions I do not like the mixed variable naming styles in this function! Given we use both upper and lower-case variable names, let's make this one follow the standard LLVM style (i.e. `Offset`). jhenderson: I do not like the mixed variable naming styles in this function! Given we use both upper and…
		rsmithAuthorUnsubmitted Done Reply Inline Actions Every local variable in this function uses `lower_camel_case`, so I followed that. I'll just change them all to follow LLVM convention (as a separate commit). rsmith: Every local variable in this function uses `lower_camel_case`, so I followed that. I'll just…
		jhendersonUnsubmitted Done Reply Inline Actions Could you do the variable renaming in a separate pre-requisite patch, please, and then rebase this patch on top of that? That'll make it easier to spot the real differences. jhenderson: Could you do the variable renaming in a separate pre-requisite patch, please, and then rebase…
		rsmithAuthorUnsubmitted Done Reply Inline Actions This was already split out into a separate commit, but... it doesn't look like Phabricator lets you look at the contents of the individual commits in a review. That seems to make the ability to upload a series of commits to Phabricator as a single review... kind of useless. I've gone ahead and submitted the renaming since I think it's obvious and uncontroversial. rsmith: This was already split out into a separate commit, but... it doesn't look like Phabricator lets…
}		}

static void symbolizeInput(StringRef InputString, LLVMSymbolizer &Symbolizer,		static void symbolizeInput(bool IsAddr2Line, StringRef InputString,
DIPrinter &Printer) {		LLVMSymbolizer &Symbolizer, DIPrinter &Printer) {
Command Cmd;		Command Cmd;
std::string ModuleName;		std::string ModuleName;
uint64_t Offset = 0;		uint64_t Offset = 0;
if (!parseCommand(StringRef(InputString), Cmd, ModuleName, Offset)) {		if (!parseCommand(IsAddr2Line, StringRef(InputString), Cmd, ModuleName,
		Offset)) {
outs() << InputString << "\n";		outs() << InputString << "\n";
return;		return;
}		}

if (ClPrintAddress) {		if (ClPrintAddress) {
outs() << "0x";		outs() << "0x";
outs().write_hex(Offset);		outs().write_hex(Offset);
StringRef Delimiter = ClPrettyPrint ? ": " : "\n";		StringRef Delimiter = ClPrettyPrint ? ": " : "\n";
▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	if (ClInputAddresses.empty()) {

while (fgets(InputString, sizeof(InputString), stdin)) {		while (fgets(InputString, sizeof(InputString), stdin)) {
// Strip newline characters.		// Strip newline characters.
std::string StrippedInputString(InputString);		std::string StrippedInputString(InputString);
StrippedInputString.erase(		StrippedInputString.erase(
std::remove_if(StrippedInputString.begin(), StrippedInputString.end(),		std::remove_if(StrippedInputString.begin(), StrippedInputString.end(),
[](char c) { return c == '\r' \|\| c == '\n'; }),		[](char c) { return c == '\r' \|\| c == '\n'; }),
StrippedInputString.end());		StrippedInputString.end());
symbolizeInput(StrippedInputString, Symbolizer, Printer);		symbolizeInput(IsAddr2Line, StrippedInputString, Symbolizer, Printer);
outs().flush();		outs().flush();
}		}
} else {		} else {
for (StringRef Address : ClInputAddresses)		for (StringRef Address : ClInputAddresses)
symbolizeInput(Address, Symbolizer, Printer);		symbolizeInput(IsAddr2Line, Address, Symbolizer, Printer);
}		}

return 0;		return 0;
}		}

This is an archive of the discontinued LLVM Phabricator instance.

llvm-addr2line: assume addresses on the command line are hexadecimal rather than attempting to guess the base based on the form of the number.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 258203

llvm/docs/CommandGuide/llvm-addr2line.rst

llvm/test/lit.cfg.py

llvm/test/tools/llvm-symbolizer/input-base.test

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp

This is an archive of the discontinued LLVM Phabricator instance.

llvm-addr2line: assume addresses on the command line are hexadecimal rather than attempting to guess the base based on the form of the number.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 258203

llvm/docs/CommandGuide/llvm-addr2line.rst

llvm/test/lit.cfg.py

llvm/test/tools/llvm-symbolizer/input-base.test

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp

llvm-addr2line: assume addresses on the command line are hexadecimal rather than attempting to guess the base based on the form of the number.
ClosedPublic