This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/CommandGuide/
-
CommandGuide/
8/9
llvm-symbolizer.rst
-
include/llvm/DebugInfo/Symbolize/
-
llvm/
-
DebugInfo/
-
Symbolize/
6/18
DIPrinter.h
-
lib/DebugInfo/Symbolize/
-
DebugInfo/
-
Symbolize/
38/58
DIPrinter.cpp
-
test/tools/llvm-symbolizer/
-
tools/
-
llvm-symbolizer/
19/27
output-style-json-code.test
5/5
output-style-json-data.test
8/14
output-style-json-frame.test
-
tools/llvm-symbolizer/
-
llvm-symbolizer/
-
Opts.td
11/17
llvm-symbolizer.cpp

Differential D96883

Add support for JSON output style to llvm-symbolizer
ClosedPublic

Authored by aorlov on Feb 17 2021, 10:58 AM.

Download Raw Diff

Details

Reviewers

MaskRay
dblaikie
grimar
jhenderson
jdoerfert

Commits

rG05d1ae4e18fa: * Add support for JSON output style to llvm-symbolizer

Summary

This patch adds JSON output style to llvm-symbolizer to better support CLI automation by providing a machine readable output.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	30 ms	x64 debian > MLIR.Dialect/Linalg::tile-and-distribute.mlir
	50 ms	x64 windows > MLIR.Dialect/Linalg::tile-and-distribute.mlir

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

aorlov marked 2 inline comments as done.Mar 18 2021, 4:09 AM

aorlov added inline comments.

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h
113–117	I'm using the exact type that symbolizeFrame() returns.
llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp
161	Please note JSON allows 53-bits numbers. So we print huge numbers as strings "0x...".
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
161	We need the error code for the clear logic while error handling. The error message may be localized, etc. DIPrinter must only print the data but do not handle errors. It is impossible to get the error code from the Error without handling. Initially I have used ErrorInfoBase instead to pass the error information to DIPrinter. Now DICommon contains ErrorCode (it can be used as "success" flag too). The error message is stored in DICommon<std::string>.Result only in case of error.

aorlov updated this revision to Diff 331515.Mar 18 2021, 4:14 AM

aorlov marked an inline comment as done.

Harbormaster completed remote builds in B94424: Diff 331509.Mar 18 2021, 4:29 AM

Harbormaster completed remote builds in B94429: Diff 331515.Mar 18 2021, 4:53 AM

aorlov updated this revision to Diff 331542.Mar 18 2021, 6:33 AM

Harbormaster completed remote builds in B94446: Diff 331542.Mar 18 2021, 7:18 AM

The refactoring is done.
Thanks for reviewing!

To separate concerns, it would be a good idea to put the DIPrinter refactoring in a separate prerequisite patch, that the JSON implementation patch depends on.

I'll probably have loads more comments once that is done, but it'll be easier to sift through them once it is.

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h
31	This isn't a good type name. What is a "Common"? Would it make more sense to pass the common data as a separate argument to the non-common `Result`?
54	Perhaps `printError` is good, because the type signature in its current form doesn't make it clear that this is for error reporting.
57	We should probably have `DIPrinterLLVM` and `DIPrinterGNU`. The two could largely share functionality under the hood, but I think it is a cleaner interface than one implementation of DIPrinter controlling all its output style, and the other having to look at another variable to choose between styles.
119	Make sure to run clang-format on all your new code.
135–136	Use `StringRef`, not `const std::string &`.
llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp
203	Delete this. We don't use arbitrary comment markers to divide up files in LLVM coding style.
207	You seem to have ignored my earlier comments re. printing everything?
219

Please review the separate DIPrinter refactoring patch here https://reviews.llvm.org/D98994

Thanks for the other patch. I'll take a look at it shortly. Could you rebase this patch on that one, please, so that it's easier to see what the adding of a new output style will entail, once the refactoring has been done?

Also, if you add "Depends on DXXXXXX" (where XXXXXX is the number from the patch URL) to this patch description, it will automatically do some Phabricator magic to allow people reviewing this to see that there are other prerequisite patches.

aorlov updated this revision to Diff 333102.Mar 24 2021, 1:08 PM

aorlov edited the summary of this revision. (Show Details)

aorlov added a parent revision: D98994: NFC. Refactored DIPrinter for better support of new print styles..

Harbormaster completed remote builds in B95566: Diff 333102.Mar 24 2021, 9:22 PM

aorlov updated this revision to Diff 335282.Apr 5 2021, 9:50 AM

aorlov edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B97143: Diff 335282.Apr 5 2021, 10:34 AM

aorlov updated this revision to Diff 335455.Apr 6 2021, 3:22 AM

Harbormaster completed remote builds in B97263: Diff 335455.Apr 6 2021, 4:10 AM

Ping.

Note the build status is red because of a problem not related to this patch.

aorlov updated this revision to Diff 336228.Apr 8 2021, 2:12 PM

Harbormaster completed remote builds in B97818: Diff 336228.Apr 8 2021, 2:16 PM

aorlov updated this revision to Diff 336260.Apr 8 2021, 4:24 PM

Harbormaster completed remote builds in B97842: Diff 336260.Apr 8 2021, 4:24 PM

aorlov updated this revision to Diff 336271.Apr 8 2021, 4:48 PM

Harbormaster completed remote builds in B97847: Diff 336271.Apr 8 2021, 5:38 PM

aorlov updated this revision to Diff 336539.Apr 9 2021, 12:06 PM

Harbormaster completed remote builds in B98051: Diff 336539.Apr 9 2021, 1:35 PM

Ping.

One more ping.

Apologies for disappearing @aorlov - I was off work for over two weeks, and only started back today. I'm working through a big backlog, and will hopefully get to look at this patch in the next day or two.

I've not looked at the testing yet. Here are comments on the code to keep you going though.

llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp
226	`static`
228	`static`
233	I suspect from a user's perspective they won't expect to see an `Error` attribute if there's no error.
234	What is `ErrorCode` supposed to represent? How will a user benefit from it beyond the information provided in the message?
252	This is intended to be machine readable. "Noise" in the output isn't an issue. In fact, as previously mentioned, not having these and other attributes is actively harmful to the user experience, as it makes it harder to write a parser that consumes this JSON. In the case where you have a BadString output, I'd just print an empty string. Example: { "Source" : "", "FunctionName" : "", ... , "Line" : 0, "Column" : 0, "Discriminator" : 0 }
274	What's the point of the new line? Same goes elsewhere where you are adding new lines. If the output is intended to be machine readable, there is no need for the new lines.
325	Same as above - just print everything.
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
158–162	I missed this in the other review. We don't need to make an `ErrorInfo` here at all. Just pass the `InputString` in instead. That will simplify both the call-site and the `printInvalidCommand` function.
361–364	I would make it always a list, not just for multiple input addresses. Also, shouldn't this be using the json stream arrayBegin/End methods?
367	There's no need for this outer if.
368–371	How about: StringRef Sep = ""; for (StringRef Address : InputAddresses) { outs() << Sep; Sep = ","; symbolizeInput(Args, AdjustVMA, IsAddr2Line, Style, Address, Symbolizer, *Printer); } This may be moot however, if you are using the JSON stream to use the proper JSON array methods.

aorlov updated this revision to Diff 339053.Apr 20 2021, 4:53 PM

I do not think we should always include everything and anything into JSON. There is nothing wrong with skipping parameters with unknown values, not applicable data and such.

For example DILocal contains Optional<int64_t> FrameOffset. In JS it would be declared as FrameOffset?: number; and handled natively.

If FrameOffset is not specified we cannot print out the value, any number is a valid offset, and a non-number string would confuse the parses much more than just the optional field which is skipped when N/A.

Besides, our customers are happy with the proposed JSON and do not have any problem parsing optional data.

llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp
233	Error: {Code: 0} – is a standard way to say `success`. And it is a bad idea to omit Error if it should be checked first.
234	The `ErrorCode` is more important than a message when automating the error handling. The error message may be localized, depend on OS, etc.
252	Ok, but I still omit empty Error Message and FrameOffset.
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
368–371	Did you forget about different commas in GNU/LLVM output style? I do not want any functional change in that area as a part of this patch. I have added groupBegin() and groupEnd() to DIPrinter interface and moved all logic to JSONPrinter implementation.

Harbormaster completed remote builds in B99856: Diff 339053.Apr 20 2021, 7:06 PM

jhenderson added inline comments.Apr 21 2021, 1:28 AM

llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp
233	Right, but basically the behaviour of checking whether "Error" exists is no more complicated in most languages than checking whether it has value zero. I would kind of think there would be two possible JSON objects produced per query (possibly three if we handle invalid commands differently to other errors for JSON output): {"Error":"some message/code/whatever"} {"Source":"some path",...} Where the response had an error, it is unlikely the other parameters can be relied upon in any meaningful way, so it's probably better to omit them than potentially cause confusion. The pseudo-python logic for this might look something like: response = json.load(output) if 'Error' in response: handleError(response) else: handleNormalResponse(response)
234	The error codes contained in `llvm::Error` and `llvm::Expected` are quite often somewhat arbitrary, and inconsistent. It won't be possible to safely rely on these codes for any useful automated processing. Furthermore, some Errors could contain `inconvertibleErrorCode` which will cause a problem if these end up back here. What's the use-case for handling different error kinds differently? I'm not saying there isn't a motivation for that, I'm just trying to understand how you plan to use it. If you have no such plan, it doesn't make sense to add additional logic to distinguish errors by code as well as message.
252	That seems reasonable, thanks for the explanation.
268	I didn't think of this earlier, but it makes a lot of sense to have the pretty printing form be more human readable, with indentation and new lines as appropriate. Thanks!
284	Did you consider having a single `json::OStream` as a member of the JSONPrinter class, so that it only needs constructing (with the `Pretty` check) in one place?
llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp
368–371	Thanks, yes, I'd forgotten this was the higher-level printer area. "group" doesn't obviously mean anything to me. Could you consider renaming it to something like "listBegin" etc? I think that more clearly indicates what you're doing.

aorlov updated this revision to Diff 339258.Apr 21 2021, 8:50 AM

aorlov marked an inline comment as done.Apr 21 2021, 8:57 AM

aorlov added inline comments.

llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp
284	Note there is no way to reset `json::OStream` context, so we cannot reuse the same instance to print 2 or more objects while processing stdin. I have moved `json::OStream` to printJSON(json::Value).

Harbormaster completed remote builds in B100003: Diff 339258.Apr 21 2021, 9:57 AM

I disagree and still think the ErrorCode is needed for the proper error handling, even if it is arbitrary and inconsistent, which is the usual case everywhere anyway.
I see your point, though, and did what you suggested to unblock this patch and get it committed.

Harbormaster completed remote builds in B100045: Diff 339311.Apr 21 2021, 12:06 PM

I've reviewed the test cases more thoroughly today. I think you need test cases where the addresses are specified on the command-line rather than via stdin, because there's a behaviour difference (objects are in list or not as the case may be).

llvm/docs/CommandGuide/llvm-symbolizer.rst
256	Perhaps worth briefly mentioning the behaviour changes for the following cases: Stdin versus addresses on command-line (i.e. separate individual objects versus list of objects) `--pretty-print` versus no `--pretty-print`. Optionally, this could be under the `--pretty-print` option discussion. I don't think you need concrete examples of the differences. Just a one/two sentence description.
279	Maybe use the `--pretty-print` option to make this example a bit more readable?
llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h
124	I guess it might be helpful to know what this list is of (e.g. `ObjectList` or whatever).
llvm/test/tools/llvm-symbolizer/output-style-json-code.test
4–10	My understanding is that the `--no-inlines` option has no impact on the output here, so I think you can drop one of these cases?
12	Rather than use `{{.*}}/no-file.exe` here and similar situations for paths below, I'd recommend using FileCheck's `-D` option to ensure the correct path is printed, i.e: # RUN: FileCheck %s ... -DFILE=%p/no-file.exe # NO-FILE: ... "ModuleName":"[[FILE]]" ...
16	Maybe rather than `NOT-FOUND-1` and `NOT-FOUND-2`, it would be more self-descriptive using `NOT-FOUND-NOINLINES` and `NOT-FOUND-INLINES` or something like that?
18	Just thinking about usability - what is the canonical way for users to know that their address hasn't been found in JSON output? It might be worth documenting this somewhere.
26	Consider a comment for this case, introducing the following set of test cases. In particular, it's important to note that this is using a stdin argument. You could also move all the references to "no-inlines" to one place. I.e. something like "this test case is testing stdin input, with the --no-inlines option" would do the trick. Same sort of thing goes below.
29	Nit: missing full stop.
30	I find the `"Address":"0x0"` bit here somewhat confusing, given this is an error case. I'd consider omitting it entirely. Alternatively, simply print what the input address was specified as (in this case `"Address":"some text"`).
33	It would be nice to have one or more test cases with a non-zero discriminator value. Also, where the "Source" parameter is non-empty. You can simplify `{{/\|\\\\}}` to this: `{{[\\/]}}`. I think it's slightly more readable due to avoiding the quadruple backslash. Same goes elsewhere below.
50	I guess the obvious question is: why do we have a difference in behaviour surrounding the `FunctionName` attribute? The motivation for different symbolizer/addr2line output is because llvm-addr2line needs to be compatible with GNU addr2line, but that principle doesn't apply for JSON output (which AFAIK is not a GNU addr2line supported feature).
54	Nit: missing full stop.
llvm/test/tools/llvm-symbolizer/output-style-json-data.test
33	I'd make one of these addresses non-zero, as that will show that the "Address" and "Start" parameters are not just always 0.
llvm/test/tools/llvm-symbolizer/output-style-json-frame.test
30	I think you need a test case with a non-empty "TagOffset".
42	It looks like me like this hasn't been addressed?

aorlov updated this revision to Diff 339790.Apr 22 2021, 3:02 PM

aorlov marked 11 inline comments as done.Apr 22 2021, 3:13 PM

aorlov added inline comments.

llvm/test/tools/llvm-symbolizer/output-style-json-code.test
12	Unfortunately -D would not work in this case, as JSON has own rules for escaping special symbols in paths.
16	It is not applicable anymore.
18	Yes. This is the Symbolize library design issue. As far as I can tell there is no reliable way to get a distinct result for the symbol not found case. For now I just keep it transparent by serializing whatever the library returns, as addressing the library issue is out of the scope of this patch.
33	I have added the test for a non-zero discriminator. Note the address is hardcoded. It is more correct to use /Inputs/discrim.inp via stdin, but I just copied the address from discriminator.test. It seems currently we have no binaries in llvm/test/tools/llvm-symbolizer/Inputs with a valid Source info. You can simplify No, it does not work because JSON has own rules for escaping special symbols in paths.

aorlov added inline comments.Apr 22 2021, 3:13 PM

llvm/test/tools/llvm-symbolizer/output-style-json-code.test
50	This behavior is a part of llvm-symbolizer and does not depend on the output styles and does not belong to the printer (look at decideHowToPrintFunctions()). Not sure if I understand what you are saying. Changing the behavior is out of the scope of this patch. I agree that it does not make much sense in testing that, but one of the reviewers requested these tests.
llvm/test/tools/llvm-symbolizer/output-style-json-frame.test
42	Intention was to break a dependency on DWARF generation in clang. But I have changed it build from the C source to keep it simple for reading.

Harbormaster completed remote builds in B100389: Diff 339790.Apr 22 2021, 5:08 PM

aorlov updated this revision to Diff 339949.Apr 23 2021, 2:01 AM

Harbormaster completed remote builds in B100514: Diff 339949.Apr 23 2021, 4:01 AM

Ping.

Hi @aorlov,

The community norm is for a week between updates and a ping/between consecutive pings. I haven't had a chance to come back to this just yet due to my workload. I hope to get to it in the next day or two.

Out of time for today. Will come back to this another time.

llvm/docs/CommandGuide/llvm-symbolizer.rst
256–258	Make this all one paragraph, and reflow to 80 character limit. I'd actually rephrase it slightly too: "If addresses are supplied via stdin, the output JSON will be a series of individual objects. Otherwise, all results will be contained in a single array."
279	Sorry, use `-p`, not `--pretty-print` (I just realised that's what's used in the previous example).
294–296	I'd rephrase as in the inline comment (please make sure to reflow if necessary).
llvm/test/tools/llvm-symbolizer/output-style-json-code.test
4–5	Please reflow this comment to 80-character limit.
7
12–13	As before, do we need both `--no-inlines` and `--inlines` cases?
33	It seems currently we have no binaries in llvm/test/tools/llvm-symbolizer/Inputs with a valid Source info. Consider generating one at test time using assembly or yaml2obj. You can simplify No, it does not work because JSON has own rules for escaping special symbols in paths. Okay - I missed the double backslash.
50	Ah, I think we've hit on a fundamental question regarding JSON output - what should the options that change what information is printed do to JSON output? I'm thinking here specifically both `--functions` and `--addresses`, but it may apply to others too.

aorlov updated this revision to Diff 340882.Apr 27 2021, 9:40 AM

aorlov marked 7 inline comments as done.Apr 27 2021, 9:49 AM

aorlov added inline comments.

llvm/test/tools/llvm-symbolizer/output-style-json-code.test
7	But note the library name is `symbolize`, not `symbolizer`.
33	I have added output-style-json-code-source.c.

Used --print-source-context-lines to control Source printout in JSON.

Harbormaster completed remote builds in B101202: Diff 340882.Apr 27 2021, 11:18 AM

Harbormaster completed remote builds in B101219: Diff 340908.Apr 27 2021, 1:37 PM

aorlov updated this revision to Diff 341070.Apr 27 2021, 9:50 PM

Harbormaster completed remote builds in B101324: Diff 341070.Apr 27 2021, 10:35 PM

aorlov updated this revision to Diff 342279.May 2 2021, 2:34 PM

Harbormaster completed remote builds in B102208: Diff 342279.May 2 2021, 3:21 PM

Ping.

jhenderson added inline comments.May 5 2021, 2:04 AM

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h
94–95	Is this clang-formatted properly? Slightly surprised there's no space after `override`.
llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp
230–231	`ErrorCode` is unused now. Also, no need for the `const` on `ErrorMsg`.
232	Usually we use `StringRef.str()` to convert to a `std::string`. Same applies below in the error message line.
247–248	Don't abbeviate these names unnecessarily like this. `Array` and `FrameCount` would both be acceptable, for example. In general, avoid single letter variable names except for loop counters. Same goes throughout this patch. See https://llvm.org/docs/CodingStandards.html#name-types-functions-variables-and-enumerators-properly for details: We cannot stress enough how important it is to use descriptive names. Pick names that match the semantics and role of the underlying entities, within reason. Avoid abbreviations unless they are well known. I think you can inline `N` into the initialising part of the `for` loop? I don't see it being used outside the loop.
251	`Object` maybe?
261	I think adding source context can be a later patch. Let's try to avoid adding more than we have to in this one patch.
llvm/test/tools/llvm-symbolizer/output-style-json-code-source.c
3 ↗	(On Diff #342279)	This won't work. clang isn't available in llvm-symbolizer tests, as the latter are part of the LLVM layer. Clang depends on the LLVM layer, but not vice-versa. You might be able to use `llvm-mc -g` to compile some assembly to be able to test this though.
llvm/test/tools/llvm-symbolizer/output-style-json-code.test
4
7	Fair point. How about simply `## Show how library errors are reported in the output.`
llvm/test/tools/llvm-symbolizer/output-style-json-data.test
6	Same comment as the code test.
28	Same comment as above.
llvm/test/tools/llvm-symbolizer/output-style-json-frame.c
24 ↗	(On Diff #342279)	As above. This won't work. You should use llvm-mc to build from assembly, as in other test cases.

aorlov updated this revision to Diff 343189.May 5 2021, 2:15 PM

I think adding source context can be a later patch. Let's try to avoid adding more than we have to in this one patch.

Nice.
Note you asked for a new test for Source a week ago.
Ok, I have removed Source from the JSON output and removed the new test for Source.

I have restored output-style-json-frame.test instead of output-style-json-frame.c
Note output-style-json-frame.test is huge because it is machine generated and you asked for a non zero and non empty TagOffset.
I don't like an idea to clean up or optimize it anyway.

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h
94–95	It is generated by clang-format and validated by clang-tidy.
llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp
251	It is not necessary anymore because `Source` has been removed.

dblaikie added inline comments.May 5 2021, 2:33 PM

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h
94–95	The extraneous trailing semicolon is probably confusing clang-format, so maybe remove those and see how it gets formatted?

Harbormaster completed remote builds in B102847: Diff 343189.May 5 2021, 3:28 PM

aorlov updated this revision to Diff 343427.May 6 2021, 8:56 AM

aorlov marked an inline comment as done.

Harbormaster completed remote builds in B103011: Diff 343427.May 6 2021, 9:36 AM

jhenderson added inline comments.May 7 2021, 12:38 AM

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h
94–95	Ah, good spot!
llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp
231	There's still an unfortunate single-letter variable name here. I'd suggest `Json` or `Object` for the variable name.
240	Same comment as above. How about `InliningInfo`?
248	`LI` is a similar abbreviation which doesn't clearly indicate its name. What's wrong with `LineInfo`?
260	`J` -> `Json` or `Object`
273	`Object` or `Json`.
284	`Local`
285	Maybe use `FrameObject` here to disambiguate from the `toJson` retiurn below.
296	`Json` or `RequestJson` or similar.
315	`Json` or `Object`
llvm/test/tools/llvm-symbolizer/output-style-json-code.test
47	If I follow this correctly, this test case is showing that llvm-addr2line with -f results in the function name being printed? Assuming that's correct, I think you need to show that llvm-addr2line without -f results in the function name not being included in the JSON output too.
llvm/test/tools/llvm-symbolizer/output-style-json-frame.test
39	Please include the version of clang explicitly here, not just in the ident string below. The reason is that someone might come along and trim off the superfluous parts of the below assembly to minimise the test case, but it would still be helpful to know how to generate the unmodified part. I would also use a final release version of clang, so that a future user can easily get the exact version of clang well into the future.

aorlov updated this revision to Diff 343649.May 7 2021, 5:14 AM

aorlov marked 11 inline comments as done.May 7 2021, 5:18 AM

Harbormaster completed remote builds in B103181: Diff 343649.May 7 2021, 6:01 AM

dblaikie added inline comments.May 7 2021, 12:55 PM

llvm/test/tools/llvm-symbolizer/output-style-json-frame.test
39	This seems like a fairly non-trivial build - might be worth an explanation about what's interesting about this case that's not exercised by simpler cases (such as ones without sanitizers, maybe also x86 plain (not a necessity, but curious if it's something ARM specific here), etc)

I have replaced output-style-json-frame.test with output-style-json-frame.ll, which is much friendlier but still contains non-zero TagOffset.

Harbormaster completed remote builds in B103396: Diff 343923.May 9 2021, 12:17 PM

jhenderson added inline comments.May 10 2021, 12:25 AM

llvm/test/tools/llvm-symbolizer/output-style-json-frame.ll
1 ↗	(On Diff #343923)	Does this need a `REQUIRES: aarch64-registered-target` or equivalent?
20 ↗	(On Diff #343923)	Perhaps highlight that you are testing both 0 and non-zero frame offsets with a comment.
25 ↗	(On Diff #343923)	This code could probably do with some comments highlighting what the key elements are, so that future changes don't lose them.

aorlov updated this revision to Diff 344128.May 10 2021, 11:33 AM

aorlov added inline comments.May 10 2021, 11:35 AM

llvm/test/tools/llvm-symbolizer/output-style-json-frame.ll
1 ↗	(On Diff #343923)	It does not require any ARM target. Note this test is passed on x64 Windows and x64 Debian.
20 ↗	(On Diff #343923)	I have updated output-style-json-frame.ll. Now it contains 0, non-zero and empty (missing) TagOffset to cover all possible cases. I added a comment too.
25 ↗	(On Diff #343923)	There are no any key elements. I just declared 3 variables with different type, size and all possible TagOffset per your requests. I personally think it is overkill, and does not really belong to the JSON patch, as there is nothing JSON specific there. It seems a test for the symbolize library and the symbolizer itself.

Harbormaster completed remote builds in B103542: Diff 344128.May 10 2021, 11:46 AM

aorlov updated this revision to Diff 344183.May 10 2021, 1:29 PM

I have added the aarch64 requirements, and a few comments in that test for target offsets. Hope this should address it.

Harbormaster completed remote builds in B103578: Diff 344183.May 10 2021, 2:02 PM

LGTM, with one nit.

llvm/test/tools/llvm-symbolizer/output-style-json-frame.ll
1 ↗	(On Diff #344183)	Nit: similar to the `##` in the other tests, use `;;` for comments in this test to help the comments stand out from lit and FileCheck directives.

This revision is now accepted and ready to land.May 11 2021, 12:48 AM

This revision was landed with ongoing or failed builds.May 11 2021, 2:11 AM

Closed by commit rG05d1ae4e18fa: * Add support for JSON output style to llvm-symbolizer (authored by aorlov). · Explain Why

This revision was automatically updated to reflect the committed changes.

aorlov added a commit: rG05d1ae4e18fa: * Add support for JSON output style to llvm-symbolizer.

Please make another commit to fix the unnecessary introduction of semi-colons in the places I've highlighted.

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h
40–49	Why have these superfluous semi-colons reappeared?
94–95	Ditto.

MaskRay added inline comments.May 11 2021, 12:36 PM

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h
49	`virtual ~DIPrinter(){};` => `virtual ~DIPrinter() {}` Does clang-format complain on `virtual ~DIPrinter(){};` ?

dblaikie added inline comments.May 11 2021, 1:04 PM

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h
49	clang-format doesn't have warnings/errors really - it does it's best to guess at what's going on and format it. The extra ';' confuse clang-format and so it formats poorly. @jhenderson also asked @aorlov to remove the unnecessary semicolons earlier: Please make another commit to fix the unnecessary introduction of semi-colons in the places I've highlighted. @aorlov - could you take a look at this & ensure the semicolons have been removed?

In D96883#2750037, @jhenderson wrote:

Please make another commit to fix the unnecessary introduction of semi-colons in the places I've highlighted.

Done.
Sorry for that. Something went wrong when switching branches.

vitalybuka mentioned this in rG85a96d82ca76: [symbolizer] Fix leak after D96883.May 11 2021, 10:52 PM

simon.giesecke mentioned this in D102224: Add option to llvm-gsymutil to read addresses from stdin..May 12 2021, 1:37 AM

Revision Contents

Path

Size

llvm/

docs/

CommandGuide/

llvm-symbolizer.rst

15 lines

include/

llvm/

DebugInfo/

Symbolize/

DIPrinter.h

17 lines

lib/

DebugInfo/

Symbolize/

DIPrinter.cpp

146 lines

test/

tools/

llvm-symbolizer/

output-style-json-code.test

67 lines

output-style-json-data.test

44 lines

output-style-json-frame.test

213 lines

tools/

llvm-symbolizer/

Opts.td

4 lines

llvm-symbolizer.cpp

29 lines

Diff 333102

llvm/docs/CommandGuide/llvm-symbolizer.rst

Show First 20 Lines • Show All 230 Lines • ▼ Show 20 Lines

.. option:: --obj <path>, --exe, -e .. option:: --obj <path>, --exe, -e

Path to object file to be symbolized. If ``-`` is specified, read the object Path to object file to be symbolized. If ``-`` is specified, read the object

directly from the standard input stream. directly from the standard input stream.

.. _llvm-symbolizer-opt-output-style: .. _llvm-symbolizer-opt-output-style:

.. option:: --output-style <LLVM|GNU> .. option:: --output-style <LLVM|GNU|JSON>

Specify the preferred output style. Defaults to ``LLVM``. When the output Specify the preferred output style. Defaults to ``LLVM``. When the output

style is set to ``GNU``, the tool follows the style of GNU's **addr2line**. style is set to ``GNU``, the tool follows the style of GNU's **addr2line**.

The differences from the ``LLVM`` style are: The differences from the ``LLVM`` style are:

* Does not print the column of a source code location. * Does not print the column of a source code location.

* Does not add an empty line after the report for an address. * Does not add an empty line after the report for an address.

* Does not replace the name of an inlined function with the name of the * Does not replace the name of an inlined function with the name of the

topmost caller when inlined frames are not shown and :option:`--use-symbol-table` topmost caller when inlined frames are not shown and :option:`--use-symbol-table`

is on. is on.

* Prints an address's debug-data discriminator when it is non-zero. One way to * Prints an address's debug-data discriminator when it is non-zero. One way to

produce discriminators is to compile with clang's -fdebug-info-for-profiling. produce discriminators is to compile with clang's -fdebug-info-for-profiling.

``JSON`` style provides a machine readable output in JSON.

jhendersonUnsubmitted

Done

produce discriminators is to compile with clang's -fdebug-info-for-profiling.

- ``JSON`` style provides a machine readable output.

+ ``JSON`` style provides a machine readable output in JSON.

.. code-block:: console

jhenderson:

jhendersonUnsubmitted

Done

Perhaps worth briefly mentioning the behaviour changes for the following cases:

Stdin versus addresses on command-line (i.e. separate individual objects versus list of objects)
--pretty-print versus no --pretty-print. Optionally, this could be under the --pretty-print option discussion.

I don't think you need concrete examples of the differences. Just a one/two sentence description.

jhenderson: Perhaps worth briefly mentioning the behaviour changes for the following cases: # Stdin…

.. code-block:: console .. code-block:: console

jhendersonUnsubmitted

Done

Make this all one paragraph, and reflow to 80 character limit. I'd actually rephrase it slightly too:

"If addresses are supplied via stdin, the output JSON will be a series of individual objects. Otherwise, all results will be contained in a single array."

jhenderson: Make this all one paragraph, and reflow to 80 character limit. I'd actually rephrase it…

$ llvm-symbolizer --obj=inlined.elf 0x4004be 0x400486 -p $ llvm-symbolizer --obj=inlined.elf 0x4004be 0x400486 -p

baz() at /tmp/test.cpp:11:18 baz() at /tmp/test.cpp:11:18

(inlined by) main at /tmp/test.cpp:15:0 (inlined by) main at /tmp/test.cpp:15:0

foo() at /tmp/test.cpp:6:3 foo() at /tmp/test.cpp:6:3

$ llvm-symbolizer --output-style=LLVM --obj=inlined.elf 0x4004be 0x400486 -p --no-inlines $ llvm-symbolizer --output-style=LLVM --obj=inlined.elf 0x4004be 0x400486 -p --no-inlines

main at /tmp/test.cpp:11:18 main at /tmp/test.cpp:11:18

foo() at /tmp/test.cpp:6:3 foo() at /tmp/test.cpp:6:3

$ llvm-symbolizer --output-style=GNU --obj=inlined.elf 0x4004be 0x400486 -p --no-inlines $ llvm-symbolizer --output-style=GNU --obj=inlined.elf 0x4004be 0x400486 -p --no-inlines

baz() at /tmp/test.cpp:11 baz() at /tmp/test.cpp:11

foo() at /tmp/test.cpp:6 foo() at /tmp/test.cpp:6

$ clang -g -fdebug-info-for-profiling test.cpp -o profiling.elf $ clang -g -fdebug-info-for-profiling test.cpp -o profiling.elf

grimarUnsubmitted

Done

Note, my previous comment says:
"I am also not sure it is useful to have a version without --no-inlines here: GNU and LLVM samples doesn't have it."

I.e. the JSON sample is slightly inconsistent now with GNU/LLVM. At the same time, as I've mentioned,
it is not a documentation for --no-inlines, so it is perhaps fine with me. May be other people have a more strong opinion
(I am not sure, why --no-inlines was used for GNU/LLVM first of all).

grimar: Note, my previous comment says: "I am also not sure it is useful to have a version **without**…

jhendersonUnsubmitted

Not Done

@grimar, see the output for the first of these examples (line 260), which is the same, but without --no-inlines. The aim of these examples is to highlight the differences in the GNU and LLVM output, one of which is to do with the --no-inlines option. Hence there is also an example highlighting the discriminator difference. I don't think we need to highlight this distinction specifically for JSON output, because the output format is completely different anyway.

@aorlov, please move this example to below the second GNU style example immediately below. I'd also drop the --no-inlines option too (and update the text to match). See my above comment for why.

jhenderson: @grimar, see the output for the first of these examples (line 260), which is the same, but…

$ llvm-symbolizer --output-style=GNU --obj=profiling.elf 0x401167 -p --no-inlines $ llvm-symbolizer --output-style=GNU --obj=profiling.elf 0x401167 -p --no-inlines

main at /tmp/test.cpp:15 (discriminator 2) main at /tmp/test.cpp:15 (discriminator 2)

$ llvm-symbolizer --output-style=JSON --obj=inlined.elf 0x4004be 0x400486

grimarUnsubmitted

Done

The new block is not on the right place?
I think it should go right after similars blocks for GNU/LLVM styles above (after foo() at /tmp/test.cpp:6 at line 273).

I am also not sure it is useful to have a version without --no-inlines here: GNU and LLVM samples doesn't have it.
It is a documentation for --output-style option and not a test case for --no-inlines, so I perhaps see no reason to have it here.
Am I missing some intention?

grimar: The new block is not on the right place? I think it should go right after similars blocks for…

jhendersonUnsubmitted

Done

Maybe use the --pretty-print option to make this example a bit more readable?

jhenderson: Maybe use the `--pretty-print` option to make this example a bit more readable?

jhendersonUnsubmitted

Done

Sorry, use -p, not --pretty-print (I just realised that's what's used in the previous example).

jhenderson: Sorry, use `-p`, not `--pretty-print` (I just realised that's what's used in the previous…

[{"ModuleName":"inlined.elf","Address":"0x4004be","Error":{"Code":0},"DIInliningInfo":{"Frames":[

{"FunctionName":"baz()","StartFileName":"/tmp/test.cpp","StartLine":9,"FileName":"/tmp/test.cpp","Line":11,"Column":18},

{"FunctionName":"main","StartFileName":"/tmp/test.cpp","StartLine":14,"FileName":"/tmp/test.cpp","Line":15}]}},

{"ModuleName":"inlined.elf","Address":"0x400486","Error":{"Code":0},"DIInliningInfo":{"Frames":[

{"FunctionName":"foo()","StartFileName":"/tmp/test.cpp","StartLine":5,"FileName":"/tmp/test.cpp","Line": 6,"Column": 3}]}}]

$ llvm-symbolizer --output-style=JSON --obj=inlined.elf 0x4004be --no-inlines

{"ModuleName":"inlined.elf","Address":"0x4004be","Error":{"Code":0},"DILineInfo":

{"FunctionName":"main","StartFileName":"/tmp/test.cpp","StartLine":14,"FileName":"/tmp/test.cpp","Line":11,"Column":18}}

.. option:: --pretty-print, -p .. option:: --pretty-print, -p

Print human readable output. If :option:`--inlining` is specified, the Print human readable output. If :option:`--inlining` is specified, the

enclosing scope is prefixed by (inlined by). enclosing scope is prefixed by (inlined by).

.. code-block:: console .. code-block:: console

jhendersonUnsubmitted

Done

enclosing scope is prefixed by (inlined by).

- You can use the - pretty-print option to get a nicely formatted JSON

- when using the corresponding output style, otherwise it is intended for

- machine parsing and has a compact form.

+ For JSON output, the option will cause JSON to be indented and

+ split over new lines. Otherwise, the JSON output will be printed

+ in a compact form.

.. code-block:: console

I'd rephrase as in the inline comment (please make sure to reflow if necessary).

jhenderson: I'd rephrase as in the inline comment (please make sure to reflow if necessary).

$ llvm-symbolizer --obj=inlined.elf 0x4004be --inlining --pretty-print $ llvm-symbolizer --obj=inlined.elf 0x4004be --inlining --pretty-print

baz() at /tmp/test.cpp:11:18 baz() at /tmp/test.cpp:11:18

(inlined by) main at /tmp/test.cpp:15:0 (inlined by) main at /tmp/test.cpp:15:0

.. option:: --print-address, --addresses, -a .. option:: --print-address, --addresses, -a

Print address before the source code location. Defaults to false. Print address before the source code location. Defaults to false.

▲ Show 20 Lines • Show All 113 Lines • Show Last 20 Lines

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h

Show All 22 Lines
class DIInliningInfo;		class DIInliningInfo;
struct DIGlobal;		struct DIGlobal;
struct DILocal;		struct DILocal;
class ErrorInfoBase;		class ErrorInfoBase;
class raw_ostream;		class raw_ostream;

namespace symbolize {		namespace symbolize {

struct Request {		struct Request {
		jhendersonUnsubmitted Not Done Reply Inline Actions This isn't a good type name. What is a "Common"? Would it make more sense to pass the common data as a separate argument to the non-common `Result`? jhenderson: This isn't a good type name. What is a "Common"? Would it make more sense to pass the common…
StringRef ModuleName;		StringRef ModuleName;
uint64_t Address = 0;		uint64_t Address = 0;
Request(const StringRef ModuleName, uint64_t Address)		Request(const StringRef ModuleName, uint64_t Address)
: ModuleName(ModuleName), Address(Address){};		: ModuleName(ModuleName), Address(Address){};
};		};

class DIPrinter {		class DIPrinter {
public:		public:
enum class OutputStyle { LLVM, GNU };		enum class OutputStyle { LLVM, GNU, JSON };

protected:		protected:
raw_ostream &OutputStream;		raw_ostream &OutputStream;
raw_ostream &ErrorStream;		raw_ostream &ErrorStream;

public:		public:
DIPrinter(raw_ostream &OS, raw_ostream &ES)		DIPrinter(raw_ostream &OS, raw_ostream &ES)
: OutputStream(OS), ErrorStream(ES) {}		: OutputStream(OS), ErrorStream(ES) {}
virtual ~DIPrinter(){};		virtual ~DIPrinter(){};
		jhendersonUnsubmitted Not Done Reply Inline Actions Why have these superfluous semi-colons reappeared? jhenderson: Why have these superfluous semi-colons reappeared?
		MaskRayUnsubmitted Not Done Reply Inline Actions `virtual ~DIPrinter(){};` => `virtual ~DIPrinter() {}` Does clang-format complain on `virtual ~DIPrinter(){};` ? MaskRay: `virtual ~DIPrinter(){};` => `virtual ~DIPrinter() {}` Does clang-format complain on `virtual…
		dblaikieUnsubmitted Not Done Reply Inline Actions clang-format doesn't have warnings/errors really - it does it's best to guess at what's going on and format it. The extra ';' confuse clang-format and so it formats poorly. @jhenderson also asked @aorlov to remove the unnecessary semicolons earlier: Please make another commit to fix the unnecessary introduction of semi-colons in the places I've highlighted. @aorlov - could you take a look at this & ensure the semicolons have been removed? dblaikie: clang-format doesn't have warnings/errors really - it does it's best to guess at what's going…

virtual void print(const Request &Request, const DILineInfo &Info) = 0;		virtual void print(const Request &Request, const DILineInfo &Info) = 0;
virtual void print(const Request &Request, const DIInliningInfo &Info) = 0;		virtual void print(const Request &Request, const DIInliningInfo &Info) = 0;
virtual void print(const Request &Request, const DIGlobal &Global) = 0;		virtual void print(const Request &Request, const DIGlobal &Global) = 0;
virtual void print(const Request &Request,		virtual void print(const Request &Request,
		jhendersonUnsubmitted Not Done Reply Inline Actions Perhaps `printError` is good, because the type signature in its current form doesn't make it clear that this is for error reporting. jhenderson: Perhaps `printError` is good, because the type signature in its current form doesn't make it…
const std::vector<DILocal> &Locals) = 0;		const std::vector<DILocal> &Locals) = 0;

virtual bool printError(const Request &Request,		virtual bool printError(const Request &Request,
		jhendersonUnsubmitted Not Done Reply Inline Actions We should probably have `DIPrinterLLVM` and `DIPrinterGNU`. The two could largely share functionality under the hood, but I think it is a cleaner interface than one implementation of DIPrinter controlling all its output style, and the other having to look at another variable to choose between styles. jhenderson: We should probably have `DIPrinterLLVM` and `DIPrinterGNU`. The two could largely share…
const ErrorInfoBase &ErrorInfo,		const ErrorInfoBase &ErrorInfo,
const StringRef ErrorBanner = "") = 0;		const StringRef ErrorBanner = "") = 0;
};		};

class PlainPrinterBase : public DIPrinter {		class PlainPrinterBase : public DIPrinter {
protected:		protected:
bool PrintAddress;		bool PrintAddress;
bool PrintFunctionNames;		bool PrintFunctionNames;
Show All 20 Lines	public:
void print(const Request &Request, const DIInliningInfo &Info) override;		void print(const Request &Request, const DIInliningInfo &Info) override;
void print(const Request &Request, const DIGlobal &Global) override;		void print(const Request &Request, const DIGlobal &Global) override;
void print(const Request &Request,		void print(const Request &Request,
const std::vector<DILocal> &Locals) override;		const std::vector<DILocal> &Locals) override;

bool printError(const Request &Request, const ErrorInfoBase &ErrorInfo,		bool printError(const Request &Request, const ErrorInfoBase &ErrorInfo,
const StringRef ErrorBanner = "") override;		const StringRef ErrorBanner = "") override;
};		};

class LLVMPrinter : public PlainPrinterBase {		class LLVMPrinter : public PlainPrinterBase {
		jhendersonUnsubmitted Not Done Reply Inline Actions Is this clang-formatted properly? Slightly surprised there's no space after `override`. jhenderson: Is this clang-formatted properly? Slightly surprised there's no space after `override`.
		aorlovAuthorUnsubmitted Done Reply Inline Actions It is generated by clang-format and validated by clang-tidy. aorlov: It is generated by clang-format and validated by clang-tidy.
		dblaikieUnsubmitted Done Reply Inline Actions The extraneous trailing semicolon is probably confusing clang-format, so maybe remove those and see how it gets formatted? dblaikie: The extraneous trailing semicolon is probably confusing clang-format, so maybe remove those and…
		jhendersonUnsubmitted Done Reply Inline Actions Ah, good spot! jhenderson: Ah, good spot!
		jhendersonUnsubmitted Not Done Reply Inline Actions Ditto. jhenderson: Ditto.
private:		private:
void print(const DILineInfo &Info, bool Inlined) override;		void print(const DILineInfo &Info, bool Inlined) override;
void printFooter() override;		void printFooter() override;

public:		public:
LLVMPrinter(raw_ostream &OS, raw_ostream &ES, bool PrintAddress = false,		LLVMPrinter(raw_ostream &OS, raw_ostream &ES, bool PrintAddress = false,
bool PrintFunctionNames = true, bool PrintPretty = false,		bool PrintFunctionNames = true, bool PrintPretty = false,
int PrintSourceContext = 0, bool Verbose = false)		int PrintSourceContext = 0, bool Verbose = false)
: PlainPrinterBase(OS, ES, PrintAddress, PrintFunctionNames, PrintPretty,		: PlainPrinterBase(OS, ES, PrintAddress, PrintFunctionNames, PrintPretty,
PrintSourceContext, Verbose) {}		PrintSourceContext, Verbose) {}
};		};

class GNUPrinter : public PlainPrinterBase {		class GNUPrinter : public PlainPrinterBase {
private:		private:
void print(const DILineInfo &Info, bool Inlined) override;		void print(const DILineInfo &Info, bool Inlined) override;

public:		public:
GNUPrinter(raw_ostream &OS, raw_ostream &ES, bool PrintAddress = false,		GNUPrinter(raw_ostream &OS, raw_ostream &ES, bool PrintAddress = false,
bool PrintFunctionNames = true, bool PrintPretty = false,		bool PrintFunctionNames = true, bool PrintPretty = false,
int PrintSourceContext = 0, bool Verbose = false)		int PrintSourceContext = 0, bool Verbose = false)
		jhendersonUnsubmitted Not Done Reply Inline Actions This doesn't feel like the right design choice here. I think a better thing to do would be to have a function like `DIPrinter &operator<<(Error E);` which can be used for reporting errors for all output formats. jhenderson: This doesn't feel like the right design choice here. I think a better thing to do would be to…
		grimarUnsubmitted Not Done Reply Inline Actions I'd like to explain it. It was initially done in that way few diffs ago. I've suggested to add the `printErrorJSON` because it didn't feel right to me to have `operator<<(Error E)`, because having an error it is not a normal regular output case. It feels to me that having a named function, rather than overloading a regular `operator<<` looks cleaner in this case: when used it emphasizes that the error handling is performed. It is a bit subtle: having an error for JSON output is kind of normal, because we have a special output. LLVM/GNU cases are different, we print the following currently: `LLVMSymbolizer: error reading file: <reason>` I.e. for the latter case we have a caller that reports an error (caller even could just call `exit(1)` there, if it wanted to, and I would assume it could be normal). For the JSON case we have a special printer logic, and the caller should no nothing. I've mentioned earlier that the error code doesn't seem to be very useful to have? If so we could have something like `virtual bool onError(const std::string &Msg); { return false; }` for all output formats. Then the logic could be like: if (onError(toString(E)) { // Handled by JSON. return; } error(...); // GNU,LLVM (Note: I believe that reporting `LLVMSymbolizer: error reading file: <reason>` from LLVM/GNU subclasses of DIPrinter is not a good thing. Because for them an error output is not a part of their output style, it is just an error. That is why I was assuming that the caller should report errors for them, like it does now). And, of course, such logic from above only can only work if we do the refactoring you've suggested first ("to have a separate class per output format (JSON, GNU, LLVM), which share a common interface"). It wasn't clear to me that we might want to add more output styles in the future (I don't know a reason for that currently). So I've suggested the simplest less intrusive approach: having a special `printErrorJSON` method. grimar: I'd like to explain it. It was initially done in that way few diffs ago. I've suggested to add…
: PlainPrinterBase(OS, ES, PrintAddress, PrintFunctionNames, PrintPretty,		: PlainPrinterBase(OS, ES, PrintAddress, PrintFunctionNames, PrintPretty,
PrintSourceContext, Verbose) {}		PrintSourceContext, Verbose) {}
		jhendersonUnsubmitted Not Done Reply Inline Actions Why not `ArrayRef` for `Locals`? jhenderson: Why not `ArrayRef` for `Locals`?
		aorlovAuthorUnsubmitted Done Reply Inline Actions I'm using the exact type that symbolizeFrame() returns. aorlov: I'm using the exact type that symbolizeFrame() returns.
};		};

		jhendersonUnsubmitted Not Done Reply Inline Actions Make sure to run clang-format on all your new code. jhenderson: Make sure to run clang-format on all your new code.
		class JSONPrinter : public DIPrinter {
		public:
		JSONPrinter(raw_ostream &OS, raw_ostream &ES) : DIPrinter(OS, ES) {}

		void print(const Request &Request, const DILineInfo &Info) override;
		jhendersonUnsubmitted Done Reply Inline Actions I guess it might be helpful to know what this list is of (e.g. `ObjectList` or whatever). jhenderson: I guess it might be helpful to know what this list is of (e.g. `ObjectList` or whatever).
		void print(const Request &Request, const DIInliningInfo &Info) override;
		void print(const Request &Request, const DIGlobal &Global) override;
		void print(const Request &Request,
		const std::vector<DILocal> &Locals) override;

		bool printError(const Request &Request, const ErrorInfoBase &ErrorInfo,
		const StringRef ErrorBanner = "") override;
		};

} // namespace symbolize		} // namespace symbolize
} // namespace llvm		} // namespace llvm

		jhendersonUnsubmitted Done Reply Inline Actions Use `StringRef`, not `const std::string &`. jhenderson: Use `StringRef`, not `const std::string &`.
#endif		#endif

llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp

Show All 10 Lines

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

#include "llvm/DebugInfo/Symbolize/DIPrinter.h" #include "llvm/DebugInfo/Symbolize/DIPrinter.h"

#include "llvm/ADT/StringRef.h" #include "llvm/ADT/StringRef.h"

#include "llvm/DebugInfo/DIContext.h" #include "llvm/DebugInfo/DIContext.h"

#include "llvm/Support/ErrorOr.h" #include "llvm/Support/ErrorOr.h"

#include "llvm/Support/Format.h" #include "llvm/Support/Format.h"

#include "llvm/Support/JSON.h"

#include "llvm/Support/LineIterator.h" #include "llvm/Support/LineIterator.h"

#include "llvm/Support/MemoryBuffer.h" #include "llvm/Support/MemoryBuffer.h"

#include "llvm/Support/Path.h" #include "llvm/Support/Path.h"

#include "llvm/Support/raw_ostream.h" #include "llvm/Support/raw_ostream.h"

#include <algorithm> #include <algorithm>

#include <cmath> #include <cmath>

#include <cstddef> #include <cstddef>

#include <cstdint> #include <cstdint>

#include <memory> #include <memory>

#include <string> #include <string>

namespace llvm { namespace llvm {

namespace symbolize { namespace symbolize {

// Prints source code around in the FileName the Line. // Prints source code around in the FileName the Line.

jhendersonUnsubmitted

Not Done

fix and opt are not good function names. Fix what? What does "opt" stand for ("optimize", "optional", "opt-out", ...). Please pick more self-explanatory names.

jhenderson: `fix` and `opt` are not good function names. Fix what? What does "opt" stand for ("optimize"…

grimarUnsubmitted

Done

I am not sure I'd introduce this helper: it is common to just inline the code like "0x" + Twine::utohexstr(V).
Though if you want it, probably the better name is toHex then.

grimar: I am not sure I'd introduce this helper: it is common to just inline the code like `"0x" +…

void PlainPrinterBase::printContext(const StringRef FileName, int64_t Line) { void PlainPrinterBase::printContext(const StringRef FileName, int64_t Line) {

if (PrintSourceContext <= 0) if (PrintSourceContext <= 0)

grimarUnsubmitted

Done

J.objectBegin();

- if (Info.Source.hasValue())

+ if (Info.Source)

J.attribute("Source", Info.Source.getValue());

Source is Optional<>, which has operator bool, so you can omit hasValue here and in other places below.

grimar: `Source` is `Optional<>`, which has `operator bool`, so you can omit `hasValue` here and in…

jhendersonUnsubmitted

Done

Should this function be static?

jhenderson: Should this function be `static`?

return; return;

grimarUnsubmitted

Done

It is common not to use Optional::getValue in favor of dereference:

J.attribute("Source", *Info.Source);

grimar: It is common not to use `Optional::getValue` in favor of dereference: ``` J.attribute("Source"…

ErrorOr<std::unique_ptr<MemoryBuffer>> BufOrErr = ErrorOr<std::unique_ptr<MemoryBuffer>> BufOrErr =

MemoryBuffer::getFile(FileName); MemoryBuffer::getFile(FileName);

jhendersonUnsubmitted

Not Done

Why llvm::StringRef and not simply StringRef? Also the V argument should be StringRef.

jhenderson: Why `llvm::StringRef` and not simply `StringRef`? Also the `V` argument should be `StringRef`.

if (!BufOrErr) if (!BufOrErr)

return; return;

std::unique_ptr<MemoryBuffer> Buf = std::move(BufOrErr.get()); std::unique_ptr<MemoryBuffer> Buf = std::move(BufOrErr.get());

int64_t FirstLine = int64_t FirstLine =

std::max(static_cast<int64_t>(1), Line - PrintSourceContext / 2); std::max(static_cast<int64_t>(1), Line - PrintSourceContext / 2);

int64_t LastLine = FirstLine + PrintSourceContext; int64_t LastLine = FirstLine + PrintSourceContext;

size_t MaxLineNumberWidth = std::ceil(std::log10(LastLine)); size_t MaxLineNumberWidth = std::ceil(std::log10(LastLine));

grimarUnsubmitted

Done

So, the Column key is only emitted when a column is not 0 (Info.Column is uint32_t). Is it expected behavior?
Is the intention is to omit the key to reduce the noise in the output? This needs a comment if so.

grimar: So, the `Column` key is only emitted when a column is not `0` (`Info.Column` is `uint32_t`). Is…

grimarUnsubmitted

Done

Thanks for adding the comment. You should probably also add an explicit test case to test/document the output when the Column value is 0 and the case when it is not.

grimar: Thanks for adding the comment. You should probably also add an explicit test case to…

aorlovAuthorUnsubmitted

Done

Done. Covered by the "Test JSON output style of empty DILineInfo" test along with the case of missing line information.

aorlov: Done. Covered by the "Test JSON output style of empty DILineInfo" test along with the case of…

grimarUnsubmitted

Not Done

Missing a full stop after output.

grimar: Missing a full stop after `output`.

jhendersonUnsubmitted

Done

J.attribute("FileName", Info.FileName);

- // Print only valid line and column to reduce a noise in the output

+ // Print only valid line and column to reduce noise in the output.

if (Info.Line)

I don't think we should be omitting Line and Column if they are 0. Probably the same goes for Discriminator, and maybe even the other fields. The problem is that by omitting them, the parser will have to handle the chance of the elements being missing.

jhenderson: I don't think we should be omitting `Line` and `Column` if they are 0. Probably the same goes…

for (line_iterator I = line_iterator(*Buf, false); for (line_iterator I = line_iterator(*Buf, false);

!I.is_at_eof() && I.line_number() <= LastLine; ++I) { !I.is_at_eof() && I.line_number() <= LastLine; ++I) {

int64_t L = I.line_number(); int64_t L = I.line_number();

if (L >= FirstLine && L <= LastLine) { if (L >= FirstLine && L <= LastLine) {

OutputStream << format_decimal(L, MaxLineNumberWidth); OutputStream << format_decimal(L, MaxLineNumberWidth);

if (L == Line) if (L == Line)

OutputStream << " >: "; OutputStream << " >: ";

▲ Show 20 Lines • Show All 84 Lines • ▼ Show 20 Lines

} }

void PlainPrinterBase::print(const Request &Request, void PlainPrinterBase::print(const Request &Request,

const DIInliningInfo &Info) { const DIInliningInfo &Info) {

printHeader(Request.Address); printHeader(Request.Address);

uint32_t FramesNum = Info.getNumberOfFrames(); uint32_t FramesNum = Info.getNumberOfFrames();

if (FramesNum == 0) if (FramesNum == 0)

print(DILineInfo(), false); print(DILineInfo(), false);

else else

grimarUnsubmitted

Done

This change is unrelated to what this patch does? You should't mix cosmetic changes with functional when they are independent.

grimar: This change is unrelated to what this patch does? You should't mix cosmetic changes with…

for (uint32_t I = 0; I < FramesNum; ++I) for (uint32_t I = 0; I < FramesNum; ++I)

print(Info.getFrame(I), I > 0); print(Info.getFrame(I), I > 0);

printFooter(); printFooter();

} }

grimarUnsubmitted

Done

J.arrayBegin();

- for (uint32_t I = 0; I < FramesNum; I++) {

+ for (uint32_t I = 0; I < FramesNum; ++I) {

OS << '\n';

It is preffered to use the pre-increment form where possible.

grimar: It is preffered to use the pre-increment form where possible.

void PlainPrinterBase::print(const Request &Request, const DIGlobal &Global) { void PlainPrinterBase::print(const Request &Request, const DIGlobal &Global) {

printHeader(Request.Address); printHeader(Request.Address);

StringRef Name = Global.Name; StringRef Name = Global.Name;

if (Name == DILineInfo::BadString) if (Name == DILineInfo::BadString)

Name = DILineInfo::Addr2LineBadString; Name = DILineInfo::Addr2LineBadString;

OutputStream << Name << "\n"; OutputStream << Name << "\n";

OutputStream << Global.Start << " " << Global.Size << "\n"; OutputStream << Global.Start << " " << Global.Size << "\n";

grimarUnsubmitted

Not Done

Why do you convert uint64_t to int64_t here and in many places below?

grimar: Why do you convert `uint64_t` to `int64_t` here and in many places below?

aorlovAuthorUnsubmitted

Done

There is no json::OStream::attribute() version for uint32_t and uint64_t.
I can use an implicit conversion for uint32_t, but I need to convert uint64_t to int64_t explicitly.

aorlov: There is no json::OStream::attribute() version for uint32_t and uint64_t. I can use an…

grimarUnsubmitted

Done

But this is simply wrong and might print a wrong result isn't?

E.g. imagine you do:

uint64_t XXX = 0xffffffffeeeeeeee;
J.attribute("Start", int64_t(XXX));

The output is:

{"Name":"foo","Start":-286331154

I think Start shouldn't have a negative value.
Seems you need to update the json::OStream implementation to fix it.
You also need to add a test for such situation(s).

Also. should Start/Size be printed as hex? I think it is very common to use hex for addresses/sizes.

grimar: But this is simply wrong and might print a wrong result isn't? E.g. imagine you do: ```…

jhendersonUnsubmitted

Not Done

There is no json::OStream::attribute() version for uint32_t and uint64_t.

You could always add them!

jhenderson: > There is no json::OStream::attribute() version for uint32_t and uint64_t. You could always…

aorlovAuthorUnsubmitted

Done

Please note JSON allows 53-bits numbers. So we print huge numbers as strings "0x...".

aorlov: Please note JSON allows 53-bits numbers. So we print huge numbers as strings "0x...".

printFooter(); printFooter();

} }

void PlainPrinterBase::print(const Request &Request, void PlainPrinterBase::print(const Request &Request,

const std::vector<DILocal> &Locals) { const std::vector<DILocal> &Locals) {

printHeader(Request.Address); printHeader(Request.Address);

if (Locals.empty()) if (Locals.empty())

OutputStream << DILineInfo::Addr2LineBadString << '\n'; OutputStream << DILineInfo::Addr2LineBadString << '\n';

Show All 22 Lines for (const DILocal &L : Locals) {

OutputStream << *L.FrameOffset; OutputStream << *L.FrameOffset;

else else

OutputStream << DILineInfo::Addr2LineBadString; OutputStream << DILineInfo::Addr2LineBadString;

OutputStream << ' '; OutputStream << ' ';

if (L.Size) if (L.Size)

OutputStream << *L.Size; OutputStream << *L.Size;

else else

OutputStream << DILineInfo::Addr2LineBadString; OutputStream << DILineInfo::Addr2LineBadString;

grimarUnsubmitted

Done

I'd suggest just for (const DILocal& L : Locals)

grimar: I'd suggest just `for (const DILocal& L : Locals)`

OutputStream << ' '; OutputStream << ' ';

if (L.TagOffset) if (L.TagOffset)

jhendersonUnsubmitted

Not Done

Delete this. We don't use arbitrary comment markers to divide up files in LLVM coding style.

jhenderson: Delete this. We don't use arbitrary comment markers to divide up files in LLVM coding style.

OutputStream << *L.TagOffset; OutputStream << *L.TagOffset;

else else

OutputStream << DILineInfo::Addr2LineBadString; OutputStream << DILineInfo::Addr2LineBadString;

OutputStream << '\n'; OutputStream << '\n';

jhendersonUnsubmitted

Not Done

You seem to have ignored my earlier comments re. printing everything?

jhenderson: You seem to have ignored my earlier comments re. printing everything?

} }

printFooter(); printFooter();

} }

bool PlainPrinterBase::printError(const Request &Request, bool PlainPrinterBase::printError(const Request &Request,

const ErrorInfoBase &ErrorInfo, const ErrorInfoBase &ErrorInfo,

const StringRef ErrorBanner) { const StringRef ErrorBanner) {

if (!ErrorBanner.empty()) { if (!ErrorBanner.empty()) {

ErrorStream << ErrorBanner; ErrorStream << ErrorBanner;

grimarUnsubmitted

Done

The same comments as I've added for void toJSON(...) above applies here.

grimar: The same comments as I've added for `void toJSON(...)` above applies here.

ErrorInfo.log(ErrorStream); ErrorInfo.log(ErrorStream);

ErrorStream << '\n'; ErrorStream << '\n';

// Print an empty struct too. // Print an empty struct too.

jhendersonUnsubmitted

Not Done

J.attribute("FileName", Info.FileName);

- // Print only valid line and column to reduce noise in the output

+ // Print only valid line and column to reduce noise in the output.

if (Info.Line)

jhenderson:

return true; return true;

} }

OutputStream << ErrorInfo.message() << '\n'; OutputStream << ErrorInfo.message() << '\n';

return false; return false;

} }

std::string toHex(uint64_t V) { return ("0x" + Twine::utohexstr(V)).str(); }

jhendersonUnsubmitted

Done

static

jhenderson: `static`

jhendersonUnsubmitted

Not Done

As noted above, I think this would be better if it took an Error. The implementation would then "handle" the error by extracting the message and storing it in the Message field of the JSON output. The error code is probably not useful for anything, so I'd omit it entirely.

jhenderson: As noted above, I think this would be better if it took an `Error`. The implementation would…

static void toJSON(json::OStream &J, const DILineInfo &Info) {

jhendersonUnsubmitted

Done

static

jhenderson: `static`

J.objectBegin();

if (Info.Source)

J.attribute("Source", *Info.Source);

grimarUnsubmitted

Done

You don't need the cast here, because value() is int?

grimar: You don't need the cast here, because `value()` is `int`?

jhendersonUnsubmitted

Done

ErrorCode is unused now. Also, no need for the const on ErrorMsg.

jhenderson: `ErrorCode` is unused now. Also, no need for the `const` on `ErrorMsg`.

jhendersonUnsubmitted

Done

There's still an unfortunate single-letter variable name here. I'd suggest Json or Object for the variable name.

jhenderson: There's still an unfortunate single-letter variable name here. I'd suggest `Json` or `Object`…

if (Info.FunctionName != DILineInfo::BadString)

jhendersonUnsubmitted

Done

Usually we use StringRef.str() to convert to a std::string. Same applies below in the error message line.

jhenderson: Usually we use `StringRef.str()` to convert to a `std::string`. Same applies below in the error…

J.attribute("FunctionName", Info.FunctionName);

jhendersonUnsubmitted

Not Done

I suspect from a user's perspective they won't expect to see an Error attribute if there's no error.

jhenderson: I suspect from a user's perspective they won't expect to see an `Error` attribute if there's no…

aorlovAuthorUnsubmitted

Done

Error: {Code: 0} – is a standard way to say success. And it is a bad idea to omit Error if it should be checked first.

aorlov: Error: {Code: 0} – is a standard way to say `success`. And it is a bad idea to omit Error if it…

jhendersonUnsubmitted

Not Done

Right, but basically the behaviour of checking whether "Error" exists is no more complicated in most languages than checking whether it has value zero.

I would kind of think there would be two possible JSON objects produced per query (possibly three if we handle invalid commands differently to other errors for JSON output):

{"Error":"some message/code/whatever"}
{"Source":"some path",...}

Where the response had an error, it is unlikely the other parameters can be relied upon in any meaningful way, so it's probably better to omit them than potentially cause confusion.
The pseudo-python logic for this might look something like:

response = json.load(output)
if 'Error' in response:
  handleError(response)
else:
  handleNormalResponse(response)

jhenderson: Right, but basically the behaviour of checking whether "Error" exists is no more complicated in…

if (Info.StartFileName != DILineInfo::BadString)

jhendersonUnsubmitted

Not Done

What is ErrorCode supposed to represent? How will a user benefit from it beyond the information provided in the message?

jhenderson: What is `ErrorCode` supposed to represent? How will a user benefit from it beyond the…

aorlovAuthorUnsubmitted

Done

The ErrorCode is more important than a message when automating the error handling. The error message may be localized, depend on OS, etc.

aorlov: The `ErrorCode` is more important than a message when automating the error handling. The error…

jhendersonUnsubmitted

Not Done

The error codes contained in llvm::Error and llvm::Expected are quite often somewhat arbitrary, and inconsistent. It won't be possible to safely rely on these codes for any useful automated processing. Furthermore, some Errors could contain inconvertibleErrorCode which will cause a problem if these end up back here.

What's the use-case for handling different error kinds differently? I'm not saying there isn't a motivation for that, I'm just trying to understand how you plan to use it. If you have no such plan, it doesn't make sense to add additional logic to distinguish errors by code as well as message.

jhenderson: The error codes contained in `llvm::Error` and `llvm::Expected` are quite often somewhat…

J.attribute("StartFileName", Info.StartFileName);

if (Info.StartLine)

J.attribute("StartLine", Info.StartLine);

if (Info.FileName != DILineInfo::BadString)

J.attribute("FileName", Info.FileName);

// Print only valid line and column to reduce noise in the output.

jhendersonUnsubmitted

Done

Same comment as above. How about InliningInfo?

jhenderson: Same comment as above. How about `InliningInfo`?

if (Info.Line)

J.attribute("Line", Info.Line);

if (Info.Column)

J.attribute("Column", Info.Column);

if (Info.Discriminator)

J.attribute("Discriminator", Info.Discriminator);

J.objectEnd();

}

jhendersonUnsubmitted

Done

Don't abbeviate these names unnecessarily like this. Array and FrameCount would both be acceptable, for example. In general, avoid single letter variable names except for loop counters. Same goes throughout this patch. See https://llvm.org/docs/CodingStandards.html#name-types-functions-variables-and-enumerators-properly for details:

We cannot stress enough how important it is to use descriptive names. Pick names that match the semantics and role of the underlying entities, within reason. Avoid abbreviations unless they are well known.

I think you can inline N into the initialising part of the for loop? I don't see it being used outside the loop.

jhenderson: Don't abbeviate these names unnecessarily like this. `Array` and `FrameCount` would both be…

jhendersonUnsubmitted

Done

LI is a similar abbreviation which doesn't clearly indicate its name. What's wrong with LineInfo?

jhenderson: `LI` is a similar abbreviation which doesn't clearly indicate its name. What's wrong with…

void JSONPrinter::print(const Request &Request, const DILineInfo &Info) {

{

jhendersonUnsubmitted

Not Done

Object maybe?

jhenderson: `Object` maybe?

aorlovAuthorUnsubmitted

Done

It is not necessary anymore because Source has been removed.

aorlov: It is not necessary anymore because `Source` has been removed.

json::OStream J(OutputStream);

jhendersonUnsubmitted

Not Done

This is intended to be machine readable. "Noise" in the output isn't an issue. In fact, as previously mentioned, not having these and other attributes is actively harmful to the user experience, as it makes it harder to write a parser that consumes this JSON.

In the case where you have a BadString output, I'd just print an empty string. Example:

{ "Source" : "", "FunctionName" : "", ... , "Line" : 0, "Column" : 0, "Discriminator" : 0 }

jhenderson: This is intended to be machine readable. "Noise" in the output isn't an issue. In fact, as…

aorlovAuthorUnsubmitted

Done

Ok, but I still omit empty Error Message and FrameOffset.

aorlov: Ok, but I still omit empty Error Message and FrameOffset.

jhendersonUnsubmitted

Not Done

That seems reasonable, thanks for the explanation.

jhenderson: That seems reasonable, thanks for the explanation.

J.objectBegin();

if (!Request.ModuleName.empty())

J.attribute("ModuleName", Request.ModuleName);

J.attribute("Address", toHex(Request.Address));

J.attributeObject("Error", [&] { J.attribute("Code", 0); });

J.attributeBegin("DILineInfo");

toJSON(J, Info);

J.attributeEnd();

jhendersonUnsubmitted

Done

J -> Json or Object

jhenderson: `J` -> `Json` or `Object`

J.objectEnd();

jhendersonUnsubmitted

Not Done

I think adding source context can be a later patch. Let's try to avoid adding more than we have to in this one patch.

jhenderson: I think adding source context can be a later patch. Let's try to avoid adding more than we have…

}

OutputStream << '\n';

}

void JSONPrinter::print(const Request &Request, const DIInliningInfo &Info) {

{

json::OStream J(OutputStream);

jhendersonUnsubmitted

Not Done

I didn't think of this earlier, but it makes a lot of sense to have the pretty printing form be more human readable, with indentation and new lines as appropriate. Thanks!

jhenderson: I didn't think of this earlier, but it makes a lot of sense to have the pretty printing form be…

J.objectBegin();

if (!Request.ModuleName.empty())

J.attribute("ModuleName", Request.ModuleName);

J.attribute("Address", toHex(Request.Address));

J.attributeObject("Error", [&] { J.attribute("Code", 0); });

jhendersonUnsubmitted

Done

Object or Json.

jhenderson: `Object` or `Json`.

J.attributeBegin("DIInliningInfo");

jhendersonUnsubmitted

Done

What's the point of the new line? Same goes elsewhere where you are adding new lines. If the output is intended to be machine readable, there is no need for the new lines.

jhenderson: What's the point of the new line? Same goes elsewhere where you are adding new lines. If the…

J.objectBegin();

grimarUnsubmitted

Done

Can DIPrinter::operator<<(const ErrorInfoBase &EI) be called for non-JSON outputs?
If no, then having assert is not what you want here. you should use llvm-unreachable for the code that can't be reached.

At the same time, do you need this new method? I don't think that it is expected to be updated in future, probably.
And it feels that it is a caller job probably to handle errors properly. So, can it's logic be inlined to printResOrErr?

grimar: Can `DIPrinter::operator<<(const ErrorInfoBase &EI)` be called for non-JSON outputs? If no…

grimarUnsubmitted

Done

At the same time, do you need this new method?
...
And it feels that it is a caller job probably to handle errors properly.

I've debugged it and now I think that creating the JSON with the error in DIPrinter is fine,
but I'd not implement it as operator<<, because it is not a regular output operation,
and will be only useful for JSON it seems.

So I'd suggest something like:

void DIPrinter::printErrorJSON(const Twine& Msg, std::error_code EC) {
  json::OStream J(OS);
  J.objectBegin();
  J.attributeObject("Error", [&] {
    J.attribute("Code", int64_t(EC.value()));
    J.attribute("Message", Msg.str());
  });
  J.objectEnd();
  OS << '\n';
}

grimar: > At the same time, do you need this new method? > ... > And it feels that it is a caller job…

J.attributeBegin("Frames");

J.arrayBegin();

uint32_t FramesNum = Info.getNumberOfFrames();

for (uint32_t I = 0; I < FramesNum; ++I) {

OutputStream << '\n';

toJSON(J, Info.getFrame(I));

}

J.arrayEnd();

J.attributeEnd(); // Frames

jhendersonUnsubmitted

Not Done

Did you consider having a single json::OStream as a member of the JSONPrinter class, so that it only needs constructing (with the Pretty check) in one place?

jhenderson: Did you consider having a single `json::OStream` as a member of the JSONPrinter class, so that…

aorlovAuthorUnsubmitted

Done

Note there is no way to reset json::OStream context, so we cannot reuse the same instance to print 2 or more objects while processing stdin.
I have moved json::OStream to printJSON(json::Value).

aorlov: Note there is no way to reset `json::OStream` context, so we cannot reuse the same instance to…

jhendersonUnsubmitted

Done

Local

jhenderson: `Local`

J.objectEnd();

jhendersonUnsubmitted

Done

Maybe use FrameObject here to disambiguate from the toJson retiurn below.

jhenderson: Maybe use `FrameObject` here to disambiguate from the `toJson` retiurn below.

J.attributeEnd(); // DIInliningInfo

J.objectEnd();

}

OutputStream << '\n';

}

void JSONPrinter::print(const Request &Request, const DIGlobal &Global) {

{

json::OStream J(OutputStream);

J.objectBegin();

if (!Request.ModuleName.empty())

jhendersonUnsubmitted

Done

Json or RequestJson or similar.

jhenderson: `Json` or `RequestJson` or similar.

J.attribute("ModuleName", Request.ModuleName);

J.attribute("Address", toHex(Request.Address));

J.attributeObject("Error", [&] { J.attribute("Code", 0); });

J.attributeBegin("DIGlobal");

J.objectBegin();

if (Global.Name != DILineInfo::BadString)

J.attribute("Name", Global.Name);

J.attribute("Start", toHex(Global.Start));

J.attribute("Size", toHex(Global.Size));

J.objectEnd();

J.attributeEnd();

J.objectEnd();

}

OutputStream << '\n';

}

void JSONPrinter::print(const Request &Request,

const std::vector<DILocal> &Locals) {

{

jhendersonUnsubmitted

Done

Json or Object

jhenderson: `Json` or `Object`

json::OStream J(OutputStream);

J.objectBegin();

if (!Request.ModuleName.empty())

J.attribute("ModuleName", Request.ModuleName);

J.attribute("Address", toHex(Request.Address));

J.attributeObject("Error", [&] { J.attribute("Code", 0); });

J.attributeBegin("vector_DILocal");

J.arrayBegin();

for (const DILocal &L : Locals) {

OutputStream << '\n';

jhendersonUnsubmitted

Not Done

Same as above - just print everything.

jhenderson: Same as above - just print everything.

J.objectBegin();

if (!L.FunctionName.empty())

J.attribute("FunctionName", L.FunctionName);

if (!L.Name.empty())

J.attribute("Name", L.Name);

if (!L.DeclFile.empty())

J.attribute("DeclFile", L.DeclFile);

J.attribute("DeclLine", int64_t(L.DeclLine));

if (L.FrameOffset)

J.attribute("FrameOffset", *L.FrameOffset);

if (L.Size)

J.attribute("Size", toHex(*L.Size));

if (L.TagOffset)

J.attribute("TagOffset", toHex(*L.TagOffset));

J.objectEnd();

}

J.arrayEnd();

J.attributeEnd();

J.objectEnd();

}

OutputStream << '\n';

}

bool JSONPrinter::printError(const Request &Request,

const ErrorInfoBase &ErrorInfo,

const StringRef ErrorBanner) {

{

json::OStream J(OutputStream);

J.objectBegin();

if (!Request.ModuleName.empty())

J.attribute("ModuleName", Request.ModuleName);

J.attribute("Address", toHex(Request.Address));

J.attributeObject("Error", [&] {

J.attribute("Code", ErrorInfo.convertToErrorCode().value());

std::string Banner;

if (ErrorBanner.empty())

Banner = "unable to parse arguments: ";

J.attribute("Message", Banner + ErrorInfo.message());

});

J.objectEnd();

}

OutputStream << '\n';

return false;

}

} // end namespace symbolize } // end namespace symbolize

} // end namespace llvm } // end namespace llvm

llvm/test/tools/llvm-symbolizer/output-style-json-code.test

This file was added.

## This test checks JSON output for CODE.

## Handle symbolize library error - file does not exist, no-inlines.

# RUN: llvm-symbolizer --output-style=JSON --no-inlines -e %p/no-file.exe 0 | \

jhendersonUnsubmitted

Done

## This test checks JSON output for CODE.

- ## If the addresses are specified in the command line, the output JSON will

+ ## If the addresses are specified on the command-line, the output JSON will

## contain an array of the results for all of the given addresses.

jhenderson:

# RUN: FileCheck %s -DMSG=%errc_ENOENT --check-prefix=NO-FILE --strict-whitespace --match-full-lines --implicit-check-not={{.}}

jhendersonUnsubmitted

Done

Please reflow this comment to 80-character limit.

jhenderson: Please reflow this comment to 80-character limit.

## Handle symbolize library error - file does not exist, inline.

jhendersonUnsubmitted

Done

## the output JSON will contain an array of the results for all of the given addresses.

- ## Handle symbolize library error - file does not exist.

+ ## Handle symbolizer library error - file does not exist.

# RUN: llvm-symbolizer --output-style=JSON -e %p/no-file.exe 0 | \

jhenderson:

aorlovAuthorUnsubmitted

Done

But note the library name is symbolize, not symbolizer.

aorlov: But note the library name is `symbolize`, not `symbolizer`.

jhendersonUnsubmitted

Done

Fair point. How about simply ## Show how library errors are reported in the output.

jhenderson: Fair point. How about simply `## Show how library errors are reported in the output.`

# RUN: llvm-symbolizer --output-style=JSON -e %p/no-file.exe 0 | \

# RUN: FileCheck %s -DMSG=%errc_ENOENT --check-prefix=NO-FILE --strict-whitespace --match-full-lines --implicit-check-not={{.}}

jhendersonUnsubmitted

Done

My understanding is that the --no-inlines option has no impact on the output here, so I think you can drop one of these cases?

jhenderson: My understanding is that the `--no-inlines` option has no impact on the output here, so I think…

# NO-FILE:{"ModuleName":"{{.*}}/no-file.exe","Address":"0x0","Error":{"Code":2,"Message":"[[MSG]]"}}

jhendersonUnsubmitted

Not Done

Rather than use {{.*}}/no-file.exe here and similar situations for paths below, I'd recommend using FileCheck's -D option to ensure the correct path is printed, i.e:

# RUN:   FileCheck %s ... -DFILE=%p/no-file.exe

# NO-FILE: ... "ModuleName":"[[FILE]]" ...

jhenderson: Rather than use `{{.*}}/no-file.exe` here and similar situations for paths below, I'd recommend…

aorlovAuthorUnsubmitted

Done

Unfortunately -D would not work in this case, as JSON has own rules for escaping special symbols in paths.

aorlov: Unfortunately -D would not work in this case, as JSON has own rules for escaping special…

## Resolve out of range address, no-inlines. Expected empty object, as all the default values are omitted.

jhendersonUnsubmitted

Done

As before, do we need both --no-inlines and --inlines cases?

jhenderson: As before, do we need both `--no-inlines` and `--inlines` cases?

# RUN: llvm-symbolizer --output-style=JSON --no-inlines -e %p/Inputs/addr.exe 1000000000 | \

# RUN: FileCheck %s --check-prefix=NOT-FOUND-1 --strict-whitespace --match-full-lines --implicit-check-not={{.}}

jhendersonUnsubmitted

Not Done

Maybe rather than NOT-FOUND-1 and NOT-FOUND-2, it would be more self-descriptive using NOT-FOUND-NOINLINES and NOT-FOUND-INLINES or something like that?

jhenderson: Maybe rather than `NOT-FOUND-1` and `NOT-FOUND-2`, it would be more self-descriptive using `NOT…

aorlovAuthorUnsubmitted

Done

It is not applicable anymore.

aorlov: It is not applicable anymore.

# NOT-FOUND-1:{"ModuleName":"{{.*}}/Inputs/addr.exe","Address":"0x3b9aca00","Error":{"Code":0},"DILineInfo":{}}

jhendersonUnsubmitted

Not Done

Just thinking about usability - what is the canonical way for users to know that their address hasn't been found in JSON output? It might be worth documenting this somewhere.

jhenderson: Just thinking about usability - what is the canonical way for users to know that their address…

aorlovAuthorUnsubmitted

Done

Yes. This is the Symbolize library design issue. As far as I can tell there is no reliable way to get a distinct result for the symbol not found case.
For now I just keep it transparent by serializing whatever the library returns, as addressing the library issue is out of the scope of this patch.

aorlov: Yes. This is the Symbolize library design issue. As far as I can tell there is no reliable way…

## Resolve out of range address, inlines. Expected a Frames list with one empty object, as all the default values are omitted.

# RUN: llvm-symbolizer --output-style=JSON -e %p/Inputs/addr.exe 1000000000 | \

# RUN: FileCheck %s --check-prefix=NOT-FOUND-2 --strict-whitespace --match-full-lines --implicit-check-not={{.}}

# NOT-FOUND-2:{"ModuleName":"{{.*}}/Inputs/addr.exe","Address":"0x3b9aca00","Error":{"Code":0},"DIInliningInfo":{"Frames":[

# NOT-FOUND-2-NEXT:{}]}}

# RUN: llvm-symbolizer --output-style=JSON --no-inlines -e %p/Inputs/addr.exe < %p/Inputs/addr.inp | \

jhendersonUnsubmitted

Done

Consider a comment for this case, introducing the following set of test cases. In particular, it's important to note that this is using a stdin argument. You could also move all the references to "no-inlines" to one place. I.e. something like "this test case is testing stdin input, with the --no-inlines option" would do the trick.

Same sort of thing goes below.

jhenderson: Consider a comment for this case, introducing the following set of test cases. In particular…

# RUN: FileCheck %s --check-prefix=NO-INLINES --strict-whitespace --match-full-lines --implicit-check-not={{.}}

## Invalid first argument before any valid one, no-inlines

jhendersonUnsubmitted

Done

Nit: missing full stop.

jhenderson: Nit: missing full stop.

# NO-INLINES:{"ModuleName":"{{.*}}/Inputs/addr.exe","Address":"0x0","Error":{"Code":22,"Message":"unable to parse arguments: some text"}}

grimarUnsubmitted

Not Done

MIssing full stop.

grimar: MIssing full stop.

jhendersonUnsubmitted

Done

I find the "Address":"0x0" bit here somewhat confusing, given this is an error case. I'd consider omitting it entirely. Alternatively, simply print what the input address was specified as (in this case "Address":"some text").

jhenderson: I find the `"Address":"0x0"` bit here somewhat confusing, given this is an error case. I'd…

## Resolve valid address, no-inlines.

# NO-INLINES-NEXT:{"ModuleName":"{{.*}}/Inputs/addr.exe","Address":"0x40054d","Error":{"Code":0},"DILineInfo":{"FunctionName":"main","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":2,"FileName":"/tmp{{/|\\\\}}x.c","Line":3,"Column":3}}

jhendersonUnsubmitted

Not Done

It would be nice to have one or more test cases with a non-zero discriminator value. Also, where the "Source" parameter is non-empty.

You can simplify {{/|\\\\}} to this: {{[\\/]}}. I think it's slightly more readable due to avoiding the quadruple backslash. Same goes elsewhere below.

jhenderson: It would be nice to have one or more test cases with a non-zero discriminator value. Also…

aorlovAuthorUnsubmitted

Done

I have added the test for a non-zero discriminator. Note the address is hardcoded. It is more correct to use /Inputs/discrim.inp via stdin, but I just copied the address from discriminator.test.
It seems currently we have no binaries in llvm/test/tools/llvm-symbolizer/Inputs with a valid Source info.

You can simplify

No, it does not work because JSON has own rules for escaping special symbols in paths.

aorlov: I have added the test for a non-zero discriminator. Note the address is hardcoded. It is more…

jhendersonUnsubmitted

Not Done

It seems currently we have no binaries in llvm/test/tools/llvm-symbolizer/Inputs with a valid Source info.

Consider generating one at test time using assembly or yaml2obj.

You can simplify

No, it does not work because JSON has own rules for escaping special symbols in paths.

Okay - I missed the double backslash.

jhenderson: > It seems currently we have no binaries in llvm/test/tools/llvm-symbolizer/Inputs with a valid…

aorlovAuthorUnsubmitted

Done

I have added output-style-json-code-source.c.

aorlov: I have added output-style-json-code-source.c.

## Invalid argument after a valid one, no-inlines.

# NO-INLINES-NEXT:{"ModuleName":"{{.*}}/Inputs/addr.exe","Address":"0x0","Error":{"Code":22,"Message":"unable to parse arguments: some text2"}}

# RUN: llvm-symbolizer --output-style=JSON -e %p/Inputs/addr.exe < %p/Inputs/addr.inp | \

# RUN: FileCheck %s --check-prefix=INLINE --strict-whitespace --match-full-lines --implicit-check-not={{.}}

## Invalid first argument before any valid one, inlines

# INLINE:{"ModuleName":"{{.*}}/Inputs/addr.exe","Address":"0x0","Error":{"Code":22,"Message":"unable to parse arguments: some text"}}

grimarUnsubmitted

Not Done

Missing full stop.

grimar: Missing full stop.

## Resolve valid address, inlines.

# INLINE-NEXT:{"ModuleName":"{{.*}}/Inputs/addr.exe","Address":"0x40054d","Error":{"Code":0},"DIInliningInfo":{"Frames":[

# INLINE-NEXT:{"FunctionName":"inctwo","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":2,"FileName":"/tmp{{/|\\\\}}x.c","Line":3,"Column":3}

# INLINE-NEXT:,{"FunctionName":"inc","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":6,"FileName":"/tmp{{/|\\\\}}x.c","Line":7}

jhendersonUnsubmitted

Done

If I follow this correctly, this test case is showing that llvm-addr2line with -f results in the function name being printed? Assuming that's correct, I think you need to show that llvm-addr2line without -f results in the function name not being included in the JSON output too.

jhenderson: If I follow this correctly, this test case is showing that llvm-addr2line with -f results in…

# INLINE-NEXT:,{"FunctionName":"main","StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":12,"FileName":"/tmp{{/|\\\\}}x.c","Line":14}]}}

## Invalid argument after a valid one, inlines.

jhendersonUnsubmitted

Not Done

I guess the obvious question is: why do we have a difference in behaviour surrounding the FunctionName attribute? The motivation for different symbolizer/addr2line output is because llvm-addr2line needs to be compatible with GNU addr2line, but that principle doesn't apply for JSON output (which AFAIK is not a GNU addr2line supported feature).

jhenderson: I guess the obvious question is: why do we have a difference in behaviour surrounding the…

aorlovAuthorUnsubmitted

Done

This behavior is a part of llvm-symbolizer and does not depend on the output styles and does not belong to the printer (look at decideHowToPrintFunctions()).
Not sure if I understand what you are saying. Changing the behavior is out of the scope of this patch.
I agree that it does not make much sense in testing that, but one of the reviewers requested these tests.

aorlov: This behavior is a part of llvm-symbolizer and does not depend on the output styles and does…

jhendersonUnsubmitted

Done

Ah, I think we've hit on a fundamental question regarding JSON output - what should the options that change what information is printed do to JSON output? I'm thinking here specifically both --functions and --addresses, but it may apply to others too.

jhenderson: Ah, I think we've hit on a fundamental question regarding JSON output - what should the options…

# INLINE-NEXT:{"ModuleName":"{{.*}}/Inputs/addr.exe","Address":"0x0","Error":{"Code":22,"Message":"unable to parse arguments: some text2"}}

## Also check the last 3 test cases with llvm-adr2line. The expected result is the same but missing the FunctionName.

# RUN: llvm-addr2line --output-style=JSON -i -e %p/Inputs/addr.exe < %p/Inputs/addr.inp | \

jhendersonUnsubmitted

Done

Nit: missing full stop.

jhenderson: Nit: missing full stop.

# RUN: FileCheck %s --check-prefix=INLINE-A2L --strict-whitespace --match-full-lines --implicit-check-not={{.}}

## Invalid first argument before any valid one, inlines

# INLINE-A2L:{"ModuleName":"{{.*}}/Inputs/addr.exe","Address":"0x0","Error":{"Code":22,"Message":"unable to parse arguments: some text"}}

## Resolve valid address, inlines.

# INLINE-A2L-NEXT:{"ModuleName":"{{.*}}/Inputs/addr.exe","Address":"0x40054d","Error":{"Code":0},"DIInliningInfo":{"Frames":[

# INLINE-A2L-NEXT:{"StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":2,"FileName":"/tmp{{/|\\\\}}x.c","Line":3,"Column":3}

# INLINE-A2L-NEXT:,{"StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":6,"FileName":"/tmp{{/|\\\\}}x.c","Line":7}

# INLINE-A2L-NEXT:,{"StartFileName":"/tmp{{/|\\\\}}x.c","StartLine":12,"FileName":"/tmp{{/|\\\\}}x.c","Line":14}]}}

## Invalid argument after a valid one, inlines.

# INLINE-A2L-NEXT:{"ModuleName":"{{.*}}/Inputs/addr.exe","Address":"0x0","Error":{"Code":22,"Message":"unable to parse arguments: some text2"}}

llvm/test/tools/llvm-symbolizer/output-style-json-data.test

This file was added.

				## This test checks JSON output for DATA.

				# REQUIRES: x86-registered-target

				## Handle symbolize library error - file does not exist.
				# RUN: llvm-symbolizer "DATA %t-no-file.o 0" --output-style=JSON \| \
				jhendersonUnsubmitted Done Reply Inline Actions Same comment as the code test. jhenderson: Same comment as the code test.
				# RUN: FileCheck %s -DMSG=%errc_ENOENT --check-prefix=NO-FILE --strict-whitespace --match-full-lines --implicit-check-not={{.}}

				# NO-FILE:{"ModuleName":"{{.*}}no-file.o","Address":"0x0","Error":{"Code":2,"Message":"[[MSG]]"}}

				## Handle invalid argument.
				# RUN: llvm-symbolizer "DATA tmp.o Z" --output-style=JSON \| \
				grimarUnsubmitted Done Reply Inline Actions Note that you already have exactly the same comment at line 5. Currently it is not very clear from comments what tests are doing. I think it is better to not to refer to `DIGlobal` and other class names in comments. They are a part of internal implementation and doesn't explain well what this test does for a reader who are not familar with the code. You can just explain what is exactly validated. E.g. this one comment could be like: "Test that we print a valid error message to JSON when the input file is not present" This applies to all test cases as far I can see. grimar: Note that you already have exactly the same comment at line 5. Currently it is not very clear…
				# RUN: FileCheck %s --check-prefix=INVARG --strict-whitespace --match-full-lines --implicit-check-not={{.}}

				# INVARG:{"ModuleName":"tmp.o","Address":"0x0","Error":{"Code":22,"Message":"unable to parse arguments: DATA tmp.o Z"}}

				# RUN: llvm-mc -filetype=obj -triple=x86_64-pc-linux %s -o %t.o

				grimarUnsubmitted Done Reply Inline Actions This error message is probably should be tested differently: You should use `FileCheck -DMSG=%errc_ENOENT` I think (see, e.g D95246). grimar: This error message is probably should be tested differently: You should use `FileCheck…
				## Resolve out of range address. Only Start and Size is expected.
				# RUN: llvm-symbolizer "DATA %t.o 1000000000" --output-style=JSON \| \
				# RUN: FileCheck %s --check-prefix=NOT-FOUND --strict-whitespace --match-full-lines --implicit-check-not={{.}}

				# NOT-FOUND:{"ModuleName":"{{.*}}.o","Address":"0x3b9aca00","Error":{"Code":0},"DIGlobal":{"Start":"0x0","Size":"0x0"}}

				## Resolve valid address.
				# RUN: llvm-symbolizer "DATA %t.o 0" --output-style=JSON \| \
				# RUN: FileCheck %s --strict-whitespace --match-full-lines --implicit-check-not={{.}}

				jhendersonUnsubmitted Done Reply Inline Actions Same comment as above. jhenderson: Same comment as above.
				# CHECK:{"ModuleName":"{{.*}}.o","Address":"0x0","Error":{"Code":0},"DIGlobal":{"Name":"foo","Start":"0x0","Size":"0x4"}}

				## Test multiple addresses in the command line.
				# RUN: llvm-symbolizer -e=%t.o "DATA 0" "DATA 0" --output-style=JSON \| \
				# RUN: FileCheck %s --check-prefix=MULTI --strict-whitespace --match-full-lines --implicit-check-not={{.}}
				jhendersonUnsubmitted Done Reply Inline Actions I'd make one of these addresses non-zero, as that will show that the "Address" and "Start" parameters are not just always 0. jhenderson: I'd make one of these addresses non-zero, as that will show that the "Address" and "Start"…

				# MULTI:[{"ModuleName":"{{.*}}.o","Address":"0x0","Error":{"Code":0},"DIGlobal":{"Name":"foo","Start":"0x0","Size":"0x4"}}
				# MULTI-NEXT:,{"ModuleName":"{{.*}}.o","Address":"0x0","Error":{"Code":0},"DIGlobal":{"Name":"foo","Start":"0x0","Size":"0x4"}}
				# MULTI-NEXT:]

				.data
				.globl foo
				.type foo, @object
				.size foo, 4
				foo = . + 0x1100000000000000
				.4byte 1

llvm/test/tools/llvm-symbolizer/output-style-json-frame.test

This file was added.

				## This test checks JSON output for FRAME.

				# REQUIRES: x86-registered-target

				## Handle symbolize library error - file does not exist.
				# RUN: llvm-symbolizer "FRAME %t-no-file.o 0" --output-style=JSON \| \
				# RUN: FileCheck %s -DMSG=%errc_ENOENT --check-prefix=NO-FILE --strict-whitespace --match-full-lines --implicit-check-not={{.}}
				MaskRayUnsubmitted Not Done Reply Inline Actions Nit: place `\|` in the end instead of the beginning of the continuation line. This is a more common style in binary utilities. The idea is that without `\` it is still clear the line needs continaution. MaskRay: Nit: place ` \| ` in the end instead of the beginning of the continuation line. This is a more…

				# NO-FILE:{"ModuleName":"{{.*}}no-file.o","Address":"0x0","Error":{"Code":2,"Message":"[[MSG]]"}}

				## Handle invalid argument.
				# RUN: llvm-symbolizer "FRAME tmp.o Z" --output-style=JSON \| \
				# RUN: FileCheck %s --check-prefix=INVARG --strict-whitespace --match-full-lines --implicit-check-not={{.}}

				# INVARG:{"ModuleName":"tmp.o","Address":"0x0","Error":{"Code":22,"Message":"unable to parse arguments: FRAME tmp.o Z"}}

				# RUN: llvm-mc -filetype=obj -triple=i386-linux-gnu -o %t.o %s

				jhendersonUnsubmitted Not Done Reply Inline Actions Same as other comments elsewhere - a "null" value doesn't make much sense to me to be printed. I'd omit it entirely. If it can ever be set, you ened specific testing for that too. jhenderson: Same as other comments elsewhere - a "null" value doesn't make much sense to me to be printed.
				## Resolve out of range address. Expected an empty array.
				# RUN: llvm-symbolizer "FRAME %t.o 1000000000" --output-style=JSON \| \
				grimarUnsubmitted Done Reply Inline Actions `empy` -> `empty`. grimar: `empy` -> `empty`.
				# RUN: FileCheck %s --check-prefix=NOT-FOUND --strict-whitespace --match-full-lines --implicit-check-not={{.}}

				# NOT-FOUND:{"ModuleName":"{{.*}}.o","Address":"0x3b9aca00","Error":{"Code":0},"vector_DILocal":[]}

				## Resolve valid address.
				# RUN: llvm-symbolizer "FRAME %t.o 0" --output-style=JSON \| \
				# RUN: FileCheck %s --strict-whitespace --match-full-lines --implicit-check-not={{.}}

				# CHECK:{"ModuleName":"{{.*}}.o","Address":"0x0","Error":{"Code":0},"vector_DILocal":[
				# CHECK-NEXT:{"FunctionName":"f","Name":"a","DeclFile":"/tmp/test{{/\|\\\\}}frame.cpp","DeclLine":2,"FrameOffset":-1,"Size":"0x1"}
				jhendersonUnsubmitted Done Reply Inline Actions I think you need a test case with a non-empty "TagOffset". jhenderson: I think you need a test case with a non-empty "TagOffset".
				# CHECK-NEXT:,{"FunctionName":"f","Name":"b","DeclFile":"/tmp/test{{/\|\\\\}}frame.cpp","DeclLine":3,"FrameOffset":-8,"Size":"0x4"}]}

				## Generated from:
				##
				## void f() {
				## char a;
				## char *b;
				## }
				##
				jhendersonUnsubmitted Done Reply Inline Actions Please include the version of clang explicitly here, not just in the ident string below. The reason is that someone might come along and trim off the superfluous parts of the below assembly to minimise the test case, but it would still be helpful to know how to generate the unmodified part. I would also use a final release version of clang, so that a future user can easily get the exact version of clang well into the future. jhenderson: Please include the version of clang explicitly here, not just in the ident string below. The…
				dblaikieUnsubmitted Not Done Reply Inline Actions This seems like a fairly non-trivial build - might be worth an explanation about what's interesting about this case that's not exercised by simpler cases (such as ones without sanitizers, maybe also x86 plain (not a necessity, but curious if it's something ARM specific here), etc) dblaikie: This seems like a fairly non-trivial build - might be worth an explanation about what's…
				## clang++ --target=i386-linux-gnu frame.cpp -g -std=c++11 -S -o frame.s

				.text
				grimarUnsubmitted Done Reply Inline Actions Why do you need to have so many locals? The only visible difference in the output is the value of "FrameOffset". Can we have only two/three? This would reduce the size of the code below significantly I guess. grimar: Why do you need to have so many locals? The only visible difference in the output is the value…
				grimarUnsubmitted Not Done Reply Inline Actions Adding an exact clang version used might be helpfull. grimar: Adding an exact clang version used might be helpfull.
				aorlovAuthorUnsubmitted Done Reply Inline Actions You can see the exact clang version 13.0.0 at the end of the generated code. aorlov: You can see the exact clang version 13.0.0 at the end of the generated code.
				grimarUnsubmitted Not Done Reply Inline Actions You mean the `info_string0`? .Linfo_string0: .asciz "clang version 13.0.0" # string offset=0 It is actually a string that is unimprortant for the test. I.e. it is a piece of input asm that can be just removed like you did for other unused sections already and nothing should change. It is a bit strange to force user to read an assembly to find out how it was generated instead of reading the comment that has an intention to descibe it. grimar: You mean the `info_string0`? ``` .Linfo_string0: .asciz "clang version 13.0.0" # string…
				jhendersonUnsubmitted Not Done Reply Inline Actions It looks like me like this hasn't been addressed? jhenderson: It looks like me like this hasn't been addressed?
				aorlovAuthorUnsubmitted Done Reply Inline Actions Intention was to break a dependency on DWARF generation in clang. But I have changed it build from the C source to keep it simple for reading. aorlov: Intention was to break a dependency on DWARF generation in clang. But I have changed it build…
				.file "frame.cpp"
				.globl _Z1fv # -- Begin function _Z1fv
				.p2align 4, 0x90
				.type _Z1fv,@function
				_Z1fv: # @_Z1fv
				.Lfunc_begin0:
				.file 1 "/tmp/test" "frame.cpp"
				.loc 1 1 0 # frame.cpp:1:0
				.cfi_sections .debug_frame
				.cfi_startproc
				# %bb.0: # %entry
				pushl %ebp
				.cfi_def_cfa_offset 8
				.cfi_offset %ebp, -8
				movl %esp, %ebp
				.cfi_def_cfa_register %ebp
				.Ltmp0:
				.loc 1 4 1 prologue_end # frame.cpp:4:1
				popl %ebp
				.cfi_def_cfa %esp, 4
				retl
				.Ltmp1:
				.Lfunc_end0:
				.size _Z1fv, .Lfunc_end0-_Z1fv
				.cfi_endproc
				# -- End function
				.section .debug_abbrev,"",@progbits
				.byte 1 # Abbreviation Code
				.byte 17 # DW_TAG_compile_unit
				.byte 1 # DW_CHILDREN_yes
				.byte 37 # DW_AT_producer
				.byte 14 # DW_FORM_strp
				.byte 19 # DW_AT_language
				.byte 5 # DW_FORM_data2
				.byte 3 # DW_AT_name
				.byte 14 # DW_FORM_strp
				.byte 16 # DW_AT_stmt_list
				.byte 23 # DW_FORM_sec_offset
				.byte 27 # DW_AT_comp_dir
				.byte 14 # DW_FORM_strp
				.byte 17 # DW_AT_low_pc
				.byte 1 # DW_FORM_addr
				.byte 18 # DW_AT_high_pc
				.byte 6 # DW_FORM_data4
				.byte 0 # EOM(1)
				.byte 0 # EOM(2)
				.byte 2 # Abbreviation Code
				.byte 46 # DW_TAG_subprogram
				.byte 1 # DW_CHILDREN_yes
				.byte 17 # DW_AT_low_pc
				.byte 1 # DW_FORM_addr
				.byte 18 # DW_AT_high_pc
				.byte 6 # DW_FORM_data4
				.byte 64 # DW_AT_frame_base
				.byte 24 # DW_FORM_exprloc
				.byte 110 # DW_AT_linkage_name
				.byte 14 # DW_FORM_strp
				.byte 3 # DW_AT_name
				.byte 14 # DW_FORM_strp
				.byte 58 # DW_AT_decl_file
				.byte 11 # DW_FORM_data1
				.byte 59 # DW_AT_decl_line
				.byte 11 # DW_FORM_data1
				.byte 63 # DW_AT_external
				.byte 25 # DW_FORM_flag_present
				.byte 0 # EOM(1)
				.byte 0 # EOM(2)
				.byte 3 # Abbreviation Code
				.byte 52 # DW_TAG_variable
				.byte 0 # DW_CHILDREN_no
				.byte 2 # DW_AT_location
				.byte 24 # DW_FORM_exprloc
				.byte 3 # DW_AT_name
				.byte 14 # DW_FORM_strp
				.byte 58 # DW_AT_decl_file
				.byte 11 # DW_FORM_data1
				.byte 59 # DW_AT_decl_line
				.byte 11 # DW_FORM_data1
				.byte 73 # DW_AT_type
				.byte 19 # DW_FORM_ref4
				.byte 0 # EOM(1)
				.byte 0 # EOM(2)
				.byte 4 # Abbreviation Code
				.byte 36 # DW_TAG_base_type
				.byte 0 # DW_CHILDREN_no
				.byte 3 # DW_AT_name
				.byte 14 # DW_FORM_strp
				.byte 62 # DW_AT_encoding
				.byte 11 # DW_FORM_data1
				.byte 11 # DW_AT_byte_size
				.byte 11 # DW_FORM_data1
				.byte 0 # EOM(1)
				.byte 0 # EOM(2)
				.byte 5 # Abbreviation Code
				.byte 15 # DW_TAG_pointer_type
				.byte 0 # DW_CHILDREN_no
				.byte 73 # DW_AT_type
				.byte 19 # DW_FORM_ref4
				.byte 0 # EOM(1)
				.byte 0 # EOM(2)
				.byte 0 # EOM(3)
				.section .debug_info,"",@progbits
				.Lcu_begin0:
				.long .Ldebug_info_end0-.Ldebug_info_start0 # Length of Unit
				.Ldebug_info_start0:
				.short 4 # DWARF version number
				.long .debug_abbrev # Offset Into Abbrev. Section
				.byte 4 # Address Size (in bytes)
				.byte 1 # Abbrev [1] 0xb:0x5a DW_TAG_compile_unit
				.long .Linfo_string0 # DW_AT_producer
				.short 26 # DW_AT_language
				.long .Linfo_string1 # DW_AT_name
				.long .Lline_table_start0 # DW_AT_stmt_list
				.long .Linfo_string2 # DW_AT_comp_dir
				.long .Lfunc_begin0 # DW_AT_low_pc
				.long .Lfunc_end0-.Lfunc_begin0 # DW_AT_high_pc
				.byte 2 # Abbrev [2] 0x26:0x32 DW_TAG_subprogram
				.long .Lfunc_begin0 # DW_AT_low_pc
				.long .Lfunc_end0-.Lfunc_begin0 # DW_AT_high_pc
				.byte 1 # DW_AT_frame_base
				.byte 85
				.long .Linfo_string3 # DW_AT_linkage_name
				.long .Linfo_string4 # DW_AT_name
				.byte 1 # DW_AT_decl_file
				.byte 1 # DW_AT_decl_line
				# DW_AT_external
				.byte 3 # Abbrev [3] 0x3b:0xe DW_TAG_variable
				.byte 2 # DW_AT_location
				.byte 145
				.byte 127
				.long .Linfo_string5 # DW_AT_name
				.byte 1 # DW_AT_decl_file
				.byte 2 # DW_AT_decl_line
				.long 88 # DW_AT_type
				.byte 3 # Abbrev [3] 0x49:0xe DW_TAG_variable
				.byte 2 # DW_AT_location
				.byte 145
				.byte 120
				.long .Linfo_string7 # DW_AT_name
				.byte 1 # DW_AT_decl_file
				.byte 3 # DW_AT_decl_line
				.long 95 # DW_AT_type
				.byte 0 # End Of Children Mark
				.byte 4 # Abbrev [4] 0x58:0x7 DW_TAG_base_type
				.long .Linfo_string6 # DW_AT_name
				.byte 6 # DW_AT_encoding
				.byte 1 # DW_AT_byte_size
				.byte 5 # Abbrev [5] 0x5f:0x5 DW_TAG_pointer_type
				.long 88 # DW_AT_type
				.byte 0 # End Of Children Mark
				.Ldebug_info_end0:
				.section .debug_str,"MS",@progbits,1
				.Linfo_string0:
				.asciz "clang version 13.0.0" # string offset=0
				.Linfo_string1:
				.asciz "frame.cpp" # string offset=105
				.Linfo_string2:
				.asciz "/tmp/test" # string offset=115
				.Linfo_string3:
				.asciz "_Z1fv" # string offset=140
				.Linfo_string4:
				.asciz "f" # string offset=146
				.Linfo_string5:
				.asciz "a" # string offset=148
				.Linfo_string6:
				.asciz "char" # string offset=150
				.Linfo_string7:
				.asciz "b" # string offset=155
				.addrsig
				.section .debug_line,"",@progbits
				.Lline_table_start0:
				grimarUnsubmitted Done Reply Inline Actions You don't need .ident "clang version 9.0.0 " .section ".note.GNU-stack","",@progbits for this test, so it can be removed. Please try to avoid having excessive pieces in test cases, when it is easy not to have them. grimar: You don't need ``` .ident "clang version 9.0.0 " .section ".note.GNU-stack","",@progbits…
				grimarUnsubmitted Done Reply Inline Actions You don't need this section. grimar: You don't need this section.

llvm/tools/llvm-symbolizer/Opts.td

	Show All 27 Lines
	defm dsym_hint : Eq<"dsym-hint", "Path to .dSYM bundles to search for debug info for the object files">, MetaVarName<"<dir>">;			defm dsym_hint : Eq<"dsym-hint", "Path to .dSYM bundles to search for debug info for the object files">, MetaVarName<"<dir>">;
	defm fallback_debug_path : Eq<"fallback-debug-path", "Fallback path for debug binaries">, MetaVarName<"<dir>">;			defm fallback_debug_path : Eq<"fallback-debug-path", "Fallback path for debug binaries">, MetaVarName<"<dir>">;
	defm inlines : B<"inlines", "Print all inlined frames for a given address",			defm inlines : B<"inlines", "Print all inlined frames for a given address",
	"Do not print inlined frames">;			"Do not print inlined frames">;
	defm obj			defm obj
	: Eq<"obj", "Path to object file to be symbolized (if not provided, "			: Eq<"obj", "Path to object file to be symbolized (if not provided, "
	"object file should be specified for each input line)">, MetaVarName<"<file>">;			"object file should be specified for each input line)">, MetaVarName<"<file>">;
	defm output_style			defm output_style
	: Eq<"output-style", "Specify print style. Supported styles: LLVM, GNU">,			: Eq<"output-style", "Specify print style. Supported styles: LLVM, GNU, JSON">,
	MetaVarName<"style">,			MetaVarName<"style">,
	Values<"LLVM,GNU">;			Values<"LLVM,GNU,JSON">;
	def pretty_print : F<"pretty-print", "Make the output more human friendly">;			def pretty_print : F<"pretty-print", "Make the output more human friendly">;
	defm print_source_context_lines : Eq<"print-source-context-lines", "Print N lines of source file context">;			defm print_source_context_lines : Eq<"print-source-context-lines", "Print N lines of source file context">;
	def relative_address : F<"relative-address", "Interpret addresses as addresses relative to the image base">;			def relative_address : F<"relative-address", "Interpret addresses as addresses relative to the image base">;
	def relativenames : F<"relativenames", "Strip the compilation directory from paths">;			def relativenames : F<"relativenames", "Strip the compilation directory from paths">;
	defm untag_addresses : B<"untag-addresses", "", "Remove memory tags from addresses before symbolization">;			defm untag_addresses : B<"untag-addresses", "", "Remove memory tags from addresses before symbolization">;
	def use_dia: F<"dia", "Use the DIA library to access symbols (Windows only)">;			def use_dia: F<"dia", "Use the DIA library to access symbols (Windows only)">;
	def verbose : F<"verbose", "Print verbose line info">;			def verbose : F<"verbose", "Print verbose line info">;
	def version : F<"version", "Display the version">;			def version : F<"version", "Display the version">;
	Show All 25 Lines

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp

Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines	static void print(const StringRef ModuleName, uint64_t Address,

if (PrintEmpty)		if (PrintEmpty)
Printer.print(Request(ModuleName, Address), T());		Printer.print(Request(ModuleName, Address), T());
}		}

enum class Command {		enum class Command {
Code,		Code,
Data,		Data,
Frame,		Frame,
		grimarUnsubmitted Not Done Reply Inline Actions The meaning of `DefIfErr` name is not very clear, also it is only used for non-JSON case, what is a bit consusing I think, because the argument is just ignored for the `JSON` case. It makes the signature to bit a bit dirty. Perhaps, instead of having this function, I'd intoduce a helper like the following in `symbolizeInput`: auto Print = [&](Expected<T> &ResOrErr){ if (ResOrErr) { Printer << ResOrErr; return; } if (OutputStyle == DIPrinter::OutputStyle::JSON) { handleAllErrors(std::move(ResOrErr.takeError()), [&](const ErrorInfoBase &EI) { Printer << EI; }); return; } error(ResOrErr); // Nice and helpful comment about why the Command::Frame is exception.... if (Cmd == Command::Frame) Printer << T(); }; Will it work? grimar:* The meaning of `DefIfErr` name is not very clear, also it is only used for non-JSON case, what…
		aorlovAuthorUnsubmitted Done Reply Inline Actions No, because of template <typename T>. I have renamed DefIfErr to BackwardCompatibleErr. I have no idea why there is no printout of the empty struct in case of Cmd == Command::Frame, probably it is a bug. But it is unrelated to this patch. I just keep the current behavior for other OutputStyle. aorlov: No, because of template <typename T>. I have renamed DefIfErr to BackwardCompatibleErr. I have…
		grimarUnsubmitted Not Done Reply Inline Actions I have no idea why there is no printout of the empty struct in case of Cmd == Command::Frame, probably it is a bug. But it is unrelated to this patch. I just keep the current behavior for other OutputStyle. It is true, but probably we shouldn't introduce a new strangely named `BackwardCompatibleErr` variable in this case. We know that `Command::Frame`, one of commands, has a different specific behavior. I'd write the code to make it more obvious with a `Cmd` argument. Also, perhaps would add few early returns. Something like: template <typename T> static void printResultOrError(Expected<T> &ResOrErr, DIPrinter::OutputStyle OutputStyle, DIPrinter &Printer, Command Cmd) { if (ResOrErr) { Printer << ResOrErr.get(); return; } if (OutputStyle == DIPrinter::OutputStyle::JSON) { Printer.printErrorJSON(toString(ResOrErr.takeError()), std::make_error_code(std::errc::invalid_argument)); return; } if (Cmd != Command::Frame) Printer << T(); } grimar: > I have no idea why there is no printout of the empty struct in case of Cmd == Command::Frame…
		aorlovAuthorUnsubmitted Done Reply Inline Actions Finally I have used std::is_same<T, std::vector<DILocal>>::value for compile time optimization. aorlov: Finally I have used std::is_same<T, std::vector<DILocal>>::value for compile time optimization.
};		};

static bool parseCommand(StringRef BinaryName, bool IsAddr2Line,		static bool parseCommand(StringRef BinaryName, bool IsAddr2Line,
StringRef InputString, Command &Cmd,		StringRef InputString, Command &Cmd,
std::string &ModuleName, uint64_t &ModuleOffset) {		std::string &ModuleName, uint64_t &ModuleOffset) {
const char kDelimiters[] = " \n\r";		const char kDelimiters[] = " \n\r";
ModuleName = "";		ModuleName = "";
if (InputString.consume_front("CODE ")) {		if (InputString.consume_front("CODE ")) {
Show All 40 Lines
static void symbolizeInput(const opt::InputArgList &Args, uint64_t AdjustVMA,		static void symbolizeInput(const opt::InputArgList &Args, uint64_t AdjustVMA,
bool IsAddr2Line, DIPrinter::OutputStyle OutputStyle,		bool IsAddr2Line, DIPrinter::OutputStyle OutputStyle,
StringRef InputString, LLVMSymbolizer &Symbolizer,		StringRef InputString, LLVMSymbolizer &Symbolizer,
DIPrinter &Printer) {		DIPrinter &Printer) {
Command Cmd;		Command Cmd;
std::string ModuleName;		std::string ModuleName;
uint64_t Offset = 0;		uint64_t Offset = 0;
if (!parseCommand(Args.getLastArgValue(OPT_obj_EQ), IsAddr2Line,		if (!parseCommand(Args.getLastArgValue(OPT_obj_EQ), IsAddr2Line,
StringRef(InputString), Cmd, ModuleName, Offset)) {		StringRef(InputString), Cmd, ModuleName, Offset)) {
Printer.printError(		Printer.printError(
Request(ModuleName, Offset),		Request(ModuleName, Offset),
StringError(InputString,		StringError(InputString,
		grimarUnsubmitted Not Done Reply Inline Actions Can you use `createStringError` from `Error.h`? grimar: Can you use `createStringError` from `Error.h`?
		aorlovAuthorUnsubmitted Done Reply Inline Actions No, because it is hard to use Error to get the error code and message just for printout. I'm using ErrorInfoBase as a holder instead. aorlov: No, because it is hard to use Error to get the error code and message just for printout. I'm…
		grimarUnsubmitted Done Reply Inline Actions If we introduce the `printErrorJSON` that I suggested in a different comment, then here it will be possible to write: Printer.printErrorJSON("unable to parse arguments: " + InputString, std::make_error_code(std::errc::invalid_argument)); What is better, because allows to provide more context and more customizable error messages for JSON output. What do you think? grimar: If we introduce the `printErrorJSON` that I suggested in a different comment, then here it will…
		jhendersonUnsubmitted Not Done Reply Inline Actions No, because it is hard to use Error to get the error code and message just for printout. It really isn't that hard to do this. See either `toString` in Error.h (converts an `Error` to a `std::string`) or `handleAllErrors` (which allows you to handle `Error` in various ways, such as by getting its message. But I see you already know how to do this in `printResOrErr`. Why not just pass the Error directly to the function?? jhenderson: > No, because it is hard to use Error to get the error code and message just for printout. It…
		aorlovAuthorUnsubmitted Done Reply Inline Actions We need the error code for the clear logic while error handling. The error message may be localized, etc. DIPrinter must only print the data but do not handle errors. It is impossible to get the error code from the Error without handling. Initially I have used ErrorInfoBase instead to pass the error information to DIPrinter. Now DICommon contains ErrorCode (it can be used as "success" flag too). The error message is stored in DICommon<std::string>.Result only in case of error. aorlov: We need the error code for the clear logic while error handling. The error message may be…
std::make_error_code(std::errc::invalid_argument)));		std::make_error_code(std::errc::invalid_argument)));
		jhendersonUnsubmitted Done Reply Inline Actions I missed this in the other review. We don't need to make an `ErrorInfo` here at all. Just pass the `InputString` in instead. That will simplify both the call-site and the `printInvalidCommand` function. jhenderson: I missed this in the other review. We don't need to make an `ErrorInfo` here at all. Just pass…
return;		return;
}		}

uint64_t AdjustedOffset = Offset - AdjustVMA;		uint64_t AdjustedOffset = Offset - AdjustVMA;
if (Cmd == Command::Data) {		if (Cmd == Command::Data) {
Expected<DIGlobal> ResOrErr = Symbolizer.symbolizeData(		Expected<DIGlobal> ResOrErr = Symbolizer.symbolizeData(
		grimarUnsubmitted Done Reply Inline Actions Consider replacing the `auto` to actual types in this method, because you are touching the related neighbouring lines, which are hard to read, because it is not clear what type `ResOrErr` has. grimar: Consider replacing the `auto` to actual types in this method, because you are touching the…
ModuleName, {AdjustedOffset, object::SectionedAddress::UndefSection});		ModuleName, {AdjustedOffset, object::SectionedAddress::UndefSection});
print(ModuleName, Offset, ResOrErr, Printer);		print(ModuleName, Offset, ResOrErr, Printer);
} else if (Cmd == Command::Frame) {		} else if (Cmd == Command::Frame) {
Expected<std::vector<DILocal>> ResOrErr = Symbolizer.symbolizeFrame(		Expected<std::vector<DILocal>> ResOrErr = Symbolizer.symbolizeFrame(
ModuleName, {AdjustedOffset, object::SectionedAddress::UndefSection});		ModuleName, {AdjustedOffset, object::SectionedAddress::UndefSection});
print(ModuleName, Offset, ResOrErr, Printer);		print(ModuleName, Offset, ResOrErr, Printer);
} else if (Args.hasFlag(OPT_inlines, OPT_no_inlines, !IsAddr2Line)) {		} else if (Args.hasFlag(OPT_inlines, OPT_no_inlines, !IsAddr2Line)) {
Expected<DIInliningInfo> ResOrErr = Symbolizer.symbolizeInlinedCode(		Expected<DIInliningInfo> ResOrErr = Symbolizer.symbolizeInlinedCode(
▲ Show 20 Lines • Show All 136 Lines • ▼ Show 20 Lines	if (sys::path::extension(Hint) == ".dSYM") {
errs() << "Warning: invalid dSYM hint: \"" << Hint		errs() << "Warning: invalid dSYM hint: \"" << Hint
<< "\" (must have the '.dSYM' extension).\n";		<< "\" (must have the '.dSYM' extension).\n";
}		}
}		}

auto OutputStyle =		auto OutputStyle =
IsAddr2Line ? DIPrinter::OutputStyle::GNU : DIPrinter::OutputStyle::LLVM;		IsAddr2Line ? DIPrinter::OutputStyle::GNU : DIPrinter::OutputStyle::LLVM;
if (const opt::Arg *A = Args.getLastArg(OPT_output_style_EQ)) {		if (const opt::Arg *A = Args.getLastArg(OPT_output_style_EQ)) {
OutputStyle = strcmp(A->getValue(), "GNU") == 0		if (strcmp(A->getValue(), "GNU") == 0)
? DIPrinter::OutputStyle::GNU		OutputStyle = DIPrinter::OutputStyle::GNU;
: DIPrinter::OutputStyle::LLVM;		else if (strcmp(A->getValue(), "JSON") == 0)
		OutputStyle = DIPrinter::OutputStyle::JSON;
		else
		OutputStyle = DIPrinter::OutputStyle::LLVM;
}		}
		grimarUnsubmitted Done Reply Inline Actions You don't need to use curly bracers for single lines (see https://llvm.org/docs/CodingStandards.html#don-t-use-braces-on-simple-single-statement-bodies-of-if-else-loop-statements). grimar: You don't need to use curly bracers for single lines (see https://llvm.org/docs/CodingStandards.

LLVMSymbolizer Symbolizer(Opts);		LLVMSymbolizer Symbolizer(Opts);
std::unique_ptr<DIPrinter> Printer;		std::unique_ptr<DIPrinter> Printer;
if (OutputStyle == DIPrinter::OutputStyle::GNU)		if (OutputStyle == DIPrinter::OutputStyle::GNU)
Printer.reset(new GNUPrinter(outs(), errs(), Args.hasArg(OPT_addresses),		Printer.reset(new GNUPrinter(outs(), errs(), Args.hasArg(OPT_addresses),
Opts.PrintFunctions != FunctionNameKind::None,		Opts.PrintFunctions != FunctionNameKind::None,
Args.hasArg(OPT_pretty_print),		Args.hasArg(OPT_pretty_print),
SourceContextLines, Args.hasArg(OPT_verbose)));		SourceContextLines, Args.hasArg(OPT_verbose)));
		else if (OutputStyle == DIPrinter::OutputStyle::JSON)
		Printer.reset(new JSONPrinter(outs(), errs()));
else		else
Printer.reset(new LLVMPrinter(outs(), errs(), Args.hasArg(OPT_addresses),		Printer.reset(new LLVMPrinter(outs(), errs(), Args.hasArg(OPT_addresses),
Opts.PrintFunctions != FunctionNameKind::None,		Opts.PrintFunctions != FunctionNameKind::None,
Args.hasArg(OPT_pretty_print),		Args.hasArg(OPT_pretty_print),
SourceContextLines,		SourceContextLines,
Args.hasArg(OPT_verbose)));		Args.hasArg(OPT_verbose)));

std::vector<std::string> InputAddresses = Args.getAllArgValues(OPT_INPUT);		std::vector<std::string> InputAddresses = Args.getAllArgValues(OPT_INPUT);
if (InputAddresses.empty()) {		if (InputAddresses.empty()) {
const int kMaxInputStringLength = 1024;		const int kMaxInputStringLength = 1024;
char InputString[kMaxInputStringLength];		char InputString[kMaxInputStringLength];

while (fgets(InputString, sizeof(InputString), stdin)) {		while (fgets(InputString, sizeof(InputString), stdin)) {
// Strip newline characters.		// Strip newline characters.
std::string StrippedInputString(InputString);		std::string StrippedInputString(InputString);
llvm::erase_if(StrippedInputString,		llvm::erase_if(StrippedInputString,
[](char c) { return c == '\r' \|\| c == '\n'; });		[](char c) { return c == '\r' \|\| c == '\n'; });
symbolizeInput(Args, AdjustVMA, IsAddr2Line, OutputStyle,		symbolizeInput(Args, AdjustVMA, IsAddr2Line, OutputStyle,
StrippedInputString, Symbolizer, *Printer);		StrippedInputString, Symbolizer, *Printer);
outs().flush();		outs().flush();
}		}
} else {		} else {
for (StringRef Address : InputAddresses)		bool ArrayJSON = false;
		if (OutputStyle == DIPrinter::OutputStyle::JSON &&
		InputAddresses.size() > 1) {
		ArrayJSON = true;
		outs() << "[";
		jhendersonUnsubmitted Done Reply Inline Actions I would make it always a list, not just for multiple input addresses. Also, shouldn't this be using the json stream arrayBegin/End methods? jhenderson: I would make it always a list, not just for multiple input addresses. Also, shouldn't this be…
		}
		bool ArrayDelimJSON = false;
		for (StringRef Address : InputAddresses) {
		jhendersonUnsubmitted Not Done Reply Inline Actions There's no need for this outer if. jhenderson: There's no need for this outer if.
		if (ArrayJSON) {
		if (ArrayDelimJSON)
		outs() << ",";
		else
		jhendersonUnsubmitted Not Done Reply Inline Actions How about: StringRef Sep = ""; for (StringRef Address : InputAddresses) { outs() << Sep; Sep = ","; symbolizeInput(Args, AdjustVMA, IsAddr2Line, Style, Address, Symbolizer, Printer); } This may be moot however, if you are using the JSON stream to use the proper JSON array methods. jhenderson:* How about: ``` StringRef Sep = ""; for (StringRef Address : InputAddresses) { outs() << Sep…
		aorlovAuthorUnsubmitted Done Reply Inline Actions Did you forget about different commas in GNU/LLVM output style? I do not want any functional change in that area as a part of this patch. I have added groupBegin() and groupEnd() to DIPrinter interface and moved all logic to JSONPrinter implementation. aorlov: Did you forget about different commas in GNU/LLVM output style? I do not want any functional…
		jhendersonUnsubmitted Done Reply Inline Actions Thanks, yes, I'd forgotten this was the higher-level printer area. "group" doesn't obviously mean anything to me. Could you consider renaming it to something like "listBegin" etc? I think that more clearly indicates what you're doing. jhenderson: Thanks, yes, I'd forgotten this was the higher-level printer area. "group" doesn't obviously…
		ArrayDelimJSON = true;
		}
symbolizeInput(Args, AdjustVMA, IsAddr2Line, OutputStyle, Address,		symbolizeInput(Args, AdjustVMA, IsAddr2Line, OutputStyle, Address,
Symbolizer, *Printer);		Symbolizer, *Printer);
}		}
		if (ArrayJSON)
		outs() << "]\n";
		}

return 0;		return 0;
}		}

This is an archive of the discontinued LLVM Phabricator instance.

Add support for JSON output style to llvm-symbolizerClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 333102

llvm/docs/CommandGuide/llvm-symbolizer.rst

llvm/include/llvm/DebugInfo/Symbolize/DIPrinter.h

llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp

llvm/test/tools/llvm-symbolizer/output-style-json-code.test

llvm/test/tools/llvm-symbolizer/output-style-json-data.test

llvm/test/tools/llvm-symbolizer/output-style-json-frame.test

llvm/tools/llvm-symbolizer/Opts.td

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp

Add support for JSON output style to llvm-symbolizer
ClosedPublic