This is an archive of the discontinued LLVM Phabricator instance.

I think JSONScopedPrinter (and possibly ScopedPrinter too, if it doesn't already have them) could do with gtest unit tests. It should be fairly straightforward to write them, and would help give confidence in the methods, especially as we may not use all of them up-front. The tests would also help with documenting what the code actually does, which will aid in future reviewing.

Will do! Do we want to test each method in isolation? or possibly in conjunction with each other to test things like the history stack? or possibly both?

Cross-posting this comment, since it's now applicable to this patch specifically: I think we probably need to do a bit of both: each method should be tested in isolation. I think we should ensure we have tests for the history stack, where it's applicable only. Basically, test the existing possible code paths, plus maybe one or two more interesting cases that test interaction between functions, if it seems sensible.

llvm/include/llvm/Support/ScopedPrinter.h
515	Nits: I think we are normally explicit about public/private, so I'd add the explicit `private:` here, even though it's not strictly required. I don't think we normally have blank lines at the start of class definitions.
519–531	I'd probably add some blank lines between these things.
535	I believe the explicit `llvm` is unnecessary.
541	How come we need to stringify the value here? That seems surprising to me, and may lead to undesirable behaviour in my mind (e.g. a field for an ELF struct might be a number for the 32-bit ELF format, but a string in the 64-bit ELF format (and not all numbers will be strings in the latter-case, where they are not 64-bits)). I have this vague memory that the LLVM JSON format doesn't support 64-bit numbers. If so, I think this is a mistake and should be fixed.
573–575	Similar point to the above, but slightly different: I think we should compare this to `std::numeric_limits<uint64_t>::max()` and print as a number, if possible, rather than as a string.
588	Similar to my comments in D114223, I'd recomend adding helper functions to avoid some of the logic duplication in the following methods.
687	I was thinking about this, and I think we shouldn't actually print as hex in JSON, and should fallback to the standard number printing. Here's the reasoning: most of the time, we use `printHex` to print some number that represents an offset, or other value in a fixed-sized field, that is easier to read as hex. However, JSON format is not intended for human consumption, at least not in our use-case for this code. As such, readability is of lesser concern than parseability. If we were to store hex numbers as strings, rather than converting them to their decimal counterparts, we'd end up with some numeric values as strings, as others as integers, in a seemingly arbitrary manner to the end-user. I don't think that this is desirable.
697	I would actually print this as two separate attributes: a symbol name, and a numeric offset. That'll be easier for consumers to parse. Perhaps these should be in a separate object, to avoid surprise name clashes. Rather than: "Label":"Symbol+0x100" or "Label":"Symbol", "LabelOffset":256 do "Label":{ "SymName":"Symbol", "Offset":256 }
703	You should check through the existing usages, but at least in llvm-readelf, I think most labels use UpperCameCase style, so printing as `MyValueRaw` would look a little cleaner. Also, see my above comments re. symbol offset about a nested object/naming clash risk, i.e. I'd recommend: "Label":{ "Value":1234, "RawValue":4321 }
708	Why's this being left as a todo?

Add printBinaryImpl implementation.
Remove duplicated logic for json scope logic.
Update printHex to print numbers.
Clean up formatting.

Harbormaster completed remote builds in B135338: Diff 388795.Nov 21 2021, 11:02 PM

In D114224#3142126, @jhenderson wrote:

Cross-posting this comment, since it's now applicable to this patch specifically: I think we probably need to do a bit of both: each method should be tested in isolation. I think we should ensure we have tests for the history stack, where it's applicable only. Basically, test the existing possible code paths, plus maybe one or two more interesting cases that test interaction between functions, if it seems sensible.

That seems reasonable to me! Will update with these tests soon.

llvm/include/llvm/Support/ScopedPrinter.h
541	I had believed LLVM JSON originally didn't support `uint64_t` types but since then I see that support was added here: https://github.com/llvm/llvm-project/commit/8c3adce81dc36306ba30cda0cdf458cfcf7d076c so I think I can just remove the `to_string(...)`
573–575	I updated this to just print the full `APSInt` unconditionally. Would it be preferable to cap printing numbers at `std::numeric_limits<uint64_t>::max()` and then fallback on strings otherwise? Since json doesn't have a max numerical value but arbitrarily long values may not always be supported. https://stackoverflow.com/a/39681707
687	That seems reasonable to me! Updated to output numeric values over hex strings.
708	My mistake, this was meant to be implemented before being put up for review.

jhenderson added inline comments.Nov 22 2021, 1:26 AM

llvm/include/llvm/Support/ScopedPrinter.h
573–575	Let's leave it without the cap, if we have a way of sensibly handling it. I think changing the type based on the value could get very confusing, and since as you say JSON has no firm limitations, printing out arbitrarily large numbers seems sensible. An alternative (and probably better) approach would be to add support for this type to the JSON library directly (in a precursor patch).
685	Here and in the other loop below, I'd avoid the use of `auto`: it doesn't improve readability, and we tend to avoid the "always use auto" approach in LLVM, when the type isn't obvious from that line.
687	Perhaps worth a comment here elaborating on the reasoning, so that future users understand why.
706–710

Add JSONScopedPrinter tests.
Update some ScopedPrinter methods.
Fix some formatting.

Herald added a subscriber: mgorny. · View Herald TranscriptNov 23 2021, 11:48 PM

Harbormaster completed remote builds in B135783: Diff 389400.Nov 23 2021, 11:48 PM

Jaysonyan marked 4 inline comments as done.Nov 23 2021, 11:51 PM

Jaysonyan added inline comments.

llvm/include/llvm/Support/ScopedPrinter.h
687	I've added a comment at the top of the first method which outputs Hex. I wasn't sure if it was desirable to add a comment for every location that outputs Hex or not. Let me know if there's a preference of one over the other.

I've only skimmed the new tests, but as there are some structural issues there, I suggest you address those and then I'll confirm the test coverage and logic looks good.

llvm/include/llvm/Support/ScopedPrinter.h
191	Seems like the change in this and the `printObject` function below should be part of the `ScopedPrinter` virtualisation patch?
438	This check seems to be a behaviour change? Seems like we should keep that out of this patch, to keep this patch purely NFC for the regular `ScopedPrinter`
631–634	If I'm not mistaken, this is going to lead to an array of mixed types (some strings and some integers). I'm not convinced that's desirable, since it will mean needing to switch on type to determine how to handle the field. Perhaps we should do what we do elsewhere and have an array of objects, where the object has name (possibly optional) and value fields? Alternatively, maybe the actual name isn't even needed, and the interesting thing is just the individual flag values. I guess that depends on how human readable we want this JSON output to be.
643	Same below.
644	I'm confused by this additional logic. It seems to me you're passing in a list of strings, but those strings might actually be numbers? That doesn't seem right to me with the calling code and/or our interface, rather than this function. For example, imagine our list happens to be a list of strings that are purely made up of digits, but the fact that they are digits is purely chance, and they should remain as strings. This function will convert to a list of integers. Further, a future new element in the same list without an all-digit input would cause the type of ALL the members of the list to change back to strings.
656	I'm inclined to say we should have the comment by each function. Perhaps better though, at the cost of a little more code than would otherwise be necessary, would be to have a small helper function `hexNumberToInt` or similar, which is then called in all places you currently do `HexNumber.Value`. You could then put the comment with that function, in one place.
699	Slightly surprised to see the new if here and a little further below. Could you explain them, please? Are you sure it's better to have attributes missing rather than as empty strings (I'm not entirely sure either way, and it probably depends on the context to some extent, but I'd focus on ease of parsing)? Aside: reminder that for single line `if` and loop statements, remove the braces, i.e. here and in the for loop below.
llvm/unittests/Support/ScopedPrinterTest.cpp
16	I wouldn't bother with the anonymous namespace. There'd only be a problem if someone else started writing ScopedPrinterTests in another file, which I think we'd want to spot and possibly stop.
18	For new unit tests that are not to do with the JSONScopedPrinter class, I'd put them in their own patch, as they are independently useful.
34	This string isn't really scoped itself. It's more the buffer used by the stream, so I'd just call it `Buffer` or `StreamBuffer`.

jhenderson added inline comments.Nov 25 2021, 1:02 AM

llvm/unittests/Support/ScopedPrinterTest.cpp
19	What's the point of the lambda? It seems like it's adding unnecessary complexity to the test, when you could just do: std::string ScopedString; llvm::raw_string_ostream OS(ScopedString); ScopedPrinter Writer(OS); // <current body of the lambda> const char Out = /...*/; EXPECT_EQ(Out, ScopedString);
35
92–94	This setup logic might benefit from being pulled into a test fixture, to reduce the duplication between tests. The JSON stuff could go in the same fixture.
503	Looking at this test seesm to make it clear to me that this is not hte optimal format of this output. I think you probably want it to be an array of numbers, rather than an object, with individual bytes labelled.

Add JSONScopedPrinter specific tests.
Update printBinaryImpl structure to include index.
Add hexNumberToInt method.
Revert JSONScopedPrinter::printListImpl to only print strings.
Add printRawFlagsImpl method to JSONScopedPrinter.

llvm/include/llvm/Support/ScopedPrinter.h
191	Moved to virtualization patch!
438	This was meant to accommodate the change I made to the making the extra `printFlags` method virtual. I was hoping both public `printFlags(...)` methods could utilize the same `printFlagsImpl` method but you're right this could be a change in behaviour. I've updated the virtualisation patch to keep this implementation the same and added an explicit `printRawFlagsImpl` method.
631–634	This was an attempt similar to the change in the ScopedPrinter implementation of `printFlagsImpl` to handle 2 types of public `printFlags` methods. I've added a `printRawFlagsImpl` to handle printing `Flag.Value` and updated this method to printing both `Flag.Name` and `Flag.Value`. This seems to best match the information that the `ScopedPrinter` implementation provides.
644	Yea this makes sense. The problem I was trying to solve was that the public method `printList(...)` takes a list of template type. But to pass to the `printListImpl(...)` method currently we are using the `to_string` method. So passing a list of numbers to `printList(...)` would result in a list of strings for the `JSONScopedPrinter`. I've reverted this change to just unconditionally print as strings for now but maybe it would be preferable to replace the template param `printList(...)` with multiple overloaded `printList(...)` methods that can delegate to `printStringListImpl(...)`/`printNumberListImpl(...)`/`printBooleanListImpl(...)` methods. Interested if you have any thoughts on this.
699	This was to match closer with the public methods available. Some public methods of `printBinary(...)` don't provide a `Str` param and so it's just passed into `printBinaryImpl` as `StringRef()` and the existing `ScopedPrinter::printBinaryImpl` does a similar check for `Str.empty()`. Similarly, there are public `printBinary(...)` and `printBinaryBlock(...)` methods where the latter prints the accompanying characters and the former does not. I'm not opposed to unconditionally printing these values but since these checks are used to differentiate different public methods inside `ScopedPrinter::printBinaryImpl` I thought it made sense to do the same for `JSONScopedPrinter::printBinaryImpl`. My mistake about the `if` statments, I'll try to be more careful about it in the future.
llvm/unittests/Support/ScopedPrinterTest.cpp
18	Moved `ScopedPrinter`-related tests to D114684 and rebased off that patch.
19	I haven't removed the lambda but it's now using a test fixture which takes a lambda. It doesn't need to use the test fixture since we're not testing the `JSONScopedPrinter` but I'd imagine it might be nice for the sake of consistency with other tests. Let me know if you feel otherwise.
503	I've updated the output to have the following format: "Bytes": [ { "Index": 0, "Value": 70, "Character": "F" }, { "Index": 2, "Value": 111, "Character": "o" }, ... ] (Character attribute will be omitted for non `printBinaryBlock(...)` methods for now)

Harbormaster completed remote builds in B136370: Diff 390248.Nov 28 2021, 9:17 PM

Could you rebase this patch on top of the other commits in the series, please, so that it's easier to see the current state?

llvm/include/llvm/Support/ScopedPrinter.h
625	I'd use the type name explicitly here. The concept of a "hex" number is purely a representation thing, so doesn't really make sense to refer to a number as a "hex number" until it's been written out somewhere. Numbers are just numbers. I also suggest referring to the base explicitly in this comment for how they are outputted.
644	I think multiple functions is the way forward, although I'd just call them all `printListImpl` and let the ArrayRef type determine the version to call. Any particular reason we'd need multiple `printList` functions too? Seems like it's something that could be determined without it. Some rough pseudo code: template <typename T> void printList(StringRef Label, const ArrayRef<T> List) { printListImpl(Label, List); } virtual void printListImpl(StringRef, ArrayRef<int> List) { /.../ } virtual void printListImpl(StringRef, ArrayRef<bool> List) { /.../ } virtual void printListImpl(StringRef Label, ArrayRef<std::string> List) { /.../ } Actually, now that I type that out, I wonder if you could just get rid of `printListImpl` entirely, and make `printList` a set of virtual non-templated functions with multiple overloads for the different supported types? (Potentially you might still want a non-virtual `printListImpl` (that could be templated) to avoid code duplication of course).
699	Let's leave it as-is for now. We can always add attributes should we desire to at a future point. That being said, I'm wondering if the `Character` block is actually useful for JSON output, since the character is just a human-readable version of the byte. If we're assuming people won't be directly reading JSON output and instead will be processing it themselves, the Character output wouldn't be useful at all, I think (they could reproduce it from the Bytes data, if needed). Thoughts?

Add JSON implementation for new printList(...) overloads.
Update comment wording.

Harbormaster completed remote builds in B137068: Diff 391220.Dec 1 2021, 11:10 PM

Remove Character block from JSONScopedPrinter::printBinaryImpl(...)

Harbormaster completed remote builds in B137069: Diff 391221.Dec 1 2021, 11:17 PM

Jaysonyan added inline comments.Dec 1 2021, 11:21 PM

llvm/include/llvm/Support/ScopedPrinter.h
644	I've added overloaded methods for `printList(...)` inside the virtualization diff and updated the ScopedPrinter test diff and this diff accordingly. The implementation roughly follows the ideas you've laid out but the major difference is that the overloaded methods are public `printList(...)` methods rather than private `printListImpl(...)`. This was needed because we still need to maintain a template `printList(...)` method to fall-back on for lists not comprised of strings, ints, or booleans.
699	You're right, adding the characters doesn't provide much value if we're analyzing the output through automated scripts. I think it makes sense to remove this block.

Jaysonyan added a child revision: D114225: Add JSONScopedPrinter to llvm-readelf.Dec 2 2021, 12:53 AM

I've not yet looked at the JSONScopedPrinter tests again, but the body of the code looks pretty good.

llvm/include/llvm/Support/ScopedPrinter.h
665	This comment applies to all HexNumber instances, not just the one passed in, so it should be plural.
750	I'm now looking at this `Index` field and wondering if it's useful at this position? We're in an array already, so the index should match the array position. It doesn't (so should be called Offset), but then why have an entry per byte, when one per array would be less verbose, and equally as informative? It would also allow the values to be written directly in the array rather than in a nested object. I.e. { "Value" : <Str>, "Offset" : <StartOffset>, "Bytes" : [Value[0], Value[1], ...] }

Update JSONScopedPrinter::printBinaryImpl to print initial offset rather than including a byte index in each byte entry.
Small comment change.

Harbormaster completed remote builds in B137216: Diff 391435.Dec 2 2021, 12:50 PM

Jaysonyan marked 10 inline comments as done.Dec 2 2021, 12:52 PM

Jaysonyan added inline comments.

llvm/include/llvm/Support/ScopedPrinter.h
750	This makes a lot of sense and seems like a better format to me. Updated to match this format.

You should add tests (for both classes) for classof and getKind functions, since they're part of the public API.

One thought that's occurred to me: should the default JSON output really be formatted so verbosely (i.e. with all the new lines and spacing)? As the prime motivation of this is for machine readability, would it not make more sense for it to be compact, at least by default (with a possible option for pretty printing)? The amount of whitespace that's in the output could make the output data size be significantly larger than it needs to be, slowing down the parsing of said output.

llvm/include/llvm/Support/ScopedPrinter.h
104–117	All the new enums probably want to be `enum class`.
105	I'd avoid the use of the name `Standard`, as it could be confused with the C++ standard. I'd use `Base`, `Plain` or `Basic` for the enum value.
522	Similar comment to the above: don't use the name "Standard" for an enum value. Actually, in this case, without looking at the code, I have no clue what "Standard" even means. It should be renamed to reflect what it is used for.
534	At one point, there was some effort to set good defaults for SmallVector, allowing you to omit the second argument. It's less useful here, because we have a good guess about the amount of nesting. I think it's more likely to be a max of about 4 or 5, so 16 is probably way too high. Perhaps worth doing an audit of the usage within the LLVM code and add a few more points above the current maximum scoping (yes, I know JSONScopedPrinter isn't used really yet, but you can imagine how it would look, based on how ScopedPrinter is used already).
llvm/unittests/Support/ScopedPrinterTest.cpp
1054	I think we need a JSONScopedPrinter version of this test, since we override `startLine` in that class.
1067	Ditto.

Update enum naming.
Add APSInt implementation and test.
Add classof and kindOf test.

Harbormaster completed remote builds in B137452: Diff 391758.Dec 3 2021, 3:14 PM

Jaysonyan edited the summary of this revision. (Show Details)Dec 3 2021, 3:14 PM

Jaysonyan added a parent revision: D114684: Add ScopedPrinter unit tests.

Update ScopeHistory SmallVector size to 8

llvm/include/llvm/Support/ScopedPrinter.h
105	Updated to use Base.
522	Updated to `NoAttribute`.
534	Looking through the usage of `ScopedPrinter` the most amount of nested scopes I came across was 6. This was a tie between a few methods in `llmv-readobj` (`LLVMElfDumper::printVersionDependencySection`, `LLVMElfDumper::printBBAddrMaps`, and `COFFDumper::printCodeViweSymbolSection`). Inside each of these methods there are 4 nested scopes and paired with the 2 scopes that would be added by `JSONScopedPrinter` (highest level `[]` and `{}` for each file) so I think the most nesting is 6. I'll update the `SmallVector` size to be 8.
llvm/unittests/Support/ScopedPrinterTest.cpp
1054	My mistake, I actually mean to remove the overridden implementations of both `startLine` and `getOStream`. For the `JSONScopedPrinter` to provide these methods, it relies on `json::OStream::rawValueBegin()` which can only be used where values are used (elements of arrays or values to attributes). So if `startLine` or `getOStream` are called in any place which aren't these contexts (which is most of the time) then assertions inside `json::OStream` fail. So I think it might be more desirable to just rely on the `ScopedPrinter` implementation of both these methods.

Harbormaster completed remote builds in B137453: Diff 391760.Dec 3 2021, 4:09 PM

jhenderson added inline comments.Dec 6 2021, 12:39 AM

llvm/include/llvm/Support/ScopedPrinter.h
104–117	This is marked as done but I don't see it here?
630–631	I'd pull this into a little helper function, shared by `printNumber`. It just helps avoid a (small) amount of duplication, but also helps label what the code does. (Ideally, I'd actually suggest enhancing the JSON output stream interface, but up to you whether you want to go that far)
llvm/unittests/Support/ScopedPrinterTest.cpp
107	This and the classof test don't rely on the fixture, so I'd a) change them to not use the fixture, and b) move them above it. Alternatively, see my comment below.
116	Either "nothing is" or "nothing's". Same below. You might want to consider enhancing the current fixture, as an alternative to making these two tests not use it. In this case, I'd add the `std::string`, `raw_string_ostream`, `ScopedPrinter` and `JSONScopedPrinter` local variables used here and in the verify* functions into the base class, and then use the `TearDown` method to ensure JSONScopedPrinter has that empty string written. Aside: it seems to me that this assertion is bogus - it's not that unreasonable to create a printer, but write nothing to it, to get an empty output.
1054	Okay. I haven't looked into this, so I'll trust your judgement.

Update missing enum to enum class
Update test fixture to handle printing outer {} and handle nothing printed at top level.

llvm/include/llvm/Support/ScopedPrinter.h
104–117	My mistake, looks like I updated some but not all of the enums. Updated them all now.
630–631	I've added a helper function for now. I don't imagine it would take too much to enhance the JSON class to handle APSInt but I don't have the capacity to investigate right now.
llvm/unittests/Support/ScopedPrinterTest.cpp
116	Updated the test fixture to handle the teardown. I needed to add a check to ensure we're only printing an empty string if there hasn't been a call to `verifyJSONScopedPrinter` since `printString` can only be called under specific contexts. Alternatively I could call something like `DictScope(W, "Label")` which can be called under all contexts (since it uses the history stack) but I held off because calling `printString("")` with the empty string felt more appropriate. Let me know if you have any opinions on this.

Harbormaster completed remote builds in B137824: Diff 392284.Dec 6 2021, 10:35 PM

jhenderson added inline comments.Dec 7 2021, 3:44 AM

llvm/include/llvm/Support/ScopedPrinter.h
541–542	I'm not a fan of having to have an enum for the scoping kind, having looked at this. I'd much rather a class that does RAII-style management of the printer be passed in and ownership taken by this printer. I was going to suggest the following interface, but I see that the DelimitedScope takes a ScopedPrinter, leading to a chicken and egg problem. JSONScopedPrinter(raw_ostream &OS, bool PrettyPrint = false, std::unique_ptr<DelimitedScope> &&Scope = std::unique_ptr<DelimitedScope>()); The first argument is unchanged. The second argument means control of the details of indentation is left to the class, rather than needing to know what "2" or "0" means (especially as "0" is special, as it means no new lines either). The third argument allows a user to provide the appropriate scoping mechanism for their use-case, but leaves the class to take and manage ownership; if it's defaulted, then no scoping will be used. NB: the functionality of the second and third arguments would need testing (both default and explicit values), if they aren't already. I wonder if the way to resolve this circular issue, is to have the DelimitedScope subclasses have a default constructor, which delays assigning the ScopedPrinter to a separate function. The scope opening would then be performed when the class is assigned its printer (and closing would only happen if a printer has been assigned). What do you think of this?
llvm/unittests/Support/ScopedPrinterTest.cpp
116	The only other idea I had was to make `JSONScopedPrinter` an `Optional` in the fixture, initialised in the corresponding verify method (and optionally in other tests, if needed), but I don't think that works. I'm happy going with your suggestion.

Jaysonyan mentioned this in D114684: Add ScopedPrinter unit tests.Dec 8 2021, 12:57 AM

Add verifyAll method to test fixture.
Allow JSONScopedPrinter to be constructed with DelimitedScope.
Add DelimitedScope and pretty-print unit tests.

Herald added subscribers: rupprecht, MaskRay, hiraditya. · View Herald TranscriptDec 8 2021, 1:25 AM

Jaysonyan added inline comments.Dec 8 2021, 1:32 AM

llvm/include/llvm/Support/ScopedPrinter.h
541–542	I agree that constructing the `JSONScopedPrinter` with an enum is less preferable than constructing with a `DelimitedScope`. I've implemented your suggestion to allow for `DelimitedScope` subclasses to be default constructed and later add the `ScopedPrinter`. This allowed us to construct `JSONScopedPrinter` with a `DelimitedScope` rather than using an enum. Making this change also pointed out a weird piece of code which accesses the `ScopedPrinter` through the `DelimitedScope` member but it had access to the original ScopedPrinter. So I've updated that piece of code to directly access the ScopedPrinter. I've also added tests for pretty printing and this delimitedScope ctor param. Since they require specific ctor calls they don't use the test fixture.
llvm/unittests/Support/ScopedPrinterTest.cpp
116	I'll leave it as is for now, possibly in the future that assertion will be removed and this extra work can just be deleted.

Fix DelimitedScopeCtor test.

Harbormaster completed remote builds in B138103: Diff 392679.Dec 8 2021, 2:17 AM

jhenderson added inline comments.Dec 9 2021, 6:31 AM

llvm/lib/Support/ScopedPrinter.cpp
50	Consider adding a comment, as suggested inline, to "name" the pretty print/indentation parameter. The name should match the parameter's name.
52	I don't think you need the `.get()`?
llvm/unittests/Support/ScopedPrinterTest.cpp
52	For symmetry, I might be inclined to print a string in each of the three cases, not just the "no scope" case.
71	I'd add a comment as suggested inline, to explain the boolean.

Add param comments.
Update pretty-print tests.

Jaysonyan marked an inline comment as done.Dec 10 2021, 12:37 AM

Harbormaster completed remote builds in B138602: Diff 393394.Dec 10 2021, 1:28 AM

Looks good to me!

This revision is now accepted and ready to land.Dec 10 2021, 2:28 AM

This revision was landed with ongoing or failed builds.Dec 10 2021, 10:58 AM

Closed by commit rG928d17254ba2: [llvm] Add JSONScopedPrinter class (authored by Jaysonyan). · Explain Why

This revision was automatically updated to reflect the committed changes.

Jaysonyan added a commit: rG928d17254ba2: [llvm] Add JSONScopedPrinter class.

Revision Contents

Path

Size

llvm/

include/

llvm/

Support/

ScopedPrinter.h

324 lines

lib/

Support/

ScopedPrinter.cpp

10 lines

tools/

llvm-readobj/

ARMEHABIPrinter.h

2 lines

unittests/

Support/

ScopedPrinterTest.cpp

565 lines

Diff 393543

llvm/include/llvm/Support/ScopedPrinter.h

Show All 10 Lines

#include "llvm/ADT/APSInt.h" #include "llvm/ADT/APSInt.h"

#include "llvm/ADT/ArrayRef.h" #include "llvm/ADT/ArrayRef.h"

#include "llvm/ADT/SmallVector.h" #include "llvm/ADT/SmallVector.h"

#include "llvm/ADT/StringExtras.h" #include "llvm/ADT/StringExtras.h"

#include "llvm/ADT/StringRef.h" #include "llvm/ADT/StringRef.h"

#include "llvm/Support/DataTypes.h" #include "llvm/Support/DataTypes.h"

#include "llvm/Support/Endian.h" #include "llvm/Support/Endian.h"

#include "llvm/Support/JSON.h"

#include "llvm/Support/raw_ostream.h" #include "llvm/Support/raw_ostream.h"

#include <algorithm> #include <algorithm>

namespace llvm { namespace llvm {

template <typename T> struct EnumEntry { template <typename T> struct EnumEntry {

StringRef Name; StringRef Name;

// While Name suffices in most of the cases, in certain cases // While Name suffices in most of the cases, in certain cases

▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines std::string enumToString(T Value, ArrayRef<EnumEntry<TEnum>> EnumValues) {

for (const EnumEntry<TEnum> &EnumItem : EnumValues) for (const EnumEntry<TEnum> &EnumItem : EnumValues)

if (EnumItem.Value == Value) if (EnumItem.Value == Value)

return std::string(EnumItem.AltName); return std::string(EnumItem.AltName);

return to_hexString(Value, false); return to_hexString(Value, false);

} }

class ScopedPrinter { class ScopedPrinter {

public: public:

ScopedPrinter(raw_ostream &OS) : OS(OS), IndentLevel(0) {} enum class ScopedPrinterKind {

Base,

jhendersonUnsubmitted

Done

I'd avoid the use of the name Standard, as it could be confused with the C++ standard. I'd use Base, Plain or Basic for the enum value.

jhenderson: I'd avoid the use of the name `Standard`, as it could be confused with the C++ standard. I'd…

JaysonyanAuthorUnsubmitted

Done

Updated to use Base.

Jaysonyan: Updated to use Base.

JSON,

};

ScopedPrinter(raw_ostream &OS,

ScopedPrinterKind Kind = ScopedPrinterKind::Base)

: OS(OS), IndentLevel(0), Kind(Kind) {}

ScopedPrinterKind getKind() const { return Kind; }

static bool classof(const ScopedPrinter *SP) {

return SP->getKind() == ScopedPrinterKind::Base;

}

jhendersonUnsubmitted

Done

All the new enums probably want to be enum class.

jhenderson: All the new enums probably want to be `enum class`.

jhendersonUnsubmitted

Done

This is marked as done but I don't see it here?

jhenderson: This is marked as done but I don't see it here?

JaysonyanAuthorUnsubmitted

Done

My mistake, looks like I updated some but not all of the enums. Updated them all now.

Jaysonyan: My mistake, looks like I updated some but not all of the enums. Updated them all now.

virtual ~ScopedPrinter() {} virtual ~ScopedPrinter() {}

void flush() { OS.flush(); } void flush() { OS.flush(); }

void indent(int Levels = 1) { IndentLevel += Levels; } void indent(int Levels = 1) { IndentLevel += Levels; }

void unindent(int Levels = 1) { void unindent(int Levels = 1) {

▲ Show 20 Lines • Show All 57 Lines • ▼ Show 20 Lines for (const auto &Flag : Flags) {

} }

llvm::sort(SetFlags, &flagName); llvm::sort(SetFlags, &flagName);

printFlagsImpl(Label, hex(Value), SetFlags); printFlagsImpl(Label, hex(Value), SetFlags);

} }

template <typename T> void printFlags(StringRef Label, T Value) { template <typename T> void printFlags(StringRef Label, T Value) {

SmallVector<HexNumber, 10> SetFlags; SmallVector<HexNumber, 10> SetFlags;

jhendersonUnsubmitted

Done

Seems like the change in this and the printObject function below should be part of the ScopedPrinter virtualisation patch?

jhenderson: Seems like the change in this and the `printObject` function below should be part of the…

JaysonyanAuthorUnsubmitted

Done

Moved to virtualization patch!

Jaysonyan: Moved to virtualization patch!

uint64_t Flag = 1; uint64_t Flag = 1;

uint64_t Curr = Value; uint64_t Curr = Value;

while (Curr > 0) { while (Curr > 0) {

if (Curr & 1) if (Curr & 1)

SetFlags.emplace_back(Flag); SetFlags.emplace_back(Flag);

Curr >>= 1; Curr >>= 1;

Flag <<= 1; Flag <<= 1;

} }

▲ Show 20 Lines • Show All 230 Lines • ▼ Show 20 Lines virtual void printBinaryImpl(StringRef Label, StringRef Str,

uint32_t StartOffset = 0); uint32_t StartOffset = 0);

virtual void printFlagsImpl(StringRef Label, HexNumber Value, virtual void printFlagsImpl(StringRef Label, HexNumber Value,

ArrayRef<FlagEntry> Flags) { ArrayRef<FlagEntry> Flags) {

startLine() << Label << " [ (" << Value << ")\n"; startLine() << Label << " [ (" << Value << ")\n";

for (const auto &Flag : Flags) for (const auto &Flag : Flags)

startLine() << " " << Flag.Name << " (" << hex(Flag.Value) << ")\n"; startLine() << " " << Flag.Name << " (" << hex(Flag.Value) << ")\n";

startLine() << "]\n"; startLine() << "]\n";

} }

jhendersonUnsubmitted

Done

This check seems to be a behaviour change? Seems like we should keep that out of this patch, to keep this patch purely NFC for the regular ScopedPrinter

jhenderson: This check seems to be a behaviour change? Seems like we should keep that out of this patch, to…

JaysonyanAuthorUnsubmitted

Done

This was meant to accommodate the change I made to the making the extra printFlags method virtual. I was hoping both public printFlags(...) methods could utilize the same printFlagsImpl method but you're right this could be a change in behaviour. I've updated the virtualisation patch to keep this implementation the same and added an explicit printRawFlagsImpl method.

Jaysonyan: This was meant to accommodate the change I made to the making the extra `printFlags` method…

virtual void printFlagsImpl(StringRef Label, HexNumber Value, virtual void printFlagsImpl(StringRef Label, HexNumber Value,

ArrayRef<HexNumber> Flags) { ArrayRef<HexNumber> Flags) {

startLine() << Label << " [ (" << Value << ")\n"; startLine() << Label << " [ (" << Value << ")\n";

for (const auto &Flag : Flags) for (const auto &Flag : Flags)

startLine() << " " << Flag << '\n'; startLine() << " " << Flag << '\n';

startLine() << "]\n"; startLine() << "]\n";

} }

▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines private:

void scopedEnd(char Symbol) { void scopedEnd(char Symbol) {

unindent(); unindent();

startLine() << Symbol << '\n'; startLine() << Symbol << '\n';

} }

raw_ostream &OS; raw_ostream &OS;

int IndentLevel; int IndentLevel;

StringRef Prefix; StringRef Prefix;

ScopedPrinterKind Kind;

}; };

template <> template <>

inline void inline void

ScopedPrinter::printHex<support::ulittle16_t>(StringRef Label, ScopedPrinter::printHex<support::ulittle16_t>(StringRef Label,

support::ulittle16_t Value) { support::ulittle16_t Value) {

startLine() << Label << ": " << hex(Value) << "\n"; startLine() << Label << ": " << hex(Value) << "\n";

} }

struct DelimitedScope;

jhendersonUnsubmitted

Done

Nits:

I think we are normally explicit about public/private, so I'd add the explicit private: here, even though it's not strictly required.
I don't think we normally have blank lines at the start of class definitions.

jhenderson: Nits: 1) I think we are normally explicit about public/private, so I'd add the explicit…

class JSONScopedPrinter : public ScopedPrinter {

private:

enum class Scope {

Array,

Object,

};

jhendersonUnsubmitted

Done

Similar comment to the above: don't use the name "Standard" for an enum value. Actually, in this case, without looking at the code, I have no clue what "Standard" even means. It should be renamed to reflect what it is used for.

jhenderson: Similar comment to the above: don't use the name "Standard" for an enum value. Actually, in…

JaysonyanAuthorUnsubmitted

Done

Updated to NoAttribute.

Jaysonyan: Updated to `NoAttribute`.

enum class ScopeKind {

NoAttribute,

Attribute,

NestedAttribute,

};

struct ScopeContext {

Scope Context;

ScopeKind Kind;

jhendersonUnsubmitted

Done

Object,

};

enum ScopeKind {

Standard,

Attribute,

NestedAttribute,

};

struct ScopeContext {

Scope Context;

ScopeKind Kind;

ScopeContext(Scope Context, ScopeKind Kind = ScopeKind::Standard)

: Context(Context), Kind(Kind) {}

};

SmallVector<ScopeContext, 16> ScopeHistory;

json::OStream JOS;

I'd probably add some blank lines between these things.

jhenderson: I'd probably add some blank lines between these things.

ScopeContext(Scope Context, ScopeKind Kind = ScopeKind::NoAttribute)

: Context(Context), Kind(Kind) {}

};

jhendersonUnsubmitted

Done

At one point, there was some effort to set good defaults for SmallVector, allowing you to omit the second argument. It's less useful here, because we have a good guess about the amount of nesting. I think it's more likely to be a max of about 4 or 5, so 16 is probably way too high. Perhaps worth doing an audit of the usage within the LLVM code and add a few more points above the current maximum scoping (yes, I know JSONScopedPrinter isn't used really yet, but you can imagine how it would look, based on how ScopedPrinter is used already).

jhenderson: At one point, there was some effort to set good defaults for SmallVector, allowing you to omit…

JaysonyanAuthorUnsubmitted

Done

Looking through the usage of ScopedPrinter the most amount of nested scopes I came across was 6. This was a tie between a few methods in llmv-readobj (LLVMElfDumper::printVersionDependencySection, LLVMElfDumper::printBBAddrMaps, and COFFDumper::printCodeViweSymbolSection). Inside each of these methods there are 4 nested scopes and paired with the 2 scopes that would be added by JSONScopedPrinter (highest level [] and {} for each file) so I think the most nesting is 6. I'll update the SmallVector size to be 8.

Jaysonyan: Looking through the usage of `ScopedPrinter` the most amount of nested scopes I came across was…

jhendersonUnsubmitted

Done

public:

- JSONScopedPrinter(llvm::raw_ostream &OS)

+ JSONScopedPrinter(raw_ostream &OS)

: ScopedPrinter(OS, ScopedPrinter::ScopedPrinterKind::JSON), JOS(OS, 2) {}

I believe the explicit llvm is unnecessary.

jhenderson: I believe the explicit `llvm` is unnecessary.

SmallVector<ScopeContext, 8> ScopeHistory;

json::OStream JOS;

std::unique_ptr<DelimitedScope> OuterScope;

public:

JSONScopedPrinter(raw_ostream &OS, bool PrettyPrint = false,

jhendersonUnsubmitted

Done

How come we need to stringify the value here? That seems surprising to me, and may lead to undesirable behaviour in my mind (e.g. a field for an ELF struct might be a number for the 32-bit ELF format, but a string in the 64-bit ELF format (and not all numbers will be strings in the latter-case, where they are not 64-bits)).

I have this vague memory that the LLVM JSON format doesn't support 64-bit numbers. If so, I think this is a mistake and should be fixed.

jhenderson: How come we need to stringify the value here? That seems surprising to me, and may lead to…

JaysonyanAuthorUnsubmitted

Done

I had believed LLVM JSON originally didn't support uint64_t types but since then I see that support was added here: https://github.com/llvm/llvm-project/commit/8c3adce81dc36306ba30cda0cdf458cfcf7d076c so I think I can just remove the to_string(...)

Jaysonyan: I had believed LLVM JSON originally didn't support `uint64_t` types but since then I see that…

std::unique_ptr<DelimitedScope> &&OuterScope =

jhendersonUnsubmitted

Done

I'm not a fan of having to have an enum for the scoping kind, having looked at this. I'd much rather a class that does RAII-style management of the printer be passed in and ownership taken by this printer. I was going to suggest the following interface, but I see that the DelimitedScope takes a ScopedPrinter, leading to a chicken and egg problem.

JSONScopedPrinter(raw_ostream &OS, bool PrettyPrint = false, std::unique_ptr<DelimitedScope> &&Scope = std::unique_ptr<DelimitedScope>());

The first argument is unchanged. The second argument means control of the details of indentation is left to the class, rather than needing to know what "2" or "0" means (especially as "0" is special, as it means no new lines either). The third argument allows a user to provide the appropriate scoping mechanism for their use-case, but leaves the class to take and manage ownership; if it's defaulted, then no scoping will be used.

NB: the functionality of the second and third arguments would need testing (both default and explicit values), if they aren't already.

I wonder if the way to resolve this circular issue, is to have the DelimitedScope subclasses have a default constructor, which delays assigning the ScopedPrinter to a separate function. The scope opening would then be performed when the class is assigned its printer (and closing would only happen if a printer has been assigned). What do you think of this?

jhenderson: I'm not a fan of having to have an enum for the scoping kind, having looked at this. I'd much…

JaysonyanAuthorUnsubmitted

Done

I agree that constructing the JSONScopedPrinter with an enum is less preferable than constructing with a DelimitedScope. I've implemented your suggestion to allow for DelimitedScope subclasses to be default constructed and later add the ScopedPrinter. This allowed us to construct JSONScopedPrinter with a DelimitedScope rather than using an enum.

Making this change also pointed out a weird piece of code which accesses the ScopedPrinter through the DelimitedScope member but it had access to the original ScopedPrinter. So I've updated that piece of code to directly access the ScopedPrinter.

I've also added tests for pretty printing and this delimitedScope ctor param. Since they require specific ctor calls they don't use the test fixture.

Jaysonyan: I agree that constructing the `JSONScopedPrinter` with an enum is less preferable than…

std::unique_ptr<DelimitedScope>{});

static bool classof(const ScopedPrinter *SP) {

return SP->getKind() == ScopedPrinter::ScopedPrinterKind::JSON;

}

void printNumber(StringRef Label, uint64_t Value) override {

JOS.attribute(Label, Value);

}

void printNumber(StringRef Label, uint32_t Value) override {

JOS.attribute(Label, Value);

}

void printNumber(StringRef Label, uint16_t Value) override {

JOS.attribute(Label, Value);

}

void printNumber(StringRef Label, uint8_t Value) override {

JOS.attribute(Label, Value);

}

void printNumber(StringRef Label, int64_t Value) override {

JOS.attribute(Label, Value);

}

void printNumber(StringRef Label, int32_t Value) override {

JOS.attribute(Label, Value);

}

void printNumber(StringRef Label, int16_t Value) override {

JOS.attribute(Label, Value);

}

jhendersonUnsubmitted

Done

Similar point to the above, but slightly different: I think we should compare this to std::numeric_limits<uint64_t>::max() and print as a number, if possible, rather than as a string.

jhenderson: Similar point to the above, but slightly different: I think we should compare this to `std…

JaysonyanAuthorUnsubmitted

Done

I updated this to just print the full APSInt unconditionally. Would it be preferable to cap printing numbers at std::numeric_limits<uint64_t>::max() and then fallback on strings otherwise? Since json doesn't have a max numerical value but arbitrarily long values may not always be supported. https://stackoverflow.com/a/39681707

Jaysonyan: I updated this to just print the full `APSInt` unconditionally. Would it be preferable to cap…

jhendersonUnsubmitted

Done

Let's leave it without the cap, if we have a way of sensibly handling it. I think changing the type based on the value could get very confusing, and since as you say JSON has no firm limitations, printing out arbitrarily large numbers seems sensible.

An alternative (and probably better) approach would be to add support for this type to the JSON library directly (in a precursor patch).

jhenderson: Let's leave it without the cap, if we have a way of sensibly handling it. I think changing the…

void printNumber(StringRef Label, int8_t Value) override {

JOS.attribute(Label, Value);

}

void printNumber(StringRef Label, const APSInt &Value) override {

JOS.attributeBegin(Label);

printAPSInt(Value);

JOS.attributeEnd();

}

void printBoolean(StringRef Label, bool Value) override {

JOS.attribute(Label, Value);

jhendersonUnsubmitted

Done

Similar to my comments in D114223, I'd recomend adding helper functions to avoid some of the logic duplication in the following methods.

jhenderson: Similar to my comments in D114223, I'd recomend adding helper functions to avoid some of the…

}

void printList(StringRef Label, const ArrayRef<bool> List) override {

printListImpl(Label, List);

}

void printList(StringRef Label, const ArrayRef<std::string> List) override {

printListImpl(Label, List);

}

void printList(StringRef Label, const ArrayRef<uint64_t> List) override {

printListImpl(Label, List);

}

void printList(StringRef Label, const ArrayRef<uint32_t> List) override {

printListImpl(Label, List);

}

void printList(StringRef Label, const ArrayRef<uint16_t> List) override {

printListImpl(Label, List);

}

void printList(StringRef Label, const ArrayRef<uint8_t> List) override {

printListImpl(Label, List);

}

void printList(StringRef Label, const ArrayRef<int64_t> List) override {

printListImpl(Label, List);

}

void printList(StringRef Label, const ArrayRef<int32_t> List) override {

printListImpl(Label, List);

}

void printList(StringRef Label, const ArrayRef<int16_t> List) override {

printListImpl(Label, List);

}

jhendersonUnsubmitted

Done

private:

- // Output hex values as JSON numbers so that they're easier to parse.

+ // Output HexNumbers as decimal so that they're easier to parse.

uint64_t hexNumberToInt(HexNumber Hex) { return Hex.Value; }

I'd use the type name explicitly here. The concept of a "hex" number is purely a representation thing, so doesn't really make sense to refer to a number as a "hex number" until it's been written out somewhere. Numbers are just numbers.

I also suggest referring to the base explicitly in this comment for how they are outputted.

jhenderson: I'd use the type name explicitly here. The concept of a "hex" number is purely a representation…

void printList(StringRef Label, const ArrayRef<int8_t> List) override {

printListImpl(Label, List);

}

void printList(StringRef Label, const ArrayRef<APSInt> List) override {

jhendersonUnsubmitted

Done

I'd pull this into a little helper function, shared by printNumber. It just helps avoid a (small) amount of duplication, but also helps label what the code does. (Ideally, I'd actually suggest enhancing the JSON output stream interface, but up to you whether you want to go that far)

jhenderson: I'd pull this into a little helper function, shared by `printNumber`. It just helps avoid a…

JaysonyanAuthorUnsubmitted

Done

I've added a helper function for now. I don't imagine it would take too much to enhance the JSON class to handle APSInt but I don't have the capacity to investigate right now.

Jaysonyan: I've added a helper function for now. I don't imagine it would take too much to enhance the…

JOS.attributeArray(Label, [&]() {

for (const APSInt &Item : List) {

printAPSInt(Item);

jhendersonUnsubmitted

Done

If I'm not mistaken, this is going to lead to an array of mixed types (some strings and some integers). I'm not convinced that's desirable, since it will mean needing to switch on type to determine how to handle the field. Perhaps we should do what we do elsewhere and have an array of objects, where the object has name (possibly optional) and value fields?

Alternatively, maybe the actual name isn't even needed, and the interesting thing is just the individual flag values. I guess that depends on how human readable we want this JSON output to be.

jhenderson: If I'm not mistaken, this is going to lead to an array of mixed types (some strings and some…

JaysonyanAuthorUnsubmitted

Done

This was an attempt similar to the change in the ScopedPrinter implementation of printFlagsImpl to handle 2 types of public printFlags methods. I've added a printRawFlagsImpl to handle printing Flag.Value and updated this method to printing both Flag.Name and Flag.Value. This seems to best match the information that the ScopedPrinter implementation provides.

Jaysonyan: This was an attempt similar to the change in the ScopedPrinter implementation of…

}

});

}

void printString(StringRef Value) override { JOS.value(Value); }

void printString(StringRef Label, StringRef Value) override {

JOS.attribute(Label, Value);

}

jhendersonUnsubmitted

Done

bool IsNumList = true;

- for (const auto &Item : List) {

+ for (StringRef Item : List) {

if (!(!Item.empty() && std::all_of(Item.begin(), Item.end(), ::isdigit)))

Same below.

jhenderson: Same below.

jhendersonUnsubmitted

Done

I'm confused by this additional logic. It seems to me you're passing in a list of strings, but those strings might actually be numbers? That doesn't seem right to me with the calling code and/or our interface, rather than this function. For example, imagine our list happens to be a list of strings that are purely made up of digits, but the fact that they are digits is purely chance, and they should remain as strings. This function will convert to a list of integers. Further, a future new element in the same list without an all-digit input would cause the type of ALL the members of the list to change back to strings.

jhenderson: I'm confused by this additional logic. It seems to me you're passing in a list of strings, but…

JaysonyanAuthorUnsubmitted

Done

Yea this makes sense. The problem I was trying to solve was that the public method printList(...) takes a list of template type. But to pass to the printListImpl(...) method currently we are using the to_string method. So passing a list of numbers to printList(...) would result in a list of strings for the JSONScopedPrinter.

I've reverted this change to just unconditionally print as strings for now but maybe it would be preferable to replace the template param printList(...) with multiple overloaded printList(...) methods that can delegate to printStringListImpl(...)/printNumberListImpl(...)/printBooleanListImpl(...) methods. Interested if you have any thoughts on this.

Jaysonyan: Yea this makes sense. The problem I was trying to solve was that the public method `printList(..

jhendersonUnsubmitted

Done

I think multiple functions is the way forward, although I'd just call them all printListImpl and let the ArrayRef type determine the version to call. Any particular reason we'd need multiple printList functions too? Seems like it's something that could be determined without it.

Some rough pseudo code:

template <typename T>
void printList(StringRef Label, const ArrayRef<T> List) {
  printListImpl(Label, List);
}

virtual void printListImpl(StringRef, ArrayRef<int> List) { /*...*/ }
virtual void printListImpl(StringRef, ArrayRef<bool> List) { /*...*/ }
virtual void printListImpl(StringRef Label, ArrayRef<std::string> List) { /*...*/ }

Actually, now that I type that out, I wonder if you could just get rid of printListImpl entirely, and make printList a set of virtual non-templated functions with multiple overloads for the different supported types? (Potentially you might still want a non-virtual printListImpl (that could be templated) to avoid code duplication of course).

jhenderson: I think multiple functions is the way forward, although I'd just call them all `printListImpl`…

JaysonyanAuthorUnsubmitted

Done

I've added overloaded methods for printList(...) inside the virtualization diff and updated the ScopedPrinter test diff and this diff accordingly.

The implementation roughly follows the ideas you've laid out but the major difference is that the overloaded methods are public printList(...) methods rather than private printListImpl(...). This was needed because we still need to maintain a template printList(...) method to fall-back on for lists not comprised of strings, ints, or booleans.

Jaysonyan: I've added overloaded methods for `printList(...)` inside the virtualization diff and updated…

void objectBegin() override {

scopedBegin({Scope::Object, ScopeKind::NoAttribute});

}

void objectBegin(StringRef Label) override {

scopedBegin(Label, Scope::Object);

}

void objectEnd() override { scopedEnd(); }

void arrayBegin() override {

scopedBegin({Scope::Array, ScopeKind::NoAttribute});

jhendersonUnsubmitted

Done

});

}

- // Output hex values as JSON number so it's easier to parse

+ // Output hex values as JSON numbers so that they're easier to parse.

void printHexListImpl(StringRef Label,

I'm inclined to say we should have the comment by each function. Perhaps better though, at the cost of a little more code than would otherwise be necessary, would be to have a small helper function hexNumberToInt or similar, which is then called in all places you currently do HexNumber.Value. You could then put the comment with that function, in one place.

jhenderson: I'm inclined to say we should have the comment by each function. Perhaps better though, at the…

}

void arrayBegin(StringRef Label) override {

scopedBegin(Label, Scope::Array);

}

void arrayEnd() override { scopedEnd(); }

private:

jhendersonUnsubmitted

Done

private:

- // Output HexNumber as decimal so that they're easier to parse.

+ // Output HexNumbers as decimal so that they're easier to parse.

uint64_t hexNumberToInt(HexNumber Hex) { return Hex.Value; }

This comment applies to all HexNumber instances, not just the one passed in, so it should be plural.

jhenderson: This comment applies to all HexNumber instances, not just the one passed in, so it should be…

// Output HexNumbers as decimals so that they're easier to parse.

uint64_t hexNumberToInt(HexNumber Hex) { return Hex.Value; }

void printAPSInt(const APSInt &Value) {

JOS.rawValueBegin() << Value;

JOS.rawValueEnd();

}

void printFlagsImpl(StringRef Label, HexNumber Value,

ArrayRef<FlagEntry> Flags) override {

JOS.attributeObject(Label, [&]() {

JOS.attribute("RawFlags", hexNumberToInt(Value));

JOS.attributeArray("Flags", [&]() {

for (const FlagEntry &Flag : Flags) {

JOS.objectBegin();

JOS.attribute("Name", Flag.Name);

JOS.attribute("Value", Flag.Value);

JOS.objectEnd();

}

});

jhendersonUnsubmitted

Done

Here and in the other loop below, I'd avoid the use of auto: it doesn't improve readability, and we tend to avoid the "always use auto" approach in LLVM, when the type isn't obvious from that line.

jhenderson: Here and in the other loop below, I'd avoid the use of `auto`: it doesn't improve readability…

});

}

jhendersonUnsubmitted

Done

I was thinking about this, and I think we shouldn't actually print as hex in JSON, and should fallback to the standard number printing. Here's the reasoning: most of the time, we use printHex to print some number that represents an offset, or other value in a fixed-sized field, that is easier to read as hex. However, JSON format is not intended for human consumption, at least not in our use-case for this code. As such, readability is of lesser concern than parseability. If we were to store hex numbers as strings, rather than converting them to their decimal counterparts, we'd end up with some numeric values as strings, as others as integers, in a seemingly arbitrary manner to the end-user. I don't think that this is desirable.

jhenderson: I was thinking about this, and I think we shouldn't actually print as hex in JSON, and should…

JaysonyanAuthorUnsubmitted

Done

That seems reasonable to me! Updated to output numeric values over hex strings.

Jaysonyan: That seems reasonable to me! Updated to output numeric values over hex strings.

jhendersonUnsubmitted

Done

Perhaps worth a comment here elaborating on the reasoning, so that future users understand why.

jhenderson: Perhaps worth a comment here elaborating on the reasoning, so that future users understand why.

JaysonyanAuthorUnsubmitted

Done

I've added a comment at the top of the first method which outputs Hex. I wasn't sure if it was desirable to add a comment for every location that outputs Hex or not. Let me know if there's a preference of one over the other.

Jaysonyan: I've added a comment at the top of the first method which outputs Hex. I wasn't sure if it was…

void printFlagsImpl(StringRef Label, HexNumber Value,

ArrayRef<HexNumber> Flags) override {

JOS.attributeObject(Label, [&]() {

JOS.attribute("RawFlags", hexNumberToInt(Value));

JOS.attributeArray("Flags", [&]() {

for (const HexNumber &Flag : Flags) {

JOS.value(Flag.Value);

}

});

jhendersonUnsubmitted

Done

I would actually print this as two separate attributes: a symbol name, and a numeric offset. That'll be easier for consumers to parse. Perhaps these should be in a separate object, to avoid surprise name clashes. Rather than:

"Label":"Symbol+0x100"

"Label":"Symbol",
"LabelOffset":256

"Label":{
  "SymName":"Symbol",
  "Offset":256
}

jhenderson: I would actually print this as two separate attributes: a symbol name, and a numeric offset.

});

}

jhendersonUnsubmitted

Done

Slightly surprised to see the new if here and a little further below. Could you explain them, please? Are you sure it's better to have attributes missing rather than as empty strings (I'm not entirely sure either way, and it probably depends on the context to some extent, but I'd focus on ease of parsing)?

Aside: reminder that for single line if and loop statements, remove the braces, i.e. here and in the for loop below.

jhenderson: Slightly surprised to see the new if here and a little further below. Could you explain them…

JaysonyanAuthorUnsubmitted

Done

This was to match closer with the public methods available. Some public methods of printBinary(...) don't provide a Str param and so it's just passed into printBinaryImpl as StringRef() and the existing ScopedPrinter::printBinaryImpl does a similar check for Str.empty(). Similarly, there are public printBinary(...) and printBinaryBlock(...) methods where the latter prints the accompanying characters and the former does not. I'm not opposed to unconditionally printing these values but since these checks are used to differentiate different public methods inside ScopedPrinter::printBinaryImpl I thought it made sense to do the same for JSONScopedPrinter::printBinaryImpl.

My mistake about the if statments, I'll try to be more careful about it in the future.

Jaysonyan: This was to match closer with the public methods available. Some public methods of `printBinary…

jhendersonUnsubmitted

Done

Let's leave it as-is for now. We can always add attributes should we desire to at a future point.

That being said, I'm wondering if the Character block is actually useful for JSON output, since the character is just a human-readable version of the byte. If we're assuming people won't be directly reading JSON output and instead will be processing it themselves, the Character output wouldn't be useful at all, I think (they could reproduce it from the Bytes data, if needed). Thoughts?

jhenderson: Let's leave it as-is for now. We can always add attributes should we desire to at a future…

JaysonyanAuthorUnsubmitted

Done

You're right, adding the characters doesn't provide much value if we're analyzing the output through automated scripts. I think it makes sense to remove this block.

Jaysonyan: You're right, adding the characters doesn't provide much value if we're analyzing the output…

template <typename T> void printListImpl(StringRef Label, const T &List) {

JOS.attributeArray(Label, [&]() {

for (const auto &Item : List)

jhendersonUnsubmitted

Done

You should check through the existing usages, but at least in llvm-readelf, I think most labels use UpperCameCase style, so printing as MyValueRaw would look a little cleaner. Also, see my above comments re. symbol offset about a nested object/naming clash risk, i.e. I'd recommend:

"Label":{
  "Value":1234,
  "RawValue":4321
}

jhenderson: You should check through the existing usages, but at least in llvm-readelf, I think most labels…

JOS.value(Item);

});

}

void printHexListImpl(StringRef Label,

jhendersonUnsubmitted

Done

Why's this being left as a todo?

jhenderson: Why's this being left as a todo?

JaysonyanAuthorUnsubmitted

Done

My mistake, this was meant to be implemented before being put up for review.

Jaysonyan: My mistake, this was meant to be implemented before being put up for review.

const ArrayRef<HexNumber> List) override {

JOS.attributeArray(Label, [&]() {

jhendersonUnsubmitted

Done

void scopedBegin(ScopeContext ScopeCtx) {

- if (ScopeCtx.Context == Scope::Object) {

+ if (ScopeCtx.Context == Scope::Object)

JOS.objectBegin();

- } else if (ScopeCtx.Context == Scope::Array) {

+ else if (ScopeCtx.Context == Scope::Array)

JOS.arrayBegin();

- }

ScopeHistory.push_back(ScopeCtx);

jhenderson:

for (const HexNumber &Item : List) {

JOS.value(hexNumberToInt(Item));

}

});

}

void printHexImpl(StringRef Label, HexNumber Value) override {

JOS.attribute(Label, hexNumberToInt(Value));

}

void printHexImpl(StringRef Label, StringRef Str, HexNumber Value) override {

JOS.attributeObject(Label, [&]() {

JOS.attribute("Value", Str);

JOS.attribute("RawValue", hexNumberToInt(Value));

});

}

void printSymbolOffsetImpl(StringRef Label, StringRef Symbol,

HexNumber Value) override {

JOS.attributeObject(Label, [&]() {

JOS.attribute("SymName", Symbol);

JOS.attribute("Offset", hexNumberToInt(Value));

});

}

void printNumberImpl(StringRef Label, StringRef Str,

StringRef Value) override {

JOS.attributeObject(Label, [&]() {

JOS.attribute("Value", Str);

JOS.attributeBegin("RawValue");

JOS.rawValueBegin() << Value;

JOS.rawValueEnd();

JOS.attributeEnd();

});

}

void printBinaryImpl(StringRef Label, StringRef Str, ArrayRef<uint8_t> Value,

bool Block, uint32_t StartOffset = 0) override {

JOS.attributeObject(Label, [&]() {

if (!Str.empty())

jhendersonUnsubmitted

Done

I'm now looking at this Index field and wondering if it's useful at this position? We're in an array already, so the index should match the array position. It doesn't (so should be called Offset), but then why have an entry per byte, when one per array would be less verbose, and equally as informative? It would also allow the values to be written directly in the array rather than in a nested object. I.e.

{
  "Value" : <Str>,
  "Offset" : <StartOffset>,
  "Bytes" : [Value[0], Value[1], ...]
}

jhenderson: I'm now looking at this `Index` field and wondering if it's useful at this position? We're in…

JaysonyanAuthorUnsubmitted

Done

This makes a lot of sense and seems like a better format to me. Updated to match this format.

Jaysonyan: This makes a lot of sense and seems like a better format to me. Updated to match this format.

JOS.attribute("Value", Str);

JOS.attribute("Offset", StartOffset);

JOS.attributeArray("Bytes", [&]() {

for (uint8_t Val : Value)

JOS.value(Val);

});

}

void scopedBegin(ScopeContext ScopeCtx) {

if (ScopeCtx.Context == Scope::Object)

JOS.objectBegin();

else if (ScopeCtx.Context == Scope::Array)

JOS.arrayBegin();

ScopeHistory.push_back(ScopeCtx);

}

void scopedBegin(StringRef Label, Scope Ctx) {

ScopeKind Kind = ScopeKind::Attribute;

if (ScopeHistory.empty() || ScopeHistory.back().Context != Scope::Object) {

JOS.objectBegin();

Kind = ScopeKind::NestedAttribute;

}

JOS.attributeBegin(Label);

scopedBegin({Ctx, Kind});

}

void scopedEnd() {

ScopeContext ScopeCtx = ScopeHistory.back();

if (ScopeCtx.Context == Scope::Object)

JOS.objectEnd();

else if (ScopeCtx.Context == Scope::Array)

JOS.arrayEnd();

if (ScopeCtx.Kind == ScopeKind::Attribute ||

ScopeCtx.Kind == ScopeKind::NestedAttribute)

JOS.attributeEnd();

if (ScopeCtx.Kind == ScopeKind::NestedAttribute)

JOS.objectEnd();

ScopeHistory.pop_back();

}

};

struct DelimitedScope { struct DelimitedScope {

DelimitedScope(ScopedPrinter &W) : W(W) {} DelimitedScope(ScopedPrinter &W) : W(&W) {}

DelimitedScope() : W(nullptr) {}

virtual ~DelimitedScope(){}; virtual ~DelimitedScope(){};

ScopedPrinter &W; virtual void setPrinter(ScopedPrinter &W) = 0;

ScopedPrinter *W;

}; };

struct DictScope : DelimitedScope { struct DictScope : DelimitedScope {

explicit DictScope() : DelimitedScope() {}

explicit DictScope(ScopedPrinter &W) : DelimitedScope(W) { W.objectBegin(); } explicit DictScope(ScopedPrinter &W) : DelimitedScope(W) { W.objectBegin(); }

DictScope(ScopedPrinter &W, StringRef N) : DelimitedScope(W) { DictScope(ScopedPrinter &W, StringRef N) : DelimitedScope(W) {

W.objectBegin(N); W.objectBegin(N);

} }

~DictScope() { W.objectEnd(); } void setPrinter(ScopedPrinter &W) override {

this->W = &W;

W.objectBegin();

}

~DictScope() {

if (W)

W->objectEnd();

}

}; };

struct ListScope : DelimitedScope { struct ListScope : DelimitedScope {

explicit ListScope() : DelimitedScope() {}

explicit ListScope(ScopedPrinter &W) : DelimitedScope(W) { W.arrayBegin(); } explicit ListScope(ScopedPrinter &W) : DelimitedScope(W) { W.arrayBegin(); }

ListScope(ScopedPrinter &W, StringRef N) : DelimitedScope(W) { ListScope(ScopedPrinter &W, StringRef N) : DelimitedScope(W) {

W.arrayBegin(N); W.arrayBegin(N);

} }

~ListScope() { W.arrayEnd(); } void setPrinter(ScopedPrinter &W) override {

this->W = &W;

W.arrayBegin();

}

~ListScope() {

if (W)

W->arrayEnd();

}

}; };

} // namespace llvm } // namespace llvm

#endif #endif

llvm/lib/Support/ScopedPrinter.cpp

Show All 37 Lines void ScopedPrinter::printBinaryImpl(StringRef Label, StringRef Str,

} else { } else {

startLine() << Label << ":"; startLine() << Label << ":";

if (!Str.empty()) if (!Str.empty())

OS << " " << Str; OS << " " << Str;

OS << " (" << format_bytes(Data, None, Data.size(), 1, 0, true) << ")\n"; OS << " (" << format_bytes(Data, None, Data.size(), 1, 0, true) << ")\n";

} }

JSONScopedPrinter::JSONScopedPrinter(

raw_ostream &OS, bool PrettyPrint,

std::unique_ptr<DelimitedScope> &&OuterScope)

: ScopedPrinter(OS, ScopedPrinter::ScopedPrinterKind::JSON),

JOS(OS, /*Indent=*/PrettyPrint ? 2 : 0),

jhendersonUnsubmitted

Done

: ScopedPrinter(OS, ScopedPrinter::ScopedPrinterKind::JSON),

- JOS(OS, PrettyPrint ? 2 : 0), OuterScope(std::move(OuterScope)) {

+ JOS(OS, /*Indent=*/PrettyPrint ? 2 : 0), OuterScope(std::move(OuterScope)) {

if (this->OuterScope)

Consider adding a comment, as suggested inline, to "name" the pretty print/indentation parameter. The name should match the parameter's name.

jhenderson: Consider adding a comment, as suggested inline, to "name" the pretty print/indentation…

OuterScope(std::move(OuterScope)) {

if (this->OuterScope)

jhendersonUnsubmitted

Done

if (this->OuterScope)

- this->OuterScope.get()->setPrinter(*this);

+ this->OuterScope->setPrinter(*this);

}

} // namespace llvm

I don't think you need the .get()?

jhenderson: I don't think you need the `.get()`?

this->OuterScope->setPrinter(*this);

}

} // namespace llvm } // namespace llvm

llvm/tools/llvm-readobj/ARMEHABIPrinter.h

Show First 20 Lines • Show All 506 Lines • ▼ Show 20 Lines	if (ErrorOr<StringRef> Name = FunctionAtAddress(Address, SecIndex))
SW.printString("PersonalityRoutineName", *Name);		SW.printString("PersonalityRoutineName", *Name);
}		}
}		}

template <typename ET>		template <typename ET>
void PrinterContext<ET>::PrintOpcodes(const uint8_t *Entry,		void PrinterContext<ET>::PrintOpcodes(const uint8_t *Entry,
size_t Length, off_t Offset) const {		size_t Length, off_t Offset) const {
ListScope OCC(SW, "Opcodes");		ListScope OCC(SW, "Opcodes");
OpcodeDecoder(OCC.W).Decode(Entry, Offset, Length);		OpcodeDecoder(SW).Decode(Entry, Offset, Length);
}		}

template <typename ET>		template <typename ET>
void PrinterContext<ET>::PrintIndexTable(unsigned SectionIndex,		void PrinterContext<ET>::PrintIndexTable(unsigned SectionIndex,
const Elf_Shdr *IT) const {		const Elf_Shdr *IT) const {
// TODO: handle failure.		// TODO: handle failure.
Expected<ArrayRef<uint8_t>> Contents = ELF.getSectionContents(*IT);		Expected<ArrayRef<uint8_t>> Contents = ELF.getSectionContents(*IT);
if (!Contents)		if (!Contents)
▲ Show 20 Lines • Show All 124 Lines • Show Last 20 Lines

llvm/unittests/Support/ScopedPrinterTest.cpp

//===- llvm/unittest/Support/ScopedPrinterTest.cpp - ScopedPrinter tests --===// //===- llvm/unittest/Support/ScopedPrinterTest.cpp - ScopedPrinter tests --===//

// //

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions. // Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information. // See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception // SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

#include "llvm/Support/ScopedPrinter.h" #include "llvm/Support/ScopedPrinter.h"

#include "llvm/ADT/APSInt.h" #include "llvm/ADT/APSInt.h"

#include "gtest/gtest.h" #include "gtest/gtest.h"

#include <vector> #include <vector>

using namespace llvm; using namespace llvm;

TEST(JSONScopedPrinterTest, PrettyPrintCtor) {

jhendersonUnsubmitted

Done

I wouldn't bother with the anonymous namespace. There'd only be a problem if someone else started writing ScopedPrinterTests in another file, which I think we'd want to spot and possibly stop.

jhenderson: I wouldn't bother with the anonymous namespace. There'd only be a problem if someone else…

auto PrintFunc = [](ScopedPrinter &W) {

DictScope D(W);

jhendersonUnsubmitted

Done

For new unit tests that are not to do with the JSONScopedPrinter class, I'd put them in their own patch, as they are independently useful.

jhenderson: For new unit tests that are not to do with the JSONScopedPrinter class, I'd put them in their…

JaysonyanAuthorUnsubmitted

Done

Moved ScopedPrinter-related tests to D114684 and rebased off that patch.

Jaysonyan: Moved `ScopedPrinter`-related tests to D114684 and rebased off that patch.

W.printString("Key", "Value");

jhendersonUnsubmitted

Done

What's the point of the lambda? It seems like it's adding unnecessary complexity to the test, when you could just do:

std::string ScopedString;
llvm::raw_string_ostream OS(ScopedString);
ScopedPrinter Writer(OS);

// <current body of the lambda>

const char *Out = /*...*/;
EXPECT_EQ(Out, ScopedString);

jhenderson: What's the point of the lambda? It seems like it's adding unnecessary complexity to the test…

JaysonyanAuthorUnsubmitted

Done

I haven't removed the lambda but it's now using a test fixture which takes a lambda. It doesn't need to use the test fixture since we're not testing the JSONScopedPrinter but I'd imagine it might be nice for the sake of consistency with other tests. Let me know if you feel otherwise.

Jaysonyan: I haven't removed the lambda but it's now using a test fixture which takes a lambda. It doesn't…

};

std::string StreamBuffer;

raw_string_ostream OS(StreamBuffer);

JSONScopedPrinter PrettyPrintWriter(OS, /*PrettyPrint=*/true);

JSONScopedPrinter NoPrettyPrintWriter(OS, /*PrettyPrint=*/false);

const char *PrettyPrintOut = R"({

"Key": "Value"

})";

const char *NoPrettyPrintOut = R"({"Key":"Value"})";

PrintFunc(PrettyPrintWriter);

EXPECT_EQ(PrettyPrintOut, OS.str());

StreamBuffer.clear();

PrintFunc(NoPrettyPrintWriter);

EXPECT_EQ(NoPrettyPrintOut, OS.str());

jhendersonUnsubmitted

Done

This string isn't really scoped itself. It's more the buffer used by the stream, so I'd just call it Buffer or StreamBuffer.

jhenderson: This string isn't really scoped itself. It's more the buffer used by the stream, so I'd just…

}

jhendersonUnsubmitted

Done

std::string ScopedString;

- llvm::raw_string_ostream OS(ScopedString);

+ raw_string_ostream OS(ScopedString);

ScopedPrinter Writer(OS);

jhenderson:

TEST(JSONScopedPrinterTest, DelimitedScopeCtor) {

std::string StreamBuffer;

raw_string_ostream OS(StreamBuffer);

{

JSONScopedPrinter DictScopeWriter(OS, /*PrettyPrint=*/false,

std::make_unique<DictScope>());

DictScopeWriter.printString("Label", "DictScope");

}

EXPECT_EQ(R"({"Label":"DictScope"})", OS.str());

StreamBuffer.clear();

{

JSONScopedPrinter ListScopeWriter(OS, /*PrettyPrint=*/false,

std::make_unique<ListScope>());

ListScopeWriter.printString("ListScope");

}

EXPECT_EQ(R"(["ListScope"])", OS.str());

jhendersonUnsubmitted

Done

For symmetry, I might be inclined to print a string in each of the three cases, not just the "no scope" case.

jhenderson: For symmetry, I might be inclined to print a string in each of the three cases, not just the…

StreamBuffer.clear();

{

JSONScopedPrinter NoScopeWriter(OS, /*PrettyPrint=*/false);

NoScopeWriter.printString("NoScope");

}

EXPECT_EQ(R"("NoScope")", OS.str());

}

class ScopedPrinterTest : public ::testing::Test { class ScopedPrinterTest : public ::testing::Test {

protected: protected:

std::string StreamBuffer; std::string StreamBuffer;

raw_string_ostream OS; raw_string_ostream OS;

ScopedPrinter Writer; ScopedPrinter Writer;

JSONScopedPrinter JSONWriter;

bool HasPrintedToJSON;

ScopedPrinterTest() : OS(StreamBuffer), Writer(OS) {} ScopedPrinterTest()

: OS(StreamBuffer), Writer(OS), JSONWriter(OS, /*PrettyPrint=*/true),

jhendersonUnsubmitted

Done

ScopedPrinterTest()

- : OS(StreamBuffer), Writer(OS), JSONWriter(OS, true),

+ : OS(StreamBuffer), Writer(OS), JSONWriter(OS, /*PrettyPrint=*/true),

HasPrintedToJSON(false) {}

I'd add a comment as suggested inline, to explain the boolean.

jhenderson: I'd add a comment as suggested inline, to explain the boolean.

HasPrintedToJSON(false) {}

using PrintFunc = function_ref<void(ScopedPrinter &)>; using PrintFunc = function_ref<void(ScopedPrinter &)>;

void verifyScopedPrinter(StringRef Expected, PrintFunc Func) { void verifyScopedPrinter(StringRef Expected, PrintFunc Func) {

Func(Writer); Func(Writer);

Writer.flush(); Writer.flush();

EXPECT_EQ(Expected.str(), OS.str()); EXPECT_EQ(Expected.str(), OS.str());

StreamBuffer.clear();

}

void verifyJSONScopedPrinter(StringRef Expected, PrintFunc Func) {

{

DictScope D(JSONWriter);

Func(JSONWriter);

}

JSONWriter.flush();

EXPECT_EQ(Expected.str(), OS.str());

StreamBuffer.clear();

HasPrintedToJSON = true;

}

void verifyAll(StringRef ExpectedOut, StringRef JSONExpectedOut,

jhendersonUnsubmitted

Done

This setup logic might benefit from being pulled into a test fixture, to reduce the duplication between tests.

The JSON stuff could go in the same fixture.

jhenderson: This setup logic might benefit from being pulled into a test fixture, to reduce the duplication…

PrintFunc Func) {

verifyScopedPrinter(ExpectedOut, Func);

verifyJSONScopedPrinter(JSONExpectedOut, Func);

}

void TearDown() {

// JSONScopedPrinter fails an assert if nothing's been printed.

if (!HasPrintedToJSON)

JSONWriter.printString("");

} }

}; };

TEST_F(ScopedPrinterTest, GetKind) {

jhendersonUnsubmitted

Done

This and the classof test don't rely on the fixture, so I'd a) change them to not use the fixture, and b) move them above it. Alternatively, see my comment below.

jhenderson: This and the classof test don't rely on the fixture, so I'd a) change them to not use the…

EXPECT_EQ(ScopedPrinter::ScopedPrinterKind::Base, Writer.getKind());

EXPECT_EQ(ScopedPrinter::ScopedPrinterKind::JSON, JSONWriter.getKind());

}

TEST_F(ScopedPrinterTest, ClassOf) {

EXPECT_TRUE(ScopedPrinter::classof(&Writer));

EXPECT_TRUE(JSONScopedPrinter::classof(&JSONWriter));

EXPECT_FALSE(ScopedPrinter::classof(&JSONWriter));

EXPECT_FALSE(JSONScopedPrinter::classof(&Writer));

jhendersonUnsubmitted

Done

Either "nothing is" or "nothing's". Same below.

You might want to consider enhancing the current fixture, as an alternative to making these two tests not use it. In this case, I'd add the std::string, raw_string_ostream, ScopedPrinter and JSONScopedPrinter local variables used here and in the verify* functions into the base class, and then use the TearDown method to ensure JSONScopedPrinter has that empty string written.

Aside: it seems to me that this assertion is bogus - it's not that unreasonable to create a printer, but write nothing to it, to get an empty output.

jhenderson: Either "nothing is" or "nothing's". Same below. You might want to consider enhancing the…

JaysonyanAuthorUnsubmitted

Done

Updated the test fixture to handle the teardown. I needed to add a check to ensure we're only printing an empty string if there hasn't been a call to verifyJSONScopedPrinter since printString can only be called under specific contexts. Alternatively I could call something like DictScope(W, "Label") which can be called under all contexts (since it uses the history stack) but I held off because calling printString("") with the empty string felt more appropriate. Let me know if you have any opinions on this.

Jaysonyan: Updated the test fixture to handle the teardown. I needed to add a check to ensure we're only…

jhendersonUnsubmitted

Done

The only other idea I had was to make JSONScopedPrinter an Optional in the fixture, initialised in the corresponding verify method (and optionally in other tests, if needed), but I don't think that works.

I'm happy going with your suggestion.

jhenderson: The only other idea I had was to make `JSONScopedPrinter` an `Optional` in the fixture…

JaysonyanAuthorUnsubmitted

Done

I'll leave it as is for now, possibly in the future that assertion will be removed and this extra work can just be deleted.

Jaysonyan: I'll leave it as is for now, possibly in the future that assertion will be removed and this…

}

TEST_F(ScopedPrinterTest, Indent) { TEST_F(ScopedPrinterTest, Indent) {

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

W.printString("|"); W.printString("|");

W.indent(); W.indent();

W.printString("|"); W.printString("|");

W.indent(2); W.indent(2);

W.printString("|"); W.printString("|");

}; };

▲ Show 20 Lines • Show All 101 Lines • ▼ Show 20 Lines auto PrintFunc = [](ScopedPrinter &W) {

EnumEntry<int> OtherEnum{"Name5", "AltName5", 5}; EnumEntry<int> OtherEnum{"Name5", "AltName5", 5};

W.printEnum("Exists", EnumList[1].Value, makeArrayRef(EnumList)); W.printEnum("Exists", EnumList[1].Value, makeArrayRef(EnumList));

W.printEnum("DoesNotExist", OtherEnum.Value, makeArrayRef(EnumList)); W.printEnum("DoesNotExist", OtherEnum.Value, makeArrayRef(EnumList));

}; };

const char *ExpectedOut = R"(Exists: Name2 (0x2) const char *ExpectedOut = R"(Exists: Name2 (0x2)

DoesNotExist: 0x5 DoesNotExist: 0x5

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc);

const char *JSONExpectedOut = R"({

"Exists": {

"Value": "Name2",

"RawValue": 2

"DoesNotExist": 5

})";

verifyAll(ExpectedOut, JSONExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, PrintFlag) { TEST_F(ScopedPrinterTest, PrintFlag) {

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

const EnumEntry<uint16_t> SingleBitFlags[] = { const EnumEntry<uint16_t> SingleBitFlags[] = {

{"Name0", "AltName0", 0}, {"Name0", "AltName0", 0},

{"Name1", "AltName1", 1}, {"Name1", "AltName1", 1},

{"Name2", "AltName2", 1 << 1}, {"Name2", "AltName2", 1 << 1},

▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines

FirstSecondByteMask [ (0xFF) FirstSecondByteMask [ (0xFF)

] ]

FirstSecondThirdByteMask [ (0x333) FirstSecondThirdByteMask [ (0x333)

FirstByte3 (0x3) FirstByte3 (0x3)

SecondByte3 (0x30) SecondByte3 (0x30)

ThirdByte3 (0x300) ThirdByte3 (0x300)

] ]

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc);

const char *JSONExpectedOut = R"({

"ZeroFlag": {

"RawFlags": 0,

"Flags": []

"NoFlag": {

"RawFlags": 8,

"Flags": []

"Flag1": {

"RawFlags": 1,

"Flags": [

{

"Name": "Name1",

"Value": 1

}

]

"Flag1&3": {

"RawFlags": 5,

"Flags": [

{

"Name": "Name1",

"Value": 1

{

"Name": "Name3",

"Value": 4

}

]

"ZeroFlagRaw": {

"RawFlags": 0,

"Flags": []

"NoFlagRaw": {

"RawFlags": 8,

"Flags": [

]

"Flag1Raw": {

"RawFlags": 1,

"Flags": [

]

"Flag1&3Raw": {

"RawFlags": 5,

"Flags": [

]

"FlagSorted": {

"RawFlags": 7,

"Flags": [

{

"Name": "A",

"Value": 4

{

"Name": "B",

"Value": 2

{

"Name": "C",

"Value": 1

}

]

"NoBitMask": {

"RawFlags": 4095,

"Flags": [

{

"Name": "FirstByte1",

"Value": 1

{

"Name": "FirstByte2",

"Value": 2

{

"Name": "FirstByte3",

"Value": 3

{

"Name": "SecondByte1",

"Value": 16

{

"Name": "SecondByte2",

"Value": 32

{

"Name": "SecondByte3",

"Value": 48

{

"Name": "ThirdByte1",

"Value": 256

{

"Name": "ThirdByte2",

"Value": 512

{

"Name": "ThirdByte3",

"Value": 768

}

]

"FirstByteMask": {

"RawFlags": 3,

"Flags": [

{

"Name": "FirstByte3",

"Value": 3

}

]

"SecondByteMask": {

"RawFlags": 48,

"Flags": [

{

"Name": "SecondByte3",

"Value": 48

}

]

"ValueOutsideMask": {

"RawFlags": 1,

"Flags": [

{

"Name": "FirstByte1",

"Value": 1

}

]

"FirstSecondByteMask": {

"RawFlags": 255,

"Flags": []

"FirstSecondThirdByteMask": {

"RawFlags": 819,

"Flags": [

{

"Name": "FirstByte3",

"Value": 3

{

"Name": "SecondByte3",

"Value": 48

{

jhendersonUnsubmitted

Done

Looking at this test seesm to make it clear to me that this is not hte optimal format of this output. I think you probably want it to be an array of numbers, rather than an object, with individual bytes labelled.

jhenderson: Looking at this test seesm to make it clear to me that this is not hte optimal format of this…

JaysonyanAuthorUnsubmitted

Done

I've updated the output to have the following format:

"Bytes": [
  {
    "Index": 0,
    "Value": 70,
    "Character": "F"
  },
  {
    "Index": 2,
    "Value": 111,
    "Character": "o"
  },
  ...
]

(Character attribute will be omitted for non printBinaryBlock(...) methods for now)

Jaysonyan: I've updated the output to have the following format: ``` "Bytes": [ { "Index": 0…

"Name": "ThirdByte3",

"Value": 768

}

]

}

})";

verifyAll(ExpectedOut, JSONExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, PrintNumber) { TEST_F(ScopedPrinterTest, PrintNumber) {

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

uint64_t Unsigned64Max = std::numeric_limits<uint64_t>::max(); uint64_t Unsigned64Max = std::numeric_limits<uint64_t>::max();

uint64_t Unsigned64Min = std::numeric_limits<uint64_t>::min(); uint64_t Unsigned64Min = std::numeric_limits<uint64_t>::min();

W.printNumber("uint64_t-max", Unsigned64Max); W.printNumber("uint64_t-max", Unsigned64Max);

W.printNumber("uint64_t-min", Unsigned64Min); W.printNumber("uint64_t-min", Unsigned64Min);

▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines

int32_t-min: -2147483648 int32_t-min: -2147483648

int16_t-max: 32767 int16_t-max: 32767

int16_t-min: -32768 int16_t-min: -32768

int8_t-max: 127 int8_t-max: 127

int8_t-min: -128 int8_t-min: -128

apsint: 9999999999999999999999 apsint: 9999999999999999999999

label: value (0) label: value (0)

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc);

const char *JSONExpectedOut = R"({

"uint64_t-max": 18446744073709551615,

"uint64_t-min": 0,

"uint32_t-max": 4294967295,

"uint32_t-min": 0,

"uint16_t-max": 65535,

"uint16_t-min": 0,

"uint8_t-max": 255,

"uint8_t-min": 0,

"int64_t-max": 9223372036854775807,

"int64_t-min": -9223372036854775808,

"int32_t-max": 2147483647,

"int32_t-min": -2147483648,

"int16_t-max": 32767,

"int16_t-min": -32768,

"int8_t-max": 127,

"int8_t-min": -128,

"apsint": 9999999999999999999999,

"label": {

"Value": "value",

"RawValue": 0

}

})";

verifyAll(ExpectedOut, JSONExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, PrintBoolean) { TEST_F(ScopedPrinterTest, PrintBoolean) {

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

W.printBoolean("True", true); W.printBoolean("True", true);

W.printBoolean("False", false); W.printBoolean("False", false);

}; };

const char *ExpectedOut = R"(True: Yes const char *ExpectedOut = R"(True: Yes

False: No False: No

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc);

const char *JSONExpectedOut = R"({

"True": true,

"False": false

})";

verifyAll(ExpectedOut, JSONExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, PrintVersion) { TEST_F(ScopedPrinterTest, PrintVersion) {

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

W.printVersion("Version", "123", "456", "789"); W.printVersion("Version", "123", "456", "789");

}; };

const char *ExpectedOut = R"(Version: 123.456.789 const char *ExpectedOut = R"(Version: 123.456.789

)"; )";

▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines

uint16List: [65535, 0] uint16List: [65535, 0]

uint8List: [255, 0] uint8List: [255, 0]

int64List: [9223372036854775807, -9223372036854775808] int64List: [9223372036854775807, -9223372036854775808]

int32List: [2147483647, -2147483648] int32List: [2147483647, -2147483648]

int16List: [32767, -32768] int16List: [32767, -32768]

int8List: [127, -128] int8List: [127, -128]

APSIntList: [9999999999999999999999, -9999999999999999999999] APSIntList: [9999999999999999999999, -9999999999999999999999]

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc);

const char *JSONExpectedOut = R"({

"EmptyList": [],

"StringList": [

"foo",

"bar",

"baz"

"BoolList": [

true,

false

"uint64List": [

18446744073709551615,

"uint32List": [

4294967295,

"uint16List": [

65535,

"uint8List": [

255,

"int64List": [

9223372036854775807,

-9223372036854775808

"int32List": [

2147483647,

-2147483648

"int16List": [

32767,

-32768

"int8List": [

127,

-128

"APSIntList": [

9999999999999999999999,

-9999999999999999999999

]

})";

verifyAll(ExpectedOut, JSONExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, PrintListPrinter) { TEST_F(ScopedPrinterTest, PrintListPrinter) {

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

const std::string StringList[] = {"a", "ab", "abc"}; const std::string StringList[] = {"a", "ab", "abc"};

W.printList("StringSizeList", StringList, W.printList("StringSizeList", StringList,

[](raw_ostream &OS, StringRef Item) { OS << Item.size(); }); [](raw_ostream &OS, StringRef Item) { OS << Item.size(); });

}; };

const char *ExpectedOut = R"(StringSizeList: [1, 2, 3] const char *ExpectedOut = R"(StringSizeList: [1, 2, 3]

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc); verifyScopedPrinter(ExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, PrintHex) { TEST_F(ScopedPrinterTest, PrintHex) {

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

W.printHex("HexNumber", 0x10); W.printHex("HexNumber", 0x10);

W.printHex("HexLabel", "Name", 0x10); W.printHex("HexLabel", "Name", 0x10);

}; };

const char *ExpectedOut = R"(HexNumber: 0x10 const char *ExpectedOut = R"(HexNumber: 0x10

HexLabel: Name (0x10) HexLabel: Name (0x10)

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc);

const char *JSONExpectedOut = R"({

"HexNumber": 16,

"HexLabel": {

"Value": "Name",

"RawValue": 16

}

})";

verifyAll(ExpectedOut, JSONExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, PrintHexList) { TEST_F(ScopedPrinterTest, PrintHexList) {

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

const uint64_t HexList[] = {0x1, 0x10, 0x100}; const uint64_t HexList[] = {0x1, 0x10, 0x100};

W.printHexList("HexList", HexList); W.printHexList("HexList", HexList);

}; };

const char *ExpectedOut = R"(HexList: [0x1, 0x10, 0x100] const char *ExpectedOut = R"(HexList: [0x1, 0x10, 0x100]

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc);

const char *JSONExpectedOut = R"({

"HexList": [

16,

256

]

})";

verifyAll(ExpectedOut, JSONExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, PrintSymbolOffset) { TEST_F(ScopedPrinterTest, PrintSymbolOffset) {

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

W.printSymbolOffset("SymbolOffset", "SymbolName", 0x10); W.printSymbolOffset("SymbolOffset", "SymbolName", 0x10);

W.printSymbolOffset("NoSymbolOffset", "SymbolName", 0); W.printSymbolOffset("NoSymbolOffset", "SymbolName", 0);

}; };

const char *ExpectedOut = R"(SymbolOffset: SymbolName+0x10 const char *ExpectedOut = R"(SymbolOffset: SymbolName+0x10

NoSymbolOffset: SymbolName+0x0 NoSymbolOffset: SymbolName+0x0

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc);

const char *JSONExpectedOut = R"({

"SymbolOffset": {

"SymName": "SymbolName",

"Offset": 16

"NoSymbolOffset": {

"SymName": "SymbolName",

"Offset": 0

}

})";

verifyAll(ExpectedOut, JSONExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, PrintString) { TEST_F(ScopedPrinterTest, PrintString) {

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

const StringRef StringRefValue("Value"); const StringRef StringRefValue("Value");

const std::string StringValue = "Value"; const std::string StringValue = "Value";

const char *CharArrayValue = "Value"; const char *CharArrayValue = "Value";

W.printString("StringRef", StringRefValue); W.printString("StringRef", StringRefValue);

W.printString("String", StringValue); W.printString("String", StringValue);

W.printString("CharArray", CharArrayValue); W.printString("CharArray", CharArrayValue);

ListScope L(W, "StringList"); ListScope L(W, "StringList");

W.printString(StringRefValue); W.printString(StringRefValue);

}; };

const char *ExpectedOut = R"(StringRef: Value const char *ExpectedOut = R"(StringRef: Value

String: Value String: Value

CharArray: Value CharArray: Value

StringList [ StringList [

Value Value

] ]

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc);

const char *JSONExpectedOut = R"({

"StringRef": "Value",

"String": "Value",

"CharArray": "Value",

"StringList": [

"Value"

]

})";

verifyAll(ExpectedOut, JSONExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, PrintBinary) { TEST_F(ScopedPrinterTest, PrintBinary) {

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

std::vector<uint8_t> IntArray = {70, 111, 111, 66, 97, 114}; std::vector<uint8_t> IntArray = {70, 111, 111, 66, 97, 114};

std::vector<char> CharArray = {'F', 'o', 'o', 'B', 'a', 'r'}; std::vector<char> CharArray = {'F', 'o', 'o', 'B', 'a', 'r'};

std::vector<uint8_t> InvalidChars = {255, 255}; std::vector<uint8_t> InvalidChars = {255, 255};

W.printBinary("Binary1", "FooBar", IntArray); W.printBinary("Binary1", "FooBar", IntArray);

Show All 30 Lines

Binary10 ( Binary10 (

0000: 4D756C74 69706C65 204C696E 6520466F |Multiple Line Fo| 0000: 4D756C74 69706C65 204C696E 6520466F |Multiple Line Fo|

0010: 6F426172 |oBar| 0010: 6F426172 |oBar|

) )

Binary11 ( Binary11 (

0000: FFFF |..| 0000: FFFF |..|

) )

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc);

const char *JSONExpectedOut = R"({

"Binary1": {

"Value": "FooBar",

"Offset": 0,

"Bytes": [

70,

111,

66,

97,

114

]

"Binary2": {

"Value": "FooBar",

"Offset": 0,

"Bytes": [

70,

111,

66,

97,

114

]

"Binary3": {

"Offset": 0,

"Bytes": [

70,

111,

66,

97,

114

]

"Binary4": {

"Offset": 0,

"Bytes": [

70,

111,

66,

97,

114

]

"Binary5": {

"Offset": 0,

"Bytes": [

70,

111,

66,

97,

114

]

"Binary6": {

"Offset": 0,

"Bytes": [

77,

117,

108,

116,

105,

112,

108,

101,

32,

76,

105,

110,

101,

32,

70,

111,

66,

97,

114

]

"Binary7": {

"Offset": 20,

"Bytes": [

70,

111,

66,

97,

114

]

"Binary8": {

"Offset": 0,

"Bytes": [

70,

111,

66,

97,

114

]

"Binary9": {

"Offset": 0,

"Bytes": [

70,

111,

66,

97,

114

]

"Binary10": {

"Offset": 0,

"Bytes": [

77,

117,

108,

116,

105,

112,

108,

101,

32,

76,

105,

110,

101,

32,

70,

111,

66,

97,

114

]

"Binary11": {

"Offset": 0,

"Bytes": [

255,

255

]

}

})";

verifyAll(ExpectedOut, JSONExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, PrintObject) { TEST_F(ScopedPrinterTest, PrintObject) {

auto PrintFunc = [](ScopedPrinter &W) { W.printObject("Object", "Value"); }; auto PrintFunc = [](ScopedPrinter &W) { W.printObject("Object", "Value"); };

const char *ExpectedOut = R"(Object: Value const char *ExpectedOut = R"(Object: Value

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc);

const char *JSONExpectedOut = R"({

"Object": "Value"

})";

verifyAll(ExpectedOut, JSONExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, StartLine) { TEST_F(ScopedPrinterTest, StartLine) {

jhendersonUnsubmitted

Done

I think we need a JSONScopedPrinter version of this test, since we override startLine in that class.

jhenderson: I think we need a JSONScopedPrinter version of this test, since we override `startLine` in that…

JaysonyanAuthorUnsubmitted

Done

My mistake, I actually mean to remove the overridden implementations of both startLine and getOStream. For the JSONScopedPrinter to provide these methods, it relies on json::OStream::rawValueBegin() which can only be used where values are used (elements of arrays or values to attributes). So if startLine or getOStream are called in any place which aren't these contexts (which is most of the time) then assertions inside json::OStream fail. So I think it might be more desirable to just rely on the ScopedPrinter implementation of both these methods.

Jaysonyan: My mistake, I actually mean to remove the overridden implementations of both `startLine` and…

jhendersonUnsubmitted

Done

Okay. I haven't looked into this, so I'll trust your judgement.

jhenderson: Okay. I haven't looked into this, so I'll trust your judgement.

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

W.startLine() << "|"; W.startLine() << "|";

W.indent(2); W.indent(2);

W.startLine() << "|"; W.startLine() << "|";

W.unindent(); W.unindent();

W.startLine() << "|"; W.startLine() << "|";

}; };

verifyScopedPrinter(ExpectedOut, PrintFunc); verifyScopedPrinter(ExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, GetOStream) { TEST_F(ScopedPrinterTest, GetOStream) {

jhendersonUnsubmitted

Done

Ditto.

jhenderson: Ditto.

auto PrintFunc = [](ScopedPrinter &W) { W.getOStream() << "Test"; }; auto PrintFunc = [](ScopedPrinter &W) { W.getOStream() << "Test"; };

const char *ExpectedOut = "Test"; const char *ExpectedOut = "Test";

verifyScopedPrinter(ExpectedOut, PrintFunc); verifyScopedPrinter(ExpectedOut, PrintFunc);

} }

TEST_F(ScopedPrinterTest, PrintScope) { TEST_F(ScopedPrinterTest, PrintScope) {

auto PrintFunc = [](ScopedPrinter &W) { auto PrintFunc = [](ScopedPrinter &W) {

Show All 17 Lines

} }

List [ List [

ObjectInList { ObjectInList {

} }

ListInList [ ListInList [

] ]

)"; )";

verifyScopedPrinter(ExpectedOut, PrintFunc);

const char *JSONExpectedOut = R"({

"Object": {

"ObjectInObject": {},

"ListInObject": []

"List": [

{

"ObjectInList": {}

{

"ListInList": []

}

]

})";

verifyAll(ExpectedOut, JSONExpectedOut, PrintFunc);

} }

This is an archive of the discontinued LLVM Phabricator instance.

Add JSONScopedPrinter classClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 393543

llvm/include/llvm/Support/ScopedPrinter.h

llvm/lib/Support/ScopedPrinter.cpp

llvm/tools/llvm-readobj/ARMEHABIPrinter.h

llvm/unittests/Support/ScopedPrinterTest.cpp

Add JSONScopedPrinter class
ClosedPublic