This is an archive of the discontinued LLVM Phabricator instance.

Differential D137088

[llvm-readobj] Standardize JSON output for `Other` field
ClosedPublic

Authored by paulkirth on Oct 31 2022, 10:26 AM.

Download Raw Diff

Details

Reviewers

jhenderson

Commits

rG8e1746faa357: [llvm-readobj] Standardize JSON output for `Other` field

Summary

Today, the LLVM output uses special handling when the Other field is 0.
This output makes sense for a command line utility that a human will
read, but JSON is a machine readable format, so being consistent is more
important. Prior to this change, any consumer of the JSON output would
need to handle the Other field specially, since the structure of the
JSON would no longer be consistent.

Changes to JSON output when Other flag == 0:

"Other": 0,   ->   "Other": {
                      "RawFlags": 0,
                       "Flags": []
                    },

There are no changes to when Other flag != 0:

"Other": {        ->   "Other": {
  "RawFlags": 1,          "RawFlags": 1,
  "Flags": [              "Flags": [
      ...                     ...
  ]                       ]
},                     },

This patch adds a overload for the JSONELFDumper's printSymbol() method,
that uses consistent output formatting, regardless of the value of the
Other field.

Depends on D137092

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

paulkirth created this revision.Oct 31 2022, 10:26 AM

Herald added a reviewer: jhenderson. · View Herald TranscriptOct 31 2022, 10:26 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: s.egerton, simoncook. · View Herald Transcript

paulkirth requested review of this revision.Oct 31 2022, 10:26 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 31 2022, 10:26 AM

Herald added subscribers: llvm-commits, • pcwang-thead, MaskRay. · View Herald Transcript

paulkirth added a child revision: D137089: [llvm-readobj] Fix JSON output for Relocations.Oct 31 2022, 10:27 AM

FYI @jhenderson This whole stack is in need of tests, and I plan to address that. I think that if we find this stack preferable to D135419, then we can just abandon that approach.

Harbormaster completed remote builds in B195297: Diff 472056.Oct 31 2022, 11:13 AM

Rather than duplicating, then refactoring, do the refactor first and then use it, so that the code is reasonably clean in every landed commit.

Also, please provide before and after examples, and test cases for the fix. Also, please fix the title to use "llvm-readobj" in the commit message/patch description, not simply "readobj".

In D137088#3897022, @paulkirth wrote:

FYI @jhenderson This whole stack is in need of tests, and I plan to address that. I think that if we find this stack preferable to D135419, then we can just abandon that approach.

Sorry, only just saw this comment. Either way, the before and after examples are needed for me to visualise the problems. One of the aims with the JSON output being built on top of the LLVM output was that they should have largely been the same structurally, just with some formatting details changing.

This revision now requires changes to proceed.Nov 1 2022, 1:36 AM

Sorry, I typed this up back in early November, when we were initially looking at this and forgot to hit the submit button. I've had a few other items take up my attention, but hope to spend a bit more time on this before the end of the year.

The point of this change is to make Other a consistent field, and avoid having to special case how its handled when Other is 0.

Other: 0,   ->   "Other": {
                   "RawFlags": 0,
                    "Flags": []
                 },

With more context the JSON looks like this:

Before

... 
 "Symbols": [
    {
      "Symbol": {
        "Name": {
          "Value": "foo",
          "RawValue": 1
        },
        "Value": 0,
        "Size": 0,
        "Binding": {
          "Value": "Local",
          "RawValue": 0
        },
        "Type": {
          "Value": "None",
          "RawValue": 0
        },
        "Other": 0,
        "Section": {
          "Value": ".text",
          "RawValue": 1
        }
      }
    }
  ]
  ...

After

...
"Symbols": [
   {
     "Symbol": {
       "Name": {
         "Value": "foo",
         "RawValue": 1
       },
       "Value": 0,
       "Size": 0,
       "Binding": {
         "Value": "Local",
         "RawValue": 0
       },
       "Type": {
         "Value": "None",
         "RawValue": 0
       },
       "Other": {
         "RawFlags": 0,
         "Flags": []
       },
       "Section": {
         "Value": ".text",
         "RawValue": 1
       }
     }
   }
 ]
 ...

It isn't significantly different than before, but now it's consistent, regardless of the flag's value. For tools that focuses on a human reader, the specializing the output to be concise makes sense. But JSON is a machine readable format, so it's far more important that the output always have the same structure than it is to be concise. That's really the point of this change.

Also, part of the reason I set these patches aside for a while was that there is little testing for the JSON output, so I was trying to find a middle ground where we could only test that the formatting is correct without duplicating many of the LLVM formating tests. I didn't really find a nice way to do that. Do you have any recommendations as to how we could achieve that?

Rebase patch on top of D137092.

Add some tests for new JSON output of the Other field.
Update the summary with an example of the change to the output.

Herald added subscribers: frasercrmck, luismarques, apazos and 18 others. · View Herald TranscriptDec 7 2022, 4:48 PM

Harbormaster completed remote builds in B201849: Diff 481110.Dec 8 2022, 3:37 AM

As a heads-up, I am off work after today for 6 weeks, so won't be able to review further after today. Sorry about that.

llvm/tools/llvm-readobj/ELFDumper.cpp
7679	Could we avoid the mass code-duplication by just introducing another function "printSymbolOtherField" (or something like that) that is implemented differently for the two dumper types? Maybe even "printZeroSymbolOtherField" so that only the very niche bit that needs to differ is distinguished. This function could then just delegate to the regular print symbol st_other code, to avoid needing to duplicate that in any way. With this change, I think you could minimise the additional testing to just demonstrate basic behaviour difference for zero/non-zero st_other values, without needing to have additional cases for the various other aspects of this field printing.

In D137088#3979964, @paulkirth wrote:

Also, part of the reason I set these patches aside for a while was that there is little testing for the JSON output, so I was trying to find a middle ground where we could only test that the formatting is correct without duplicating many of the LLVM formating tests. I didn't really find a nice way to do that. Do you have any recommendations as to how we could achieve that?

I didn't respond to this directly, but just noting that as far as I'm concerned, if both LLVM and JSON styles use the same code, you only need to test one or the other. You only need to add additional testing where behaviour follows a different code path.

Also, although I'm marked as Requesting Changes, as long as my most recent comments are addressed, I'm happy for this patch to be landed if someone else involved with llvm-readobj approves it (e.g. @MaskRay).

@jhenderson Thanks for the feedback, and the notification re: time of work. I'm looking into your suggested approach now. The duplication also bothered me, but I also saw several other places where the output should diverge. I'll take a closer look at this, now that I've sketched out how to fix the JSON formatting for most of the problems I've found, and see if there is a nicer way to modify the output only when they diverge.

In D137088#3984007, @jhenderson wrote:

I didn't respond to this directly, but just noting that as far as I'm concerned, if both LLVM and JSON styles use the same code, you only need to test one or the other. You only need to add additional testing where behaviour follows a different code path.

That is good to know. Thank you for clarifying. I'll try to keep that in mind as this stack evolves.

Test only the differences in output and implementation between the LLVM and JSON formats

Harbormaster completed remote builds in B202877: Diff 482519.Dec 13 2022, 11:24 AM

Fix Formatting

Rebase + format

Rebase

Remove unnecesary test checks.

Harbormaster completed remote builds in B204052: Diff 484123.Dec 19 2022, 7:02 PM

Fix error in test

Harbormaster completed remote builds in B204248: Diff 484389.Dec 20 2022, 4:41 PM

Rebase

Harbormaster completed remote builds in B210000: Diff 492285.Jan 25 2023, 5:51 PM

Rebase.

Harbormaster completed remote builds in B212867: Diff 496200.Feb 9 2023, 1:17 PM

jhenderson added inline comments.Feb 24 2023, 12:59 AM

llvm/test/tools/llvm-readobj/ELF/llvm-vs-json-format.test
1 ↗	(On Diff #496200)	Having looked at this again, rather than a separate test showing the differences, I think I'd prefer the test cases being added to existing tests that test LLVM and GNU output styles. For the st_other case, that would be https://github.com/llvm/llvm-project/blob/main/llvm/test/tools/llvm-readobj/ELF/symbol-visibility.test (which is essentially the "main" st_other test) and potentially both https://github.com/llvm/llvm-project/blob/main/llvm/test/tools/llvm-readobj/ELF/aarch64-symbols-stother.test and https://github.com/llvm/llvm-project/blob/main/llvm/test/tools/llvm-readobj/ELF/mips-symbols-stother.test. I was hoping we'd be able to share CHECK patterns, but of course, for JSON output, the keys are quoted, so that doesn't really work.

paulkirth added inline comments.Feb 24 2023, 10:38 AM

llvm/test/tools/llvm-readobj/ELF/llvm-vs-json-format.test
1 ↗	(On Diff #496200)	I'm a bit concerned that the number of tests and test cases will start to rapidly grow as we progress through this stack, since there is basically zero JSON testing as it stands, but let's see how this goes. I'd also like to keep this test since it documents how the two formats differ in a single place, and subsequent patches build on top of it.

Rebase and address comments

Update existing tests
Update summary

paulkirth marked an inline comment as done.Feb 24 2023, 4:48 PM

Rebase.

Harbormaster completed remote builds in B215859: Diff 500335.Feb 25 2023, 2:15 AM

jhenderson added inline comments.Mar 1 2023, 12:43 AM

llvm/test/tools/llvm-readobj/ELF/llvm-vs-json-format.test
1 ↗	(On Diff #496200)	It's just showing what the other test cases show though, for example, the aarch test you've modified clearly shows the behaviour difference between the two for st_other fields. If you think highlighting the difference in behaviour between the two is worthwhile (I'm not convinced that it is), it probably should be in the documentation, rather than duplicating test coverage. Regarding the rapid test case growth, that's rather inevitable, I'd have thought: you've got a new(ish) file format, that hasn't been rigorously tested before. It clearly needs to be better tested, so it's rather unsurprising. Keeping this file around will just exacerbate the test growth, since it will become one giant soup of test cases, be hard to maintain, and be somewhat hard to read, I suspect going forwards.
llvm/test/tools/llvm-readobj/ELF/mips-symbols-stother.test
27	Nit: no need for extra blank line.

Remove redunant test code.

Harbormaster completed remote builds in B217210: Diff 502187.Mar 3 2023, 12:58 PM

Rebase.

paulkirth mentioned this in D135419: [readobj] Make JSON output consistent for Other flags.Mar 3 2023, 4:04 PM

Harbormaster completed remote builds in B217297: Diff 502289.Mar 3 2023, 5:21 PM

Looks good, but one suggestion.

llvm/tools/llvm-readobj/ELFDumper.cpp
7679	It might be worth a comment explaining why for JSON it's desirable to do things differently here.

This revision is now accepted and ready to land.Mar 17 2023, 1:27 AM

Add comment about the rationale for how we handle JSON output.

This revision was landed with ongoing or failed builds.Mar 17 2023, 4:38 PM

Closed by commit rG8e1746faa357: [llvm-readobj] Standardize JSON output for `Other` field (authored by paulkirth). · Explain Why

This revision was automatically updated to reflect the committed changes.

paulkirth added a commit: rG8e1746faa357: [llvm-readobj] Standardize JSON output for `Other` field.

Harbormaster completed remote builds in B220164: Diff 506223.Mar 17 2023, 5:14 PM

Revision Contents

Path

Size

llvm/

test/

tools/

llvm-readobj/

ELF/

aarch64-symbols-stother.test

49 lines

mips-symbols-stother.test

34 lines

symbol-visibility.test

40 lines

tools/

llvm-readobj/

ELFDumper.cpp

42 lines

Diff 506224

llvm/test/tools/llvm-readobj/ELF/aarch64-symbols-stother.test

	## Check that we are able to dump AArch64 STO_* flags correctly when dumping symbols.			## Check that we are able to dump AArch64 STO_* flags correctly when dumping symbols.

	# RUN: yaml2obj %s -o %t.o			# RUN: yaml2obj %s -o %t.o
	# RUN: llvm-readobj --symbols %t.o \| FileCheck %s --check-prefix=LLVM			# RUN: llvm-readobj --symbols %t.o \| FileCheck %s --check-prefix=LLVM
				# RUN: llvm-readobj --symbols %t.o --elf-output-style=JSON --pretty-print \| FileCheck %s --check-prefix=JSON
	# RUN: llvm-readelf --symbols %t.o \| FileCheck %s --check-prefix=GNU			# RUN: llvm-readelf --symbols %t.o \| FileCheck %s --check-prefix=GNU

	# LLVM: Name: foo1			# LLVM: Name: foo1
	# LLVM: Other [ (0x80)			# LLVM: Other [ (0x80)
	# LLVM-NEXT: STO_AARCH64_VARIANT_PCS (0x80)			# LLVM-NEXT: STO_AARCH64_VARIANT_PCS (0x80)
	# LLVM-NEXT: ]			# LLVM-NEXT: ]
	# LLVM: Name: foo2			# LLVM: Name: foo2
	# LLVM: Other [ (0xC0)			# LLVM: Other [ (0xC0)
	Show All 10 Lines
	# LLVM-NEXT: ]			# LLVM-NEXT: ]

	# GNU: Symbol table '.symtab' contains 5 entries:			# GNU: Symbol table '.symtab' contains 5 entries:
	# GNU: 1: 0000000000000000 0 NOTYPE LOCAL DEFAULT [VARIANT_PCS] UND foo1			# GNU: 1: 0000000000000000 0 NOTYPE LOCAL DEFAULT [VARIANT_PCS] UND foo1
	# GNU-NEXT: 2: 0000000000000000 0 NOTYPE LOCAL DEFAULT [VARIANT_PCS \| 40] UND foo2			# GNU-NEXT: 2: 0000000000000000 0 NOTYPE LOCAL DEFAULT [VARIANT_PCS \| 40] UND foo2
	# GNU-NEXT: 3: 0000000000000000 0 NOTYPE LOCAL PROTECTED [VARIANT_PCS] UND foo3			# GNU-NEXT: 3: 0000000000000000 0 NOTYPE LOCAL PROTECTED [VARIANT_PCS] UND foo3
	# GNU-NEXT: 4: 0000000000000000 0 NOTYPE LOCAL PROTECTED UND foo4			# GNU-NEXT: 4: 0000000000000000 0 NOTYPE LOCAL PROTECTED UND foo4

				# JSON: "Value": "foo1",
				# JSON: "Other": {
				# JSON-NEXT: "RawFlags": 128,
				# JSON-NEXT: "Flags": [
				# JSON-NEXT: {
				# JSON-NEXT: "Name": "STO_AARCH64_VARIANT_PCS",
				# JSON-NEXT: "Value": 128
				# JSON-NEXT: }
				# JSON-NEXT: ]
				# JSON-NEXT:},

				# JSON: "Value": "foo2",
				# JSON: "Other": {
				# JSON-NEXT: "RawFlags": 192,
				# JSON-NEXT: "Flags": [
				# JSON-NEXT: {
				# JSON-NEXT: "Name": "STO_AARCH64_VARIANT_PCS",
				# JSON-NEXT: "Value": 128
				# JSON-NEXT: }
				# JSON-NEXT: ]
				# JSON-NEXT:},

				# JSON: "Value": "foo3",
				# JSON: "Other": {
				# JSON-NEXT: "RawFlags": 131,
				# JSON-NEXT: "Flags": [
				# JSON-NEXT: {
				# JSON-NEXT: "Name": "STO_AARCH64_VARIANT_PCS",
				# JSON-NEXT: "Value": 128
				# JSON-NEXT: },
				# JSON-NEXT: {
				# JSON-NEXT: "Name": "STV_PROTECTED",
				# JSON-NEXT: "Value": 3
				# JSON-NEXT: }
				# JSON-NEXT: ]
				# JSON-NEXT:},

				# JSON: "Value": "foo4",
				# JSON: "Other": {
				# JSON-NEXT: "RawFlags": 3,
				# JSON-NEXT: "Flags": [
				# JSON-NEXT: {
				# JSON-NEXT: "Name": "STV_PROTECTED",
				# JSON-NEXT: "Value": 3
				# JSON-NEXT: }
				# JSON-NEXT: ]
				# JSON-NEXT:},

	--- !ELF			--- !ELF
	FileHeader:			FileHeader:
	Class: ELFCLASS64			Class: ELFCLASS64
	Data: ELFDATA2LSB			Data: ELFDATA2LSB
	Type: ET_REL			Type: ET_REL
	Machine: EM_AARCH64			Machine: EM_AARCH64
	Symbols:			Symbols:
	- Name: foo1			- Name: foo1
	Other: [ STO_AARCH64_VARIANT_PCS ]			Other: [ STO_AARCH64_VARIANT_PCS ]
	- Name: foo2			- Name: foo2
	Other: [ STO_AARCH64_VARIANT_PCS, 0x40 ]			Other: [ STO_AARCH64_VARIANT_PCS, 0x40 ]
	- Name: foo3			- Name: foo3
	Other: [ STO_AARCH64_VARIANT_PCS, STV_PROTECTED ]			Other: [ STO_AARCH64_VARIANT_PCS, STV_PROTECTED ]
	- Name: foo4			- Name: foo4
	Other: [ STV_PROTECTED ]			Other: [ STV_PROTECTED ]

llvm/test/tools/llvm-readobj/ELF/mips-symbols-stother.test

	## Check that we are able to dump MIPS STO_* flags correctly when dumping symbols.			## Check that we are able to dump MIPS STO_* flags correctly when dumping symbols.

	# RUN: yaml2obj %s -o %t.o			# RUN: yaml2obj %s -o %t.o
	# RUN: llvm-readobj --symbols %t.o \| FileCheck %s --strict-whitespace --check-prefix=MIPS-LLVM			# RUN: llvm-readobj --symbols %t.o \| FileCheck %s --strict-whitespace --check-prefix=MIPS-LLVM
				# RUN: llvm-readobj --symbols %t.o --elf-output-style=JSON --pretty-print \| FileCheck %s --check-prefix=MIPS-JSON
	# RUN: llvm-readelf --symbols %t.o \| FileCheck %s --strict-whitespace --check-prefix=MIPS-GNU			# RUN: llvm-readelf --symbols %t.o \| FileCheck %s --strict-whitespace --check-prefix=MIPS-GNU

	# MIPS-LLVM:Name: foo			# MIPS-LLVM:Name: foo
	# MIPS-LLVM:Other [			# MIPS-LLVM:Other [
	# MIPS-LLVM-NEXT: STO_MIPS_MICROMIPS (0x80)			# MIPS-LLVM-NEXT: STO_MIPS_MICROMIPS (0x80)
	# MIPS-LLVM-NEXT: STO_MIPS_OPTIONAL (0x4)			# MIPS-LLVM-NEXT: STO_MIPS_OPTIONAL (0x4)
	# MIPS-LLVM-NEXT: STO_MIPS_PIC (0x20)			# MIPS-LLVM-NEXT: STO_MIPS_PIC (0x20)
	# MIPS-LLVM-NEXT: STO_MIPS_PLT (0x8)			# MIPS-LLVM-NEXT: STO_MIPS_PLT (0x8)
	# MIPS-LLVM-NEXT:]			# MIPS-LLVM-NEXT:]

	# MIPS-LLVM:Name: bar			# MIPS-LLVM:Name: bar
	# MIPS-LLVM:Other [			# MIPS-LLVM:Other [
	# MIPS-LLVM-NEXT: STO_MIPS_MIPS16 (0xF0)			# MIPS-LLVM-NEXT: STO_MIPS_MIPS16 (0xF0)
	# MIPS-LLVM-NEXT:]			# MIPS-LLVM-NEXT:]

	# MIPS-GNU:Symbol table '.symtab' contains 3 entries:			# MIPS-GNU:Symbol table '.symtab' contains 3 entries:
	# MIPS-GNU-NEXT: Num: Value Size Type Bind Vis Ndx Name			# MIPS-GNU-NEXT: Num: Value Size Type Bind Vis Ndx Name
	# MIPS-GNU-NEXT: 0: 00000000 0 NOTYPE LOCAL DEFAULT UND			# MIPS-GNU-NEXT: 0: 00000000 0 NOTYPE LOCAL DEFAULT UND
	# MIPS-GNU-NEXT: 1: 00000000 0 NOTYPE LOCAL DEFAULT [<other: 0xac>] UND foo			# MIPS-GNU-NEXT: 1: 00000000 0 NOTYPE LOCAL DEFAULT [<other: 0xac>] UND foo
	# MIPS-GNU-NEXT: 2: 00000000 0 NOTYPE LOCAL DEFAULT [<other: 0xf0>] UND bar			# MIPS-GNU-NEXT: 2: 00000000 0 NOTYPE LOCAL DEFAULT [<other: 0xf0>] UND bar

				# MIPS-JSON: "Value": "foo",
				jhendersonUnsubmitted Done Reply Inline Actions Nit: no need for extra blank line. jhenderson: Nit: no need for extra blank line.
				# MIPS-JSON: "Other": {
				# MIPS-JSON-NEXT: "RawFlags": 172,
				# MIPS-JSON-NEXT: "Flags": [
				# MIPS-JSON-NEXT: {
				# MIPS-JSON-NEXT: "Name": "STO_MIPS_MICROMIPS",
				# MIPS-JSON-NEXT: "Value": 128
				# MIPS-JSON-NEXT: },
				# MIPS-JSON-NEXT: {
				# MIPS-JSON-NEXT: "Name": "STO_MIPS_OPTIONAL",
				# MIPS-JSON-NEXT: "Value": 4
				# MIPS-JSON-NEXT: },
				# MIPS-JSON-NEXT: {
				# MIPS-JSON-NEXT: "Name": "STO_MIPS_PIC",
				# MIPS-JSON-NEXT: "Value": 32
				# MIPS-JSON-NEXT: },
				# MIPS-JSON-NEXT: {
				# MIPS-JSON-NEXT: "Name": "STO_MIPS_PLT",
				# MIPS-JSON-NEXT: "Value": 8
				# MIPS-JSON-NEXT: }
				# MIPS-JSON-NEXT: ]
				# MIPS-JSON-NEXT: },
				# MIPS-JSON: "Value": "bar",
				# MIPS-JSON: "Other": {
				# MIPS-JSON-NEXT: "RawFlags": 240,
				# MIPS-JSON-NEXT: "Flags": [
				# MIPS-JSON-NEXT: {
				# MIPS-JSON-NEXT: "Name": "STO_MIPS_MIPS16",
				# MIPS-JSON-NEXT: "Value": 240
				# MIPS-JSON-NEXT: }
				# MIPS-JSON-NEXT: ]
				# MIPS-JSON-NEXT: },

	--- !ELF			--- !ELF
	FileHeader:			FileHeader:
	Class: ELFCLASS32			Class: ELFCLASS32
	Data: ELFDATA2LSB			Data: ELFDATA2LSB
	Type: ET_REL			Type: ET_REL
	Machine: EM_MIPS			Machine: EM_MIPS
	Symbols:			Symbols:
	- Name: foo			- Name: foo
	Other: [ STO_MIPS_MICROMIPS, STO_MIPS_PIC,			Other: [ STO_MIPS_MICROMIPS, STO_MIPS_PIC,
	STO_MIPS_PLT, STO_MIPS_OPTIONAL]			STO_MIPS_PLT, STO_MIPS_OPTIONAL]
	## Use a different symbol for STO_MIPS_MIPS16 (0xf0) as it interferes			## Use a different symbol for STO_MIPS_MIPS16 (0xf0) as it interferes
	## with STO_MIPS_PIC (0x20) and STO_MIPS_MICROMIPS (0x80).			## with STO_MIPS_PIC (0x20) and STO_MIPS_MICROMIPS (0x80).
	- Name: bar			- Name: bar
	Other: [ STO_MIPS_MIPS16 ]			Other: [ STO_MIPS_MIPS16 ]

llvm/test/tools/llvm-readobj/ELF/symbol-visibility.test

	## Show that llvm-readobj prints the symbol visibility where recognised, or			## Show that llvm-readobj prints the symbol visibility where recognised, or
	## something sensible when not, for both GNU and LLVM output.			## something sensible when not, for both GNU and LLVM output.

	## Check how we dump symbols when they have only STV_* bits set for st_other.			## Check how we dump symbols when they have only STV_* bits set for st_other.
	## (This is the most common case).			## (This is the most common case).

	# RUN: yaml2obj --docnum=1 %s -o %t1.o			# RUN: yaml2obj --docnum=1 %s -o %t1.o
	# RUN: llvm-readobj --symbols %t1.o \| FileCheck %s --check-prefix=LLVM			# RUN: llvm-readobj --symbols %t1.o \| FileCheck %s --check-prefix=LLVM
	# RUN: llvm-readelf --symbols %t1.o \| FileCheck %s --strict-whitespace --check-prefix=GNU			# RUN: llvm-readelf --symbols %t1.o \| FileCheck %s --strict-whitespace --check-prefix=GNU
				# RUN: llvm-readobj --symbols --pretty-print --elf-output-style=JSON %t1.o \| FileCheck %s --check-prefix=JSON

	# LLVM: Name: default			# LLVM: Name: default
	# LLVM: Other: 0			# LLVM: Other: 0
	# LLVM: Name: internal			# LLVM: Name: internal
	# LLVM: Other [ (0x1)			# LLVM: Other [ (0x1)
	# LLVM-NEXT: STV_INTERNAL (0x1)			# LLVM-NEXT: STV_INTERNAL (0x1)
	# LLVM-NEXT: ]			# LLVM-NEXT: ]
	# LLVM: Name: hidden			# LLVM: Name: hidden
	# LLVM: Other [ (0x2)			# LLVM: Other [ (0x2)
	# LLVM-NEXT: STV_HIDDEN (0x2)			# LLVM-NEXT: STV_HIDDEN (0x2)
	# LLVM-NEXT: ]			# LLVM-NEXT: ]
	# LLVM: Name: protected			# LLVM: Name: protected
	# LLVM: Other [ (0x3)			# LLVM: Other [ (0x3)
	# LLVM-NEXT: STV_PROTECTED (0x3)			# LLVM-NEXT: STV_PROTECTED (0x3)
	# LLVM-NEXT: ]			# LLVM-NEXT: ]

	# GNU: Vis Ndx Name			# GNU: Vis Ndx Name
	# GNU-NEXT: DEFAULT UND			# GNU-NEXT: DEFAULT UND
	# GNU-NEXT: DEFAULT UND default			# GNU-NEXT: DEFAULT UND default
	# GNU-NEXT: INTERNAL UND internal			# GNU-NEXT: INTERNAL UND internal
	# GNU-NEXT: HIDDEN UND hidden			# GNU-NEXT: HIDDEN UND hidden
	# GNU-NEXT: PROTECTED UND protected			# GNU-NEXT: PROTECTED UND protected

				# JSON: "Value": "default",
				# JSON: "Other": {
				# JSON-NEXT: "RawFlags": 0,
				# JSON-NEXT: "Flags": []
				# JSON-NEXT: },

				# JSON: "Value": "internal",
				# JSON: "Other": {
				# JSON-NEXT: "RawFlags": 1,
				# JSON-NEXT: "Flags": [
				# JSON-NEXT: {
				# JSON-NEXT: "Name": "STV_INTERNAL",
				# JSON-NEXT: "Value": 1
				# JSON-NEXT: }
				# JSON-NEXT: ]
				# JSON-NEXT: },

				# JSON: "Value": "hidden",
				# JSON: "Other": {
				# JSON-NEXT: "RawFlags": 2,
				# JSON-NEXT: "Flags": [
				# JSON-NEXT: {
				# JSON-NEXT: "Name": "STV_HIDDEN",
				# JSON-NEXT: "Value": 2
				# JSON-NEXT: }
				# JSON-NEXT: ]
				# JSON-NEXT: },

				# JSON: "Value": "protected",
				# JSON: "Other": {
				# JSON-NEXT: "RawFlags": 3,
				# JSON-NEXT: "Flags": [
				# JSON-NEXT: {
				# JSON-NEXT: "Name": "STV_PROTECTED",
				# JSON-NEXT: "Value": 3
				# JSON-NEXT: }
				# JSON-NEXT: ]
				# JSON-NEXT: },

	--- !ELF			--- !ELF
	FileHeader:			FileHeader:
	Class: ELFCLASS32			Class: ELFCLASS32
	Data: ELFDATA2LSB			Data: ELFDATA2LSB
	Type: ET_REL			Type: ET_REL
	Symbols:			Symbols:
	- Name: default			- Name: default
	Other: [ STV_DEFAULT ]			Other: [ STV_DEFAULT ]
	▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

llvm/tools/llvm-readobj/ELFDumper.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 683 Lines • ▼ Show 20 Lines	public:
void printBBAddrMaps() override;		void printBBAddrMaps() override;
void printAddrsig() override;		void printAddrsig() override;
void printNotes() override;		void printNotes() override;
void printELFLinkerOptions() override;		void printELFLinkerOptions() override;
void printStackSizes() override;		void printStackSizes() override;
void printMemtag(		void printMemtag(
const ArrayRef<std::pair<std::string, std::string>> DynamicEntries,		const ArrayRef<std::pair<std::string, std::string>> DynamicEntries,
const ArrayRef<uint8_t> AndroidNoteDesc) override;		const ArrayRef<uint8_t> AndroidNoteDesc) override;
		void printSymbolSection(const Elf_Sym &Symbol, unsigned SymIndex,
		DataRegion<Elf_Word> ShndxTable) const;

private:		private:
void printRelrReloc(const Elf_Relr &R) override;		void printRelrReloc(const Elf_Relr &R) override;
void printRelRelaReloc(const Relocation<ELFT> &R,		void printRelRelaReloc(const Relocation<ELFT> &R,
const RelSymbol<ELFT> &RelSym) override;		const RelSymbol<ELFT> &RelSym) override;

void printSymbolSection(const Elf_Sym &Symbol, unsigned SymIndex,
DataRegion<Elf_Word> ShndxTable) const;
void printSymbol(const Elf_Sym &Symbol, unsigned SymIndex,		void printSymbol(const Elf_Sym &Symbol, unsigned SymIndex,
DataRegion<Elf_Word> ShndxTable,		DataRegion<Elf_Word> ShndxTable,
std::optional<StringRef> StrTable, bool IsDynamic,		std::optional<StringRef> StrTable, bool IsDynamic,
bool /NonVisibilityBitsUsed/) const override;		bool /NonVisibilityBitsUsed/) const override;
void printProgramHeaders() override;		void printProgramHeaders() override;
void printSectionMapping() override {}		void printSectionMapping() override {}
void printStackSizeEntry(uint64_t Size,		void printStackSizeEntry(uint64_t Size,
ArrayRef<std::string> FuncNames) override;		ArrayRef<std::string> FuncNames) override;

void printMipsGOT(const MipsGOTParser<ELFT> &Parser) override;		void printMipsGOT(const MipsGOTParser<ELFT> &Parser) override;
void printMipsPLT(const MipsGOTParser<ELFT> &Parser) override;		void printMipsPLT(const MipsGOTParser<ELFT> &Parser) override;
void printMipsABIFlags() override;		void printMipsABIFlags() override;
		virtual void printZeroSymbolOtherField(const Elf_Sym &Symbol) const;

protected:		protected:
		void printSymbolOtherField(const Elf_Sym &Symbol) const;
ScopedPrinter &W;		ScopedPrinter &W;
};		};

// JSONELFDumper shares most of the same implementation as LLVMELFDumper except		// JSONELFDumper shares most of the same implementation as LLVMELFDumper except
// it uses a JSONScopedPrinter.		// it uses a JSONScopedPrinter.
template <typename ELFT> class JSONELFDumper : public LLVMELFDumper<ELFT> {		template <typename ELFT> class JSONELFDumper : public LLVMELFDumper<ELFT> {
public:		public:
LLVM_ELF_IMPORT_TYPES_ELFT(ELFT)		LLVM_ELF_IMPORT_TYPES_ELFT(ELFT)

JSONELFDumper(const object::ELFObjectFile<ELFT> &ObjF, ScopedPrinter &Writer)		JSONELFDumper(const object::ELFObjectFile<ELFT> &ObjF, ScopedPrinter &Writer)
: LLVMELFDumper<ELFT>(ObjF, Writer) {}		: LLVMELFDumper<ELFT>(ObjF, Writer) {}

void printFileSummary(StringRef FileStr, ObjectFile &Obj,		void printFileSummary(StringRef FileStr, ObjectFile &Obj,
ArrayRef<std::string> InputFilenames,		ArrayRef<std::string> InputFilenames,
const Archive *A) override;		const Archive *A) override;
		virtual void printZeroSymbolOtherField(const Elf_Sym &Symbol) const override;

private:		private:
std::unique_ptr<DictScope> FileScope;		std::unique_ptr<DictScope> FileScope;
};		};

} // end anonymous namespace		} // end anonymous namespace

namespace llvm {		namespace llvm {
▲ Show 20 Lines • Show All 6,134 Lines • ▼ Show 20 Lines	else
consumeError(SectionName.takeError());		consumeError(SectionName.takeError());
W.printHex("Section", "<?>", *SectionIndex);		W.printHex("Section", "<?>", *SectionIndex);
} else {		} else {
W.printHex("Section", SectionName, SectionIndex);		W.printHex("Section", SectionName, SectionIndex);
}		}
}		}

template <class ELFT>		template <class ELFT>
		void LLVMELFDumper<ELFT>::printSymbolOtherField(const Elf_Sym &Symbol) const {
		std::vector<EnumEntry<unsigned>> SymOtherFlags =
		this->getOtherFlagsFromSymbol(this->Obj.getHeader(), Symbol);
		W.printFlags("Other", Symbol.st_other, ArrayRef(SymOtherFlags), 0x3u);
		}

		template <class ELFT>
		void LLVMELFDumper<ELFT>::printZeroSymbolOtherField(
		const Elf_Sym &Symbol) const {
		assert(Symbol.st_other == 0 && "non-zero Other Field");
		// Usually st_other flag is zero. Do not pollute the output
		// by flags enumeration in that case.
		W.printNumber("Other", 0);
		}

		template <class ELFT>
void LLVMELFDumper<ELFT>::printSymbol(const Elf_Sym &Symbol, unsigned SymIndex,		void LLVMELFDumper<ELFT>::printSymbol(const Elf_Sym &Symbol, unsigned SymIndex,
DataRegion<Elf_Word> ShndxTable,		DataRegion<Elf_Word> ShndxTable,
std::optional<StringRef> StrTable,		std::optional<StringRef> StrTable,
bool IsDynamic,		bool IsDynamic,
bool /NonVisibilityBitsUsed/) const {		bool /NonVisibilityBitsUsed/) const {
std::string FullSymbolName = this->getFullSymbolName(		std::string FullSymbolName = this->getFullSymbolName(
Symbol, SymIndex, ShndxTable, StrTable, IsDynamic);		Symbol, SymIndex, ShndxTable, StrTable, IsDynamic);
unsigned char SymbolType = Symbol.getType();		unsigned char SymbolType = Symbol.getType();

DictScope D(W, "Symbol");		DictScope D(W, "Symbol");
W.printNumber("Name", FullSymbolName, Symbol.st_name);		W.printNumber("Name", FullSymbolName, Symbol.st_name);
W.printHex("Value", Symbol.st_value);		W.printHex("Value", Symbol.st_value);
W.printNumber("Size", Symbol.st_size);		W.printNumber("Size", Symbol.st_size);
W.printEnum("Binding", Symbol.getBinding(), ArrayRef(ElfSymbolBindings));		W.printEnum("Binding", Symbol.getBinding(), ArrayRef(ElfSymbolBindings));
if (this->Obj.getHeader().e_machine == ELF::EM_AMDGPU &&		if (this->Obj.getHeader().e_machine == ELF::EM_AMDGPU &&
SymbolType >= ELF::STT_LOOS && SymbolType < ELF::STT_HIOS)		SymbolType >= ELF::STT_LOOS && SymbolType < ELF::STT_HIOS)
W.printEnum("Type", SymbolType, ArrayRef(AMDGPUSymbolTypes));		W.printEnum("Type", SymbolType, ArrayRef(AMDGPUSymbolTypes));
else		else
W.printEnum("Type", SymbolType, ArrayRef(ElfSymbolTypes));		W.printEnum("Type", SymbolType, ArrayRef(ElfSymbolTypes));
if (Symbol.st_other == 0)		if (Symbol.st_other == 0)
// Usually st_other flag is zero. Do not pollute the output		printZeroSymbolOtherField(Symbol);
// by flags enumeration in that case.		else
W.printNumber("Other", 0);		printSymbolOtherField(Symbol);
else {
std::vector<EnumEntry<unsigned>> SymOtherFlags =
this->getOtherFlagsFromSymbol(this->Obj.getHeader(), Symbol);
W.printFlags("Other", Symbol.st_other, ArrayRef(SymOtherFlags), 0x3u);
}
printSymbolSection(Symbol, SymIndex, ShndxTable);		printSymbolSection(Symbol, SymIndex, ShndxTable);
}		}

template <class ELFT>		template <class ELFT>
void LLVMELFDumper<ELFT>::printSymbols(bool PrintSymbols,		void LLVMELFDumper<ELFT>::printSymbols(bool PrintSymbols,
bool PrintDynamicSymbols) {		bool PrintDynamicSymbols) {
if (PrintSymbols) {		if (PrintSymbols) {
ListScope Group(W, "Symbols");		ListScope Group(W, "Symbols");
▲ Show 20 Lines • Show All 740 Lines • ▼ Show 20 Lines	void JSONELFDumper<ELFT>::printFileSummary(StringRef FileStr, ObjectFile &Obj,
this->W.printString("File", FileStr);		this->W.printString("File", FileStr);
this->W.printString("Format", Obj.getFileFormatName());		this->W.printString("Format", Obj.getFileFormatName());
this->W.printString("Arch", Triple::getArchTypeName(Obj.getArch()));		this->W.printString("Arch", Triple::getArchTypeName(Obj.getArch()));
this->W.printString(		this->W.printString(
"AddressSize",		"AddressSize",
std::string(formatv("{0}bit", 8 * Obj.getBytesInAddress())));		std::string(formatv("{0}bit", 8 * Obj.getBytesInAddress())));
this->printLoadName();		this->printLoadName();
}		}

		template <class ELFT>
		void JSONELFDumper<ELFT>::printZeroSymbolOtherField(
		jhendersonUnsubmitted Done Reply Inline Actions Could we avoid the mass code-duplication by just introducing another function "printSymbolOtherField" (or something like that) that is implemented differently for the two dumper types? Maybe even "printZeroSymbolOtherField" so that only the very niche bit that needs to differ is distinguished. This function could then just delegate to the regular print symbol st_other code, to avoid needing to duplicate that in any way. With this change, I think you could minimise the additional testing to just demonstrate basic behaviour difference for zero/non-zero st_other values, without needing to have additional cases for the various other aspects of this field printing. jhenderson: Could we avoid the mass code-duplication by just introducing another function…
		jhendersonUnsubmitted Done Reply Inline Actions It might be worth a comment explaining why for JSON it's desirable to do things differently here. jhenderson: It might be worth a comment explaining why for JSON it's desirable to do things differently…
		const Elf_Sym &Symbol) const {
		// We want the JSON format to be uniform, since it is machine readable, so
		// always print the `Other` field the same way.
		this->printSymbolOtherField(Symbol);
		}

This is an archive of the discontinued LLVM Phabricator instance.

[llvm-readobj] Standardize JSON output for `Other` fieldClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 506224

llvm/test/tools/llvm-readobj/ELF/aarch64-symbols-stother.test

llvm/test/tools/llvm-readobj/ELF/mips-symbols-stother.test

llvm/test/tools/llvm-readobj/ELF/symbol-visibility.test

llvm/tools/llvm-readobj/ELFDumper.cpp

[llvm-readobj] Standardize JSON output for `Other` field
ClosedPublic