This is an archive of the discontinued LLVM Phabricator instance.

llvm/tools/llvm-readobj/XCOFFDumper.cpp
145	On the example I am running this just prints "Symbol Index: )", although I checked the individual SymIdx and SymName variables and they have the right values.

DiggerLin updated this revision to Diff 457061.Aug 31 2022, 12:38 PM

fixed Paul Scoropan's report bug. thanks for Paul

DiggerLin marked an inline comment as done.Aug 31 2022, 12:38 PM

Harbormaster completed remote builds in B184429: Diff 457061.Aug 31 2022, 1:24 PM

I did wonder whether this should really just be reusing the existing --unwind option, but as I understand it, the XCOFF exception section isn't really about unwinding the stack or anything along those lines. Is that correct?

Rather than canned binaries, I think it wouldn't be too hard to add yaml2obj support for exception sections, so that you can create the input at test time.

Your commit message should be a grammatically correct sentence, with leading capital letter (i.e. "Add a new ..."), much like comments.

I've not really looked at the XCOFFObjectFile code changes or the testing just yet. Please don't forget to test error/warning paths, to show that the code can handle malformed inputs.

llvm/docs/CommandGuide/llvm-readobj.rst
335	I think you can simplify to the inline suggestion.
llvm/lib/Object/XCOFFObjectFile.cpp
94	This file hasn't been clang-formatted. Please fix.
1050–1053	I recommend folding this `if` into the assertions, possibly like the inline suggestion. Otherwise you have an empty if without assertions, which seems messy (although the optimizer should get rid of it in this case).
llvm/test/tools/llvm-readobj/XCOFF/exception-section.test
10	I think it would be more similar to other dumping formats to print this row as: `Symbol: .bar (12)`.
llvm/tools/llvm-readobj/XCOFFDumper.cpp
143	`unwrapOrError` should be considered deprecated in llvm-readobj, as it stops llvm-readobj from continuing, which is not useful for dumping tools, especially given that the error in this case won't prevent other files or sections from being dumped. Instead, prefer reporting problems as warnings (via `reportUniqueWarning` or equivalent). See the ELF dumper for good examples.
152	Ditto: don't use `unwrapOrError`.
llvm/tools/llvm-readobj/llvm-readobj.cpp
510–511	Is there a reasson you're not putting this up with the other XCOFF-specific dumping option?

DiggerLin edited the summary of this revision. (Show Details)Sep 1 2022, 6:32 AM

In D133030#3763317, @jhenderson wrote:

I did wonder whether this should really just be reusing the existing --unwind option, but as I understand it, the XCOFF exception section isn't really about unwinding the stack or anything along those lines. Is that correct?

yes, you are correct, the exception section is not for unwind. so we can not use the --unwind for it.

Rather than canned binaries, I think it wouldn't be too hard to add yaml2obj support for exception sections, so that you can create the input at test time.

yes, if I do the yaml2obj first , I need a tools to decode the object file generated by yaml2obj when I add a test for the yaml2obj, but there is not now. My propose is three steps:

using canned binaries in this patch,
write a second patch to yaml2obj support for exception sections(using the llvm-readobj --exception-section to test the second patch).
have another patch to modify the llvm/test/tools/llvm-readobj/XCOFF/exception-section.test which use the yaml2obj to generate xcoff object with exception sections and replace the canned binaries in the test case.

Your commit message should be a grammatically correct sentence, with leading capital letter (i.e. "Add a new ..."), much like comments.

I've not really looked at the XCOFFObjectFile code changes or the testing just yet. Please don't forget to test error/warning paths, to show that the code can handle malformed inputs.

address James's comment.

llvm/lib/Object/XCOFFObjectFile.cpp
1050–1053	thanks for explain. fixed
llvm/test/tools/llvm-readobj/XCOFF/exception-section.test
10	according to https://www.ibm.com/docs/en/aix/7.2?topic=formats-xcoff-object-file-format#XCOFF__iua3i23ajbau e_addr.e_symndx+ Symbol table index for function the value of the field is for symbol index, so putting symbol index at first and putting symbol name in brackets are reasonable.
llvm/tools/llvm-readobj/llvm-readobj.cpp
510–511	If there are several options in the command line at the same time. I want to keep the output as order of content xcoff object file as much as possible. If you do not agree with this, I can put the all the XCOFF-specific dumping option together.

Harbormaster completed remote builds in B184633: Diff 457337.Sep 1 2022, 2:02 PM

In D133030#3763938, @DiggerLin wrote:

In D133030#3763317, @jhenderson wrote:

Rather than canned binaries, I think it wouldn't be too hard to add yaml2obj support for exception sections, so that you can create the input at test time.

yes, if I do the yaml2obj first , I need a tools to decode the object file generated by yaml2obj when I add a test for the yaml2obj, but there is not now. My propose is three steps:

using canned binaries in this patch,

write a second patch to yaml2obj support for exception sections(using the llvm-readobj --exception-section to test the second patch).

have another patch to modify the llvm/test/tools/llvm-readobj/XCOFF/exception-section.test which use the yaml2obj to generate xcoff object with exception sections and replace the canned binaries in the test case.

I think my preferred option would be to use yaml2bj/obj2yaml to test each other, with an additional manual step for now:

Write yaml2obj code for the desired behaviour, and obj2yaml code which can convert the object back into YAML, which you can then check with FileCheck.
After running the test, use your system tools to manually check the temporary object file created by the test looks correct. If so, land this change.
In a second patch (this one), use yaml2obj to create your test inputs and verify that llvm-readobj can dump them correctly.
You could also manually verify using your system tools that the input looks as you would expect again.
Add llvm-readobj checks to your yaml2obj tests.

llvm/test/tools/llvm-readobj/XCOFF/exception-section.test
10	This isn't any different to how ELF is usually dumped. For example, when dumping ELF relocations (see for example https://github.com/llvm/llvm-project/blob/main/llvm/test/tools/llvm-readobj/ELF/relocations.test), the ELF format refers to a symbol index, but the printout displays the symbol name. In general, if a user is trying to find out information, it is more likely that they are interested in the symbol not the index of that symbol, so printing the name first is the more useful thing.
llvm/test/tools/llvm-readobj/XCOFF/invalid-exception-section.test
2
4	`dd` isn't used anywhere else in the tests as far as I can tell, which means adding it might break some users who do not have that tool. Instead, you can use python to achieve the same effect, by reading in the file and modifying the specific bytes. (Although this is where yaml2obj is more useful)
5	Use `echo` not `printf`.
8
llvm/tools/llvm-readobj/XCOFFDumper.cpp
144	I don't think you've tested this warning?
llvm/tools/llvm-readobj/llvm-readobj.cpp
510–511	Okay, your explanation makes sense, thanks. Please put it in comments somewhere in the file, e.g. "this data appears early in XCOFF files so display it first" etc.

jhenderson added inline comments.Sep 2 2022, 1:06 AM

llvm/include/llvm/Object/XCOFFObjectFile.h
231	Why the `public` directive? It makes it look like the fields are `private`, except they aren't because this is a `struct`.
238	I'd also consider replacing `Reason` with `Reason != 0`. This makes it a mirror of the `getSymbolIndex` function.
249–250	Why the duplication? In fact, why the `extern template` at all?
llvm/lib/Object/XCOFFObjectFile.cpp
407	I think it would help the understandability of these changes if the refactoring was split into a separate prerequisite patch. You'd thus have two patches (well three if you include the yaml2obj patch discussed out of line): NFC refactoring to make the loader section code reusable. Implementation of the exception section.
430
436–437	I'd use the slightly shorter names `SectionOffset` and `SectionSize`.
442	Don't start blocks with a blank line.
446–451	All the magic numbers make this code completely opaque. Why "3", why "16", why "1" etc. I think you'd be better off assigning them to const variables so you can name them. Also, I think the consensus is that lambdas should have name style like variables, because they are function objects rather than pure functions. In other words, I'd name this `GetSectionName` (no need to abbreviate).
808–812	Could a template function (or `auto` lambda) allow you to avoid this duplication (passing in the `sections32/sections64` functions as parameters)?
1044	Probably delete this blank line.
1050–1051	This assertion is independent of finding the section, so should probably the first line of the function.

address comment.

llvm/include/llvm/Object/XCOFFObjectFile.h
231	just for readable to separate the data member and function.
249–250	"extern template" means the template is initiated somewhere. for the template has member function. without the "extern template". there maybe member function maybe be initiate in different compile unit. it waste the the code.
llvm/lib/Object/XCOFFObjectFile.cpp
407	the function getLoaderSectionAddress is mapped into getSectionFileOffsetToRawData(XCOFF::STYP_LOADER). and the function getSectionFileOffsetToRawData has more functionality than only get getLoaderSectionAddress. if split into two patch. I do not think it is NFC patch.
446–451	the functionality of the mapping the value SectionTypeFlags into string. https://github.com/llvm/llvm-project/blob/main/llvm/include/llvm/BinaryFormat/XCOFF.h
808–812	good idea.

Harbormaster completed remote builds in B185628: Diff 458744.Sep 8 2022, 8:37 AM

jhenderson added inline comments.Sep 9 2022, 1:00 AM

llvm/include/llvm/Object/XCOFFObjectFile.h
231	just for readable to separate the data member and function. The addition of an explicit and unnecessary `public` makes things less readable for me, not more. If you wish to explicitly call out that the method is public, put the method before the members and label the whole struct with `public`. Also nit: add a blank line between your methods, where your methods are multi line.
249–250	We don't bother with extern templates in many other places, so I'm not sure why you felt that this particular case needed it? Actually, after writing that, I see it's the same for other XCOFFObjectFile declarations, so it's probably not worth me worrying about in this patch, although it's still the case that there's limited use of this feature outside this file. I'd still like to know why you feel like these structs need it specifically, when in most cases within LLVM we don't bother.
llvm/lib/Object/XCOFFObjectFile.cpp
407	All the more reason to split this up then: if the first patch isn't a pure refactor, and adds new functionality, that functionality should be added and tested for the loader section. If on the other hand it is not used, the additional functionality can be added in the second patch.
446–451	Correct me if I'm wrong, but it looks to me like this is just an over-complicated way of mapping the section type values to section names. Wouldn't a switch statement be significantly clearer? Something like: StringRef getSectionName(int32_t SectType) { switch(SectType) { case STYP_PAD: return "pad"; case STYPE_DWARF: return "dwarf"; ... default: return "<unknown: " + SectType + ">"; } } (The default case covers the situation where a user provides a type that isn't a known type, but could be replaced by something else appropriate).
llvm/test/tools/llvm-readobj/XCOFF/exception-section.test
19	It would probably help readers/maintainers if this had some small comments in the lines immediately below with arrows pointing up at the start of each field in the data. There are several examples of this in the yaml2obj DWARF tests. Something like: SectionData: "0000000000000000003400030000005c0002000000010000000001140002000001400002" ^ ^-FieldName +-SymIndex (add more labels with field names as appropriate).
26	Nit: get rid of double blank line. Same in the other test.
llvm/test/tools/llvm-readobj/XCOFF/invalid-exception-section.test
8	Not addressed, and now duplicated in the invalid_sym case too.
21–25	Do you need any symbols at all for this test case? If so, would 1 suffice?
36	Do you really need this much data for this test case? A single entry would surely be sufficient, and you wouldn't even need any symbols at all.

address comment.

llvm/include/llvm/Object/XCOFFObjectFile.h
249–250	in the patch, it only llvm-readobj/XCOFFDumper.cpp used the member functions of ExceptionSectionEntry. but I instanced the member functions of ExceptionSectionEntry in the XCOFFObjectFile.cpp instead other files. just because we still want to implement the decode the Exception Section in llvm-objdump later, I am not sure whether we use use the member functions of ExceptionSectionEntry in XCOFFObjectFile.cpp later. I only instanced the ExceptionSectionEntry in the XCOFFObjectFile.cpp explicitly and extern instanced the ExceptionSectionEntry explicitly, It will guarantee the members functions of ExceptionSectionEntry only be instantiated once.
llvm/lib/Object/XCOFFObjectFile.cpp
407	for the in the patch https://reviews.llvm.org/D110320 and https://reviews.llvm.org/D106643 , in the getLoaderSectionAddress(). there is no test case for the code "return createError(toString(std::move(E)) + ": loader section with offset 0x" + Twine::utohexstr(OffsetToLoaderSection) + " and size 0x" + Twine::utohexstr(SizeOfLoaderSection) + " goes past the end of the file"); " I do not think we will used a canned invalid xcoff object file to test invalid loader Section. I will not touch the code the getLoaderSectionAddress() in current patch. and After the current patch landed, I will create a new NFC patch to refactor the getLoaderSectionAddress().(which will delete the function getLoaderSectionAddress())
446–451	yes, it is mapping the section type values to section names, I think your way is easy to understand but much more code(at lease 30 lines codes, at least 20 lines more codes). and if the enum of SectType has 100 different value, that means I have to write about 200 lines?
llvm/test/tools/llvm-readobj/XCOFF/invalid-exception-section.test
21–25	yes,agree.
36	agree, thanks

Harbormaster completed remote builds in B186199: Diff 459514.Sep 12 2022, 11:32 AM

jhenderson added inline comments.Sep 13 2022, 12:09 AM

llvm/include/llvm/Object/XCOFFObjectFile.h
249–250	Right, I understand what the extern template achieves, but your answer doesn't explain why this struct is special compared to many other classes and structs in LLVM that don't use extern template. Anyway, like I said, this is a discussion for another time.
llvm/lib/Object/XCOFFObjectFile.cpp
407	Sounds reasonable to me.
446–451	If the code is simpler to understand, you should always prefer that approach to the opaque option, even if the opaque option is many fewer lines of code. The switch/case approach may well be more efficient too, since compilers can easily optimize a switch case into a single jump. Listing them out, the advantages of switch/case are: Easy to follow. Clear mapping of value to name (doesn't require maintaining two parallel but intrinsically linked arrays/enums). Improved compiler diagnostics (e.g. compilers can warn if no default case and not all cases in an enum are covered). Potentially improved performance. The advantage of the existing approach: Fewer lines of code. SectType doesn't have 100 values, so your argument is a false comparison. Regardless, even if it did, you'd still have to have a 100 element array and make sure that the order in that array exactly matched the order of the enum values, so it's not exactly like it's any more maintainable.
457

DiggerLin updated this revision to Diff 459799.Sep 13 2022, 10:33 AM

DiggerLin marked 3 inline comments as done.

Harbormaster completed remote builds in B186407: Diff 459799.Sep 13 2022, 10:58 AM

jhenderson added inline comments.Sep 14 2022, 12:29 AM

llvm/lib/Object/XCOFFObjectFile.cpp
454	`SectionName` is unitialized if this function is called with an unknown section type. Either add a default case to the switch, or, preferably (in my opinion), assign it to some other initial value, preferably that includes the section type value, so that a user can see what the input is that is passed in but has gone wrong. If there's a straightforward way to test this, e.g. using a gtest unit test, than that would be good, but I accept that this might not be the case at the moment.
472

address comment

DiggerLin added inline comments.Sep 14 2022, 10:19 AM

llvm/lib/Object/XCOFFObjectFile.cpp
454	For in the code, I have covered all the enumeration values. if I use a default label, there will be compile error as "error in switch which covers all enumeration values [-Werror,-Wcovered-switch-default]"

Harbormaster completed remote builds in B186664: Diff 460145.Sep 14 2022, 10:58 AM

jhenderson added inline comments.Sep 15 2022, 12:38 AM

llvm/lib/Object/XCOFFObjectFile.cpp
454	`256` is unnecessarily large, given the maximum possible size of any of these names. Reduce it so that the typical max size is the maximum size of one of the normal cases.
455	The `<unknown:0x1234>` style is used in other places, so I think it's reasonable to use it here. On ELF, we use hex numbers for the section type too. You may wish to do that here too.

DiggerLin updated this revision to Diff 460401.Sep 15 2022, 7:29 AM

Harbormaster completed remote builds in B186854: Diff 460401.Sep 15 2022, 7:54 AM

One remaining nit, otherwise LGTM.

llvm/lib/Object/XCOFFObjectFile.cpp
455	Do you need to add the `0x` explicitly? I haven't looked at `uthexstr`, so it might not need it, but I think we should have the 0x to make it clear it is in hex.

This revision is now accepted and ready to land.Sep 16 2022, 12:18 AM

DiggerLin marked an inline comment as done.Sep 19 2022, 6:48 AM

DiggerLin added inline comments.

llvm/lib/Object/XCOFFObjectFile.cpp
455	Twine::utohexstr will have "0x" as prefix.

This revision was landed with ongoing or failed builds.Sep 19 2022, 7:56 AM

Closed by commit rGdcd5abd4c482: [AIX] llvm-readobj support a new option --exception-section for xcoff object… (authored by zhijian <zhijian@ca.ibm.com>). · Explain Why

This revision was automatically updated to reflect the committed changes.

zhijian <zhijian@ca.ibm.com> added a commit: rGdcd5abd4c482: [AIX] llvm-readobj support a new option --exception-section for xcoff object….

pscoro mentioned this in D132146: [PowerPC] XCOFF exception section support on the direct assembler path and exception language and reason code lowering from trap intrinsics.Sep 19 2022, 9:59 AM

pscoro mentioned this in D134195: [PowerPC] XCOFF exception section support on the integrated assembler path.Sep 19 2022, 10:02 AM

shchenz mentioned this in rGce004fb4f2e6: [PowerPC] XCOFF exception section support on the direct assembler path.Sep 26 2022, 7:24 PM

shchenz mentioned this in rG2234098291de: [PowerPC] XCOFF exception section support on the integrated assembler path.Nov 20 2022, 10:16 PM

Revision Contents

Path

Size

llvm/

docs/

CommandGuide/

llvm-readobj.rst

4 lines

include/

llvm/

Object/

XCOFFObjectFile.h

38 lines

lib/

Object/

XCOFFObjectFile.cpp

98 lines

test/

tools/

llvm-readobj/

XCOFF/

exception-section.test

55 lines

invalid-exception-section.test

38 lines

tools/

llvm-readobj/

1 line

1 line

45 lines

8 lines

Diff 461208

llvm/docs/CommandGuide/llvm-readobj.rst

Show First 20 Lines • Show All 324 Lines • ▼ Show 20 Lines

----------------------

The following options are implemented only for the XCOFF file format.

.. option:: --auxiliary-header

Display XCOFF Auxiliary header.

.. option:: --exception-section

Display XCOFF exception section entries.

jhendersonUnsubmitted

Done

.. option:: --exception-section

- Display exception section entries of XCOFF object file.

+ Display XCOFF exception section entries.

EXIT STATUS

I think you can simplify to the inline suggestion.

jhenderson: I think you can simplify to the inline suggestion.

EXIT STATUS

-----------

:program:`llvm-readobj` returns 0 under normal operation. It returns a non-zero

exit code if there were any errors.

llvm/include/llvm/Object/XCOFFObjectFile.h

Show First 20 Lines • Show All 214 Lines • ▼ Show 20 Lines struct LoaderSectionHeader64 {

support::ubig32_t LengthOfStrTbl; support::ubig32_t LengthOfStrTbl;

support::big64_t OffsetToImpid; support::big64_t OffsetToImpid;

support::big64_t OffsetToStrTbl; support::big64_t OffsetToStrTbl;

support::big64_t OffsetToSymTbl; support::big64_t OffsetToSymTbl;

char Padding[16]; char Padding[16];

support::big32_t OffsetToRelEnt; support::big32_t OffsetToRelEnt;

}; };

template <typename AddressType> struct ExceptionSectionEntry {

union {

support::ubig32_t SymbolIdx;

AddressType TrapInstAddr;

};

uint8_t LangId;

uint8_t Reason;

uint32_t getSymbolIndex() const {

jhendersonUnsubmitted

Done

Why the public directive? It makes it look like the fields are private, except they aren't because this is a struct.

jhenderson: Why the `public` directive? It makes it look like the fields are `private`, except they aren't…

DiggerLinAuthorUnsubmitted

Done

just for readable to separate the data member and function.

DiggerLin: just for readable to separate the data member and function.

jhendersonUnsubmitted

Done

just for readable to separate the data member and function.

The addition of an explicit and unnecessary public makes things less readable for me, not more. If you wish to explicitly call out that the method is public, put the method before the members and label the whole struct with public.

Also nit: add a blank line between your methods, where your methods are multi line.

jhenderson: > just for readable to separate the data member and function. The addition of an explicit and…

assert(Reason == 0 && "Get symbol table index of the function only when "

"the e_reason field is 0.");

return SymbolIdx;

}

uint64_t getTrapInstAddr() const {

assert(Reason != 0 && "Zero is not a valid trap exception reason code.");

jhendersonUnsubmitted

Done

uint64_t getTrapInstAddr() const {

- assert(Reason && " Zero is not a valid trap exception reason code.");

+ assert(Reason && "Zero is not a valid trap exception reason code.");

return TrapInstAddr;

I'd also consider replacing Reason with Reason != 0. This makes it a mirror of the getSymbolIndex function.

jhenderson: I'd also consider replacing `Reason` with `Reason != 0`. This makes it a mirror of the…

return TrapInstAddr;

}

uint8_t getLangID() const { return LangId; }

uint8_t getReason() const { return Reason; }

};

typedef ExceptionSectionEntry<support::ubig32_t> ExceptionSectionEntry32;

typedef ExceptionSectionEntry<support::ubig64_t> ExceptionSectionEntry64;

// Explicit extern template declarations.

extern template struct ExceptionSectionEntry<support::ubig32_t>;

extern template struct ExceptionSectionEntry<support::ubig64_t>;

jhendersonUnsubmitted

Done

Why the duplication? In fact, why the extern template at all?

jhenderson: Why the duplication? In fact, why the `extern template` at all?

DiggerLinAuthorUnsubmitted

Done

"extern template" means the template is initiated somewhere.
for the template has member function. without the "extern template". there maybe member function maybe be initiate in different compile unit. it waste the the code.

DiggerLin: "extern template" means the template is initiated somewhere. for the template has member…

jhendersonUnsubmitted

Done

We don't bother with extern templates in many other places, so I'm not sure why you felt that this particular case needed it?

Actually, after writing that, I see it's the same for other XCOFFObjectFile declarations, so it's probably not worth me worrying about in this patch, although it's still the case that there's limited use of this feature outside this file. I'd still like to know why you feel like these structs need it specifically, when in most cases within LLVM we don't bother.

jhenderson: We don't bother with extern templates in many other places, so I'm not sure why you felt that…

DiggerLinAuthorUnsubmitted

Done

in the patch, it only llvm-readobj/XCOFFDumper.cpp used the member functions of ExceptionSectionEntry. but I instanced the member functions of ExceptionSectionEntry in the XCOFFObjectFile.cpp instead other files. just because we still want to implement the decode the Exception Section in llvm-objdump later, I am not sure whether we use use the member functions of ExceptionSectionEntry in XCOFFObjectFile.cpp later. I only instanced the ExceptionSectionEntry in the XCOFFObjectFile.cpp explicitly and extern instanced the ExceptionSectionEntry explicitly, It will guarantee the members functions of ExceptionSectionEntry only be instantiated once.

DiggerLin: in the patch, it only llvm-readobj/XCOFFDumper.cpp used the member functions of…

jhendersonUnsubmitted

Done

Right, I understand what the extern template achieves, but your answer doesn't explain why this struct is special compared to many other classes and structs in LLVM that don't use extern template.

Anyway, like I said, this is a discussion for another time.

jhenderson: Right, I understand what the extern template achieves, but your answer doesn't explain why this…

struct XCOFFStringTable { struct XCOFFStringTable {

uint32_t Size; uint32_t Size;

const char *Data; const char *Data;

}; };

struct XCOFFCsectAuxEnt32 { struct XCOFFCsectAuxEnt32 {

support::ubig32_t SectionOrLength; support::ubig32_t SectionOrLength;

support::ubig32_t ParameterHashIndex; support::ubig32_t ParameterHashIndex;

▲ Show 20 Lines • Show All 228 Lines • ▼ Show 20 Lines private:

size_t getSectionHeaderSize() const; size_t getSectionHeaderSize() const;

const XCOFFSectionHeader32 *toSection32(DataRefImpl Ref) const; const XCOFFSectionHeader32 *toSection32(DataRefImpl Ref) const;

const XCOFFSectionHeader64 *toSection64(DataRefImpl Ref) const; const XCOFFSectionHeader64 *toSection64(DataRefImpl Ref) const;

uintptr_t getSectionHeaderTableAddress() const; uintptr_t getSectionHeaderTableAddress() const;

uintptr_t getEndOfSymbolTableAddress() const; uintptr_t getEndOfSymbolTableAddress() const;

Expected<uintptr_t> getLoaderSectionAddress() const; Expected<uintptr_t> getLoaderSectionAddress() const;

DataRefImpl getSectionByType(XCOFF::SectionTypeFlags SectType) const;

uint64_t getSectionFileOffsetToRawData(DataRefImpl Sec) const;

Expected<uintptr_t>

getSectionFileOffsetToRawData(XCOFF::SectionTypeFlags SectType) const;

// This returns a pointer to the start of the storage for the name field of // This returns a pointer to the start of the storage for the name field of

// the 32-bit or 64-bit SectionHeader struct. This string is *not* necessarily // the 32-bit or 64-bit SectionHeader struct. This string is *not* necessarily

// null-terminated. // null-terminated.

const char *getSectionNameInternal(DataRefImpl Sec) const; const char *getSectionNameInternal(DataRefImpl Sec) const;

static bool isReservedSectionNumber(int16_t SectionNumber); static bool isReservedSectionNumber(int16_t SectionNumber);

// Constructor and "create" factory function. The constructor is only a thin // Constructor and "create" factory function. The constructor is only a thin

▲ Show 20 Lines • Show All 138 Lines • ▼ Show 20 Lines public:

getNumberOfRelocationEntries(const XCOFFSectionHeader<T> &Sec) const; getNumberOfRelocationEntries(const XCOFFSectionHeader<T> &Sec) const;

template <typename Shdr, typename Reloc> template <typename Shdr, typename Reloc>

Expected<ArrayRef<Reloc>> relocations(const Shdr &Sec) const; Expected<ArrayRef<Reloc>> relocations(const Shdr &Sec) const;

// Loader section related interfaces. // Loader section related interfaces.

Expected<StringRef> getImportFileTable() const; Expected<StringRef> getImportFileTable() const;

// Exception-related interface.

template <typename ExceptEnt>

Expected<ArrayRef<ExceptEnt>> getExceptionEntries() const;

// This function returns string table entry. // This function returns string table entry.

Expected<StringRef> getStringTableEntry(uint32_t Offset) const; Expected<StringRef> getStringTableEntry(uint32_t Offset) const;

// This function returns the string table. // This function returns the string table.

StringRef getStringTable() const; StringRef getStringTable() const;

const XCOFF::SymbolAuxType *getSymbolAuxType(uintptr_t AuxEntryAddress) const; const XCOFF::SymbolAuxType *getSymbolAuxType(uintptr_t AuxEntryAddress) const;

▲ Show 20 Lines • Show All 215 Lines • Show Last 20 Lines

llvm/lib/Object/XCOFFObjectFile.cpp

Show First 20 Lines • Show All 80 Lines • ▼ Show 20 Lines

template <typename AddressType> template <typename AddressType>

uint8_t XCOFFRelocation<AddressType>::getRelocatedLength() const { uint8_t XCOFFRelocation<AddressType>::getRelocatedLength() const {

// The relocation encodes the bit length being relocated minus 1. Add back // The relocation encodes the bit length being relocated minus 1. Add back

// the 1 to get the actual length being relocated. // the 1 to get the actual length being relocated.

return (Info & XR_BIASED_LENGTH_MASK) + 1; return (Info & XR_BIASED_LENGTH_MASK) + 1;

} }

template struct ExceptionSectionEntry<support::ubig32_t>;

template struct ExceptionSectionEntry<support::ubig64_t>;

uintptr_t uintptr_t

XCOFFObjectFile::getAdvancedSymbolEntryAddress(uintptr_t CurrentAddress, XCOFFObjectFile::getAdvancedSymbolEntryAddress(uintptr_t CurrentAddress,

uint32_t Distance) { uint32_t Distance) {

jhendersonUnsubmitted

Done

This file hasn't been clang-formatted. Please fix.

jhenderson: This file hasn't been clang-formatted. Please fix.

return getWithOffset(CurrentAddress, Distance * XCOFF::SymbolTableEntrySize); return getWithOffset(CurrentAddress, Distance * XCOFF::SymbolTableEntrySize);

} }

const XCOFF::SymbolAuxType * const XCOFF::SymbolAuxType *

XCOFFObjectFile::getSymbolAuxType(uintptr_t AuxEntryAddress) const { XCOFFObjectFile::getSymbolAuxType(uintptr_t AuxEntryAddress) const {

assert(is64Bit() && "64-bit interface called on a 32-bit object file."); assert(is64Bit() && "64-bit interface called on a 32-bit object file.");

return viewAs<XCOFF::SymbolAuxType>( return viewAs<XCOFF::SymbolAuxType>(

getWithOffset(AuxEntryAddress, SymbolAuxTypeOffset)); getWithOffset(AuxEntryAddress, SymbolAuxTypeOffset));

▲ Show 20 Lines • Show All 287 Lines • ▼ Show 20 Lines

} }

uint64_t XCOFFObjectFile::getSectionAlignment(DataRefImpl Sec) const { uint64_t XCOFFObjectFile::getSectionAlignment(DataRefImpl Sec) const {

uint64_t Result = 0; uint64_t Result = 0;

llvm_unreachable("Not yet implemented!"); llvm_unreachable("Not yet implemented!");

return Result; return Result;

} }

uint64_t XCOFFObjectFile::getSectionFileOffsetToRawData(DataRefImpl Sec) const {

if (is64Bit())

return toSection64(Sec)->FileOffsetToRawData;

return toSection32(Sec)->FileOffsetToRawData;

}

Expected<uintptr_t> XCOFFObjectFile::getLoaderSectionAddress() const { Expected<uintptr_t> XCOFFObjectFile::getLoaderSectionAddress() const {

uint64_t OffsetToLoaderSection = 0; uint64_t OffsetToLoaderSection = 0;

uint64_t SizeOfLoaderSection = 0; uint64_t SizeOfLoaderSection = 0;

jhendersonUnsubmitted

Done

I think it would help the understandability of these changes if the refactoring was split into a separate prerequisite patch. You'd thus have two patches (well three if you include the yaml2obj patch discussed out of line):

NFC refactoring to make the loader section code reusable.
Implementation of the exception section.

jhenderson: I think it would help the understandability of these changes if the refactoring was split into…

DiggerLinAuthorUnsubmitted

Done

the function getLoaderSectionAddress is mapped into getSectionFileOffsetToRawData(XCOFF::STYP_LOADER).

and the function getSectionFileOffsetToRawData has more functionality than only get getLoaderSectionAddress. if split into two patch. I do not think it is NFC patch.

DiggerLin: the function getLoaderSectionAddress is mapped into getSectionFileOffsetToRawData(XCOFF…

jhendersonUnsubmitted

Done

All the more reason to split this up then: if the first patch isn't a pure refactor, and adds new functionality, that functionality should be added and tested for the loader section. If on the other hand it is not used, the additional functionality can be added in the second patch.

jhenderson: All the more reason to split this up then: if the first patch isn't a pure refactor, and adds…

DiggerLinAuthorUnsubmitted

Done

for the in the patch https://reviews.llvm.org/D110320 and https://reviews.llvm.org/D106643 , in the getLoaderSectionAddress(). there is no test case for the code "return createError(toString(std::move(E)) +

": loader section with offset 0x" +
Twine::utohexstr(OffsetToLoaderSection) +
" and size 0x" + Twine::utohexstr(SizeOfLoaderSection) +
" goes past the end of the file");

"
I do not think we will used a canned invalid xcoff object file to test invalid loader Section.

I will not touch the code the getLoaderSectionAddress() in current patch.
and After the current patch landed, I will create a new NFC patch to refactor the getLoaderSectionAddress().(which will delete the function getLoaderSectionAddress())

DiggerLin: for the in the patch https://reviews.llvm.org/D110320 and https://reviews.llvm.org/D106643 , in…

jhendersonUnsubmitted

Done

Sounds reasonable to me.

jhenderson: Sounds reasonable to me.

if (is64Bit()) { if (is64Bit()) {

for (const auto &Sec64 : sections64()) for (const auto &Sec64 : sections64())

if (Sec64.getSectionType() == XCOFF::STYP_LOADER) { if (Sec64.getSectionType() == XCOFF::STYP_LOADER) {

OffsetToLoaderSection = Sec64.FileOffsetToRawData; OffsetToLoaderSection = Sec64.FileOffsetToRawData;

SizeOfLoaderSection = Sec64.SectionSize; SizeOfLoaderSection = Sec64.SectionSize;

break; break;

} }

} else { } else {

for (const auto &Sec32 : sections32()) for (const auto &Sec32 : sections32())

if (Sec32.getSectionType() == XCOFF::STYP_LOADER) { if (Sec32.getSectionType() == XCOFF::STYP_LOADER) {

OffsetToLoaderSection = Sec32.FileOffsetToRawData; OffsetToLoaderSection = Sec32.FileOffsetToRawData;

SizeOfLoaderSection = Sec32.SectionSize; SizeOfLoaderSection = Sec32.SectionSize;

break; break;

} }

// No loader section is not an error. // No loader section is not an error.

if (!SizeOfLoaderSection) if (!SizeOfLoaderSection)

return 0; return 0;

uintptr_t LoderSectionStart = uintptr_t LoderSectionStart =

reinterpret_cast<uintptr_t>(base() + OffsetToLoaderSection); reinterpret_cast<uintptr_t>(base() + OffsetToLoaderSection);

jhendersonUnsubmitted

Done

DataRefImpl DRI = getSectionByType(SectType);

- if (DRI.p == 0) // No section is not error.

+ if (DRI.p == 0) // No section is not an error.

return 0;

jhenderson:

if (Error E = if (Error E =

Binary::checkOffset(Data, LoderSectionStart, SizeOfLoaderSection)) Binary::checkOffset(Data, LoderSectionStart, SizeOfLoaderSection))

return createError(toString(std::move(E)) + return createError(toString(std::move(E)) +

": loader section with offset 0x" + ": loader section with offset 0x" +

Twine::utohexstr(OffsetToLoaderSection) + Twine::utohexstr(OffsetToLoaderSection) +

" and size 0x" + Twine::utohexstr(SizeOfLoaderSection) + " and size 0x" + Twine::utohexstr(SizeOfLoaderSection) +

" goes past the end of the file"); " goes past the end of the file");

jhendersonUnsubmitted

Done

I'd use the slightly shorter names SectionOffset and SectionSize.

jhenderson: I'd use the slightly shorter names `SectionOffset` and `SectionSize`.

return LoderSectionStart; return LoderSectionStart;

} }

Expected<uintptr_t> XCOFFObjectFile::getSectionFileOffsetToRawData(

jhendersonUnsubmitted

Done

Don't start blocks with a blank line.

jhenderson: Don't start blocks with a blank line.

XCOFF::SectionTypeFlags SectType) const {

DataRefImpl DRI = getSectionByType(SectType);

if (DRI.p == 0) // No section is not an error.

return 0;

uint64_t SectionOffset = getSectionFileOffsetToRawData(DRI);

uint64_t SizeOfSection = getSectionSize(DRI);

jhendersonUnsubmitted

Done

All the magic numbers make this code completely opaque. Why "3", why "16", why "1" etc. I think you'd be better off assigning them to const variables so you can name them.

Also, I think the consensus is that lambdas should have name style like variables, because they are function objects rather than pure functions. In other words, I'd name this GetSectionName (no need to abbreviate).

jhenderson: All the magic numbers make this code completely opaque. Why "3", why "16", why "1" etc. I think…

DiggerLinAuthorUnsubmitted

Done

the functionality of the mapping the value SectionTypeFlags into string.

https://github.com/llvm/llvm-project/blob/main/llvm/include/llvm/BinaryFormat/XCOFF.h

DiggerLin: the functionality of the mapping the value SectionTypeFlags into string. https://github.

jhendersonUnsubmitted

Done

Correct me if I'm wrong, but it looks to me like this is just an over-complicated way of mapping the section type values to section names. Wouldn't a switch statement be significantly clearer? Something like:

StringRef getSectionName(int32_t SectType) {
  switch(SectType) {
  case STYP_PAD:
    return "pad";
  case STYPE_DWARF:
    return "dwarf";
  ...
  default:
    return "<unknown: " + SectType + ">";
  }
}

(The default case covers the situation where a user provides a type that isn't a known type, but could be replaced by something else appropriate).

jhenderson: Correct me if I'm wrong, but it looks to me like this is just an over-complicated way of…

DiggerLinAuthorUnsubmitted

Done

yes, it is mapping the section type values to section names, I think your way is easy to understand but much more code(at lease 30 lines codes, at least 20 lines more codes). and if the enum of SectType has 100 different value, that means I have to write about 200 lines?

DiggerLin: yes, it is mapping the section type values to section names, I think your way is easy to…

jhendersonUnsubmitted

Not Done

If the code is simpler to understand, you should always prefer that approach to the opaque option, even if the opaque option is many fewer lines of code. The switch/case approach may well be more efficient too, since compilers can easily optimize a switch case into a single jump.

Listing them out, the advantages of switch/case are:

Easy to follow.
Clear mapping of value to name (doesn't require maintaining two parallel but intrinsically linked arrays/enums).
Improved compiler diagnostics (e.g. compilers can warn if no default case and not all cases in an enum are covered).
Potentially improved performance.

The advantage of the existing approach:

Fewer lines of code.

SectType doesn't have 100 values, so your argument is a false comparison. Regardless, even if it did, you'd still have to have a 100 element array and make sure that the order in that array exactly matched the order of the enum values, so it's not exactly like it's any more maintainable.

jhenderson: If the code is simpler to understand, you should always prefer that approach to the opaque…

uintptr_t SectionStart = reinterpret_cast<uintptr_t>(base() + SectionOffset);

if (Error E = Binary::checkOffset(Data, SectionStart, SizeOfSection)) {

SmallString<32> UnknownType;

jhendersonUnsubmitted

Done

SectionName is unitialized if this function is called with an unknown section type. Either add a default case to the switch, or, preferably (in my opinion), assign it to some other initial value, preferably that includes the section type value, so that a user can see what the input is that is passed in but has gone wrong.

If there's a straightforward way to test this, e.g. using a gtest unit test, than that would be good, but I accept that this might not be the case at the moment.

jhenderson: `SectionName` is unitialized if this function is called with an unknown section type. Either…

DiggerLinAuthorUnsubmitted

Done

For in the code, I have covered all the enumeration values.
if I use a default label, there will be compile error as "error in switch which covers all enumeration values [-Werror,-Wcovered-switch-default]"

DiggerLin: For in the code, I have covered all the enumeration values. if I use a default label, there…

jhendersonUnsubmitted

Not Done

256 is unnecessarily large, given the maximum possible size of any of these names. Reduce it so that the typical max size is the maximum size of one of the normal cases.

jhenderson: `256` is unnecessarily large, given the maximum possible size of any of these names. Reduce it…

Twine(("<Unknown:") + Twine::utohexstr(SectType) + ">")

jhendersonUnsubmitted

Not Done

SmallString<256> UnknownType;

- Twine(("Unknown SectType(") + Twine(SectType) +")").toVector(UnknownType);

+ Twine("<unknown:" + Twine(SectType) +">").toVector(UnknownType);

const char *SectionName = UnknownType.c_str();

The <unknown:0x1234> style is used in other places, so I think it's reasonable to use it here. On ELF, we use hex numbers for the section type too. You may wish to do that here too.

jhenderson: The `<unknown:0x1234>` style is used in other places, so I think it's reasonable to use it here.

jhendersonUnsubmitted

Not Done

Do you need to add the 0x explicitly? I haven't looked at uthexstr, so it might not need it, but I think we should have the 0x to make it clear it is in hex.

jhenderson: Do you need to add the `0x` explicitly? I haven't looked at `uthexstr`, so it might not need it…

DiggerLinAuthorUnsubmitted

Done

Twine::utohexstr will have "0x" as prefix.

DiggerLin: Twine::utohexstr will have "0x" as prefix.

.toVector(UnknownType);

const char *SectionName = UnknownType.c_str();

jhendersonUnsubmitted

Done

"debug", "typchk", "ovrflo"};

- SmallString<256> UnknowType;

+ SmallString<256> UnknownType;

auto GetSectionName = [&]() {

jhenderson:

switch (SectType) {

#define ECASE(Value, String) \

case XCOFF::Value: \

SectionName = String; \

break

ECASE(STYP_PAD, "pad");

ECASE(STYP_DWARF, "dwarf");

ECASE(STYP_TEXT, "text");

ECASE(STYP_DATA, "data");

ECASE(STYP_BSS, "bss");

ECASE(STYP_EXCEPT, "expect");

ECASE(STYP_INFO, "info");

ECASE(STYP_TDATA, "tdata");

jhendersonUnsubmitted

Not Done

ECASE(STYP_DEBUG, "debug");

- ECASE(STYP_TYPCHK, "tyechk");

+ ECASE(STYP_TYPCHK, "typchk");

ECASE(STYP_OVRFLO, "ovrflo");

jhenderson:

ECASE(STYP_TBSS, "tbss");

ECASE(STYP_LOADER, "loader");

ECASE(STYP_DEBUG, "debug");

ECASE(STYP_TYPCHK, "typchk");

ECASE(STYP_OVRFLO, "ovrflo");

#undef ECASE

}

return createError(toString(std::move(E)) + ": " + SectionName +

" section with offset 0x" +

Twine::utohexstr(SectionOffset) + " and size 0x" +

Twine::utohexstr(SizeOfSection) +

" goes past the end of the file");

}

return SectionStart;

}

bool XCOFFObjectFile::isSectionCompressed(DataRefImpl Sec) const { bool XCOFFObjectFile::isSectionCompressed(DataRefImpl Sec) const {

return false; return false;

} }

bool XCOFFObjectFile::isSectionText(DataRefImpl Sec) const { bool XCOFFObjectFile::isSectionText(DataRefImpl Sec) const {

return getSectionFlags(Sec) & XCOFF::STYP_TEXT; return getSectionFlags(Sec) & XCOFF::STYP_TEXT;

} }

▲ Show 20 Lines • Show All 293 Lines • ▼ Show 20 Lines return createStringError(object_error::invalid_section_index,

") is invalid"); ") is invalid");

DataRefImpl DRI; DataRefImpl DRI;

DRI.p = getWithOffset(getSectionHeaderTableAddress(), DRI.p = getWithOffset(getSectionHeaderTableAddress(),

getSectionHeaderSize() * (Num - 1)); getSectionHeaderSize() * (Num - 1));

return DRI; return DRI;

} }

DataRefImpl

XCOFFObjectFile::getSectionByType(XCOFF::SectionTypeFlags SectType) const {

DataRefImpl DRI;

auto GetSectionAddr = [&](const auto &Sections) {

for (const auto &Sec : Sections)

if (Sec.getSectionType() == SectType)

return reinterpret_cast<uintptr_t>(&Sec);

return 0ul;

};

if (is64Bit())

DRI.p = GetSectionAddr(sections64());

else

DRI.p = GetSectionAddr(sections32());

return DRI;

}

jhendersonUnsubmitted

Done

Could a template function (or auto lambda) allow you to avoid this duplication (passing in the sections32/sections64 functions as parameters)?

jhenderson: Could a template function (or `auto` lambda) allow you to avoid this duplication (passing in…

DiggerLinAuthorUnsubmitted

Done

good idea.

DiggerLin: good idea.

Expected<StringRef> Expected<StringRef>

XCOFFObjectFile::getSymbolSectionName(XCOFFSymbolRef SymEntPtr) const { XCOFFObjectFile::getSymbolSectionName(XCOFFSymbolRef SymEntPtr) const {

const int16_t SectionNum = SymEntPtr.getSectionNumber(); const int16_t SectionNum = SymEntPtr.getSectionNumber();

switch (SectionNum) { switch (SectionNum) {

case XCOFF::N_DEBUG: case XCOFF::N_DEBUG:

return "N_DEBUG"; return "N_DEBUG";

case XCOFF::N_ABS: case XCOFF::N_ABS:

▲ Show 20 Lines • Show All 206 Lines • ▼ Show 20 Lines return createError(

Twine::utohexstr(NumRelocEntries * sizeof(Reloc)) + Twine::utohexstr(NumRelocEntries * sizeof(Reloc)) +

" go past the end of the file"); " go past the end of the file");

const Reloc *StartReloc = RelocationOrErr.get(); const Reloc *StartReloc = RelocationOrErr.get();

return ArrayRef<Reloc>(StartReloc, StartReloc + NumRelocEntries); return ArrayRef<Reloc>(StartReloc, StartReloc + NumRelocEntries);

} }

template <typename ExceptEnt>

Expected<ArrayRef<ExceptEnt>> XCOFFObjectFile::getExceptionEntries() const {

assert(is64Bit() && sizeof(ExceptEnt) == sizeof(ExceptionSectionEntry64) ||

!is64Bit() && sizeof(ExceptEnt) == sizeof(ExceptionSectionEntry32));

Expected<uintptr_t> ExceptionSectOrErr =

getSectionFileOffsetToRawData(XCOFF::STYP_EXCEPT);

if (!ExceptionSectOrErr)

return ExceptionSectOrErr.takeError();

jhendersonUnsubmitted

Done

Probably delete this blank line.

jhenderson: Probably delete this blank line.

DataRefImpl DRI = getSectionByType(XCOFF::STYP_EXCEPT);

if (DRI.p == 0)

return ArrayRef<ExceptEnt>();

ExceptEnt *ExceptEntStart =

reinterpret_cast<ExceptEnt *>(*ExceptionSectOrErr);

jhendersonUnsubmitted

Done

This assertion is independent of finding the section, so should probably the first line of the function.

jhenderson: This assertion is independent of finding the section, so should probably the first line of the…

return ArrayRef<ExceptEnt>(

ExceptEntStart, ExceptEntStart + getSectionSize(DRI) / sizeof(ExceptEnt));

jhendersonUnsubmitted

Done

uint64_t SizeOfSection = getSectionSize(DRI);

- if (is64Bit())

- assert(sizeof(ExceptEnt) == sizeof(ExceptionSectionEntry64));

- else

- assert(sizeof(ExceptEnt) == sizeof(ExceptionSectionEntry32));

+ assert((is64Bit() && sizeof(ExceptEnt) == sizeof(ExceptionSectionEntry64) || (!is64Bit() && sizeof(ExceptEnt) == sizeof(ExceptionSectionEntry32));

ExceptEnt *ExceptEntStart =

I recommend folding this if into the assertions, possibly like the inline suggestion. Otherwise you have an empty if without assertions, which seems messy (although the optimizer should get rid of it in this case).

jhenderson: I recommend folding this `if` into the assertions, possibly like the inline suggestion.

DiggerLinAuthorUnsubmitted

Done

thanks for explain. fixed

DiggerLin: thanks for explain. fixed

}

template Expected<ArrayRef<ExceptionSectionEntry32>>

XCOFFObjectFile::getExceptionEntries() const;

template Expected<ArrayRef<ExceptionSectionEntry64>>

XCOFFObjectFile::getExceptionEntries() const;

Expected<XCOFFStringTable> Expected<XCOFFStringTable>

XCOFFObjectFile::parseStringTable(const XCOFFObjectFile *Obj, uint64_t Offset) { XCOFFObjectFile::parseStringTable(const XCOFFObjectFile *Obj, uint64_t Offset) {

// If there is a string table, then the buffer must contain at least 4 bytes // If there is a string table, then the buffer must contain at least 4 bytes

// for the string table's size. Not having a string table is not an error. // for the string table's size. Not having a string table is not an error.

if (Error E = Binary::checkOffset( if (Error E = Binary::checkOffset(

Obj->Data, reinterpret_cast<uintptr_t>(Obj->base() + Offset), 4)) { Obj->Data, reinterpret_cast<uintptr_t>(Obj->base() + Offset), 4)) {

consumeError(std::move(E)); consumeError(std::move(E));

return XCOFFStringTable{0, nullptr}; return XCOFFStringTable{0, nullptr};

▲ Show 20 Lines • Show All 544 Lines • Show Last 20 Lines

llvm/test/tools/llvm-readobj/XCOFF/exception-section.test

This file was added.

				## Test the --exception-section option.

				# RUN: yaml2obj --docnum=1 %s -o %t_xcoff32.o
				# RUN: yaml2obj --docnum=2 %s -o %t_xcoff64.o
				# RUN: llvm-readobj --exception-section %t_xcoff32.o \|\
				# RUN: FileCheck %s --check-prefixes=CHECK
				# RUN: llvm-readobj --exception-section %t_xcoff64.o \|\
				# RUN: FileCheck %s --check-prefixes=CHECK

				--- !XCOFF
				jhendersonUnsubmitted Done Reply Inline Actions I think it would be more similar to other dumping formats to print this row as: `Symbol: .bar (12)`. jhenderson: I think it would be more similar to other dumping formats to print this row as: `Symbol: .bar…
				DiggerLinAuthorUnsubmitted Done Reply Inline Actions according to https://www.ibm.com/docs/en/aix/7.2?topic=formats-xcoff-object-file-format#XCOFF__iua3i23ajbau e_addr.e_symndx+ Symbol table index for function the value of the field is for symbol index, so putting symbol index at first and putting symbol name in brackets are reasonable. DiggerLin: according to https://www.ibm.com/docs/en/aix/7.2?topic=formats-xcoff-object-file…
				jhendersonUnsubmitted Done Reply Inline Actions This isn't any different to how ELF is usually dumped. For example, when dumping ELF relocations (see for example https://github.com/llvm/llvm-project/blob/main/llvm/test/tools/llvm-readobj/ELF/relocations.test), the ELF format refers to a symbol index, but the printout displays the symbol name. In general, if a user is trying to find out information, it is more likely that they are interested in the symbol not the index of that symbol, so printing the name first is the more useful thing. jhenderson: This isn't any different to how ELF is usually dumped. For example, when dumping ELF…
				FileHeader:
				MagicNumber: 0x1DF
				Sections:
				- Name: .text
				Flags: [ STYP_TEXT ]
				- Name: .except
				Flags: [ STYP_EXCEPT ]
				SectionData: "000000000000000000340003"
				## ^------- -SymbolIndex=0
				jhendersonUnsubmitted Done Reply Inline Actions It would probably help readers/maintainers if this had some small comments in the lines immediately below with arrows pointing up at the start of each field in the data. There are several examples of this in the yaml2obj DWARF tests. Something like: SectionData: "0000000000000000003400030000005c0002000000010000000001140002000001400002" ^ ^-FieldName +-SymIndex (add more labels with field names as appropriate). jhenderson: It would probably help readers/maintainers if this had some small comments in the lines…
				## ^- -LangID=0
				## ^- -Reason=0
				## ^------- -Trap Instr Addr=0x34
				## ^- -LangID=0
				## ^- -Reason=3
				Symbols:
				- Name: .bar
				jhendersonUnsubmitted Done Reply Inline Actions Nit: get rid of double blank line. Same in the other test. jhenderson: Nit: get rid of double blank line. Same in the other test.
				Section: .text

				--- !XCOFF
				FileHeader:
				MagicNumber: 0x1F7
				Sections:
				- Name: .text
				Flags: [ STYP_TEXT ]
				- Name: .except
				Flags: [ STYP_EXCEPT ]
				SectionData: "0000000000000000000000000000000000340003"
				## ^--------------- -SymbolIndex=0
				## ^- -LangID=0
				## ^- -Reason=0
				## ^-------------- -Trap Instr Addr=0x34
				## ^- -LangID=0
				## ^- -Reason=3
				Symbols:
				- Name: .bar
				Section: .text

				# CHECK: Exception section {
				# CHECK-NEXT: Symbol: .bar (0)
				# CHECK-NEXT: LangID: 0
				# CHECK-NEXT: Reason: 0
				# CHECK-NEXT: Trap Instr Addr: 0x34
				# CHECK-NEXT: LangID: 0
				# CHECK-NEXT: Reason: 3
				# CHECK-NEXT: }

llvm/test/tools/llvm-readobj/XCOFF/invalid-exception-section.test

This file was added.

## Test decoding an invalid exception section and symbol index.

jhendersonUnsubmitted

Done

- ## Test decoding a invalid exception section.

+ ## Test decoding an invalid exception section.

# RUN: dd bs=1 count=186 if=%p/Inputs/exception-section.o of=%t_invalid.o

jhenderson:

# RUN: yaml2obj --docnum=1 %s -o %t_invalid_size.o

# RUN: yaml2obj --docnum=2 %s -o %t_invalid_sym.o

jhendersonUnsubmitted

Done

dd isn't used anywhere else in the tests as far as I can tell, which means adding it might break some users who do not have that tool. Instead, you can use python to achieve the same effect, by reading in the file and modifying the specific bytes.

(Although this is where yaml2obj is more useful)

jhenderson: `dd` isn't used anywhere else in the tests as far as I can tell, which means adding it might…

# RUN: llvm-readobj --exception-section %t_invalid_size.o 2>&1 |\

jhendersonUnsubmitted

Done

Use echo not printf.

jhenderson: Use `echo` not `printf`.

# RUN: FileCheck -DFILE=%t_invalid_size.o %s --check-prefixes=CHECK-WARN-SIZE

# RUN: llvm-readobj --exception-section %t_invalid_sym.o 2>&1 |\

# RUN: FileCheck -DFILE=%t_invalid_sym.o %s --check-prefixes=CHECK-WARN-SYM

jhendersonUnsubmitted

Done

# RUN: dd bs=1 skip=190 seek=190 count=2000 if=%p/Inputs/exception-section.o of=%t_invalid.o

- # RUN: llvm-readobj --exception-section %t_invalid.o 2>&1 |\

+ # RUN: llvm-readobj --exception-section %t_invalid.o 2>&1 |\

# RUN: FileCheck -DFILE=%t_invalid.o %s

jhenderson:

jhendersonUnsubmitted

Done

Not addressed, and now duplicated in the invalid_sym case too.

jhenderson: Not addressed, and now duplicated in the invalid_sym case too.

--- !XCOFF

FileHeader:

MagicNumber: 0x1DF

Sections:

- Name: .text

Flags: [ STYP_TEXT ]

- Name: .except

Size: 1000

Flags: [ STYP_EXCEPT ]

SectionData: "000000000000"

Symbols:

- Name: .bar

Section: .text

--- !XCOFF

FileHeader:

jhendersonUnsubmitted

Done

Do you need any symbols at all for this test case? If so, would 1 suffice?

jhenderson: Do you need any symbols at all for this test case? If so, would 1 suffice?

DiggerLinAuthorUnsubmitted

Done

yes,agree.

DiggerLin: yes,agree.

MagicNumber: 0x1F7

Sections:

- Name: .text

Flags: [ STYP_TEXT ]

- Name: .except

Flags: [ STYP_EXCEPT ]

SectionData: "00000004000000000000"

Symbols:

- Name: .bar

Section: .text

jhendersonUnsubmitted

Done

Do you really need this much data for this test case? A single entry would surely be sufficient, and you wouldn't even need any symbols at all.

jhenderson: Do you really need this much data for this test case? A single entry would surely be sufficient…

DiggerLinAuthorUnsubmitted

Done

agree, thanks

DiggerLin: agree, thanks

# CHECK-WARN-SIZE: warning: '[[FILE]]': The end of the file was unexpectedly encountered: expect section with offset 0x64 and size 0x3e8 goes past the end of the file

# CHECK-WARN-SYM: warning: '[[FILE]]': symbol index 4 exceeds symbol count 1

llvm/tools/llvm-readobj/ObjDumper.h

Show First 20 Lines • Show All 151 Lines • ▼ Show 20 Lines	public:
mergeCodeViewTypes(llvm::codeview::MergingTypeTableBuilder &CVIDs,		mergeCodeViewTypes(llvm::codeview::MergingTypeTableBuilder &CVIDs,
llvm::codeview::MergingTypeTableBuilder &CVTypes,		llvm::codeview::MergingTypeTableBuilder &CVTypes,
llvm::codeview::GlobalTypeTableBuilder &GlobalCVIDs,		llvm::codeview::GlobalTypeTableBuilder &GlobalCVIDs,
llvm::codeview::GlobalTypeTableBuilder &GlobalCVTypes,		llvm::codeview::GlobalTypeTableBuilder &GlobalCVTypes,
bool GHash) {}		bool GHash) {}

// Only implement for XCOFF		// Only implement for XCOFF
virtual void printAuxiliaryHeader() {}		virtual void printAuxiliaryHeader() {}
		virtual void printExceptionSection() {}

// Only implemented for MachO.		// Only implemented for MachO.
virtual void printMachODataInCode() { }		virtual void printMachODataInCode() { }
virtual void printMachOVersionMin() { }		virtual void printMachOVersionMin() { }
virtual void printMachODysymtab() { }		virtual void printMachODysymtab() { }
virtual void printMachOSegment() { }		virtual void printMachOSegment() { }
virtual void printMachOIndirectSymbols() { }		virtual void printMachOIndirectSymbols() { }
virtual void printMachOLinkerOptions() { }		virtual void printMachOLinkerOptions() { }
▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

llvm/tools/llvm-readobj/Opts.td

	Show First 20 Lines • Show All 82 Lines • ▼ Show 20 Lines
	def coff_imports : FF<"coff-imports", "Display import table">, Group<grp_coff>;			def coff_imports : FF<"coff-imports", "Display import table">, Group<grp_coff>;
	def coff_load_config : FF<"coff-load-config", "Display load config">, Group<grp_coff>;			def coff_load_config : FF<"coff-load-config", "Display load config">, Group<grp_coff>;
	def coff_resources : FF<"coff-resources", "Display .rsrc section">, Group<grp_coff>;			def coff_resources : FF<"coff-resources", "Display .rsrc section">, Group<grp_coff>;
	def coff_tls_directory : FF<"coff-tls-directory", "Display TLS directory">, Group<grp_coff>;			def coff_tls_directory : FF<"coff-tls-directory", "Display TLS directory">, Group<grp_coff>;

	// XCOFF specific options.			// XCOFF specific options.
	def grp_xcoff : OptionGroup<"kind">, HelpText<"OPTIONS (XCOFF specific)">;			def grp_xcoff : OptionGroup<"kind">, HelpText<"OPTIONS (XCOFF specific)">;
	def auxiliary_header : FF<"auxiliary-header" , "Display the auxiliary header">, Group<grp_xcoff>;			def auxiliary_header : FF<"auxiliary-header" , "Display the auxiliary header">, Group<grp_xcoff>;
				def exception_section : FF<"exception-section" , "Display the exception section entries">, Group<grp_xcoff>;

	def help : FF<"help", "Display this help">;			def help : FF<"help", "Display this help">;
	def version : FF<"version", "Display the version">;			def version : FF<"version", "Display the version">;

	// Ignored for GNU readelf compatibility.			// Ignored for GNU readelf compatibility.
	def wide : FF<"wide", "Ignored for GNU readelf compatibility">;			def wide : FF<"wide", "Ignored for GNU readelf compatibility">;
	def : F<"W", "Ignored for GNU readelf compatibility">, Alias<wide>;			def : F<"W", "Ignored for GNU readelf compatibility">, Alias<wide>;

	Show All 36 Lines

llvm/tools/llvm-readobj/XCOFFDumper.cpp

Show All 33 Lines	public:
void printSectionHeaders() override;		void printSectionHeaders() override;
void printRelocations() override;		void printRelocations() override;
void printSymbols() override;		void printSymbols() override;
void printDynamicSymbols() override;		void printDynamicSymbols() override;
void printUnwindInfo() override;		void printUnwindInfo() override;
void printStackMap() const override;		void printStackMap() const override;
void printNeededLibraries() override;		void printNeededLibraries() override;
void printStringTable() override;		void printStringTable() override;
		void printExceptionSection() override;

ScopedPrinter &getScopedPrinter() const { return W; }		ScopedPrinter &getScopedPrinter() const { return W; }

private:		private:
template <typename T> void printSectionHeaders(ArrayRef<T> Sections);		template <typename T> void printSectionHeaders(ArrayRef<T> Sections);
template <typename T> void printGenericSectionHeader(T &Sec) const;		template <typename T> void printGenericSectionHeader(T &Sec) const;
template <typename T> void printOverflowSectionHeader(T &Sec) const;		template <typename T> void printOverflowSectionHeader(T &Sec) const;
		template <typename T>
		void printExceptionSectionEntry(const T &ExceptionSectEnt) const;
		template <typename T> void printExceptionSectionEntries() const;
template <typename T> const T *getAuxEntPtr(uintptr_t AuxAddress);		template <typename T> const T *getAuxEntPtr(uintptr_t AuxAddress);
void printFileAuxEnt(const XCOFFFileAuxEnt *AuxEntPtr);		void printFileAuxEnt(const XCOFFFileAuxEnt *AuxEntPtr);
void printCsectAuxEnt(XCOFFCsectAuxRef AuxEntRef);		void printCsectAuxEnt(XCOFFCsectAuxRef AuxEntRef);
void printSectAuxEntForStat(const XCOFFSectAuxEntForStat *AuxEntPtr);		void printSectAuxEntForStat(const XCOFFSectAuxEntForStat *AuxEntPtr);
void printExceptionAuxEnt(const XCOFFExceptionAuxEnt *AuxEntPtr);		void printExceptionAuxEnt(const XCOFFExceptionAuxEnt *AuxEntPtr);
void printFunctionAuxEnt(const XCOFFFunctionAuxEnt32 *AuxEntPtr);		void printFunctionAuxEnt(const XCOFFFunctionAuxEnt32 *AuxEntPtr);
void printFunctionAuxEnt(const XCOFFFunctionAuxEnt64 *AuxEntPtr);		void printFunctionAuxEnt(const XCOFFFunctionAuxEnt64 *AuxEntPtr);
void printBlockAuxEnt(const XCOFFBlockAuxEnt32 *AuxEntPtr);		void printBlockAuxEnt(const XCOFFBlockAuxEnt32 *AuxEntPtr);
▲ Show 20 Lines • Show All 67 Lines • ▼ Show 20 Lines

void XCOFFDumper::printSectionHeaders() {		void XCOFFDumper::printSectionHeaders() {
if (Obj.is64Bit())		if (Obj.is64Bit())
printSectionHeaders(Obj.sections64());		printSectionHeaders(Obj.sections64());
else		else
printSectionHeaders(Obj.sections32());		printSectionHeaders(Obj.sections32());
}		}

		template <typename T>
		void XCOFFDumper::printExceptionSectionEntry(const T &ExceptionSectEnt) const {
		if (ExceptionSectEnt.getReason())
		W.printHex("Trap Instr Addr", ExceptionSectEnt.getTrapInstAddr());
		else {
		uint32_t SymIdx = ExceptionSectEnt.getSymbolIndex();
		Expected<StringRef> ErrOrSymbolName = Obj.getSymbolNameByIndex(SymIdx);
		if (Error E = ErrOrSymbolName.takeError()) {
		jhendersonUnsubmitted Done Reply Inline Actions `unwrapOrError` should be considered deprecated in llvm-readobj, as it stops llvm-readobj from continuing, which is not useful for dumping tools, especially given that the error in this case won't prevent other files or sections from being dumped. Instead, prefer reporting problems as warnings (via `reportUniqueWarning` or equivalent). See the ELF dumper for good examples. jhenderson: `unwrapOrError` should be considered deprecated in llvm-readobj, as it stops llvm-readobj from…
		reportUniqueWarning(std::move(E));
		jhendersonUnsubmitted Done Reply Inline Actions I don't think you've tested this warning? jhenderson: I don't think you've tested this warning?
		return;
		pscoroUnsubmitted Done Reply Inline Actions On the example I am running this just prints "Symbol Index: )", although I checked the individual SymIdx and SymName variables and they have the right values. pscoro: On the example I am running this just prints "Symbol Index: )", although I checked the…
		}
		StringRef SymName = *ErrOrSymbolName;

		W.printNumber("Symbol", SymName, SymIdx);
		}
		W.printNumber("LangID", ExceptionSectEnt.getLangID());
		W.printNumber("Reason", ExceptionSectEnt.getReason());
		jhendersonUnsubmitted Done Reply Inline Actions Ditto: don't use `unwrapOrError`. jhenderson: Ditto: don't use `unwrapOrError`.
		}

		template <typename T> void XCOFFDumper::printExceptionSectionEntries() const {
		Expected<ArrayRef<T>> ExceptSectEntsOrErr = Obj.getExceptionEntries<T>();
		if (Error E = ExceptSectEntsOrErr.takeError()) {
		reportUniqueWarning(std::move(E));
		return;
		}
		ArrayRef<T> ExceptSectEnts = *ExceptSectEntsOrErr;

		DictScope DS(W, "Exception section");
		if (ExceptSectEnts.empty())
		return;
		for (auto &Ent : ExceptSectEnts)
		printExceptionSectionEntry(Ent);
		}

		void XCOFFDumper::printExceptionSection() {
		if (Obj.is64Bit())
		printExceptionSectionEntries<ExceptionSectionEntry64>();
		else
		printExceptionSectionEntries<ExceptionSectionEntry32>();
		}

void XCOFFDumper::printRelocations() {		void XCOFFDumper::printRelocations() {
if (Obj.is64Bit())		if (Obj.is64Bit())
printRelocations<XCOFFSectionHeader64, XCOFFRelocation64>(Obj.sections64());		printRelocations<XCOFFSectionHeader64, XCOFFRelocation64>(Obj.sections64());
else		else
printRelocations<XCOFFSectionHeader32, XCOFFRelocation32>(Obj.sections32());		printRelocations<XCOFFSectionHeader32, XCOFFRelocation32>(Obj.sections32());
}		}

const EnumEntry<XCOFF::RelocationType> RelocationTypeNameclass[] = {		const EnumEntry<XCOFF::RelocationType> RelocationTypeNameclass[] = {
▲ Show 20 Lines • Show All 803 Lines • Show Last 20 Lines

llvm/tools/llvm-readobj/llvm-readobj.cpp

Show First 20 Lines • Show All 156 Lines • ▼ Show 20 Lines
static bool COFFExports;		static bool COFFExports;
static bool COFFImports;		static bool COFFImports;
static bool COFFLoadConfig;		static bool COFFLoadConfig;
static bool COFFResources;		static bool COFFResources;
static bool COFFTLSDirectory;		static bool COFFTLSDirectory;

// XCOFF specific options.		// XCOFF specific options.
static bool XCOFFAuxiliaryHeader;		static bool XCOFFAuxiliaryHeader;
		static bool XCOFFExceptionSection;

OutputStyleTy Output = OutputStyleTy::LLVM;		OutputStyleTy Output = OutputStyleTy::LLVM;
static std::vector<std::string> InputFilenames;		static std::vector<std::string> InputFilenames;
} // namespace opts		} // namespace opts

static StringRef ToolName;		static StringRef ToolName;

namespace llvm {		namespace llvm {
▲ Show 20 Lines • Show All 124 Lines • ▼ Show 20 Lines	static void parseOptions(const opt::InputArgList &Args) {
opts::COFFExports = Args.hasArg(OPT_coff_exports);		opts::COFFExports = Args.hasArg(OPT_coff_exports);
opts::COFFImports = Args.hasArg(OPT_coff_imports);		opts::COFFImports = Args.hasArg(OPT_coff_imports);
opts::COFFLoadConfig = Args.hasArg(OPT_coff_load_config);		opts::COFFLoadConfig = Args.hasArg(OPT_coff_load_config);
opts::COFFResources = Args.hasArg(OPT_coff_resources);		opts::COFFResources = Args.hasArg(OPT_coff_resources);
opts::COFFTLSDirectory = Args.hasArg(OPT_coff_tls_directory);		opts::COFFTLSDirectory = Args.hasArg(OPT_coff_tls_directory);

// XCOFF specific options.		// XCOFF specific options.
opts::XCOFFAuxiliaryHeader = Args.hasArg(OPT_auxiliary_header);		opts::XCOFFAuxiliaryHeader = Args.hasArg(OPT_auxiliary_header);
		opts::XCOFFExceptionSection = Args.hasArg(OPT_exception_section);

opts::InputFilenames = Args.getAllArgValues(OPT_INPUT);		opts::InputFilenames = Args.getAllArgValues(OPT_INPUT);
}		}

namespace {		namespace {
struct ReadObjTypeTableBuilder {		struct ReadObjTypeTableBuilder {
ReadObjTypeTableBuilder()		ReadObjTypeTableBuilder()
: IDTable(Allocator), TypeTable(Allocator), GlobalIDTable(Allocator),		: IDTable(Allocator), TypeTable(Allocator), GlobalIDTable(Allocator),
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	if (Dumper->canCompareSymbols()) {
FileStr);		FileStr);
}		}
}		}
Dumper->printFileSummary(FileStr, Obj, opts::InputFilenames, A);		Dumper->printFileSummary(FileStr, Obj, opts::InputFilenames, A);

if (opts::FileHeaders)		if (opts::FileHeaders)
Dumper->printFileHeaders();		Dumper->printFileHeaders();

		// Auxiliary header in XOCFF is right after the file header, so print the data
		// here.
if (Obj.isXCOFF() && opts::XCOFFAuxiliaryHeader)		if (Obj.isXCOFF() && opts::XCOFFAuxiliaryHeader)
Dumper->printAuxiliaryHeader();		Dumper->printAuxiliaryHeader();

// This is only used for ELF currently. In some cases, when an object is		// This is only used for ELF currently. In some cases, when an object is
// corrupt (e.g. truncated), we can't dump anything except the file header.		// corrupt (e.g. truncated), we can't dump anything except the file header.
if (!ContentErrString.empty())		if (!ContentErrString.empty())
reportError(createError(ContentErrString), FileStr);		reportError(createError(ContentErrString), FileStr);

▲ Show 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	if (opts::MachOSegment)
Dumper->printMachOSegment();		Dumper->printMachOSegment();
if (opts::MachOVersionMin)		if (opts::MachOVersionMin)
Dumper->printMachOVersionMin();		Dumper->printMachOVersionMin();
if (opts::MachODysymtab)		if (opts::MachODysymtab)
Dumper->printMachODysymtab();		Dumper->printMachODysymtab();
if (opts::CGProfile)		if (opts::CGProfile)
Dumper->printCGProfile();		Dumper->printCGProfile();
}		}

		if (Obj.isXCOFF() && opts::XCOFFExceptionSection)
		Dumper->printExceptionSection();
		jhendersonUnsubmitted Done Reply Inline Actions Is there a reasson you're not putting this up with the other XCOFF-specific dumping option? jhenderson: Is there a reasson you're not putting this up with the other XCOFF-specific dumping option?
		DiggerLinAuthorUnsubmitted Done Reply Inline Actions If there are several options in the command line at the same time. I want to keep the output as order of content xcoff object file as much as possible. If you do not agree with this, I can put the all the XCOFF-specific dumping option together. DiggerLin: If there are several options in the command line at the same time. I want to keep the output…
		jhendersonUnsubmitted Done Reply Inline Actions Okay, your explanation makes sense, thanks. Please put it in comments somewhere in the file, e.g. "this data appears early in XCOFF files so display it first" etc. jhenderson: Okay, your explanation makes sense, thanks. Please put it in comments somewhere in the file, e.

if (opts::PrintStackMap)		if (opts::PrintStackMap)
Dumper->printStackMap();		Dumper->printStackMap();
if (opts::PrintStackSizes)		if (opts::PrintStackSizes)
Dumper->printStackSizes();		Dumper->printStackSizes();
}		}

/// Dumps each object file in \a Arc;		/// Dumps each object file in \a Arc;
static void dumpArchive(const Archive *Arc, ScopedPrinter &Writer) {		static void dumpArchive(const Archive *Arc, ScopedPrinter &Writer) {
▲ Show 20 Lines • Show All 171 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[AIX] llvm-readobj support a new option --exception-section for xcoff object file.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 461208

llvm/docs/CommandGuide/llvm-readobj.rst

llvm/include/llvm/Object/XCOFFObjectFile.h

llvm/lib/Object/XCOFFObjectFile.cpp

llvm/test/tools/llvm-readobj/XCOFF/exception-section.test

llvm/test/tools/llvm-readobj/XCOFF/invalid-exception-section.test

llvm/tools/llvm-readobj/ObjDumper.h

llvm/tools/llvm-readobj/Opts.td

llvm/tools/llvm-readobj/XCOFFDumper.cpp

llvm/tools/llvm-readobj/llvm-readobj.cpp

[AIX] llvm-readobj support a new option --exception-section for xcoff object file.
ClosedPublic