This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
BinaryFormat/
5/10
XCOFF.h
-
Object/
19/32
XCOFFObjectFile.h
-
lib/Object/
-
Object/
20/37
XCOFFObjectFile.cpp
-
test/tools/
-
tools/
-
llvm-objdump/XCOFF/
-
XCOFF/
-
Inputs/
-
xcoff-section-headers64.o
1/3
disassemble-symbol-description64.test
-
llvm-readobj/XCOFF/
-
XCOFF/
-
Inputs/
-
file-aux-wrong64.o
-
symbol64.o
4/6
file-aux-wrong64.test
-
symbols64.test
-
tools/
-
llvm-objdump/
1/2
XCOFFDump.cpp
-
llvm-readobj/
12/23
XCOFFDumper.cpp
-
obj2yaml/
-
xcoff2yaml.cpp
-
unittests/Object/
-
Object/
-
XCOFFObjectFileTest.cpp

Differential D85774

[XCOFF][AIX] Enable tooling support for 64 bit symbol table parsing
ClosedPublic

Authored by jasonliu on Aug 11 2020, 1:00 PM.

Download Raw Diff

Details

Reviewers

DiggerLin
daltenty
hubert.reinterpretcast
jhenderson
Xiangling_L
MaskRay

Group Reviewers

Restricted Project

Commits

rG8e84311a84b3: [XCOFF][AIX] Enable tooling support for 64 bit symbol table parsing

Summary

Add in the ability of parsing symbol table for 64 bit object.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

DiggerLin added inline comments.Aug 13 2020, 1:21 PM

llvm/lib/Object/XCOFFObjectFile.cpp
411	can we add new member function as getNumberOfSymbolTableEntries() { return is64Bit() is64Bit() ? getNumberOfSymbolTableEntries64() : getLogicalNumberOfSymbolTableEntries32(); } the function can also use in XCOFFObjectFile::create() and getSymbolNameByIndex()

jasonliu added inline comments.Aug 13 2020, 1:31 PM

llvm/lib/Object/XCOFFObjectFile.cpp
205	Please see my other comments regarding combining the 32bit and 64 bit version into 1 function.
209	Please see my other comments regarding combining the 32bit and 64 bit version into 1 function.
411	About all the comments mentioning if we could combining the 32bit and 64 bit version into 1 function. I don't think it's good idea because people would ignore the fact that they are returning different types underneath.

In D85774#2216424, @jasonliu wrote:
I just wonder whether we can implement two separate structure XCOFFSymbolEntry32 and XCOFFSymbolEntry64 without so much union be used on currently implement.
Agree that current implementation have many union, and it's hard for people to parse what exactly is inside for the structure.
But separating them into two structures, namely, XCOFFSymbolEntry32 and XCOFFSymbolEntry64, would mean a lot more if (Obj->is64Bit()) check across all tooling, which sacrifice a lot in the usability department. A lot more logic would look duplicated.
One potential solution I thought about is for every data member, we introduce a getter to retrieve the data, and mark the data members private. So that most of the time, user of the structure do not need to look inside of the structure to figure out how to retrieve certain data. But the downside is we are going to introduce a lot of getters for that, and not sure if it would be worth the effort.

It seems to me like this should be using inheritance here. You have a base class that has the common members, and provides pure virtual declarations of the various getters, with the sub-classes defining them to do the right thing. Yes, it would introduce a number of getters, but I feel like it would make everything a bit cleaner from a usability standpoint. In most cases, you then don't need any is64Bit queries, because the getters hide that from you.

On a testing note, there are several places in the new code which detect some kind of error. You need testing for these code paths too.

llvm/lib/Object/XCOFFObjectFile.cpp
411	From my experience working with tools that had to support 32-bit and 64-bit ELF, you don't worry about the underlying type in most cases and always use the larger type. The same probably applies here. Of course, it becomes a bit moot if you add a common getter interface as suggested out-of-line, because those getters will have to return the larger of the two return types anyway. Is there a strong reason to not use the larger type everywhere?
756	Related to my comments elsewhere - it looks to me like most consumers will need to handle both 32 and 64-bit versions, so they'll always have to do this dance. Thus your concern about how the caller uses them is misplaced - the caller is more likely to do the wrong thing i.e. call the wrong version than have problems with the return types.
792	Better than `errorCodeToError(/some error code/)` is to use `createStringError()` or `createFileError()` to provide more context to the failure (how did the parsing fail? where? etc).
807	I don't think you want to use `int` here. There's always going to be a positive number of entries, and there are no subtractions etc inolving `Index` here. Better would be an unsigned type of some form (presumably the return type of `getNumberOfAuxEntries()`).
816	Same comment as before - use `createStringError()` or `createFileError()`.
llvm/test/tools/llvm-objdump/XCOFF/disassemble-symbol-description64.test
14–15	I'm not going to stop you checking in a pre-compiled object, as I'm not an XCOFF maintainer, but as you are continuing to add more functionality here, I strongly advise you to write a yaml2obj XCOFF port, to avoid pre-canned binaries. You'll find pre-built binaries extremely inconvenient to work with as you maintain things going forward. Not only that, but they are harmful to the git repository size, especially if you have to occasionally rebuild them. Using yaml2obj may also be about the only way you can test most parse failure paths. If yaml2obj isn't viable, at least consider llvm-mc or similar, if possible.
llvm/tools/llvm-readobj/XCOFFDumper.cpp
412–417	@grimar has gone to a lot of effort to get rid of `unwrapOrError` from the ELF dumping code. I'd prefer it if we could avoid using it here too. It is generally better in dumping tools to report a warning and abort dumping the current section than to emit an error and terminate the program, since it gives the user more of the information they've asked for.

hubert.reinterpretcast added inline comments.Aug 14 2020, 9:12 AM

llvm/lib/Object/XCOFFObjectFile.cpp
411	Is there a strong reason to not use the larger type everywhere? I don't know what strength this reason has, but we had noticed that some of the tools do not reflect the width of the 32-bit format fields very well (even for relatively uninterpreted output). Where the producer of the binary is under development, developers are better served if the tools emit the correct width for fields in the format.

jasonliu added inline comments.Aug 14 2020, 12:38 PM

llvm/test/tools/llvm-objdump/XCOFF/disassemble-symbol-description64.test
14–15	I agree that we would want to move away from pre-canned binaries at some point. When writing a yaml2obj port, we would still require tools such as llvm-readobj and llvm-objdump to make sure our yaml2obj implementation is correct. So we still have a chicken-or-egg problem here. I think the current plan is to use pre-canned binary to develop the tooling support first. Then use the verified tooling support to verify XCOFF object file generation from llc. Then we could replace the pre-canned binary with llvm-mc/llc.

grimar added inline comments.Aug 17 2020, 3:20 AM

llvm/tools/llvm-readobj/XCOFFDumper.cpp
412–417	Yeah. Having `unwrapOrError` available is my concern. I am trying to cleanup ELF dumper, but other files (e.g. COFF) are still using it, thought ideally I'd just remove this API from llvm-readobj code, it seems does more harm than good for a long term. At least I'd be happy if people stop adding more calls to the code.

DiggerLin added inline comments.Aug 17 2020, 6:02 AM

llvm/include/llvm/BinaryFormat/XCOFF.h
300	the name must be consistent with the aix OS file syms.h? what about the to change to AUX_EXCEPT . it consistent with our current style.

jasonliu added inline comments.Aug 17 2020, 6:31 AM

llvm/include/llvm/BinaryFormat/XCOFF.h
300	The current style in this file seems to be (correct me if I'm wrong): Use a descriptive name if it's not directly taken from OS header. Otherwise, take the name directly from header and add detailed description/comment to it. In this case, it follows the latter. Other enums member do not have '_' because OS version do not have it either.

jasonliu added inline comments.Aug 17 2020, 6:33 AM

llvm/tools/llvm-readobj/XCOFFDumper.cpp
412–417	Thanks. Agreed. Will avoid using `unwrapOrError` in future code.

hubert.reinterpretcast added inline comments.Aug 17 2020, 7:51 AM

llvm/include/llvm/BinaryFormat/XCOFF.h
300	The issue here is that the name from the OS header is a reserved name. So by reason of not wanting undefined behaviour, we cannot use the name taken from the OS header. Unfortunately, that means we cannot be consistent in terms of using the name taken from the OS header without switching all the enumerator names to be descriptive and in the LLVM style.

jasonliu added inline comments.Aug 17 2020, 7:58 AM

llvm/include/llvm/BinaryFormat/XCOFF.h
300	hmm... Is the namespace `XCOFF` not enough to prevent the undefined behavior happening? Or are we afraid of people just use `using namespace XCOFF` to defeat it?

hubert.reinterpretcast added inline comments.Aug 17 2020, 8:04 AM

llvm/include/llvm/BinaryFormat/XCOFF.h
300	The practical cause of undefined behaviour in such a case is usually that the instance of the identifier here is misparsed, or otherwise has surprising behaviour, either because it is defined as a macro or is an extension keyword. Whether the name turns out to be in scope elsewhere is not a factor for such mechanisms.

jasonliu added inline comments.Aug 17 2020, 8:12 AM

llvm/include/llvm/BinaryFormat/XCOFF.h
300	Got it. I will switch the style of SymbolAuxType in the next revision. We will need to come up with a plan to switch the rest of the classes in this file (There are a lot).

jhenderson added inline comments.Aug 18 2020, 2:37 AM

llvm/include/llvm/BinaryFormat/XCOFF.h
300	+1 to dropping the underscore. I think it's okay for that to be the only change, but have no strong opinion either way, so happy with whichever you prefer.
305	Not that it really matters, but it's more traiditional to order enums in ascending numerical order. Any particular reason you've done this in the reverse order?
llvm/lib/Object/XCOFFObjectFile.cpp
411	In think in the context of printing the appropriately formatted output, you'd want to switch on the source type (i.e. `is64Bit` or whatever), at the formatting time. Certainly, this is how we've done it in our own internal code bases I work on, and there are examples of this in a number of other LLVM utilities. For example in https://github.com/llvm/llvm-project/blob/master/llvm/lib/DebugInfo/DWARF/DWARFCompileUnit.cpp#L17, the `dump` function dumps the offset with a width according to the DWARF format (i.e. 32 or 64 bit), but the `getLength` function returns a 64-bit value always. Similarly https://github.com/llvm/llvm-project/blob/master/llvm/tools/llvm-objdump/ELFDump.cpp#L257 identifies the ELF format kind and uses that in determining the width of offset, size and address fields (which are stored as uint64_t) when printing ELF program header tables. There are certainly plenty of places where this hasn't been done. Sometimes this is a mistake, other times it's for consistency with GNU output, but I think the preferred approach is the "store large, explicitly specify format on output" approach.

jhenderson added inline comments.Aug 18 2020, 3:12 AM

llvm/test/tools/llvm-objdump/XCOFF/disassemble-symbol-description64.test
14–15	Yeah, chicken-or-egg problem is a bit of an issue. I'm not sure there's always a clear answer to this. The one I've encouraged for yaml2obj DWARF support testing is to actually inspect the hex output (with sufficient additional commenting to make it clear what the output represents). By keeping the initial functionality small enough, you can boostrap up from there. The issue is that a lot of our low-level tool testing (i.e. testing of things like llvm-readobj) has switched over to yaml2obj, but clearly we can't (in theory) then use llvm-readobj to test the basic output of yaml2obj or we end up with a circular test dependency - a bug in a common library might not obviously manifest itself in this context, but would if using a tool from outside the ecosystem. Another strategy which I've used occasionally for testing DWARF parsing before the yaml2obj support existed was writing assembly using just .byte/.quad etc directives to craft the input format precisely, without relying on the higher-level assembly directives (like .file/.loc etc). This may not work in all situations though.

jasonliu added inline comments.Aug 18 2020, 10:00 AM

llvm/include/llvm/BinaryFormat/XCOFF.h
305	I sort of "copied" the list from the OS header, and that's just the order it appeared in OS header. I don't think it's particular important to have it in current order, I could change it to ascending order.

jhenderson added inline comments.Aug 19 2020, 12:10 AM

llvm/include/llvm/BinaryFormat/XCOFF.h
305	I haven't got any particular preference, so am happy to defer to whatever you prefer on this one.

Addresses comments.
Add in test case to test errors.
Use view/reference class to encapsulate 32-bit and 64-bit differences instead.

In D85774#2217649, @jhenderson wrote:

It seems to me like this should be using inheritance here. You have a base class that has the common members, and provides pure virtual declarations of the various getters, with the sub-classes defining them to do the right thing. Yes, it would introduce a number of getters, but I feel like it would make everything a bit cleaner from a usability standpoint. In most cases, you then don't need any is64Bit queries, because the getters hide that from you.

@jhenderson I tried to use inheritance as suggested. But inheritance would mean I need to use pointers to enable the runtime polymorphism. Then there is a life time issue that need to be managed when using pointers. The easier way to achieve that is to return via unique_ptr. But using unique_ptr introduced usability issue in the caller/user side, as we would see std::move, SymbolRef.get() before getting to the query we want. Also underneath of the unique_ptr, new/delete is not very efficient as well.
In the end, I tried to solve this in similar manner as COFF does, which is using a view class without inheritance. Although the downside of it is we basically have an if query in every call to the view class to differentiate which version (32/64) we are having right now, the good thing is that caller side is much more cleaner.

DiggerLin added inline comments.Sep 10 2020, 12:56 PM

llvm/include/llvm/Object/XCOFFObjectFile.h
158	change to return Entry32 ? Entry32->ParameterHashIndex : Entry64->ParameterHashIndex and change in the following functions too ?
176	change to return reinterpret_cast<uintptr_t>(Entry32 ? Entry32 : Entry64)
400	this means for getSymbolEntryAddressByIndex(uint32_t SymbolTableIndex) const ?
484	assert(OwningObjectPtr != nullptr) here ?
495–528	maybe we can use a macro here. #define GETVALUE(X) Entry32 ? Entry32->X : Entry64 ->X int16_t getSectionNumber() const { return GETVALUE(SectionNumber); } uint16_t getSymbolType() const { return GETVALUE(SymbolType); } and so on
535	getAddress() may confuse with getting the address of the symbol. maybe good to rename to getEntryAddress() ?
llvm/lib/Object/XCOFFObjectFile.cpp
600	several place use above NumberOfSymTableEntries , maybe good to provide a helper function.
757	change to const uint64_t SymbolTableSize ?
823–825	not all the symbol has Csect entry. what about to return Optional<XCOFFCsectAuxRef>XCOFFSymbolRef::getXCOFFCsectAuxRef()
827	I think assert(isCsectSymbol()) myabe better. sometime maybe our developer call getXCOFFCsectAuxRef() at a no CsectSymbol . it is not a object file parse failed.
llvm/tools/llvm-objdump/XCOFFDump.cpp
49	I can not see benefit to change from XCOFFSymbolRef SymRef(Sym.getRawDataRefImpl(), Obj); to XCOFFSymbolRef SymRef = Obj->toSymbolRef(Sym.getRawDataRefImpl());

jasonliu added inline comments.Sep 10 2020, 1:48 PM

llvm/include/llvm/Object/XCOFFObjectFile.h
158	I was initially worried about MSVC breakage here: https://github.com/llvm/llvm-project/commit/210314ae8c59bc0a8801c2528eda892cd5960c31 But after taking a closer look, it seems to be only a problem for conversion from ubig32_t value to a unit64_t, which does not apply here. So I will give it a try.

jasonliu added inline comments.Sep 10 2020, 5:27 PM

llvm/lib/Object/XCOFFObjectFile.cpp
823–825	I returned Expected<XCOFFCsectAuxRef> partly because of the original comment on this function: TODO: The function needs to return an error if there is no csect auxiliary entry. I believe that's a change from your previous commit. Is there any reason that you changed your mind? In 32 bit mode, you could have a csect symbol, but without any auxiliary entry. That should return an error (which I haven't detected here, but I should). Also, in 64 bit mode, it's possible that you have a csect symbol that has auxiliary entries, but do not have a csect auxiliary entry, that should be an error situation right? So it also makes sense to return error in that case. I think the interface would be too complicated if we return an Expected wrap with an Optional, or the other way around.
827	Sure.
llvm/tools/llvm-objdump/XCOFFDump.cpp
49	I did it for consistency reason, i.e: always get XCOFFSymbolRef via toSymbolRef.

jasonliu added inline comments.Sep 10 2020, 5:39 PM

llvm/include/llvm/Object/XCOFFObjectFile.h
400	Sorry, what's the difference between Symbol and SymbolEntry? I'm also seeing `getSymbolIndex` and `getSymbolNameByIndex` around this function. Any reason they are not "SymbolEntry"?

DiggerLin added inline comments.Sep 11 2020, 6:24 AM

llvm/include/llvm/Object/XCOFFObjectFile.h
400	the value of symbol maybe a symbol relocation address , I was confused getSymbolAddress with getting the relocation address at my first glance of the code, getSymbolEntryAddress, that means we need the SymbolEntry address not relocation address of a symbol.

jasonliu marked 4 inline comments as done.Sep 11 2020, 10:46 AM

jasonliu added inline comments.

llvm/include/llvm/Object/XCOFFObjectFile.h
400	I don't think we have a symbol relocation address IMO. The address related to relocation could be relocation entry's address, or the virtual address data member inside of a relocation entry. But those addresses are not related to symbol in any ways. Also, there is a function from base class which we overrides here called getSymbolAddress, which we don't want to change. So it would make sense to keep the same naming style here.
535	Do you still find it confusing after seeing my other comments about `Symbol` vs `SymbolEntry`? I would think it's fine since we don't really have other addresses we could get in here.

jasonliu added inline comments.Sep 11 2020, 11:23 AM

llvm/include/llvm/Object/XCOFFObjectFile.h
400	hmm... Just realized what you meant. getSymbolAddress actually returns toSymbolRef.getValue() which is a relocatable address. This function is suppose to return the address of the symbol table entry within the object file.
535	Will change.

Address comments from Digger.

MaskRay added inline comments.Oct 2 2020, 12:05 PM

llvm/include/llvm/Object/XCOFFObjectFile.h
145	You probably don't need these assert. The dereference will crash anyway

hubert.reinterpretcast added inline comments.Oct 2 2020, 12:46 PM

llvm/include/llvm/Object/XCOFFObjectFile.h
145	That's not true. There are some number of addressable bytes containing 0 starting from address 0x0 on AIX.

DiggerLin added inline comments.Mar 18 2021, 11:12 AM

llvm/include/llvm/Object/XCOFFObjectFile.h
158	thanks let me know.
171	what about return reinterpret_cast<uintptr_t>(Entry32 ? Entry32 : Entry64) ?
202	ruse GETVALUE(SymbolAlignmentAndType) ?
425	not sure we want to Distance to be negative value future? I think change to int32_t Distance, means that we can backward
435	not sure whether we want to define a enum for the LanguageID in this patch. The values for this field are defined in the e_lang field in "Exception Section"
484	"Symbol table pointer can not be nullptr!" --> "Symbol table entry pointer can not be nullptr!"
493	using GETVALUE(Value) for consistent ?
llvm/lib/Object/XCOFFObjectFile.cpp
223	const int16_t SectNum ?
498	const int16_t SectionNum ?

DiggerLin added inline comments.Mar 18 2021, 1:51 PM

llvm/lib/Object/XCOFFObjectFile.cpp
805–812	const int16_t SectNum

DiggerLin added inline comments.Mar 18 2021, 2:09 PM

llvm/lib/Object/XCOFFObjectFile.cpp
839–845	create a static function getSymbolAuxType in a file scope maybe better? All the Aux symbol of 64bit all has the AuxType . we can use the function for other type too in other place later ?

DiggerLin added inline comments.Mar 19 2021, 8:07 AM

llvm/lib/Object/XCOFFObjectFile.cpp
797	if we not enable -ffunction-sections , function entry is label.
839–845	there already has a function on XCOFFCsectAuxRef ::getAuxType64()

Esme mentioned this in D100375: [yaml2obj] Enable support for parsing 64-bit XCOFF..Apr 13 2021, 3:24 AM

Hi Jason, what's your plan about the patch? When will you move forward with it?
Currently D100375 relies on the patch for its 64-bit llvm-readobj. In additional, D97656, D99164 and D98003 should be rebased on it.
Besides, I would like to add support for the line number dump in llvm-objdump if this patch is ready.
It looks like this is a fundamental patch for tools implementation. And it looks good except for some comments haven't been addressed yet.
Please let me know if I can be of any help. Thanks!

@Esme I will rebase and address comments asap.

Rebase and Address comments.

llvm/include/llvm/Object/XCOFFObjectFile.h
171	No, you could not do that. It only works if Entry32 and Entry64 are the same type. But they are not here.
425	I don't see a need to jump backward now. If it's needed in the future, we could always change in the future patch.
435	I think we are already doing enum mapping in tools/llvm-readobj/XCOFFDumper.cpp. I don't see a strong need to create an enum for it.
493	I would prefer to be more explicit here because we are doing a conversion to larger value for 32 bit version, which is different from the rest of GETVALUE(Value).
llvm/lib/Object/XCOFFObjectFile.cpp
797	Thanks. I brought back the old behavior and added the FIXME to say that this function does not return a correct value if we have -ffunction-sections enabled.
839–845	Yes, there is a XCOFFCsectAuxRef ::getAuxType64(), but if you notice, this function is used to create an XCOFFCsectAuxRef object. So you don't have that function available in the creator. And I don't think a static function is needed, because when you created XCOFFCsectAuxRef object through this function, then you could call the XCOFFCsectAuxRef ::getAuxType64() to get your type. So this lambda should only exists in this function.

jasonliu updated this revision to Diff 338896.Apr 20 2021, 9:16 AM

jasonliu marked 2 inline comments as done.

Harbormaster completed remote builds in B99734: Diff 338892.Apr 20 2021, 9:44 AM

Harbormaster completed remote builds in B99735: Diff 338896.Apr 20 2021, 10:04 AM

Esme added a child revision: D101272: [llvm-objdump][XCOFF][AIX] Enable the -l (--line-numbers) option..Apr 25 2021, 7:19 PM

Address clang-tidy comment and added binary file.

Ping.

jsji added a reviewer: Restricted Project.Apr 27 2021, 9:00 AM

Harbormaster completed remote builds in B101173: Diff 340857.Apr 27 2021, 9:08 AM

jhenderson added inline comments.Apr 28 2021, 6:28 AM

llvm/lib/Object/XCOFFObjectFile.cpp
831	Is this error user-facing (I'm assuming so)? Assuming it is, you should record here which symbol is causing the problem. Otherwise the user will be faced with an error along the lines of this: error: this csec symbol contains no auxiliary entry which is not really actionable (imagine the input had 100000 symbols in - the user can't realistically go through each to find the offending one).
848	Similar to my above comment - which entry was not found? Give the user more context so that they can act on the problem.
llvm/tools/llvm-readobj/XCOFFDumper.cpp
416	This will write the error inline, rather than to stderr. Are you sure that's what you want? it isn't what most dumping tools do on failure.

Address comments.

jasonliu marked 3 inline comments as done.Apr 28 2021, 9:07 AM

Harbormaster completed remote builds in B101435: Diff 341230.Apr 28 2021, 10:18 AM

jhenderson added inline comments.Apr 29 2021, 12:22 AM

llvm/lib/Object/XCOFFObjectFile.cpp
831	I believe this requires `std::move(Error);`, as you're returning an `Expected`, not an `Error`.
835–836	I think it would make more sense to insert the name in the middle of the message to make it a bit more concise. Something like: "csect symbol `name` contains no auxiliary entry"
863	I'd suggest quoting somehow the symbol name, so that any whitespace or similar that happens to end up in the name (rare, but possible to write using assembly, at least for other platforms) is easily understood to be part of the name.
llvm/tools/llvm-readobj/XCOFFDumper.cpp
414–415	Use `reportError` or `reportWarning` so that the error is reported in a clean manner and consistent with other llvm-readobj varieties, and not `report_fatal_error` which looks like a crash. General rule of thumb: try to avoid using `report_fatal_error`, especially in tool code where it is easy to report errors properly. In llvm-readobj for ELF, we try to avoid even using `reportError` where possible, as that stops the tool from continuing dumping, which can be problematic occasionally. We prefer `reportWarning` (or more specifically the local `reportUniqueWarning` which avoids reporting the same warning multiple times) and bailing out of the current routine. Take a look at ELFDumper.cpp for examples.

Address comments.

llvm/tools/llvm-readobj/XCOFFDumper.cpp
414–415	Thanks for the elaboration. That clears up things a lot for me. I will use reportUniqueWarning here.

Harbormaster completed remote builds in B101730: Diff 341639.Apr 29 2021, 4:24 PM

Esme mentioned this in D97656: [llvm-objcopy] Initial XCOFF32 support..Apr 29 2021, 9:41 PM

jhenderson added inline comments.Apr 30 2021, 1:34 AM

llvm/tools/llvm-readobj/XCOFFDumper.cpp
414–415	`reportUniqueWarning` can take an `Error` directly, so you can just do: if (!ErrOrCsectAuxRef) reportUniqueWarning(ErrOrCsectAuxRef.takeError()); Also, be careful, as the program continues, so referencing `ErrOrCsectAuxRef` after this may result in things going wrong...

Address comments.

Harbormaster completed remote builds in B101920: Diff 341894.Apr 30 2021, 8:21 AM

EGuesnet added a subscriber: EGuesnet.May 5 2021, 4:51 AM

@jhenderson @DiggerLin @Esme
Any more comments?

In D85774#2738962, @jasonliu wrote:

@jhenderson @DiggerLin @Esme
Any more comments?

I'll try to take another look in the next day or two. The pre-merge bots are failing. Is that an issue with this patch?

In D85774#2738969, @jhenderson wrote:

In D85774#2738962, @jasonliu wrote:

@jhenderson @DiggerLin @Esme
Any more comments?

I'll try to take another look in the next day or two. The pre-merge bots are failing. Is that an issue with this patch?

Thanks a lot.
I think it's more of an issue that the binaries did not get into the phabricator probably. Not really an issue with the patch itself.

Only some minor nits, otherwise LGTM. I haven't attempted to review the test coverage for all the new code, as I don't feel like I'm in a good position to do that, not being an XCOFF developer. I assume someone else has though.

llvm/include/llvm/Object/XCOFFObjectFile.h
483
485
llvm/lib/Object/XCOFFObjectFile.cpp
797
808	Basically any time you use `consumeError`, add a comment explaining why it's justified that we don't report the error to the user.

This revision is now accepted and ready to land.May 10 2021, 1:14 AM

DiggerLin added inline comments.May 10 2021, 12:37 PM

llvm/tools/llvm-readobj/XCOFFDumper.cpp
390–391	this is only for the 32bits . " By convention, the csect auxiliary entry in an XCOFF32 file must be the last auxiliary entry for any external symbol that has more than one auxiliary entry" for 64bit, it maybe look for the x_auxtype ==AUX_CSECT

DiggerLin added inline comments.May 10 2021, 1:03 PM

llvm/tools/llvm-readobj/XCOFFDumper.cpp
187	if (AuxEntPtr->AuxType != XCOFF::AUX_FILE ) , it should not be parsed as XCOFF::AUX_FILE it may better to print out raw data in the printSymbol()
236	if (AuxEntPtr->AuxType != XCOFF::AUX_CSECT) , it should not be parsed as XCOFF::AUX_CSECT above it may better to print out raw data in the printSymbol()

Address comments.

jasonliu marked 2 inline comments as done.May 13 2021, 11:18 AM

jasonliu added inline comments.

llvm/tools/llvm-readobj/XCOFFDumper.cpp
187	I think it's the caller's responsibility to make sure they are passing in the right auxiliary type. So we should assume when we enter this function, we have the right auxiliary type here. So I modified it and made it an assert instead.
236	Same above. I modified it to be an assertion instead.
390–391	Good point. Updated the code.

DiggerLin added inline comments.May 13 2021, 12:07 PM

llvm/lib/Object/XCOFFObjectFile.cpp
94	SymbolAuxType is only for the 64 bits. add assert(is64Bit() ) before return ?
llvm/tools/llvm-readobj/XCOFFDumper.cpp
378	as you mention "I think it's the caller's responsibility to make sure they are passing in the right auxiliary type. " I think we need to check the AuxType == XCOFF::AUX_FILE for 64 bits. if not , print out the raw data as AUX_CSECT did ?

DiggerLin added inline comments.May 13 2021, 12:22 PM

llvm/tools/llvm-readobj/XCOFFDumper.cpp
416	we have iterated auxiliary entries from line 382~396 and reiterated again in the printCsectAuxEnt(). I think we can improve on it. And in 64bits, The auxiliary entries maybe be reordered in above implement. it will print out all no AUX_CSECT auxiliary entries first and then AUX_CSECT entry, even if the AUX_CSECT is in the middle of the auxiliary entries and if there are two AUX_CSECT on auxiliary entries, we only print out one.

jasonliu marked 2 inline comments as done.May 13 2021, 12:47 PM

jasonliu added inline comments.

llvm/tools/llvm-readobj/XCOFFDumper.cpp
416	I don't think we reiterated again in printCsectAuxEnt() for other auxiliary entries. We did similar things for 382~396 in `getXCOFFCsectAuxRef`, but that function is specifically designed to only get XCOFFCsectAuxRef. So it's really not intended to pass information out there. I don't really think the re-iteration should be a big concern because in theory 382~396 won't be executed all that much. If it does, I would rather have it implement properly (i.e. actually recognize the missing auxiliary entry) instead of just printing out raw datas.

jasonliu updated this revision to Diff 345261.May 13 2021, 1:12 PM

jasonliu marked an inline comment as done.

LGTM with address comment.

llvm/tools/llvm-readobj/XCOFFDumper.cpp
366	it need "continue;" here

and please run clang-format

Address comments.

Harbormaster completed remote builds in B104384: Diff 345283.May 13 2021, 4:28 PM

I took a quick look at the pre-merge output. The message is that the object isn't recognised as a valid object file, not that it wasn't found, which suggests either the object or code is broken in some manner, or you're missing a dependent patch. Does this patch depend on another patch that isn't in main yet?

llvm/tools/llvm-readobj/XCOFFDumper.cpp
357–372	`i` -> `I` (you're changing most of the loop body - you might as well fix this whilst you're here)
363	Test case?
392–394	My English language ping went off at the word "till" in these two sentences. I'd probably change it to "to". Also, use "first" rather than "1st", I suggest in both places. Also "skips" -> "skip" for grammatical consistency.
396	`i` -> `I`

Address comments.

In D85774#2758956, @jhenderson wrote:

I took a quick look at the pre-merge output. The message is that the object isn't recognised as a valid object file, not that it wasn't found, which suggests either the object or code is broken in some manner, or you're missing a dependent patch. Does this patch depend on another patch that isn't in main yet?

No this patch does not depend on other patches. I think this is caused by git generated binary for the patch doesn't really assemble to the same one?

Harbormaster completed remote builds in B104539: Diff 345492.May 14 2021, 11:41 AM

In D85774#2760205, @jasonliu wrote:

In D85774#2758956, @jhenderson wrote:

I took a quick look at the pre-merge output. The message is that the object isn't recognised as a valid object file, not that it wasn't found, which suggests either the object or code is broken in some manner, or you're missing a dependent patch. Does this patch depend on another patch that isn't in main yet?

No this patch does not depend on other patches. I think this is caused by git generated binary for the patch doesn't really assemble to the same one?

Sounds vaguely plausible, I guess.

llvm/test/tools/llvm-readobj/XCOFF/file-aux-wrong64.test
2–3	Fix a couple of grammar issues and a premature line-wrap.
5–6
20	The output on this line looks incorrect to me. Possibly a bug in the code resulting from a missing newe line?

Address comments.

jasonliu marked 2 inline comments as done.May 17 2021, 8:51 AM

jasonliu added inline comments.

llvm/test/tools/llvm-readobj/XCOFF/file-aux-wrong64.test
20	I added an newline after all the raw bytes. Other than that, I think the output is good. We have 18 bytes per symbol table entry, and we are printing 18 raw bytes here.

Harbormaster completed remote builds in B104842: Diff 345903.May 17 2021, 10:07 AM

jhenderson added inline comments.May 18 2021, 12:17 AM

llvm/test/tools/llvm-readobj/XCOFF/file-aux-wrong64.test
20	I don't know if it is, but you probably want `00fb` indented to line up nicely with the previous line. You can then confirm that this indentation is maintained by enabling `--match-full-lines` and `--strict-whitespace` in FileCheck. (If you do that, you'll need to remove the space after `CHECK-NEXT:`)

Adjustment the print out.

jasonliu added inline comments.May 28 2021, 8:21 AM

llvm/test/tools/llvm-readobj/XCOFF/file-aux-wrong64.test
20	Hi James, I adjusted the code to print `00fb` in the same line, I think that actually works better because we could have multiply auxiliary entry data to print out. Putting each in the same line is easier to parse.

Harbormaster completed remote builds in B106719: Diff 348533.May 28 2021, 8:54 AM

Esme mentioned this in D98003: [obj2yaml][XCOFF] Dump sections.May 30 2021, 7:58 PM

MaryamBen mentioned this in D103696: [XCOFF][AIX] Add support for XCOFF 64 bit Object files.Jun 4 2021, 6:18 AM

jsji added a child revision: D103696: [XCOFF][AIX] Add support for XCOFF 64 bit Object files.Jun 4 2021, 6:54 AM

Looks good again. Sorry for the delay - I was off work last week.

This revision was landed with ongoing or failed builds.Jun 7 2021, 10:25 AM

Closed by commit rG8e84311a84b3: [XCOFF][AIX] Enable tooling support for 64 bit symbol table parsing (authored by jasonliu). · Explain Why

This revision was automatically updated to reflect the committed changes.

jasonliu added a commit: rG8e84311a84b3: [XCOFF][AIX] Enable tooling support for 64 bit symbol table parsing.

In D85774#2801934, @jhenderson wrote:

Looks good again. Sorry for the delay - I was off work last week.

No worries. Thanks for all the reviews.

MaryamBen added a child revision: D104639: [AIX][XCOFF] Add support for 64-bit file header and section header writing.Jun 21 2021, 6:46 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

BinaryFormat/

XCOFF.h

9 lines

Object/

XCOFFObjectFile.h

268 lines

lib/

Object/

XCOFFObjectFile.cpp

231 lines

test/

tools/

llvm-objdump/

XCOFF/

Inputs/

xcoff-section-headers64.o

disassemble-symbol-description64.test

96 lines

llvm-readobj/

XCOFF/

Inputs/

file-aux-wrong64.o

symbol64.o

file-aux-wrong64.test

19 lines

symbols64.test

387 lines

tools/

llvm-objdump/

XCOFFDump.cpp

24 lines

llvm-readobj/

XCOFFDumper.cpp

151 lines

obj2yaml/

xcoff2yaml.cpp

12 lines

unittests/

Object/

XCOFFObjectFileTest.cpp

112 lines

Diff 350352

llvm/include/llvm/BinaryFormat/XCOFF.h

	Show First 20 Lines • Show All 290 Lines • ▼ Show 20 Lines
	};			};

	enum CFileCpuId : uint8_t {			enum CFileCpuId : uint8_t {
	TCPU_PPC64 = 2, ///< PowerPC common architecture 64-bit mode.			TCPU_PPC64 = 2, ///< PowerPC common architecture 64-bit mode.
	TCPU_COM = 3, ///< POWER and PowerPC architecture common.			TCPU_COM = 3, ///< POWER and PowerPC architecture common.
	TCPU_970 = 19 ///< PPC970 - PowerPC 64-bit architecture.			TCPU_970 = 19 ///< PPC970 - PowerPC 64-bit architecture.
	};			};

				enum SymbolAuxType : uint8_t {
				AUX_EXCEPT = 255, ///< Identifies an exception auxiliary entry.
				DiggerLinUnsubmitted Not Done Reply Inline Actions the name must be consistent with the aix OS file syms.h? what about the to change to AUX_EXCEPT . it consistent with our current style. DiggerLin: the name must be consistent with the aix OS file syms.h? what about the to change to AUX_EXCEPT…
				jasonliuAuthorUnsubmitted Done Reply Inline Actions The current style in this file seems to be (correct me if I'm wrong): Use a descriptive name if it's not directly taken from OS header. Otherwise, take the name directly from header and add detailed description/comment to it. In this case, it follows the latter. Other enums member do not have '_' because OS version do not have it either. jasonliu: The current style in this file seems to be (correct me if I'm wrong): Use a descriptive name if…
				hubert.reinterpretcastUnsubmitted Not Done Reply Inline Actions The issue here is that the name from the OS header is a reserved name. So by reason of not wanting undefined behaviour, we cannot use the name taken from the OS header. Unfortunately, that means we cannot be consistent in terms of using the name taken from the OS header without switching all the enumerator names to be descriptive and in the LLVM style. hubert.reinterpretcast: The issue here is that the name from the OS header is a reserved name. So by reason of not…
				jasonliuAuthorUnsubmitted Done Reply Inline Actions hmm... Is the namespace `XCOFF` not enough to prevent the undefined behavior happening? Or are we afraid of people just use `using namespace XCOFF` to defeat it? jasonliu: hmm... Is the namespace `XCOFF` not enough to prevent the undefined behavior happening? Or are…
				hubert.reinterpretcastUnsubmitted Done Reply Inline Actions The practical cause of undefined behaviour in such a case is usually that the instance of the identifier here is misparsed, or otherwise has surprising behaviour, either because it is defined as a macro or is an extension keyword. Whether the name turns out to be in scope elsewhere is not a factor for such mechanisms. hubert.reinterpretcast: The practical cause of undefined behaviour in such a case is usually that the instance of the…
				jasonliuAuthorUnsubmitted Done Reply Inline Actions Got it. I will switch the style of SymbolAuxType in the next revision. We will need to come up with a plan to switch the rest of the classes in this file (There are a lot). jasonliu: Got it. I will switch the style of SymbolAuxType in the next revision. We will need to come up…
				jhendersonUnsubmitted Not Done Reply Inline Actions +1 to dropping the underscore. I think it's okay for that to be the only change, but have no strong opinion either way, so happy with whichever you prefer. jhenderson: +1 to dropping the underscore. I think it's okay for that to be the only change, but have no…
				AUX_FCN = 254, ///< Identifies a function auxiliary entry.
				AUX_SYM = 253, ///< Identifies a symbol auxiliary entry.
				AUX_FILE = 252, ///< Identifies a file auxiliary entry.
				AUX_CSECT = 251, ///< Identifies a csect auxiliary entry.
				AUX_SECT = 250 ///< Identifies a SECT auxiliary entry.
				jhendersonUnsubmitted Not Done Reply Inline Actions Not that it really matters, but it's more traiditional to order enums in ascending numerical order. Any particular reason you've done this in the reverse order? jhenderson: Not that it really matters, but it's more traiditional to order enums in ascending numerical…
				jasonliuAuthorUnsubmitted Done Reply Inline Actions I sort of "copied" the list from the OS header, and that's just the order it appeared in OS header. I don't think it's particular important to have it in current order, I could change it to ascending order. jasonliu: I sort of "copied" the list from the OS header, and that's just the order it appeared in OS…
				jhendersonUnsubmitted Not Done Reply Inline Actions I haven't got any particular preference, so am happy to defer to whatever you prefer on this one. jhenderson: I haven't got any particular preference, so am happy to defer to whatever you prefer on this…
				}; // 64-bit XCOFF file only.

	StringRef getMappingClassString(XCOFF::StorageMappingClass SMC);			StringRef getMappingClassString(XCOFF::StorageMappingClass SMC);
	StringRef getRelocationTypeString(XCOFF::RelocationType Type);			StringRef getRelocationTypeString(XCOFF::RelocationType Type);
	SmallString<32> parseParmsType(uint32_t Value, unsigned ParmsNum);			SmallString<32> parseParmsType(uint32_t Value, unsigned ParmsNum);

	struct TracebackTable {			struct TracebackTable {
	enum LanguageID : uint8_t {			enum LanguageID : uint8_t {
	C,			C,
	Fortran,			Fortran,
	▲ Show 20 Lines • Show All 115 Lines • Show Last 20 Lines

llvm/include/llvm/Object/XCOFFObjectFile.h

Show First 20 Lines • Show All 91 Lines • ▼ Show 20 Lines struct XCOFFSectionHeader64 : XCOFFSectionHeader<XCOFFSectionHeader64> {

support::big64_t FileOffsetToRelocationInfo; support::big64_t FileOffsetToRelocationInfo;

support::big64_t FileOffsetToLineNumberInfo; support::big64_t FileOffsetToLineNumberInfo;

support::ubig32_t NumberOfRelocations; support::ubig32_t NumberOfRelocations;

support::ubig32_t NumberOfLineNumbers; support::ubig32_t NumberOfLineNumbers;

support::big32_t Flags; support::big32_t Flags;

char Padding[4]; char Padding[4];

}; };

struct XCOFFSymbolEntry {

enum { NAME_IN_STR_TBL_MAGIC = 0x0 };

typedef struct {

support::big32_t Magic; // Zero indicates name in string table.

support::ubig32_t Offset;

} NameInStrTblType;

typedef struct {

uint8_t LanguageId;

uint8_t CpuTypeId;

} CFileLanguageIdAndTypeIdType;

union {

char SymbolName[XCOFF::NameSize];

NameInStrTblType NameInStrTbl;

};

support::ubig32_t Value; // Symbol value; storage class-dependent.

support::big16_t SectionNumber;

union {

support::ubig16_t SymbolType;

CFileLanguageIdAndTypeIdType CFileLanguageIdAndTypeId;

};

XCOFF::StorageClass StorageClass;

uint8_t NumberOfAuxEntries;

};

struct XCOFFStringTable { struct XCOFFStringTable {

uint32_t Size; uint32_t Size;

const char *Data; const char *Data;

}; };

struct XCOFFCsectAuxEnt32 { struct XCOFFCsectAuxEnt32 {

support::ubig32_t SectionOrLength;

DiggerLinUnsubmitted

Not Done

the type name Entry32Type is not easy to understand.
change to the SymNameAndValue32 ? it means the first 12 bytes are related to symbol name and symbol value .

DiggerLin: the type name Entry32Type is not easy to understand. change to the SymNameAndValue32 ? it…

jasonliuAuthorUnsubmitted

Done

I don't think the name of Entry32Type matters much though. It just tells user these are 32 bit only entries. It's what's inside matters.
Changing it to SymNameAndValue32 means an extra long name to retrieve its actual entry, and the name would have overlaps with its members as well.
For example:
SymEntry.SymNameAndValue32.SymbolName
vs
SymEntry.Entry32Type.SymbolName

jasonliu: I don't think the name of Entry32Type matters much though. It just tells user these are 32 bit…

support::ubig32_t ParameterHashIndex;

support::ubig16_t TypeChkSectNum;

uint8_t SymbolAlignmentAndType;

XCOFF::StorageMappingClass StorageMappingClass;

support::ubig32_t StabInfoIndex;

support::ubig16_t StabSectNum;

DiggerLinUnsubmitted

Not Done

there only SectionOrLengthLowByte64 in the union, the SectionOrLengthHighByte64 can deleted in the comment

DiggerLin: there only SectionOrLengthLowByte64 in the union, the SectionOrLengthHighByte64 can deleted in…

jasonliuAuthorUnsubmitted

Done

But for 64-bit, SectionOrLength is represented by SectionOrLengthLowByte64 and SectionOrLengthHighByte64 combined. Deleting the high byte from the comment would make people think the high byte doesn't matter here.

jasonliu: But for 64-bit, SectionOrLength is represented by SectionOrLengthLowByte64 and…

};

struct XCOFFCsectAuxEnt64 {

support::ubig32_t SectionOrLengthLowByte;

support::ubig32_t ParameterHashIndex;

support::ubig16_t TypeChkSectNum;

uint8_t SymbolAlignmentAndType;

XCOFF::StorageMappingClass StorageMappingClass;

support::ubig32_t SectionOrLengthHighByte;

uint8_t Pad;

XCOFF::SymbolAuxType AuxType;

};

class XCOFFCsectAuxRef {

public:

static constexpr uint8_t SymbolTypeMask = 0x07; static constexpr uint8_t SymbolTypeMask = 0x07;

static constexpr uint8_t SymbolAlignmentMask = 0xF8; static constexpr uint8_t SymbolAlignmentMask = 0xF8;

static constexpr size_t SymbolAlignmentBitOffset = 3; static constexpr size_t SymbolAlignmentBitOffset = 3;

support::ubig32_t XCOFFCsectAuxRef(const XCOFFCsectAuxEnt32 *Entry32) : Entry32(Entry32) {}

SectionOrLength; // If the symbol type is XTY_SD or XTY_CM, the csect XCOFFCsectAuxRef(const XCOFFCsectAuxEnt64 *Entry64) : Entry64(Entry64) {}

// length.

// For getSectionOrLength(),

// If the symbol type is XTY_SD or XTY_CM, the csect length.

// If the symbol type is XTY_LD, the symbol table // If the symbol type is XTY_LD, the symbol table

// index of the containing csect. // index of the containing csect.

// If the symbol type is XTY_ER, 0. // If the symbol type is XTY_ER, 0.

support::ubig32_t ParameterHashIndex; uint64_t getSectionOrLength() const {

support::ubig16_t TypeChkSectNum; return Entry32 ? getSectionOrLength32() : getSectionOrLength64();

uint8_t SymbolAlignmentAndType; }

XCOFF::StorageMappingClass StorageMappingClass;

support::ubig32_t StabInfoIndex; uint32_t getSectionOrLength32() const {

support::ubig16_t StabSectNum; assert(Entry32 && "32-bit interface called on 64-bit object file.");

MaskRayUnsubmitted

Not Done

You probably don't need these assert. The dereference will crash anyway

MaskRay: You probably don't need these assert. The dereference will crash anyway

hubert.reinterpretcastUnsubmitted

Not Done

That's not true. There are some number of addressable bytes containing 0 starting from address 0x0 on AIX.

hubert.reinterpretcast: That's not true. There are some number of addressable bytes containing 0 starting from address…

return Entry32->SectionOrLength;

}

uint64_t getSectionOrLength64() const {

assert(Entry64 && "64-bit interface called on 32-bit object file.");

return (static_cast<uint64_t>(Entry64->SectionOrLengthHighByte) << 32) |

Entry64->SectionOrLengthLowByte;

}

#define GETVALUE(X) Entry32 ? Entry32->X : Entry64->X

uint32_t getParameterHashIndex() const {

return GETVALUE(ParameterHashIndex);

DiggerLinUnsubmitted

Not Done

change to
return Entry32 ? Entry32->ParameterHashIndex : Entry64->ParameterHashIndex

and change in the following functions too ?

DiggerLin: change to return Entry32 ? Entry32->ParameterHashIndex : Entry64->ParameterHashIndex and…

jasonliuAuthorUnsubmitted

Done

I was initially worried about MSVC breakage here: https://github.com/llvm/llvm-project/commit/210314ae8c59bc0a8801c2528eda892cd5960c31
But after taking a closer look, it seems to be only a problem for conversion from ubig32_t value to a unit64_t, which does not apply here.
So I will give it a try.

jasonliu: I was initially worried about MSVC breakage here: https://github.com/llvm/llvm…

DiggerLinUnsubmitted

Not Done

thanks let me know.

DiggerLin: thanks let me know.

}

uint16_t getTypeChkSectNum() const { return GETVALUE(TypeChkSectNum); }

XCOFF::StorageMappingClass getStorageMappingClass() const {

return GETVALUE(StorageMappingClass);

}

uintptr_t getEntryAddress() const {

return Entry32 ? reinterpret_cast<uintptr_t>(Entry32)

: reinterpret_cast<uintptr_t>(Entry64);

}

DiggerLinUnsubmitted

Not Done

what about return reinterpret_cast<uintptr_t>(Entry32 ? Entry32 : Entry64) ?

DiggerLin: what about return reinterpret_cast<uintptr_t>(Entry32 ? Entry32 : Entry64) ?

jasonliuAuthorUnsubmitted

Done

No, you could not do that. It only works if Entry32 and Entry64 are the same type. But they are not here.

jasonliu: No, you could not do that. It only works if Entry32 and Entry64 are the same type. But they are…

uint16_t getAlignmentLog2() const { uint16_t getAlignmentLog2() const {

return (SymbolAlignmentAndType & SymbolAlignmentMask) >> return (getSymbolAlignmentAndType() & SymbolAlignmentMask) >>

SymbolAlignmentBitOffset; SymbolAlignmentBitOffset;

} }

DiggerLinUnsubmitted

Done

change to return reinterpret_cast<uintptr_t>(Entry32 ? Entry32 : Entry64)

DiggerLin: change to return reinterpret_cast<uintptr_t>(Entry32 ? Entry32 : Entry64)

uint8_t getSymbolType() const { uint8_t getSymbolType() const {

return SymbolAlignmentAndType & SymbolTypeMask; return getSymbolAlignmentAndType() & SymbolTypeMask;

} }

bool isLabel() const { return getSymbolType() == XCOFF::XTY_LD; } bool isLabel() const { return getSymbolType() == XCOFF::XTY_LD; }

uint32_t getStabInfoIndex32() const {

assert(Entry32 && "32-bit interface called on 64-bit object file.");

return Entry32->StabInfoIndex;

}

uint16_t getStabSectNum32() const {

assert(Entry32 && "32-bit interface called on 64-bit object file.");

return Entry32->StabSectNum;

}

XCOFF::SymbolAuxType getAuxType64() const {

assert(Entry64 && "64-bit interface called on 32-bit object file.");

return Entry64->AuxType;

}

private:

uint8_t getSymbolAlignmentAndType() const {

return GETVALUE(SymbolAlignmentAndType);

}

DiggerLinUnsubmitted

Done

ruse GETVALUE(SymbolAlignmentAndType) ?

DiggerLin: ruse GETVALUE(SymbolAlignmentAndType) ?

#undef GETVALUE

const XCOFFCsectAuxEnt32 *Entry32 = nullptr;

const XCOFFCsectAuxEnt64 *Entry64 = nullptr;

}; };

struct XCOFFFileAuxEnt { struct XCOFFFileAuxEnt {

typedef struct { typedef struct {

support::big32_t Magic; // Zero indicates name in string table. support::big32_t Magic; // Zero indicates name in string table.

support::ubig32_t Offset; support::ubig32_t Offset;

char NamePad[XCOFF::FileNamePadSize]; char NamePad[XCOFF::FileNamePadSize];

} NameInStrTblType; } NameInStrTblType;

union { union {

char Name[XCOFF::NameSize + XCOFF::FileNamePadSize]; char Name[XCOFF::NameSize + XCOFF::FileNamePadSize];

NameInStrTblType NameInStrTbl; NameInStrTblType NameInStrTbl;

}; };

XCOFF::CFileStringType Type; XCOFF::CFileStringType Type;

uint8_t ReservedZeros[2]; uint8_t ReservedZeros[2];

uint8_t AuxType; // 64-bit XCOFF file only. XCOFF::SymbolAuxType AuxType; // 64-bit XCOFF file only.

}; };

struct XCOFFSectAuxEntForStat { struct XCOFFSectAuxEntForStat {

support::ubig32_t SectionLength; support::ubig32_t SectionLength;

support::ubig16_t NumberOfRelocEnt; support::ubig16_t NumberOfRelocEnt;

support::ubig16_t NumberOfLineNum; support::ubig16_t NumberOfLineNum;

uint8_t Pad[10]; uint8_t Pad[10];

}; }; // 32-bit XCOFF file only.

struct XCOFFRelocation32 { struct XCOFFRelocation32 {

// Masks for packing/unpacking the r_rsize field of relocations. // Masks for packing/unpacking the r_rsize field of relocations.

// The msb is used to indicate if the bits being relocated are signed or // The msb is used to indicate if the bits being relocated are signed or

// unsigned. // unsigned.

static constexpr uint8_t XR_SIGN_INDICATOR_MASK = 0x80; static constexpr uint8_t XR_SIGN_INDICATOR_MASK = 0x80;

Show All 17 Lines

public: public:

bool isRelocationSigned() const; bool isRelocationSigned() const;

bool isFixupIndicated() const; bool isFixupIndicated() const;

// Returns the number of bits being relocated. // Returns the number of bits being relocated.

uint8_t getRelocatedLength() const; uint8_t getRelocatedLength() const;

}; };

class XCOFFSymbolRef;

class XCOFFObjectFile : public ObjectFile { class XCOFFObjectFile : public ObjectFile {

private: private:

const void *FileHeader = nullptr; const void *FileHeader = nullptr;

const void *SectionHeaderTable = nullptr; const void *SectionHeaderTable = nullptr;

const XCOFFSymbolEntry *SymbolTblPtr = nullptr; const void *SymbolTblPtr = nullptr;

XCOFFStringTable StringTable = {0, nullptr}; XCOFFStringTable StringTable = {0, nullptr};

const XCOFFFileHeader32 *fileHeader32() const; const XCOFFFileHeader32 *fileHeader32() const;

const XCOFFFileHeader64 *fileHeader64() const; const XCOFFFileHeader64 *fileHeader64() const;

const XCOFFSectionHeader32 *sectionHeaderTable32() const; const XCOFFSectionHeader32 *sectionHeaderTable32() const;

const XCOFFSectionHeader64 *sectionHeaderTable64() const; const XCOFFSectionHeader64 *sectionHeaderTable64() const;

size_t getFileHeaderSize() const; size_t getFileHeaderSize() const;

size_t getSectionHeaderSize() const; size_t getSectionHeaderSize() const;

const XCOFFSectionHeader32 *toSection32(DataRefImpl Ref) const; const XCOFFSectionHeader32 *toSection32(DataRefImpl Ref) const;

const XCOFFSectionHeader64 *toSection64(DataRefImpl Ref) const; const XCOFFSectionHeader64 *toSection64(DataRefImpl Ref) const;

uintptr_t getSectionHeaderTableAddress() const; uintptr_t getSectionHeaderTableAddress() const;

uintptr_t getEndOfSymbolTableAddress() const; uintptr_t getEndOfSymbolTableAddress() const;

// This returns a pointer to the start of the storage for the name field of // This returns a pointer to the start of the storage for the name field of

// the 32-bit or 64-bit SectionHeader struct. This string is *not* necessarily // the 32-bit or 64-bit SectionHeader struct. This string is *not* necessarily

// null-terminated. // null-terminated.

const char *getSectionNameInternal(DataRefImpl Sec) const; const char *getSectionNameInternal(DataRefImpl Sec) const;

// This function returns string table entry.

Expected<StringRef> getStringTableEntry(uint32_t Offset) const;

static bool isReservedSectionNumber(int16_t SectionNumber); static bool isReservedSectionNumber(int16_t SectionNumber);

// Constructor and "create" factory function. The constructor is only a thin // Constructor and "create" factory function. The constructor is only a thin

// wrapper around the base constructor. The "create" function fills out the // wrapper around the base constructor. The "create" function fills out the

// XCOFF-specific information and performs the error checking along the way. // XCOFF-specific information and performs the error checking along the way.

XCOFFObjectFile(unsigned Type, MemoryBufferRef Object); XCOFFObjectFile(unsigned Type, MemoryBufferRef Object);

static Expected<std::unique_ptr<XCOFFObjectFile>> create(unsigned Type, static Expected<std::unique_ptr<XCOFFObjectFile>> create(unsigned Type,

MemoryBufferRef MBR); MemoryBufferRef MBR);

▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines public:

SubtargetFeatures getFeatures() const override; SubtargetFeatures getFeatures() const override;

Expected<uint64_t> getStartAddress() const override; Expected<uint64_t> getStartAddress() const override;

StringRef mapDebugSectionName(StringRef Name) const override; StringRef mapDebugSectionName(StringRef Name) const override;

bool isRelocatableObject() const override; bool isRelocatableObject() const override;

// Below here is the non-inherited interface. // Below here is the non-inherited interface.

bool is64Bit() const; bool is64Bit() const;

const XCOFFSymbolEntry *getPointerToSymbolTable() const { const void *getPointerToSymbolTable() const { return SymbolTblPtr; }

assert(!is64Bit() && "Symbol table handling not supported yet.");

return SymbolTblPtr;

}

Expected<StringRef> Expected<StringRef> getSymbolSectionName(XCOFFSymbolRef Ref) const;

getSymbolSectionName(const XCOFFSymbolEntry *SymEntPtr) const;

const XCOFFSymbolEntry *toSymbolEntry(DataRefImpl Ref) const; XCOFFSymbolRef toSymbolRef(DataRefImpl Ref) const;

// File header related interfaces. // File header related interfaces.

uint16_t getMagic() const; uint16_t getMagic() const;

uint16_t getNumberOfSections() const; uint16_t getNumberOfSections() const;

int32_t getTimeStamp() const; int32_t getTimeStamp() const;

// Symbol table offset and entry count are handled differently between // Symbol table offset and entry count are handled differently between

// XCOFF32 and XCOFF64. // XCOFF32 and XCOFF64.

uint32_t getSymbolTableOffset32() const; uint32_t getSymbolTableOffset32() const;

uint64_t getSymbolTableOffset64() const; uint64_t getSymbolTableOffset64() const;

// Note that this value is signed and might return a negative value. Negative // Note that this value is signed and might return a negative value. Negative

// values are reserved for future use. // values are reserved for future use.

int32_t getRawNumberOfSymbolTableEntries32() const; int32_t getRawNumberOfSymbolTableEntries32() const;

// The sanitized value appropriate to use as an index into the symbol table. // The sanitized value appropriate to use as an index into the symbol table.

uint32_t getLogicalNumberOfSymbolTableEntries32() const; uint32_t getLogicalNumberOfSymbolTableEntries32() const;

uint32_t getNumberOfSymbolTableEntries64() const; uint32_t getNumberOfSymbolTableEntries64() const;

// Return getLogicalNumberOfSymbolTableEntries32 or

// getNumberOfSymbolTableEntries64 depending on the object mode.

uint32_t getNumberOfSymbolTableEntries() const;

uint32_t getSymbolIndex(uintptr_t SymEntPtr) const; uint32_t getSymbolIndex(uintptr_t SymEntPtr) const;

uintptr_t getSymbolEntryAddressByIndex(uint32_t SymbolTableIndex) const;

DiggerLinUnsubmitted

Not Done

this means for getSymbolEntryAddressByIndex(uint32_t SymbolTableIndex) const ?

DiggerLin: this means for getSymbolEntryAddressByIndex(uint32_t SymbolTableIndex) const ?

jasonliuAuthorUnsubmitted

Done

Sorry, what's the difference between Symbol and SymbolEntry?
I'm also seeing getSymbolIndex and getSymbolNameByIndex around this function. Any reason they are not "SymbolEntry"?

jasonliu: Sorry, what's the difference between Symbol and SymbolEntry? I'm also seeing `getSymbolIndex`…

DiggerLinUnsubmitted

Not Done

the value of symbol maybe a symbol relocation address , I was confused getSymbolAddress with getting the relocation address at my first glance of the code, getSymbolEntryAddress, that means we need the SymbolEntry address not relocation address of a symbol.

DiggerLin: the value of symbol maybe a symbol relocation address , I was confused getSymbolAddress with…

jasonliuAuthorUnsubmitted

Done

I don't think we have a symbol relocation address IMO. The address related to relocation could be relocation entry's address, or the virtual address data member inside of a relocation entry. But those addresses are not related to symbol in any ways.
Also, there is a function from base class which we overrides here called getSymbolAddress, which we don't want to change. So it would make sense to keep the same naming style here.

jasonliu: I don't think we have a symbol relocation address IMO. The address related to relocation could…

jasonliuAuthorUnsubmitted

Done

hmm... Just realized what you meant.
getSymbolAddress actually returns toSymbolRef.getValue() which is a relocatable address.
This function is suppose to return the address of the symbol table entry within the object file.

jasonliu: hmm... Just realized what you meant. getSymbolAddress actually returns toSymbolRef.getValue()…

Expected<StringRef> getSymbolNameByIndex(uint32_t SymbolTableIndex) const; Expected<StringRef> getSymbolNameByIndex(uint32_t SymbolTableIndex) const;

Expected<StringRef> getCFileName(const XCOFFFileAuxEnt *CFileEntPtr) const; Expected<StringRef> getCFileName(const XCOFFFileAuxEnt *CFileEntPtr) const;

uint16_t getOptionalHeaderSize() const; uint16_t getOptionalHeaderSize() const;

uint16_t getFlags() const; uint16_t getFlags() const;

// Section header table related interfaces. // Section header table related interfaces.

ArrayRef<XCOFFSectionHeader32> sections32() const; ArrayRef<XCOFFSectionHeader32> sections32() const;

ArrayRef<XCOFFSectionHeader64> sections64() const; ArrayRef<XCOFFSectionHeader64> sections64() const;

int32_t getSectionFlags(DataRefImpl Sec) const; int32_t getSectionFlags(DataRefImpl Sec) const;

Expected<DataRefImpl> getSectionByNum(int16_t Num) const; Expected<DataRefImpl> getSectionByNum(int16_t Num) const;

void checkSymbolEntryPointer(uintptr_t SymbolEntPtr) const; void checkSymbolEntryPointer(uintptr_t SymbolEntPtr) const;

// Relocation-related interfaces. // Relocation-related interfaces.

Expected<uint32_t> Expected<uint32_t>

getLogicalNumberOfRelocationEntries(const XCOFFSectionHeader32 &Sec) const; getLogicalNumberOfRelocationEntries(const XCOFFSectionHeader32 &Sec) const;

Expected<ArrayRef<XCOFFRelocation32>> Expected<ArrayRef<XCOFFRelocation32>>

relocations(const XCOFFSectionHeader32 &) const; relocations(const XCOFFSectionHeader32 &) const;

// This function returns string table entry.

Expected<StringRef> getStringTableEntry(uint32_t Offset) const;

DiggerLinUnsubmitted

Not Done

not sure we want to Distance to be negative value future? I think change to int32_t Distance, means that we can backward

DiggerLin: not sure we want to Distance to be negative value future? I think change to int32_t Distance…

jasonliuAuthorUnsubmitted

Done

I don't see a need to jump backward now. If it's needed in the future, we could always change in the future patch.

jasonliu: I don't see a need to jump backward now. If it's needed in the future, we could always change…

const XCOFF::SymbolAuxType *getSymbolAuxType(uintptr_t AuxEntryAddress) const;

static uintptr_t getAdvancedSymbolEntryAddress(uintptr_t CurrentAddress,

uint32_t Distance);

static bool classof(const Binary *B) { return B->isXCOFF(); } static bool classof(const Binary *B) { return B->isXCOFF(); }

}; // XCOFFObjectFile }; // XCOFFObjectFile

class XCOFFSymbolRef { typedef struct {

const DataRefImpl SymEntDataRef; uint8_t LanguageId;

DiggerLinUnsubmitted

Not Done

not sure whether we want to define a enum for the LanguageID in this patch.
The values for this field are defined in the e_lang field in "Exception Section"

DiggerLin: not sure whether we want to define a enum for the LanguageID in this patch. The values for…

jasonliuAuthorUnsubmitted

Done

I think we are already doing enum mapping in tools/llvm-readobj/XCOFFDumper.cpp. I don't see a strong need to create an enum for it.

jasonliu: I think we are already doing enum mapping in tools/llvm-readobj/XCOFFDumper.cpp. I don't see a…

const XCOFFObjectFile *const OwningObjectPtr; uint8_t CpuTypeId;

} CFileLanguageIdAndTypeIdType;

struct XCOFFSymbolEntry32 {

typedef struct {

support::big32_t Magic; // Zero indicates name in string table.

support::ubig32_t Offset;

} NameInStrTblType;

union {

char SymbolName[XCOFF::NameSize];

NameInStrTblType NameInStrTbl;

};

support::ubig32_t Value; // Symbol value; storage class-dependent.

support::big16_t SectionNumber;

union {

support::ubig16_t SymbolType;

CFileLanguageIdAndTypeIdType CFileLanguageIdAndTypeId;

};

XCOFF::StorageClass StorageClass;

uint8_t NumberOfAuxEntries;

};

struct XCOFFSymbolEntry64 {

support::ubig64_t Value; // Symbol value; storage class-dependent.

support::ubig32_t Offset;

support::big16_t SectionNumber;

union {

support::ubig16_t SymbolType;

CFileLanguageIdAndTypeIdType CFileLanguageIdAndTypeId;

};

XCOFF::StorageClass StorageClass;

uint8_t NumberOfAuxEntries;

};

class XCOFFSymbolRef {

public: public:

enum { NAME_IN_STR_TBL_MAGIC = 0x0 };

XCOFFSymbolRef(DataRefImpl SymEntDataRef, XCOFFSymbolRef(DataRefImpl SymEntDataRef,

const XCOFFObjectFile *OwningObjectPtr) const XCOFFObjectFile *OwningObjectPtr)

: SymEntDataRef(SymEntDataRef), OwningObjectPtr(OwningObjectPtr){}; : OwningObjectPtr(OwningObjectPtr) {

assert(OwningObjectPtr && "OwningObjectPtr cannot be nullptr!");

jhendersonUnsubmitted

Done

: OwningObjectPtr(OwningObjectPtr) {

- assert(OwningObjectPtr && "OwningObjectPtr can not be nullptr!");

+ assert(OwningObjectPtr && "OwningObjectPtr cannot be nullptr!");

assert(SymEntDataRef.p != 0 &&

jhenderson:

assert(SymEntDataRef.p != 0 &&

DiggerLinUnsubmitted

Done

assert(OwningObjectPtr != nullptr) here ?

DiggerLin: assert(OwningObjectPtr != nullptr) here ?

DiggerLinUnsubmitted

Done

"Symbol table pointer can not be nullptr!" --> "Symbol table entry pointer can not be nullptr!"

DiggerLin: "Symbol table pointer can not be nullptr!" --> "Symbol table entry pointer can not be nullptr!"

"Symbol table entry pointer cannot be nullptr!");

jhendersonUnsubmitted

Done

assert(SymEntDataRef.p != 0 &&

- "Symbol table entry pointer can not be nullptr!");

+ "Symbol table entry pointer cannot be nullptr!");

if (OwningObjectPtr->is64Bit())

jhenderson:

if (OwningObjectPtr->is64Bit())

Entry64 = reinterpret_cast<const XCOFFSymbolEntry64 *>(SymEntDataRef.p);

else

Entry32 = reinterpret_cast<const XCOFFSymbolEntry32 *>(SymEntDataRef.p);

}

uint64_t getValue() const { return Entry32 ? getValue32() : getValue64(); }

DiggerLinUnsubmitted

Not Done

using GETVALUE(Value) for consistent ?

DiggerLin: using GETVALUE(Value) for consistent ?

jasonliuAuthorUnsubmitted

Done

I would prefer to be more explicit here because we are doing a conversion to larger value for 32 bit version, which is different from the rest of GETVALUE(Value).

jasonliu: I would prefer to be more explicit here because we are doing a conversion to larger value for…

uint32_t getValue32() const { return Entry32->Value; }

uint64_t getValue64() const { return Entry64->Value; }

#define GETVALUE(X) Entry32 ? Entry32->X : Entry64->X

int16_t getSectionNumber() const { return GETVALUE(SectionNumber); }

uint16_t getSymbolType() const { return GETVALUE(SymbolType); }

uint8_t getLanguageIdForCFile() const {

assert(getStorageClass() == XCOFF::C_FILE &&

"This interface is for C_FILE only.");

return GETVALUE(CFileLanguageIdAndTypeId.LanguageId);

}

uint8_t getCPUTypeIddForCFile() const {

assert(getStorageClass() == XCOFF::C_FILE &&

"This interface is for C_FILE only.");

return GETVALUE(CFileLanguageIdAndTypeId.CpuTypeId);

}

XCOFF::StorageClass getStorageClass() const { return GETVALUE(StorageClass); }

uint8_t getNumberOfAuxEntries() const { return GETVALUE(NumberOfAuxEntries); }

XCOFF::StorageClass getStorageClass() const; #undef GETVALUE

uint8_t getNumberOfAuxEntries() const;

const XCOFFCsectAuxEnt32 *getXCOFFCsectAuxEnt32() const;

uint16_t getType() const;

int16_t getSectionNumber() const;

bool hasCsectAuxEnt() const; uintptr_t getEntryAddress() const {

return Entry32 ? reinterpret_cast<uintptr_t>(Entry32)

: reinterpret_cast<uintptr_t>(Entry64);

}

Expected<StringRef> getName() const;

DiggerLinUnsubmitted

Done

maybe we can use a macro here.
#define GETVALUE(X) Entry32 ? Entry32->X : Entry64 ->X

int16_t getSectionNumber() const {

return GETVALUE(SectionNumber);

}

uint16_t getSymbolType() const {

   return GETVALUE(SymbolType);
}

and so on

DiggerLin: maybe we can use a macro here. #define GETVALUE(X) Entry32 ? Entry32->X : Entry64 ->X…

bool isFunction() const; bool isFunction() const;

bool isCsectSymbol() const;

Expected<XCOFFCsectAuxRef> getXCOFFCsectAuxRef() const;

private:

const XCOFFObjectFile *OwningObjectPtr;

const XCOFFSymbolEntry32 *Entry32 = nullptr;

DiggerLinUnsubmitted

Not Done

getAddress() may confuse with getting the address of the symbol. maybe good to rename to getEntryAddress() ?

DiggerLin: getAddress() may confuse with getting the address of the symbol. maybe good to rename to…

jasonliuAuthorUnsubmitted

Done

Do you still find it confusing after seeing my other comments about Symbol vs SymbolEntry?
I would think it's fine since we don't really have other addresses we could get in here.

jasonliu: Do you still find it confusing after seeing my other comments about `Symbol` vs `SymbolEntry`?

jasonliuAuthorUnsubmitted

Done

Will change.

jasonliu: Will change.

const XCOFFSymbolEntry64 *Entry64 = nullptr;

}; };

class TBVectorExt { class TBVectorExt {

friend class XCOFFTracebackTable; friend class XCOFFTracebackTable;

uint16_t Data; uint16_t Data;

uint32_t VecParmsInfo; uint32_t VecParmsInfo;

▲ Show 20 Lines • Show All 95 Lines • Show Last 20 Lines

llvm/lib/Object/XCOFFObjectFile.cpp

Show All 18 Lines

namespace llvm { namespace llvm {

using namespace XCOFF; using namespace XCOFF;

namespace object { namespace object {

static const uint8_t FunctionSym = 0x20; static const uint8_t FunctionSym = 0x20;

static const uint8_t SymTypeMask = 0x07;

static const uint16_t NoRelMask = 0x0001; static const uint16_t NoRelMask = 0x0001;

static const size_t SymbolAuxTypeOffset = 17;

// Checks that [Ptr, Ptr + Size) bytes fall inside the memory buffer // Checks that [Ptr, Ptr + Size) bytes fall inside the memory buffer

// 'M'. Returns a pointer to the underlying object on success. // 'M'. Returns a pointer to the underlying object on success.

template <typename T> template <typename T>

static Expected<const T *> getObject(MemoryBufferRef M, const void *Ptr, static Expected<const T *> getObject(MemoryBufferRef M, const void *Ptr,

const uint64_t Size = sizeof(T)) { const uint64_t Size = sizeof(T)) {

uintptr_t Addr = reinterpret_cast<uintptr_t>(Ptr); uintptr_t Addr = reinterpret_cast<uintptr_t>(Ptr);

if (Error E = Binary::checkOffset(M, Addr, Size)) if (Error E = Binary::checkOffset(M, Addr, Size))

▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines

} }

uint8_t XCOFFRelocation32::getRelocatedLength() const { uint8_t XCOFFRelocation32::getRelocatedLength() const {

// The relocation encodes the bit length being relocated minus 1. Add back // The relocation encodes the bit length being relocated minus 1. Add back

// the 1 to get the actual length being relocated. // the 1 to get the actual length being relocated.

return (Info & XR_BIASED_LENGTH_MASK) + 1; return (Info & XR_BIASED_LENGTH_MASK) + 1;

} }

uintptr_t

XCOFFObjectFile::getAdvancedSymbolEntryAddress(uintptr_t CurrentAddress,

uint32_t Distance) {

return getWithOffset(CurrentAddress, Distance * XCOFF::SymbolTableEntrySize);

}

const XCOFF::SymbolAuxType *

XCOFFObjectFile::getSymbolAuxType(uintptr_t AuxEntryAddress) const {

assert(is64Bit() && "64-bit interface called on a 32-bit object file.");

DiggerLinUnsubmitted

Done

SymbolAuxType is only for the 64 bits.
add assert(is64Bit() ) before return ?

DiggerLin: SymbolAuxType is only for the 64 bits. add assert(is64Bit() ) before return ?

return viewAs<XCOFF::SymbolAuxType>(

getWithOffset(AuxEntryAddress, SymbolAuxTypeOffset));

}

void XCOFFObjectFile::checkSectionAddress(uintptr_t Addr, void XCOFFObjectFile::checkSectionAddress(uintptr_t Addr,

uintptr_t TableAddress) const { uintptr_t TableAddress) const {

if (Addr < TableAddress) if (Addr < TableAddress)

report_fatal_error("Section header outside of section header table."); report_fatal_error("Section header outside of section header table.");

uintptr_t Offset = Addr - TableAddress; uintptr_t Offset = Addr - TableAddress;

if (Offset >= getSectionHeaderSize() * getNumberOfSections()) if (Offset >= getSectionHeaderSize() * getNumberOfSections())

report_fatal_error("Section header outside of section header table."); report_fatal_error("Section header outside of section header table.");

Show All 16 Lines

XCOFFObjectFile::toSection64(DataRefImpl Ref) const { XCOFFObjectFile::toSection64(DataRefImpl Ref) const {

assert(is64Bit() && "64-bit interface called on a 32-bit object file."); assert(is64Bit() && "64-bit interface called on a 32-bit object file.");

#ifndef NDEBUG #ifndef NDEBUG

checkSectionAddress(Ref.p, getSectionHeaderTableAddress()); checkSectionAddress(Ref.p, getSectionHeaderTableAddress());

#endif #endif

return viewAs<XCOFFSectionHeader64>(Ref.p); return viewAs<XCOFFSectionHeader64>(Ref.p);

} }

const XCOFFSymbolEntry *XCOFFObjectFile::toSymbolEntry(DataRefImpl Ref) const { XCOFFSymbolRef XCOFFObjectFile::toSymbolRef(DataRefImpl Ref) const {

assert(!is64Bit() && "Symbol table support not implemented for 64-bit.");

assert(Ref.p != 0 && "Symbol table pointer can not be nullptr!"); assert(Ref.p != 0 && "Symbol table pointer can not be nullptr!");

#ifndef NDEBUG #ifndef NDEBUG

checkSymbolEntryPointer(Ref.p); checkSymbolEntryPointer(Ref.p);

#endif #endif

auto SymEntPtr = viewAs<XCOFFSymbolEntry>(Ref.p); return XCOFFSymbolRef(Ref, this);

return SymEntPtr;

} }

const XCOFFFileHeader32 *XCOFFObjectFile::fileHeader32() const { const XCOFFFileHeader32 *XCOFFObjectFile::fileHeader32() const {

assert(!is64Bit() && "32-bit interface called on 64-bit object file."); assert(!is64Bit() && "32-bit interface called on 64-bit object file.");

return static_cast<const XCOFFFileHeader32 *>(FileHeader); return static_cast<const XCOFFFileHeader32 *>(FileHeader);

} }

const XCOFFFileHeader64 *XCOFFObjectFile::fileHeader64() const { const XCOFFFileHeader64 *XCOFFObjectFile::fileHeader64() const {

Show All 9 Lines

const XCOFFSectionHeader64 * const XCOFFSectionHeader64 *

XCOFFObjectFile::sectionHeaderTable64() const { XCOFFObjectFile::sectionHeaderTable64() const {

assert(is64Bit() && "64-bit interface called on a 32-bit object file."); assert(is64Bit() && "64-bit interface called on a 32-bit object file.");

return static_cast<const XCOFFSectionHeader64 *>(SectionHeaderTable); return static_cast<const XCOFFSectionHeader64 *>(SectionHeaderTable);

} }

void XCOFFObjectFile::moveSymbolNext(DataRefImpl &Symb) const { void XCOFFObjectFile::moveSymbolNext(DataRefImpl &Symb) const {

const XCOFFSymbolEntry *SymEntPtr = toSymbolEntry(Symb); uintptr_t NextSymbolAddr = getAdvancedSymbolEntryAddress(

SymEntPtr += SymEntPtr->NumberOfAuxEntries + 1; Symb.p, toSymbolRef(Symb).getNumberOfAuxEntries() + 1);

#ifndef NDEBUG #ifndef NDEBUG

// This function is used by basic_symbol_iterator, which allows to // This function is used by basic_symbol_iterator, which allows to

// point to the end-of-symbol-table address. // point to the end-of-symbol-table address.

if (reinterpret_cast<uintptr_t>(SymEntPtr) != getEndOfSymbolTableAddress()) if (NextSymbolAddr != getEndOfSymbolTableAddress())

checkSymbolEntryPointer(reinterpret_cast<uintptr_t>(SymEntPtr)); checkSymbolEntryPointer(NextSymbolAddr);

#endif #endif

Symb.p = reinterpret_cast<uintptr_t>(SymEntPtr); Symb.p = NextSymbolAddr;

} }

Expected<StringRef> Expected<StringRef>

XCOFFObjectFile::getStringTableEntry(uint32_t Offset) const { XCOFFObjectFile::getStringTableEntry(uint32_t Offset) const {

// The byte offset is relative to the start of the string table. // The byte offset is relative to the start of the string table.

// A byte offset value of 0 is a null or zero-length symbol // A byte offset value of 0 is a null or zero-length symbol

// name. A byte offset in the range 1 to 3 (inclusive) points into the length // name. A byte offset in the range 1 to 3 (inclusive) points into the length

// field; as a soft-error recovery mechanism, we treat such cases as having an // field; as a soft-error recovery mechanism, we treat such cases as having an

// offset of 0. // offset of 0.

if (Offset < 4) if (Offset < 4)

return StringRef(nullptr, 0); return StringRef(nullptr, 0);

if (StringTable.Data != nullptr && StringTable.Size > Offset) if (StringTable.Data != nullptr && StringTable.Size > Offset)

return (StringTable.Data + Offset); return (StringTable.Data + Offset);

return make_error<GenericBinaryError>("Bad offset for string table entry", return make_error<GenericBinaryError>("Bad offset for string table entry",

object_error::parse_failed); object_error::parse_failed);

} }

Expected<StringRef> Expected<StringRef>

XCOFFObjectFile::getCFileName(const XCOFFFileAuxEnt *CFileEntPtr) const { XCOFFObjectFile::getCFileName(const XCOFFFileAuxEnt *CFileEntPtr) const {

if (CFileEntPtr->NameInStrTbl.Magic != if (CFileEntPtr->NameInStrTbl.Magic != XCOFFSymbolRef::NAME_IN_STR_TBL_MAGIC)

XCOFFSymbolEntry::NAME_IN_STR_TBL_MAGIC)

return generateXCOFFFixedNameStringRef(CFileEntPtr->Name); return generateXCOFFFixedNameStringRef(CFileEntPtr->Name);

return getStringTableEntry(CFileEntPtr->NameInStrTbl.Offset); return getStringTableEntry(CFileEntPtr->NameInStrTbl.Offset);

} }

Expected<StringRef> XCOFFObjectFile::getSymbolName(DataRefImpl Symb) const { Expected<StringRef> XCOFFObjectFile::getSymbolName(DataRefImpl Symb) const {

const XCOFFSymbolEntry *SymEntPtr = toSymbolEntry(Symb); return toSymbolRef(Symb).getName();

// A storage class value with the high-order bit on indicates that the name is

// a symbolic debugger stabstring.

if (SymEntPtr->StorageClass & 0x80)

return StringRef("Unimplemented Debug Name");

if (SymEntPtr->NameInStrTbl.Magic != XCOFFSymbolEntry::NAME_IN_STR_TBL_MAGIC)

return generateXCOFFFixedNameStringRef(SymEntPtr->SymbolName);

return getStringTableEntry(SymEntPtr->NameInStrTbl.Offset);

} }

Expected<uint64_t> XCOFFObjectFile::getSymbolAddress(DataRefImpl Symb) const { Expected<uint64_t> XCOFFObjectFile::getSymbolAddress(DataRefImpl Symb) const {

assert(!is64Bit() && "Symbol table support not implemented for 64-bit."); return toSymbolRef(Symb).getValue();

return toSymbolEntry(Symb)->Value;

} }

uint64_t XCOFFObjectFile::getSymbolValueImpl(DataRefImpl Symb) const { uint64_t XCOFFObjectFile::getSymbolValueImpl(DataRefImpl Symb) const {

DiggerLinUnsubmitted

Not Done

if we define
getValue() for the XCOFFSymbolRef
we rewrite the code as
Expected<uint64_t> XCOFFObjectFile::getSymbolAddress(DataRefImpl Symb) const {
return XCOFFSymRef(Symb, this).getValue();
}

DiggerLin: if we define getValue() for the XCOFFSymbolRef we rewrite the code as Expected<uint64_t>…

jasonliuAuthorUnsubmitted

Done

Please see my other comments regarding combining the 32bit and 64 bit version into 1 function.

jasonliu: Please see my other comments regarding combining the 32bit and 64 bit version into 1 function.

assert(!is64Bit() && "Symbol table support not implemented for 64-bit."); return toSymbolRef(Symb).getValue();

return toSymbolEntry(Symb)->Value;

} }

uint64_t XCOFFObjectFile::getCommonSymbolSizeImpl(DataRefImpl Symb) const { uint64_t XCOFFObjectFile::getCommonSymbolSizeImpl(DataRefImpl Symb) const {

DiggerLinUnsubmitted

Not Done

same as above comment.

DiggerLin: same as above comment.

jasonliuAuthorUnsubmitted

Done

Please see my other comments regarding combining the 32bit and 64 bit version into 1 function.

jasonliu: Please see my other comments regarding combining the 32bit and 64 bit version into 1 function.

uint64_t Result = 0; uint64_t Result = 0;

llvm_unreachable("Not yet implemented!"); llvm_unreachable("Not yet implemented!");

return Result; return Result;

} }

Expected<SymbolRef::Type> Expected<SymbolRef::Type>

XCOFFObjectFile::getSymbolType(DataRefImpl Symb) const { XCOFFObjectFile::getSymbolType(DataRefImpl Symb) const {

llvm_unreachable("Not yet implemented!"); llvm_unreachable("Not yet implemented!");

return SymbolRef::ST_Other; return SymbolRef::ST_Other;

} }

Expected<section_iterator> Expected<section_iterator>

XCOFFObjectFile::getSymbolSection(DataRefImpl Symb) const { XCOFFObjectFile::getSymbolSection(DataRefImpl Symb) const {

const XCOFFSymbolEntry *SymEntPtr = toSymbolEntry(Symb); const int16_t SectNum = toSymbolRef(Symb).getSectionNumber();

DiggerLinUnsubmitted

Done

const int16_t SectNum ?

DiggerLin: const int16_t SectNum ?

int16_t SectNum = SymEntPtr->SectionNumber;

if (isReservedSectionNumber(SectNum)) if (isReservedSectionNumber(SectNum))

return section_end(); return section_end();

Expected<DataRefImpl> ExpSec = getSectionByNum(SectNum); Expected<DataRefImpl> ExpSec = getSectionByNum(SectNum);

if (!ExpSec) if (!ExpSec)

return ExpSec.takeError(); return ExpSec.takeError();

▲ Show 20 Lines • Show All 136 Lines • ▼ Show 20 Lines if (is64Bit())

report_fatal_error("64-bit support not implemented yet"); report_fatal_error("64-bit support not implemented yet");

const XCOFFRelocation32 *Reloc = viewAs<XCOFFRelocation32>(Rel.p); const XCOFFRelocation32 *Reloc = viewAs<XCOFFRelocation32>(Rel.p);

const uint32_t Index = Reloc->SymbolIndex; const uint32_t Index = Reloc->SymbolIndex;

if (Index >= getLogicalNumberOfSymbolTableEntries32()) if (Index >= getLogicalNumberOfSymbolTableEntries32())

return symbol_end(); return symbol_end();

DataRefImpl SymDRI; DataRefImpl SymDRI;

SymDRI.p = reinterpret_cast<uintptr_t>(getPointerToSymbolTable() + Index); SymDRI.p = getSymbolEntryAddressByIndex(Index);

return symbol_iterator(SymbolRef(SymDRI, this)); return symbol_iterator(SymbolRef(SymDRI, this));

} }

uint64_t XCOFFObjectFile::getRelocationType(DataRefImpl Rel) const { uint64_t XCOFFObjectFile::getRelocationType(DataRefImpl Rel) const {

if (is64Bit()) if (is64Bit())

report_fatal_error("64-bit support not implemented yet"); report_fatal_error("64-bit support not implemented yet");

return viewAs<XCOFFRelocation32>(Rel.p)->Type; return viewAs<XCOFFRelocation32>(Rel.p)->Type;

} }

Show All 9 Lines

Expected<uint32_t> XCOFFObjectFile::getSymbolFlags(DataRefImpl Symb) const { Expected<uint32_t> XCOFFObjectFile::getSymbolFlags(DataRefImpl Symb) const {

uint32_t Result = 0; uint32_t Result = 0;

llvm_unreachable("Not yet implemented!"); llvm_unreachable("Not yet implemented!");

return Result; return Result;

} }

basic_symbol_iterator XCOFFObjectFile::symbol_begin() const { basic_symbol_iterator XCOFFObjectFile::symbol_begin() const {

if (is64Bit())

report_fatal_error("64-bit support not implemented yet");

DataRefImpl SymDRI; DataRefImpl SymDRI;

SymDRI.p = reinterpret_cast<uintptr_t>(SymbolTblPtr); SymDRI.p = reinterpret_cast<uintptr_t>(SymbolTblPtr);

return basic_symbol_iterator(SymbolRef(SymDRI, this)); return basic_symbol_iterator(SymbolRef(SymDRI, this));

} }

basic_symbol_iterator XCOFFObjectFile::symbol_end() const { basic_symbol_iterator XCOFFObjectFile::symbol_end() const {

if (is64Bit())

report_fatal_error("64-bit support not implemented yet");

DataRefImpl SymDRI; DataRefImpl SymDRI;

SymDRI.p = reinterpret_cast<uintptr_t>( const uint32_t NumberOfSymbolTableEntries = getNumberOfSymbolTableEntries();

SymbolTblPtr + getLogicalNumberOfSymbolTableEntries32()); SymDRI.p = getSymbolEntryAddressByIndex(NumberOfSymbolTableEntries);

return basic_symbol_iterator(SymbolRef(SymDRI, this)); return basic_symbol_iterator(SymbolRef(SymDRI, this));

DiggerLinUnsubmitted

Not Done

can we add new member function as getNumberOfSymbolTableEntries()
{

return is64Bit() is64Bit() ? getNumberOfSymbolTableEntries64()
              : getLogicalNumberOfSymbolTableEntries32();

}

the function can also use in
XCOFFObjectFile::create()
and
getSymbolNameByIndex()

DiggerLin: can we add new member function as getNumberOfSymbolTableEntries() { return is64Bit() is64Bit…

jasonliuAuthorUnsubmitted

Done

About all the comments mentioning if we could combining the 32bit and 64 bit version into 1 function.
I don't think it's good idea because people would ignore the fact that they are returning different types underneath.

jasonliu: About all the comments mentioning if we could combining the 32bit and 64 bit version into 1…

jhendersonUnsubmitted

Not Done

From my experience working with tools that had to support 32-bit and 64-bit ELF, you don't worry about the underlying type in most cases and always use the larger type. The same probably applies here. Of course, it becomes a bit moot if you add a common getter interface as suggested out-of-line, because those getters will have to return the larger of the two return types anyway.

Is there a strong reason to not use the larger type everywhere?

jhenderson: From my experience working with tools that had to support 32-bit and 64-bit ELF, you don't…

hubert.reinterpretcastUnsubmitted

Not Done

Is there a strong reason to not use the larger type everywhere?

I don't know what strength this reason has, but we had noticed that some of the tools do not reflect the width of the 32-bit format fields very well (even for relatively uninterpreted output). Where the producer of the binary is under development, developers are better served if the tools emit the correct width for fields in the format.

hubert.reinterpretcast: > Is there a strong reason to not use the larger type everywhere? I don't know what strength…

jhendersonUnsubmitted

Not Done

In think in the context of printing the appropriately formatted output, you'd want to switch on the source type (i.e. is64Bit or whatever), at the formatting time. Certainly, this is how we've done it in our own internal code bases I work on, and there are examples of this in a number of other LLVM utilities. For example in https://github.com/llvm/llvm-project/blob/master/llvm/lib/DebugInfo/DWARF/DWARFCompileUnit.cpp#L17, the dump function dumps the offset with a width according to the DWARF format (i.e. 32 or 64 bit), but the getLength function returns a 64-bit value always. Similarly https://github.com/llvm/llvm-project/blob/master/llvm/tools/llvm-objdump/ELFDump.cpp#L257 identifies the ELF format kind and uses that in determining the width of offset, size and address fields (which are stored as uint64_t) when printing ELF program header tables.

There are certainly plenty of places where this hasn't been done. Sometimes this is a mistake, other times it's for consistency with GNU output, but I think the preferred approach is the "store large, explicitly specify format on output" approach.

jhenderson: In think in the context of printing the appropriately formatted output, you'd want to switch on…

} }

section_iterator XCOFFObjectFile::section_begin() const { section_iterator XCOFFObjectFile::section_begin() const {

DataRefImpl DRI; DataRefImpl DRI;

DRI.p = getSectionHeaderTableAddress(); DRI.p = getSectionHeaderTableAddress();

return section_iterator(SectionRef(DRI, this)); return section_iterator(SectionRef(DRI, this));

} }

▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines Expected<DataRefImpl> XCOFFObjectFile::getSectionByNum(int16_t Num) const {

DataRefImpl DRI; DataRefImpl DRI;

DRI.p = getWithOffset(getSectionHeaderTableAddress(), DRI.p = getWithOffset(getSectionHeaderTableAddress(),

getSectionHeaderSize() * (Num - 1)); getSectionHeaderSize() * (Num - 1));

return DRI; return DRI;

} }

Expected<StringRef> Expected<StringRef>

XCOFFObjectFile::getSymbolSectionName(const XCOFFSymbolEntry *SymEntPtr) const { XCOFFObjectFile::getSymbolSectionName(XCOFFSymbolRef SymEntPtr) const {

assert(!is64Bit() && "Symbol table support not implemented for 64-bit."); const int16_t SectionNum = SymEntPtr.getSectionNumber();

DiggerLinUnsubmitted

Done

const int16_t SectionNum ?

DiggerLin: const int16_t SectionNum ?

int16_t SectionNum = SymEntPtr->SectionNumber;

switch (SectionNum) { switch (SectionNum) {

case XCOFF::N_DEBUG: case XCOFF::N_DEBUG:

return "N_DEBUG"; return "N_DEBUG";

case XCOFF::N_ABS: case XCOFF::N_ABS:

return "N_ABS"; return "N_ABS";

case XCOFF::N_UNDEF: case XCOFF::N_UNDEF:

return "N_UNDEF"; return "N_UNDEF";

▲ Show 20 Lines • Show All 44 Lines • ▼ Show 20 Lines

uint64_t XCOFFObjectFile::getSymbolTableOffset64() const { uint64_t XCOFFObjectFile::getSymbolTableOffset64() const {

return fileHeader64()->SymbolTableOffset; return fileHeader64()->SymbolTableOffset;

} }

uint32_t XCOFFObjectFile::getNumberOfSymbolTableEntries64() const { uint32_t XCOFFObjectFile::getNumberOfSymbolTableEntries64() const {

return fileHeader64()->NumberOfSymTableEntries; return fileHeader64()->NumberOfSymTableEntries;

} }

uintptr_t XCOFFObjectFile::getEndOfSymbolTableAddress() const { uint32_t XCOFFObjectFile::getNumberOfSymbolTableEntries() const {

uint32_t NumberOfSymTableEntries = return is64Bit() ? getNumberOfSymbolTableEntries64()

is64Bit() ? getNumberOfSymbolTableEntries64()

: getLogicalNumberOfSymbolTableEntries32(); : getLogicalNumberOfSymbolTableEntries32();

}

uintptr_t XCOFFObjectFile::getEndOfSymbolTableAddress() const {

const uint32_t NumberOfSymTableEntries = getNumberOfSymbolTableEntries();

return getWithOffset(reinterpret_cast<uintptr_t>(SymbolTblPtr), return getWithOffset(reinterpret_cast<uintptr_t>(SymbolTblPtr),

XCOFF::SymbolTableEntrySize * NumberOfSymTableEntries); XCOFF::SymbolTableEntrySize * NumberOfSymTableEntries);

} }

void XCOFFObjectFile::checkSymbolEntryPointer(uintptr_t SymbolEntPtr) const { void XCOFFObjectFile::checkSymbolEntryPointer(uintptr_t SymbolEntPtr) const {

if (SymbolEntPtr < reinterpret_cast<uintptr_t>(SymbolTblPtr)) if (SymbolEntPtr < reinterpret_cast<uintptr_t>(SymbolTblPtr))

report_fatal_error("Symbol table entry is outside of symbol table."); report_fatal_error("Symbol table entry is outside of symbol table.");

Show All 9 Lines

} }

uint32_t XCOFFObjectFile::getSymbolIndex(uintptr_t SymbolEntPtr) const { uint32_t XCOFFObjectFile::getSymbolIndex(uintptr_t SymbolEntPtr) const {

return (reinterpret_cast<const char *>(SymbolEntPtr) - return (reinterpret_cast<const char *>(SymbolEntPtr) -

reinterpret_cast<const char *>(SymbolTblPtr)) / reinterpret_cast<const char *>(SymbolTblPtr)) /

XCOFF::SymbolTableEntrySize; XCOFF::SymbolTableEntrySize;

} }

uintptr_t XCOFFObjectFile::getSymbolEntryAddressByIndex(uint32_t Index) const {

return getAdvancedSymbolEntryAddress(

reinterpret_cast<uintptr_t>(getPointerToSymbolTable()), Index);

}

Expected<StringRef> Expected<StringRef>

XCOFFObjectFile::getSymbolNameByIndex(uint32_t Index) const { XCOFFObjectFile::getSymbolNameByIndex(uint32_t Index) const {

if (is64Bit()) const uint32_t NumberOfSymTableEntries = getNumberOfSymbolTableEntries();

report_fatal_error("64-bit symbol table support not implemented yet.");

if (Index >= getLogicalNumberOfSymbolTableEntries32()) if (Index >= NumberOfSymTableEntries)

DiggerLinUnsubmitted

Not Done

several place use above NumberOfSymTableEntries , maybe good to provide a helper function.

DiggerLin: several place use above NumberOfSymTableEntries , maybe good to provide a helper function.

return errorCodeToError(object_error::invalid_symbol_index); return errorCodeToError(object_error::invalid_symbol_index);

DataRefImpl SymDRI; DataRefImpl SymDRI;

SymDRI.p = reinterpret_cast<uintptr_t>(getPointerToSymbolTable() + Index); SymDRI.p = getSymbolEntryAddressByIndex(Index);

return getSymbolName(SymDRI); return getSymbolName(SymDRI);

} }

uint16_t XCOFFObjectFile::getFlags() const { uint16_t XCOFFObjectFile::getFlags() const {

return is64Bit() ? fileHeader64()->Flags : fileHeader32()->Flags; return is64Bit() ? fileHeader64()->Flags : fileHeader32()->Flags;

} }

const char *XCOFFObjectFile::getSectionNameInternal(DataRefImpl Sec) const { const char *XCOFFObjectFile::getSectionNameInternal(DataRefImpl Sec) const {

▲ Show 20 Lines • Show All 126 Lines • ▼ Show 20 Lines if (Obj->getNumberOfSections()) {

auto SecHeadersOrErr = getObject<void>(Data, Base + CurOffset, auto SecHeadersOrErr = getObject<void>(Data, Base + CurOffset,

Obj->getNumberOfSections() * Obj->getNumberOfSections() *

Obj->getSectionHeaderSize()); Obj->getSectionHeaderSize());

if (Error E = SecHeadersOrErr.takeError()) if (Error E = SecHeadersOrErr.takeError())

return std::move(E); return std::move(E);

Obj->SectionHeaderTable = SecHeadersOrErr.get(); Obj->SectionHeaderTable = SecHeadersOrErr.get();

} }

// 64-bit object supports only file header and section headers for now. const uint32_t NumberOfSymbolTableEntries =

if (Obj->is64Bit()) Obj->getNumberOfSymbolTableEntries();

return std::move(Obj);

// If there is no symbol table we are done parsing the memory buffer. // If there is no symbol table we are done parsing the memory buffer.

if (Obj->getLogicalNumberOfSymbolTableEntries32() == 0) if (NumberOfSymbolTableEntries == 0)

return std::move(Obj); return std::move(Obj);

// Parse symbol table. // Parse symbol table.

CurOffset = Obj->fileHeader32()->SymbolTableOffset; CurOffset = Obj->is64Bit() ? Obj->getSymbolTableOffset64()

uint64_t SymbolTableSize = (uint64_t)(sizeof(XCOFFSymbolEntry)) * : Obj->getSymbolTableOffset32();

DiggerLinUnsubmitted

Not Done

can we a member function as
uint64_t XCOFFObjectFile::getSymbolTableOffset() const {

return is64Bit() ? fileHeader64()->SymbolTableOffset
                 : fileHeader32()->SymbolTableOffset;

}
add
CurOffset = getSymbolTableOffset() here;

DiggerLin: can we a member function as uint64_t XCOFFObjectFile::getSymbolTableOffset() const { return…

jasonliuAuthorUnsubmitted

Done

getSymbolTableOffset64 and getSymbolTableOffset32 returns different types.
Combine them to return the same type is likely to introduce error when caller use them.

jasonliu: getSymbolTableOffset64 and getSymbolTableOffset32 returns different types. Combine them to…

jhendersonUnsubmitted

Not Done

Related to my comments elsewhere - it looks to me like most consumers will need to handle both 32 and 64-bit versions, so they'll always have to do this dance. Thus your concern about how the caller uses them is misplaced - the caller is more likely to do the wrong thing i.e. call the wrong version than have problems with the return types.

jhenderson: Related to my comments elsewhere - it looks to me like most consumers will need to handle both…

Obj->getLogicalNumberOfSymbolTableEntries32(); const uint64_t SymbolTableSize =

DiggerLinUnsubmitted

Done

change to const uint64_t SymbolTableSize ?

DiggerLin: change to const uint64_t SymbolTableSize ?

static_cast<uint64_t>(XCOFF::SymbolTableEntrySize) *

NumberOfSymbolTableEntries;

auto SymTableOrErr = auto SymTableOrErr =

getObject<XCOFFSymbolEntry>(Data, Base + CurOffset, SymbolTableSize); getObject<void *>(Data, Base + CurOffset, SymbolTableSize);

if (Error E = SymTableOrErr.takeError()) if (Error E = SymTableOrErr.takeError())

return std::move(E); return std::move(E);

Obj->SymbolTblPtr = SymTableOrErr.get(); Obj->SymbolTblPtr = SymTableOrErr.get();

CurOffset += SymbolTableSize; CurOffset += SymbolTableSize;

// Parse String table. // Parse String table.

Expected<XCOFFStringTable> StringTableOrErr = Expected<XCOFFStringTable> StringTableOrErr =

parseStringTable(Obj.get(), CurOffset); parseStringTable(Obj.get(), CurOffset);

if (Error E = StringTableOrErr.takeError()) if (Error E = StringTableOrErr.takeError())

return std::move(E); return std::move(E);

Obj->StringTable = StringTableOrErr.get(); Obj->StringTable = StringTableOrErr.get();

return std::move(Obj); return std::move(Obj);

} }

Expected<std::unique_ptr<ObjectFile>> Expected<std::unique_ptr<ObjectFile>>

ObjectFile::createXCOFFObjectFile(MemoryBufferRef MemBufRef, ObjectFile::createXCOFFObjectFile(MemoryBufferRef MemBufRef,

unsigned FileType) { unsigned FileType) {

return XCOFFObjectFile::create(FileType, MemBufRef); return XCOFFObjectFile::create(FileType, MemBufRef);

} }

XCOFF::StorageClass XCOFFSymbolRef::getStorageClass() const { bool XCOFFSymbolRef::isFunction() const {

return OwningObjectPtr->toSymbolEntry(SymEntDataRef)->StorageClass; if (!isCsectSymbol())

} return false;

uint8_t XCOFFSymbolRef::getNumberOfAuxEntries() const { if (getSymbolType() & FunctionSym)

return OwningObjectPtr->toSymbolEntry(SymEntDataRef)->NumberOfAuxEntries; return true;

}

// TODO: The function needs to return an error if there is no csect auxiliary Expected<XCOFFCsectAuxRef> ExpCsectAuxEnt = getXCOFFCsectAuxRef();

// entry. if (!ExpCsectAuxEnt)

const XCOFFCsectAuxEnt32 *XCOFFSymbolRef::getXCOFFCsectAuxEnt32() const { return false;

jhendersonUnsubmitted

Not Done

Better than errorCodeToError(/*some error code*/) is to use createStringError() or createFileError() to provide more context to the failure (how did the parsing fail? where? etc).

jhenderson: Better than `errorCodeToError(/*some error code*/)` is to use `createStringError()` or…

assert(!OwningObjectPtr->is64Bit() &&

"32-bit interface called on 64-bit object file.");

assert(hasCsectAuxEnt() && "No Csect Auxiliary Entry is found.");

// In XCOFF32, the csect auxilliary entry is always the last auxiliary const XCOFFCsectAuxRef CsectAuxRef = ExpCsectAuxEnt.get();

// entry for the symbol.

uintptr_t AuxAddr = getWithOffset(

SymEntDataRef.p, XCOFF::SymbolTableEntrySize * getNumberOfAuxEntries());

#ifndef NDEBUG // A function definition should be a label definition.

OwningObjectPtr->checkSymbolEntryPointer(AuxAddr); // FIXME: This is not necessarily the case when -ffunction-sections is

DiggerLinUnsubmitted

Not Done

if we not enable -ffunction-sections , function entry is label.

DiggerLin: if we not enable -ffunction-sections , function entry is label.

jasonliuAuthorUnsubmitted

Done

Thanks. I brought back the old behavior and added the FIXME to say that this function does not return a correct value if we have -ffunction-sections enabled.

jasonliu: Thanks. I brought back the old behavior and added the FIXME to say that this function does not…

jhendersonUnsubmitted

Done

// A function definition should be a label definition.

- // FIXME: This is not necessary the case when -ffunction-sections is enabled.

+ // FIXME: This is not necessarily the case when -ffunction-sections is enabled.

if (!CsectAuxRef.isLabel())

jhenderson:

#endif // enabled.

if (!CsectAuxRef.isLabel())

return false;

return reinterpret_cast<const XCOFFCsectAuxEnt32 *>(AuxAddr); if (CsectAuxRef.getStorageMappingClass() != XCOFF::XMC_PR)

} return false;

uint16_t XCOFFSymbolRef::getType() const { const int16_t SectNum = getSectionNumber();

return OwningObjectPtr->toSymbolEntry(SymEntDataRef)->SymbolType; Expected<DataRefImpl> SI = OwningObjectPtr->getSectionByNum(SectNum);

if (!SI) {

jhendersonUnsubmitted

Not Done

I don't think you want to use int here. There's always going to be a positive number of entries, and there are no subtractions etc inolving Index here. Better would be an unsigned type of some form (presumably the return type of getNumberOfAuxEntries()).

jhenderson: I don't think you want to use `int` here. There's always going to be a positive number of…

// If we could not get the section, then this symbol should not be

jhendersonUnsubmitted

Done

Basically any time you use consumeError, add a comment explaining why it's justified that we don't report the error to the user.

jhenderson: Basically any time you use `consumeError`, add a comment explaining why it's justified that we…

// a function. So consume the error and return `false` to move on.

consumeError(SI.takeError());

return false;

} }

DiggerLinUnsubmitted

Done

const int16_t SectNum

DiggerLin: const int16_t SectNum

int16_t XCOFFSymbolRef::getSectionNumber() const { return (OwningObjectPtr->getSectionFlags(SI.get()) & XCOFF::STYP_TEXT);

return OwningObjectPtr->toSymbolEntry(SymEntDataRef)->SectionNumber;

} }

jhendersonUnsubmitted

Not Done

Same comment as before - use createStringError() or createFileError().

jhenderson: Same comment as before - use `createStringError()` or `createFileError()`.

// TODO: The function name needs to be changed to express the purpose of the bool XCOFFSymbolRef::isCsectSymbol() const {

// function.

bool XCOFFSymbolRef::hasCsectAuxEnt() const {

XCOFF::StorageClass SC = getStorageClass(); XCOFF::StorageClass SC = getStorageClass();

return (SC == XCOFF::C_EXT || SC == XCOFF::C_WEAKEXT || return (SC == XCOFF::C_EXT || SC == XCOFF::C_WEAKEXT ||

SC == XCOFF::C_HIDEXT); SC == XCOFF::C_HIDEXT);

} }

bool XCOFFSymbolRef::isFunction() const { Expected<XCOFFCsectAuxRef> XCOFFSymbolRef::getXCOFFCsectAuxRef() const {

if (OwningObjectPtr->is64Bit()) assert(isCsectSymbol() &&

report_fatal_error("64-bit support is unimplemented yet."); "Calling csect symbol interface with a non-csect symbol.");

DiggerLinUnsubmitted

Not Done

not all the symbol has Csect entry.
what about to return
Optional<XCOFFCsectAuxRef>XCOFFSymbolRef::getXCOFFCsectAuxRef()

DiggerLin: not all the symbol has Csect entry. what about to return…

jasonliuAuthorUnsubmitted

Done

I returned Expected<XCOFFCsectAuxRef> partly because of the original comment on this function:
TODO: The function needs to return an error if there is no csect auxiliary
entry.

I believe that's a change from your previous commit. Is there any reason that you changed your mind?
In 32 bit mode, you could have a csect symbol, but without any auxiliary entry. That should return an error (which I haven't detected here, but I should).
Also, in 64 bit mode, it's possible that you have a csect symbol that has auxiliary entries, but do not have a csect auxiliary entry, that should be an error situation right? So it also makes sense to return error in that case.
I think the interface would be too complicated if we return an Expected wrap with an Optional, or the other way around.

jasonliu: I returned Expected<XCOFFCsectAuxRef> partly because of the original comment on this function…

if (getType() & FunctionSym) uint8_t NumberOfAuxEntries = getNumberOfAuxEntries();

DiggerLinUnsubmitted

Not Done

I think assert(isCsectSymbol()) myabe better.
sometime maybe our developer call getXCOFFCsectAuxRef() at a no CsectSymbol . it is not a object file parse failed.

DiggerLin: I think assert(isCsectSymbol()) myabe better. sometime maybe our developer call…

jasonliuAuthorUnsubmitted

Done

Sure.

jasonliu: Sure.

return true;

if (!hasCsectAuxEnt()) Expected<StringRef> NameOrErr = getName();

return false; if (auto Err = NameOrErr.takeError())

return std::move(Err);

jhendersonUnsubmitted

Done

Is this error user-facing (I'm assuming so)? Assuming it is, you should record here which symbol is causing the problem. Otherwise the user will be faced with an error along the lines of this:

error: this csec symbol contains no auxiliary entry

which is not really actionable (imagine the input had 100000 symbols in - the user can't realistically go through each to find the offending one).

jhenderson: Is this error user-facing (I'm assuming so)? Assuming it is, you should record here which…

jhendersonUnsubmitted

Done

I believe this requires std::move(Error);, as you're returning an Expected, not an Error.

jhenderson: I believe this requires `std::move(Error);`, as you're returning an `Expected`, not an `Error`.

const XCOFFCsectAuxEnt32 *CsectAuxEnt = getXCOFFCsectAuxEnt32(); if (!NumberOfAuxEntries) {

return createStringError(object_error::parse_failed,

"csect symbol \"" + *NameOrErr +

"\" contains no auxiliary entry");

jhendersonUnsubmitted

Done

I think it would make more sense to insert the name in the middle of the message to make it a bit more concise. Something like:
"csect symbol name contains no auxiliary entry"

jhenderson: I think it would make more sense to insert the name in the middle of the message to make it a…

}

// A function definition should be a label definition. if (!OwningObjectPtr->is64Bit()) {

if ((CsectAuxEnt->SymbolAlignmentAndType & SymTypeMask) != XCOFF::XTY_LD) // In XCOFF32, the csect auxilliary entry is always the last auxiliary

return false; // entry for the symbol.

uintptr_t AuxAddr = XCOFFObjectFile::getAdvancedSymbolEntryAddress(

getEntryAddress(), NumberOfAuxEntries);

return XCOFFCsectAuxRef(viewAs<XCOFFCsectAuxEnt32>(AuxAddr));

}

DiggerLinUnsubmitted

Not Done

create a static function getSymbolAuxType in a file scope maybe better?
All the Aux symbol of 64bit all has the AuxType . we can use the function for other type too in other place later ?

DiggerLin: create a static function getSymbolAuxType in a file scope maybe better? All the Aux symbol…

DiggerLinUnsubmitted

Not Done

there already has a function on XCOFFCsectAuxRef ::getAuxType64()

DiggerLin: there already has a function on XCOFFCsectAuxRef ::getAuxType64()

jasonliuAuthorUnsubmitted

Done

Yes, there is a XCOFFCsectAuxRef ::getAuxType64(), but if you notice, this function is used to create an XCOFFCsectAuxRef object. So you don't have that function available in the creator.
And I don't think a static function is needed, because when you created XCOFFCsectAuxRef object through this function, then you could call the XCOFFCsectAuxRef ::getAuxType64() to get your type. So this lambda should only exists in this function.

jasonliu: Yes, there is a XCOFFCsectAuxRef ::getAuxType64(), but if you notice, this function is used to…

if (CsectAuxEnt->StorageMappingClass != XCOFF::XMC_PR) // XCOFF64 uses SymbolAuxType to identify the auxiliary entry type.

return false; // We need to iterate through all the auxiliary entries to find it.

jhendersonUnsubmitted

Done

Similar to my above comment - which entry was not found? Give the user more context so that they can act on the problem.

jhenderson: Similar to my above comment - which entry was not found? Give the user more context so that…

for (uint8_t Index = NumberOfAuxEntries; Index > 0; --Index) {

uintptr_t AuxAddr = XCOFFObjectFile::getAdvancedSymbolEntryAddress(

getEntryAddress(), Index);

if (*OwningObjectPtr->getSymbolAuxType(AuxAddr) ==

XCOFF::SymbolAuxType::AUX_CSECT) {

#ifndef NDEBUG

OwningObjectPtr->checkSymbolEntryPointer(AuxAddr);

#endif

return XCOFFCsectAuxRef(viewAs<XCOFFCsectAuxEnt64>(AuxAddr));

}

int16_t SectNum = getSectionNumber(); return createStringError(

Expected<DataRefImpl> SI = OwningObjectPtr->getSectionByNum(SectNum); object_error::parse_failed,

if (!SI) "a csect auxiliary entry is not found for symbol \"" + *NameOrErr + "\"");

jhendersonUnsubmitted

Done

I'd suggest quoting somehow the symbol name, so that any whitespace or similar that happens to end up in the name (rare, but possible to write using assembly, at least for other platforms) is easily understood to be part of the name.

jhenderson: I'd suggest quoting somehow the symbol name, so that any whitespace or similar that happens to…

return false; }

return (OwningObjectPtr->getSectionFlags(SI.get()) & XCOFF::STYP_TEXT); Expected<StringRef> XCOFFSymbolRef::getName() const {

// A storage class value with the high-order bit on indicates that the name is

// a symbolic debugger stabstring.

if (getStorageClass() & 0x80)

return StringRef("Unimplemented Debug Name");

if (Entry32) {

if (Entry32->NameInStrTbl.Magic != XCOFFSymbolRef::NAME_IN_STR_TBL_MAGIC)

return generateXCOFFFixedNameStringRef(Entry32->SymbolName);

return OwningObjectPtr->getStringTableEntry(Entry32->NameInStrTbl.Offset);

}

return OwningObjectPtr->getStringTableEntry(Entry64->Offset);

} }

// Explictly instantiate template classes. // Explictly instantiate template classes.

template struct XCOFFSectionHeader<XCOFFSectionHeader32>; template struct XCOFFSectionHeader<XCOFFSectionHeader32>;

template struct XCOFFSectionHeader<XCOFFSectionHeader64>; template struct XCOFFSectionHeader<XCOFFSectionHeader64>;

bool doesXCOFFTracebackTableBegin(ArrayRef<uint8_t> Bytes) { bool doesXCOFFTracebackTableBegin(ArrayRef<uint8_t> Bytes) {

if (Bytes.size() < 4) if (Bytes.size() < 4)

▲ Show 20 Lines • Show All 292 Lines • Show Last 20 Lines

llvm/test/tools/llvm-objdump/XCOFF/Inputs/xcoff-section-headers64.o

This binary file was added.

llvm/test/tools/llvm-objdump/XCOFF/disassemble-symbol-description64.test

This file was added.

				# REQUIRES: powerpc-registered-target

				# RUN: llvm-objdump -D %p/Inputs/xcoff-section-headers64.o \| \
				# RUN: FileCheck --check-prefixes=COMMON,PLAIN %s

				# RUN: llvm-objdump -D --symbol-description %p/Inputs/xcoff-section-headers64.o \| \
				# RUN: FileCheck --check-prefixes=COMMON,DESC %s

				# RUN: not --crash llvm-objdump -D -r --symbol-description %p/Inputs/xcoff-section-headers64.o 2>&1 \| \
				# RUN: FileCheck --check-prefix=ERROR %s
				# ERROR: 64-bit support not implemented yet

				## xcoff-section-headers64.o Compiled with IBM XL C/C++ for AIX, V16.1.0
				## compiler command: xlc -q64 -qtls -o xcoff-section-headers64.o -c test.c

				jhendersonUnsubmitted Not Done Reply Inline Actions I'm not going to stop you checking in a pre-compiled object, as I'm not an XCOFF maintainer, but as you are continuing to add more functionality here, I strongly advise you to write a yaml2obj XCOFF port, to avoid pre-canned binaries. You'll find pre-built binaries extremely inconvenient to work with as you maintain things going forward. Not only that, but they are harmful to the git repository size, especially if you have to occasionally rebuild them. Using yaml2obj may also be about the only way you can test most parse failure paths. If yaml2obj isn't viable, at least consider llvm-mc or similar, if possible. jhenderson: I'm not going to stop you checking in a pre-compiled object, as I'm not an XCOFF maintainer…
				jasonliuAuthorUnsubmitted Done Reply Inline Actions I agree that we would want to move away from pre-canned binaries at some point. When writing a yaml2obj port, we would still require tools such as llvm-readobj and llvm-objdump to make sure our yaml2obj implementation is correct. So we still have a chicken-or-egg problem here. I think the current plan is to use pre-canned binary to develop the tooling support first. Then use the verified tooling support to verify XCOFF object file generation from llc. Then we could replace the pre-canned binary with llvm-mc/llc. jasonliu: I agree that we would want to move away from pre-canned binaries at some point. When writing a…
				jhendersonUnsubmitted Not Done Reply Inline Actions Yeah, chicken-or-egg problem is a bit of an issue. I'm not sure there's always a clear answer to this. The one I've encouraged for yaml2obj DWARF support testing is to actually inspect the hex output (with sufficient additional commenting to make it clear what the output represents). By keeping the initial functionality small enough, you can boostrap up from there. The issue is that a lot of our low-level tool testing (i.e. testing of things like llvm-readobj) has switched over to yaml2obj, but clearly we can't (in theory) then use llvm-readobj to test the basic output of yaml2obj or we end up with a circular test dependency - a bug in a common library might not obviously manifest itself in this context, but would if using a tool from outside the ecosystem. Another strategy which I've used occasionally for testing DWARF parsing before the yaml2obj support existed was writing assembly using just .byte/.quad etc directives to craft the input format precisely, without relying on the higher-level assembly directives (like .file/.loc etc). This may not work in all situations though. jhenderson: Yeah, chicken-or-egg problem is a bit of an issue. I'm not sure there's always a clear answer…
				## test.c:
				## int a;
				## int b = 12345;
				## __thread int c;
				## __thread double d = 3.14159;
				##
				## int func(void) {
				## return a;
				## }

				COMMON: Inputs/xcoff-section-headers64.o: file format aix5coff64-rs6000
				COMMON: Disassembly of section .text:
				COMMON-EMPTY:
				PLAIN: 0000000000000000 <.func>:
				DESC: 0000000000000000 (idx: 6) .func:
				COMMON-NEXT: 0: e8 62 00 08 ld 3, 8(2)
				COMMON-NEXT: 4: e8 63 00 02 lwa 3, 0(3)
				COMMON-NEXT: 8: 4e 80 00 20 blr
				COMMON-NEXT: c: 00 00 00 00 <unknown>
				COMMON-NEXT: 10: 00 00 20 40 <unknown>
				COMMON-NEXT: 14: 00 00 00 01 <unknown>
				COMMON-NEXT: 18: 00 00 00 0c <unknown>
				COMMON-NEXT: 1c: 00 04 66 75 <unknown>
				COMMON-NEXT: 20: 6e 63 00 00 xoris 3, 19, 0
				COMMON-NEXT: ...
				COMMON-EMPTY:
				COMMON-NEXT: Disassembly of section .data:
				COMMON-EMPTY:
				PLAIN: 0000000000000080 <func>:
				DESC: 0000000000000080 (idx: 12) func[TC]:
				COMMON-NEXT: 80: 00 00 00 00 <unknown>
				COMMON-NEXT: 84: 00 00 00 a8 <unknown>
				COMMON-EMPTY:
				PLAIN: 0000000000000088 <a>:
				DESC: 0000000000000088 (idx: 16) a[TC]:
				COMMON-NEXT: 88: 00 00 00 00 <unknown>
				COMMON-NEXT: 8c: 00 00 00 c8 <unknown>
				COMMON-EMPTY:
				PLAIN: 0000000000000090 <b>:
				DESC: 0000000000000090 (idx: 20) b[TC]:
				COMMON-NEXT: 90: 00 00 00 00 <unknown>
				COMMON-NEXT: 94: 00 00 00 c0 <unknown>
				COMMON-EMPTY:
				PLAIN: 0000000000000098 <c>:
				DESC: 0000000000000098 (idx: 24) c[TC]:
				COMMON-NEXT: 98: 00 00 00 00 <unknown>
				COMMON-NEXT: 9c: 00 00 00 08 <unknown>
				COMMON-EMPTY:
				PLAIN: 00000000000000a0 <d>:
				DESC: 00000000000000a0 (idx: 28) d[TC]:
				COMMON-NEXT: ...
				COMMON-EMPTY:
				PLAIN: 00000000000000a8 <func>:
				DESC: 00000000000000a8 (idx: 10) func[DS]:
				COMMON-NEXT: ...
				COMMON-NEXT: b4: 00 00 00 80 <unknown>
				COMMON-NEXT: ...
				COMMON-EMPTY:
				PLAIN: 00000000000000c0 <b>:
				DESC: 00000000000000c0 (idx: 18) b[RW]:
				COMMON-NEXT: c0: 00 00 30 39 <unknown>
				COMMON-NEXT: c4: 00 00 00 00 <unknown>
				COMMON-EMPTY:
				COMMON-NEXT: Disassembly of section .bss:
				COMMON-EMPTY:
				PLAIN: 00000000000000c8 <a>:
				DESC: 00000000000000c8 (idx: 14) a[RW]:
				COMMON-NEXT: ...
				COMMON-EMPTY:
				COMMON-NEXT: Disassembly of section .tdata:
				COMMON-EMPTY:
				PLAIN: 0000000000000000 <d>:
				DESC: 0000000000000000 (idx: 26) d[TL]:
				COMMON-NEXT: 0: 40 09 21 f9 bdnzfl 9, 0x21f8
				COMMON-NEXT: 4: f0 1b 86 6e <unknown>
				COMMON-EMPTY:
				COMMON-NEXT: Disassembly of section .tbss:
				COMMON-EMPTY:
				PLAIN: 0000000000000008 <c>:
				DESC: 0000000000000008 (idx: 22) c[UL]:
				COMMON-NEXT: ...

llvm/test/tools/llvm-readobj/XCOFF/Inputs/file-aux-wrong64.o

This binary file was added.

llvm/test/tools/llvm-readobj/XCOFF/Inputs/symbol64.o

This binary file was added.

llvm/test/tools/llvm-readobj/XCOFF/file-aux-wrong64.test

This file was added.

## This file tests the raw data output ability when a file auxiliary entry does

## not have the matching auxiliary type.

jhendersonUnsubmitted

Done

- ## This file tests the raw data output ability when file auxiliary

- ## entry do not have the matching auxiliary type.

+ ## This file tests the raw data output ability when a file auxiliary entry does

+ ## not have the matching auxiliary type.

# RUN: llvm-readobj --symbols %p/Inputs/file-aux-wrong64.o | \

Fix a couple of grammar issues and a premature line-wrap.

jhenderson: Fix a couple of grammar issues and a premature line-wrap.

# RUN: llvm-readobj --symbols %p/Inputs/file-aux-wrong64.o | FileCheck %s

# CHECK: Symbols [

jhendersonUnsubmitted

Done

## entry do not have the matching auxiliary type.

- # RUN: llvm-readobj --symbols %p/Inputs/file-aux-wrong64.o | \

- # RUN: FileCheck %s

+ # RUN: llvm-readobj --symbols %p/Inputs/file-aux-wrong64.o | FileCheck %s

# CHECK: Symbols [

jhenderson:

# CHECK-NEXT: Symbol {

# CHECK-NEXT: Index: 0

# CHECK-NEXT: Name: .file

# CHECK-NEXT: Value (SymbolTableIndex): 0x0

# CHECK-NEXT: Section: N_DEBUG

# CHECK-NEXT: Source Language ID: 0xC

# CHECK-NEXT: CPU Version ID: TCPU_PPC64 (0x2)

# CHECK-NEXT: StorageClass: C_FILE (0x67)

# CHECK-NEXT: NumberOfAuxEntries: 1

# CHECK-NEXT: !Unexpected raw auxiliary entry data:

# CHECK-NEXT: 612e7300 00000000 00000000 00000000 00fb

# CHECK-NEXT: }

# CHECK-NEXT: ]

jhendersonUnsubmitted

Not Done

The output on this line looks incorrect to me. Possibly a bug in the code resulting from a missing newe line?

jhenderson: The output on this line looks incorrect to me. Possibly a bug in the code resulting from a…

jasonliuAuthorUnsubmitted

Done

I added an newline after all the raw bytes. Other than that, I think the output is good. We have 18 bytes per symbol table entry, and we are printing 18 raw bytes here.

jasonliu: I added an newline after all the raw bytes. Other than that, I think the output is good. We…

jhendersonUnsubmitted

Not Done

I don't know if it is, but you probably want 00fb indented to line up nicely with the previous line. You can then confirm that this indentation is maintained by enabling --match-full-lines and --strict-whitespace in FileCheck. (If you do that, you'll need to remove the space after CHECK-NEXT:)

jhenderson: I don't know if it is, but you probably want `00fb` indented to line up nicely with the…

jasonliuAuthorUnsubmitted

Done

Hi James, I adjusted the code to print 00fb in the same line, I think that actually works better because we could have multiply auxiliary entry data to print out. Putting each in the same line is easier to parse.

jasonliu: Hi James, I adjusted the code to print `00fb` in the same line, I think that actually works…

llvm/test/tools/llvm-readobj/XCOFF/symbols64.test

This file was added.

				## This file tests the ability of llvm-readobj to display the symbol table for a
				## 64-bit XCOFF object file.
				## The object file used is generated by the following source file
				## and command on AIX:
				##
				## > cat test8.c
				##
				## extern int i;
				## extern int TestforXcoff;
				## extern int fun(int i);
				## static int static_i;
				## char* p="abcd";
				## int fun1(int j) {
				## static_i++;
				## j++;
				## j=j+*p;
				## return j;
				## }
				##
				## int main() {
				## i++;
				## fun(i);
				## return fun1(i);
				## }
				##
				## > xlc -q64 -c test8.c -o symbol64.o

				# RUN: llvm-readobj --symbols %p/Inputs/symbol64.o \| \
				# RUN: FileCheck --check-prefix=SYMBOL64 %s

				# SYMBOL64: File: {{.*}}symbol64.o
				# SYMBOL64-NEXT: Format: aix5coff64-rs6000
				# SYMBOL64-NEXT: Arch: powerpc64
				# SYMBOL64-NEXT: AddressSize: 64bit
				# SYMBOL64-NEXT: Symbols [
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 0
				# SYMBOL64-NEXT: Name: .file
				# SYMBOL64-NEXT: Value (SymbolTableIndex): 0x0
				# SYMBOL64-NEXT: Section: N_DEBUG
				# SYMBOL64-NEXT: Source Language ID: TB_C (0x0)
				# SYMBOL64-NEXT: CPU Version ID: TCPU_PPC64 (0x2)
				# SYMBOL64-NEXT: StorageClass: C_FILE (0x67)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 3
				# SYMBOL64-NEXT: File Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 1
				# SYMBOL64-NEXT: Name: test64.c
				# SYMBOL64-NEXT: Type: XFT_FN (0x0)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_FILE (0xFC)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: File Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 2
				# SYMBOL64-NEXT: Name: Mon Aug 10 16:07:48 2020
				# SYMBOL64-NEXT: Type: XFT_CT (0x1)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_FILE (0xFC)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: File Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 3
				# SYMBOL64-NEXT: Name: IBM XL C for AIX, Version 16.1.0.6
				# SYMBOL64-NEXT: Type: XFT_CV (0x2)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_FILE (0xFC)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 4
				# SYMBOL64-NEXT: Name:
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x0
				# SYMBOL64-NEXT: Section: .text
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_HIDEXT (0x6B)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 5
				# SYMBOL64-NEXT: SectionLen: 256
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 7
				# SYMBOL64-NEXT: SymbolType: XTY_SD (0x1)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_PR (0x0)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 6
				# SYMBOL64-NEXT: Name: .fun1
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x0
				# SYMBOL64-NEXT: Section: .text
				# SYMBOL64-NEXT: Type: 0x20
				# SYMBOL64-NEXT: StorageClass: C_EXT (0x2)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 7
				# SYMBOL64-NEXT: ContainingCsectSymbolIndex: 4
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 0
				# SYMBOL64-NEXT: SymbolType: XTY_LD (0x2)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_PR (0x0)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 8
				# SYMBOL64-NEXT: Name: .main
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x80
				# SYMBOL64-NEXT: Section: .text
				# SYMBOL64-NEXT: Type: 0x20
				# SYMBOL64-NEXT: StorageClass: C_EXT (0x2)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 9
				# SYMBOL64-NEXT: ContainingCsectSymbolIndex: 4
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 0
				# SYMBOL64-NEXT: SymbolType: XTY_LD (0x2)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_PR (0x0)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 10
				# SYMBOL64-NEXT: Name: TOC
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x100
				# SYMBOL64-NEXT: Section: .data
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_HIDEXT (0x6B)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 11
				# SYMBOL64-NEXT: SectionLen: 0
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 2
				# SYMBOL64-NEXT: SymbolType: XTY_SD (0x1)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_TC0 (0xF)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 12
				# SYMBOL64-NEXT: Name:
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x128
				# SYMBOL64-NEXT: Section: .data
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_HIDEXT (0x6B)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 13
				# SYMBOL64-NEXT: SectionLen: 8
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 3
				# SYMBOL64-NEXT: SymbolType: XTY_SD (0x1)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_TC (0x3)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 14
				# SYMBOL64-NEXT: Name:
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x168
				# SYMBOL64-NEXT: Section: .data
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_HIDEXT (0x6B)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 15
				# SYMBOL64-NEXT: SectionLen: 5
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 3
				# SYMBOL64-NEXT: SymbolType: XTY_SD (0x1)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_RO (0x1)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 16
				# SYMBOL64-NEXT: Name: _$STATIC_BSS
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x170
				# SYMBOL64-NEXT: Section: .bss
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_HIDEXT (0x6B)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 17
				# SYMBOL64-NEXT: SectionLen: 4
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 2
				# SYMBOL64-NEXT: SymbolType: XTY_CM (0x3)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_RW (0x5)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 18
				# SYMBOL64-NEXT: Name: _$STATIC_BSS
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x108
				# SYMBOL64-NEXT: Section: .data
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_HIDEXT (0x6B)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 19
				# SYMBOL64-NEXT: SectionLen: 8
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 3
				# SYMBOL64-NEXT: SymbolType: XTY_SD (0x1)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_TC (0x3)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 20
				# SYMBOL64-NEXT: Name: fun1
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x130
				# SYMBOL64-NEXT: Section: .data
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_EXT (0x2)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 21
				# SYMBOL64-NEXT: SectionLen: 24
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 3
				# SYMBOL64-NEXT: SymbolType: XTY_SD (0x1)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_DS (0xA)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 22
				# SYMBOL64-NEXT: Name: fun1
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x100
				# SYMBOL64-NEXT: Section: .data
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_HIDEXT (0x6B)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 23
				# SYMBOL64-NEXT: SectionLen: 8
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 3
				# SYMBOL64-NEXT: SymbolType: XTY_SD (0x1)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_TC (0x3)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 24
				# SYMBOL64-NEXT: Name: p
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x160
				# SYMBOL64-NEXT: Section: .data
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_EXT (0x2)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 25
				# SYMBOL64-NEXT: SectionLen: 8
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 3
				# SYMBOL64-NEXT: SymbolType: XTY_SD (0x1)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_RW (0x5)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 26
				# SYMBOL64-NEXT: Name: p
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x110
				# SYMBOL64-NEXT: Section: .data
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_HIDEXT (0x6B)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 27
				# SYMBOL64-NEXT: SectionLen: 8
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 3
				# SYMBOL64-NEXT: SymbolType: XTY_SD (0x1)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_TC (0x3)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 28
				# SYMBOL64-NEXT: Name: main
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x148
				# SYMBOL64-NEXT: Section: .data
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_EXT (0x2)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 29
				# SYMBOL64-NEXT: SectionLen: 24
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 3
				# SYMBOL64-NEXT: SymbolType: XTY_SD (0x1)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_DS (0xA)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 30
				# SYMBOL64-NEXT: Name: main
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x118
				# SYMBOL64-NEXT: Section: .data
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_HIDEXT (0x6B)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 31
				# SYMBOL64-NEXT: SectionLen: 8
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 3
				# SYMBOL64-NEXT: SymbolType: XTY_SD (0x1)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_TC (0x3)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 32
				# SYMBOL64-NEXT: Name: i
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x0
				# SYMBOL64-NEXT: Section: N_UNDEF
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_EXT (0x2)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 33
				# SYMBOL64-NEXT: SectionLen: 0
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 0
				# SYMBOL64-NEXT: SymbolType: XTY_ER (0x0)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_UA (0x4)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 34
				# SYMBOL64-NEXT: Name: i
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x120
				# SYMBOL64-NEXT: Section: .data
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_HIDEXT (0x6B)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 35
				# SYMBOL64-NEXT: SectionLen: 8
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 3
				# SYMBOL64-NEXT: SymbolType: XTY_SD (0x1)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_TC (0x3)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: Symbol {
				# SYMBOL64-NEXT: Index: 36
				# SYMBOL64-NEXT: Name: .fun
				# SYMBOL64-NEXT: Value (RelocatableAddress): 0x0
				# SYMBOL64-NEXT: Section: N_UNDEF
				# SYMBOL64-NEXT: Type: 0x0
				# SYMBOL64-NEXT: StorageClass: C_EXT (0x2)
				# SYMBOL64-NEXT: NumberOfAuxEntries: 1
				# SYMBOL64-NEXT: CSECT Auxiliary Entry {
				# SYMBOL64-NEXT: Index: 37
				# SYMBOL64-NEXT: SectionLen: 0
				# SYMBOL64-NEXT: ParameterHashIndex: 0x0
				# SYMBOL64-NEXT: TypeChkSectNum: 0x0
				# SYMBOL64-NEXT: SymbolAlignmentLog2: 0
				# SYMBOL64-NEXT: SymbolType: XTY_ER (0x0)
				# SYMBOL64-NEXT: StorageMappingClass: XMC_PR (0x0)
				# SYMBOL64-NEXT: Auxiliary Type: AUX_CSECT (0xFB)
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: }
				# SYMBOL64-NEXT: ]

llvm/tools/llvm-objdump/XCOFFDump.cpp

Show All 40 Lines	Error objdump::getXCOFFRelocationValueString(const XCOFFObjectFile *Obj,

Result.append(SymName.begin(), SymName.end());		Result.append(SymName.begin(), SymName.end());
return Error::success();		return Error::success();
}		}

Optional<XCOFF::StorageMappingClass>		Optional<XCOFF::StorageMappingClass>
objdump::getXCOFFSymbolCsectSMC(const XCOFFObjectFile *Obj,		objdump::getXCOFFSymbolCsectSMC(const XCOFFObjectFile *Obj,
const SymbolRef &Sym) {		const SymbolRef &Sym) {
XCOFFSymbolRef SymRef(Sym.getRawDataRefImpl(), Obj);		const XCOFFSymbolRef SymRef = Obj->toSymbolRef(Sym.getRawDataRefImpl());
		DiggerLinUnsubmitted Not Done Reply Inline Actions I can not see benefit to change from XCOFFSymbolRef SymRef(Sym.getRawDataRefImpl(), Obj); to XCOFFSymbolRef SymRef = Obj->toSymbolRef(Sym.getRawDataRefImpl()); DiggerLin: I can not see benefit to change from XCOFFSymbolRef SymRef(Sym.getRawDataRefImpl(), Obj)…
		jasonliuAuthorUnsubmitted Done Reply Inline Actions I did it for consistency reason, i.e: always get XCOFFSymbolRef via toSymbolRef. jasonliu: I did it for consistency reason, i.e: always get XCOFFSymbolRef via toSymbolRef.

if (SymRef.hasCsectAuxEnt())		if (!SymRef.isCsectSymbol())
return SymRef.getXCOFFCsectAuxEnt32()->StorageMappingClass;		return None;

		auto CsectAuxEntOrErr = SymRef.getXCOFFCsectAuxRef();
		if (!CsectAuxEntOrErr)
return None;		return None;

		return CsectAuxEntOrErr.get().getStorageMappingClass();
}		}

bool objdump::isLabel(const XCOFFObjectFile *Obj, const SymbolRef &Sym) {		bool objdump::isLabel(const XCOFFObjectFile *Obj, const SymbolRef &Sym) {

XCOFFSymbolRef SymRef(Sym.getRawDataRefImpl(), Obj);		const XCOFFSymbolRef SymRef = Obj->toSymbolRef(Sym.getRawDataRefImpl());

if (SymRef.hasCsectAuxEnt())		if (!SymRef.isCsectSymbol())
return SymRef.getXCOFFCsectAuxEnt32()->isLabel();		return false;

		auto CsectAuxEntOrErr = SymRef.getXCOFFCsectAuxRef();
		if (!CsectAuxEntOrErr)
return false;		return false;

		return CsectAuxEntOrErr.get().isLabel();
}		}

std::string objdump::getXCOFFSymbolDescription(const SymbolInfoTy &SymbolInfo,		std::string objdump::getXCOFFSymbolDescription(const SymbolInfoTy &SymbolInfo,
StringRef SymbolName) {		StringRef SymbolName) {
assert(SymbolInfo.isXCOFF() && "Must be a XCOFFSymInfo.");		assert(SymbolInfo.isXCOFF() && "Must be a XCOFFSymInfo.");

std::string Result;		std::string Result;
// Dummy symbols have no symbol index.		// Dummy symbols have no symbol index.
Show All 16 Lines

llvm/tools/llvm-readobj/XCOFFDumper.cpp

Show All 34 Lines	public:
void printStackMap() const override;		void printStackMap() const override;
void printNeededLibraries() override;		void printNeededLibraries() override;

private:		private:
template <typename T> void printSectionHeaders(ArrayRef<T> Sections);		template <typename T> void printSectionHeaders(ArrayRef<T> Sections);
template <typename T> void printGenericSectionHeader(T &Sec) const;		template <typename T> void printGenericSectionHeader(T &Sec) const;
template <typename T> void printOverflowSectionHeader(T &Sec) const;		template <typename T> void printOverflowSectionHeader(T &Sec) const;
void printFileAuxEnt(const XCOFFFileAuxEnt *AuxEntPtr);		void printFileAuxEnt(const XCOFFFileAuxEnt *AuxEntPtr);
void printCsectAuxEnt32(const XCOFFCsectAuxEnt32 *AuxEntPtr);		void printCsectAuxEnt(XCOFFCsectAuxRef AuxEntRef);
void printSectAuxEntForStat(const XCOFFSectAuxEntForStat *AuxEntPtr);		void printSectAuxEntForStat(const XCOFFSectAuxEntForStat *AuxEntPtr);
void printSymbol(const SymbolRef &);		void printSymbol(const SymbolRef &);
void printRelocations(ArrayRef<XCOFFSectionHeader32> Sections);		void printRelocations(ArrayRef<XCOFFSectionHeader32> Sections);
const XCOFFObjectFile &Obj;		const XCOFFObjectFile &Obj;
};		};
} // anonymous namespace		} // anonymous namespace

void XCOFFDumper::printFileHeaders() {		void XCOFFDumper::printFileHeaders() {
▲ Show 20 Lines • Show All 107 Lines • ▼ Show 20 Lines

static const EnumEntry<XCOFF::CFileStringType> FileStringType[] = {		static const EnumEntry<XCOFF::CFileStringType> FileStringType[] = {
#define ECase(X) \		#define ECase(X) \
{ #X, XCOFF::X }		{ #X, XCOFF::X }
ECase(XFT_FN), ECase(XFT_CT), ECase(XFT_CV), ECase(XFT_CD)		ECase(XFT_FN), ECase(XFT_CT), ECase(XFT_CV), ECase(XFT_CD)
#undef ECase		#undef ECase
};		};

		static const EnumEntry<XCOFF::SymbolAuxType> SymAuxType[] = {
		#define ECase(X) \
		{ #X, XCOFF::X }
		ECase(AUX_EXCEPT), ECase(AUX_FCN), ECase(AUX_SYM), ECase(AUX_FILE),
		ECase(AUX_CSECT), ECase(AUX_SECT)
		#undef ECase
		};

void XCOFFDumper::printFileAuxEnt(const XCOFFFileAuxEnt *AuxEntPtr) {		void XCOFFDumper::printFileAuxEnt(const XCOFFFileAuxEnt *AuxEntPtr) {
if (Obj.is64Bit())		assert((!Obj.is64Bit() \|\| AuxEntPtr->AuxType == XCOFF::AUX_FILE) &&
report_fatal_error(		"Mismatched auxiliary type!");
"Printing for File Auxiliary Entry in 64-bit is unimplemented.");
StringRef FileName =		StringRef FileName =
unwrapOrError(Obj.getFileName(), Obj.getCFileName(AuxEntPtr));		unwrapOrError(Obj.getFileName(), Obj.getCFileName(AuxEntPtr));
DictScope SymDs(W, "File Auxiliary Entry");		DictScope SymDs(W, "File Auxiliary Entry");
W.printNumber("Index",		W.printNumber("Index",
Obj.getSymbolIndex(reinterpret_cast<uintptr_t>(AuxEntPtr)));		Obj.getSymbolIndex(reinterpret_cast<uintptr_t>(AuxEntPtr)));
W.printString("Name", FileName);		W.printString("Name", FileName);
W.printEnum("Type", static_cast<uint8_t>(AuxEntPtr->Type),		W.printEnum("Type", static_cast<uint8_t>(AuxEntPtr->Type),
makeArrayRef(FileStringType));		makeArrayRef(FileStringType));
		if (Obj.is64Bit()) {
		W.printEnum("Auxiliary Type", static_cast<uint8_t>(AuxEntPtr->AuxType),
		DiggerLinUnsubmitted Not Done Reply Inline Actions if (AuxEntPtr->AuxType != XCOFF::AUX_FILE ) , it should not be parsed as XCOFF::AUX_FILE it may better to print out raw data in the printSymbol() DiggerLin: if (AuxEntPtr->AuxType != XCOFF::AUX_FILE ) , it should not be parsed as XCOFF::AUX_FILE it…
		jasonliuAuthorUnsubmitted Done Reply Inline Actions I think it's the caller's responsibility to make sure they are passing in the right auxiliary type. So we should assume when we enter this function, we have the right auxiliary type here. So I modified it and made it an assert instead. jasonliu: I think it's the caller's responsibility to make sure they are passing in the right auxiliary…
		makeArrayRef(SymAuxType));
		}
}		}

static const EnumEntry<XCOFF::StorageMappingClass> CsectStorageMappingClass[] =		static const EnumEntry<XCOFF::StorageMappingClass> CsectStorageMappingClass[] =
{		{
#define ECase(X) \		#define ECase(X) \
{ #X, XCOFF::X }		{ #X, XCOFF::X }
ECase(XMC_PR), ECase(XMC_RO), ECase(XMC_DB), ECase(XMC_GL),		ECase(XMC_PR), ECase(XMC_RO), ECase(XMC_DB), ECase(XMC_GL),
ECase(XMC_XO), ECase(XMC_SV), ECase(XMC_SV64), ECase(XMC_SV3264),		ECase(XMC_XO), ECase(XMC_SV), ECase(XMC_SV64), ECase(XMC_SV3264),
ECase(XMC_TI), ECase(XMC_TB), ECase(XMC_RW), ECase(XMC_TC0),		ECase(XMC_TI), ECase(XMC_TB), ECase(XMC_RW), ECase(XMC_TC0),
ECase(XMC_TC), ECase(XMC_TD), ECase(XMC_DS), ECase(XMC_UA),		ECase(XMC_TC), ECase(XMC_TD), ECase(XMC_DS), ECase(XMC_UA),
ECase(XMC_BS), ECase(XMC_UC), ECase(XMC_TL), ECase(XMC_UL),		ECase(XMC_BS), ECase(XMC_UC), ECase(XMC_TL), ECase(XMC_UL),
ECase(XMC_TE)		ECase(XMC_TE)
#undef ECase		#undef ECase
};		};

static const EnumEntry<XCOFF::SymbolType> CsectSymbolTypeClass[] = {		static const EnumEntry<XCOFF::SymbolType> CsectSymbolTypeClass[] = {
#define ECase(X) \		#define ECase(X) \
{ #X, XCOFF::X }		{ #X, XCOFF::X }
ECase(XTY_ER), ECase(XTY_SD), ECase(XTY_LD), ECase(XTY_CM)		ECase(XTY_ER), ECase(XTY_SD), ECase(XTY_LD), ECase(XTY_CM)
#undef ECase		#undef ECase
};		};

void XCOFFDumper::printCsectAuxEnt32(const XCOFFCsectAuxEnt32 *AuxEntPtr) {		void XCOFFDumper::printCsectAuxEnt(XCOFFCsectAuxRef AuxEntRef) {
assert(!Obj.is64Bit() && "32-bit interface called on 64-bit object file.");		assert((!Obj.is64Bit() \|\| AuxEntRef.getAuxType64() == XCOFF::AUX_CSECT) &&
		"Mismatched auxiliary type!");

DictScope SymDs(W, "CSECT Auxiliary Entry");		DictScope SymDs(W, "CSECT Auxiliary Entry");
W.printNumber("Index",		W.printNumber("Index", Obj.getSymbolIndex(AuxEntRef.getEntryAddress()));
Obj.getSymbolIndex(reinterpret_cast<uintptr_t>(AuxEntPtr)));		W.printNumber(AuxEntRef.isLabel() ? "ContainingCsectSymbolIndex"
if (AuxEntPtr->isLabel())		: "SectionLen",
W.printNumber("ContainingCsectSymbolIndex", AuxEntPtr->SectionOrLength);		AuxEntRef.getSectionOrLength());
else		W.printHex("ParameterHashIndex", AuxEntRef.getParameterHashIndex());
W.printNumber("SectionLen", AuxEntPtr->SectionOrLength);		W.printHex("TypeChkSectNum", AuxEntRef.getTypeChkSectNum());
W.printHex("ParameterHashIndex", AuxEntPtr->ParameterHashIndex);
W.printHex("TypeChkSectNum", AuxEntPtr->TypeChkSectNum);
// Print out symbol alignment and type.		// Print out symbol alignment and type.
W.printNumber("SymbolAlignmentLog2", AuxEntPtr->getAlignmentLog2());		W.printNumber("SymbolAlignmentLog2", AuxEntRef.getAlignmentLog2());
W.printEnum("SymbolType", AuxEntPtr->getSymbolType(),		W.printEnum("SymbolType", AuxEntRef.getSymbolType(),
makeArrayRef(CsectSymbolTypeClass));		makeArrayRef(CsectSymbolTypeClass));
W.printEnum("StorageMappingClass",		W.printEnum("StorageMappingClass",
static_cast<uint8_t>(AuxEntPtr->StorageMappingClass),		static_cast<uint8_t>(AuxEntRef.getStorageMappingClass()),
makeArrayRef(CsectStorageMappingClass));		makeArrayRef(CsectStorageMappingClass));
W.printHex("StabInfoIndex", AuxEntPtr->StabInfoIndex);
W.printHex("StabSectNum", AuxEntPtr->StabSectNum);		if (Obj.is64Bit()) {
		W.printEnum("Auxiliary Type", static_cast<uint8_t>(XCOFF::AUX_CSECT),
		makeArrayRef(SymAuxType));
		} else {
		W.printHex("StabInfoIndex", AuxEntRef.getStabInfoIndex32());
		W.printHex("StabSectNum", AuxEntRef.getStabSectNum32());
		DiggerLinUnsubmitted Done Reply Inline Actions if (AuxEntPtr->AuxType != XCOFF::AUX_CSECT) , it should not be parsed as XCOFF::AUX_CSECT above it may better to print out raw data in the printSymbol() DiggerLin: if (AuxEntPtr->AuxType != XCOFF::AUX_CSECT) , it should not be parsed as XCOFF::AUX_CSECT…
		jasonliuAuthorUnsubmitted Done Reply Inline Actions Same above. I modified it to be an assertion instead. jasonliu: Same above. I modified it to be an assertion instead.
		}
}		}

void XCOFFDumper::printSectAuxEntForStat(		void XCOFFDumper::printSectAuxEntForStat(
const XCOFFSectAuxEntForStat *AuxEntPtr) {		const XCOFFSectAuxEntForStat *AuxEntPtr) {
assert(!Obj.is64Bit() && "32-bit interface called on 64-bit object file.");		assert(!Obj.is64Bit() && "32-bit interface called on 64-bit object file.");

DictScope SymDs(W, "Sect Auxiliary Entry For Stat");		DictScope SymDs(W, "Sect Auxiliary Entry For Stat");
W.printNumber("Index",		W.printNumber("Index",
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines
static const EnumEntry<XCOFF::CFileCpuId> CFileCpuIdClass[] = {		static const EnumEntry<XCOFF::CFileCpuId> CFileCpuIdClass[] = {
#define ECase(X) \		#define ECase(X) \
{ #X, XCOFF::X }		{ #X, XCOFF::X }
ECase(TCPU_PPC64), ECase(TCPU_COM), ECase(TCPU_970)		ECase(TCPU_PPC64), ECase(TCPU_COM), ECase(TCPU_970)
#undef ECase		#undef ECase
};		};

void XCOFFDumper::printSymbol(const SymbolRef &S) {		void XCOFFDumper::printSymbol(const SymbolRef &S) {
if (Obj.is64Bit())
report_fatal_error("64-bit support is unimplemented.");

DataRefImpl SymbolDRI = S.getRawDataRefImpl();		DataRefImpl SymbolDRI = S.getRawDataRefImpl();
const XCOFFSymbolEntry *SymbolEntPtr = Obj.toSymbolEntry(SymbolDRI);		XCOFFSymbolRef SymbolEntRef = Obj.toSymbolRef(SymbolDRI);

XCOFFSymbolRef XCOFFSymRef(SymbolDRI, &Obj);		uint8_t NumberOfAuxEntries = SymbolEntRef.getNumberOfAuxEntries();
uint8_t NumberOfAuxEntries = XCOFFSymRef.getNumberOfAuxEntries();

DictScope SymDs(W, "Symbol");		DictScope SymDs(W, "Symbol");

StringRef SymbolName =		StringRef SymbolName =
unwrapOrError(Obj.getFileName(), Obj.getSymbolName(SymbolDRI));		unwrapOrError(Obj.getFileName(), SymbolEntRef.getName());

W.printNumber("Index",		W.printNumber("Index", Obj.getSymbolIndex(SymbolEntRef.getEntryAddress()));
Obj.getSymbolIndex(reinterpret_cast<uintptr_t>(SymbolEntPtr)));
W.printString("Name", SymbolName);		W.printString("Name", SymbolName);
W.printHex(GetSymbolValueName(SymbolEntPtr->StorageClass),		W.printHex(GetSymbolValueName(SymbolEntRef.getStorageClass()),
SymbolEntPtr->Value);		SymbolEntRef.getValue());
		DiggerLinUnsubmitted Not Done Reply Inline Actions can we a new member function in the XCOFFSymbolRef getValue() and we will not see any Obj.is64Bit() here. XCOFFSymRef->getValue() DiggerLin: can we a new member function in the XCOFFSymbolRef getValue() and we will not see any Obj.
		jasonliuAuthorUnsubmitted Done Reply Inline Actions Same reason why we don't want to combine getSymbolTableOffset() They are returning different values. Caller have to be aware of that. jasonliu: Same reason why we don't want to combine getSymbolTableOffset() They are returning different…

StringRef SectionName =		StringRef SectionName =
unwrapOrError(Obj.getFileName(), Obj.getSymbolSectionName(SymbolEntPtr));		unwrapOrError(Obj.getFileName(), Obj.getSymbolSectionName(SymbolEntRef));

W.printString("Section", SectionName);		W.printString("Section", SectionName);
if (XCOFFSymRef.getStorageClass() == XCOFF::C_FILE) {		if (SymbolEntRef.getStorageClass() == XCOFF::C_FILE) {
W.printEnum("Source Language ID",		W.printEnum("Source Language ID", SymbolEntRef.getLanguageIdForCFile(),
SymbolEntPtr->CFileLanguageIdAndTypeId.LanguageId,
makeArrayRef(CFileLangIdClass));		makeArrayRef(CFileLangIdClass));
W.printEnum("CPU Version ID",		W.printEnum("CPU Version ID", SymbolEntRef.getCPUTypeIddForCFile(),
SymbolEntPtr->CFileLanguageIdAndTypeId.CpuTypeId,
makeArrayRef(CFileCpuIdClass));		makeArrayRef(CFileCpuIdClass));
} else		} else
W.printHex("Type", SymbolEntPtr->SymbolType);		W.printHex("Type", SymbolEntRef.getSymbolType());

W.printEnum("StorageClass", static_cast<uint8_t>(SymbolEntPtr->StorageClass),		W.printEnum("StorageClass",
		static_cast<uint8_t>(SymbolEntRef.getStorageClass()),
makeArrayRef(SymStorageClass));		makeArrayRef(SymStorageClass));
W.printNumber("NumberOfAuxEntries", SymbolEntPtr->NumberOfAuxEntries);		W.printNumber("NumberOfAuxEntries", NumberOfAuxEntries);

if (NumberOfAuxEntries == 0)		if (NumberOfAuxEntries == 0)
return;		return;

switch (XCOFFSymRef.getStorageClass()) {		switch (SymbolEntRef.getStorageClass()) {
case XCOFF::C_FILE:		case XCOFF::C_FILE:
// If the symbol is C_FILE and has auxiliary entries...		// If the symbol is C_FILE and has auxiliary entries...
for (int i = 1; i <= NumberOfAuxEntries; i++) {		for (int I = 1; I <= NumberOfAuxEntries; I++) {
		uintptr_t AuxAddress = XCOFFObjectFile::getAdvancedSymbolEntryAddress(
		SymbolEntRef.getEntryAddress(), I);

		if (Obj.is64Bit() &&
		*Obj.getSymbolAuxType(AuxAddress) != XCOFF::SymbolAuxType::AUX_FILE) {
		W.startLine() << "!Unexpected raw auxiliary entry data:\n";
		jhendersonUnsubmitted Not Done Reply Inline Actions Test case? jhenderson: Test case?
		W.startLine() << format_bytes(
		ArrayRef<uint8_t>(
		reinterpret_cast<const uint8_t *>(AuxAddress),
		DiggerLinUnsubmitted Not Done Reply Inline Actions it need "continue;" here DiggerLin: it need "continue;" here
		XCOFF::SymbolTableEntrySize),
		0, XCOFF::SymbolTableEntrySize)
		<< "\n";
		continue;
		}

		jhendersonUnsubmitted Not Done Reply Inline Actions `i` -> `I` (you're changing most of the loop body - you might as well fix this whilst you're here) jhenderson: `i` -> `I` (you're changing most of the loop body - you might as well fix this whilst you're…
const XCOFFFileAuxEnt *FileAuxEntPtr =		const XCOFFFileAuxEnt *FileAuxEntPtr =
reinterpret_cast<const XCOFFFileAuxEnt *>(SymbolEntPtr + i);		reinterpret_cast<const XCOFFFileAuxEnt *>(AuxAddress);
#ifndef NDEBUG		#ifndef NDEBUG
Obj.checkSymbolEntryPointer(reinterpret_cast<uintptr_t>(FileAuxEntPtr));		Obj.checkSymbolEntryPointer(reinterpret_cast<uintptr_t>(FileAuxEntPtr));
#endif		#endif
printFileAuxEnt(FileAuxEntPtr);		printFileAuxEnt(FileAuxEntPtr);
		DiggerLinUnsubmitted Done Reply Inline Actions as you mention "I think it's the caller's responsibility to make sure they are passing in the right auxiliary type. " I think we need to check the AuxType == XCOFF::AUX_FILE for 64 bits. if not , print out the raw data as AUX_CSECT did ? DiggerLin: as you mention "I think it's the caller's responsibility to make sure they are passing in the…
}		}
break;		break;
case XCOFF::C_EXT:		case XCOFF::C_EXT:
case XCOFF::C_WEAKEXT:		case XCOFF::C_WEAKEXT:
case XCOFF::C_HIDEXT:		case XCOFF::C_HIDEXT: {
// If the symbol is for a function, and it has more than 1 auxiliary entry,		// If the symbol is for a function, and it has more than 1 auxiliary entry,
// then one of them must be function auxiliary entry which we do not		// then one of them must be function auxiliary entry which we do not
// support yet.		// support yet.
if (XCOFFSymRef.isFunction() && NumberOfAuxEntries >= 2)		if (SymbolEntRef.isFunction() && NumberOfAuxEntries >= 2)
report_fatal_error("Function auxiliary entry printing is unimplemented.");		report_fatal_error("Function auxiliary entry printing is unimplemented.");

// If there is more than 1 auxiliary entry, instead of printing out		// If there is more than 1 auxiliary entry, instead of printing out
// error information, print out the raw Auxiliary entry from 1st till		// error information, print out the raw Auxiliary entry.
		DiggerLinUnsubmitted Done Reply Inline Actions this is only for the 32bits . " By convention, the csect auxiliary entry in an XCOFF32 file must be the last auxiliary entry for any external symbol that has more than one auxiliary entry" for 64bit, it maybe look for the x_auxtype ==AUX_CSECT DiggerLin: this is only for the 32bits . " By convention, the csect auxiliary entry in an XCOFF32 file…
		jasonliuAuthorUnsubmitted Done Reply Inline Actions Good point. Updated the code. jasonliu: Good point. Updated the code.
// the last - 1. The last one must be a CSECT Auxiliary Entry.		// For 32-bit object, print from first to the last - 1. The last one must be
for (int i = 1; i < NumberOfAuxEntries; i++) {		// a CSECT Auxiliary Entry.
		// For 64-bit object, print from first to last and skips if SymbolAuxType is
		jhendersonUnsubmitted Not Done Reply Inline Actions My English language ping went off at the word "till" in these two sentences. I'd probably change it to "to". Also, use "first" rather than "1st", I suggest in both places. Also "skips" -> "skip" for grammatical consistency. jhenderson: My English language ping went off at the word "till" in these two sentences. I'd probably…
		// AUX_CSECT.
		for (int I = 1; I <= NumberOfAuxEntries; I++) {
		jhendersonUnsubmitted Not Done Reply Inline Actions `i` -> `I` jhenderson: `i` -> `I`
		if (I == NumberOfAuxEntries && !Obj.is64Bit())
		break;

		uintptr_t AuxAddress = XCOFFObjectFile::getAdvancedSymbolEntryAddress(
		SymbolEntRef.getEntryAddress(), I);
		if (Obj.is64Bit() &&
		*Obj.getSymbolAuxType(AuxAddress) == XCOFF::SymbolAuxType::AUX_CSECT)
		continue;

W.startLine() << "!Unexpected raw auxiliary entry data:\n";		W.startLine() << "!Unexpected raw auxiliary entry data:\n";
W.startLine() << format_bytes(		W.startLine() << format_bytes(
ArrayRef<uint8_t>(reinterpret_cast<const uint8_t *>(SymbolEntPtr + i),		ArrayRef<uint8_t>(reinterpret_cast<const uint8_t *>(AuxAddress),
XCOFF::SymbolTableEntrySize));		XCOFF::SymbolTableEntrySize));
}		}

// The symbol's last auxiliary entry is a CSECT Auxiliary Entry.		auto ErrOrCsectAuxRef = SymbolEntRef.getXCOFFCsectAuxRef();
printCsectAuxEnt32(XCOFFSymRef.getXCOFFCsectAuxEnt32());		if (!ErrOrCsectAuxRef)
		reportUniqueWarning(ErrOrCsectAuxRef.takeError());
		else
		jhendersonUnsubmitted Not Done Reply Inline Actions Use `reportError` or `reportWarning` so that the error is reported in a clean manner and consistent with other llvm-readobj varieties, and not `report_fatal_error` which looks like a crash. General rule of thumb: try to avoid using `report_fatal_error`, especially in tool code where it is easy to report errors properly. In llvm-readobj for ELF, we try to avoid even using `reportError` where possible, as that stops the tool from continuing dumping, which can be problematic occasionally. We prefer `reportWarning` (or more specifically the local `reportUniqueWarning` which avoids reporting the same warning multiple times) and bailing out of the current routine. Take a look at ELFDumper.cpp for examples. jhenderson: Use `reportError` or `reportWarning` so that the error is reported in a clean manner and…
		jasonliuAuthorUnsubmitted Done Reply Inline Actions Thanks for the elaboration. That clears up things a lot for me. I will use reportUniqueWarning here. jasonliu: Thanks for the elaboration. That clears up things a lot for me. I will use reportUniqueWarning…
		jhendersonUnsubmitted Done Reply Inline Actions `reportUniqueWarning` can take an `Error` directly, so you can just do: if (!ErrOrCsectAuxRef) reportUniqueWarning(ErrOrCsectAuxRef.takeError()); Also, be careful, as the program continues, so referencing `ErrOrCsectAuxRef` after this may result in things going wrong... jhenderson: `reportUniqueWarning` can take an `Error` directly, so you can just do: ``` if (!
		printCsectAuxEnt(*ErrOrCsectAuxRef);
		jhendersonUnsubmitted Done Reply Inline Actions This will write the error inline, rather than to stderr. Are you sure that's what you want? it isn't what most dumping tools do on failure. jhenderson: This will write the error inline, rather than to stderr. Are you sure that's what you want? it…
		DiggerLinUnsubmitted Not Done Reply Inline Actions we have iterated auxiliary entries from line 382~396 and reiterated again in the printCsectAuxEnt(). I think we can improve on it. And in 64bits, The auxiliary entries maybe be reordered in above implement. it will print out all no AUX_CSECT auxiliary entries first and then AUX_CSECT entry, even if the AUX_CSECT is in the middle of the auxiliary entries and if there are two AUX_CSECT on auxiliary entries, we only print out one. DiggerLin: we have iterated auxiliary entries from line 382~396 and reiterated again in the…
		jasonliuAuthorUnsubmitted Done Reply Inline Actions I don't think we reiterated again in printCsectAuxEnt() for other auxiliary entries. We did similar things for 382~396 in `getXCOFFCsectAuxRef`, but that function is specifically designed to only get XCOFFCsectAuxRef. So it's really not intended to pass information out there. I don't really think the re-iteration should be a big concern because in theory 382~396 won't be executed all that much. If it does, I would rather have it implement properly (i.e. actually recognize the missing auxiliary entry) instead of just printing out raw datas. jasonliu: I don't think we reiterated again in printCsectAuxEnt() for other auxiliary entries. We did…

		jhendersonUnsubmitted Not Done Reply Inline Actions @grimar has gone to a lot of effort to get rid of `unwrapOrError` from the ELF dumping code. I'd prefer it if we could avoid using it here too. It is generally better in dumping tools to report a warning and abort dumping the current section than to emit an error and terminate the program, since it gives the user more of the information they've asked for. jhenderson: @grimar has gone to a lot of effort to get rid of `unwrapOrError` from the ELF dumping code.
		grimarUnsubmitted Not Done Reply Inline Actions Yeah. Having `unwrapOrError` available is my concern. I am trying to cleanup ELF dumper, but other files (e.g. COFF) are still using it, thought ideally I'd just remove this API from llvm-readobj code, it seems does more harm than good for a long term. At least I'd be happy if people stop adding more calls to the code. grimar: Yeah. Having `unwrapOrError` available is my concern. I am trying to cleanup ELF dumper, but…
		jasonliuAuthorUnsubmitted Done Reply Inline Actions Thanks. Agreed. Will avoid using `unwrapOrError` in future code. jasonliu: Thanks. Agreed. Will avoid using `unwrapOrError` in future code.
break;		break;
		}
case XCOFF::C_STAT:		case XCOFF::C_STAT:
if (NumberOfAuxEntries > 1)		if (NumberOfAuxEntries > 1)
report_fatal_error(		report_fatal_error(
"C_STAT symbol should not have more than 1 auxiliary entry.");		"C_STAT symbol should not have more than 1 auxiliary entry.");

const XCOFFSectAuxEntForStat *StatAuxEntPtr;		const XCOFFSectAuxEntForStat *StatAuxEntPtr;
StatAuxEntPtr =		StatAuxEntPtr = reinterpret_cast<const XCOFFSectAuxEntForStat *>(
reinterpret_cast<const XCOFFSectAuxEntForStat *>(SymbolEntPtr + 1);		XCOFFObjectFile::getAdvancedSymbolEntryAddress(
		SymbolEntRef.getEntryAddress(), 1));
#ifndef NDEBUG		#ifndef NDEBUG
Obj.checkSymbolEntryPointer(reinterpret_cast<uintptr_t>(StatAuxEntPtr));		Obj.checkSymbolEntryPointer(reinterpret_cast<uintptr_t>(StatAuxEntPtr));
#endif		#endif
printSectAuxEntForStat(StatAuxEntPtr);		printSectAuxEntForStat(StatAuxEntPtr);
break;		break;
case XCOFF::C_DWARF:		case XCOFF::C_DWARF:
case XCOFF::C_BLOCK:		case XCOFF::C_BLOCK:
case XCOFF::C_FCN:		case XCOFF::C_FCN:
report_fatal_error("Symbol table entry printing for this storage class "		report_fatal_error("Symbol table entry printing for this storage class "
"type is unimplemented.");		"type is unimplemented.");
break;		break;
default:		default:
for (int i = 1; i <= NumberOfAuxEntries; i++) {		for (int i = 1; i <= NumberOfAuxEntries; i++) {
W.startLine() << "!Unexpected raw auxiliary entry data:\n";		W.startLine() << "!Unexpected raw auxiliary entry data:\n";
W.startLine() << format_bytes(		W.startLine() << format_bytes(
ArrayRef<uint8_t>(reinterpret_cast<const uint8_t *>(SymbolEntPtr + i),		ArrayRef<uint8_t>(reinterpret_cast<const uint8_t *>(
		XCOFFObjectFile::getAdvancedSymbolEntryAddress(
		SymbolEntRef.getEntryAddress(), i)),
XCOFF::SymbolTableEntrySize));		XCOFF::SymbolTableEntrySize));
}		}
break;		break;
}		}
}		}

void XCOFFDumper::printSymbols() {		void XCOFFDumper::printSymbols() {
ListScope Group(W, "Symbols");		ListScope Group(W, "Symbols");
▲ Show 20 Lines • Show All 111 Lines • Show Last 20 Lines

llvm/tools/obj2yaml/xcoff2yaml.cpp

Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	void XCOFFDumper::dumpHeader() {
YAMLObj.Header.Flags = Obj.getFlags();		YAMLObj.Header.Flags = Obj.getFlags();
}		}

std::error_code XCOFFDumper::dumpSymbols() {		std::error_code XCOFFDumper::dumpSymbols() {
std::vector<XCOFFYAML::Symbol> &Symbols = YAMLObj.Symbols;		std::vector<XCOFFYAML::Symbol> &Symbols = YAMLObj.Symbols;

for (const SymbolRef &S : Obj.symbols()) {		for (const SymbolRef &S : Obj.symbols()) {
DataRefImpl SymbolDRI = S.getRawDataRefImpl();		DataRefImpl SymbolDRI = S.getRawDataRefImpl();
const XCOFFSymbolEntry *SymbolEntPtr = Obj.toSymbolEntry(SymbolDRI);		const XCOFFSymbolRef SymbolEntRef = Obj.toSymbolRef(SymbolDRI);
XCOFFYAML::Symbol Sym;		XCOFFYAML::Symbol Sym;

Expected<StringRef> SymNameRefOrErr = Obj.getSymbolName(SymbolDRI);		Expected<StringRef> SymNameRefOrErr = Obj.getSymbolName(SymbolDRI);
if (!SymNameRefOrErr) {		if (!SymNameRefOrErr) {
return errorToErrorCode(SymNameRefOrErr.takeError());		return errorToErrorCode(SymNameRefOrErr.takeError());
}		}
Sym.SymbolName = SymNameRefOrErr.get();		Sym.SymbolName = SymNameRefOrErr.get();

Sym.Value = SymbolEntPtr->Value;		Sym.Value = SymbolEntRef.getValue();

Expected<StringRef> SectionNameRefOrErr =		Expected<StringRef> SectionNameRefOrErr =
Obj.getSymbolSectionName(SymbolEntPtr);		Obj.getSymbolSectionName(SymbolEntRef);
if (!SectionNameRefOrErr)		if (!SectionNameRefOrErr)
return errorToErrorCode(SectionNameRefOrErr.takeError());		return errorToErrorCode(SectionNameRefOrErr.takeError());

Sym.SectionName = SectionNameRefOrErr.get();		Sym.SectionName = SectionNameRefOrErr.get();

Sym.Type = SymbolEntPtr->SymbolType;		Sym.Type = SymbolEntRef.getSymbolType();
Sym.StorageClass = SymbolEntPtr->StorageClass;		Sym.StorageClass = SymbolEntRef.getStorageClass();
Sym.NumberOfAuxEntries = SymbolEntPtr->NumberOfAuxEntries;		Sym.NumberOfAuxEntries = SymbolEntRef.getNumberOfAuxEntries();
Symbols.push_back(Sym);		Symbols.push_back(Sym);
}		}

return std::error_code();		return std::error_code();
}		}

std::error_code xcoff2yaml(raw_ostream &Out,		std::error_code xcoff2yaml(raw_ostream &Out,
const object::XCOFFObjectFile &Obj) {		const object::XCOFFObjectFile &Obj) {
Show All 10 Lines

llvm/unittests/Object/XCOFFObjectFileTest.cpp

Show First 20 Lines • Show All 378 Lines • ▼ Show 20 Lines	Expected<XCOFFTracebackTable> TTOrErr =
XCOFFTracebackTable::create(TBTableData, Size);		XCOFFTracebackTable::create(TBTableData, Size);

EXPECT_THAT_ERROR(		EXPECT_THAT_ERROR(
TTOrErr.takeError(),		TTOrErr.takeError(),
FailedWithMessage(		FailedWithMessage(
"unexpected end of data at offset 0x2c while reading [0x2c, 0x2d)"));		"unexpected end of data at offset 0x2c while reading [0x2c, 0x2d)"));
EXPECT_EQ(Size, 44u);		EXPECT_EQ(Size, 44u);
}		}

		TEST(XCOFFObjectFileTest, XCOFFGetCsectAuxRef32) {
		uint8_t XCOFF32Binary[] = {
		// File header.
		0x01, 0xdf, 0x00, 0x01, 0x5f, 0x58, 0xf8, 0x95, 0x00, 0x00, 0x00, 0x3c,
		0x00, 0x00, 0x00, 0x04, 0x00, 0x00, 0x00, 0x00,

		// Section header for empty .data section.
		0x2e, 0x64, 0x61, 0x74, 0x61, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00,
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00,
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00,
		0x00, 0x00, 0x00, 0x40,

		// Start of symbol table.
		// C_File symbol.
		0x2e, 0x66, 0x69, 0x6c, 0x65, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00,
		0xff, 0xfe, 0x00, 0x03, 0x67, 0x01,
		// File Auxiliary Entry.
		0x61, 0x2e, 0x63, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00,
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00,

		// Csect symbol.
		0x2e, 0x64, 0x61, 0x74, 0x61, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00,
		0x00, 0x01, 0x00, 0x00, 0x6b, 0x01,
		// Csect auxiliary entry.
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x21, 0x05,
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00};

		ArrayRef<uint8_t> XCOFF32Ref(XCOFF32Binary, sizeof(XCOFF32Binary));
		Expected<std::unique_ptr<ObjectFile>> XCOFFObjOrErr =
		object::ObjectFile::createObjectFile(
		MemoryBufferRef(toStringRef(XCOFF32Ref), "dummyXCOFF"),
		file_magic::xcoff_object_32);
		ASSERT_THAT_EXPECTED(XCOFFObjOrErr, Succeeded());

		const XCOFFObjectFile &File = cast<XCOFFObjectFile>((XCOFFObjOrErr).get());
		DataRefImpl Ref;
		Ref.p = File.getSymbolEntryAddressByIndex(2);
		XCOFFSymbolRef SymRef = File.toSymbolRef(Ref);
		Expected<XCOFFCsectAuxRef> CsectRefOrErr = SymRef.getXCOFFCsectAuxRef();
		ASSERT_THAT_EXPECTED(CsectRefOrErr, Succeeded());

		// Set csect symbol's auxiliary entry count to 0.
		XCOFF32Binary[113] = 0;
		Expected<XCOFFCsectAuxRef> ExpectErr = SymRef.getXCOFFCsectAuxRef();
		EXPECT_THAT_ERROR(
		ExpectErr.takeError(),
		FailedWithMessage("csect symbol \".data\" contains no auxiliary entry"));
		}

		TEST(XCOFFObjectFileTest, XCOFFGetCsectAuxRef64) {
		uint8_t XCOFF64Binary[] = {
		// File header.
		0x01, 0xf7, 0x00, 0x01, 0x5f, 0x59, 0x25, 0xeb, 0x00, 0x00, 0x00, 0x00,
		0x00, 0x00, 0x00, 0x60, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x04,

		// Section header for empty .data section.
		0x2e, 0x64, 0x61, 0x74, 0x61, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00,
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00,
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00,
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00,
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00,
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x40, 0x00, 0x00, 0x00, 0x00,

		// Start of symbol table.
		// C_File symbol.
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x04,
		0xff, 0xfe, 0x00, 0x02, 0x67, 0x01,
		// File Auxiliary Entry.
		0x61, 0x2e, 0x63, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00,
		0x00, 0x00, 0x00, 0x00, 0x00, 0xfc,

		// Csect symbol.
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x0a,
		0x00, 0x01, 0x00, 0x00, 0x6b, 0x01,
		// Csect auxiliary entry.
		0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x21, 0x05,
		0x00, 0x00, 0x00, 0x00, 0x00, 0xfb,

		// String table.
		0x00, 0x00, 0x00, 0x10, 0x2e, 0x66, 0x69, 0x6c, 0x65, 0x00, 0x2e, 0x64,
		0x61, 0x74, 0x61, 0x00};

		ArrayRef<uint8_t> XCOFF64Ref(XCOFF64Binary, sizeof(XCOFF64Binary));
		Expected<std::unique_ptr<ObjectFile>> XCOFFObjOrErr =
		object::ObjectFile::createObjectFile(
		MemoryBufferRef(toStringRef(XCOFF64Ref), "dummyXCOFF"),
		file_magic::xcoff_object_64);
		ASSERT_THAT_EXPECTED(XCOFFObjOrErr, Succeeded());

		const XCOFFObjectFile &File = cast<XCOFFObjectFile>((XCOFFObjOrErr).get());
		DataRefImpl Ref;
		Ref.p = File.getSymbolEntryAddressByIndex(2);
		XCOFFSymbolRef SymRef = File.toSymbolRef(Ref);
		Expected<XCOFFCsectAuxRef> CsectRefOrErr = SymRef.getXCOFFCsectAuxRef();
		ASSERT_THAT_EXPECTED(CsectRefOrErr, Succeeded());

		// Inject incorrect auxiliary type value.
		XCOFF64Binary[167] = static_cast<uint8_t>(XCOFF::AUX_SYM);
		Expected<XCOFFCsectAuxRef> NotFoundErr = SymRef.getXCOFFCsectAuxRef();
		EXPECT_THAT_ERROR(
		NotFoundErr.takeError(),
		FailedWithMessage(
		"a csect auxiliary entry is not found for symbol \".data\""));

		// Set csect symbol's auxiliary entry count to 0.
		XCOFF64Binary[149] = 0;
		Expected<XCOFFCsectAuxRef> ExpectErr = SymRef.getXCOFFCsectAuxRef();
		EXPECT_THAT_ERROR(
		ExpectErr.takeError(),
		FailedWithMessage("csect symbol \".data\" contains no auxiliary entry"));
		}

This is an archive of the discontinued LLVM Phabricator instance.

[XCOFF][AIX] Enable tooling support for 64 bit symbol table parsingClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 350352

llvm/include/llvm/BinaryFormat/XCOFF.h

llvm/include/llvm/Object/XCOFFObjectFile.h

llvm/lib/Object/XCOFFObjectFile.cpp

llvm/test/tools/llvm-objdump/XCOFF/Inputs/xcoff-section-headers64.o

llvm/test/tools/llvm-objdump/XCOFF/disassemble-symbol-description64.test

llvm/test/tools/llvm-readobj/XCOFF/Inputs/file-aux-wrong64.o

llvm/test/tools/llvm-readobj/XCOFF/Inputs/symbol64.o

llvm/test/tools/llvm-readobj/XCOFF/file-aux-wrong64.test

llvm/test/tools/llvm-readobj/XCOFF/symbols64.test

llvm/tools/llvm-objdump/XCOFFDump.cpp

llvm/tools/llvm-readobj/XCOFFDumper.cpp

llvm/tools/obj2yaml/xcoff2yaml.cpp

llvm/unittests/Object/XCOFFObjectFileTest.cpp

[XCOFF][AIX] Enable tooling support for 64 bit symbol table parsing
ClosedPublic