This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
openmp/libomptarget/plugins/
-
libomptarget/
-
plugins/
-
amdgpu/src/
-
src/
1/3
rtl.cpp
-
common/elf_common/
-
elf_common/
-
CMakeLists.txt
-
ELFSymbols.h
1/3
ELFSymbols.cpp

Differential D131309

[Libomptarget] Add utility functions for loading an ELF symbol by name
ClosedPublic

Authored by jhuber6 on Aug 5 2022, 7:38 PM.

Download Raw Diff

Details

Reviewers

JonChesterfield
jdoerfert
tra
yaxunl

Commits

rG894531f59beb: [Libomptarget] Add utility functions for loading an ELF symbol by name

Summary

The SHT_HASH sections in an ELF are used to look up a symbol in the
symbol table using a symbol's name. This is done by obtaining the
SHT_HASH section and using its sh_link attribute to access the
associated symbol table, from which we can access the string table
containing the associated name. We can then search for the symbol using
the hash of the name and the buckets and chains in the hash table
itself

This patch adds utility functions that allow us to look up a symbol in
an ELF file by name. It will first attempt to look through the hash
tables, and then search the section tables manually if failed. This
allows us to pull out constants necessary for setting up offloading
without first loading the object.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jhuber6 created this revision.Aug 5 2022, 7:38 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 5 2022, 7:38 PM

Herald added a subscriber: StephenFan. · View Herald Transcript

jhuber6 requested review of this revision.Aug 5 2022, 7:38 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 5 2022, 7:38 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B179656: Diff 450464.Aug 5 2022, 8:54 PM

Format

Harbormaster completed remote builds in B179689: Diff 450513.Aug 6 2022, 5:39 AM

Could you call this function from the amdgpu plugin in the same patch? In particular the sketchy looking hashtable logic is likely to be a copy from there with modifications to fit the llvm libs and it'll be easier to review with that in the diff scope.

Basically I can't remember the details of that lookup but remember writing it carefully at the time + it's been in use in that plugin for a year or so. I'd like to verify that the logic is unchanged by the move between files.

In D131309#3706189, @JonChesterfield wrote:

Could you call this function from the amdgpu plugin in the same patch? In particular the sketchy looking hashtable logic is likely to be a copy from there with modifications to fit the llvm libs and it'll be easier to review with that in the diff scope.

Basically I can't remember the details of that lookup but remember writing it carefully at the time + it's been in use in that plugin for a year or so. I'd like to verify that the logic is unchanged by the move between files.

I'd prefer not to make this patch really messy with all the required changes in Libomptarget, I can make a new patch for that and put it on the stack for this one. Here's the relevant file if you'd rather look at what's in-source first https://github.com/llvm/llvm-project/blob/main/openmp/libomptarget/plugins/amdgpu/src/rtl.cpp#L1630. For the functionality I figured the runtime test I added was sufficient, though I could make it more complicated.

jhuber6 added a child revision: D131401: [Libomptarget] Replace libelf with LLVM's Elf libraries.Aug 8 2022, 8:06 AM

jhuber6 added reviewers: jhenderson, yaxunl.Aug 10 2022, 12:48 PM

I think you might be reinventing the wheel here. llvm-readobj already has code for printing the symbols derived from the hash table - see printHashTableSymbols. Can you reuse that code? Obviously, this might require refactoring some of the llvm-readobj logic, so that it is in libObject rather than the tool specifically.

In D131309#3715186, @jhenderson wrote:

I think you might be reinventing the wheel here. llvm-readobj already has code for printing the symbols derived from the hash table - see printHashTableSymbols. Can you reuse that code? Obviously, this might require refactoring some of the llvm-readobj logic, so that it is in libObject rather than the tool specifically.

The usage is somewhat different, as far as I can tell the llvm-readobj implementation is only concerned with printing all the symbols in the hash table. This function is intended to be used to look up a specific symbol if we already know the name.

In D131309#3715793, @jhuber6 wrote:

In D131309#3715186, @jhenderson wrote:

I think you might be reinventing the wheel here. llvm-readobj already has code for printing the symbols derived from the hash table - see printHashTableSymbols. Can you reuse that code? Obviously, this might require refactoring some of the llvm-readobj logic, so that it is in libObject rather than the tool specifically.

The usage is somewhat different, as far as I can tell the llvm-readobj implementation is only concerned with printing all the symbols in the hash table. This function is intended to be used to look up a specific symbol if we already know the name.

Oops, yes, I see. Not sure how I missed that.

Could you clarify the use-case a bit more, please? The hash table is primarily for dynamic linking purposes, and consequently section headers aren't guaranteed to exist when doing this kind of lookup, with the dynamic tags like DT_HASH being used instead.

llvm/include/llvm/Object/ELF.h
518–519 ↗	(On Diff #450513)	You need to make sure that this value is within the ELF, and that there's enough size for the hash table. You should also validate that the sh_size is large enough to fit a valid hash table in, so that you don't try to look for e.g. the nbuckets field if the size is 0 or the Bucket or Chain arrays if the size is too small.
520–521 ↗	(On Diff #450513)	Similarly, you should validate that the symbol table fits within the ELF.
529 ↗	(On Diff #450513)	I'd use `STN_UNDEF` rather than 0, since that's what the spec uses.
530 ↗	(On Diff #450513)	`I` needs validating to ensure it doesn't fall outside the range of `SymTab`.
llvm/unittests/Object/ELFObjectFileTest.cpp
812 ↗	(On Diff #450513)	Your testing needs to be expanded to cover the various failure cases that you check for in your function.

In D131309#3718458, @jhenderson wrote:

Oops, yes, I see. Not sure how I missed that.

Could you clarify the use-case a bit more, please? The hash table is primarily for dynamic linking purposes, and consequently section headers aren't guaranteed to exist when doing this kind of lookup, with the dynamic tags like DT_HASH being used instead.

Offloading execution to an external accelerator is very similar to loading a shared library via dlopen. The images we execute on an external device are basically dynamic objects we load at runtime. When loading these images there are certain symbols and constants we emit inside the ELF that are used to control how the image itself is executed (how many blocks and threads to launch for example). This method is currently used to look up these known constants by-name so we can read with without first needing to load the whole image. This method is supposed to be faster than iterating the entire symbol table, checking each one.

Interesting point about DT_HASH. Would this just require an extra lookup from the dynamic table if we don't have section headers? I'm assuming they'd both point to the same symbol / string table, or .

yaxunl added inline comments.Aug 12 2022, 7:57 AM

llvm/unittests/Object/ELFObjectFileTest.cpp
878 ↗	(On Diff #450513)	I am wondering whether it is useful to add a test for getting symbol values here.

Addressing comments and adding more error handling.

For the tests I had to manually edit the values with a const_cast, the ELF parsing was changing some things I think. I think this is fine for now since the interface requires a section header, if in the future we need to be able to do this without section headers we could probably make another interface to get the same pointers.

Harbormaster completed remote builds in B180926: Diff 452194.Aug 12 2022, 10:29 AM

jhenderson added inline comments.Aug 15 2022, 1:25 AM

llvm/include/llvm/Object/ELF.h
505–516 ↗	(On Diff #452194)	All these checks still need test cases.
524–526 ↗	(On Diff #452194)	This is certainly part of what is needed, but it doesn't address the issue of if the sh_size is too small for the hash table to be valid (e.g. it is too small for the nchain or nbucket fields, or the size derived from this information is different to sh_size).
534–535 ↗	(On Diff #452194)	I would very much prefer not to use `auto` here. It's not obvious what the type of Bucket and Chain are.
530 ↗	(On Diff #450513)	Not sure I see where this check is made, even though it's marked as done?
llvm/unittests/Object/ELFObjectFileTest.cpp
823–825 ↗	(On Diff #452194)	`SHF_ALLOC` flags and `Link: .dynsym` are implicit for SHT_HASH sections in YAML, so you can omit those two fields for simplicity.
826 ↗	(On Diff #452194)	I think the `AddressAlign` field is unnecessary? (Also for the other sections)
831–833 ↗	(On Diff #452194)	Similar to above, `SHF_ALLOC` and `Link: .dynstr` are implicit, so don't need specifying.
842–845 ↗	(On Diff #452194)	I think it's probably irrelevant to the test what the contents/address of the section is: I don't believe anything validates that the symbols appear in the claimed section. You could even just use SHN_ABS dynamic symbols, if you wanted to avoid the question entirely. Removing these details would improve the signal to noise ratio in the setup data.
864 ↗	(On Diff #452194)	Nit: the preferred style for these comments is as inline - clang-format recognises this specific syntax and formats it slightly differently to other inline comments. Applies below too.
867 ↗	(On Diff #452194)	There's a lot of unhelpful `auto` in this test, which I don't think really conforms to LLVM's style guide.
871 ↗	(On Diff #452194)	Seems simpler?
940 ↗	(On Diff #452194)	Rather than const_cast here, you should be able to use an `StName` field in the YAML. See https://github.com/llvm/llvm-project/blob/main/llvm/test/tools/yaml2obj/ELF/symbol-name.yaml for examples.
943 ↗	(On Diff #452194)	Use `FailedWithMessage` to check the error message contents. Same elsewhere.
945 ↗	(On Diff #452194)
946–947 ↗	(On Diff #452194)	You can use the .symtab's `ShOffset` field in the YAML to force this without needing the const_cast.
952 ↗	(On Diff #452194)
955 ↗	(On Diff #452194)	You can use the .hash's `ShOffset` field in the YAML to force this without needing the const_cast.
878 ↗	(On Diff #450513)	I'm in two minds about it. However, if preferring to test just the symbol names, the other symbol properties can be omitted to simplify the YAML.

Addressing comments.

jhuber6 marked 16 inline comments as done.Aug 15 2022, 6:32 AM

jhuber6 added inline comments.

llvm/include/llvm/Object/ELF.h
505–516 ↗	(On Diff #452194)	Checking these errors explicitly is excessive. This isn't any new logic so it should already be covered by existing tests. I added one for calling this with an incorrect section but the other error paths are already covered as far as I'm aware.
llvm/unittests/Object/ELFObjectFileTest.cpp
867 ↗	(On Diff #452194)	ELF uses a lot of type-aliases depending on the input. It's much easier to just use `auto` rather than importing the names and using `Expected<typename ELF64LE::xyz>` for everything.

Harbormaster completed remote builds in B181266: Diff 452649.Aug 15 2022, 7:39 AM

jhenderson added inline comments.Aug 16 2022, 2:20 AM

llvm/include/llvm/Object/ELF.h
505–516 ↗	(On Diff #452194)	I'm not suggesting the called code itself needs additional testing, merely testing that shows that if the called code reports an error, the error is correctly handled and returned up the call stack. The new if statements are new logic. As things stand, there is zero testing for those statements.
506 ↗	(On Diff #452649)	I'm looking at this error and wondering if it would be useful to print the actual type (possibly just as a number), so that if a user encounters this area, it might be slightly easier to find the problem. Up to you whether you think it's useful or not though.
527–528 ↗	(On Diff #452649)	If HashTab's size is 0, then you can't read the nbucket or nchain field, so this will result in reading outside the section. You'll need to split this into a check to make sure it's big enough for the nchain/nbucket fields, and then a separate one to make sure the arrays fit.
543–544 ↗	(On Diff #452649)	It's not clear in the final message what the bit after the colon means. I'd add some additional words clarifying that it is the number of symbols.
llvm/unittests/Object/ELFObjectFileTest.cpp
916 ↗	(On Diff #452649)	You might be able to avoid duplicating this string between some of the test cases, by having a factory function that constructs the string by having a prefix (which ends at the end of the section you want to modify) and a suffix (which starts immediately after it), and then an optional field that can be inserted in the middle. Essentially you could have lines like the following then in the different tests: makeYAMLWithInvalidDynsym("ShOffset: 9999"); makeYAMLWithInvalidHash("ShSize: 1"); or perhaps: makeYAML(DynsymPrefix, "ShOffset: 9999", DynsymSuffix); makeYAML(HashPrefix, "ShSize: 1", HashSuffix); Potentially, you could have that function construct the ELF and return the hash table too, to reduce the boiler plate.
995 ↗	(On Diff #452649)	I was slightly confused by first reading of this comment, so I'd add the "index" to be specific.
1035 ↗	(On Diff #452649)	"is" -> "it" here and in all the other comments. Not sure which "table" is referred to in this comment. Probably should be "check that it fails if a symbol has an invalid st_name."

jhuber6 marked 6 inline comments as done.Aug 16 2022, 9:10 AM

jhuber6 added inline comments.

llvm/include/llvm/Object/ELF.h
505–516 ↗	(On Diff #452194)	I'll add a test for an invalid `st_link`. That should be good enough to assert that this part functions properly. I think it's safe to assert that getting a section from an index is sufficiently tested.
llvm/unittests/Object/ELFObjectFileTest.cpp
916 ↗	(On Diff #452649)	The solution I had previously that modified the ELF in-place was more compact because I could iteratively make different field invalid using the same input. I could go back to that method, otherwise I think it's too much work to make some factory generator just for saving a few string constants in a test.

Addressing comments.

Harbormaster completed remote builds in B181549: Diff 453037.Aug 16 2022, 9:54 AM

jhenderson added inline comments.Aug 18 2022, 12:48 AM

llvm/include/llvm/Object/ELF.h
505–516 ↗	(On Diff #452194)	(I assume you meant sh_link). That will cover the `getSection` if statement, but it doesn't cover the `sections()` if statement or the `getStringTableForSymtab`: every conditional should have at least one test case that triggers both the true and false cases, or you haven't fully tested the if. Putting it in other terms, if things were easier to mock, I'd suggest mocking out the return values of each of those functions, so that in one case they produced an error and in the other case they produced a valid value that can be used later on, and then having test cases that used both mocked versions. In practice, what you have to do instead is craft some invalid ELF to trigger the error. I don't care how you achieve that, as long as the respective functions trigger errors.
506 ↗	(On Diff #452649)	Just pinging this point in case you missed it, as you've not responded one way or another (even with just a "no I don't think it's useful).
527–528 ↗	(On Diff #452649)	Thanks for the fix, although there are now two failure cases here, both of which need separate testing (i.e. one with a very small/nearly empty SHT_HASH and another where it has enough space for the header fields, but not the full bucket array).
llvm/unittests/Object/ELFObjectFileTest.cpp
1080 ↗	(On Diff #453037)	`sh_link` not `st_link`
916 ↗	(On Diff #452649)	Yes, I see your point. I do have a strong aversion to const_cast, but I don't really have much justification to it in this particular context. I see there is limited usage within LLVM unit tests of it too. I'd like a second opinion from a regular contributor in this area: @MaskRay, do you have any thoughts?

jhuber6 marked an inline comment as done.Aug 18 2022, 4:24 AM

jhuber6 added inline comments.

llvm/include/llvm/Object/ELF.h
505–516 ↗	(On Diff #452194)	At this point half of the existing ELF runtime test is going to be this one feature. I personally don't see the utility in testing errors that already have coverage somewhere else. If we already know calling some function with an invalid ELF will return the correct error I don't see why repeating that same pattern elsewhere warrants extra tests since none of this logic is unique or new.
506 ↗	(On Diff #452649)	I don't think it's useful and this error message is very similar to other messages.

jhenderson added inline comments.Aug 18 2022, 4:50 AM

llvm/include/llvm/Object/ELF.h
505–516 ↗	(On Diff #452194)	Most of ELF.h is tested via lit tests rather than unittests, so it's a bit of a mischaracterisation that this feature will be half the runtime tests. If you put a code coverage tool on this function, you wouldn't get 100% coverage without testing both halves of the ifs in question. A lot of work has gone into making sure that many/most/all of these sorts of functions have 100% coverage, both for the positive and the negative cases. That means testing needs to trigger every case where an error is handled (not just where it is reported) to show that the error handling is correct and that the error is appropriately propagated up the call stack. Note that I'm not arguing this for the sake of it: bugs can and do happen even in simple looking code around error handling. I'm not budging on this point, because I am not willing to accept only partial test coverage, when it is relatively straightforward to add said coverage. I think you'll find that this is the same for others who regularly work on and review code in this area.

Adding a test for an invalid ELF header failing to get sections, an invalid link index, and an invalid hash size greater than the size of the header.

jhuber6 marked 3 inline comments as done.Aug 18 2022, 5:21 AM

Harbormaster completed remote builds in B181972: Diff 453626.Aug 18 2022, 6:53 AM

MaskRay added inline comments.Aug 18 2022, 12:04 PM

llvm/unittests/Object/ELFObjectFileTest.cpp
916 ↗	(On Diff #452649)	I prefer avoiding `const_cast`, but for this case it's fine if an alternative needs too much boilerplate. `SmallString<0> Storage;` and the `toBinary<ELF64LE>(Storage, R"(` use make it pretty clear the underlying object is actually writable.

Many Linux distributions (see clang/lib/Driver/ToolChains/Linux.cpp:240 for Clang behavior; the distros typically patch their system GCC to do the same thing) use the linker option --hash-style=gnu, so most of their executabls/shared objects do not have SHT_HASH/DT_HASH. You may need to add SHT_GNU_HASH/DT_GNU_HASH support in a separate patch.

Actually, on Linux SHT_GNU_HASH occurs more than SHT_HASH. For symbol search's purpose, SHT_HASH is actually legacy now. If I am to implement the functionality, I may not even implement the SHT_HASH part. (I have a SHT_GNU_HASH state summary for many ELF based OSes https://groups.google.com/g/generic-abi/c/9L03yrxXPBc/m/WKuUjZshAQAJ)

llvm/include/llvm/Object/ELF.h
75 ↗	(On Diff #453626)	s/libelf/generic ABI/
546 ↗	(On Diff #453626)	You can save `sh_size / sizeof(Elf_Sym)` as a local variable, then you can replace `I * sizeof(Elf_Sym) >= (*SymTabOrErr)->sh_size` with a simpler check.
551 ↗	(On Diff #453626)	Prefer `substr` which is also available in std::string_view and is the unaliased function in StringRef.

In D131309#3733045, @MaskRay wrote:

Many Linux distributions (clang/lib/Driver/ToolChains/Linux.cpp:240; they typically patch their system GCC to do the same thing) use the linker option --hash-style=gnu, so most of their executabls/shared objects do not have SHT_HASH/DT_HASH. You may need to add SHT_GNU_HASH/DT_GNU_HASH support.

Right now we use this for the offloading runtime library and the generated objects use SHT_HASH so it's sufficient for our purposes. Do we currently do anything with SHT_GNU_HASH? My understanding is that SHT_HASH is standard while SHT_GNU_HASH isn't officially documented so we would need to import some GNU code verbatim to LLVM, but I'm hardly an expert.

llvm/include/llvm/Object/ELF.h
75 ↗	(On Diff #453626)	I just moved this, but I can fix it while I'm here.

In D131309#3733059, @jhuber6 wrote:

In D131309#3733045, @MaskRay wrote:

Many Linux distributions (clang/lib/Driver/ToolChains/Linux.cpp:240; they typically patch their system GCC to do the same thing) use the linker option --hash-style=gnu, so most of their executabls/shared objects do not have SHT_HASH/DT_HASH. You may need to add SHT_GNU_HASH/DT_GNU_HASH support.

Right now we use this for the offloading runtime library and the generated objects use SHT_HASH so it's sufficient for our purposes. Do we currently do anything with SHT_GNU_HASH? My understanding is that SHT_HASH is standard while SHT_GNU_HASH isn't officially documented so we would need to import some GNU code verbatim to LLVM, but I'm hardly an expert.

I added " (I have a SHT_GNU_HASH state summary for many ELF based OSes https://groups.google.com/g/generic-abi/c/9L03yrxXPBc/m/WKuUjZshAQAJ)" perhaps after you were composing the reply.

SHT_GNU_HASH is quite universal now. At least all OS with serious LLVM support (except Solaris) support the format, with some OSes for a long time (almost 10 years; and glibc for 16 years).
SHT_HASH will probably just benefit Solaris and illumnos (I am not sure how the whole GPU stack works there.) (Also, Solaris rtld may be quite different anyway. I am uncertain we can support them with generic code.)

In D131309#3733081, @MaskRay wrote:

I added " (I have a SHT_GNU_HASH state summary for many ELF based OSes https://groups.google.com/g/generic-abi/c/9L03yrxXPBc/m/WKuUjZshAQAJ)" perhaps after you were composing the reply.

SHT_GNU_HASH is quite universal now. At least all OS with serious LLVM support (except Solaris) support it, with some OSes for a long time (almost 10 years; and glibc for 16 years).
SHT_HASH will probably just benefit Solaris and illumnos (I am not sure how the whole GPU stack works there.)

I see, thanks for the information. It seems we have a GnuHash struct to represent the values in LLVM. Do we have the GNU hash function somewhere in the LLVM source? I don't see it used anywhere.

As for this patch, I'd say since this patch is pretty much done and it meets our (LLVM offloading's) immediate needs we may as well support it. Then I can make a follow-up patch that adds the GNU hash support. Since it'll use most of the same logic it should be relatively easy to add-on to this patch.

In D131309#3733102, @jhuber6 wrote:

In D131309#3733081, @MaskRay wrote:

I added " (I have a SHT_GNU_HASH state summary for many ELF based OSes https://groups.google.com/g/generic-abi/c/9L03yrxXPBc/m/WKuUjZshAQAJ)" perhaps after you were composing the reply.

SHT_GNU_HASH is quite universal now. At least all OS with serious LLVM support (except Solaris) support it, with some OSes for a long time (almost 10 years; and glibc for 16 years).
SHT_HASH will probably just benefit Solaris and illumnos (I am not sure how the whole GPU stack works there.)

I see, thanks for the information. It seems we have a GnuHash struct to represent the values in LLVM. Do we have the GNU hash function somewhere in the LLVM source? I don't see it used anywhere.

As for this patch, I'd say since this patch is pretty much done and it meets our (LLVM offloading's) immediate needs we may as well support it. Then I can make a follow-up patch that adds the GNU hash support. Since it'll use most of the same logic it should be relatively easy to add-on to this patch.

I am sorry that I made the comment so late. I am travelling and will have more limited bandwidth next week.
I am quite sure the SHT_HASH will benefit nearly no user. If the SHT_HASH patch is created 3 years ago, it'd probably benefit some *BSD, but now, it probably doesn't add much value.
I agree this patch is pretty much done with decent tests now. Perhaps this can be changed to use SHT_GNU_HASH instead? Quite a large portion of the code can be reused and you will not waste your effort :)

https://flapenguin.me/elf-dt-gnu-hash has a great introduction of the format.

In D131309#3733474, @MaskRay wrote:

In D131309#3733102, @jhuber6 wrote:

In D131309#3733081, @MaskRay wrote:

I added " (I have a SHT_GNU_HASH state summary for many ELF based OSes https://groups.google.com/g/generic-abi/c/9L03yrxXPBc/m/WKuUjZshAQAJ)" perhaps after you were composing the reply.

SHT_GNU_HASH is quite universal now. At least all OS with serious LLVM support (except Solaris) support it, with some OSes for a long time (almost 10 years; and glibc for 16 years).
SHT_HASH will probably just benefit Solaris and illumnos (I am not sure how the whole GPU stack works there.)

I see, thanks for the information. It seems we have a GnuHash struct to represent the values in LLVM. Do we have the GNU hash function somewhere in the LLVM source? I don't see it used anywhere.

As for this patch, I'd say since this patch is pretty much done and it meets our (LLVM offloading's) immediate needs we may as well support it. Then I can make a follow-up patch that adds the GNU hash support. Since it'll use most of the same logic it should be relatively easy to add-on to this patch.

I am sorry that I made the comment so late. I am travelling and will have more limited bandwidth next week.
I am quite sure the SHT_HASH will benefit nearly no user. If the SHT_HASH patch is created 3 years ago, it'd probably benefit some *BSD, but now, it probably doesn't add much value.
I agree this patch is pretty much done with decent tests now. Perhaps this can be changed to use SHT_GNU_HASH instead? Quite a large portion of the code can be reused and you will not waste your effort :)

https://flapenguin.me/elf-dt-gnu-hash has a great introduction of the format.

There's a patch in the stack view that's waiting on this one to land, so it meets the needs of that currently. I'm definitely on board with adding SHT_GNU_HASH support afterwards since it should be relatively simple given what's already here. Is there no utility in supporting both?

OK. I don't understand OpenMP and GPU enough to comment on that patch.
The DT_HASH support will have some utilitization if mips is to be supported. How good is LLVM's support for OpenMP/GPU on mips?

In D131309#3733520, @MaskRay wrote:

OK. I don't understand OpenMP and GPU enough to comment on that patch.
The DT_HASH support will have some utilitization if mips is to be supported. How good is LLVM's support for OpenMP/GPU on mips?

I think libomp should work, but not for GPU offloading with libomptarget as none of the offloading plugins target MIPS. I guess my understanding is that the compiler will typically emit both HASH and GNU_HASH sections and we previously used the old DT_HASH because it was simpler. We could probably switch over to using the GNU version and it would still work. I still feel like if this works already we may as well support this function for both DT_HASH and DT_GNU_HASH, but you're obviously more qualified to make that judgement.

Thanks for the information about mips.
For Linux, I think for x years non-mips distributions have shipped executables with both DT_HASH and DT_GNU_HASH,
for 16-x years, these distributions ship executables with only DT_GNU_HASH.

For example, Go vdso used to support DT_HASH but in 2017 they noticed that DT_GNU_HASH had to be supported: https://go-review.git.corp.google.com/c/go/+/45511/

Nowadays, there are some --hash-style=sysv uses which are all mips specific or ancient tech debt.
AIUI your patch does not need to deal with such ancient objects.
mips is in a weird situation as it uses a technique from last century to sort .dynsym. The .dynsym sorting is incompatible with DT_GNU_HASH.
(https://maskray.me/blog/2021-08-29-all-about-global-offset-table#dt_mips_local_gotno-and-dt_mips_symtabno-dt_mips_gotsym)

Mostly just nits remaining from me, plus a couple of "normal behaviour" test cases I missed earlier.

FWIW, I think implementing SHT_HASH support first, as this patch does, should definitely happen. This is because the SHT_HASH is the thing that is in the gABI, and not SHT_GNU_HASH, however widely the GNU version is supported. Last I checked my company's system didn't use GNU hash tables yet, for example. I wouldn't be opposed to follow-up work for the GNU hash table though.

Presumably LLD has the GNU hash alogrithm in its source code, since it can generate such tables? Could that code be moved to libObject if needed?

llvm/unittests/Object/ELFObjectFileTest.cpp
835–840 ↗	(On Diff #453626)	If I follow this correctly, y is in the same bucket and ahead of x, such that we see y, check its name, and then move onto x. Assuming that is right, it might be worth a brief comment stating why we have y (I think we should have it FWIW, to show that the bucket/chain logic is correct). We definitely also want a test case where the symbol is not found. I think it would also be worth a test case involving multiple buckets, to show that we calculate the bucket correctly (this test case only currently has one bucket, so there's no risk that the wrong bucket would be selected).
923 ↗	(On Diff #453626)	I believe nchain is supposed to always match the size of the .dynsym. Not sure that it matters that much, but might be less confusing/risk of causing problems if it did in these test cases.
1064 ↗	(On Diff #453626)	I'm not sure "an invalid table" is clear as to the meaning. I'd go with "an invalid symbol name offset."
1106 ↗	(On Diff #453626)	Probably change this comment to "check that it fails when not referencing a dynamic symbol table" or something to that effect. "invalid sh_link" sounds like the value is outside the range of the section indexes, which is what the next case does.
1164–1165 ↗	(On Diff #453626)
916 ↗	(On Diff #452649)	Okay, thanks @MaskRay. @jhuber6, if you would prefer to revert to const_cast style, I'm not going to oppose it, although I'd ask that you keep the test cases separate still, with a helper function for the boiler plate to set up the ELF. The reasoning for this is mostly to keep the errors isolated (i.e. "the symbol would be found/nullptr returned except for this one thing").

Updating tests

Harbormaster completed remote builds in B182209: Diff 453980.Aug 19 2022, 7:27 AM

jhenderson added inline comments.Aug 22 2022, 12:02 AM

llvm/unittests/Object/ELFObjectFileTest.cpp
1106 ↗	(On Diff #453626)	You've updated the wrong comment in response to my suggestion here. This test case is the case where the sh_link is valid, but points at a .dynstr. The one you updated is the one that has a sh_link that is not within the section count range.
1164–1165 ↗	(On Diff #453626)	Not addressed?

Addressing comments. Hopefully this covers all nits so we can move forward.

jhuber6 marked 3 inline comments as done.Aug 23 2022, 7:50 AM

Harbormaster completed remote builds in B182849: Diff 454843.Aug 23 2022, 8:56 AM

LGTM, from my point of view, but I note a couple of @MaskRay's comments seem to not have been addressed. Please wait for him to confirm he is happy.

This revision is now accepted and ready to land.Aug 24 2022, 12:31 AM

In D131309#3744828, @jhenderson wrote:

LGTM, from my point of view, but I note a couple of @MaskRay's comments seem to not have been addressed. Please wait for him to confirm he is happy.

The only one I didn't address was using substr because we don't know the size I rely on the const char * being implicitly converted to a StringRef again.

In D131309#3734346, @jhenderson wrote:

Mostly just nits remaining from me, plus a couple of "normal behaviour" test cases I missed earlier.

FWIW, I think implementing SHT_HASH support first, as this patch does, should definitely happen. This is because the SHT_HASH is the thing that is in the gABI, and not SHT_GNU_HASH, however widely the GNU version is supported. Last I checked my company's system didn't use GNU hash tables yet, for example. I wouldn't be opposed to follow-up work for the GNU hash table though.

OK, I did not know you have internal users with DT_HASH. https://maskray.me/blog/2022-08-21-glibc-and-dt-gnu-hash#what-is-dt_gnu_hash My investigation suggests that multiple major Linux distributions have switched to --hash-style=gnu before 2010...
Nowadays DT_HASH is unlikely to be used on non-mips Linux.
My mild objection is because such a feature would nearly assuredly benefit nobody, but I think I'll not hold a strong objection.
In case related, FreeBSD rtld has supported DT_GNU_HASH since 2012.
As of today, their clang driver does not pass --hash-style= to ld, so the default --hash-style=both is used, but this does not mean that DT_HASH is mandatory.

Note that the bloom filter in a GNU hash table is optional. @jhuber6 can just change the hash function, then the existing cost is almost a DT_GNU_HASH implementation.

Presumably LLD has the GNU hash alogrithm in its source code, since it can generate such tables? Could that code be moved to libObject if needed?

In D131309#3748194, @MaskRay wrote:

In D131309#3734346, @jhenderson wrote:

Mostly just nits remaining from me, plus a couple of "normal behaviour" test cases I missed earlier.

FWIW, I think implementing SHT_HASH support first, as this patch does, should definitely happen. This is because the SHT_HASH is the thing that is in the gABI, and not SHT_GNU_HASH, however widely the GNU version is supported. Last I checked my company's system didn't use GNU hash tables yet, for example. I wouldn't be opposed to follow-up work for the GNU hash table though.

OK, I did not know you have internal users with DT_HASH. https://maskray.me/blog/2022-08-21-glibc-and-dt-gnu-hash#what-is-dt_gnu_hash My investigation suggests that multiple major Linux distributions have switched to --hash-style=gnu before 2010...
Nowadays DT_HASH is unlikely to be used on non-mips Linux.
My mild objection is because such a feature would nearly assuredly benefit nobody, but I think I'll not hold a strong objection.
In case related, FreeBSD rtld has supported DT_GNU_HASH since 2012.
As of today, their clang driver does not pass --hash-style= to ld, so the default --hash-style=both is used, but this does not mean that DT_HASH is mandatory.

Note that the bloom filter in a GNU hash table is optional. @jhuber6 can just change the hash function, then the existing cost is almost a DT_GNU_HASH implementation.

Presumably LLD has the GNU hash alogrithm in its source code, since it can generate such tables? Could that code be moved to libObject if needed?

Are you okay with me landing this patch now and adding the GNU hash support later?

jhuber6 added a child revision: D132696: [ELF] Provide the GNU hash function in libObject.Aug 25 2022, 12:24 PM

In D131309#3749907, @jhuber6 wrote:

In D131309#3748194, @MaskRay wrote:

In D131309#3734346, @jhenderson wrote:

Mostly just nits remaining from me, plus a couple of "normal behaviour" test cases I missed earlier.

FWIW, I think implementing SHT_HASH support first, as this patch does, should definitely happen. This is because the SHT_HASH is the thing that is in the gABI, and not SHT_GNU_HASH, however widely the GNU version is supported. Last I checked my company's system didn't use GNU hash tables yet, for example. I wouldn't be opposed to follow-up work for the GNU hash table though.

OK, I did not know you have internal users with DT_HASH. https://maskray.me/blog/2022-08-21-glibc-and-dt-gnu-hash#what-is-dt_gnu_hash My investigation suggests that multiple major Linux distributions have switched to --hash-style=gnu before 2010...
Nowadays DT_HASH is unlikely to be used on non-mips Linux.
My mild objection is because such a feature would nearly assuredly benefit nobody, but I think I'll not hold a strong objection.
In case related, FreeBSD rtld has supported DT_GNU_HASH since 2012.
As of today, their clang driver does not pass --hash-style= to ld, so the default --hash-style=both is used, but this does not mean that DT_HASH is mandatory.

Note that the bloom filter in a GNU hash table is optional. @jhuber6 can just change the hash function, then the existing cost is almost a DT_GNU_HASH implementation.

Presumably LLD has the GNU hash alogrithm in its source code, since it can generate such tables? Could that code be moved to libObject if needed?

Are you okay with me landing this patch now and adding the GNU hash support later?

I wish that we don't add unneeded functionality...

MaskRay requested changes to this revision.Aug 25 2022, 6:53 PM

This revision now requires changes to proceed.Aug 25 2022, 6:53 PM

In D131309#3750804, @MaskRay wrote:

I wish that we don't add unneeded functionality...

It won't be strictly unneeded as the whole point of this was to factor out something done specifically for AMDGPU offloading into a proper ELF helper function. The question then is if we should switch that handling over to use GNU_HASH instead. That's a little out of scope of the original patch series I've been trying to land so I was hoping to get this in first and work on that later. We already support other functions using DT_HASH in Object/Elf.h so I figured it would be fine to support both as many applications provide both as you said.

jhuber6 mentioned this in D132696: [ELF] Provide the GNU hash function in libObject.Aug 25 2022, 7:50 PM

OK, if openmp offloading already uses findOnlyShtHash and doesn't support mips, you may refactor the openmp code to switch to DT_GNU_HASH instead.

While DT_HASH has been mostly distinct in Linux distributions, D85742 added it. The support should be replaced with DT_GNU_HASH.

Changing the hash table amdgpu uses would be an ABI break needing work in the HSA loader and possibly the windows stack as well. I'm not aware of any inclination to do that. Glibc's deprecation decisions have no direct bearing on amdgpu as it doesn't use glibc.

This patch wants to move code from the openmp runtime into llvm. I'm not convinced that has any value relative to leaving it where it already is, perhaps Joseph would be better off applying these review induced changes to the runtime instead.

In D131309#3751136, @JonChesterfield wrote:

This patch wants to move code from the openmp runtime into llvm. I'm not convinced that has any value relative to leaving it where it already is, perhaps Joseph would be better off applying these review induced changes to the runtime instead.

That's probably the best road forward if this is a blocker, as it no longer provides the functionality we need. Most of the code can be directly put in the runtime instead, but it would be nice to have all the runtime and correctness tests.

Use sections() instead of rolling the array manually and duplicating an error check. Still handled by existing tests but the error message changed.

Harbormaster completed remote builds in B183597: Diff 455901.Aug 26 2022, 8:13 AM

@MaskRay is this feature not going to be accepted? The next patch in this series fixes a bug in AMD's support and fixing it has been long overdue now. If you don't agree with this feature let me know so I can abandon the patch and put it in the runtime instead so we can move forward.

In D131309#3751136, @JonChesterfield wrote:

Changing the hash table amdgpu uses would be an ABI break needing work in the HSA loader and possibly the windows stack as well. I'm not aware of any inclination to do that. Glibc's deprecation decisions have no direct bearing on amdgpu as it doesn't use glibc.

This patch wants to move code from the openmp runtime into llvm. I'm not convinced that has any value relative to leaving it where it already is, perhaps Joseph would be better off applying these review induced changes to the runtime instead.

The host binaries do not use DT_HASH for many years. If AMDGPU wants to stick with DT_HASH, it's fine, but deviating from the host is probably a poor decision.

I do not object to using DT_HASH if AMDGPU needs it, but why does the patch expose getSymbolFromHashTable as an lib/Object API while it works well in the current setup that it stays as a private API in openmp/libomptarget/plugins/amdgpu/src/rtl.cpp?

getSymbolFromHashTable is not suitable if it only supports DT_HASH, as this is not useful for most Linux binaries:

most have only DT_GNU_HASH and no DT_HASH
for some having both, DT_GNU_HASH is used and DT_HASH is ignored
the API does some dynamic loader work which I am not sure is suitable to lib/Object. Reloation resolving and dlsym have more rules than implemented here and the implementation does not handle symbol versioning. How fast do you want? If the object only has a couple of dynamic symbols, just iterate over the whole DT_SYMTAB.

In D131309#3757002, @MaskRay wrote:

I do not object to using DT_HASH if AMDGPU needs it, but why does the patch expose getSymbolFromHashTable as an lib/Object API while it works well in the current setup that it stays as a private API in openmp/libomptarget/plugins/amdgpu/src/rtl.cpp?

In the process of transitioning libomptarget from libelf to lib/Object I figured this functionality was tied directly to ELF such that it justified pulling it out and testing directly. If you think it's not a proper fit for lib/Object you obviously have seniority here, where would this functionality go if not here? Or is this generally out of LLVM's scope.

getSymbolFromHashTable is not suitable if it only supports DT_HASH, as this is not useful for most Linux binaries:

most have only DT_GNU_HASH and no DT_HASH

for some having both, DT_GNU_HASH is used and DT_HASH is ignored

I made a patch in D132743 that adds the DT_GNU_HASH implementation, AMDGPU uses DT_HASH so I figured it was acceptable to support both given that DT_HASH isn't officially deprecated as far as I know.

the API does some dynamic loader work which I am not sure is suitable to lib/Object. Relocation resolving and dlsym have more rules than implemented here and the implementation does not handle symbol versioning. How fast do you want? If the object only has a couple of dynamic symbols, just iterate over the whole DT_SYMTAB.

I believe we simply iterated the dynamic symbol table at some point, the number of symbols depends on the user. Large applications have a sufficiently large number of device functions / variables to justify a faster lookup.

Yes, I believe the shortcomings of this implementation as it relates to dynamic loading is that we require section headers and do not handle symbol versions as you said. A better interface just for this could potentially have a getSymbolFromGnuHashTable and getSymbolFromSysVHashTable that don't use any section headers. Then have a function that attempts to find the relevant sections and call either one of those functions based on the ELF with optional versioning information. But as you mentioned this is pretty much re-implementing dlsym, which I don't know if LLVM has any desire to support that kind of functionality outside of our use-case.

if you're against including this functionality we can simply merge it back into libomptarget. Personally I feel there is utility in having ELF hash table traversal present in LLVM, as is making the gnu_hash function readily available as in D132696. If you feel like this shouldn't be provided by LLVM we can easily move it back into libomptarget. Alternatively, if you provide a path forward to provide something that you find useful from this we can move forward with that.

In D131309#3759988, @jhuber6 wrote:

In D131309#3757002, @MaskRay wrote:

I do not object to using DT_HASH if AMDGPU needs it, but why does the patch expose getSymbolFromHashTable as an lib/Object API while it works well in the current setup that it stays as a private API in openmp/libomptarget/plugins/amdgpu/src/rtl.cpp?

In the process of transitioning libomptarget from libelf to lib/Object I figured this functionality was tied directly to ELF such that it justified pulling it out and testing directly. If you think it's not a proper fit for lib/Object you obviously have seniority here, where would this functionality go if not here? Or is this generally out of LLVM's scope.

getSymbolFromHashTable is not suitable if it only supports DT_HASH, as this is not useful for most Linux binaries:

most have only DT_GNU_HASH and no DT_HASH

for some having both, DT_GNU_HASH is used and DT_HASH is ignored

I made a patch in D132743 that adds the DT_GNU_HASH implementation, AMDGPU uses DT_HASH so I figured it was acceptable to support both given that DT_HASH isn't officially deprecated as far as I know.

the API does some dynamic loader work which I am not sure is suitable to lib/Object. Relocation resolving and dlsym have more rules than implemented here and the implementation does not handle symbol versioning. How fast do you want? If the object only has a couple of dynamic symbols, just iterate over the whole DT_SYMTAB.

I believe we simply iterated the dynamic symbol table at some point, the number of symbols depends on the user. Large applications have a sufficiently large number of device functions / variables to justify a faster lookup.

Yes, I believe the shortcomings of this implementation as it relates to dynamic loading is that we require section headers and do not handle symbol versions as you said. A better interface just for this could potentially have a getSymbolFromGnuHashTable and getSymbolFromSysVHashTable that don't use any section headers. Then have a function that attempts to find the relevant sections and call either one of those functions based on the ELF with optional versioning information. But as you mentioned this is pretty much re-implementing dlsym, which I don't know if LLVM has any desire to support that kind of functionality outside of our use-case.

if you're against including this functionality we can simply merge it back into libomptarget. Personally I feel there is utility in having ELF hash table traversal present in LLVM, as is making the gnu_hash function readily available as in D132696. If you feel like this shouldn't be provided by LLVM we can easily move it back into libomptarget. Alternatively, if you provide a path forward to provide something that you find useful from this we can move forward with that.

Having the public API may lure users to improperly use it and what it does (DT_HASH) is very different from the practice.
I think the DT_HASH code should reside in libomptarget

In D131309#3760024, @MaskRay wrote:

Having the public API may lure users to improperly use it and what it does (DT_HASH) is very different from the practice.
I think the DT_HASH code should reside in libomptarget

What if we changed the interface to instead search the ELF for the hash section rather than passing it in. Then we would only use the DT_HASH section if it is the only one present.

In D131309#3760025, @jhuber6 wrote:

In D131309#3760024, @MaskRay wrote:

Having the public API may lure users to improperly use it and what it does (DT_HASH) is very different from the practice.
I think the DT_HASH code should reside in libomptarget

What if we changed the interface to instead search the ELF for the hash section rather than passing it in. Then we would only use the DT_HASH section if it is the only one present.

Can you elaborate? When adding a public API, it should be useful. If this is libomptarget specific (this DT_HASH case pretty much is), keep it there.

In D131309#3760085, @MaskRay wrote:

Can you elaborate? When adding a public API, it should be useful. If this is libomptarget specific (this DT_HASH case pretty much is), keep it there.

I'm asking because you seemed open to this patch earlier if only GNU_HASH was supported. Your main concern was that DT_HASH being present may lure users into using the wrong format. If the interface were changed users would not be able to use the wrong format. If instead you think that this is not useful at all then that's the end of it and I'll abandon this patch. There are other public API functions with single users so I figured it wasn't an issue, but it's your call.

In D131309#3760091, @jhuber6 wrote:

In D131309#3760085, @MaskRay wrote:

Can you elaborate? When adding a public API, it should be useful. If this is libomptarget specific (this DT_HASH case pretty much is), keep it there.

I'm asking because you seemed open to this patch earlier if only GNU_HASH was supported. Your main concern was that DT_HASH being present may lure users into using the wrong format. If the interface were changed users would not be able to use the wrong format. If instead you think that this is not useful at all then that's the end of it and I'll abandon this patch. There are other public API functions with single users so I figured it wasn't an issue, but it's your call.

It's not a wrong format, but it is a not useful format, nearly not in use. So having the functionality as a public API is not useful. I am somewhat concerned of people using this to do dlsym like things, too, but it is minor.
If the amdgpu offloading stuff can implement the functionality without exposing an API in LLVMObject, that'll be great.

In D131309#3765875, @MaskRay wrote:

It's not a wrong format, but it is a not useful format, nearly not in use. So having the functionality as a public API is not useful. I am somewhat concerned of people using this to do dlsym like things, too, but it is minor.
If the amdgpu offloading stuff can implement the functionality without exposing an API in LLVMObject, that'll be great.

Okay, I'll change this patch to implement an API in libomptarget for getting symbols either via GNU_HASH, DT_HASH, or by a linear search of symbols in that order. I already wrote this patch and D132743 with all the testing necessary so I can at least be confident that the implementation works without a runtime test.

Side note, I am curious if there is ever a plan for LLVM to have dlsym features given that we already have the llvm-libc project.

In D131309#3767503, @jhuber6 wrote:

Side note, I am curious if there is ever a plan for LLVM to have dlsym features given that we already have the llvm-libc project.

That probably wants bringing up on the relevant part of Discourse. If there are going to be multiple users of this ultimately, we don't want to have multiple implementations, if we can avoid it.

jhuber6 retitled this revision from [ELF] Add ability to get a symbol by name from the hash table section to [Libomptarget] Add utility functions for loading an ELF symbol by name.Sep 5 2022, 9:07 AM

jhuber6 edited the summary of this revision. (Show Details)

jhuber6 removed reviewers: MaskRay, alexander-shaposhnikov, jhenderson.

Moving code into libomptarget.

Herald added a project: Restricted Project. · View Herald TranscriptSep 5 2022, 9:08 AM

Herald added subscribers: openmp-commits, mgorny. · View Herald Transcript

jhuber6 mentioned this in D132743: [ELF] Add ability to get a symbol by name from the GNU_HASH table section.Sep 5 2022, 9:10 AM

Harbormaster completed remote builds in B185126: Diff 458028.Sep 5 2022, 9:11 AM

Herald added a subscriber: sstefan1. · View Herald TranscriptSep 5 2022, 9:11 AM

I think we've got two users in libomptarget - the amdgpu plugin where the code was originally, and the cuda plugin where we'd like to call it from. That seems a good reason to put it under plugins/common, and we can move it to llvm proper if a user outside of libomptarget shows up. It's not a disaster if we end up with a couple of implementations of this by accident given it's a hashtable lookup.

Pulling in user of this function so it's easier to see what it's replacing.

Herald added subscribers: kosarev, kerbowa, jvesely. · View Herald TranscriptSep 7 2022, 4:48 AM

Harbormaster completed remote builds in B185389: Diff 458422.Sep 7 2022, 4:52 AM

Replacing REPORT with DP.

Implementation looks reasonable. Extra const annotations are fine. I'm not totally convinced implementation complexity has gone down - we're down 70 lines and up 210 - what're the users of the second format of hashtable, and of the linear scan? I'm guessing cuda?

openmp/libomptarget/plugins/amdgpu/src/rtl.cpp
1617	REPORT -> DP for consistency with the rest of the plugin I think
openmp/libomptarget/plugins/common/elf_common/ELFSymbols.cpp
136	So one of these is presumably dead, since the only caller is amdgpu. Does cuda use the other sort of hash table? If not, suggest we delete the unused code path for now

JonChesterfield accepted this revision.Sep 7 2022, 5:03 AM

JonChesterfield added inline comments.

openmp/libomptarget/plugins/amdgpu/src/rtl.cpp
1613–1618	Possible follow up is to have this return an Expected<SymbolInfo> but that's not worth doing inline with this patch

This revision is now accepted and ready to land.Sep 7 2022, 5:03 AM

Harbormaster completed remote builds in B185392: Diff 458426.Sep 7 2022, 5:03 AM

jhuber6 added inline comments.Sep 7 2022, 5:09 AM

openmp/libomptarget/plugins/amdgpu/src/rtl.cpp
1613–1618	Once the plugins are rewritten we'll probably switch over.
openmp/libomptarget/plugins/common/elf_common/ELFSymbols.cpp
136	I don't think CUDA even uses a hash table as it's some weird executable, for that we will need the exhaustive search below. I mostly kept this because we may have more users in the future and it was easier to just include it all since I already wrote it.

JonChesterfield added inline comments.Sep 7 2022, 5:11 AM

openmp/libomptarget/plugins/common/elf_common/ELFSymbols.cpp
136	Yeah, none of that. Delete it please.

Closed by commit rG894531f59beb: [Libomptarget] Add utility functions for loading an ELF symbol by name (authored by jhuber6). · Explain WhySep 7 2022, 10:39 AM

This revision was automatically updated to reflect the committed changes.

jhuber6 added a commit: rG894531f59beb: [Libomptarget] Add utility functions for loading an ELF symbol by name.

Revision Contents

Path

Size

openmp/

libomptarget/

plugins/

amdgpu/

src/

rtl.cpp

147 lines

common/

elf_common/

CMakeLists.txt

2 lines

ELFSymbols.h

27 lines

ELFSymbols.cpp

193 lines

Diff 458509

openmp/libomptarget/plugins/amdgpu/src/rtl.cpp

//===--- amdgpu/src/rtl.cpp --------------------------------------- C++ -*-===//		//===--- amdgpu/src/rtl.cpp --------------------------------------- C++ -*-===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// RTL for AMD hsa machine		// RTL for AMD hsa machine
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

		#include "llvm/ADT/StringMap.h"
		#include "llvm/ADT/StringRef.h"
		#include "llvm/Frontend/OpenMP/OMPConstants.h"
		#include "llvm/Frontend/OpenMP/OMPGridValues.h"
		#include "llvm/Object/ELF.h"
		#include "llvm/Object/ELFObjectFile.h"

#include <algorithm>		#include <algorithm>
#include <assert.h>		#include <assert.h>
#include <cstdio>		#include <cstdio>
#include <cstdlib>		#include <cstdlib>
#include <cstring>		#include <cstring>
#include <functional>		#include <functional>
#include <libelf.h>		#include <libelf.h>
#include <list>		#include <list>
#include <memory>		#include <memory>
#include <mutex>		#include <mutex>
#include <shared_mutex>		#include <shared_mutex>
#include <unordered_map>		#include <unordered_map>
#include <vector>		#include <vector>

		#include "ELFSymbols.h"
#include "impl_runtime.h"		#include "impl_runtime.h"
#include "interop_hsa.h"		#include "interop_hsa.h"

#include "internal.h"		#include "internal.h"
#include "rt.h"		#include "rt.h"

#include "DeviceEnvironment.h"		#include "DeviceEnvironment.h"
#include "get_elf_mach_gfx_name.h"		#include "get_elf_mach_gfx_name.h"
#include "omptargetplugin.h"		#include "omptargetplugin.h"
#include "print_tracing.h"		#include "print_tracing.h"

#include "llvm/ADT/StringMap.h"
#include "llvm/ADT/StringRef.h"
#include "llvm/Frontend/OpenMP/OMPConstants.h"
#include "llvm/Frontend/OpenMP/OMPGridValues.h"

using namespace llvm;		using namespace llvm;
		using namespace llvm::object;

// hostrpc interface, FIXME: consider moving to its own include these are		// hostrpc interface, FIXME: consider moving to its own include these are
// statically linked into amdgpu/plugin if present from hostrpc_services.a,		// statically linked into amdgpu/plugin if present from hostrpc_services.a,
// linked as --whole-archive to override the weak symbols that are used to		// linked as --whole-archive to override the weak symbols that are used to
// implement a fallback for toolchains that do not yet have a hostrpc library.		// implement a fallback for toolchains that do not yet have a hostrpc library.
extern "C" {		extern "C" {
uint64_t hostrpc_assign_buffer(hsa_agent_t Agent, hsa_queue_t *ThisQ,		uint64_t hostrpc_assign_buffer(hsa_agent_t Agent, hsa_queue_t *ThisQ,
uint32_t DeviceId);		uint32_t DeviceId);
▲ Show 20 Lines • Show All 1,543 Lines • ▼ Show 20 Lines
template <typename T> bool enforceUpperBound(T *Value, T Upper) {		template <typename T> bool enforceUpperBound(T *Value, T Upper) {
bool Changed = *Value > Upper;		bool Changed = *Value > Upper;
if (Changed) {		if (Changed) {
*Value = Upper;		*Value = Upper;
}		}
return Changed;		return Changed;
}		}

Elf64_Shdr findOnlyShtHash(Elf Elf) {
size_t N;
int Rc = elf_getshdrnum(Elf, &N);
if (Rc != 0) {
return nullptr;
}

Elf64_Shdr *Result = nullptr;
for (size_t I = 0; I < N; I++) {
Elf_Scn *Scn = elf_getscn(Elf, I);
if (Scn) {
Elf64_Shdr *Shdr = elf64_getshdr(Scn);
if (Shdr) {
if (Shdr->sh_type == SHT_HASH) {
if (Result == nullptr) {
Result = Shdr;
} else {
// multiple SHT_HASH sections not handled
return nullptr;
}
}
}
}
}
return Result;
}

const Elf64_Sym elfLookup(Elf Elf, char Base, Elf64_Shdr SectionHash,
const char *Symname) {

assert(SectionHash);
size_t SectionSymtabIndex = SectionHash->sh_link;
Elf64_Shdr *SectionSymtab =
elf64_getshdr(elf_getscn(Elf, SectionSymtabIndex));
size_t SectionStrtabIndex = SectionSymtab->sh_link;

const Elf64_Sym *Symtab =
reinterpret_cast<const Elf64_Sym *>(Base + SectionSymtab->sh_offset);

const uint32_t *Hashtab =
reinterpret_cast<const uint32_t *>(Base + SectionHash->sh_offset);

// Layout:
// nbucket
// nchain
// bucket[nbucket]
// chain[nchain]
uint32_t Nbucket = Hashtab[0];
const uint32_t *Bucket = &Hashtab[2];
const uint32_t *Chain = &Hashtab[Nbucket + 2];

const size_t Max = strlen(Symname) + 1;
const uint32_t Hash = elf_hash(Symname);
for (uint32_t I = Bucket[Hash % Nbucket]; I != 0; I = Chain[I]) {
char *N = elf_strptr(Elf, SectionStrtabIndex, Symtab[I].st_name);
if (strncmp(Symname, N, Max) == 0) {
return &Symtab[I];
}
}

return nullptr;
}

struct SymbolInfo {		struct SymbolInfo {
void *Addr = nullptr;		const void *Addr = nullptr;
uint32_t Size = UINT32_MAX;		uint32_t Size = UINT32_MAX;
uint32_t ShType = SHT_NULL;		uint32_t ShType = SHT_NULL;
};		};

int getSymbolInfoWithoutLoading(Elf Elf, char Base, const char *Symname,		int getSymbolInfoWithoutLoading(const ELFObjectFile<ELF64LE> &ELFObj,
SymbolInfo *Res) {		StringRef SymName, SymbolInfo *Res) {
if (elf_kind(Elf) != ELF_K_ELF) {		auto SymOrErr = getELFSymbol(ELFObj, SymName);
return 1;		if (!SymOrErr) {
}		std::string ErrorString = toString(SymOrErr.takeError());
		JonChesterfieldUnsubmitted Not Done Reply Inline Actions REPORT -> DP for consistency with the rest of the plugin I think JonChesterfield: REPORT -> DP for consistency with the rest of the plugin I think
		DP("Failed ELF lookup: %s\n", ErrorString.c_str());
		JonChesterfieldUnsubmitted Not Done Reply Inline Actions Possible follow up is to have this return an Expected<SymbolInfo> but that's not worth doing inline with this patch JonChesterfield: Possible follow up is to have this return an Expected<SymbolInfo> but that's not worth doing…
		jhuber6AuthorUnsubmitted Done Reply Inline Actions Once the plugins are rewritten we'll probably switch over. jhuber6: Once the plugins are rewritten we'll probably switch over.
Elf64_Shdr *SectionHash = findOnlyShtHash(Elf);
if (!SectionHash) {
return 1;
}

const Elf64_Sym *Sym = elfLookup(Elf, Base, SectionHash, Symname);
if (!Sym) {
return 1;
}

if (Sym->st_size > UINT32_MAX) {
return 1;
}

if (Sym->st_shndx == SHN_UNDEF) {
return 1;		return 1;
}		}
		if (!*SymOrErr)
Elf_Scn *Section = elf_getscn(Elf, Sym->st_shndx);
if (!Section) {
return 1;		return 1;
}

Elf64_Shdr *Header = elf64_getshdr(Section);		auto SymSecOrErr = ELFObj.getELFFile().getSection((*SymOrErr)->st_shndx);
if (!Header) {		if (!SymSecOrErr) {
		std::string ErrorString = toString(SymOrErr.takeError());
		DP("Failed ELF lookup: %s\n", ErrorString.c_str());
return 1;		return 1;
}		}

Res->Addr = Sym->st_value + Base;		Res->Addr = (*SymOrErr)->st_value + ELFObj.getELFFile().base();
Res->Size = static_cast<uint32_t>(Sym->st_size);		Res->Size = static_cast<uint32_t>((*SymOrErr)->st_size);
Res->ShType = Header->sh_type;		Res->ShType = static_cast<uint32_t>((*SymSecOrErr)->sh_type);
return 0;		return 0;
}		}

int getSymbolInfoWithoutLoading(char Base, size_t ImgSize, const char Symname,		int getSymbolInfoWithoutLoading(char Base, size_t ImgSize, const char SymName,
SymbolInfo *Res) {		SymbolInfo *Res) {
Elf *Elf = elf_memory(Base, ImgSize);		StringRef Buffer = StringRef(Base, ImgSize);
if (Elf) {		auto ElfOrErr = ObjectFile::createELFObjectFile(MemoryBufferRef(Buffer, ""),
int Rc = getSymbolInfoWithoutLoading(Elf, Base, Symname, Res);		/InitContent=/false);
elf_end(Elf);		if (!ElfOrErr) {
return Rc;		REPORT("Failed to load ELF: %s\n", toString(ElfOrErr.takeError()).c_str());
		return 1;
}		}

		if (const auto *ELFObj = dyn_cast<ELF64LEObjectFile>(ElfOrErr->get()))
		return getSymbolInfoWithoutLoading(*ELFObj, SymName, Res);
return 1;		return 1;
}		}

hsa_status_t interopGetSymbolInfo(char *Base, size_t ImgSize,		hsa_status_t interopGetSymbolInfo(char *Base, size_t ImgSize,
const char SymName, void *VarAddr,		const char SymName, const void *VarAddr,
uint32_t *VarSize) {		uint32_t *VarSize) {
SymbolInfo SI;		SymbolInfo SI;
int Rc = getSymbolInfoWithoutLoading(Base, ImgSize, SymName, &SI);		int Rc = getSymbolInfoWithoutLoading(Base, ImgSize, SymName, &SI);
if (Rc == 0) {		if (Rc == 0) {
*VarAddr = SI.Addr;		*VarAddr = SI.Addr;
*VarSize = SI.Size;		*VarSize = SI.Size;
return HSA_STATUS_SUCCESS;		return HSA_STATUS_SUCCESS;
}		}
▲ Show 20 Lines • Show All 754 Lines • ▼ Show 20 Lines	struct KernDescValType {
uint16_t TSize;		uint16_t TSize;
uint16_t WGSize;		uint16_t WGSize;
};		};
struct KernDescValType KernDescVal;		struct KernDescValType KernDescVal;
std::string KernDescNameStr(E->name);		std::string KernDescNameStr(E->name);
KernDescNameStr += "_kern_desc";		KernDescNameStr += "_kern_desc";
const char *KernDescName = KernDescNameStr.c_str();		const char *KernDescName = KernDescNameStr.c_str();

void *KernDescPtr;		const void *KernDescPtr;
uint32_t KernDescSize;		uint32_t KernDescSize;
void *CallStackAddr = nullptr;		void *CallStackAddr = nullptr;
Err = interopGetSymbolInfo((char *)Image->ImageStart, ImgSize, KernDescName,		Err = interopGetSymbolInfo((char *)Image->ImageStart, ImgSize, KernDescName,
&KernDescPtr, &KernDescSize);		&KernDescPtr, &KernDescSize);

if (Err == HSA_STATUS_SUCCESS) {		if (Err == HSA_STATUS_SUCCESS) {
if ((size_t)KernDescSize != sizeof(KernDescVal))		if ((size_t)KernDescSize != sizeof(KernDescVal))
DP("Loading global computation properties '%s' - size mismatch (%u != "		DP("Loading global computation properties '%s' - size mismatch (%u != "
Show All 22 Lines	for (__tgt_offload_entry *E = HostBegin; E != HostEnd; ++E) {
} else {		} else {
DP("Warning: Loading KernDesc '%s' - symbol not found, ", KernDescName);		DP("Warning: Loading KernDesc '%s' - symbol not found, ", KernDescName);

// Flat group size		// Flat group size
std::string WGSizeNameStr(E->name);		std::string WGSizeNameStr(E->name);
WGSizeNameStr += "_wg_size";		WGSizeNameStr += "_wg_size";
const char *WGSizeName = WGSizeNameStr.c_str();		const char *WGSizeName = WGSizeNameStr.c_str();

void *WGSizePtr;		const void *WGSizePtr;
uint32_t WGSize;		uint32_t WGSize;
Err = interopGetSymbolInfo((char *)Image->ImageStart, ImgSize, WGSizeName,		Err = interopGetSymbolInfo((char *)Image->ImageStart, ImgSize, WGSizeName,
&WGSizePtr, &WGSize);		&WGSizePtr, &WGSize);

if (Err == HSA_STATUS_SUCCESS) {		if (Err == HSA_STATUS_SUCCESS) {
if ((size_t)WGSize != sizeof(int16_t)) {		if ((size_t)WGSize != sizeof(int16_t)) {
DP("Loading global computation properties '%s' - size mismatch (%u "		DP("Loading global computation properties '%s' - size mismatch (%u "
"!= "		"!= "
Show All 22 Lines	if (Err == HSA_STATUS_SUCCESS) {
check("Loading WGSize computation property", Err);		check("Loading WGSize computation property", Err);
}		}

// Read execution mode from global in binary		// Read execution mode from global in binary
std::string ExecModeNameStr(E->name);		std::string ExecModeNameStr(E->name);
ExecModeNameStr += "_exec_mode";		ExecModeNameStr += "_exec_mode";
const char *ExecModeName = ExecModeNameStr.c_str();		const char *ExecModeName = ExecModeNameStr.c_str();

void *ExecModePtr;		const void *ExecModePtr;
uint32_t VarSize;		uint32_t VarSize;
Err = interopGetSymbolInfo((char *)Image->ImageStart, ImgSize, ExecModeName,		Err = interopGetSymbolInfo((char *)Image->ImageStart, ImgSize, ExecModeName,
&ExecModePtr, &VarSize);		&ExecModePtr, &VarSize);

if (Err == HSA_STATUS_SUCCESS) {		if (Err == HSA_STATUS_SUCCESS) {
if ((size_t)VarSize != sizeof(llvm::omp::OMPTgtExecModeFlags)) {		if ((size_t)VarSize != sizeof(llvm::omp::OMPTgtExecModeFlags)) {
DP("Loading global computation properties '%s' - size mismatch(%u != "		DP("Loading global computation properties '%s' - size mismatch(%u != "
"%lu)\n",		"%lu)\n",
▲ Show 20 Lines • Show All 193 Lines • Show Last 20 Lines

openmp/libomptarget/plugins/common/elf_common/CMakeLists.txt

	##===----------------------------------------------------------------------===##			##===----------------------------------------------------------------------===##
	#			#
	# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	# See https://llvm.org/LICENSE.txt for license information.			# See https://llvm.org/LICENSE.txt for license information.
	# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	#			#
	##===----------------------------------------------------------------------===##			##===----------------------------------------------------------------------===##
	#			#
	# Common ELF functionality for target plugins			# Common ELF functionality for target plugins
	#			#
	##===----------------------------------------------------------------------===##			##===----------------------------------------------------------------------===##

	add_library(elf_common OBJECT elf_common.cpp)			add_library(elf_common OBJECT elf_common.cpp ELFSymbols.cpp)

	# Build elf_common with PIC to be able to link it with plugin shared libraries.			# Build elf_common with PIC to be able to link it with plugin shared libraries.
	set_property(TARGET elf_common PROPERTY POSITION_INDEPENDENT_CODE ON)			set_property(TARGET elf_common PROPERTY POSITION_INDEPENDENT_CODE ON)
	llvm_update_compile_flags(elf_common)			llvm_update_compile_flags(elf_common)
	set(LINK_LLVM_LIBS LLVMBinaryFormat LLVMObject LLVMSupport)			set(LINK_LLVM_LIBS LLVMBinaryFormat LLVMObject LLVMSupport)
	if (LLVM_LINK_LLVM_DYLIB)			if (LLVM_LINK_LLVM_DYLIB)
	set(LINK_LLVM_LIBS LLVM)			set(LINK_LLVM_LIBS LLVM)
	endif()			endif()
	Show All 9 Lines

openmp/libomptarget/plugins/common/elf_common/ELFSymbols.h

This file was added.

				//===-- ELFSymbols.h - ELF Symbol look-up functionality ---------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// ELF routines for obtaining a symbol from an Elf file without loading it.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_OPENMP_LIBOMPTARGET_PLUGINS_COMMON_ELF_COMMON_ELF_SYMBOLS_H
				#define LLVM_OPENMP_LIBOMPTARGET_PLUGINS_COMMON_ELF_COMMON_ELF_SYMBOLS_H

				#include "llvm/Object/ELF.h"
				#include "llvm/Object/ELFObjectFile.h"

				/// Returns the symbol associated with the \p Name in the \p ELFObj. It will
				/// first search for the hash sections to identify symbols from the hash table.
				/// If that fails it will fall back to a linear search in the case of an
				/// executable file without a hash table.
				llvm::Expected<const typename llvm::object::ELF64LE::Sym *>
				getELFSymbol(const llvm::object::ELFObjectFile<llvm::object::ELF64LE> &ELFObj,
				llvm::StringRef Name);

				#endif

openmp/libomptarget/plugins/common/elf_common/ELFSymbols.cpp

This file was added.

				#include "ELFSymbols.h"

				using namespace llvm;
				using namespace llvm::object;
				using namespace llvm::ELF;

				template <class ELFT>
				static Expected<const typename ELFT::Sym *>
				getSymbolFromGnuHashTable(StringRef Name, const typename ELFT::GnuHash &HashTab,
				ArrayRef<typename ELFT::Sym> SymTab,
				StringRef StrTab) {
				const uint32_t NameHash = hashGnu(Name);
				const typename ELFT::Word NBucket = HashTab.nbuckets;
				const typename ELFT::Word SymOffset = HashTab.symndx;
				ArrayRef<typename ELFT::Off> Filter = HashTab.filter();
				ArrayRef<typename ELFT::Word> Bucket = HashTab.buckets();
				ArrayRef<typename ELFT::Word> Chain = HashTab.values(SymTab.size());

				// Check the bloom filter and exit early if the symbol is not present.
				uint64_t ElfClassBits = ELFT::Is64Bits ? 64 : 32;
				typename ELFT::Off Word =
				Filter[(NameHash / ElfClassBits) % HashTab.maskwords];
				uint64_t Mask = (0x1ull << (NameHash % ElfClassBits)) \|
				(0x1ull << ((NameHash >> HashTab.shift2) % ElfClassBits));
				if ((Word & Mask) != Mask)
				return nullptr;

				// The symbol may or may not be present, check the hash values.
				for (typename ELFT::Word I = Bucket[NameHash % NBucket];
				I >= SymOffset && I < SymTab.size(); I = I + 1) {
				const uint32_t ChainHash = Chain[I - SymOffset];

				if ((NameHash \| 0x1) != (ChainHash \| 0x1))
				continue;

				if (SymTab[I].st_name >= StrTab.size())
				return createError("symbol [index " + Twine(I) +
				"] has invalid st_name: " + Twine(SymTab[I].st_name));
				if (StrTab.drop_front(SymTab[I].st_name).data() == Name)
				return &SymTab[I];

				if (ChainHash & 0x1)
				return nullptr;
				}
				return nullptr;
				}

				template <class ELFT>
				static Expected<const typename ELFT::Sym *>
				getSymbolFromSysVHashTable(StringRef Name, const typename ELFT::Hash &HashTab,
				ArrayRef<typename ELFT::Sym> SymTab,
				StringRef StrTab) {
				const uint32_t Hash = hashSysV(Name);
				const typename ELFT::Word NBucket = HashTab.nbucket;
				ArrayRef<typename ELFT::Word> Bucket = HashTab.buckets();
				ArrayRef<typename ELFT::Word> Chain = HashTab.chains();
				for (typename ELFT::Word I = Bucket[Hash % NBucket]; I != ELF::STN_UNDEF;
				I = Chain[I]) {
				if (I >= SymTab.size())
				return createError(
				"symbol [index " + Twine(I) +
				"] is greater than the number of symbols: " + Twine(SymTab.size()));
				if (SymTab[I].st_name >= StrTab.size())
				return createError("symbol [index " + Twine(I) +
				"] has invalid st_name: " + Twine(SymTab[I].st_name));

				if (StrTab.drop_front(SymTab[I].st_name).data() == Name)
				return &SymTab[I];
				}
				return nullptr;
				}

				template <class ELFT>
				static Expected<const typename ELFT::Sym *>
				getHashTableSymbol(const ELFFile<ELFT> &Elf, const typename ELFT::Shdr &Sec,
				StringRef Name) {
				if (Sec.sh_type != ELF::SHT_HASH && Sec.sh_type != ELF::SHT_GNU_HASH)
				return createError(
				"invalid sh_type for hash table, expected SHT_HASH or SHT_GNU_HASH");
				Expected<typename ELFT::ShdrRange> SectionsOrError = Elf.sections();
				if (!SectionsOrError)
				return SectionsOrError.takeError();

				auto SymTabOrErr = getSection<ELFT>(*SectionsOrError, Sec.sh_link);
				if (!SymTabOrErr)
				return SymTabOrErr.takeError();

				auto StrTabOrErr =
				Elf.getStringTableForSymtab(*SymTabOrErr, SectionsOrError);
				if (!StrTabOrErr)
				return StrTabOrErr.takeError();
				StringRef StrTab = *StrTabOrErr;

				auto SymsOrErr = Elf.symbols(*SymTabOrErr);
				if (!SymsOrErr)
				return SymsOrErr.takeError();
				ArrayRef<typename ELFT::Sym> SymTab = *SymsOrErr;

				// If this is a GNU hash table we verify its size and search the symbol
				// table using the GNU hash table format.
				if (Sec.sh_type == ELF::SHT_GNU_HASH) {
				const typename ELFT::GnuHash *HashTab =
				reinterpret_cast<const typename ELFT::GnuHash *>(Elf.base() +
				Sec.sh_offset);
				if (Sec.sh_offset + Sec.sh_size >= Elf.getBufSize())
				return createError("section has invalid sh_offset: " +
				Twine(Sec.sh_offset));
				if (Sec.sh_size < sizeof(typename ELFT::GnuHash) \|\|
				Sec.sh_size <
				sizeof(typename ELFT::GnuHash) +
				sizeof(typename ELFT::Word) * HashTab->maskwords +
				sizeof(typename ELFT::Word) * HashTab->nbuckets +
				sizeof(typename ELFT::Word) * (SymTab.size() - HashTab->symndx))
				return createError("section has invalid sh_size: " + Twine(Sec.sh_size));
				return getSymbolFromGnuHashTable<ELFT>(Name, *HashTab, SymTab, StrTab);
				}

				// If this is a Sys-V hash table we verify its size and search the symbol
				// table using the Sys-V hash table format.
				if (Sec.sh_type == ELF::SHT_HASH) {
				const typename ELFT::Hash *HashTab =
				reinterpret_cast<const typename ELFT::Hash *>(Elf.base() +
				Sec.sh_offset);
				if (Sec.sh_offset + Sec.sh_size >= Elf.getBufSize())
				return createError("section has invalid sh_offset: " +
				Twine(Sec.sh_offset));
				if (Sec.sh_size < sizeof(typename ELFT::Hash) \|\|
				Sec.sh_size < sizeof(typename ELFT::Hash) +
				sizeof(typename ELFT::Word) * HashTab->nbucket +
				sizeof(typename ELFT::Word) * HashTab->nchain)
				return createError("section has invalid sh_size: " + Twine(Sec.sh_size));

				return getSymbolFromSysVHashTable<ELFT>(Name, *HashTab, SymTab, StrTab);
				}

				return nullptr;
				JonChesterfieldUnsubmitted Not Done Reply Inline Actions So one of these is presumably dead, since the only caller is amdgpu. Does cuda use the other sort of hash table? If not, suggest we delete the unused code path for now JonChesterfield: So one of these is presumably dead, since the only caller is amdgpu. Does cuda use the other…
				jhuber6AuthorUnsubmitted Done Reply Inline Actions I don't think CUDA even uses a hash table as it's some weird executable, for that we will need the exhaustive search below. I mostly kept this because we may have more users in the future and it was easier to just include it all since I already wrote it. jhuber6: I don't think CUDA even uses a hash table as it's some weird executable, for that we will need…
				JonChesterfieldUnsubmitted Not Done Reply Inline Actions Yeah, none of that. Delete it please. JonChesterfield: Yeah, none of that. Delete it please.
				}

				template <class ELFT>
				static Expected<const typename ELFT::Sym *>
				getSymTableSymbol(const ELFFile<ELFT> &Elf, const typename ELFT::Shdr &Sec,
				StringRef Name) {
				if (Sec.sh_type != ELF::SHT_SYMTAB && Sec.sh_type != ELF::SHT_DYNSYM)
				return createError(
				"invalid sh_type for hash table, expected SHT_SYMTAB or SHT_DYNSYM");
				Expected<typename ELFT::ShdrRange> SectionsOrError = Elf.sections();
				if (!SectionsOrError)
				return SectionsOrError.takeError();

				auto StrTabOrErr = Elf.getStringTableForSymtab(Sec, *SectionsOrError);
				if (!StrTabOrErr)
				return StrTabOrErr.takeError();
				StringRef StrTab = *StrTabOrErr;

				auto SymsOrErr = Elf.symbols(&Sec);
				if (!SymsOrErr)
				return SymsOrErr.takeError();
				ArrayRef<typename ELFT::Sym> SymTab = *SymsOrErr;

				for (const typename ELFT::Sym &Sym : SymTab)
				if (StrTab.drop_front(Sym.st_name).data() == Name)
				return &Sym;

				return nullptr;
				}

				Expected<const typename ELF64LE::Sym *>
				getELFSymbol(const ELFObjectFile<ELF64LE> &ELFObj, StringRef Name) {
				// First try to look up the symbol via the hash table.
				for (ELFSectionRef Sec : ELFObj.sections()) {
				if (Sec.getType() != SHT_HASH && Sec.getType() != SHT_GNU_HASH)
				continue;

				auto HashTabOrErr = ELFObj.getELFFile().getSection(Sec.getIndex());
				if (!HashTabOrErr)
				return HashTabOrErr.takeError();
				return getHashTableSymbol<ELF64LE>(ELFObj.getELFFile(), **HashTabOrErr,
				Name);
				}

				// If this is an executable file check the entire standard symbol table.
				for (ELFSectionRef Sec : ELFObj.sections()) {
				if (Sec.getType() != SHT_SYMTAB)
				continue;

				auto SymTabOrErr = ELFObj.getELFFile().getSection(Sec.getIndex());
				if (!SymTabOrErr)
				return SymTabOrErr.takeError();
				return getSymTableSymbol<ELF64LE>(ELFObj.getELFFile(), **SymTabOrErr, Name);
				}

				return nullptr;
				}

This is an archive of the discontinued LLVM Phabricator instance.

[Libomptarget] Add utility functions for loading an ELF symbol by nameClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 458509

openmp/libomptarget/plugins/amdgpu/src/rtl.cpp

openmp/libomptarget/plugins/common/elf_common/CMakeLists.txt

openmp/libomptarget/plugins/common/elf_common/ELFSymbols.h

openmp/libomptarget/plugins/common/elf_common/ELFSymbols.cpp

[Libomptarget] Add utility functions for loading an ELF symbol by name
ClosedPublic