This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
test/
-
CMakeLists.txt
-
tools/llvm-elfabi/
-
llvm-elfabi/
-
binary-read-arch.test
-
fail-file-open.test
-
read-unsupported-file.test
-
replace-soname-tbe.test
-
tbe-emits-current-version.test
-
tbe-read-basic.test
-
tools/
-
LLVMBuild.txt
-
llvm-elfabi/
-
CMakeLists.txt
-
ELFObjHandler.h
-
ELFObjHandler.cpp
-
ErrorCollector.h
-
ErrorCollector.cpp
-
LLVMBuild.txt
-
llvm-elfabi.cpp

Differential D55352

[elfabi] Introduce tool for ELF TextAPI
ClosedPublic

Authored by amontanez on Dec 5 2018, 8:32 PM.

Download Raw Diff

Details

Reviewers

jakehehrlich
phosek
ruiu
echristo
jhenderson
mcgrathr

Commits

rG31f0f659a8f4: [elfabi] Introduce tool for ELF TextAPI
rL350341: [elfabi] Introduce tool for ELF TextAPI

Summary

Follow up for D53051

This patch introduces the tool associated with the ELF implementation of TextAPI (previously llvm-tapi, renamed for better distinction). This tool will house a number of features related to analysis and manipulation of shared object's exposed interfaces. The first major feature for this tool is support for producing binary stubs that are useful for compile-time linking of shared objects. This patch introduces beginnings of support for reading binary ELF objects to work towards that goal.

Added:

elfabi tool.
Support for reading architecture from a binary ELF file into an ELFStub.
Support for writing .tbe files.

Diff Detail

Repository: rL LLVM

Event Timeline

amontanez created this revision.Dec 5 2018, 8:32 PM

Herald added subscribers: llvm-commits, mgorny. · View Herald TranscriptDec 5 2018, 8:32 PM

jakehehrlich added inline comments.Dec 6 2018, 2:19 PM

llvm/tools/llvm-elfabi/ELFObjHandler.cpp
29 ↗	(On Diff #176915)	If something should never be null, you generally should use a reference. If you can't use a reference it is often (but not always) the case that something is wrong with the design. I generally prefer references to pointers.
48 ↗	(On Diff #176915)	Yeah I think this should just be a single function and the other classes shouldn't exist.
52 ↗	(On Diff #176915)	Don't consume the error, return it. Also I do this awful thing throughout llvm-objcopy where I just hard fail the program and exit as soon as something bad happens...you haven't done that but I'd like restress that you shouldn't do that.
71 ↗	(On Diff #176915)	technically it could just be any binary format you didn't expect. Also this should probably return an Expected unique pointer that is never null.
llvm/tools/llvm-elfabi/ELFObjHandler.h
34 ↗	(On Diff #176915)	Can you just make this a function? I don't think it needs a whole class at the moment and I'm not sure it ever will.
43 ↗	(On Diff #176915)	I don't think this interface makes sense. What does constructing an ELFStubBuilder without an ELF object file accomplish? Also if we 100% expect this to be empty, maybe the interface that is used should return a new ELFStub rather than requiring that we give it an empty one.
57 ↗	(On Diff #176915)	It doesn't seem like this should be a class. It contains no state and has one function. Functions are your friend, unnecessary classes are the enemy.
llvm/tools/llvm-elfabi/llvm-elfabi.cpp
24–25 ↗	(On Diff #176915)	Having this as global is bad. Globals (sans use cases like options) mean one of two things 1) you shouldn't have the class in the first place 2) you're maintaining global state which is almost always bad.
61 ↗	(On Diff #176915)	As you might have heard me just speak with Roland the SoName should be optional and never synthetic.
68 ↗	(On Diff #176915)	Return an error rather than terminating even here. This is an example of that terrible thing that I did. I also let that reality seep into all the nooks and crannies of my interfaces.
79 ↗	(On Diff #176915)	writing should just wirte, not modify. If you update the ELFStub it should go 1) Modify 2) Write.
llvm/tools/llvm-elfabi/llvm-elfabi.h
22 ↗	(On Diff #176915)	I realize I do this in llvm-objcopy but I do it so that I can exit with an error from wherever. It's really better to propagate errors all the way back to main or near main and handle them once there if possible. That might not be a perfect fit but if you find that you can plumb all the errors back to main, you should remove these functions and this file.

jakehehrlich added inline comments.Dec 6 2018, 2:19 PM

llvm/tools/llvm-elfabi/llvm-elfabi.cpp
96 ↗	(On Diff #176915)	Wait to declare this until you construct it later.
100 ↗	(On Diff #176915)	I think "unable to read file" is vague and likely covered by error already. Maybe "unable to read .tbe file '%s'" where "%s" should be the filename even if the filename might be mentioned in the error. That gives more information. You should have a test to confirm that this error happens and is handled correctly. Also do you want to take the error here?
106 ↗	(On Diff #176915)	As discussed offline, we should use Expected, check for an error (and try the next if there is one) and then create a new Error that wraps both and then chooses which one to report based on some heuristic (basically move the heuristic to the error reporting rather than the file parsing stage so that no heuristic is ever used for parsing). I know I said checking the extension would be easy but checking for ELF magic is also probably easy and more reliable so I guess now I agree with you (I was just thinking it would be hard to check the magic here). It will probably make sense to also include the file path in the error that you create so that it can report which file the error was an issue for. A really cool heuristic would be If the first 4 bytes of the memory are ELF magic, report the ELF error (but state assumption) If the extension is .tbe report the the TBE error (but state the assumption). Also you can search the file for If the neither of those things are true, tell the user you don't know what's going on and report both. It's also acceptable to search the whole file looking for a sign that If you'd want I'd also still accept just looking at the extension.
121 ↗	(On Diff #176915)	Don't declare until you construct later.
125 ↗	(On Diff #176915)	That should probably be a part of the error already if possible.

Changed:

Classes removed, functions are all standalone with no state.
Now using Expected<std::unique_ptr<ELFStub>> for everything that would previously return a std::unique_ptr<ELFStub>.
Added tests to check for proper failure output.

Removed:

Automatic DT_SONAME generation.

amontanez marked 2 inline comments as done.Dec 10 2018, 1:03 PM

amontanez added inline comments.

llvm/tools/llvm-elfabi/llvm-elfabi.cpp
61 ↗	(On Diff #176915)	I've noted this and will make SoName optional in a separate patch. For now I have removed the code that would generate a SoName from the file name if the SoName was empty.
106 ↗	(On Diff #176915)	If the YAML reader fails, it will always output to stderr. Having the YAML reader second causes this to only occur if both fail. I can later augment llvm::yaml::Input so we can have more control over where YAML outputs the error to, but for now I've defaulted to outputting both errors in the situation that an unreadable file is encountered. This makes slightly more sense given that right now I can't completely filter out the YAML error if the heuristics determine the binary error is of more interest. This function's implementation isn't scalable, but it also doesn't impact design decisions of other code in this tool. I have marked a todo so I can revisit this separately. Let me know if you feel I should address this immediately.

jakehehrlich added inline comments.Dec 10 2018, 4:19 PM

llvm/tools/llvm-elfabi/ELFObjHandler.cpp
29 ↗	(On Diff #177586)	nit: You can just put all these inside of llvm::elfabi namespace rather than repeatedly declaring them as such.
42 ↗	(On Diff #177586)	nit: Can you add some rough TODOs here on what else you need to add to get this fully featured?
61 ↗	(On Diff #177586)	nit: The type should be automatically inferred so you don't need the explicit template parameter.
llvm/tools/llvm-elfabi/llvm-elfabi.cpp
36–44 ↗	(On Diff #177586)	nit: Can you just inline these and return from main instead of calling these? I think we don't expect any other exit points. Having these functions around is just kind of asking for them to be used and I'm skeptical that's a good idea.
120–131 ↗	(On Diff #177586)	I think this should be implemented as a new aggregate error that you return. You can move the errors into the aggregate error and then the main function can report the aggregate error.

Added:

ErrorCollector for collectively handling multiple related errors.

Fixed other nits.

amontanez marked 2 inline comments as done.Dec 11 2018, 3:39 PM

jakehehrlich added inline comments.Dec 11 2018, 5:56 PM

llvm/tools/llvm-elfabi/ErrorCollector.cpp
36 ↗	(On Diff #177794)	Rather than this, when you make the error you should be left with an empty vector of errors. Long term I'd like to see this be a move of the vector into an aggregate error but for now I'm fine just consuming and clearing.
63 ↗	(On Diff #177794)	You don't need to clear since the destructor will be called after this anyway.
llvm/tools/llvm-elfabi/ErrorCollector.h
44 ↗	(On Diff #177794)	I think I'd prefer this be marked private.
61 ↗	(On Diff #177794)	I think id prefer this marked private for now.

Changed:

ErrorCollector now clears errors when makeError() is called.

Fixed a few other nits.

amontanez marked 2 inline comments as done.Dec 11 2018, 6:43 PM

amontanez added inline comments.

llvm/tools/llvm-elfabi/ErrorCollector.cpp
36 ↗	(On Diff #177794)	Agreed. I marked TODO at the top of this function since that's something that can be done later when it's slightly higher priority.

jakehehrlich accepted this revision.Dec 11 2018, 6:45 PM

This revision is now accepted and ready to land.Dec 11 2018, 6:45 PM

I'd like someone outside of our team like @jhenderson to review this. Landing a new tool without some outside eyes looking at it isn't very community friendly IMO.

In D55352#1327949, @jakehehrlich wrote:

I'd like someone outside of our team like @jhenderson to review this. Landing a new tool without some outside eyes looking at it isn't very community friendly IMO.

I'll try to find some time in the next 2-3 days to look over this.

Higuoxing added a subscriber: Higuoxing.Dec 12 2018, 2:38 AM

amontanez added a child revision: D55619: [elfabi] Add option to manually specify file read format.Dec 12 2018, 2:17 PM

Small change to binary-read-arch.test to make it easier to debug test failure.

jhenderson added inline comments.Dec 13 2018, 3:02 AM

llvm/test/tools/llvm-elfabi/binary-read-arch.test
12 ↗	(On Diff #177811)	Nit here and elsewhere - remove all the extra spaces between the CHECK/CHECK-NEXT and the actual text being checked. (It's okay to have the spaces after CHECK: to make the patterns match with CHECK-NEXT, but this test has extra on top of that).
3 ↗	(On Diff #177966)	Why the cat command? You can have FileCheck read a file via --input-file (also applies to other tests).
llvm/test/tools/llvm-elfabi/fail-file-open.test
5 ↗	(On Diff #177811)	Perhaps the error check here could include "NotAFileInTestingDir" at the end?
llvm/test/tools/llvm-elfabi/replace-soname-tbe.test
2 ↗	(On Diff #177966)	Nit: inconsistent use of '--' and '-' for switch prefixes.
llvm/test/tools/llvm-elfabi/tbe-read-basic.test
17 ↗	(On Diff #177966)	Don't know if it belongs in this particular test, but I feel like we should have at least one test that shows the current version is correct.
llvm/tools/LLVMBuild.txt
35 ↗	(On Diff #177811)	I'm not entirely convinced by the "elfabi" name since it isn't really an ABI, but I don't have a clearly better name.
llvm/tools/llvm-elfabi/ELFObjHandler.cpp
32 ↗	(On Diff #177966)	Nit: you should be able to remove the object::, since you're using the namespace object.
39 ↗	(On Diff #177966)	Will the ElfHeader likely be used for anything else? If not, you should fetch it inline, I think.
41 ↗	(On Diff #177966)	No need for this comment. It's obvious from the function name.
54 ↗	(On Diff #177966)	Are you anticipating this function becoming more complicated? At the moment, there's no point in it returning an error, or even really existing, since you could just set the TargetStub.Arch member at the call site instead.
llvm/tools/llvm-elfabi/ELFObjHandler.h
28 ↗	(On Diff #177966)	Pointer -> Reference. The sentence as a whole sounds a bit clunky. Maybe "Source ELFObjectFile". I don't think you need to explicitly state that it's a pointer or reference (it's in the type signature).
33 ↗	(On Diff #177966)	This comment smells a bit like this should be a member function of ELFStub. Thoughts on that?
34–35 ↗	(On Diff #177966)	who's -> whose As above, I don't think you need to explicitly say that TargetStub and Header are references. What is the ELF file header for however? That needs explaining more here.
llvm/tools/llvm-elfabi/ErrorCollector.h
1 ↗	(On Diff #177966)	My understanding of `Error` is that you don't need this class. `Error` can actually represent multiple errors. You can use `joinErrors()` to combine two `Error` instances into a single one. See the docs here.
llvm/tools/llvm-elfabi/llvm-elfabi.cpp
31 ↗	(On Diff #177966)	I think the convention tends to be for cl::opt variable names to essentially match the switch name, so this should just be "Soname" (or "SoName", or even "SOName").
32 ↗	(On Diff #177966)	Perhaps this should be --soname to match the equivalent linker flag.
39 ↗	(On Diff #177966)	Similar to populateArch, why is this a separate function and not just done inline (or possibly a member function of ELFStub)?
43 ↗	(On Diff #177966)	Again, I'm not sure there's any value in this function being out-of-line.
70 ↗	(On Diff #177966)	Move this down to first use.
85 ↗	(On Diff #177966)	I think `StubFromELF` rather than `StubFromELF.get()` is more common usage in this case. This could probably be written a bit simpler as follows: Expected<std::unique_ptr<ELFStub>> StubFromELF = readELFFile(FileReadBuffer->getMemBufferRef()); if (StubFromELF) { return std::move(StubFromELF); } EC.addError(StubFromELF.takeError(), "BinaryRead");
111 ↗	(On Diff #177966)	This message sounds a bit clunky. Perhaps "No file readers were able to read..."?
128 ↗	(On Diff #177966)	Nit: No need for explicit return 0 in a main in C++ these days.
68 ↗	(On Diff #176915)	FWIW, in functions in the tool itself, I have no issue with errors raised within functions. I don't think it's always necessary to propagate back up to the top-level. The issue is when they are in library functions.

Changed:

Addressed several areas of feedback.

Forgot to include a few changes for a couple files.

Hopefully this clarifies the direction on some things brought up. I didn't explicitly mention that everything outside llvm-elfabi.cpp is designed as if it were part of a library. Let me know if this resolves some concerns you had or brings up new ones. I really appreciate your insight, push back again on anything you still feels needs to be changed and I'll look into it again.

llvm/tools/LLVMBuild.txt
35 ↗	(On Diff #177811)	That's very understandable. We didn't have any better ideas for names, so we went with elfabi. I'm totally open to suggestions since I understand changing this later would be a huge pain. This tool views ELF shared objects roughly from a compile-time linker's perspective, so maybe there's a better name to be had that stems from that.
llvm/tools/llvm-elfabi/ELFObjHandler.cpp
39 ↗	(On Diff #177966)	Done. I don't think we'll need to explicitly use it again.
54 ↗	(On Diff #177966)	I've changed it to `void` as I doubt this will become more complicated over time. Even though it is very simple, I would like to keep it as a function. The plan is to have one function for each ELFStub member. `buildStub()` streamlines the process of fetching the arguments needed for each function from an `ELFFile<ELFT>`, but I don't want the implementation of `buildStub()` to be the sole point of documentation on how to populate `ELFStub.Arch`. I'd like it to be very clear what information is required to populate each member. While this is simple for `Arch`, it becomes a little more involved for some of the other members. having `populateArch()` helps with consistency in this situation. D55629 helps to better illustrate the direction I'd like to take. This is also a result of trying to design a consistent, unit-testable interface. If you depend on using complete `ELFObjectFile`s, things become significantly harder to unit test. I understand I can't unit test a tool, but that was how I originally designed it, and would be useful if this does get moved to a library.
llvm/tools/llvm-elfabi/ELFObjHandler.h
33 ↗	(On Diff #177966)	While that is definitely an appealing approach, we've committed to not having the TextAPI library depend on libObject (Juergen would like libObject to depend on some parts of the MachO TextAPI implementation). The result of that discussion was that the ELF implementation for TextAPI will mostly just contain the YAML reader/writer and the ELFStub internal representation for now. Overall, everything in this tool outside of llvm-elfabi.cpp is designed as if it were part of a library. There's two main reasons for this: It's not completely clear where the ELF binary readers/writers belong since they won't be in TextAPI. An argument could be made that they should be in a library, but a counter argument could be that binary readers/writers are only really used for the tool. I'd rather put it in the tool for now to avoid bikeshedding another library. That allows me to start adding stubbing functionality asap. Designing everything as if it were part of a library prevents this tool from becoming a monolith that can't later be turned into a library.
llvm/tools/llvm-elfabi/ErrorCollector.h
1 ↗	(On Diff #177966)	I looked into that initially, and I don't feel it solves our problem. While joinErrors() solves the problem of collecting multiple errors into one, `ErrorCollector` solves our issue of determining whether or not to consume errors. In `llvm-elfabi.cpp`, each file reader might fail and produce an error. If any reader succeeds, though, the other errors should be consumed as they become irrelevant. If none succeed, they should all be reported (in the future Jake and I would like to have more fine grained control over which are reported, but that's not in the scope of this patch). This was the cleanest way I felt we could resolve the issue without having very messy use of consumeError() throughout `readInputFile()`.
llvm/tools/llvm-elfabi/llvm-elfabi.cpp
39 ↗	(On Diff #177966)	You're right, there's not much need for `updateTBEVersion()`. I could see there being more logic in the future if there's a desire to write a TBE using an older writer, but it probably doesn't warrant a function right now. I've inlined it in main().

I'm happy with everything I've reviewed, including the related changes, but time means that I haven't had a chance to review the ErrorCollector class in more detail, and I'm not sure I'll get a chance this side of the New Year. If others are happy for this to go in, feel free to commit. I may come back and review the last bit at a later point.

amontanez added a child revision: D55839: [elfabi] Add support for writing ELF header for binary stubs.Dec 18 2018, 10:31 AM

I got a few more minutes this morning, but that's it from me until the New Year. I don't mind this going in before then, once my comments have been addressed and somebody else reviews it.

llvm/tools/llvm-elfabi/ErrorCollector.cpp
23 ↗	(On Diff #178124)	Should Err here be an r-value reference? I'm not sure copying it strictly makes sense. Tag be a StringRef (copy it at the point where necessary i.e. at the Tags.push_back() line).
40 ↗	(On Diff #178124)	errc::interrupted doesn't feel like the right answer here (what was interrupted?). This should probably be something else. I guess it may be worth checking the errc of of all errors and using that if it's the same, or picking one if not?
46 ↗	(On Diff #178124)	Unless the indexes are going to be useful for something else, I don't think they give us anything. I also am not sure there's a need to print the "Encountered multiple errors bit and then print them all as a single error". Instead, I'd print each of the errors separately (i.e. with the WithColor::error() method or equivalent), since each is a distinct message. One strong reason for doing this is that each is indeed a distinct error, so should be prefixed with "error:". This helps IDEs and the like display the different errors in a meaningful way (if you plugged this into Visual Studio for example, the Error List view would simply say "Encountered multiple errors" and you'd have to go to the build output to actually view them, whereas reporting them separately would result in separate errors being listed). Additionally, printing separate errors is easier to read when viewing the output in a terminal (because each is individually highlighted).
53–56 ↗	(On Diff #178124)	`return Errors.empty();`
70 ↗	(On Diff #178124)	Does logging mark Error as checked? If not, we don't need the abort() (and can probably also get rid of the "Aborted due to...." message too) by simply skipping the consumeError() loop in the destructor, and letting the unchecked Error assertion fire as normal.
llvm/tools/llvm-elfabi/ErrorCollector.h
62 ↗	(On Diff #178124)	Could we avoid the abbreviation here? Everywhere else it's Errors, not Errs (so this would be `allErrorsHandled`).

Changed:

Populating ELFStub.Arch is now inline.
ErrorCollector.log() now prefixes each error with "error: " in red.
ErrorCollector.makeError() now uses joinErrors() in lieu of a specialized Error type (marked as TODO) that gives more control over how to handle each error.
addError() now takes a r-value reference Error.

Quick bugfix in ErrorCollector::makeError() and change to make the function more concise.

amontanez marked 3 inline comments as done.Dec 19 2018, 2:50 PM

amontanez added inline comments.

llvm/tools/llvm-elfabi/ErrorCollector.cpp
46 ↗	(On Diff #178124)	I've fixed this for `ErrorCollector::log()`, but it doesn't work as cleanly for the previous `ErrorCollector::makeError()` since color information would get stripped out. I've switched `ErrorCollector::makeError()` to use `joinErrors()` for now.
70 ↗	(On Diff #178124)	I tried omitting this and only one error is printed since the program terminates early when the first unhanded error is encountered.

Changed:

Fixed test formatting.

LGTM.

Closed by commit rL350341: [elfabi] Introduce tool for ELF TextAPI (authored by amontanez). · Explain WhyJan 3 2019, 10:36 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

test/

CMakeLists.txt

1 line

tools/

llvm-elfabi/

binary-read-arch.test

15 lines

fail-file-open.test

5 lines

read-unsupported-file.test

7 lines

replace-soname-tbe.test

16 lines

tbe-emits-current-version.test

13 lines

tbe-read-basic.test

25 lines

tools/

LLVMBuild.txt

1 line

llvm-elfabi/

11 lines

33 lines

68 lines

75 lines

70 lines

22 lines

120 lines

Diff 180100

llvm/trunk/test/CMakeLists.txt

Show First 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	set(LLVM_TEST_DEPENDS
llvm-cxxfilt		llvm-cxxfilt
llvm-cxxmap		llvm-cxxmap
llvm-diff		llvm-diff
llvm-dis		llvm-dis
llvm-dlltool		llvm-dlltool
dsymutil		dsymutil
llvm-dwarfdump		llvm-dwarfdump
llvm-dwp		llvm-dwp
		llvm-elfabi
llvm-exegesis		llvm-exegesis
llvm-extract		llvm-extract
llvm-isel-fuzzer		llvm-isel-fuzzer
llvm-lib		llvm-lib
llvm-link		llvm-link
llvm-lto2		llvm-lto2
llvm-mc		llvm-mc
llvm-mca		llvm-mca
▲ Show 20 Lines • Show All 105 Lines • Show Last 20 Lines

llvm/trunk/test/tools/llvm-elfabi/binary-read-arch.test

				# RUN: yaml2obj %s > %t
				# RUN: llvm-elfabi %t --emit-tbe=- \| FileCheck %s

				!ELF
				FileHeader:
				Class: ELFCLASS64
				Data: ELFDATA2LSB
				Type: ET_DYN
				Machine: EM_X86_64

				# CHECK: --- !tapi-tbe
				# CHECK-NEXT: TbeVersion: {{[1-9]\d\.(0\|([1-9]\d))}}
				# CHECK-NEXT: Arch: x86_64
				# CHECK-NEXT: Symbols: {}
				# CHECK-NEXT: ...

llvm/trunk/test/tools/llvm-elfabi/fail-file-open.test

				# RUN: not llvm-elfabi %s.NotAFileInTestingDir --emit-tbe=%t 2>&1 \| FileCheck %s

				This file will not be read. An invalid file path is fed to llvm-elfabi.

				# CHECK: error: Could not open `{{.*}}.NotAFileInTestingDir`

llvm/trunk/test/tools/llvm-elfabi/read-unsupported-file.test

				# RUN: not llvm-elfabi %s --emit-tbe=%t 2>&1 \| FileCheck %s

				This is just some text that cannot be read by llvm-elfabi.

				# CHECK: The file was not recognized as a valid object file
				# CHECK: YAML failed reading as TBE
				# CHECK: No file readers succeeded reading `{{.*}}` (unsupported/malformed file?)

llvm/trunk/test/tools/llvm-elfabi/replace-soname-tbe.test

				# RUN: yaml2obj %s > %t
				# RUN: llvm-elfabi %t --emit-tbe=- --soname=best.so \| FileCheck %s

				!ELF
				FileHeader:
				Class: ELFCLASS64
				Data: ELFDATA2LSB
				Type: ET_DYN
				Machine: EM_AARCH64

				# CHECK: --- !tapi-tbe
				# CHECK-NEXT: TbeVersion: {{[1-9]\d\.(0\|([1-9]\d))}}
				# CHECK-NEXT: SoName: best.so
				# CHECK-NEXT: Arch: AArch64
				# CHECK-NEXT: Symbols: {}
				# CHECK-NEXT: ...

llvm/trunk/test/tools/llvm-elfabi/tbe-emits-current-version.test

				# RUN: llvm-elfabi %s --emit-tbe=- \| FileCheck %s

				--- !tapi-tbe
				TbeVersion: 1.0
				Arch: AArch64
				Symbols: {}
				...

				# As the tbe reader/writer is updated, update this check to ensure --emit-tbe
				# uses the latest tbe writer by default.

				# CHECK: --- !tapi-tbe
				# CHECK-NEXT: TbeVersion: 1.0

llvm/trunk/test/tools/llvm-elfabi/tbe-read-basic.test

				# RUN: llvm-elfabi %s --emit-tbe=- \| FileCheck %s

				--- !tapi-tbe
				SoName: somelib.so
				TbeVersion: 1.0
				Arch: x86_64
				Symbols:
				foo: { Type: Func }
				bar: { Type: Object, Size: 42 }
				baz: { Type: Object, Size: 8 }
				not: { Type: Object, Undefined: true, Size: 128 }
				nor: { Type: Func, Undefined: true }
				...

				# CHECK: --- !tapi-tbe
				# CHECK-NEXT: TbeVersion: {{[1-9]\d\.(0\|([1-9]\d))}}
				# CHECK-NEXT: SoName: somelib.so
				# CHECK-NEXT: Arch: x86_64
				# CHECK-NEXT: Symbols:
				# CHECK-NEXT: bar: { Type: Object, Size: 42 }
				# CHECK-NEXT: baz: { Type: Object, Size: 8 }
				# CHECK-NEXT: foo: { Type: Func }
				# CHECK-NEXT: nor: { Type: Func, Undefined: true }
				# CHECK-NEXT: not: { Type: Object, Size: 128, Undefined: true }
				# CHECK-NEXT: ...

llvm/trunk/tools/LLVMBuild.txt

	Show All 26 Lines
	llvm-cat			llvm-cat
	llvm-cfi-verify			llvm-cfi-verify
	llvm-cov			llvm-cov
	llvm-cvtres			llvm-cvtres
	llvm-diff			llvm-diff
	llvm-dis			llvm-dis
	llvm-dwarfdump			llvm-dwarfdump
	llvm-dwp			llvm-dwp
				llvm-elfabi
	llvm-exegesis			llvm-exegesis
	llvm-extract			llvm-extract
	llvm-jitlistener			llvm-jitlistener
	llvm-link			llvm-link
	llvm-lto			llvm-lto
	llvm-mc			llvm-mc
	llvm-mca			llvm-mca
	llvm-modextract			llvm-modextract
	Show All 18 Lines

llvm/trunk/tools/llvm-elfabi/CMakeLists.txt

				set(LLVM_LINK_COMPONENTS
				Object
				Support
				TextAPI
				)

				add_llvm_tool(llvm-elfabi
				ELFObjHandler.cpp
				ErrorCollector.cpp
				llvm-elfabi.cpp
				)

llvm/trunk/tools/llvm-elfabi/ELFObjHandler.h

				//===- ELFObjHandler.h ------------------------------------------- C++ --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===-----------------------------------------------------------------------===/
				///
				/// This supports reading and writing of elf dynamic shared objects.
				///
				//===-----------------------------------------------------------------------===/

				#ifndef LLVM_TOOLS_ELFABI_ELFOBJHANDLER_H
				#define LLVM_TOOLS_ELFABI_ELFOBJHANDLER_H

				#include "llvm/Object/ELFObjectFile.h"
				#include "llvm/Object/ELFTypes.h"
				#include "llvm/TextAPI/ELF/ELFStub.h"

				namespace llvm {

				class MemoryBuffer;

				namespace elfabi {

				/// Attempt to read a binary ELF file from a MemoryBuffer.
				Expected<std::unique_ptr<ELFStub>> readELFFile(MemoryBufferRef Buf);

				} // end namespace elfabi
				} // end namespace llvm

				#endif // LLVM_TOOLS_ELFABI_ELFOBJHANDLER_H

llvm/trunk/tools/llvm-elfabi/ELFObjHandler.cpp

				//===- ELFObjHandler.cpp --------------------------------------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===-----------------------------------------------------------------------===/

				#include "ELFObjHandler.h"
				#include "llvm/Object/Binary.h"
				#include "llvm/Object/ELFObjectFile.h"
				#include "llvm/Object/ELFTypes.h"
				#include "llvm/Support/Errc.h"
				#include "llvm/Support/Error.h"
				#include "llvm/Support/MemoryBuffer.h"
				#include "llvm/TextAPI/ELF/ELFStub.h"

				using llvm::MemoryBufferRef;
				using llvm::object::ELFObjectFile;

				using namespace llvm;
				using namespace llvm::object;
				using namespace llvm::elfabi;
				using namespace llvm::ELF;

				namespace llvm {
				namespace elfabi {

				/// Returns a new ELFStub with all members populated from an ELFObjectFile.
				/// @param ElfObj Source ELFObjectFile.
				template <class ELFT>
				Expected<std::unique_ptr<ELFStub>>
				buildStub(const ELFObjectFile<ELFT> &ElfObj) {
				std::unique_ptr<ELFStub> DestStub = make_unique<ELFStub>();
				const ELFFile<ELFT> *ElfFile = ElfObj.getELFFile();

				DestStub->Arch = ElfFile->getHeader()->e_machine;

				// TODO: Populate SoName from .dynamic entries and linked string table.
				// TODO: Populate NeededLibs from .dynamic entries and linked string table.
				// TODO: Populate Symbols from .dynsym table and linked string table.

				return std::move(DestStub);
				}

				Expected<std::unique_ptr<ELFStub>> readELFFile(MemoryBufferRef Buf) {
				Expected<std::unique_ptr<Binary>> BinOrErr = createBinary(Buf);
				if (!BinOrErr) {
				return BinOrErr.takeError();
				}

				Binary *Bin = BinOrErr->get();
				if (auto Obj = dyn_cast<ELFObjectFile<ELF32LE>>(Bin)) {
				return buildStub(*Obj);
				} else if (auto Obj = dyn_cast<ELFObjectFile<ELF64LE>>(Bin)) {
				return buildStub(*Obj);
				} else if (auto Obj = dyn_cast<ELFObjectFile<ELF32BE>>(Bin)) {
				return buildStub(*Obj);
				} else if (auto Obj = dyn_cast<ELFObjectFile<ELF64BE>>(Bin)) {
				return buildStub(*Obj);
				}

				return createStringError(errc::not_supported, "Unsupported binary format");
				}

				} // end namespace elfabi
				} // end namespace llvm

llvm/trunk/tools/llvm-elfabi/ErrorCollector.h

				//===- ErrorCollector.h ------------------------------------------ C++ --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===-----------------------------------------------------------------------===/
				///
				/// This class collects errors that should be reported or ignored in aggregate.
				///
				/// Like llvm::Error, an ErrorCollector cannot be copied. Unlike llvm::Error,
				/// an ErrorCollector may be destroyed if it was originally constructed to treat
				/// errors as non-fatal. In this case, all Errors are consumed upon destruction.
				/// An ErrorCollector may be initially constructed (or escalated) such that
				/// errors are treated as fatal. This causes a crash if an attempt is made to
				/// delete the ErrorCollector when some Errors have not been retrieved via
				/// makeError().
				///
				//===-----------------------------------------------------------------------===/

				#ifndef LLVM_TOOLS_ELFABI_ERRORCOLLECTOR_H
				#define LLVM_TOOLS_ELFABI_ERRORCOLLECTOR_H

				#include "llvm/Support/Error.h"
				#include <vector>

				namespace llvm {
				namespace elfabi {

				class ErrorCollector {
				public:
				/// Upon destruction, an ErrorCollector will crash if UseFatalErrors=true and
				/// there are remaining errors that haven't been fetched by makeError().
				ErrorCollector(bool UseFatalErrors = true) : ErrorsAreFatal(UseFatalErrors) {}
				// Don't allow copying.
				ErrorCollector(const ErrorCollector &Stub) = delete;
				ErrorCollector &operator=(const ErrorCollector &Other) = delete;
				~ErrorCollector();

				// TODO: Add move constructor and operator= when a testable situation arises.

				/// Returns a single error that contains messages for all stored Errors.
				Error makeError();

				/// Adds an error with a descriptive tag that helps with identification.
				/// If the error is an Error::success(), it is checked and discarded.
				void addError(Error &&E, StringRef Tag);

				/// This ensures an ErrorCollector will treat unhandled errors as fatal.
				/// This function should be called if errors that usually can be ignored
				/// are suddenly of concern (i.e. attempt multiple things that return Error,
				/// but only care about the Errors if no attempt succeeds).
				void escalateToFatal();

				private:
				/// Logs all errors to a raw_ostream.
				void log(raw_ostream &OS);

				/// Returns true if all errors have been retrieved through makeError(), or
				/// false if errors have been added since the last makeError() call.
				bool allErrorsHandled() const;

				/// Dump output and crash.
				LLVM_ATTRIBUTE_NORETURN void fatalUnhandledError();

				bool ErrorsAreFatal;
				std::vector<Error> Errors;
				std::vector<std::string> Tags;
				};

				} // end namespace elfabi
				} // end namespace llvm

				#endif // LLVM_TOOLS_ELFABI_ERRORCOLLECTOR_H

llvm/trunk/tools/llvm-elfabi/ErrorCollector.cpp

				//===- ErrorCollector.cpp -------------------------------------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===-----------------------------------------------------------------------===/

				#include "ErrorCollector.h"
				#include "llvm/Support/Errc.h"
				#include "llvm/Support/Error.h"
				#include "llvm/Support/raw_ostream.h"
				#include "llvm/Support/WithColor.h"
				#include <vector>

				using namespace llvm;
				using namespace llvm::elfabi;

				void ErrorCollector::escalateToFatal() {
				ErrorsAreFatal = true;
				}

				void ErrorCollector::addError(Error &&Err, StringRef Tag) {
				if (Err) {
				Errors.push_back(std::move(Err));
				Tags.push_back(Tag.str());
				}
				}

				Error ErrorCollector::makeError() {
				// TODO: Make this return something (an AggregateError?) that gives more
				// individual control over each error and which might be of interest.
				Error JoinedErrors = Error::success();
				for (Error &E : Errors) {
				JoinedErrors = joinErrors(std::move(JoinedErrors), std::move(E));
				}
				Errors.clear();
				Tags.clear();
				return JoinedErrors;
				}

				void ErrorCollector::log(raw_ostream &OS) {
				OS << "Encountered multiple errors:\n";
				for (size_t i = 0; i < Errors.size(); ++i) {
				WithColor::error(OS) << "(" << Tags[i] << ") " << Errors[i];
				if (i != Errors.size() - 1)
				OS << "\n";
				}
				}

				bool ErrorCollector::allErrorsHandled() const {
				return Errors.empty();
				}

				ErrorCollector::~ErrorCollector() {
				if (ErrorsAreFatal && !allErrorsHandled())
				fatalUnhandledError();

				for (Error &E : Errors) {
				consumeError(std::move(E));
				}
				}

				LLVM_ATTRIBUTE_NORETURN void ErrorCollector::fatalUnhandledError() {
				errs() << "Program aborted due to unhandled Error(s):\n";
				log(errs());
				errs() << "\n";
				abort();
				}

llvm/trunk/tools/llvm-elfabi/LLVMBuild.txt

				;===- ./tools/llvm-elfabi/LLVMBuild.txt ------------------------- Conf ---===;
				;
				; The LLVM Compiler Infrastructure
				;
				; This file is distributed under the University of Illinois Open Source
				; License. See LICENSE.TXT for details.
				;
				;===------------------------------------------------------------------------===;
				;
				; This is an LLVMBuild description file for the components in this subdirectory.
				;
				; For more information on the LLVMBuild system, please see:
				;
				; http://llvm.org/docs/LLVMBuild.html
				;
				;===------------------------------------------------------------------------===;

				[component_0]
				type = Tool
				name = llvm-elfabi
				parent = Tools
				required_libraries = Object Support TextAPI

llvm/trunk/tools/llvm-elfabi/llvm-elfabi.cpp

				//===- llvm-elfabi.cpp ----------------------------------------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===-----------------------------------------------------------------------===/

				#include "ELFObjHandler.h"
				#include "ErrorCollector.h"
				#include "llvm/Support/CommandLine.h"
				#include "llvm/Support/Errc.h"
				#include "llvm/Support/FileOutputBuffer.h"
				#include "llvm/Support/MemoryBuffer.h"
				#include "llvm/Support/Path.h"
				#include "llvm/Support/raw_ostream.h"
				#include "llvm/Support/WithColor.h"
				#include "llvm/TextAPI/ELF/TBEHandler.h"
				#include <string>

				using namespace llvm;
				using namespace llvm::elfabi;

				// Command line flags:
				cl::opt<std::string> InputFilePath(cl::Positional, cl::desc("input"),
				cl::Required);
				cl::opt<std::string>
				EmitTBE("emit-tbe",
				cl::desc("Emit a text-based ELF stub (.tbe) from the input file"),
				cl::value_desc("path"));
				cl::opt<std::string> SOName(
				"soname",
				cl::desc("Manually set the DT_SONAME entry of any emitted files"),
				cl::value_desc("name"));

				/// writeTBE() writes a Text-Based ELF stub to a file using the latest version
				/// of the YAML parser.
				static Error writeTBE(StringRef FilePath, ELFStub &Stub) {
				std::error_code SysErr;

				// Open file for writing.
				raw_fd_ostream Out(FilePath, SysErr);
				if (SysErr)
				return createStringError(SysErr, "Couldn't open `%s` for writing",
				FilePath.data());
				// Write file.
				Error YAMLErr = writeTBEToOutputStream(Out, Stub);
				if (YAMLErr)
				return YAMLErr;

				return Error::success();
				}

				/// readInputFile populates an ELFStub by attempting to read the
				/// input file using both the TBE and binary ELF parsers.
				static Expected<std::unique_ptr<ELFStub>> readInputFile(StringRef FilePath) {
				// Read in file.
				ErrorOr<std::unique_ptr<MemoryBuffer>> BufOrError =
				MemoryBuffer::getFile(FilePath);
				if (!BufOrError) {
				return createStringError(BufOrError.getError(), "Could not open `%s`",
				FilePath.data());
				}

				std::unique_ptr<MemoryBuffer> FileReadBuffer = std::move(*BufOrError);
				ErrorCollector EC(/UseFatalErrors=/false);

				// First try to read as a binary (fails fast if not binary).
				Expected<std::unique_ptr<ELFStub>> StubFromELF =
				readELFFile(FileReadBuffer->getMemBufferRef());
				if (StubFromELF) {
				return std::move(*StubFromELF);
				}
				EC.addError(StubFromELF.takeError(), "BinaryRead");

				// Fall back to reading as a tbe.
				Expected<std::unique_ptr<ELFStub>> StubFromTBE =
				readTBEFromBuffer(FileReadBuffer->getBuffer());
				if (StubFromTBE) {
				return std::move(*StubFromTBE);
				}
				EC.addError(StubFromTBE.takeError(), "YamlParse");

				// If both readers fail, build a new error that includes all information.
				EC.addError(createStringError(errc::not_supported,
				"No file readers succeeded reading `%s` "
				"(unsupported/malformed file?)",
				FilePath.data()),
				"ReadInputFile");
				EC.escalateToFatal();
				return EC.makeError();
				}

				int main(int argc, char *argv[]) {
				// Parse arguments.
				cl::ParseCommandLineOptions(argc, argv);

				Expected<std::unique_ptr<ELFStub>> StubOrErr = readInputFile(InputFilePath);
				if (!StubOrErr) {
				Error ReadError = StubOrErr.takeError();
				WithColor::error() << ReadError << "\n";
				exit(1);
				}

				std::unique_ptr<ELFStub> TargetStub = std::move(StubOrErr.get());

				// Write out .tbe file.
				if (EmitTBE.getNumOccurrences() == 1) {
				TargetStub->TbeVersion = TBEVersionCurrent;
				if (SOName.getNumOccurrences() == 1) {
				TargetStub->SoName = SOName;
				}
				Error TBEWriteError = writeTBE(EmitTBE, *TargetStub);
				if (TBEWriteError) {
				WithColor::error() << TBEWriteError << "\n";
				exit(1);
				}
				}
				}

This is an archive of the discontinued LLVM Phabricator instance.

[elfabi] Introduce tool for ELF TextAPIClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 180100

llvm/trunk/test/CMakeLists.txt

llvm/trunk/test/tools/llvm-elfabi/binary-read-arch.test

llvm/trunk/test/tools/llvm-elfabi/fail-file-open.test

llvm/trunk/test/tools/llvm-elfabi/read-unsupported-file.test

llvm/trunk/test/tools/llvm-elfabi/replace-soname-tbe.test

llvm/trunk/test/tools/llvm-elfabi/tbe-emits-current-version.test

llvm/trunk/test/tools/llvm-elfabi/tbe-read-basic.test

llvm/trunk/tools/LLVMBuild.txt

llvm/trunk/tools/llvm-elfabi/CMakeLists.txt

llvm/trunk/tools/llvm-elfabi/ELFObjHandler.h

llvm/trunk/tools/llvm-elfabi/ELFObjHandler.cpp

llvm/trunk/tools/llvm-elfabi/ErrorCollector.h

llvm/trunk/tools/llvm-elfabi/ErrorCollector.cpp

llvm/trunk/tools/llvm-elfabi/LLVMBuild.txt

llvm/trunk/tools/llvm-elfabi/llvm-elfabi.cpp

[elfabi] Introduce tool for ELF TextAPI
ClosedPublic