This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
test/tools/llvm-elfabi/
-
tools/
-
llvm-elfabi/
2/2
invalid-bin-target.test
1/1
missing-bin-target.test
1/1
write-elf32be-ehdr.test
-
write-elf32le-ehdr.test
-
write-elf64be-ehdr.test
-
write-elf64le-ehdr.test
-
tools/llvm-elfabi/
-
llvm-elfabi/
8/8
ELFObjHandler.h
11/20
ELFObjHandler.cpp
7/10
llvm-elfabi.cpp

Differential D55839

[elfabi] Add support for writing ELF header for binary stubs
Needs ReviewPublic

Authored by jakehehrlich on Dec 18 2018, 10:28 AM.

Download Raw Diff

Details

Reviewers

phosek
mcgrathr
jhenderson
ruiu
amontanez

Summary

This change introduces the beginnings of ELF binary stub write support for elfabi. Specifying an output file path as well as --output-target=<target> will write a binary ELF stub to the specified output file path. For this patch, only the ELF file header is written to the file.

Diff Detail

Repository: rL LLVM

Event Timeline

amontanez created this revision.Dec 18 2018, 10:28 AM

Herald added a subscriber: llvm-commits. · View Herald TranscriptDec 18 2018, 10:28 AM

amontanez added a parent revision: D55352: [elfabi] Introduce tool for ELF TextAPI.Dec 18 2018, 10:31 AM

The structure looks most good but I've got a lot of little requests.

llvm/test/tools/llvm-elfabi/invalid-bin-target.test
8	There's no need to have these extra symbols here.
llvm/test/tools/llvm-elfabi/missing-bin-target.test
8	ditto.
llvm/tools/llvm-elfabi/ELFObjHandler.cpp
240	Not sure I like this interface but if you do want todo something like this 1) use uint8_t instead of 'char' and 2) MutableArrayRef wraps the pointer and size for you so you don't have to carry them around while still allowing you to modify the contents as you need to.
242	Calling getBinarySize twice since the user already has to call it isn't ideal. Also if you want this check it seems like an assertion would be better. Also if you just return a Buffer from here rather than telling the user how big of a buffer to construct (though you force them to use a specific kind of buffer) then you can avoid the check all together.
256	I think using this technique is most justified by larger code (like what you'll have later) so I'm cool keeping these but at this size it feels like it could just all go in a header.
llvm/tools/llvm-elfabi/ELFObjHandler.h
42	Seems like this can be inlined.
63	The whole 'Writer' thing is supposed to allow for a base class to exist so you don't have to know what Writer you're using, but nothing like that is happening here. If we're not doing anything like that do you think we could just make these all functions?
65	This can be private or if you turn these into functions, you can make them static in the defining object.
69–81	Seems like you might be able to simplify this to just return a buffer of some kind.
89	This can be private.
llvm/tools/llvm-elfabi/llvm-elfabi.cpp
93	Use uint8_t instead of char for raw binary data. llvm doesn't even use full C++11 but in C++17 we'll use std::byte
109	I think it would be nicer if this took the output format as a parameter.
117	Use == and don't construct a StringRef.

ruiu added a subscriber: ruiu.Dec 18 2018, 5:01 PM

ruiu added inline comments.

llvm/tools/llvm-elfabi/ELFObjHandler.cpp
234–235	0x00 and 0u are all just 0, so I'd just write "0". Sign-extending 0 yields just 0, so "u" is redundant.
llvm/tools/llvm-elfabi/ELFObjHandler.h
63	I agree with Jake that because this can be done with functions it's probably better to do this using functions without a class. In addition to that, it looks like a "Impl" class that doesn't inherit any class a bit weird, because usually an "Impl" class implements an abstract interface of some other class (I'm not suggesting you define an abstract class and an implementation class, but just pointing out that the current name is perhaps not that good.)
llvm/tools/llvm-elfabi/llvm-elfabi.cpp
158–159	It might be discussed before, but what is a value of returning an Error from a function and then print that error & exit from the main function? Propagating an error all the way to the main function makes function signatures more complex (any function that can fail or calls a function that can fail has to have a function signature of ErrorOr<T> instead of just T). If this is just a command, then maybe just printing out an error message and exit is better?

jakehehrlich added inline comments.Dec 19 2018, 2:56 PM

llvm/tools/llvm-elfabi/llvm-elfabi.cpp
87	Since you have an error and are currently returning an error, that error should incorporate the specific information from this error and not make it more vauge.
158–159	Yeah there's some line where things will only ever go in this code and some line where things would go into an eventual library. It isn't 100% clear where that line is right now. I'm in favor of earing on the side of caution and propagating more than less for the time being.

This should address most of the comments. All functions except for the single writeBinaryStub() have become static functions, and the class has been removed. I'll be updating D55864 to match these changes as soon as possible.

jhenderson added inline comments.Jan 21 2019, 2:17 AM

llvm/test/tools/llvm-elfabi/invalid-bin-target.test
11	Super nit: Space between # and CHECK. Same applies to other tests.
llvm/test/tools/llvm-elfabi/write-elf32be-ehdr.test
11–13	I believe this information is derived from the ElfHeader, so there's not much point in testing it, since you already test it below when testing the ElfHeader itself. Same applies to other tests.
llvm/tools/llvm-elfabi/ELFObjHandler.cpp
214	"calculates what the size" -> "calculates the size"
262	You should probably set the e_shentsize and e_phentsize fields too, since those are constant.
291	Same comment as above. It seems weird for `Stub` to be a non-const reference.
303	Add a blank line here.
307–308	You can do these two lines in one, and lose the braces too, i.e: if (Error BinaryWriteError = writeELFBinaryToBuffer<ELFT>(Stub, BufRef)) return BinaryWriteError;
312–313	Ditto.
llvm/tools/llvm-elfabi/ELFObjHandler.h
38	"begins the process"? Sounds to me like this function should do the whole process, given its name.
43	It seems odd to me that Stub is a non-const reference. I wouldn't expect this function to modify it based on the current description.
llvm/tools/llvm-elfabi/llvm-elfabi.cpp
146	It probably reads easier if there's a new line between each if block.

Comments addressed.

One small comment from me, otherwise looks good from my point of view, assuming the other reviewers are happy.

llvm/tools/llvm-elfabi/ELFObjHandler.cpp
264–265	Now that you are doing this, please make sure that it is tested too.

Updated tests.

amontanez added a child revision: D55864: [elfabi] Write program headers, .dynamic, .dynstr, and .shstrtab.Jan 23 2019, 12:37 PM

jakehehrlich added inline comments.Jan 23 2019, 12:46 PM

llvm/tools/llvm-elfabi/ELFObjHandler.cpp
220	Calculating the size and writing can be a bit fragile. That said you can't write until you have enough space allocated and because we want to use a FileBuffer and avoid copying, we only want to write once. A trick I've seen used is to think about 'WriteCommands' write commands know the maximum index that they write to and can perform the write themselves. So rather than performing size calculation and writes separately, you construct a list of write commands. From this list you then traverse it to get the maximum index written to, and then traverse it once more to write into the now allocated buffer. This way you don't have parallel code but you avoid reallocating a mapped buffer.
246	Use ELFMAG* instead of each of the actual constants here.
254	It turns out that there are use cases where this can be other things. This is a good default but sometimes the user should have to specify this. @jhenderson has hit this issue in BSD land. I can't seem to find the code for that however. Hopefully James can weigh in. Either way I don't think its something you need to worry about right now but it was possibly an oversight on our part.
305–306	You can append to an error I believe so that you don't have to consume it.

amontanez mentioned this in D55864: [elfabi] Write program headers, .dynamic, .dynstr, and .shstrtab.Jan 23 2019, 6:19 PM

jhenderson added inline comments.Jan 31 2019, 5:46 AM

llvm/tools/llvm-elfabi/ELFObjHandler.cpp
254	The issue in llvm-objcopy was that it was copying an existing file and discarding the EI_OSABI and EI_ABIVERSION. Certainly it might be useful in the future to be able to say what the value should be, but as this is not currently required, I think it can be delayed until there's a request for it. On the other hand, converting a binary into a text file then back into a binary should probably support this at some point.
257	Nit: this should start with a capital letter and end with a full stop.
llvm/tools/llvm-elfabi/llvm-elfabi.cpp
151–152	You might want to consider factoring these out into a single function that takes an `Error`.

I'm picking this up. Looking for review on this again. I'm going to drive this to completion enough to use on Fuchsia and then maintain it. After it has the features we want for Fuchsia I'll work on adding features that other people want but at a slower pace (like I did after a while with llvm-objcopy) eventually I'll ramp down but honestly, unlike llvm-objcopy, I could see this tool stabilizing.

llvm/tools/llvm-elfabi/ELFObjHandler.cpp
220	I went ahead and implemented my own advice here. Seems to solve the problem of separating calculation of size and writing of data. The alternative is to have a type that mirrors the format of the stub but contains layout information and then to perform layout and then writing individually like we do in llvm-objcopy.
llvm/tools/llvm-elfabi/llvm-elfabi.cpp
151–152	Do you remember what you meant by this? I'm starting this back up again.

Herald added a project: Restricted Project. · View Herald TranscriptApr 26 2019, 4:38 PM

jakehehrlich updated this revision to Diff 196930.Apr 26 2019, 4:39 PM

jhenderson added inline comments.Apr 30 2019, 2:20 AM

llvm/tools/llvm-elfabi/ELFObjHandler.cpp
43	clang-format? (I think it should be `const T &Value`)
239	out -> Out
llvm/tools/llvm-elfabi/llvm-elfabi.cpp
151–152	The WithColor::error() and exit(1) are repeated in a couple of places. It might be nice if they were a simple function that takes an llvm::Errror and does not return, e.g: void reportError(Error Err) { WithColor::error() << Err << "\n"; exit(1); } although I feel like there might also be other functions available to do the same thing.

In D55839#1481114, @jakehehrlich wrote:

I'm picking this up. Looking for review on this again. I'm going to drive this to completion enough to use on Fuchsia and then maintain it. After it has the features we want for Fuchsia I'll work on adding features that other people want but at a slower pace (like I did after a while with llvm-objcopy) eventually I'll ramp down but honestly, unlike llvm-objcopy, I could see this tool stabilizing.

@jakehehrlich Is this the diff for adding the binary stubs support we were talking about in D60974 ?

Revision Contents

Path

Size

llvm/

test/

tools/

llvm-elfabi/

invalid-bin-target.test

10 lines

missing-bin-target.test

10 lines

write-elf32be-ehdr.test

26 lines

write-elf32le-ehdr.test

26 lines

write-elf64be-ehdr.test

26 lines

write-elf64le-ehdr.test

26 lines

tools/

llvm-elfabi/

ELFObjHandler.h

17 lines

ELFObjHandler.cpp

125 lines

llvm-elfabi.cpp

33 lines

Diff 182938

llvm/test/tools/llvm-elfabi/invalid-bin-target.test

This file was added.

				# RUN: not llvm-elfabi %s --output-target=nope %t 2>&1 \| FileCheck %s

				--- !tapi-tbe
				SoName: somelib.so
				TbeVersion: 1.0
				Arch: x86_64
				Symbols: {}
				...
				jakehehrlichAuthorUnsubmitted Done Reply Inline Actions There's no need to have these extra symbols here. jakehehrlich: There's no need to have these extra symbols here.

				# CHECK: llvm-elfabi: for the -output-target option: Cannot find option named 'nope'!
				jhendersonUnsubmitted Done Reply Inline Actions Super nit: Space between # and CHECK. Same applies to other tests. jhenderson: Super nit: Space between # and CHECK. Same applies to other tests.

llvm/test/tools/llvm-elfabi/missing-bin-target.test

This file was added.

				# RUN: not llvm-elfabi %s %t 2>&1 \| FileCheck %s

				--- !tapi-tbe
				SoName: somelib.so
				TbeVersion: 1.0
				Arch: x86_64
				Symbols: {}
				...
				jakehehrlichAuthorUnsubmitted Done Reply Inline Actions ditto. jakehehrlich: ditto.

				# CHECK: No binary output target specified.

llvm/test/tools/llvm-elfabi/write-elf32be-ehdr.test

This file was added.

				# RUN: llvm-elfabi %s --output-target=elf32-big %t
				# RUN: llvm-readobj --file-headers %t \| FileCheck %s

				--- !tapi-tbe
				TbeVersion: 1.0
				Arch: x86_64
				Symbols: {}
				...

				# CHECK: ElfHeader {
				# CHECK-NEXT: Ident {
				# CHECK-NEXT: Magic: (7F 45 4C 46)
				# CHECK-NEXT: Class: 32-bit (0x1)
				jhendersonUnsubmitted Done Reply Inline Actions I believe this information is derived from the ElfHeader, so there's not much point in testing it, since you already test it below when testing the ElfHeader itself. Same applies to other tests. jhenderson: I believe this information is derived from the ElfHeader, so there's not much point in testing…
				# CHECK-NEXT: DataEncoding: BigEndian (0x2)
				# CHECK-NEXT: FileVersion: 1
				# CHECK-NEXT: OS/ABI: SystemV (0x0)
				# CHECK-NEXT: ABIVersion: 0
				# CHECK-NEXT: Unused: (00 00 00 00 00 00 00)
				# CHECK-NEXT: }
				# CHECK-NEXT: Type: SharedObject (0x3)
				# CHECK-NEXT: Machine: EM_X86_64 (0x3E)
				# CHECK-NEXT: Version: 1
				# CHECK-NEXT: Entry: 0x0
				# CHECK: Flags [ (0x0)
				# CHECK-NEXT: ]
				# CHECK-NEXT: HeaderSize: 52

llvm/test/tools/llvm-elfabi/write-elf32le-ehdr.test

This file was added.

				# RUN: llvm-elfabi %s --output-target=elf32-little %t
				# RUN: llvm-readobj --file-headers %t \| FileCheck %s

				--- !tapi-tbe
				TbeVersion: 1.0
				Arch: x86_64
				Symbols: {}
				...

				# CHECK: ElfHeader {
				# CHECK-NEXT: Ident {
				# CHECK-NEXT: Magic: (7F 45 4C 46)
				# CHECK-NEXT: Class: 32-bit (0x1)
				# CHECK-NEXT: DataEncoding: LittleEndian (0x1)
				# CHECK-NEXT: FileVersion: 1
				# CHECK-NEXT: OS/ABI: SystemV (0x0)
				# CHECK-NEXT: ABIVersion: 0
				# CHECK-NEXT: Unused: (00 00 00 00 00 00 00)
				# CHECK-NEXT: }
				# CHECK-NEXT: Type: SharedObject (0x3)
				# CHECK-NEXT: Machine: EM_X86_64 (0x3E)
				# CHECK-NEXT: Version: 1
				# CHECK-NEXT: Entry: 0x0
				# CHECK: Flags [ (0x0)
				# CHECK-NEXT: ]
				# CHECK-NEXT: HeaderSize: 52

llvm/test/tools/llvm-elfabi/write-elf64be-ehdr.test

This file was added.

				# RUN: llvm-elfabi %s --output-target=elf64-big %t
				# RUN: llvm-readobj --file-headers %t \| FileCheck %s

				--- !tapi-tbe
				TbeVersion: 1.0
				Arch: x86_64
				Symbols: {}
				...

				# CHECK: ElfHeader {
				# CHECK-NEXT: Ident {
				# CHECK-NEXT: Magic: (7F 45 4C 46)
				# CHECK-NEXT: Class: 64-bit (0x2)
				# CHECK-NEXT: DataEncoding: BigEndian (0x2)
				# CHECK-NEXT: FileVersion: 1
				# CHECK-NEXT: OS/ABI: SystemV (0x0)
				# CHECK-NEXT: ABIVersion: 0
				# CHECK-NEXT: Unused: (00 00 00 00 00 00 00)
				# CHECK-NEXT: }
				# CHECK-NEXT: Type: SharedObject (0x3)
				# CHECK-NEXT: Machine: EM_X86_64 (0x3E)
				# CHECK-NEXT: Version: 1
				# CHECK-NEXT: Entry: 0x0
				# CHECK: Flags [ (0x0)
				# CHECK-NEXT: ]
				# CHECK-NEXT: HeaderSize: 64

llvm/test/tools/llvm-elfabi/write-elf64le-ehdr.test

This file was added.

				# RUN: llvm-elfabi %s --output-target=elf64-little %t
				# RUN: llvm-readobj --file-headers %t \| FileCheck %s

				--- !tapi-tbe
				TbeVersion: 1.0
				Arch: AArch64
				Symbols: {}
				...

				# CHECK: ElfHeader {
				# CHECK-NEXT: Ident {
				# CHECK-NEXT: Magic: (7F 45 4C 46)
				# CHECK-NEXT: Class: 64-bit (0x2)
				# CHECK-NEXT: DataEncoding: LittleEndian (0x1)
				# CHECK-NEXT: FileVersion: 1
				# CHECK-NEXT: OS/ABI: SystemV (0x0)
				# CHECK-NEXT: ABIVersion: 0
				# CHECK-NEXT: Unused: (00 00 00 00 00 00 00)
				# CHECK-NEXT: }
				# CHECK-NEXT: Type: SharedObject (0x3)
				# CHECK-NEXT: Machine: EM_AARCH64 (0xB7)
				# CHECK-NEXT: Version: 1
				# CHECK-NEXT: Entry: 0x0
				# CHECK: Flags [ (0x0)
				# CHECK-NEXT: ]
				# CHECK-NEXT: HeaderSize: 64

llvm/tools/llvm-elfabi/ELFObjHandler.h

	Show All 17 Lines
	#include "llvm/TextAPI/ELF/ELFStub.h"			#include "llvm/TextAPI/ELF/ELFStub.h"

	namespace llvm {			namespace llvm {

	class MemoryBuffer;			class MemoryBuffer;

	namespace elfabi {			namespace elfabi {

				enum class ELFTarget {
				ELF32LE,
				ELF32BE,
				ELF64LE,
				ELF64BE
				};

	/// Attempt to read a binary ELF file from a MemoryBuffer.			/// Attempt to read a binary ELF file from a MemoryBuffer.
	Expected<std::unique_ptr<ELFStub>> readELFFile(MemoryBufferRef Buf);			Expected<std::unique_ptr<ELFStub>> readELFFile(MemoryBufferRef Buf);

				/// Attempt to write a binary ELF stub.
				/// This function determines appropriate ELFType using the passed ELFTarget and
				/// then writes a binary ELF stub to a specified file path.
				jhendersonUnsubmitted Done Reply Inline Actions "begins the process"? Sounds to me like this function should do the whole process, given its name. jhenderson: "begins the process"? Sounds to me like this function should do the whole process, given its…
				///
				/// @param FilePath File path for writing the ELF binary.
				/// @param Stub Source ELFStub to generate a binary ELF stub from.
				/// @param OutputFormat Target ELFType to write binary as.
				jakehehrlichAuthorUnsubmitted Done Reply Inline Actions Seems like this can be inlined. jakehehrlich: Seems like this can be inlined.
				Error writeBinaryStub(StringRef FilePath, const ELFStub &Stub,
				jhendersonUnsubmitted Done Reply Inline Actions It seems odd to me that Stub is a non-const reference. I wouldn't expect this function to modify it based on the current description. jhenderson: It seems odd to me that Stub is a non-const reference. I wouldn't expect this function to…
				ELFTarget OutputFormat);

	} // end namespace elfabi			} // end namespace elfabi
	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_TOOLS_ELFABI_ELFOBJHANDLER_H			#endif // LLVM_TOOLS_ELFABI_ELFOBJHANDLER_H
				jakehehrlichAuthorUnsubmitted Done Reply Inline Actions This can be private or if you turn these into functions, you can make them static in the defining object. jakehehrlich: This can be private or if you turn these into functions, you can make them static in the…
				jakehehrlichAuthorUnsubmitted Done Reply Inline Actions This can be private. jakehehrlich: This can be private.
				jakehehrlichAuthorUnsubmitted Done Reply Inline Actions Seems like you might be able to simplify this to just return a buffer of some kind. jakehehrlich: Seems like you might be able to simplify this to just return a buffer of some kind.
				jakehehrlichAuthorUnsubmitted Done Reply Inline Actions The whole 'Writer' thing is supposed to allow for a base class to exist so you don't have to know what Writer you're using, but nothing like that is happening here. If we're not doing anything like that do you think we could just make these all functions? jakehehrlich: The whole 'Writer' thing is supposed to allow for a base class to exist so you don't have to…
				ruiuUnsubmitted Done Reply Inline Actions I agree with Jake that because this can be done with functions it's probably better to do this using functions without a class. In addition to that, it looks like a "Impl" class that doesn't inherit any class a bit weird, because usually an "Impl" class implements an abstract interface of some other class (I'm not suggesting you define an abstract class and an implementation class, but just pointing out that the current name is perhaps not that good.) ruiu: I agree with Jake that because this can be done with functions it's probably better to do this…

llvm/tools/llvm-elfabi/ELFObjHandler.cpp

//===- ELFObjHandler.cpp --------------------------------------------------===//		//===- ELFObjHandler.cpp --------------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===-----------------------------------------------------------------------===/		//===-----------------------------------------------------------------------===/

#include "ELFObjHandler.h"		#include "ELFObjHandler.h"
#include "llvm/Object/Binary.h"		#include "llvm/Object/Binary.h"
#include "llvm/Object/ELFObjectFile.h"		#include "llvm/Object/ELFObjectFile.h"
#include "llvm/Object/ELFTypes.h"		#include "llvm/Object/ELFTypes.h"
#include "llvm/Support/Errc.h"		#include "llvm/Support/Errc.h"
#include "llvm/Support/Error.h"		#include "llvm/Support/Error.h"
		#include "llvm/Support/FileOutputBuffer.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/TextAPI/ELF/ELFStub.h"		#include "llvm/TextAPI/ELF/ELFStub.h"

using llvm::MemoryBufferRef;		using llvm::MemoryBufferRef;
using llvm::object::ELFObjectFile;		using llvm::object::ELFObjectFile;

using namespace llvm;		using namespace llvm;
using namespace llvm::object;		using namespace llvm::object;
Show All 11 Lines
};		};

/// This function behaves similarly to StringRef::substr(), but attempts to		/// This function behaves similarly to StringRef::substr(), but attempts to
/// terminate the returned StringRef at the first null terminator. If no null		/// terminate the returned StringRef at the first null terminator. If no null
/// terminator is found, an error is returned.		/// terminator is found, an error is returned.
///		///
/// @param Str Source string to create a substring from.		/// @param Str Source string to create a substring from.
/// @param Offset The start index of the desired substring.		/// @param Offset The start index of the desired substring.
static Expected<StringRef> terminatedSubstr(StringRef Str, size_t Offset) {		static Expected<StringRef> terminatedSubstr(StringRef Str, size_t Offset) {
		jhendersonUnsubmitted Not Done Reply Inline Actions clang-format? (I think it should be `const T &Value`) jhenderson: clang-format? (I think it should be `const T &Value`)
size_t StrEnd = Str.find('\0', Offset);		size_t StrEnd = Str.find('\0', Offset);
if (StrEnd == StringLiteral::npos) {		if (StrEnd == StringLiteral::npos) {
return createError(		return createError(
"String overran bounds of string table (no null terminator)");		"String overran bounds of string table (no null terminator)");
}		}

size_t StrLen = StrEnd - Offset;		size_t StrLen = StrEnd - Offset;
return Str.substr(Offset, StrLen);		return Str.substr(Offset, StrLen);
▲ Show 20 Lines • Show All 154 Lines • ▼ Show 20 Lines	if (auto Obj = dyn_cast<ELFObjectFile<ELF32LE>>(Bin)) {
return buildStub(*Obj);		return buildStub(*Obj);
} else if (auto Obj = dyn_cast<ELFObjectFile<ELF64BE>>(Bin)) {		} else if (auto Obj = dyn_cast<ELFObjectFile<ELF64BE>>(Bin)) {
return buildStub(*Obj);		return buildStub(*Obj);
}		}

return createStringError(errc::not_supported, "Unsupported binary format");		return createStringError(errc::not_supported, "Unsupported binary format");
}		}

		/// This function calculates the size a binary ELF stub will be.
		jhendersonUnsubmitted Done Reply Inline Actions "calculates what the size" -> "calculates the size" jhenderson: "calculates what the size" -> "calculates the size"
		/// `Stub` is used to determine the exact binary size. This calculation includes
		/// padding that may be added between sections to ensure proper alignment.
		///
		/// @param Stub The ELFStub that will be used for size calculation.
		/// @return Size (in bytes) of the final binary stub.
		template <class ELFT> static size_t getBinarySize(const ELFStub &Stub) {
		jakehehrlichAuthorUnsubmitted Not Done Reply Inline Actions Calculating the size and writing can be a bit fragile. That said you can't write until you have enough space allocated and because we want to use a FileBuffer and avoid copying, we only want to write once. A trick I've seen used is to think about 'WriteCommands' write commands know the maximum index that they write to and can perform the write themselves. So rather than performing size calculation and writes separately, you construct a list of write commands. From this list you then traverse it to get the maximum index written to, and then traverse it once more to write into the now allocated buffer. This way you don't have parallel code but you avoid reallocating a mapped buffer. jakehehrlich: Calculating the size and writing can be a bit fragile. That said you can't write until you have…
		jakehehrlichAuthorUnsubmitted Not Done Reply Inline Actions I went ahead and implemented my own advice here. Seems to solve the problem of separating calculation of size and writing of data. The alternative is to have a type that mirrors the format of the stub but contains layout information and then to perform layout and then writing individually like we do in llvm-objcopy. jakehehrlich: I went ahead and implemented my own advice here. Seems to solve the problem of separating…
		using Elf_Ehdr = typename ELFT::Ehdr;
		return sizeof(Elf_Ehdr);
		// TODO: Calculate size of section headers.
		// TODO: Calculate size of program headers.
		// TODO: Calculate size of .dynsym section.
		// TODO: Calculate size of .dynstr section.
		// TODO: Calculate size of .dynamic section.
		// TODO: Calculate size of .shstrtab section.
		}

		/// This initializes an ELF file header with information specific to a binary
		/// dynamic shared object.
		/// Offsets, indexes, links, etc. for section and program headers are just
		/// zero-initialized as they will be updated elsewhere.
		///
		ruiuUnsubmitted Done Reply Inline Actions 0x00 and 0u are all just 0, so I'd just write "0". Sign-extending 0 yields just 0, so "u" is redundant. ruiu: 0x00 and 0u are all just 0, so I'd just write "0". Sign-extending 0 yields just 0, so "u" is…
		/// @param ElfHeader Target ELFT::Ehdr to populate.
		/// @param Machine Target architecture (e_machine from ELF specifications).
		template <class ELFT>
		static void initELFHeader(typename ELFT::Ehdr &ElfHeader, uint16_t Machine) {
		jhendersonUnsubmitted Not Done Reply Inline Actions out -> Out jhenderson: out -> Out
		using Elf_Ehdr = typename ELFT::Ehdr;
		jakehehrlichAuthorUnsubmitted Done Reply Inline Actions Not sure I like this interface but if you do want todo something like this 1) use uint8_t instead of 'char' and 2) MutableArrayRef wraps the pointer and size for you so you don't have to carry them around while still allowing you to modify the contents as you need to. jakehehrlich: Not sure I like this interface but if you do want todo something like this 1) use uint8_t…
		using Elf_Phdr = typename ELFT::Phdr;
		using Elf_Shdr = typename ELFT::Shdr;
		jakehehrlichAuthorUnsubmitted Done Reply Inline Actions Calling getBinarySize twice since the user already has to call it isn't ideal. Also if you want this check it seems like an assertion would be better. Also if you just return a Buffer from here rather than telling the user how big of a buffer to construct (though you force them to use a specific kind of buffer) then you can avoid the check all together. jakehehrlich: Calling getBinarySize twice since the user already has to call it isn't ideal. Also if you want…

		memset(&ElfHeader, 0, sizeof(Elf_Ehdr));
		// ELF identification
		ElfHeader.e_ident[EI_MAG0] = 0x7f; // ELFMAG0
		jakehehrlichAuthorUnsubmitted Not Done Reply Inline Actions Use ELFMAG* instead of each of the actual constants here. jakehehrlich: Use ELFMAG* instead of each of the actual constants here.
		ElfHeader.e_ident[EI_MAG1] = 'E'; // ELFMAG1
		ElfHeader.e_ident[EI_MAG2] = 'L'; // ELFMAG2
		ElfHeader.e_ident[EI_MAG3] = 'F'; // ELFMAG3
		ElfHeader.e_ident[EI_CLASS] = ELFT::Is64Bits ? ELFCLASS64 : ELFCLASS32;
		bool IsLittleEndian = ELFT::TargetEndianness == support::little;
		ElfHeader.e_ident[EI_DATA] = IsLittleEndian ? ELFDATA2LSB : ELFDATA2MSB;
		ElfHeader.e_ident[EI_VERSION] = EV_CURRENT;
		ElfHeader.e_ident[EI_OSABI] = ELFOSABI_NONE;
		jakehehrlichAuthorUnsubmitted Not Done Reply Inline Actions It turns out that there are use cases where this can be other things. This is a good default but sometimes the user should have to specify this. @jhenderson has hit this issue in BSD land. I can't seem to find the code for that however. Hopefully James can weigh in. Either way I don't think its something you need to worry about right now but it was possibly an oversight on our part. jakehehrlich: It turns out that there are use cases where this can be other things. This is a good default…
		jhendersonUnsubmitted Not Done Reply Inline Actions The issue in llvm-objcopy was that it was copying an existing file and discarding the EI_OSABI and EI_ABIVERSION. Certainly it might be useful in the future to be able to say what the value should be, but as this is not currently required, I think it can be delayed until there's a request for it. On the other hand, converting a binary into a text file then back into a binary should probably support this at some point. jhenderson: The issue in llvm-objcopy was that it was copying an existing file and discarding the EI_OSABI…
		ElfHeader.e_ident[EI_ABIVERSION] = 0;

		jakehehrlichAuthorUnsubmitted Done Reply Inline Actions I think using this technique is most justified by larger code (like what you'll have later) so I'm cool keeping these but at this size it feels like it could just all go in a header. jakehehrlich: I think using this technique is most justified by larger code (like what you'll have later) so…
		// remainder of ELF header
		jhendersonUnsubmitted Not Done Reply Inline Actions Nit: this should start with a capital letter and end with a full stop. jhenderson: Nit: this should start with a capital letter and end with a full stop.
		ElfHeader.e_type = ET_DYN;
		ElfHeader.e_machine = Machine;
		ElfHeader.e_version = EV_CURRENT;
		ElfHeader.e_entry = 0;
		ElfHeader.e_flags = 0;
		jhendersonUnsubmitted Done Reply Inline Actions You should probably set the e_shentsize and e_phentsize fields too, since those are constant. jhenderson: You should probably set the e_shentsize and e_phentsize fields too, since those are constant.
		ElfHeader.e_ehsize = sizeof(Elf_Ehdr);
		ElfHeader.e_phentsize = sizeof(Elf_Phdr);
		ElfHeader.e_shentsize = sizeof(Elf_Shdr);
		jhendersonUnsubmitted Done Reply Inline Actions Now that you are doing this, please make sure that it is tested too. jhenderson: Now that you are doing this, please make sure that it is tested too.
		}

		/// This function uses an ELFStub to generate an ELF binary stub that is written
		/// to a buffer.
		///
		/// @param FilePath File path for writing the ELF binary.
		/// @param Stub Source ELFStub to generate a binary ELF stub from.
		template <class ELFT>
		static Error writeELFBinaryToBuffer(const ELFStub &Stub,
		MutableArrayRef<uint8_t> BufRef) {
		using Elf_Ehdr = typename ELFT::Ehdr;
		uint8_t *Buf = BufRef.data();
		Elf_Ehdr ElfHeader = reinterpret_cast<Elf_Ehdr >(Buf);
		initELFHeader<ELFT>(*ElfHeader, Stub.Arch);
		// TODO: Write section headers.
		// TODO: Write program headers.
		// TODO: Write .dynsym section.
		// TODO: Write .dynstr section.
		// TODO: Write .dynamic section.
		// TODO: Write .shstrtab section.
		return Error::success();
		}

		/// This function opens a file for writing and then writes a binary ELF stub to
		/// the file.
		///
		jhendersonUnsubmitted Done Reply Inline Actions Same comment as above. It seems weird for `Stub` to be a non-const reference. jhenderson: Same comment as above. It seems weird for `Stub` to be a non-const reference.
		/// @param FilePath File path for writing the ELF binary.
		/// @param Stub Source ELFStub to generate a binary ELF stub from.
		template <class ELFT>
		static Error writeELFBinaryToFile(StringRef FilePath, const ELFStub &Stub) {
		// Open file for writing.
		Expected<std::unique_ptr<FileOutputBuffer>> BufOrError =
		FileOutputBuffer::create(FilePath, getBinarySize<ELFT>(Stub));
		if (!BufOrError) {
		Error FileReadError = BufOrError.takeError();
		std::string Message;
		raw_string_ostream Stream(Message);
		Stream << FileReadError;
		jhendersonUnsubmitted Done Reply Inline Actions Add a blank line here. jhenderson: Add a blank line here.
		Stream << " when trying to open `" << FilePath <<"` for writing";
		consumeError(std::move(FileReadError));
		return createStringError(errc::invalid_argument, Stream.str().c_str());
		jakehehrlichAuthorUnsubmitted Not Done Reply Inline Actions You can append to an error I believe so that you don't have to consume it. jakehehrlich: You can append to an error I believe so that you don't have to consume it.
		}

		jhendersonUnsubmitted Done Reply Inline Actions You can do these two lines in one, and lose the braces too, i.e: if (Error BinaryWriteError = writeELFBinaryToBuffer<ELFT>(Stub, BufRef)) return BinaryWriteError; jhenderson: You can do these two lines in one, and lose the braces too, i.e: ``` if (Error…
		// Write binary to file.
		std::unique_ptr<FileOutputBuffer> Buf = std::move(*BufOrError);
		MutableArrayRef<uint8_t> BufRef(Buf->getBufferStart(), Buf->getBufferSize());
		if (Error BinaryWriteError = writeELFBinaryToBuffer<ELFT>(Stub, BufRef))
		return BinaryWriteError;
		jhendersonUnsubmitted Done Reply Inline Actions Ditto. jhenderson: Ditto.

		if (Error FileWriteError = Buf->commit())
		return FileWriteError;

		return Error::success();
		}

		// This function wraps the ELFT writeELFBinaryToFile() so writeBinaryStub()
		// can be called without having to use ELFType templates directly.
		Error writeBinaryStub(StringRef FilePath, const ELFStub &Stub,
		ELFTarget OutputFormat) {
		if (OutputFormat == ELFTarget::ELF32LE) {
		return writeELFBinaryToFile<ELF32LE>(FilePath, Stub);
		} else if (OutputFormat == ELFTarget::ELF32BE) {
		return writeELFBinaryToFile<ELF32BE>(FilePath, Stub);
		} else if (OutputFormat == ELFTarget::ELF64LE) {
		return writeELFBinaryToFile<ELF64LE>(FilePath, Stub);
		} else if (OutputFormat == ELFTarget::ELF64BE) {
		return writeELFBinaryToFile<ELF64BE>(FilePath, Stub);
		}
		return createStringError(errc::invalid_argument,
		"Invalid binary output target");
		}

} // end namespace elfabi		} // end namespace elfabi
} // end namespace llvm		} // end namespace llvm

llvm/tools/llvm-elfabi/llvm-elfabi.cpp

	Show First 20 Lines • Show All 43 Lines • ▼ Show 20 Lines
	cl::opt<std::string>			cl::opt<std::string>
	EmitTBE("emit-tbe",			EmitTBE("emit-tbe",
	cl::desc("Emit a text-based ELF stub (.tbe) from the input file"),			cl::desc("Emit a text-based ELF stub (.tbe) from the input file"),
	cl::value_desc("path"));			cl::value_desc("path"));
	cl::opt<std::string> SOName(			cl::opt<std::string> SOName(
	"soname",			"soname",
	cl::desc("Manually set the DT_SONAME entry of any emitted files"),			cl::desc("Manually set the DT_SONAME entry of any emitted files"),
	cl::value_desc("name"));			cl::value_desc("name"));
				cl::opt<ELFTarget> BinaryOutputTarget(
				"output-target", cl::desc("Create a binary stub for the specified target"),
				cl::values(clEnumValN(ELFTarget::ELF32LE, "elf32-little",
				"32-bit little-endian ELF stub"),
				clEnumValN(ELFTarget::ELF32BE, "elf32-big",
				"32-bit big-endian ELF stub"),
				clEnumValN(ELFTarget::ELF64LE, "elf64-little",
				"64-bit little-endian ELF stub"),
				clEnumValN(ELFTarget::ELF64BE, "elf64-big",
				"64-bit big-endian ELF stub")));
				cl::opt<std::string> BinaryOutputFilePath(cl::Positional, cl::desc("output"));

	/// writeTBE() writes a Text-Based ELF stub to a file using the latest version			/// writeTBE() writes a Text-Based ELF stub to a file using the latest version
	/// of the YAML parser.			/// of the YAML parser.
	static Error writeTBE(StringRef FilePath, ELFStub &Stub) {			static Error writeTBE(StringRef FilePath, ELFStub &Stub) {
	std::error_code SysErr;			std::error_code SysErr;

	// Open file for writing.			// Open file for writing.
	raw_fd_ostream Out(FilePath, SysErr);			raw_fd_ostream Out(FilePath, SysErr);
	if (SysErr)			if (SysErr)
	return createStringError(SysErr, "Couldn't open `%s` for writing",			return createStringError(SysErr, "Couldn't open `%s` for writing",
	FilePath.data());			FilePath.data());
	// Write file.			// Write file.
	Error YAMLErr = writeTBEToOutputStream(Out, Stub);			Error YAMLErr = writeTBEToOutputStream(Out, Stub);
	if (YAMLErr)			if (YAMLErr)
	return YAMLErr;			return YAMLErr;

	return Error::success();			return Error::success();
	}			}

	/// readInputFile populates an ELFStub by attempting to read the			/// readInputFile populates an ELFStub by attempting to read the
	/// input file using both the TBE and binary ELF parsers.			/// input file using both the TBE and binary ELF parsers.
	static Expected<std::unique_ptr<ELFStub>> readInputFile(StringRef FilePath) {			static Expected<std::unique_ptr<ELFStub>> readInputFile(StringRef FilePath) {
	// Read in file.			// Read in file.
	ErrorOr<std::unique_ptr<MemoryBuffer>> BufOrError =			ErrorOr<std::unique_ptr<MemoryBuffer>> BufOrError =
	MemoryBuffer::getFile(FilePath);			MemoryBuffer::getFile(FilePath);
				jakehehrlichAuthorUnsubmitted Done Reply Inline Actions Since you have an error and are currently returning an error, that error should incorporate the specific information from this error and not make it more vauge. jakehehrlich: Since you have an error and are currently returning an error, that error should incorporate the…
	if (!BufOrError) {			if (!BufOrError) {
	return createStringError(BufOrError.getError(), "Could not open `%s`",			return createStringError(BufOrError.getError(), "Could not open `%s`",
	FilePath.data());			FilePath.data());
	}			}

	std::unique_ptr<MemoryBuffer> FileReadBuffer = std::move(*BufOrError);			std::unique_ptr<MemoryBuffer> FileReadBuffer = std::move(*BufOrError);
				jakehehrlichAuthorUnsubmitted Done Reply Inline Actions Use uint8_t instead of char for raw binary data. llvm doesn't even use full C++11 but in C++17 we'll use std::byte jakehehrlich: Use uint8_t instead of char for raw binary data. llvm doesn't even use full C++11 but in C++17…
	ErrorCollector EC(/UseFatalErrors=/false);			ErrorCollector EC(/UseFatalErrors=/false);

	// First try to read as a binary (fails fast if not binary).			// First try to read as a binary (fails fast if not binary).
	if (InputFileFormat.getNumOccurrences() == 0 \|\|			if (InputFileFormat.getNumOccurrences() == 0 \|\|
	InputFileFormat == FileFormat::ELF) {			InputFileFormat == FileFormat::ELF) {
	Expected<std::unique_ptr<ELFStub>> StubFromELF =			Expected<std::unique_ptr<ELFStub>> StubFromELF =
	readELFFile(FileReadBuffer->getMemBufferRef());			readELFFile(FileReadBuffer->getMemBufferRef());
	if (StubFromELF) {			if (StubFromELF) {
	return std::move(*StubFromELF);			return std::move(*StubFromELF);
	}			}
	EC.addError(StubFromELF.takeError(), "BinaryRead");			EC.addError(StubFromELF.takeError(), "BinaryRead");
	}			}

	// Fall back to reading as a tbe.			// Fall back to reading as a tbe.
	if (InputFileFormat.getNumOccurrences() == 0 \|\|			if (InputFileFormat.getNumOccurrences() == 0 \|\|
	InputFileFormat == FileFormat::TBE) {			InputFileFormat == FileFormat::TBE) {
				jakehehrlichAuthorUnsubmitted Done Reply Inline Actions I think it would be nicer if this took the output format as a parameter. jakehehrlich: I think it would be nicer if this took the output format as a parameter.
	Expected<std::unique_ptr<ELFStub>> StubFromTBE =			Expected<std::unique_ptr<ELFStub>> StubFromTBE =
	readTBEFromBuffer(FileReadBuffer->getBuffer());			readTBEFromBuffer(FileReadBuffer->getBuffer());
	if (StubFromTBE) {			if (StubFromTBE) {
	return std::move(*StubFromTBE);			return std::move(*StubFromTBE);
	}			}
	EC.addError(StubFromTBE.takeError(), "YamlParse");			EC.addError(StubFromTBE.takeError(), "YamlParse");
	}			}

				jakehehrlichAuthorUnsubmitted Done Reply Inline Actions Use == and don't construct a StringRef. jakehehrlich: Use == and don't construct a StringRef.
	// If both readers fail, build a new error that includes all information.			// If both readers fail, build a new error that includes all information.
	EC.addError(createStringError(errc::not_supported,			EC.addError(createStringError(errc::not_supported,
	"No file readers succeeded reading `%s` "			"No file readers succeeded reading `%s` "
	"(unsupported/malformed file?)",			"(unsupported/malformed file?)",
	FilePath.data()),			FilePath.data()),
	"ReadInputFile");			"ReadInputFile");
	EC.escalateToFatal();			EC.escalateToFatal();
	return EC.makeError();			return EC.makeError();
	}			}

	int main(int argc, char *argv[]) {			int main(int argc, char *argv[]) {
	// Parse arguments.			// Parse arguments.
	cl::ParseCommandLineOptions(argc, argv);			cl::ParseCommandLineOptions(argc, argv);

	Expected<std::unique_ptr<ELFStub>> StubOrErr = readInputFile(InputFilePath);			Expected<std::unique_ptr<ELFStub>> StubOrErr = readInputFile(InputFilePath);
	if (!StubOrErr) {			if (!StubOrErr) {
	Error ReadError = StubOrErr.takeError();			Error ReadError = StubOrErr.takeError();
	WithColor::error() << ReadError << "\n";			WithColor::error() << ReadError << "\n";
	exit(1);			exit(1);
	}			}

	std::unique_ptr<ELFStub> TargetStub = std::move(StubOrErr.get());			std::unique_ptr<ELFStub> TargetStub = std::move(StubOrErr.get());

	// Write out .tbe file.			// Change SoName before emitting stubs.
	if (EmitTBE.getNumOccurrences() == 1) {
	TargetStub->TbeVersion = TBEVersionCurrent;
	if (SOName.getNumOccurrences() == 1) {			if (SOName.getNumOccurrences() == 1) {
	TargetStub->SoName = SOName;			TargetStub->SoName = SOName;
	}			}

				// Write out .tbe file.
				jhendersonUnsubmitted Done Reply Inline Actions It probably reads easier if there's a new line between each if block. jhenderson: It probably reads easier if there's a new line between each if block.
				if (EmitTBE.getNumOccurrences() == 1) {
				TargetStub->TbeVersion = TBEVersionCurrent;
	Error TBEWriteError = writeTBE(EmitTBE, *TargetStub);			Error TBEWriteError = writeTBE(EmitTBE, *TargetStub);
	if (TBEWriteError) {			if (TBEWriteError) {
	WithColor::error() << TBEWriteError << "\n";			WithColor::error() << TBEWriteError << "\n";
	exit(1);			exit(1);
				jhendersonUnsubmitted Not Done Reply Inline Actions You might want to consider factoring these out into a single function that takes an `Error`. jhenderson: You might want to consider factoring these out into a single function that takes an `Error`.
				jakehehrlichAuthorUnsubmitted Not Done Reply Inline Actions Do you remember what you meant by this? I'm starting this back up again. jakehehrlich: Do you remember what you meant by this? I'm starting this back up again.
				jhendersonUnsubmitted Not Done Reply Inline Actions The WithColor::error() and exit(1) are repeated in a couple of places. It might be nice if they were a simple function that takes an llvm::Errror and does not return, e.g: void reportError(Error Err) { WithColor::error() << Err << "\n"; exit(1); } although I feel like there might also be other functions available to do the same thing. jhenderson: The WithColor::error() and exit(1) are repeated in a couple of places. It might be nice if they…
	}			}
	}			}

				// Write out binary ELF stub.
				if (BinaryOutputFilePath.getNumOccurrences() == 1) {
				if (BinaryOutputTarget.getNumOccurrences() == 0) {
				WithColor::error() << "No binary output target specified.\n";
				ruiuUnsubmitted Done Reply Inline Actions It might be discussed before, but what is a value of returning an Error from a function and then print that error & exit from the main function? Propagating an error all the way to the main function makes function signatures more complex (any function that can fail or calls a function that can fail has to have a function signature of ErrorOr<T> instead of just T). If this is just a command, then maybe just printing out an error message and exit is better? ruiu: It might be discussed before, but what is a value of returning an Error from a function and…
				jakehehrlichAuthorUnsubmitted Done Reply Inline Actions Yeah there's some line where things will only ever go in this code and some line where things would go into an eventual library. It isn't 100% clear where that line is right now. I'm in favor of earing on the side of caution and propagating more than less for the time being. jakehehrlich: Yeah there's some line where things will only ever go in this code and some line where things…
				exit(1);
				}
				Error BinaryWriteError = writeBinaryStub(BinaryOutputFilePath, *TargetStub,
				BinaryOutputTarget);
				if (BinaryWriteError) {
				WithColor::error() << BinaryWriteError << "\n";
				exit(1);
				}
				}
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[elfabi] Add support for writing ELF header for binary stubsNeeds ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 182938

llvm/test/tools/llvm-elfabi/invalid-bin-target.test

llvm/test/tools/llvm-elfabi/missing-bin-target.test

llvm/test/tools/llvm-elfabi/write-elf32be-ehdr.test

llvm/test/tools/llvm-elfabi/write-elf32le-ehdr.test

llvm/test/tools/llvm-elfabi/write-elf64be-ehdr.test

llvm/test/tools/llvm-elfabi/write-elf64le-ehdr.test

llvm/tools/llvm-elfabi/ELFObjHandler.h

llvm/tools/llvm-elfabi/ELFObjHandler.cpp

llvm/tools/llvm-elfabi/llvm-elfabi.cpp

[elfabi] Add support for writing ELF header for binary stubs
Needs ReviewPublic