This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/ObjCopy/
-
llvm/
-
ObjCopy/
2/4
Buffer.h
-
COFF/
-
COFFObjcopy.h
1/2
CopyConfig.h
-
ELF/
-
ELFConfig.h
-
ELFObjcopy.h
-
MachO/
-
MachOObjcopy.h
2/6
ObjCopy.h
-
wasm/
-
WasmObjcopy.h
-
lib/
1
CMakeLists.txt
-
LLVMBuild.txt
-
ObjCopy/
-
Buffer.cpp
-
CMakeLists.txt
-
COFF/
-
COFFObjcopy.cpp
-
Object.h
3/7
Object.cpp
-
Reader.h
-
Reader.cpp
-
Writer.h
-
Writer.cpp
-
CopyConfig.cpp
-
ELF/
-
ELFConfig.cpp
-
ELFObjcopy.cpp
-
Object.h
-
Object.cpp
-
LLVMBuild.txt
-
MachO/
-
MachOLayoutBuilder.h
-
MachOLayoutBuilder.cpp
-
MachOObjcopy.cpp
-
MachOReader.h
-
MachOReader.cpp
-
MachOWriter.h
-
MachOWriter.cpp
-
Object.h
-
Object.cpp
-
ObjCopy.cpp
-
Util.h
-
wasm/
-
Object.h
-
Object.cpp
1
Reader.h
-
Reader.cpp
-
WasmObjcopy.cpp
-
Writer.h
-
Writer.cpp
-
tools/llvm-objcopy/
-
llvm-objcopy/
-
Buffer.h
-
Buffer.cpp
-
CMakeLists.txt
-
COFF/
-
COFFObjcopy.h
-
COFFObjcopy.cpp
-
Object.h
-
Object.cpp
-
Reader.h
-
Reader.cpp
-
Writer.h
-
Writer.cpp
-
CopyConfig.h
-
CopyConfig.cpp
-
ELF/
-
ELFConfig.h
-
ELFConfig.cpp
-
ELFObjcopy.h
-
ELFObjcopy.cpp
-
Object.h
-
Object.cpp
-
MachO/
-
MachOLayoutBuilder.h
-
MachOLayoutBuilder.cpp
-
MachOObjcopy.h
-
MachOObjcopy.cpp
-
MachOReader.h
-
MachOReader.cpp
-
MachOWriter.h
-
MachOWriter.cpp
-
Object.h
-
Object.cpp
-
ParseConfig.h
-
ParseConfig.cpp
-
llvm-objcopy.h
1
llvm-objcopy.cpp
-
wasm/
-
Object.h
-
Object.cpp
-
Reader.h
-
Reader.cpp
-
WasmObjcopy.h
-
WasmObjcopy.cpp
-
Writer.h
-
Writer.cpp
-
unittests/
-
CMakeLists.txt
-
ObjCopy/
1
CMakeLists.txt
3
ObjCopyTest.cpp

Differential D88827

[llvm-objcopy][NFC] Move core implementation of llvm-objcopy into separate library.
ClosedPublic

Authored by avl on Oct 5 2020, 5:29 AM.

Download Raw Diff

Details

Reviewers

jhenderson
rupprecht
grimar
alexander-shaposhnikov
MaskRay
• espindola
mstorsjo
sbc100
echristo

Commits

rGf75da0c8e65c: [llvm-objcopy][NFC] Move core implementation of llvm-objcopy into separate…

Summary

This patch moves core implementation of llvm-objcopy into ObjCopy library
(http://lists.llvm.org/pipermail/llvm-dev/2020-September/145075.html).
The functionality for parsing input options is left inside tools/llvm-objcopy.
The interface of ObjCopy library:

ObjCopy/ELF/ELFObjcopy.h

Error executeObjcopyOnIHex(const CopyConfig &Config, MemoryBuffer &In,
                           Buffer &Out);
Error executeObjcopyOnRawBinary(const CopyConfig &Config, MemoryBuffer &In,
                                Buffer &Out);
Error executeObjcopyOnBinary(const CopyConfig &Config,
                             object::ELFObjectFileBase &In, Buffer &Out);

ObjCopy/COFF/COFFObjcopy.h

Error executeObjcopyOnBinary(const CopyConfig &Config,
                             object::COFFObjectFile &In, Buffer &Out);

ObjCopy/MachO/MachOObjcopy.h

Error executeObjcopyOnBinary(const CopyConfig &Config,
                             object::MachOObjectFile &In, Buffer &Out);

ObjCopy/wasm/WasmObjcopy.h

Error executeObjcopyOnBinary(const CopyConfig &Config,
                             object::WasmObjectFile &In, Buffer &Out);

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

avl created this revision.Oct 5 2020, 5:29 AM

Herald added a reviewer: • espindola. · View Herald TranscriptOct 5 2020, 5:30 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, abrachet, sunfish and 4 others. · View Herald Transcript

avl requested review of this revision.Oct 5 2020, 5:30 AM

Herald added a subscriber: aheejin. · View Herald TranscriptOct 5 2020, 5:30 AM

Harbormaster completed remote builds in B73983: Diff 296161.Oct 5 2020, 5:43 AM

avl edited the summary of this revision. (Show Details)Oct 5 2020, 5:58 AM

This looks fine to me generally. Have a suggestion though:
in few places there are changes that are related to clang-formatting of original code it seems.
I'd commit them as a separate NFC cleanup (you don't need a review for that) and then rebase this diff.

llvm/unittests/ObjCopy/ObjCopyTest.cpp
40	Perhaps just fail inside?
51	You can avod having this `ASSERT_TRUE` if you use `cast<T>`, I think. Also, you can use `T &` instead of a pointer, because it is expected that the value is always non-null.
100	Do you need `Machine: EM_X86_64`? By default it is `EM_NONE`, so it should work without an explicit value I guess.

This change needs some analysis & review, it might take some time (especially because this week we have the LLVM conference).

A few considerations / observations / questions.

While this diff moves the implementation of llvm-objcopy into a library it seems like the current interfaces / design are not ideal for a library.

For example, CopyConfig essentially represents parsed command-line options and it is "string-heavy".
Another manifestation of this problem is that the "public" headers (include/llvm/ObjCopy/...) expose a great deal of the internal implementation details etc.

Designing a good interface is a nontrivial task. It is important to understand the scope of the problem and potentially dissect it into subparts.

For example if the plan (at least initially) is to reuse the code for reading / writing object files then one of the first steps would be factoring out (and, probably, cleaning up) the model ("class Object") and
exposing the minimal interface for reading / writing . In particular, the internal details of implementation and the associated complexity (e.g. class Reader, class Writer) would live inside the library.

addressed comments.

Herald added a subscriber: mgrang. · View Herald TranscriptOct 6 2020, 7:36 AM

@alexshap

This refactoring tries to reuse this part of the functionality:

Error executeObjcopyOnBinary(const CopyConfig &Config, object::ELFObjectFileBase &In, Buffer &Out);

get source file "In", apply transformations described by "Config", write output into the "Out".

i.e. it is not trying to refactor low level of the objcopy code("class Object" and others). It still should be internal thing of ObjCopy library.

I agree that the patch currently makes internal details (class Object, class Reader, class Writer) to be public.
They should be hidden inside library code. I addressed this into the new version of the patch.

Harbormaster completed remote builds in B74139: Diff 296459.Oct 6 2020, 7:51 AM

apply clang-tidy comments.

Harbormaster completed remote builds in B74176: Diff 296524.Oct 6 2020, 12:58 PM

Looks pretty good from my point of view. Some general comments:

The functions in the library header need doxygen style comments to explain the interface. Possibly these could be added in a follow-up patch.
I think if we are moving files around, now is a good time to run a blanket clang-format on the files. Whether that should be done as part of the patch or a separate follow-on one, I don't know.
Probably not this patch, but we should consider our testing strategy for the libray and llvm-objcopy going forward. Some examples for this dicussion: should we move the existing lit tests? Should we port/duplicate some of them as gtest unit tests? How should new library features be tested (gtest or lit)? How should new llvm-strip/objcopy/... options be tested (bearing in mind in theory there should be library testing etc etc)? I think this sort of discussion probably belongs on the mailing list.

I think keeping the surface area of the library down to just the executeObjcopyOn* functions plus the config struct is a pretty good example of a facade style design pattern, and I think makes sense in this context for at least the first version. This assumes that the use-case is a simple in-process read-in/transform/write-out (possibly to a memory buffer for further operations by other tools). We might want to expose other bits of the process and provide other ways of driving the objcopy process in the future (e.g. making it more interactive somehow), but I think that is an extension for later. We do need to nail down the initial use-case(s) though to make sure we don't produce something that isn't useful.

llvm/include/llvm/ObjCopy/Buffer.h
23–24	Maybe as part of a separate patch, it would be worth taking a look at this TODO. It would be great if the Buffer could be removed from the library API and generic LLVM buffers used instead (for example an in-memory buffer or a file buffer, depending on what people want to do).
llvm/include/llvm/ObjCopy/CopyConfig.h
126–127	This comment probably needs updating to better match the new usage - but see out-of-line comment.
llvm/include/llvm/ObjCopy/ObjCopy.h
19–22	I think details of the output format shouldn't be described in the comment - theoretically objcopy could even mutate from one object format to another (see e.g. the IHEX stuff).
26–28
llvm/lib/ObjCopy/COFF/Object.cpp
1	Whilst you're moving this and the equivalent files for other formats around, could you please rename them to be obvious from the filename which format they are for (same goes for their headers), please? For example, COFF/Object.cpp -> COFF/COFFObject.cpp. The reason for this is that when using the Visual Studio IDE, all the "Object.cpp" files end up listed next to each other in the file browser, and the only way of figuring out which is which is by opening them and seeing.
llvm/lib/ObjCopy/wasm/Reader.h
9

The functions in the library header need doxygen style comments to explain the interface. Possibly these could be added in a follow-up patch.

Ok.

I think if we are moving files around, now is a good time to run a blanket clang-format on the files. Whether that should be done as part of the patch or a separate follow-on one, I don't know.

Ok, but I think it is better to do in a separate patch.

Probably not this patch, but we should consider our testing strategy for the libray and llvm-objcopy going forward. Some examples for this dicussion: should we move the existing lit tests? Should we port/duplicate some of them as gtest unit tests? How should new library features be tested (gtest or lit)? How should new llvm-strip/objcopy/... options be tested (bearing in mind in theory there should be library testing etc etc)? I think this sort of discussion probably belongs on the mailing list.

OK, I will start that thread soon.

llvm/include/llvm/ObjCopy/Buffer.h
23–24	agreed, but I think it is better to do in separate patch.
llvm/include/llvm/ObjCopy/CopyConfig.h
126–127	ok.
llvm/include/llvm/ObjCopy/ObjCopy.h
19–22	ok.
llvm/lib/ObjCopy/COFF/Object.cpp
1	ok.

I have some general comments / concerns (in addition to the inline comment).
The interface of the library is important and once it's committed and people start using the library in multiple places it might be harder to make changes / fix issues
(unfortunately this has already happened in LLVM a few times in the past) .

CopyConfig as a part of the interface in its current form seems to be not "in a perfect shape". E.g. some ELF-specific options are inside the struct ELF, some others are just regular fields.

Some fields make sense in the context of the tool but they don't (at least at quick glance) in the context of a library function.

class Buffer - I agree with @jhenderson's comment. Just want to note that in general there are multiple ways how to address it, but this requires some thinking.

1, 2 - perhaps, this can be refactored / reorganized / better documented, but it would be great to have it done before the introduction of the library.

I'd like to point out that not all the operations supported by llvm-objcopy can be performed in-memory (e.g. splitting dwo will create a new file). If a library function has such side effects (creates new objects on the local file system) it would be good to have it documented.

cc: @mtrent , @dblaikie, @echristo

llvm/lib/ObjCopy/COFF/Object.cpp
1	@jhenderson, I'm sorry to disagree, but renaming files this way doesn't seem to be a good idea and the provided justification doesn't appear to be sufficient. Since this file contains the implementation of what's declared in Object.h I would strongly prefer to have it named Object.cpp given it is already located in the corresponding folder. Visual Studio IDE might have some peculiarities but having consistent naming is important, adding such prefixes doesn't seem to be a good approach.

alexander-shaposhnikov added a reviewer: mstorsjo.Oct 8 2020, 4:47 AM

1, 2 - perhaps, this can be refactored / reorganized / better documented, but it would be great to have it done before the introduction of the library.

if possible - I propose to do these points as next patches after this(to make some progress on it). If not - I would work on preliminary patches.

llvm/lib/ObjCopy/COFF/Object.cpp
1	Would it be OK, If both of the files would be renamed Object.h -> COFFObject.h and Object.cpp->COFFObject.cpp ?

avl added inline comments.Oct 9 2020, 8:07 AM

llvm/include/llvm/ObjCopy/Buffer.h

23–24

@alexshap @jhenderson

Speaking of class Buffer refactoring. I do not think that this comment is fully valid:

// The class Buffer abstracts out the common interface of FileOutputBuffer and
// WritableMemoryBuffer so that the hierarchy of Writers depends on this
// abstract interface and doesn't depend on a particular implementation.
// TODO: refactor the buffer classes in LLVM to enable us to use them here
// directly.

It suggests to create some common interface for FileOutputBuffer and WritableMemoryBuffer.
Which is assumed to look similar to this:

class Buffer {
  StringRef Name;

public:
  virtual ~Buffer();
  virtual Error allocate(size_t Size) = 0;
  virtual uint8_t *getBufferStart() = 0;
  virtual Error commit() = 0;

  explicit Buffer(StringRef Name) : Name(Name) {}
  StringRef getName() const { return Name; }
};

There exists a problem with methods commit() and allocate(). commit() is a redundant for WritableMemoryBuffer.
adding it to WritableMemoryBuffer would require to patch all current usages of WritableMemoryBuffer.
So it looks incorrectly to use it for common parent interface of FileOutputBuffer and WritableMemoryBuffer.
allocate() suggests another way of buffer creation. Currently, buffers are created by static creation methods:

static std::unique_ptr<WritableMemoryBuffer>
getNewMemBuffer(size_t Size, const Twine &BufferName = "");

static Expected<std::unique_ptr<FileOutputBuffer>>
create(StringRef FilePath, size_t Size, unsigned Flags = 0);

Adding "virtual Error allocate(size_t Size)" would lead to creation empty buffer by static creation method
and then call to allocate(). This does not seem a good addition to the already existed FileOutputBuffer and WritableMemoryBuffer. FileOutputBuffer and WritableMemoryBuffer assume another use cases than Buffer.

Actually, what is neccessary by objcopy is method:

uint8_t *createBuffer ( size_t Size );

All other functionality is redundant and could be removed from objcopy.

what do you think about following design?

LazyBuffer {
  StringRef Name;

  virtual StringRef getName() const { return Name; }
  virtual uint8_t *createBuffer ( size_t Size ) = 0;
};

MemoryLazyBuffer : public LazyBuffer {
  virtual uint8_t *createBuffer ( size_t Size ) {
    Buffer = WritableMemoryBuffer::getNewMemBuffer(Size, Name);
    return Buffer->getBufferStart();
  }

  std::unique_ptr<WritableMemoryBuffer> Buffer;
};

FileLazyBuffer : public LazyBuffer {
  virtual uint8_t *createBuffer ( size_t Size ) {
    Buffer = FileOutputBuffer::create(Name, Size);
    return Buffer->getBufferStart();
  }

  std::unique_ptr<FileOutputBuffer> Buffer;
};

Usage:

static Error executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In,
                                    LazyBuffer &Out);


MemoryLazyBuffer MB("name");
executeObjcopyOnBinary(Config, Input, MB);

FileLazyBuffer MB("name");
executeObjcopyOnBinary(Config, Input, MB);
if (MB.Buffer)
  MB.Buffer->commit();

Another alternative is that library always writes to general MemoryBuffer :

static Expected<std::unique_ptr<MemoryBuffer>> executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In);

and later this MemoryBuffer would be written into the file by llvm-objcopy.cpp.

What do you think?

I like the direction this is going, I'll take more of a deep look soon, but wanted to ask: "should this be in Object rather than a separate library?" When I'd originally asked for this to be split into its own library I'd thought that it would get added into libobject.

Thoughts?

-eric

In D88827#2322005, @echristo wrote:

I like the direction this is going, I'll take more of a deep look soon, but wanted to ask: "should this be in Object rather than a separate library?" When I'd originally asked for this to be split into its own library I'd thought that it would get added into libobject.

Thoughts?

-eric

One of the ideals would be to have writable versions of the various ObjectFile classes defined by the Object library. I know this was worked on last year as part of GSOC by @abrachet, but it didn't really end up getting to a usable point. I kind of like the separation of concerns here - the proposed Objcopy library would handle manipulation of object files, whilst the Object library is primarily for parsing and inspecting them. The former builds on the latter, but the latter doesn't need to care about the former. Thus a user who wrote an object dumping tool wouldn't need the Objcopy library. However, I don't have a strong opinion on this, so if a good design could be presented to resolve that, I'd be happy.

In D88827#2319015, @alexshap wrote:

I have some general comments / concerns (in addition to the inline comment).
The interface of the library is important and once it's committed and people start using the library in multiple places it might be harder to make changes / fix issues
(unfortunately this has already happened in LLVM a few times in the past) .

Could we maybe put a big note at the top of the library header saying that the API is still work-in-progress and shouldn't be used, until we've got to the final point? I feel like moving the code first is a good step and then we can iterate on the design once it's in place (I agree that CopyConfig probably needs more work).

llvm/include/llvm/ObjCopy/Buffer.h
23–24	I'm not sure I've looked at the intricacies of the different buffer classes to know what the right approach is. However, if modifying the existing buffers seems like it won't work/will be too invasive, this seems like a fair approach (it's the "Adapter" design pattern in action, I believe). I'm not sure I'd call it `LazyBuffer`, although I don't have a specific better name (maybe just ObjcopyBuffer, depending on how generic we want it to be). Also, unless the objcopy code actually needs the name for anything (aside from error messages, I'm not sure what that would be), I'd not include that in the interface.

In D88827#2322005, @echristo wrote:

I like the direction this is going, I'll take more of a deep look soon, but wanted to ask: "should this be in Object rather than a separate library?" When I'd originally asked for this to be split into its own library I'd thought that it would get added into libobject.

Thoughts?

I think, for the start, it is probably better to make it as a separate library. It could be refactored later and some part could probably be moved into Object library(Object.h/.cpp, Reader.h/.cpp, /Writer.h/.cpp).
If the ObjCopy library would be used in many other places in the end - then it would make sense to put it into the Object library, otherwise it would be better to live it as separate library.
Though, I am not against moving this ObjCopy library into the Object library, if there is opinion that it would be better.

avl added inline comments.Oct 20 2020, 8:59 AM

llvm/lib/ObjCopy/COFF/Object.cpp
1	@jhenderson, I'm sorry to disagree, but renaming files this way doesn't seem to be a good idea and the provided justification doesn't appear to be sufficient. Since this file contains the implementation of what's declared in Object.h I would strongly prefer to have it named Object.cpp given it is already located in the corresponding folder. Visual Studio IDE might have some peculiarities but having consistent naming is important, adding such prefixes doesn't seem to be a good approach. @alexshap Could you explain this renaming thing, please? i.e. if both header file COFF/Object.h and src file COFF/Object.cpp would be renamed(COFF/COFFObject.h, COFF/COFFObject.cpp), would it be OK?

addressed comments: renamed files, added doxygen comments into library headers(except CopyConfig.h).

I propose to go ahead with this patch and continue with follow-up patches:

Remove Buffer.h/Buffer.cpp.
cleanup CopyConfig.h.
remove handling of SplitDWO from the library.

Harbormaster completed remote builds in B75868: Diff 299664.Oct 21 2020, 7:07 AM

I really do think this needs to be under object. It can be separate in there as lib/Object/Copy if you want, but I don't think it should be a parallel directory. Hopefully the updates for the move aren't overly onerous here.

-eric

In D88827#2344900, @echristo wrote:

I really do think this needs to be under object. It can be separate in there as lib/Object/Copy if you want, but I don't think it should be a parallel directory. Hopefully the updates for the move aren't overly onerous here.

-eric

Won't that mean that any tool that just wants to read in object files (e.g. lld) will need to build this extra code for no reason? Not a huge deal I guess (especially for lld which due to LTO pulls in all of llvm!) but I imagine there are many tools that just want to read (and not modify) objects.

My 0.02$: (perhaps, this should have been mentioned earlier) the current class CopyConfig contains e.g. file names (again, imo it is good enough for a tool, but not good enough for a library) and this means that if somebody wants to add a section to an object file he won't be able to accomplish this task using the current interface without creating extra files. It kind of defeats the idea. To solve this problem proper abstractions should be introduced / the code needs to be refactored. Personally I would strongly prefer to see the following iterative approach here: refactor the current code in llvm-objcopy step by step until it's ready to be moved into a library with a clean and easy-to-use interface. Maybe I'm missing something, but doing refactoring post factum seems to be a less controllable process and might get us to the state where the code has been move out of the tool, the interface has been modified to accomplish a very specific task and the rest (burden) will stay there for years creating more issues than benefits, moreover, it would introduce some risks.
Regarding where to place these functions - into libObject or create a separate library - libObject already contains several write* functions, (e.g. for archives), so indeed, putting this group of functions (e.g. one can use a bit less verbose name - copy(...)) into libObject seems to be quite natural.

llvm/lib/ObjCopy/COFF/Object.cpp
1	I'm very sorry, but i still think that the old names were good, adding these prefixes is unnecessary and makes things less intuitive (e.g. class Object is described in Object.h).

In D88827#2344916, @sbc100 wrote:

In D88827#2344900, @echristo wrote:

I really do think this needs to be under object. It can be separate in there as lib/Object/Copy if you want, but I don't think it should be a parallel directory. Hopefully the updates for the move aren't overly onerous here.

-eric

Won't that mean that any tool that just wants to read in object files (e.g. lld) will need to build this extra code for no reason? Not a huge deal I guess (especially for lld which due to LTO pulls in all of llvm!) but I imagine there are many tools that just want to read (and not modify) objects.

That will depend on the archiver/linker behaviour, I expect, but at least for standard ld.lld behaviour (I can't speak for others), only the objects that actually are needed are pulled from the archive. Therefore, for example, if LLD didn't reference anything in the objcopy codebase, it wouldn't actually use those archive members, and therefore it wouldn't be added to the tools code. Even if it did, linker options like --gc-sections or equivalent likely would cause the unused code to be removed at link time, if the tool is built appropriately.

@alexshap's point about other files in the Object library allowing writing is persuasive to me. I think an Objcopy sub-directory would make sense though, to avoid the confusion of having both an ObjectFile and Object class/header file etc in Object. This is assuming of course we don't want to jump into refactoring the two classes so that they become one. I suspect that way might prove tricky to get right.

I'm somewhat ambivalent as to whether we move then refactor or refactor then move. If people feel that the latter approach is the right one, then so be it.

Regarding the naming of Object.h, that's fine, leave it as-is. I suspect with some small CMake changes, it should be possible to get the files to appear in a corresponding "solution folder" within the VS IDE, which would help disambiguate things.

In D88827#2346567, @jhenderson wrote:

In D88827#2344916, @sbc100 wrote:

In D88827#2344900, @echristo wrote:

I really do think this needs to be under object. It can be separate in there as lib/Object/Copy if you want, but I don't think it should be a parallel directory. Hopefully the updates for the move aren't overly onerous here.

-eric

Won't that mean that any tool that just wants to read in object files (e.g. lld) will need to build this extra code for no reason? Not a huge deal I guess (especially for lld which due to LTO pulls in all of llvm!) but I imagine there are many tools that just want to read (and not modify) objects.

That will depend on the archiver/linker behaviour, I expect, but at least for standard ld.lld behaviour (I can't speak for others), only the objects that actually are needed are pulled from the archive. Therefore, for example, if LLD didn't reference anything in the objcopy codebase, it wouldn't actually use those archive members, and therefore it wouldn't be added to the tools code. Even if it did, linker options like --gc-sections or equivalent likely would cause the unused code to be removed at link time, if the tool is built appropriately.

You are right, the size of resulting tools won't be increased. I was referring to the fact that such a tool would transitively depend on more sources and would take longer to compile, and would become dirty more often on average after a git pull. For example, its good that one can build llvm-objdump without compiling the whole of llvm.

@alexshap's point about other files in the Object library allowing writing is persuasive to me. I think an Objcopy sub-directory would make sense though, to avoid the confusion of having both an ObjectFile and Object class/header file etc in Object. This is assuming of course we don't want to jump into refactoring the two classes so that they become one. I suspect that way might prove tricky to get right.

I'm somewhat ambivalent as to whether we move then refactor or refactor then move. If people feel that the latter approach is the right one, then so be it.

Regarding the naming of Object.h, that's fine, leave it as-is. I suspect with some small CMake changes, it should be possible to get the files to appear in a corresponding "solution folder" within the VS IDE, which would help disambiguate things.

To summarize comments - I am going to:

create preliminary patch for Buffers.
create preliminary patch for CopyConfig.
move handling of SplitDWO into llvm-objcopy.cpp.
move this functionality into Object library.
do not rename files.

@alexshap @jhenderson @echristo

Folks, Before creating a patch I would like to consult what would be the best option to refactor the Buffer class.
One of the alternatives is described here https://reviews.llvm.org/D88827#2321871 ("Adapter" approach).

I think the better solution would be to use raw_ostream instead of buffers:

current:

Error executeObjcopyOnBinary(const CopyConfig &Config,
                             object::ELFObjectFileBase &In, Buffer &Out);

new:

Error executeObjcopyOnBinary(const CopyConfig &Config,
                             object::ELFObjectFileBase &In, raw_ostream &Out);

Generally, using streams could allow us to reduce memory usages. No need to load all data into the memory - the data could be streamed through a smaller buffer. Opposite, the current WritableMemoryBuffer, used in llvm-objcopy, would allocate whole data into the memory. Thus replacing WritableMemoryBuffer with raw_ostream
would allow minimizing memory requirements.

FileOutputBuffer(used by llvm-objcopy) has an advantage over raw_fd_ostream(which might be used if we would like
to store data into the file). FileOutputBuffer::createOnDiskBuffer() allows to use memory mapped file. The similar functionality
could be implemented for raw_fd_ostream. It is possible to add preallocate() method into raw_ostream.

class raw_ostream {

void preallocate(uint64_t size);

}

That method, implemented for raw_fd_ostream, could create a memory-mapped file. The streamed data would be written
into that memory file then. Thus we would be able to use memory-mapped files with raw_fd_ostream.

So, it seems we could use raw_ostream instead of Buffer without losing functionality.
It seems to me that raw_ostream is a good abstraction here and it would be good to use it for llvm-objcopy.

So what is your opinion, would it be OK to use raw_ostream?
Or should we use "Adapter" approach, from https://reviews.llvm.org/D88827#2321871 ?

Using a stream approach sounds reasonable. I don't really know what the benefits are of using a memory mapped file versus other options (I vaguely recall from some older work that they improve performance, but am not sure if that is still the case or not). The one concern I'd have with a stream for writing output is if we ever need to jump back and forth within the object for some reason. Without looking at the existing objcopy code, I don't know if there are any instances where this happens though.

By the way, I think the preallocate method might be better termed reserve as it sounds like it solves a similar intent as std::vector::reserve to me.

Using a stream approach sounds reasonable. I don't really know what the benefits are of using a memory mapped file versus other options (I vaguely recall from some older work that they improve performance, but am not sure if that is still the case or not). The one concern I'd have with a stream for writing output is if we ever need to jump back and forth within the object for some reason. Without looking at the existing objcopy code, I don't know if there are any instances where this happens though.

If there is such necessity then instead of raw_ostream there could be used raw_pwrite_stream, which allows such seek&update functionality.

During this refactoring effort, I am not going to rewrite the existing objcopy writing code.
So I plan to do things this way:

Error executeObjcopyOnBinary(Config,In, raw_ostream &Out) {
  // TODO: refactor "writing" code to output into "raw_ostream &Out"
  // TOFO: directly, without MemBuffer in the middle.
  MemBuffer.allocate(Size)
  // existing writing code.
  MemBuffer.commit();
  Out.write(MemBuffer.getBufferStart(), MemBuffer.getBufferSize());
}

I propose to use raw_ostream now and replace it with raw_pwrite_stream later, if it would be necessary.
If it would be hard to use raw_ostream during such postponed rewriting then we could change raw_ostream into raw_pwrite_stream at that moment.

If it is already clear that we need to use raw_pwrite_stream then I will use it within my patch.

By the way, I think the preallocate method might be better termed reserve as it sounds like it solves a similar intent as std::vector::reserve to me.

avl mentioned this in D91028: [llvm-objcopy][NFC] replace class Buffer/MemBuffer/FileBuffer with streams..Nov 8 2020, 2:58 AM

avl mentioned this in D91693: [Support] Add reserve() method to the raw_ostream..Nov 20 2020, 5:37 AM

sgraenitz mentioned this in D96627: [WIP] Implement JITLoaderGDB ObjectLinkingLayer plugin for ELF x86-64.Feb 17 2021, 3:16 AM

sgraenitz added a subscriber: sgraenitz.Feb 17 2021, 3:19 AM

avl mentioned this in rG4f16e177e104: [llvm-objcopy][NFC] replace class Buffer/MemBuffer/FileBuffer with streams..Mar 10 2021, 12:52 PM

avl mentioned this in D99055: [llvm-objcopy] Refactor CopyConfig structure..Mar 31 2021, 7:35 AM

addressed comments(usages of Buffers are replaced with streams, CopyConfig is refactored,
handling of SplitDWO is moved into llvm-objcopy.cpp, implementation of
llvm-objcopy is moved into the Object library).

Harbormaster completed remote builds in B124160: Diff 372901.Sep 16 2021, 5:42 AM

avl edited the summary of this revision. (Show Details)Sep 16 2021, 6:18 AM

@jhenderson @alexander-shaposhnikov

Would you mind to take a look at this review, please? It implemented all requests which were done previously https://reviews.llvm.org/D88827#2349119.

In D88827#3003632, @avl wrote:

@jhenderson @alexander-shaposhnikov

Would you mind to take a look at this review, please? It implemented all requests which were done previously https://reviews.llvm.org/D88827#2349119.

Sorry I haven't got to this - there are seeral higher-priority items that I need to be focusing on. It's still on my radar though.

ping,

In D88827#3118304, @avl wrote:

ping,

Apologies for not getting to this - a number of other reviews have used up what time I've had to do reviews.

This patch is looking very close to being ready, but not quite there yet.

llvm/include/llvm/Object/ObjCopy/ConfigManager.h
24–25 ↗	(On Diff #372901)	This should be a doc comment if it is needed. I don't think the comment is particularly useful in its current form. The functionality is fairly obvious from its interface.
33 ↗	(On Diff #372901)	The functionality in this and similar functions for wasm and Mach-O should be moved to a .cpp file. It's not unlikely that it will change over time, so we don't want to force rebuilds due to it being unnecessarily in the header.
llvm/include/llvm/Object/ObjCopy/ObjCopy.h
12–18 ↗	(On Diff #372901)	I think you can forward declare most of these classes, and include the headerse in the .cpp rather than having this header include them all. I think about the only ones you likely need are Error.h, and vector.
llvm/lib/ObjCopy/COFF/Object.cpp
1	Actually, I think the way to solve this is to use some CMake functionality to stick specific files in IDE folders, so that they don't end up clashing: in the Visual Studio IDE, for example, I see 4 Object.h files next to each other in the Header Files group, but there's no indication which is which. Instead, we could create COFF, ELF etc sub-groups for the headers and source files. I believe https://cmake.org/cmake/help/latest/command/source_group.html is the relevant piece of CMake.
llvm/lib/Object/CMakeLists.txt
32–47 ↗	(On Diff #372901)	I thought ObjCopy was going to be its own distinct library, not a part of the Object library? It would make more sense to me for it to be separate, as it's a fairly distinct piece of functionality.
llvm/tools/llvm-objcopy/COFF/COFFObjcopy.h
26–28	I've no issue with adding comments like these, but could you spin them off into separate patches, please, as they are independently useful, and this will help reduce the size of this patch. Same applies for other new doc comments.
llvm/tools/llvm-objcopy/ELF/ELFObjcopy.cpp
171	As you're renaming files, it's probably worth doing a complete clang-format of the whole file at the same time. Applies to all files with clang-format issues in otherwise untouched code.
llvm/unittests/Object/ObjCopyTest.cpp
9 ↗	(On Diff #372901)	Here and in other new files, if you haven't already, please make sure the include set is minimal for the code in the new file.
35 ↗	(On Diff #372901)	As this function is called from several places, you may want to add some tracing to it: https://github.com/google/googletest/blob/main/docs/advanced.md#adding-traces-to-assertions If you don't, and the test fails, you may not be able to easily tell which test is causing the failure.
39 ↗	(On Diff #372901)	I believe YAML is an acronym, so should be all-caps in comments.
49–50 ↗	(On Diff #372901)	I was under the impression that temporary files are supposed to be deleted automatically (I might be wrong), so you wouldn't need the `FileRemover`? More generally, the need to create a concrete file in the unit test makes me wonder whether we could enhance the objcopy API to take an object of some kind that could represent data in memory (as an alternative to on-disk). I believe such an object hierarchy already exists within LLVM, although the name escapes me right now.

avl added inline comments.Nov 18 2021, 2:54 AM

llvm/lib/Object/CMakeLists.txt
32–47 ↗	(On Diff #372901)	It was directly requested to implement it as part of Object library during the previous review iteration: https://reviews.llvm.org/D88827#2344900

jhenderson added inline comments.Nov 18 2021, 3:07 AM

llvm/lib/Object/CMakeLists.txt
32–47 ↗	(On Diff #372901)	Right, there was some opposition from @sbc100 though. I think we need to hear their thoughts on that. If we do end up putting it in libObject, should the namespace be changed? Not sure either way.

avl added inline comments.Nov 18 2021, 3:27 AM

llvm/lib/Object/CMakeLists.txt
32–47 ↗	(On Diff #372901)	Do you mean: objcopy->object? objcopy->objcopybase? I think it is fine to have objcopy(like in this patch). Changing objcopy->object creates some names clashes. Thus we need to have other namespace than object for the objcopy part. Having objcopy inside Object library and inside llvm-objcopy tool seems to be no problem.

jhenderson added inline comments.Nov 18 2021, 3:28 AM

llvm/lib/Object/CMakeLists.txt
32–47 ↗	(On Diff #372901)	I was referring to the first of those (using `object` instead of `objcopy`). If using the same namespace would be a problem, I don't see any issue with leaving as-is.

avl added inline comments.Nov 18 2021, 3:38 AM

llvm/unittests/Object/ObjCopyTest.cpp
49–50 ↗	(On Diff #372901)	I was under the impression that temporary files are supposed to be deleted automatically (I might be wrong), so you wouldn't need the FileRemover? I do not see the place where automatic removing is implemented. Also, there are examples of FileRemover usages in other unit tests: https://github.com/llvm/llvm-project/blob/7b6790850968031fe1c098ed6dcc196ddc547ea5/llvm/unittests/Support/MemoryBufferTest.cpp#L105 More generally, the need to create a concrete file in the unit test makes me wonder whether we could enhance the objcopy API to take an object of some kind that could represent data in memory (as an alternative to on-disk). I believe such an object hierarchy already exists within LLVM, although the name escapes me right now. I think it would be a good enhancement. I propose to do it in a separate patch though.

jhenderson added inline comments.Nov 18 2021, 4:03 AM

llvm/unittests/Object/ObjCopyTest.cpp
49–50 ↗	(On Diff #372901)	I do not see the place where automatic removing is implemented For Windows, see references to the OF_Delete flag in Path.inc. For Linux, the behaviour is configured higher up the stack. See `RemoveFileOnSignal` referenced by `TempFile::create`. At the location you cited, `FileRemover` is needed, because we're not working with a `TempFile` there. I think it would be a good enhancement. I propose to do it in a separate patch though. Agreed, but I think it would be useful to see it before this patch gets committed, to avoid a bad interface landing.

avl added inline comments.Nov 18 2021, 4:36 AM

llvm/unittests/Object/ObjCopyTest.cpp
49–50 ↗	(On Diff #372901)	Ah, the suggestion is to use sys::fs::TempFile::create instead of sys::fs::createTemporaryFile(). I see, thanks.

jhenderson added inline comments.Nov 18 2021, 4:38 AM

llvm/unittests/Object/ObjCopyTest.cpp
49–50 ↗	(On Diff #372901)	I didn't realise the two did different things, but it would make sense to do as you are saying, yes.

avl added inline comments.Nov 18 2021, 8:27 AM

llvm/unittests/Object/ObjCopyTest.cpp

49–50 ↗

(On Diff #372901)

I think we could not use sys::fs::TempFile::create() for reusing automatic removing. Please look at the use case:

Expected<sys::fs::TempFile> Temp =
  sys::fs::TempFile::create("a.temp-unittest-%%%%%%", Mode);

Config.Common.OutputFilename = Temp.TmpName;

// Call executeObjcopyOnBinary()
std::error_code EC;
raw_fd_ostream OutStream(Temp->FD, false);
Error Err = objcopy::executeObjcopyOnBinary(Config, *Obj.get(), OutStream);
OutStream.flush();

// if we call Temp.keep() here then the autoremoving functionality would be lost.
// if we call Temp.discard() then the file would be removed.

// if we not call Temp.keep() or Temp.discard()
// then we would try to open the temporarily file second time
// (inside ObjectFile::createObjectFile) which is undesirable.

// Load and check copied file.
Expected<OwningBinary<ObjectFile>> Result =
    ObjectFile::createObjectFile(Config.Common.OutputFilename);

jhenderson added inline comments.Nov 19 2021, 12:00 AM

llvm/unittests/Object/ObjCopyTest.cpp
49–50 ↗	(On Diff #372901)	Fair enough. Stick with this for now, although that maybe suggests we should change the `executeObjcopyOnBinary` interface sooner rather than later, so that it can take an in-memory buffer, instead of an on-disk file.

addressed comments:

deleted doc comments(will add them with the follow-up patch).
moved methods implementation into .cpp file.
optimized headers.
did not modify cmake to create IDE folders(will do it with the follow-up patch).
added traces in the test assertions.
rewrote test to use inmemory output file.

Harbormaster completed remote builds in B135451: Diff 388956.Nov 22 2021, 4:16 PM

A couple of nits remaining, otherwise basically looks good. I'd still like:

@sbc100's response to the library division. I have a very slight preference for separate, but don't care enough about it to push for it, if others are opposed. Adding them as a reviewer in case it gets their attention.
I'd like to see patches based on this patch for the other issues that your deferring (particularly the IDE one).

Once those are dealt with, I'll give this an LGTM.

llvm/include/llvm/Object/ObjCopy/ObjCopy.h
12 ↗	(On Diff #388956)	I think you can forward declare `Binary`, `Archive` and `NewArchiveMember`, right, so that you don't need this header?
llvm/tools/llvm-objcopy/ObjcopyOptions.cpp
627 ↗	(On Diff #388956)	More clang-formatting that can be done as you rename the file.

Oh, ideally I'd also get a second reviewer to give this a once-over, in case I've missed anything too.

@sbc100's response to the library division. I have a very slight preference for separate, but don't care enough about it to push for it, if others are opposed. Adding them as a reviewer in case it gets their attention.

My own preference is also to make ObjCopy to be separate library, though I also do not want to push it if others want opposite. I read this message https://reviews.llvm.org/D88827#2344900 and this https://reviews.llvm.org/D88827#2346399 and this https://reviews.llvm.org/D88827#2346567 responses as a consensus on making it to be part of Object library. If we do not have a consensus, I am open to discuss it further.

llvm/include/llvm/Object/ObjCopy/ObjCopy.h
12 ↗	(On Diff #388956)	std::vector<NewArchiveMember> wants to know sizeof (NewArchiveMember). So, we could not have forward declaration here. i.e. we still need ArchiveWriter.h header.

addressed clang-format issue.

avl added a child revision: D114429: [objcopy][NFC] Add doc comments to the executeObjcopy* functions..Nov 23 2021, 2:58 AM

In D88827#3148315, @avl wrote:

@sbc100's response to the library division. I have a very slight preference for separate, but don't care enough about it to push for it, if others are opposed. Adding them as a reviewer in case it gets their attention.

My own preference is also to make ObjCopy to be separate library, though I also do not want to push it if others want opposite. I read this message https://reviews.llvm.org/D88827#2344900 and this https://reviews.llvm.org/D88827#2346399 and this https://reviews.llvm.org/D88827#2346567 responses as a consensus on making it to be part of Object library. If we do not have a consensus, I am open to discuss it further.

I changed my mind a bit since my early comment you linked, having gone cold on the review and now having come back to it. Here's my thinking: the object manipulation performed by the objcopy code is based on an internal Object class that has nothing to do with the object classes within the libObject library. By putting the two inside the same library, we risk confusion ("which type of Object do I need for this functionality?"). There are fundamental differences which would make reusing the libObject classes for the objcopy code less than ideal - it may not even be possible without a lot of work.

Regarding the Archive writing code being in libObject: it's my tentative opinion that archive functionality shouldn't be in libObject at all: an archive isn't really an object itself: it's a group of objects. The functionality for archives is completely unrelated to the functionality for other file types supported by the object library. As such, I think it should actually be moved into a separate libArchive library or similar.

Finally, there's @sbc100's point earlier: whilst it's true that by having the code in the same library doesn't force tools to link in that code, it does require distributions to include the llvm-objcopy library code anytime it needs the libObject code, because it's part of that library, even though it may not need it. I've seen several instances internally where we generate packages of LLVM libraries, without including all of them, so including the llvm-objcopy code in the libObject library would bloat the size of the package.

@echristo, @alexander-shaposhnikov, do either of you have any further thoughts? What do you think of my points above? I'm not strongly opposed to it going in libObject, but just think it makes a little more sense to be separate. Also, would either of you be able to give this a review more generally?

llvm/include/llvm/Object/ObjCopy/ObjCopy.h
12 ↗	(On Diff #388956)	You learn something new everyday. I thought the type used in std::vector could also be forward declared if it was just used in the return signature. Guess I was wrong.

jhenderson added a reviewer: echristo.Nov 23 2021, 3:28 AM

Harbormaster completed remote builds in B135580: Diff 389132.Nov 23 2021, 5:35 AM

@echristo, @alexander-shaposhnikov Do you have objections to doing Objcopy to be a separate library instead of putting it into the existing Object library?

I agree with concerns raised by @jhenderson (https://reviews.llvm.org/D88827#3148452) and @sbc100 (https://reviews.llvm.org/D88827#2348439): Doing Objcopy as a separate library would allow to reduce code inter-dependency and reduce size of resulting code/package(when objcopy functionality is not neccessary).

@echristo, @alexander-shaposhnikov ping...

@echristo, @alexander-shaposhnikov Would you mind taking part in this review, please? Your opinion might help to make a progress with it. During previous iterations, you preferred the variant when ObjCopy code would be moved into the Object library. There are arguments to make ObjCopy be a separate library. Do you have objections to that?

@echristo, @alexander-shaposhnikov ping...

@jhenderson James, what do you think would be right to do if we do not have a response from others?

Both solutions(make it be a separate library or make it be part of Object) are OK for me.
It would be good if we might have a progress with this patch. Probably we can make a decision and continue with it?

@avl, I'd suggest emailing them directly to prod them, as they aren't responding. If they don't respond after that, I think you should adopt what active reviewers (i.e. me!) are suggesting.

On the note of activity, I will be off for at least 2 weeks, starting tomorrow, so may not be around to continue reviewing in that time!

In D88827#3240313, @jhenderson wrote:

@avl, I'd suggest emailing them directly to prod them, as they aren't responding. If they don't respond after that, I think you should adopt what active reviewers (i.e. me!) are suggesting.

On the note of activity, I will be off for at least 2 weeks, starting tomorrow, so may not be around to continue reviewing in that time!

I see, thanks! will follow that advice.

In D88827#3148452, @jhenderson wrote:

In D88827#3148315, @avl wrote:

@sbc100's response to the library division. I have a very slight preference for separate, but don't care enough about it to push for it, if others are opposed. Adding them as a reviewer in case it gets their attention.

My own preference is also to make ObjCopy to be separate library, though I also do not want to push it if others want opposite. I read this message https://reviews.llvm.org/D88827#2344900 and this https://reviews.llvm.org/D88827#2346399 and this https://reviews.llvm.org/D88827#2346567 responses as a consensus on making it to be part of Object library. If we do not have a consensus, I am open to discuss it further.

I changed my mind a bit since my early comment you linked, having gone cold on the review and now having come back to it. Here's my thinking: the object manipulation performed by the objcopy code is based on an internal Object class that has nothing to do with the object classes within the libObject library. By putting the two inside the same library, we risk confusion ("which type of Object do I need for this functionality?"). There are fundamental differences which would make reusing the libObject classes for the objcopy code less than ideal - it may not even be possible without a lot of work.

The original idea is that these classes -could- be used for this. If that's not true we should go back and fix that rather than have a separate set. That said, we do have a separate set which complicates matters. I think the question I'd ask is "how can we get from here to there?". No straightforward answer here for sure.

Regarding the Archive writing code being in libObject: it's my tentative opinion that archive functionality shouldn't be in libObject at all: an archive isn't really an object itself: it's a group of objects. The functionality for archives is completely unrelated to the functionality for other file types supported by the object library. As such, I think it should actually be moved into a separate libArchive library or similar.

From my perspective: ideally an archive should just be a specific type of object that contains other objects (similar to perhaps a shared library if we did that). Reading it just means cracking it, sharing the archive format information, and then iterating through each object accordingly. Writing would be the reverse. Any thoughts on where we're on a different page? :)

Finally, there's @sbc100's point earlier: whilst it's true that by having the code in the same library doesn't force tools to link in that code, it does require distributions to include the llvm-objcopy library code anytime it needs the libObject code, because it's part of that library, even though it may not need it. I've seen several instances internally where we generate packages of LLVM libraries, without including all of them, so including the llvm-objcopy code in the libObject library would bloat the size of the package.

Are you thinking about things just shipping .a or .o files or something else? Most platforms that I can think of can remove unneeded object code in a binary link so I'm curious about the concern here.

@echristo, @alexander-shaposhnikov, do either of you have any further thoughts? What do you think of my points above? I'm not strongly opposed to it going in libObject, but just think it makes a little more sense to be separate. Also, would either of you be able to give this a review more generally?

I'll do my best :)

-eric

llvm-objcopy has its own data models for some binary format specific things, so placing it under the llvm::object namespace may lead to some conflicts. Placing it into a sub-namespace does not avoid the conflict.
If it keeps using the llvm::objcopy namespace, then placing it under lib/Object may cause some confusion. A separate directory looks good to me.

An archive is a quite different container format. So I get the point that there is a question about whether it fits into lib/Object.
On the other hand, an archive is sometimes an input file type and used nearly indistinguishable with other file typed in syntax (ld.lld, llvm-readelf, ...). Having it in lib/Object seems fine to me...

By placing some less associated code in the same library, there is a risk that dependencies tend to become mixed.
Layered component design has some advantages but in practice it is easy to run into a situation that for all of {a,b,c}.cpp to include all of {a,b,c}.h...
Different lib/* directories tend to force developers to think harder on the layering.

In D88827#3242326, @echristo wrote:

I changed my mind a bit since my early comment you linked, having gone cold on the review and now having come back to it. Here's my thinking: the object manipulation performed by the objcopy code is based on an internal Object class that has nothing to do with the object classes within the libObject library. By putting the two inside the same library, we risk confusion ("which type of Object do I need for this functionality?"). There are fundamental differences which would make reusing the libObject classes for the objcopy code less than ideal - it may not even be possible without a lot of work.

The original idea is that these classes -could- be used for this. If that's not true we should go back and fix that rather than have a separate set. That said, we do have a separate set which complicates matters. I think the question I'd ask is "how can we get from here to there?". No straightforward answer here for sure.

I guess I might have misrepresented the internals of the llvm-objcopy classes: to populate them, they do use the libObject code. The llvm-objcopy Object class acts as another layer that provides manipulation functionality on top of the libObject object files. In some ways, it could actually be used purely for reading though, which raises some interesting thoughts which I can't exactly express. If we wanted to change the existing libObject code to support direct manipulation, without this extra layer, we'd need to change how those classes store data - at the moment, they just consist of references into memory buffers, rather than copies stored within the program. As such, it is impossible to change these references without changing how data is backed. The llvm-objcopy library provides this additional layer in essence. I think this shows that, at least at the moment, the llvm-objcopy Object code is a separate layer. Whethere that then means the separate layer should be in a separate library is probably a different point (it means it can be, but doesn't mean it must be.

Regarding the Archive writing code being in libObject: it's my tentative opinion that archive functionality shouldn't be in libObject at all: an archive isn't really an object itself: it's a group of objects. The functionality for archives is completely unrelated to the functionality for other file types supported by the object library. As such, I think it should actually be moved into a separate libArchive library or similar.

From my perspective: ideally an archive should just be a specific type of object that contains other objects (similar to perhaps a shared library if we did that). Reading it just means cracking it, sharing the archive format information, and then iterating through each object accordingly. Writing would be the reverse. Any thoughts on where we're on a different page? :)

I don't think we really disagree here (although I wouldn't call an archive an object in those words). The point I was largely addressing was the earlier point made by @alexander-shaposhnikov about how the object library already allows manipulation of archives, unlike the other object formats.

Finally, there's @sbc100's point earlier: whilst it's true that by having the code in the same library doesn't force tools to link in that code, it does require distributions to include the llvm-objcopy library code anytime it needs the libObject code, because it's part of that library, even though it may not need it. I've seen several instances internally where we generate packages of LLVM libraries, without including all of them, so including the llvm-objcopy code in the libObject library would bloat the size of the package.

Are you thinking about things just shipping .a or .o files or something else? Most platforms that I can think of can remove unneeded object code in a binary link so I'm curious about the concern here.

Yeah, we have some cases where we distribute .a files (or more specifically Windows .lib files, but the principle is the same). A couple of concrete examples to illustrate the points: firstly, if I want to build LLD, I don't need the llvm-objcopy code. Let's say llvm-objcopy included 5 files. If those 5 files are included in libObject, they need to be compiled and added to that library, as part fo the build process for building LLD, even though those files aren't used in LLD's link, thus unnecessarily slowing down the build as a whole. Secondly, if I have a distribution that wants to ship LLVM libraries sufficient for developers to read objects, we'd end up distributing those same 5 files as part of libObject, unnecessarily. The end executables won't be any bigger in either case of course.

@echristo, @alexander-shaposhnikov, do either of you have any further thoughts? What do you think of my points above? I'm not strongly opposed to it going in libObject, but just think it makes a little more sense to be separate. Also, would either of you be able to give this a review more generally?

I'll do my best :)

-eric

FYI, I'm on leave for the next 2-3 weeks, so won't be able to respond to further points after this for a while.

In D88827#3242580, @MaskRay wrote:

llvm-objcopy has its own data models for some binary format specific things, so placing it under the llvm::object namespace may lead to some conflicts. Placing it into a sub-namespace does not avoid the conflict.

Not sure I follow how placing things into, say, llvm::object::objcopy would have potential for conflicts beyond the current state of the code?

In D88827#3243266, @jhenderson wrote:

In D88827#3242326, @echristo wrote:

Finally, there's @sbc100's point earlier: whilst it's true that by having the code in the same library doesn't force tools to link in that code, it does require distributions to include the llvm-objcopy library code anytime it needs the libObject code, because it's part of that library, even though it may not need it. I've seen several instances internally where we generate packages of LLVM libraries, without including all of them, so including the llvm-objcopy code in the libObject library would bloat the size of the package.

Are you thinking about things just shipping .a or .o files or something else? Most platforms that I can think of can remove unneeded object code in a binary link so I'm curious about the concern here.

Yeah, we have some cases where we distribute .a files (or more specifically Windows .lib files, but the principle is the same). A couple of concrete examples to illustrate the points: firstly, if I want to build LLD, I don't need the llvm-objcopy code. Let's say llvm-objcopy included 5 files. If those 5 files are included in libObject, they need to be compiled and added to that library, as part fo the build process for building LLD, even though those files aren't used in LLD's link, thus unnecessarily slowing down the build as a whole. Secondly, if I have a distribution that wants to ship LLVM libraries sufficient for developers to read objects, we'd end up distributing those same 5 files as part of libObject, unnecessarily. The end executables won't be any bigger in either case of course.

there is also a scenario when end executables would be bigger(though it is not about shipping, this is for debug builds). It is assumed that --gc-sections or similar will remove unused code and then the final executable would be small. That is not exactly true for debug builds. --gc-sections will not remove unused debuginfo. Thus end debug executables would contain unnecessary debuginfo and then will be bigger.

In D88827#3243279, @avl wrote:

In D88827#3243266, @jhenderson wrote:

In D88827#3242326, @echristo wrote:

Finally, there's @sbc100's point earlier: whilst it's true that by having the code in the same library doesn't force tools to link in that code, it does require distributions to include the llvm-objcopy library code anytime it needs the libObject code, because it's part of that library, even though it may not need it. I've seen several instances internally where we generate packages of LLVM libraries, without including all of them, so including the llvm-objcopy code in the libObject library would bloat the size of the package.

Are you thinking about things just shipping .a or .o files or something else? Most platforms that I can think of can remove unneeded object code in a binary link so I'm curious about the concern here.

Yeah, we have some cases where we distribute .a files (or more specifically Windows .lib files, but the principle is the same). A couple of concrete examples to illustrate the points: firstly, if I want to build LLD, I don't need the llvm-objcopy code. Let's say llvm-objcopy included 5 files. If those 5 files are included in libObject, they need to be compiled and added to that library, as part fo the build process for building LLD, even though those files aren't used in LLD's link, thus unnecessarily slowing down the build as a whole. Secondly, if I have a distribution that wants to ship LLVM libraries sufficient for developers to read objects, we'd end up distributing those same 5 files as part of libObject, unnecessarily. The end executables won't be any bigger in either case of course.

there is also a scenario when end executables would be bigger(though it is not about shipping, this is for debug builds). It is assumed that --gc-sections or similar will remove unused code and then the final executable would be small. That is not exactly true for debug builds. --gc-sections will not remove unused debuginfo. Thus end debug executables would contain unnecessary debuginfo and then will be bigger.

This isn't generally true: even in debug builds and without --gc-sections, linkers tend not to link in unused objects into the final executable.

This isn't generally true: even in debug builds and without --gc-sections, linkers tend not to link in unused objects into the final executable.

I remember there was a patch to make it possible - https://reviews.llvm.org/D54747 . Which finally was not applied and current behavior that it is not working(if I did not miss something new).

In D88827#3243298, @avl wrote:

This isn't generally true: even in debug builds and without --gc-sections, linkers tend not to link in unused objects into the final executable.

I remember there was a patch to make it possible - https://reviews.llvm.org/D54747 . Which finally was not applied and current behavior that it is not working(if I did not miss something new).

The difference here is that if you feed the linker a specific set of object files, it will include all of them (minus what --gc-sections removes). But if you feed the linker a static library, it will only include the object files that actually are referenced (unless the library is added with the --whole-archive option or something similar).

In D88827#3243317, @mstorsjo wrote:

In D88827#3243298, @avl wrote:

This isn't generally true: even in debug builds and without --gc-sections, linkers tend not to link in unused objects into the final executable.

I remember there was a patch to make it possible - https://reviews.llvm.org/D54747 . Which finally was not applied and current behavior that it is not working(if I did not miss something new).

The difference here is that if you feed the linker a specific set of object files, it will include all of them (minus what --gc-sections removes). But if you feed the linker a static library, it will only include the object files that actually are referenced (unless the library is added with the --whole-archive option or something similar).

Ah, right. We are talking about library here.

@jhenderson accurately expressed my considerations / original reasoning. Sorry about the very late reply. Perhaps, it's worth adding a few words.
I kind of envisioned the following minimalistic interface exposed by the library (essentially just a single function):

Error copy(const Config &C, const object::Object &O, raw_ostream &Out)

My hope was that we would refactor Config significantly and clean it up with 2 major goals in mind: (1) have proper separation of concerns and structure: Config { ... <common options>, COFFConfig, ELFConfig, MachOConfig };
(in particular, the fields which are not required anymore would be removed) (2) the functionality which is specific to the tool (e.g. creating a .DWO file) would live in the tool and would not be a part of the library. The same applies e.g. to setting file attributes.
The question wether it should be a separate library of a part of libObject - I had mixed feelings, based on the interface I thought it was appropriate to have it in libObject, but making it a separate library also makes sense to me.

@alexander-shaposhnikov My understaning is that current state of this patch is pretty close to above description:

I kind of envisioned the following minimalistic interface exposed by the library (essentially just a single function):

Error copy(const Config &C, const object::Object &O, raw_ostream &Out)

Error executeObjcopyOnBinary(const MultiFormatConfig &Config,
                             object::Binary &In, raw_ostream &Out);

This function does dispatch on input formats and calls format-specific function.

It is not a single function though. I think we may refactor it further with separate patches.

My hope was that we would refactor Config significantly and clean it up with 2 major goals in mind: (1) have proper separation of concerns and structure: Config { ... <common options>, COFFConfig, ELFConfig, MachOConfig };
(in particular, the fields which are not required anymore would be removed)

Config was refactored by following patches :

https://reviews.llvm.org/D99055
https://reviews.llvm.org/D102277
https://reviews.llvm.org/D103260

(2) the functionality which is specific to the tool (e.g. creating a .DWO file) would live in the tool and would not be a part of the library. The same applies e.g. to setting file attributes.

these patches did it possible to leave creating a .DWO file, setting attributes, creating streams inside the tool:

https://reviews.llvm.org/D98582
https://reviews.llvm.org/D98511
https://reviews.llvm.org/D91028
https://reviews.llvm.org/D98426
https://reviews.llvm.org/D95478

@echristo Are you satisfied with response by @jhenderson ?

following is a reminder, if messages are lost in the history:

That is @echristo message - https://reviews.llvm.org/D88827#3242326
That is @jhenderson response - https://reviews.llvm.org/D88827#3243266

move implementation into the separate library ObjCopy.

Harbormaster completed remote builds in B149890: Diff 409152.Feb 15 2022, 10:55 PM

LGTM, with two nits.

llvm/include/llvm/ObjCopy/ObjCopy.h
23–28	This doesn't need to be in the header, right? I'd get rid of it, as it's not really part fo the public interface (`executeObjcopyOnArchive` would be the right thing to use, I believe(?)).
llvm/tools/llvm-objcopy/llvm-objcopy.cpp
9	Delete this blank line then resort the headers.

This revision is now accepted and ready to land.Feb 16 2022, 1:35 AM

avl added inline comments.Feb 16 2022, 2:19 AM

llvm/include/llvm/ObjCopy/ObjCopy.h
23–28	This also is used in lib/ObjCopy/MachO/MachOObjcopy.cpp. But, indeed, it might be removed from the public interface - include/llvm/ObjCopy/ObjCopy.h. Would it be OK if declaration of createNewArchiveMembers would be put into the new local header llvm/lib/ObjCopy/Archive.h ?

jhenderson added inline comments.Feb 16 2022, 2:20 AM

llvm/include/llvm/ObjCopy/ObjCopy.h
23–28	Yes, I think that should be okay.

addressed comments - moved createNewArchiveMembers into separate header.

Harbormaster completed remote builds in B149977: Diff 409266.Feb 16 2022, 9:04 AM

avl edited the summary of this revision. (Show Details)Feb 17 2022, 1:55 AM

This revision was landed with ongoing or failed builds.Feb 17 2022, 2:12 AM

Closed by commit rGf75da0c8e65c: [llvm-objcopy][NFC] Move core implementation of llvm-objcopy into separate… (authored by avl). · Explain Why

This revision was automatically updated to reflect the committed changes.

avl added a commit: rGf75da0c8e65c: [llvm-objcopy][NFC] Move core implementation of llvm-objcopy into separate….

@jhenderson Thank you for the review!

As far as I know, you also have to update the entries in clang/docs/tools/clang-formatted-files.txt.

llvm/lib/CMakeLists.txt
22	Can we be consistent about capitalization of "ObjCopy" or "Objcopy"? The new lib capitalizes the C, but e.g. `ObjcopyOptions.cpp` doesn't. (Don't care which way, just be consistent). % git grep Objcopy \| wc -l 150 % git grep ObjCopy \| wc -l 77
llvm/unittests/ObjCopy/CMakeLists.txt
2	I think you have to add_subdirectory this in llvm/unittests/CMakeLists.txt to get this test binary actually built.

@thakis Thanks! will implement comments with followup commit.

@thakis addressed comments in https://reviews.llvm.org/rG0b57e6c46b707c0e7a123efe82abf3c1e7b5a503. As to the consistent naming - will do it in the separate patch.

I'm also seeing a bunch of warnings that look related:

/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/libtool: warning same member name (ObjCopy.Object.o) in output file used for input files: obj/llvm/lib/ObjCopy/ELF/ObjCopy.Object.o and: obj/llvm/lib/ObjCopy/COFF/ObjCopy.Object.o (due to use of basename, truncation, blank padding or duplicate input files)
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/libtool: warning same member name (ObjCopy.Object.o) in output file used for input files: obj/llvm/lib/ObjCopy/COFF/ObjCopy.Object.o and: obj/llvm/lib/ObjCopy/MachO/ObjCopy.Object.o (due to use of basename, truncation, blank padding or duplicate input files)
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/libtool: warning same member name (ObjCopy.Object.o) in output file used for input files: obj/llvm/lib/ObjCopy/MachO/ObjCopy.Object.o and: obj/llvm/lib/ObjCopy/wasm/ObjCopy.Object.o (due to use of basename, truncation, blank padding or duplicate input files)
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/libtool: warning same member name (ObjCopy.Reader.o) in output file used for input files: obj/llvm/lib/ObjCopy/COFF/ObjCopy.Reader.o and: obj/llvm/lib/ObjCopy/wasm/ObjCopy.Reader.o (due to use of basename, truncation, blank padding or duplicate input files)
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/libtool: warning same member name (ObjCopy.Writer.o) in output file used for input files: obj/llvm/lib/ObjCopy/wasm/ObjCopy.Writer.o and: obj/llvm/lib/ObjCopy/COFF/ObjCopy.Writer.o (due to use of basename, truncation, blank padding or duplicate input files)

In D88827#3335814, @thakis wrote:

I'm also seeing a bunch of warnings that look related:

/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/libtool: warning same member name (ObjCopy.Object.o) in output file used for input files: obj/llvm/lib/ObjCopy/ELF/ObjCopy.Object.o and: obj/llvm/lib/ObjCopy/COFF/ObjCopy.Object.o (due to use of basename, truncation, blank padding or duplicate input files)
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/libtool: warning same member name (ObjCopy.Object.o) in output file used for input files: obj/llvm/lib/ObjCopy/COFF/ObjCopy.Object.o and: obj/llvm/lib/ObjCopy/MachO/ObjCopy.Object.o (due to use of basename, truncation, blank padding or duplicate input files)
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/libtool: warning same member name (ObjCopy.Object.o) in output file used for input files: obj/llvm/lib/ObjCopy/MachO/ObjCopy.Object.o and: obj/llvm/lib/ObjCopy/wasm/ObjCopy.Object.o (due to use of basename, truncation, blank padding or duplicate input files)
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/libtool: warning same member name (ObjCopy.Reader.o) in output file used for input files: obj/llvm/lib/ObjCopy/COFF/ObjCopy.Reader.o and: obj/llvm/lib/ObjCopy/wasm/ObjCopy.Reader.o (due to use of basename, truncation, blank padding or duplicate input files)
/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/libtool: warning same member name (ObjCopy.Writer.o) in output file used for input files: obj/llvm/lib/ObjCopy/wasm/ObjCopy.Writer.o and: obj/llvm/lib/ObjCopy/COFF/ObjCopy.Writer.o (due to use of basename, truncation, blank padding or duplicate input files)

My understanding is that it is not a problem for linking. But it may be a problem if one would want to extract the files from libLLVMObjCopy.a. To avoid this problem we need to rename files. Will do it with separate patch.

avl mentioned this in D120345: [objcopy][NFC] Rename files to avoid clashing of archive members..Feb 22 2022, 12:11 PM

Esme mentioned this in D97656: [llvm-objcopy] Initial XCOFF32 support..Feb 22 2022, 7:28 PM

avl mentioned this in rG25d7b4fb446b: [objcopy][NFC] Rename files to avoid clashing of archive members..Feb 23 2022, 2:07 AM

avl mentioned this in D121005: [objcopy][NFC] Move NameOrPattern::create() into CommonConfig.h.Mar 4 2022, 9:06 AM

avl mentioned this in rG2dc4a80ec026: [objcopy][NFC] Move NameOrPattern::create() into CommonConfig.h.Mar 9 2022, 2:05 AM

Revision Contents

Path

Size

llvm/

	include/	llvm/	ObjCopy/
		tools/	llvm-objcopy/

Buffer.h

6 lines

COFF/

COFFObjcopy.h

6 lines

CopyConfig.h

40 lines

ELF/

ELFConfig.h

6 lines

ELFObjcopy.h

6 lines

MachO/

MachOObjcopy.h

6 lines

wasm/

WasmObjcopy.h

6 lines

include/

llvm/

ObjCopy/

ObjCopy.h

34 lines

lib/

CMakeLists.txt

1 line

LLVMBuild.txt

1 line

ObjCopy/

CMakeLists.txt

27 lines

CopyConfig.cpp

50 lines

ObjCopy.cpp

131 lines

	ObjCopy/

LLVMBuild.txt

48 lines

	lib/	ObjCopy/
	tools/	llvm-objcopy/

Buffer.cpp

2 lines

COFF/

6 lines

6 lines

148 lines

8 lines

226 lines

8 lines

449 lines

ELF/

2 lines

6 lines

10 lines

2686 lines

MachO/

MachOLayoutBuilder.h

2 lines

MachOLayoutBuilder.cpp

388 lines

9 lines

7 lines

341 lines

9 lines

514 lines

354 lines

197 lines

	Util.h
	llvm-objcopy.h

24 lines

wasm/

6 lines

36 lines

6 lines

33 lines

6 lines

8 lines

78 lines

tools/

llvm-objcopy/

Buffer.h

Buffer.cpp

CMakeLists.txt

20 lines

COFF/

ELF/

MachO/

MachOLayoutBuilder.cpp

60 lines

	ParseConfig.cpp
	CopyConfig.cpp

47 lines

llvm-objcopy.h

llvm-objcopy.cpp

119 lines

wasm/

unittests/

1 line

ObjCopy/

CMakeLists.txt

11 lines

ObjCopyTest.cpp

120 lines

Diff 296524

llvm/include/llvm/ObjCopy/Buffer.h

This file was moved from llvm/tools/llvm-objcopy/Buffer.h.

//===- Buffer.h -------------------------------------------------- C++ --===//		//===- Buffer.h -------------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TOOLS_OBJCOPY_BUFFER_H		#ifndef LLVM_OBJCOPY_BUFFER_H
#define LLVM_TOOLS_OBJCOPY_BUFFER_H		#define LLVM_OBJCOPY_BUFFER_H

#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/Support/FileOutputBuffer.h"		#include "llvm/Support/FileOutputBuffer.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include <memory>		#include <memory>

namespace llvm {		namespace llvm {
namespace objcopy {		namespace objcopy {

// The class Buffer abstracts out the common interface of FileOutputBuffer and		// The class Buffer abstracts out the common interface of FileOutputBuffer and
// WritableMemoryBuffer so that the hierarchy of Writers depends on this		// WritableMemoryBuffer so that the hierarchy of Writers depends on this
// abstract interface and doesn't depend on a particular implementation.		// abstract interface and doesn't depend on a particular implementation.
// TODO: refactor the buffer classes in LLVM to enable us to use them here		// TODO: refactor the buffer classes in LLVM to enable us to use them here
// directly.		// directly.
		jhendersonUnsubmitted Not Done Reply Inline Actions Maybe as part of a separate patch, it would be worth taking a look at this TODO. It would be great if the Buffer could be removed from the library API and generic LLVM buffers used instead (for example an in-memory buffer or a file buffer, depending on what people want to do). jhenderson: Maybe as part of a separate patch, it would be worth taking a look at this TODO. It would be…
		avlAuthorUnsubmitted Done Reply Inline Actions agreed, but I think it is better to do in separate patch. avl: agreed, but I think it is better to do in separate patch.
		avlAuthorUnsubmitted Done Reply Inline Actions @alexshap @jhenderson Speaking of class Buffer refactoring. I do not think that this comment is fully valid: // The class Buffer abstracts out the common interface of FileOutputBuffer and // WritableMemoryBuffer so that the hierarchy of Writers depends on this // abstract interface and doesn't depend on a particular implementation. // TODO: refactor the buffer classes in LLVM to enable us to use them here // directly. It suggests to create some common interface for FileOutputBuffer and WritableMemoryBuffer. Which is assumed to look similar to this: class Buffer { StringRef Name; public: virtual ~Buffer(); virtual Error allocate(size_t Size) = 0; virtual uint8_t getBufferStart() = 0; virtual Error commit() = 0; explicit Buffer(StringRef Name) : Name(Name) {} StringRef getName() const { return Name; } }; There exists a problem with methods commit() and allocate(). commit() is a redundant for WritableMemoryBuffer. adding it to WritableMemoryBuffer would require to patch all current usages of WritableMemoryBuffer. So it looks incorrectly to use it for common parent interface of FileOutputBuffer and WritableMemoryBuffer. allocate() suggests another way of buffer creation. Currently, buffers are created by static creation methods: static std::unique_ptr<WritableMemoryBuffer> getNewMemBuffer(size_t Size, const Twine &BufferName = ""); static Expected<std::unique_ptr<FileOutputBuffer>> create(StringRef FilePath, size_t Size, unsigned Flags = 0); Adding "virtual Error allocate(size_t Size)" would lead to creation empty buffer by static creation method and then call to allocate(). This does not seem a good addition to the already existed FileOutputBuffer and WritableMemoryBuffer. FileOutputBuffer and WritableMemoryBuffer assume another use cases than Buffer. Actually, what is neccessary by objcopy is method: uint8_t createBuffer ( size_t Size ); All other functionality is redundant and could be removed from objcopy. what do you think about following design? LazyBuffer { StringRef Name; virtual StringRef getName() const { return Name; } virtual uint8_t createBuffer ( size_t Size ) = 0; }; MemoryLazyBuffer : public LazyBuffer { virtual uint8_t createBuffer ( size_t Size ) { Buffer = WritableMemoryBuffer::getNewMemBuffer(Size, Name); return Buffer->getBufferStart(); } std::unique_ptr<WritableMemoryBuffer> Buffer; }; FileLazyBuffer : public LazyBuffer { virtual uint8_t createBuffer ( size_t Size ) { Buffer = FileOutputBuffer::create(Name, Size); return Buffer->getBufferStart(); } std::unique_ptr<FileOutputBuffer> Buffer; }; Usage: static Error executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In, LazyBuffer &Out); MemoryLazyBuffer MB("name"); executeObjcopyOnBinary(Config, Input, MB); FileLazyBuffer MB("name"); executeObjcopyOnBinary(Config, Input, MB); if (MB.Buffer) MB.Buffer->commit(); Another alternative is that library always writes to general MemoryBuffer : static Expected<std::unique_ptr<MemoryBuffer>> executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In); and later this MemoryBuffer would be written into the file by llvm-objcopy.cpp. What do you think? avl:* @alexshap @jhenderson Speaking of class Buffer refactoring. I do not think that this comment…
		jhendersonUnsubmitted Not Done Reply Inline Actions I'm not sure I've looked at the intricacies of the different buffer classes to know what the right approach is. However, if modifying the existing buffers seems like it won't work/will be too invasive, this seems like a fair approach (it's the "Adapter" design pattern in action, I believe). I'm not sure I'd call it `LazyBuffer`, although I don't have a specific better name (maybe just ObjcopyBuffer, depending on how generic we want it to be). Also, unless the objcopy code actually needs the name for anything (aside from error messages, I'm not sure what that would be), I'd not include that in the interface. jhenderson: I'm not sure I've looked at the intricacies of the different buffer classes to know what the…
class Buffer {		class Buffer {
StringRef Name;		StringRef Name;

public:		public:
virtual ~Buffer();		virtual ~Buffer();
virtual Error allocate(size_t Size) = 0;		virtual Error allocate(size_t Size) = 0;
virtual uint8_t *getBufferStart() = 0;		virtual uint8_t *getBufferStart() = 0;
virtual Error commit() = 0;		virtual Error commit() = 0;
Show All 27 Lines	public:
explicit MemBuffer(StringRef Name) : Buffer(Name) {}		explicit MemBuffer(StringRef Name) : Buffer(Name) {}

std::unique_ptr<WritableMemoryBuffer> releaseMemoryBuffer();		std::unique_ptr<WritableMemoryBuffer> releaseMemoryBuffer();
};		};

} // end namespace objcopy		} // end namespace objcopy
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_TOOLS_OBJCOPY_BUFFER_H		#endif // LLVM_OBJCOPY_BUFFER_H

llvm/include/llvm/ObjCopy/COFF/COFFObjcopy.h

This file was moved from llvm/tools/llvm-objcopy/COFF/COFFObjcopy.h.

	//===- COFFObjcopy.h --------------------------------------------- C++ --===//			//===- COFFObjcopy.h --------------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_TOOLS_OBJCOPY_COFFOBJCOPY_H			#ifndef LLVM_OBJCOPY_COFF_COFFOBJCOPY_H
	#define LLVM_TOOLS_OBJCOPY_COFFOBJCOPY_H			#define LLVM_OBJCOPY_COFF_COFFOBJCOPY_H

	namespace llvm {			namespace llvm {
	class Error;			class Error;

	namespace object {			namespace object {
	class COFFObjectFile;			class COFFObjectFile;
	} // end namespace object			} // end namespace object

	namespace objcopy {			namespace objcopy {
	struct CopyConfig;			struct CopyConfig;
	class Buffer;			class Buffer;

	namespace coff {			namespace coff {
	Error executeObjcopyOnBinary(const CopyConfig &Config,			Error executeObjcopyOnBinary(const CopyConfig &Config,
	object::COFFObjectFile &In, Buffer &Out);			object::COFFObjectFile &In, Buffer &Out);

	} // end namespace coff			} // end namespace coff
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_TOOLS_OBJCOPY_COFFOBJCOPY_H			#endif // LLVM_OBJCOPY_COFF_COFFOBJCOPY_H

llvm/include/llvm/ObjCopy/CopyConfig.h

This file was moved from llvm/tools/llvm-objcopy/CopyConfig.h.

//===- CopyConfig.h -------------------------------------------------------===//		//===- CopyConfig.h -------------------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TOOLS_LLVM_OBJCOPY_COPY_CONFIG_H		#ifndef LLVM_OBJCOPY_COPYCONFIG_H
#define LLVM_TOOLS_LLVM_OBJCOPY_COPY_CONFIG_H		#define LLVM_OBJCOPY_COPYCONFIG_H

#include "ELF/ELFConfig.h"		#include "ELF/ELFConfig.h"
#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/BitmaskEnum.h"		#include "llvm/ADT/BitmaskEnum.h"
#include "llvm/ADT/DenseSet.h"		#include "llvm/ADT/DenseSet.h"
#include "llvm/ADT/Optional.h"		#include "llvm/ADT/Optional.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/StringMap.h"		#include "llvm/ADT/StringMap.h"
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	public:

bool isPositiveMatch() const { return IsPositiveMatch; }		bool isPositiveMatch() const { return IsPositiveMatch; }
bool operator==(StringRef S) const {		bool operator==(StringRef S) const {
return R ? R->match(S) : G ? G->match(S) : Name == S;		return R ? R->match(S) : G ? G->match(S) : Name == S;
}		}
bool operator!=(StringRef S) const { return !operator==(S); }		bool operator!=(StringRef S) const { return !operator==(S); }
};		};

// Matcher that checks symbol or section names against the command line flags		// Matcher that checks symbol or section names against the command line flags
// provided for that option.		// provided for that option.
		jhendersonUnsubmitted Not Done Reply Inline Actions This comment probably needs updating to better match the new usage - but see out-of-line comment. jhenderson: This comment probably needs updating to better match the new usage - but see out-of-line…
		avlAuthorUnsubmitted Done Reply Inline Actions ok. avl: ok.
class NameMatcher {		class NameMatcher {
std::vector<NameOrPattern> PosMatchers;		std::vector<NameOrPattern> PosMatchers;
std::vector<NameOrPattern> NegMatchers;		std::vector<NameOrPattern> NegMatchers;

public:		public:
Error addMatcher(Expected<NameOrPattern> Matcher) {		Error addMatcher(Expected<NameOrPattern> Matcher) {
if (!Matcher)		if (!Matcher)
return Matcher.takeError();		return Matcher.takeError();
▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	if (!ELF) {
if (!ELFConfig)		if (!ELFConfig)
return ELFConfig.takeError();		return ELFConfig.takeError();
ELF = *ELFConfig;		ELF = *ELFConfig;
}		}
return Error::success();		return Error::success();
}		}
};		};

// Configuration for the overall invocation of this tool. When invoked as
// objcopy, will always contain exactly one CopyConfig. When invoked as strip,
// will contain one or more CopyConfigs.
struct DriverConfig {
SmallVector<CopyConfig, 1> CopyConfigs;
BumpPtrAllocator Alloc;
};

// ParseObjcopyOptions returns the config and sets the input arguments. If a
// help flag is set then ParseObjcopyOptions will print the help messege and
// exit. ErrorCallback is used to handle recoverable errors. An Error returned
// by the callback aborts the parsing and is then returned by this function.
Expected<DriverConfig>
parseObjcopyOptions(ArrayRef<const char *> ArgsArr,
llvm::function_ref<Error(Error)> ErrorCallback);

// ParseInstallNameToolOptions returns the config and sets the input arguments.
// If a help flag is set then ParseInstallNameToolOptions will print the help
// messege and exit.
Expected<DriverConfig>
parseInstallNameToolOptions(ArrayRef<const char *> ArgsArr);

// ParseBitcodeStripOptions returns the config and sets the input arguments.
// If a help flag is set then ParseBitcodeStripOptions will print the help
// messege and exit.
Expected<DriverConfig> parseBitcodeStripOptions(ArrayRef<const char *> ArgsArr);

// ParseStripOptions returns the config and sets the input arguments. If a
// help flag is set then ParseStripOptions will print the help messege and
// exit. ErrorCallback is used to handle recoverable errors. An Error returned
// by the callback aborts the parsing and is then returned by this function.
Expected<DriverConfig>
parseStripOptions(ArrayRef<const char *> ArgsArr,
llvm::function_ref<Error(Error)> ErrorCallback);
} // namespace objcopy		} // namespace objcopy
} // namespace llvm		} // namespace llvm

#endif		#endif // LLVM_OBJCOPY_COPYCONFIG_H

llvm/include/llvm/ObjCopy/ELF/ELFConfig.h

This file was moved from llvm/tools/llvm-objcopy/ELF/ELFConfig.h.

	//===- ELFConfig.h ----------------------------------------------- C++ --===//			//===- ELFConfig.h ----------------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_TOOLS_OBJCOPY_ELFCONFIG_H			#ifndef LLVM_OBJCOPY_ELF_ELFCONFIG_H
	#define LLVM_TOOLS_OBJCOPY_ELFCONFIG_H			#define LLVM_OBJCOPY_ELF_ELFCONFIG_H

	#include "llvm/ADT/Optional.h"			#include "llvm/ADT/Optional.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/Object/ELFTypes.h"			#include "llvm/Object/ELFTypes.h"
	#include "llvm/Support/Error.h"			#include "llvm/Support/Error.h"
	#include <vector>			#include <vector>

	namespace llvm {			namespace llvm {
	Show All 17 Lines
	};			};

	Expected<ELFCopyConfig> parseConfig(const CopyConfig &Config);			Expected<ELFCopyConfig> parseConfig(const CopyConfig &Config);

	} // namespace elf			} // namespace elf
	} // namespace objcopy			} // namespace objcopy
	} // namespace llvm			} // namespace llvm

	#endif			#endif // LLVM_OBJCOPY_ELF_ELFCONFIG_H

llvm/include/llvm/ObjCopy/ELF/ELFObjcopy.h

This file was moved from llvm/tools/llvm-objcopy/ELF/ELFObjcopy.h.

//===- ELFObjcopy.h ---------------------------------------------- C++ --===//		//===- ELFObjcopy.h ---------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TOOLS_OBJCOPY_ELFOBJCOPY_H		#ifndef LLVM_OBJCOPY_ELF_ELFOBJCOPY_H
#define LLVM_TOOLS_OBJCOPY_ELFOBJCOPY_H		#define LLVM_OBJCOPY_ELF_ELFOBJCOPY_H

namespace llvm {		namespace llvm {
class Error;		class Error;
class MemoryBuffer;		class MemoryBuffer;

namespace object {		namespace object {
class ELFObjectFileBase;		class ELFObjectFileBase;
} // end namespace object		} // end namespace object
Show All 9 Lines	Error executeObjcopyOnRawBinary(const CopyConfig &Config, MemoryBuffer &In,
Buffer &Out);		Buffer &Out);
Error executeObjcopyOnBinary(const CopyConfig &Config,		Error executeObjcopyOnBinary(const CopyConfig &Config,
object::ELFObjectFileBase &In, Buffer &Out);		object::ELFObjectFileBase &In, Buffer &Out);

} // end namespace elf		} // end namespace elf
} // end namespace objcopy		} // end namespace objcopy
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_TOOLS_OBJCOPY_ELFOBJCOPY_H		#endif // LLVM_OBJCOPY_ELF_ELFOBJCOPY_H

llvm/include/llvm/ObjCopy/MachO/MachOObjcopy.h

This file was moved from llvm/tools/llvm-objcopy/MachO/MachOObjcopy.h.

	//===- MachOObjcopy.h -------------------------------------------- C++ --===//			//===- MachOObjcopy.h -------------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_TOOLS_OBJCOPY_MACHOOBJCOPY_H			#ifndef LLVM_OBJCOPY_MACHO_MACHOOBJCOPY_H
	#define LLVM_TOOLS_OBJCOPY_MACHOOBJCOPY_H			#define LLVM_OBJCOPY_MACHO_MACHOOBJCOPY_H

	namespace llvm {			namespace llvm {
	class Error;			class Error;

	namespace object {			namespace object {
	class MachOObjectFile;			class MachOObjectFile;
	class MachOUniversalBinary;			class MachOUniversalBinary;
	} // end namespace object			} // end namespace object

	namespace objcopy {			namespace objcopy {
	struct CopyConfig;			struct CopyConfig;
	class Buffer;			class Buffer;

	namespace macho {			namespace macho {
	Error executeObjcopyOnBinary(const CopyConfig &Config,			Error executeObjcopyOnBinary(const CopyConfig &Config,
	object::MachOObjectFile &In, Buffer &Out);			object::MachOObjectFile &In, Buffer &Out);

	Error executeObjcopyOnMachOUniversalBinary(			Error executeObjcopyOnMachOUniversalBinary(
	CopyConfig &Config, const object::MachOUniversalBinary &In, Buffer &Out);			CopyConfig &Config, const object::MachOUniversalBinary &In, Buffer &Out);

	} // end namespace macho			} // end namespace macho
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_TOOLS_OBJCOPY_MACHOOBJCOPY_H			#endif // LLVM_OBJCOPY_MACHO_MACHOOBJCOPY_H

llvm/include/llvm/ObjCopy/ObjCopy.h

This file was added.

//===- ObjCopy.h ------------------------------------------------*- C++ -*-===//

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

//===----------------------------------------------------------------------===//

#ifndef LLVM_OBJCOPY_OBJCOPY_H

#define LLVM_OBJCOPY_OBJCOPY_H

#include "Buffer.h"

#include "CopyConfig.h"

#include "llvm/Object/Archive.h"

namespace llvm {

namespace objcopy {

/// The function applies transformation described by Config to

/// the specified binary and writes result into the Out. It does

/// the dispatch based on the format of the input binary

/// (ELF, MachO or COFF).

jhendersonUnsubmitted

Not Done

namespace objcopy {

- /// The function applies transformation described by Config to

- /// the specified binary and writes result into the Out. It does

- /// the dispatch based on the format of the input binary

- /// (ELF, MachO or COFF).

+ /// Applies the transformations described by \p Config to

+ /// \p In and writes the result into \p Out.

+ /// \returns any Error encountered whilst performing the operation.

Error executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In,

I think details of the output format shouldn't be described in the comment - theoretically objcopy could even mutate from one object format to another (see e.g. the IHEX stuff).

jhenderson: I think details of the output format shouldn't be described in the comment - theoretically…

avlAuthorUnsubmitted

Done

ok.

avl: ok.

Error executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In,

Buffer &Out);

/// The function applies transformation described by Config to

/// the specified archive. It does the dispatch based on the format

/// of the input binary (ELF, MachO or COFF).

jhendersonUnsubmitted

Not Done

Buffer &Out);

- /// The function applies transformation described by Config to

- /// the specified archive. It does the dispatch based on the format

- /// of the input binary (ELF, MachO or COFF).

+ /// Applies the transformations described by \p Config to each member in archive \p Ar.

+ /// \returns any Error encountered whilst performing the operation.

Error executeObjcopyOnArchive(CopyConfig &Config, const object::Archive &Ar);

jhenderson:

jhendersonUnsubmitted

Not Done

This doesn't need to be in the header, right? I'd get rid of it, as it's not really part fo the public interface (executeObjcopyOnArchive would be the right thing to use, I believe(?)).

jhenderson: This doesn't need to be in the header, right? I'd get rid of it, as it's not really part fo the…

avlAuthorUnsubmitted

Done

This also is used in lib/ObjCopy/MachO/MachOObjcopy.cpp. But, indeed, it might be removed from the public interface - include/llvm/ObjCopy/ObjCopy.h. Would it be OK if declaration of createNewArchiveMembers would be put into the new local header llvm/lib/ObjCopy/Archive.h ?

avl: This also is used in lib/ObjCopy/MachO/MachOObjcopy.cpp. But, indeed, it might be removed from…

jhendersonUnsubmitted

Not Done

Yes, I think that should be okay.

jhenderson: Yes, I think that should be okay.

Error executeObjcopyOnArchive(CopyConfig &Config, const object::Archive &Ar);

} // end namespace objcopy

} // end namespace llvm

#endif // LLVM_OBJCOPY_OBJCOPY_H

llvm/include/llvm/ObjCopy/wasm/WasmObjcopy.h

This file was moved from llvm/tools/llvm-objcopy/wasm/WasmObjcopy.h.

	//===- WasmObjcopy.h -------------------------------------------- C++ --===//			//===- WasmObjcopy.h -------------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_TOOLS_LLVM_OBJCOPY_WASM_WASMOBJCOPY_H			#ifndef LLVM_OBJCOPY_WASM_WASMOBJCOPY_H
	#define LLVM_TOOLS_LLVM_OBJCOPY_WASM_WASMOBJCOPY_H			#define LLVM_OBJCOPY_WASM_WASMOBJCOPY_H

	namespace llvm {			namespace llvm {
	class Error;			class Error;

	namespace object {			namespace object {
	class WasmObjectFile;			class WasmObjectFile;
	} // end namespace object			} // end namespace object

	namespace objcopy {			namespace objcopy {
	struct CopyConfig;			struct CopyConfig;
	class Buffer;			class Buffer;

	namespace wasm {			namespace wasm {
	Error executeObjcopyOnBinary(const CopyConfig &Config,			Error executeObjcopyOnBinary(const CopyConfig &Config,
	object::WasmObjectFile &In, Buffer &Out);			object::WasmObjectFile &In, Buffer &Out);

	} // end namespace wasm			} // end namespace wasm
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_TOOLS_LLVM_OBJCOPY_WASM_WASMOBJCOPY_H			#endif // LLVM_OBJCOPY_WASM_WASMOBJCOPY_H

llvm/lib/CMakeLists.txt

	Show All 13 Lines
	add_subdirectory(Extensions)			add_subdirectory(Extensions)
	add_subdirectory(Frontend)			add_subdirectory(Frontend)
	add_subdirectory(Transforms)			add_subdirectory(Transforms)
	add_subdirectory(Linker)			add_subdirectory(Linker)
	add_subdirectory(Analysis)			add_subdirectory(Analysis)
	add_subdirectory(LTO)			add_subdirectory(LTO)
	add_subdirectory(MC)			add_subdirectory(MC)
	add_subdirectory(MCA)			add_subdirectory(MCA)
	add_subdirectory(Object)			add_subdirectory(Object)
				thakisUnsubmitted Not Done Reply Inline Actions Can we be consistent about capitalization of "ObjCopy" or "Objcopy"? The new lib capitalizes the C, but e.g. `ObjcopyOptions.cpp` doesn't. (Don't care which way, just be consistent). % git grep Objcopy \| wc -l 150 % git grep ObjCopy \| wc -l 77 thakis: Can we be consistent about capitalization of "ObjCopy" or "Objcopy"? The new lib capitalizes…
				add_subdirectory(ObjCopy)
	add_subdirectory(ObjectYAML)			add_subdirectory(ObjectYAML)
	add_subdirectory(Option)			add_subdirectory(Option)
	add_subdirectory(Remarks)			add_subdirectory(Remarks)
	add_subdirectory(DebugInfo)			add_subdirectory(DebugInfo)
	add_subdirectory(ExecutionEngine)			add_subdirectory(ExecutionEngine)
	add_subdirectory(Target)			add_subdirectory(Target)
	add_subdirectory(AsmParser)			add_subdirectory(AsmParser)
	add_subdirectory(LineEditor)			add_subdirectory(LineEditor)
	Show All 9 Lines

llvm/lib/LLVMBuild.txt

This file was copied to llvm/lib/ObjCopy/LLVMBuild.txt.

	Show All 31 Lines
	Linker			Linker
	InterfaceStub			InterfaceStub
	IR			IR
	IRReader			IRReader
	LTO			LTO
	MC			MC
	MCA			MCA
	Object			Object
				ObjCopy
	BinaryFormat			BinaryFormat
	ObjectYAML			ObjectYAML
	Option			Option
	Remarks			Remarks
	Passes			Passes
	ProfileData			ProfileData
	Support			Support
	TableGen			TableGen
	Show All 12 Lines

llvm/lib/ObjCopy/Buffer.cpp

This file was moved from llvm/tools/llvm-objcopy/Buffer.cpp.

	//===- Buffer.cpp ---------------------------------------------------------===//			//===- Buffer.cpp ---------------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "Buffer.h"			#include "llvm/ObjCopy/Buffer.h"
	#include "llvm/Support/FileOutputBuffer.h"			#include "llvm/Support/FileOutputBuffer.h"
	#include "llvm/Support/FileSystem.h"			#include "llvm/Support/FileSystem.h"
	#include "llvm/Support/MemoryBuffer.h"			#include "llvm/Support/MemoryBuffer.h"
	#include "llvm/Support/Process.h"			#include "llvm/Support/Process.h"
	#include <memory>			#include <memory>

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	▲ Show 20 Lines • Show All 62 Lines • Show Last 20 Lines

llvm/lib/ObjCopy/CMakeLists.txt

This file was added.

				add_llvm_component_library(LLVMObjCopy
				Buffer.cpp
				CopyConfig.cpp
				ObjCopy.cpp
				COFF/COFFObjcopy.cpp
				COFF/Object.cpp
				COFF/Reader.cpp
				COFF/Writer.cpp
				ELF/ELFConfig.cpp
				ELF/ELFObjcopy.cpp
				ELF/Object.cpp
				MachO/MachOObjcopy.cpp
				MachO/MachOReader.cpp
				MachO/MachOWriter.cpp
				MachO/MachOLayoutBuilder.cpp
				MachO/Object.cpp
				wasm/Object.cpp
				wasm/Reader.cpp
				wasm/Writer.cpp
				wasm/WasmObjcopy.cpp

				ADDITIONAL_HEADER_DIRS
				${LLVM_MAIN_INCLUDE_DIR}/llvm/ObjCopy

				DEPENDS
				intrinsics_gen
				)

llvm/lib/ObjCopy/COFF/COFFObjcopy.cpp

This file was moved from llvm/tools/llvm-objcopy/COFF/COFFObjcopy.cpp.

	//===- COFFObjcopy.cpp ----------------------------------------------------===//			//===- COFFObjcopy.cpp ----------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "COFFObjcopy.h"			#include "llvm/ObjCopy/COFF/COFFObjcopy.h"
	#include "Buffer.h"
	#include "CopyConfig.h"
	#include "Object.h"			#include "Object.h"
	#include "Reader.h"			#include "Reader.h"
	#include "Writer.h"			#include "Writer.h"
				#include "llvm/ObjCopy/Buffer.h"
				#include "llvm/ObjCopy/CopyConfig.h"

	#include "llvm/Object/Binary.h"			#include "llvm/Object/Binary.h"
	#include "llvm/Object/COFF.h"			#include "llvm/Object/COFF.h"
	#include "llvm/Support/CRC.h"			#include "llvm/Support/CRC.h"
	#include "llvm/Support/Errc.h"			#include "llvm/Support/Errc.h"
	#include "llvm/Support/Path.h"			#include "llvm/Support/Path.h"
	#include <cassert>			#include <cassert>

	▲ Show 20 Lines • Show All 270 Lines • Show Last 20 Lines

llvm/lib/ObjCopy/COFF/Object.h

This file was moved from llvm/tools/llvm-objcopy/COFF/Object.h.

//===- Object.h -------------------------------------------------- C++ --===//		//===- Object.h -------------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TOOLS_OBJCOPY_COFF_OBJECT_H		#ifndef LLVM_LIB_OBJCOPY_COFF_OBJECT_H
#define LLVM_TOOLS_OBJCOPY_COFF_OBJECT_H		#define LLVM_LIB_OBJCOPY_COFF_OBJECT_H

#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/DenseMap.h"		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/Optional.h"		#include "llvm/ADT/Optional.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/iterator_range.h"		#include "llvm/ADT/iterator_range.h"
#include "llvm/BinaryFormat/COFF.h"		#include "llvm/BinaryFormat/COFF.h"
#include "llvm/Object/COFF.h"		#include "llvm/Object/COFF.h"
▲ Show 20 Lines • Show All 184 Lines • ▼ Show 20 Lines	void copyPeHeader(PeHeader1Ty &Dest, const PeHeader2Ty &Src) {
Dest.LoaderFlags = Src.LoaderFlags;		Dest.LoaderFlags = Src.LoaderFlags;
Dest.NumberOfRvaAndSize = Src.NumberOfRvaAndSize;		Dest.NumberOfRvaAndSize = Src.NumberOfRvaAndSize;
}		}

} // end namespace coff		} // end namespace coff
} // end namespace objcopy		} // end namespace objcopy
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_TOOLS_OBJCOPY_COFF_OBJECT_H		#endif // LLVM_LIB_OBJCOPY_COFF_OBJECT_H

llvm/lib/ObjCopy/COFF/Object.cpp

This file was moved from llvm/tools/llvm-objcopy/COFF/Object.cpp.

	//===- Object.cpp ---------------------------------------------------------===//			//===- Object.cpp ---------------------------------------------------------===//
				jhendersonUnsubmitted Not Done Reply Inline Actions Whilst you're moving this and the equivalent files for other formats around, could you please rename them to be obvious from the filename which format they are for (same goes for their headers), please? For example, COFF/Object.cpp -> COFF/COFFObject.cpp. The reason for this is that when using the Visual Studio IDE, all the "Object.cpp" files end up listed next to each other in the file browser, and the only way of figuring out which is which is by opening them and seeing. jhenderson: Whilst you're moving this and the equivalent files for other formats around, could you please…
				avlAuthorUnsubmitted Done Reply Inline Actions ok. avl: ok.
				alexander-shaposhnikovUnsubmitted Not Done Reply Inline Actions @jhenderson, I'm sorry to disagree, but renaming files this way doesn't seem to be a good idea and the provided justification doesn't appear to be sufficient. Since this file contains the implementation of what's declared in Object.h I would strongly prefer to have it named Object.cpp given it is already located in the corresponding folder. Visual Studio IDE might have some peculiarities but having consistent naming is important, adding such prefixes doesn't seem to be a good approach. alexander-shaposhnikov: @jhenderson, I'm sorry to disagree, but renaming files this way doesn't seem to be a good idea…
				avlAuthorUnsubmitted Done Reply Inline Actions Would it be OK, If both of the files would be renamed Object.h -> COFFObject.h and Object.cpp->COFFObject.cpp ? avl: Would it be OK, If both of the files would be renamed Object.h -> COFFObject.h and Object.cpp…
				avlAuthorUnsubmitted Done Reply Inline Actions @jhenderson, I'm sorry to disagree, but renaming files this way doesn't seem to be a good idea and the provided justification doesn't appear to be sufficient. Since this file contains the implementation of what's declared in Object.h I would strongly prefer to have it named Object.cpp given it is already located in the corresponding folder. Visual Studio IDE might have some peculiarities but having consistent naming is important, adding such prefixes doesn't seem to be a good approach. @alexshap Could you explain this renaming thing, please? i.e. if both header file COFF/Object.h and src file COFF/Object.cpp would be renamed(COFF/COFFObject.h, COFF/COFFObject.cpp), would it be OK? avl: > @jhenderson, I'm sorry to disagree, but renaming files this way doesn't seem to be a good…
				alexander-shaposhnikovUnsubmitted Not Done Reply Inline Actions I'm very sorry, but i still think that the old names were good, adding these prefixes is unnecessary and makes things less intuitive (e.g. class Object is described in Object.h). alexander-shaposhnikov: I'm very sorry, but i still think that the old names were good, adding these prefixes is…
				jhendersonUnsubmitted Not Done Reply Inline Actions Actually, I think the way to solve this is to use some CMake functionality to stick specific files in IDE folders, so that they don't end up clashing: in the Visual Studio IDE, for example, I see 4 Object.h files next to each other in the Header Files group, but there's no indication which is which. Instead, we could create COFF, ELF etc sub-groups for the headers and source files. I believe https://cmake.org/cmake/help/latest/command/source_group.html is the relevant piece of CMake. jhenderson: Actually, I think the way to solve this is to use some CMake functionality to stick specific…
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "Object.h"			#include "Object.h"
	▲ Show 20 Lines • Show All 139 Lines • Show Last 20 Lines

llvm/lib/ObjCopy/COFF/Reader.h

This file was moved from llvm/tools/llvm-objcopy/COFF/Reader.h.

//===- Reader.h -------------------------------------------------- C++ --===//		//===- Reader.h -------------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TOOLS_OBJCOPY_COFF_READER_H		#ifndef LLVM_LIB_OBJCOPY_COFF_READER_H
#define LLVM_TOOLS_OBJCOPY_COFF_READER_H		#define LLVM_LIB_OBJCOPY_COFF_READER_H

#include "Buffer.h"
#include "llvm/BinaryFormat/COFF.h"		#include "llvm/BinaryFormat/COFF.h"
		#include "llvm/ObjCopy/Buffer.h"
#include "llvm/Object/COFF.h"		#include "llvm/Object/COFF.h"
#include "llvm/Support/Error.h"		#include "llvm/Support/Error.h"

namespace llvm {		namespace llvm {
namespace objcopy {		namespace objcopy {
namespace coff {		namespace coff {

struct Object;		struct Object;
Show All 12 Lines	public:
explicit COFFReader(const COFFObjectFile &O) : COFFObj(O) {}		explicit COFFReader(const COFFObjectFile &O) : COFFObj(O) {}
Expected<std::unique_ptr<Object>> create() const;		Expected<std::unique_ptr<Object>> create() const;
};		};

} // end namespace coff		} // end namespace coff
} // end namespace objcopy		} // end namespace objcopy
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_TOOLS_OBJCOPY_COFF_READER_H		#endif // LLVM_LIB_OBJCOPY_COFF_READER_H

llvm/lib/ObjCopy/COFF/Reader.cpp

This file was moved from llvm/tools/llvm-objcopy/COFF/Reader.cpp.

	//===- Reader.cpp ---------------------------------------------------------===//			//===- Reader.cpp ---------------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "Reader.h"			#include "Reader.h"
	#include "Object.h"			#include "Object.h"
	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/BinaryFormat/COFF.h"			#include "llvm/BinaryFormat/COFF.h"
	#include "llvm/Object/COFF.h"			#include "llvm/Object/COFF.h"
	#include "llvm/Support/ErrorHandling.h"			#include "llvm/Support/ErrorHandling.h"
	#include <cstddef>			#include <cstddef>
	#include <cstdint>			#include <cstdint>

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace coff {			namespace coff {

	using namespace object;			using namespace object;
	using namespace COFF;			using namespace COFF;

	Error COFFReader::readExecutableHeaders(Object &Obj) const {			Error COFFReader::readExecutableHeaders(Object &Obj) const {
	const dos_header *DH = COFFObj.getDOSHeader();			const dos_header *DH = COFFObj.getDOSHeader();
	Obj.Is64 = COFFObj.is64();			Obj.Is64 = COFFObj.is64();
	if (!DH)			if (!DH)
	return Error::success();			return Error::success();

	Obj.IsPE = true;			Obj.IsPE = true;
	Obj.DosHeader = *DH;			Obj.DosHeader = *DH;
	if (DH->AddressOfNewExeHeader > sizeof(*DH))			if (DH->AddressOfNewExeHeader > sizeof(*DH))
	Obj.DosStub = ArrayRef<uint8_t>(reinterpret_cast<const uint8_t *>(&DH[1]),			Obj.DosStub = ArrayRef<uint8_t>(reinterpret_cast<const uint8_t *>(&DH[1]),
	DH->AddressOfNewExeHeader - sizeof(*DH));			DH->AddressOfNewExeHeader - sizeof(*DH));

	if (COFFObj.is64()) {			if (COFFObj.is64()) {
	Obj.PeHeader = *COFFObj.getPE32PlusHeader();			Obj.PeHeader = *COFFObj.getPE32PlusHeader();
	} else {			} else {
	const pe32_header *PE32 = COFFObj.getPE32Header();			const pe32_header *PE32 = COFFObj.getPE32Header();
	copyPeHeader(Obj.PeHeader, *PE32);			copyPeHeader(Obj.PeHeader, *PE32);
	// The pe32plus_header (stored in Object) lacks the BaseOfData field.			// The pe32plus_header (stored in Object) lacks the BaseOfData field.
	Obj.BaseOfData = PE32->BaseOfData;			Obj.BaseOfData = PE32->BaseOfData;
	}			}

	for (size_t I = 0; I < Obj.PeHeader.NumberOfRvaAndSize; I++) {			for (size_t I = 0; I < Obj.PeHeader.NumberOfRvaAndSize; I++) {
	const data_directory *Dir = COFFObj.getDataDirectory(I);			const data_directory *Dir = COFFObj.getDataDirectory(I);
	if (!Dir)			if (!Dir)
	return errorCodeToError(object_error::parse_failed);			return errorCodeToError(object_error::parse_failed);
	Obj.DataDirectories.emplace_back(*Dir);			Obj.DataDirectories.emplace_back(*Dir);
	}			}
	return Error::success();			return Error::success();
	}			}

	Error COFFReader::readSections(Object &Obj) const {			Error COFFReader::readSections(Object &Obj) const {
	std::vector<Section> Sections;			std::vector<Section> Sections;
	// Section indexing starts from 1.			// Section indexing starts from 1.
	for (size_t I = 1, E = COFFObj.getNumberOfSections(); I <= E; I++) {			for (size_t I = 1, E = COFFObj.getNumberOfSections(); I <= E; I++) {
	Expected<const coff_section *> SecOrErr = COFFObj.getSection(I);			Expected<const coff_section *> SecOrErr = COFFObj.getSection(I);
	if (!SecOrErr)			if (!SecOrErr)
	return SecOrErr.takeError();			return SecOrErr.takeError();
	const coff_section Sec = SecOrErr;			const coff_section Sec = SecOrErr;
	Sections.push_back(Section());			Sections.push_back(Section());
	Section &S = Sections.back();			Section &S = Sections.back();
	S.Header = *Sec;			S.Header = *Sec;
	S.Header.Characteristics &= ~COFF::IMAGE_SCN_LNK_NRELOC_OVFL;			S.Header.Characteristics &= ~COFF::IMAGE_SCN_LNK_NRELOC_OVFL;
	ArrayRef<uint8_t> Contents;			ArrayRef<uint8_t> Contents;
	if (Error E = COFFObj.getSectionContents(Sec, Contents))			if (Error E = COFFObj.getSectionContents(Sec, Contents))
	return E;			return E;
	S.setContentsRef(Contents);			S.setContentsRef(Contents);
	ArrayRef<coff_relocation> Relocs = COFFObj.getRelocations(Sec);			ArrayRef<coff_relocation> Relocs = COFFObj.getRelocations(Sec);
	for (const coff_relocation &R : Relocs)			for (const coff_relocation &R : Relocs)
	S.Relocs.push_back(R);			S.Relocs.push_back(R);
	if (Expected<StringRef> NameOrErr = COFFObj.getSectionName(Sec))			if (Expected<StringRef> NameOrErr = COFFObj.getSectionName(Sec))
	S.Name = *NameOrErr;			S.Name = *NameOrErr;
	else			else
	return NameOrErr.takeError();			return NameOrErr.takeError();
	}			}
	Obj.addSections(Sections);			Obj.addSections(Sections);
	return Error::success();			return Error::success();
	}			}

	Error COFFReader::readSymbols(Object &Obj, bool IsBigObj) const {			Error COFFReader::readSymbols(Object &Obj, bool IsBigObj) const {
	std::vector<Symbol> Symbols;			std::vector<Symbol> Symbols;
	Symbols.reserve(COFFObj.getRawNumberOfSymbols());			Symbols.reserve(COFFObj.getRawNumberOfSymbols());
	ArrayRef<Section> Sections = Obj.getSections();			ArrayRef<Section> Sections = Obj.getSections();
	for (uint32_t I = 0, E = COFFObj.getRawNumberOfSymbols(); I < E;) {			for (uint32_t I = 0, E = COFFObj.getRawNumberOfSymbols(); I < E;) {
	Expected<COFFSymbolRef> SymOrErr = COFFObj.getSymbol(I);			Expected<COFFSymbolRef> SymOrErr = COFFObj.getSymbol(I);
	if (!SymOrErr)			if (!SymOrErr)
	return SymOrErr.takeError();			return SymOrErr.takeError();
	COFFSymbolRef SymRef = *SymOrErr;			COFFSymbolRef SymRef = *SymOrErr;

	Symbols.push_back(Symbol());			Symbols.push_back(Symbol());
	Symbol &Sym = Symbols.back();			Symbol &Sym = Symbols.back();
	// Copy symbols from the original form into an intermediate coff_symbol32.			// Copy symbols from the original form into an intermediate coff_symbol32.
	if (IsBigObj)			if (IsBigObj)
	copySymbol(Sym.Sym,			copySymbol(Sym.Sym,
	reinterpret_cast<const coff_symbol32 >(SymRef.getRawPtr()));			reinterpret_cast<const coff_symbol32 >(SymRef.getRawPtr()));
	else			else
	copySymbol(Sym.Sym,			copySymbol(Sym.Sym,
	reinterpret_cast<const coff_symbol16 >(SymRef.getRawPtr()));			reinterpret_cast<const coff_symbol16 >(SymRef.getRawPtr()));
	auto NameOrErr = COFFObj.getSymbolName(SymRef);			auto NameOrErr = COFFObj.getSymbolName(SymRef);
	if (!NameOrErr)			if (!NameOrErr)
	return NameOrErr.takeError();			return NameOrErr.takeError();
	Sym.Name = *NameOrErr;			Sym.Name = *NameOrErr;

	ArrayRef<uint8_t> AuxData = COFFObj.getSymbolAuxData(SymRef);			ArrayRef<uint8_t> AuxData = COFFObj.getSymbolAuxData(SymRef);
	size_t SymSize = IsBigObj ? sizeof(coff_symbol32) : sizeof(coff_symbol16);			size_t SymSize = IsBigObj ? sizeof(coff_symbol32) : sizeof(coff_symbol16);
	assert(AuxData.size() == SymSize * SymRef.getNumberOfAuxSymbols());			assert(AuxData.size() == SymSize * SymRef.getNumberOfAuxSymbols());
	// The auxillary symbols are structs of sizeof(coff_symbol16) each.			// The auxillary symbols are structs of sizeof(coff_symbol16) each.
	// In the big object format (where symbols are coff_symbol32), each			// In the big object format (where symbols are coff_symbol32), each
	// auxillary symbol is padded with 2 bytes at the end. Copy each			// auxillary symbol is padded with 2 bytes at the end. Copy each
	// auxillary symbol to the Sym.AuxData vector. For file symbols,			// auxillary symbol to the Sym.AuxData vector. For file symbols,
	// the whole range of aux symbols are interpreted as one null padded			// the whole range of aux symbols are interpreted as one null padded
	// string instead.			// string instead.
	if (SymRef.isFileRecord())			if (SymRef.isFileRecord())
	Sym.AuxFile = StringRef(reinterpret_cast<const char *>(AuxData.data()),			Sym.AuxFile = StringRef(reinterpret_cast<const char *>(AuxData.data()),
	AuxData.size())			AuxData.size())
	.rtrim('\0');			.rtrim('\0');
	else			else
	for (size_t I = 0; I < SymRef.getNumberOfAuxSymbols(); I++)			for (size_t I = 0; I < SymRef.getNumberOfAuxSymbols(); I++)
	Sym.AuxData.push_back(AuxData.slice(I * SymSize, sizeof(AuxSymbol)));			Sym.AuxData.push_back(AuxData.slice(I * SymSize, sizeof(AuxSymbol)));

	// Find the unique id of the section			// Find the unique id of the section
	if (SymRef.getSectionNumber() <=			if (SymRef.getSectionNumber() <=
	0) // Special symbol (undefined/absolute/debug)			0) // Special symbol (undefined/absolute/debug)
	Sym.TargetSectionId = SymRef.getSectionNumber();			Sym.TargetSectionId = SymRef.getSectionNumber();
	else if (static_cast<uint32_t>(SymRef.getSectionNumber() - 1) <			else if (static_cast<uint32_t>(SymRef.getSectionNumber() - 1) <
	Sections.size())			Sections.size())
	Sym.TargetSectionId = Sections[SymRef.getSectionNumber() - 1].UniqueId;			Sym.TargetSectionId = Sections[SymRef.getSectionNumber() - 1].UniqueId;
	else			else
	return createStringError(object_error::parse_failed,			return createStringError(object_error::parse_failed,
	"section number out of range");			"section number out of range");
	// For section definitions, check if it is comdat associative, and if			// For section definitions, check if it is comdat associative, and if
	// it is, find the target section unique id.			// it is, find the target section unique id.
	const coff_aux_section_definition *SD = SymRef.getSectionDefinition();			const coff_aux_section_definition *SD = SymRef.getSectionDefinition();
	const coff_aux_weak_external *WE = SymRef.getWeakExternal();			const coff_aux_weak_external *WE = SymRef.getWeakExternal();
	if (SD && SD->Selection == IMAGE_COMDAT_SELECT_ASSOCIATIVE) {			if (SD && SD->Selection == IMAGE_COMDAT_SELECT_ASSOCIATIVE) {
	int32_t Index = SD->getNumber(IsBigObj);			int32_t Index = SD->getNumber(IsBigObj);
	if (Index <= 0 \|\| static_cast<uint32_t>(Index - 1) >= Sections.size())			if (Index <= 0 \|\| static_cast<uint32_t>(Index - 1) >= Sections.size())
	return createStringError(object_error::parse_failed,			return createStringError(object_error::parse_failed,
	"unexpected associative section index");			"unexpected associative section index");
	Sym.AssociativeComdatTargetSectionId = Sections[Index - 1].UniqueId;			Sym.AssociativeComdatTargetSectionId = Sections[Index - 1].UniqueId;
	} else if (WE) {			} else if (WE) {
	// This is a raw symbol index for now, but store it in the Symbol			// This is a raw symbol index for now, but store it in the Symbol
	// until we've added them to the Object, which assigns the final			// until we've added them to the Object, which assigns the final
	// unique ids.			// unique ids.
	Sym.WeakTargetSymbolId = WE->TagIndex;			Sym.WeakTargetSymbolId = WE->TagIndex;
	}			}
	I += 1 + SymRef.getNumberOfAuxSymbols();			I += 1 + SymRef.getNumberOfAuxSymbols();
	}			}
	Obj.addSymbols(Symbols);			Obj.addSymbols(Symbols);
	return Error::success();			return Error::success();
	}			}

	Error COFFReader::setSymbolTargets(Object &Obj) const {			Error COFFReader::setSymbolTargets(Object &Obj) const {
	std::vector<const Symbol *> RawSymbolTable;			std::vector<const Symbol *> RawSymbolTable;
	for (const Symbol &Sym : Obj.getSymbols()) {			for (const Symbol &Sym : Obj.getSymbols()) {
	RawSymbolTable.push_back(&Sym);			RawSymbolTable.push_back(&Sym);
	for (size_t I = 0; I < Sym.Sym.NumberOfAuxSymbols; I++)			for (size_t I = 0; I < Sym.Sym.NumberOfAuxSymbols; I++)
	RawSymbolTable.push_back(nullptr);			RawSymbolTable.push_back(nullptr);
	}			}
	for (Symbol &Sym : Obj.getMutableSymbols()) {			for (Symbol &Sym : Obj.getMutableSymbols()) {
	// Convert WeakTargetSymbolId from the original raw symbol index to			// Convert WeakTargetSymbolId from the original raw symbol index to
	// a proper unique id.			// a proper unique id.
	if (Sym.WeakTargetSymbolId) {			if (Sym.WeakTargetSymbolId) {
	if (*Sym.WeakTargetSymbolId >= RawSymbolTable.size())			if (*Sym.WeakTargetSymbolId >= RawSymbolTable.size())
	return createStringError(object_error::parse_failed,			return createStringError(object_error::parse_failed,
	"weak external reference out of range");			"weak external reference out of range");
	const Symbol Target = RawSymbolTable[Sym.WeakTargetSymbolId];			const Symbol Target = RawSymbolTable[Sym.WeakTargetSymbolId];
	if (Target == nullptr)			if (Target == nullptr)
	return createStringError(object_error::parse_failed,			return createStringError(object_error::parse_failed,
	"invalid SymbolTableIndex");			"invalid SymbolTableIndex");
	Sym.WeakTargetSymbolId = Target->UniqueId;			Sym.WeakTargetSymbolId = Target->UniqueId;
	}			}
	}			}
	for (Section &Sec : Obj.getMutableSections()) {			for (Section &Sec : Obj.getMutableSections()) {
	for (Relocation &R : Sec.Relocs) {			for (Relocation &R : Sec.Relocs) {
	if (R.Reloc.SymbolTableIndex >= RawSymbolTable.size())			if (R.Reloc.SymbolTableIndex >= RawSymbolTable.size())
	return createStringError(object_error::parse_failed,			return createStringError(object_error::parse_failed,
	"SymbolTableIndex out of range");			"SymbolTableIndex out of range");
	const Symbol *Sym = RawSymbolTable[R.Reloc.SymbolTableIndex];			const Symbol *Sym = RawSymbolTable[R.Reloc.SymbolTableIndex];
	if (Sym == nullptr)			if (Sym == nullptr)
	return createStringError(object_error::parse_failed,			return createStringError(object_error::parse_failed,
	"invalid SymbolTableIndex");			"invalid SymbolTableIndex");
	R.Target = Sym->UniqueId;			R.Target = Sym->UniqueId;
	R.TargetName = Sym->Name;			R.TargetName = Sym->Name;
	}			}
	}			}
	return Error::success();			return Error::success();
	}			}

	Expected<std::unique_ptr<Object>> COFFReader::create() const {			Expected<std::unique_ptr<Object>> COFFReader::create() const {
	auto Obj = std::make_unique<Object>();			auto Obj = std::make_unique<Object>();

	bool IsBigObj = false;			bool IsBigObj = false;
	if (const coff_file_header *CFH = COFFObj.getCOFFHeader()) {			if (const coff_file_header *CFH = COFFObj.getCOFFHeader()) {
	Obj->CoffFileHeader = *CFH;			Obj->CoffFileHeader = *CFH;
	} else {			} else {
	const coff_bigobj_file_header *CBFH = COFFObj.getCOFFBigObjHeader();			const coff_bigobj_file_header *CBFH = COFFObj.getCOFFBigObjHeader();
	if (!CBFH)			if (!CBFH)
	return createStringError(object_error::parse_failed,			return createStringError(object_error::parse_failed,
	"no COFF file header returned");			"no COFF file header returned");
	// Only copying the few fields from the bigobj header that we need			// Only copying the few fields from the bigobj header that we need
	// and won't recreate in the end.			// and won't recreate in the end.
	Obj->CoffFileHeader.Machine = CBFH->Machine;			Obj->CoffFileHeader.Machine = CBFH->Machine;
	Obj->CoffFileHeader.TimeDateStamp = CBFH->TimeDateStamp;			Obj->CoffFileHeader.TimeDateStamp = CBFH->TimeDateStamp;
	IsBigObj = true;			IsBigObj = true;
	}			}

	if (Error E = readExecutableHeaders(*Obj))			if (Error E = readExecutableHeaders(*Obj))
	return std::move(E);			return std::move(E);
	if (Error E = readSections(*Obj))			if (Error E = readSections(*Obj))
	return std::move(E);			return std::move(E);
	if (Error E = readSymbols(*Obj, IsBigObj))			if (Error E = readSymbols(*Obj, IsBigObj))
	return std::move(E);			return std::move(E);
	if (Error E = setSymbolTargets(*Obj))			if (Error E = setSymbolTargets(*Obj))
	return std::move(E);			return std::move(E);

	return std::move(Obj);			return std::move(Obj);
	}			}

	} // end namespace coff			} // end namespace coff
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

llvm/lib/ObjCopy/COFF/Writer.h

This file was moved from llvm/tools/llvm-objcopy/COFF/Writer.h.

//===- Writer.h -------------------------------------------------- C++ --===//		//===- Writer.h -------------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TOOLS_OBJCOPY_COFF_WRITER_H		#ifndef LLVM_LIB_OBJCOPY_COFF_WRITER_H
#define LLVM_TOOLS_OBJCOPY_COFF_WRITER_H		#define LLVM_LIB_OBJCOPY_COFF_WRITER_H

#include "Buffer.h"
#include "llvm/MC/StringTableBuilder.h"		#include "llvm/MC/StringTableBuilder.h"
		#include "llvm/ObjCopy/Buffer.h"
#include "llvm/Support/Error.h"		#include "llvm/Support/Error.h"
#include <cstddef>		#include <cstddef>
#include <utility>		#include <utility>

namespace llvm {		namespace llvm {
namespace objcopy {		namespace objcopy {
namespace coff {		namespace coff {

Show All 32 Lines	public:
COFFWriter(Object &Obj, Buffer &Buf)		COFFWriter(Object &Obj, Buffer &Buf)
: Obj(Obj), Buf(Buf), StrTabBuilder(StringTableBuilder::WinCOFF) {}		: Obj(Obj), Buf(Buf), StrTabBuilder(StringTableBuilder::WinCOFF) {}
};		};

} // end namespace coff		} // end namespace coff
} // end namespace objcopy		} // end namespace objcopy
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_TOOLS_OBJCOPY_COFF_WRITER_H		#endif // LLVM_LIB_OBJCOPY_COFF_WRITER_H

llvm/lib/ObjCopy/COFF/Writer.cpp

This file was moved from llvm/tools/llvm-objcopy/COFF/Writer.cpp.

	//===- Writer.cpp ---------------------------------------------------------===//			//===- Writer.cpp ---------------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "Writer.h"			#include "Writer.h"
	#include "Object.h"			#include "Object.h"
	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/BinaryFormat/COFF.h"			#include "llvm/BinaryFormat/COFF.h"
	#include "llvm/Object/COFF.h"			#include "llvm/Object/COFF.h"
	#include "llvm/Support/ErrorHandling.h"			#include "llvm/Support/ErrorHandling.h"
	#include <cstddef>			#include <cstddef>
	#include <cstdint>			#include <cstdint>

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace coff {			namespace coff {

	using namespace object;			using namespace object;
	using namespace COFF;			using namespace COFF;

	Error COFFWriter::finalizeRelocTargets() {			Error COFFWriter::finalizeRelocTargets() {
	for (Section &Sec : Obj.getMutableSections()) {			for (Section &Sec : Obj.getMutableSections()) {
	for (Relocation &R : Sec.Relocs) {			for (Relocation &R : Sec.Relocs) {
	const Symbol *Sym = Obj.findSymbol(R.Target);			const Symbol *Sym = Obj.findSymbol(R.Target);
	if (Sym == nullptr)			if (Sym == nullptr)
	return createStringError(object_error::invalid_symbol_index,			return createStringError(object_error::invalid_symbol_index,
	"relocation target '%s' (%zu) not found",			"relocation target '%s' (%zu) not found",
	R.TargetName.str().c_str(), R.Target);			R.TargetName.str().c_str(), R.Target);
	R.Reloc.SymbolTableIndex = Sym->RawIndex;			R.Reloc.SymbolTableIndex = Sym->RawIndex;
	}			}
	}			}
	return Error::success();			return Error::success();
	}			}

	Error COFFWriter::finalizeSymbolContents() {			Error COFFWriter::finalizeSymbolContents() {
	for (Symbol &Sym : Obj.getMutableSymbols()) {			for (Symbol &Sym : Obj.getMutableSymbols()) {
	if (Sym.TargetSectionId <= 0) {			if (Sym.TargetSectionId <= 0) {
	// Undefined, or a special kind of symbol. These negative values			// Undefined, or a special kind of symbol. These negative values
	// are stored in the SectionNumber field which is unsigned.			// are stored in the SectionNumber field which is unsigned.
	Sym.Sym.SectionNumber = static_cast<uint32_t>(Sym.TargetSectionId);			Sym.Sym.SectionNumber = static_cast<uint32_t>(Sym.TargetSectionId);
	} else {			} else {
	const Section *Sec = Obj.findSection(Sym.TargetSectionId);			const Section *Sec = Obj.findSection(Sym.TargetSectionId);
	if (Sec == nullptr)			if (Sec == nullptr)
	return createStringError(object_error::invalid_symbol_index,			return createStringError(object_error::invalid_symbol_index,
	"symbol '%s' points to a removed section",			"symbol '%s' points to a removed section",
	Sym.Name.str().c_str());			Sym.Name.str().c_str());
	Sym.Sym.SectionNumber = Sec->Index;			Sym.Sym.SectionNumber = Sec->Index;

	if (Sym.Sym.NumberOfAuxSymbols == 1 &&			if (Sym.Sym.NumberOfAuxSymbols == 1 &&
	Sym.Sym.StorageClass == IMAGE_SYM_CLASS_STATIC) {			Sym.Sym.StorageClass == IMAGE_SYM_CLASS_STATIC) {
	coff_aux_section_definition *SD =			coff_aux_section_definition *SD =
	reinterpret_cast<coff_aux_section_definition *>(			reinterpret_cast<coff_aux_section_definition *>(
	Sym.AuxData[0].Opaque);			Sym.AuxData[0].Opaque);
	uint32_t SDSectionNumber;			uint32_t SDSectionNumber;
	if (Sym.AssociativeComdatTargetSectionId == 0) {			if (Sym.AssociativeComdatTargetSectionId == 0) {
	// Not a comdat associative section; just set the Number field to			// Not a comdat associative section; just set the Number field to
	// the number of the section itself.			// the number of the section itself.
	SDSectionNumber = Sec->Index;			SDSectionNumber = Sec->Index;
	} else {			} else {
	Sec = Obj.findSection(Sym.AssociativeComdatTargetSectionId);			Sec = Obj.findSection(Sym.AssociativeComdatTargetSectionId);
	if (Sec == nullptr)			if (Sec == nullptr)
	return createStringError(			return createStringError(
	object_error::invalid_symbol_index,			object_error::invalid_symbol_index,
	"symbol '%s' is associative to a removed section",			"symbol '%s' is associative to a removed section",
	Sym.Name.str().c_str());			Sym.Name.str().c_str());
	SDSectionNumber = Sec->Index;			SDSectionNumber = Sec->Index;
	}			}
	// Update the section definition with the new section number.			// Update the section definition with the new section number.
	SD->NumberLowPart = static_cast<uint16_t>(SDSectionNumber);			SD->NumberLowPart = static_cast<uint16_t>(SDSectionNumber);
	SD->NumberHighPart = static_cast<uint16_t>(SDSectionNumber >> 16);			SD->NumberHighPart = static_cast<uint16_t>(SDSectionNumber >> 16);
	}			}
	}			}
	// Check that we actually have got AuxData to match the weak symbol target			// Check that we actually have got AuxData to match the weak symbol target
	// we want to set. Only >= 1 would be required, but only == 1 makes sense.			// we want to set. Only >= 1 would be required, but only == 1 makes sense.
	if (Sym.WeakTargetSymbolId && Sym.Sym.NumberOfAuxSymbols == 1) {			if (Sym.WeakTargetSymbolId && Sym.Sym.NumberOfAuxSymbols == 1) {
	coff_aux_weak_external *WE =			coff_aux_weak_external *WE =
	reinterpret_cast<coff_aux_weak_external *>(Sym.AuxData[0].Opaque);			reinterpret_cast<coff_aux_weak_external *>(Sym.AuxData[0].Opaque);
	const Symbol Target = Obj.findSymbol(Sym.WeakTargetSymbolId);			const Symbol Target = Obj.findSymbol(Sym.WeakTargetSymbolId);
	if (Target == nullptr)			if (Target == nullptr)
	return createStringError(object_error::invalid_symbol_index,			return createStringError(object_error::invalid_symbol_index,
	"symbol '%s' is missing its weak target",			"symbol '%s' is missing its weak target",
	Sym.Name.str().c_str());			Sym.Name.str().c_str());
	WE->TagIndex = Target->RawIndex;			WE->TagIndex = Target->RawIndex;
	}			}
	}			}
	return Error::success();			return Error::success();
	}			}

	void COFFWriter::layoutSections() {			void COFFWriter::layoutSections() {
	for (auto &S : Obj.getMutableSections()) {			for (auto &S : Obj.getMutableSections()) {
	if (S.Header.SizeOfRawData > 0)			if (S.Header.SizeOfRawData > 0)
	S.Header.PointerToRawData = FileSize;			S.Header.PointerToRawData = FileSize;
	FileSize += S.Header.SizeOfRawData; // For executables, this is already			FileSize += S.Header.SizeOfRawData; // For executables, this is already
	// aligned to FileAlignment.			// aligned to FileAlignment.
	if (S.Relocs.size() >= 0xffff) {			if (S.Relocs.size() >= 0xffff) {
	S.Header.Characteristics \|= COFF::IMAGE_SCN_LNK_NRELOC_OVFL;			S.Header.Characteristics \|= COFF::IMAGE_SCN_LNK_NRELOC_OVFL;
	S.Header.NumberOfRelocations = 0xffff;			S.Header.NumberOfRelocations = 0xffff;
	S.Header.PointerToRelocations = FileSize;			S.Header.PointerToRelocations = FileSize;
	FileSize += sizeof(coff_relocation);			FileSize += sizeof(coff_relocation);
	} else {			} else {
	S.Header.NumberOfRelocations = S.Relocs.size();			S.Header.NumberOfRelocations = S.Relocs.size();
	S.Header.PointerToRelocations = S.Relocs.size() ? FileSize : 0;			S.Header.PointerToRelocations = S.Relocs.size() ? FileSize : 0;
	}			}

	FileSize += S.Relocs.size() * sizeof(coff_relocation);			FileSize += S.Relocs.size() * sizeof(coff_relocation);
	FileSize = alignTo(FileSize, FileAlignment);			FileSize = alignTo(FileSize, FileAlignment);

	if (S.Header.Characteristics & IMAGE_SCN_CNT_INITIALIZED_DATA)			if (S.Header.Characteristics & IMAGE_SCN_CNT_INITIALIZED_DATA)
	SizeOfInitializedData += S.Header.SizeOfRawData;			SizeOfInitializedData += S.Header.SizeOfRawData;
	}			}
	}			}

	size_t COFFWriter::finalizeStringTable() {			size_t COFFWriter::finalizeStringTable() {
	for (const auto &S : Obj.getSections())			for (const auto &S : Obj.getSections())
	if (S.Name.size() > COFF::NameSize)			if (S.Name.size() > COFF::NameSize)
	StrTabBuilder.add(S.Name);			StrTabBuilder.add(S.Name);

	for (const auto &S : Obj.getSymbols())			for (const auto &S : Obj.getSymbols())
	if (S.Name.size() > COFF::NameSize)			if (S.Name.size() > COFF::NameSize)
	StrTabBuilder.add(S.Name);			StrTabBuilder.add(S.Name);

	StrTabBuilder.finalize();			StrTabBuilder.finalize();

	for (auto &S : Obj.getMutableSections()) {			for (auto &S : Obj.getMutableSections()) {
	memset(S.Header.Name, 0, sizeof(S.Header.Name));			memset(S.Header.Name, 0, sizeof(S.Header.Name));
	if (S.Name.size() > COFF::NameSize) {			if (S.Name.size() > COFF::NameSize) {
	snprintf(S.Header.Name, sizeof(S.Header.Name), "/%d",			snprintf(S.Header.Name, sizeof(S.Header.Name), "/%d",
	(int)StrTabBuilder.getOffset(S.Name));			(int)StrTabBuilder.getOffset(S.Name));
	} else {			} else {
	memcpy(S.Header.Name, S.Name.data(), S.Name.size());			memcpy(S.Header.Name, S.Name.data(), S.Name.size());
	}			}
	}			}
	for (auto &S : Obj.getMutableSymbols()) {			for (auto &S : Obj.getMutableSymbols()) {
	if (S.Name.size() > COFF::NameSize) {			if (S.Name.size() > COFF::NameSize) {
	S.Sym.Name.Offset.Zeroes = 0;			S.Sym.Name.Offset.Zeroes = 0;
	S.Sym.Name.Offset.Offset = StrTabBuilder.getOffset(S.Name);			S.Sym.Name.Offset.Offset = StrTabBuilder.getOffset(S.Name);
	} else {			} else {
	strncpy(S.Sym.Name.ShortName, S.Name.data(), COFF::NameSize);			strncpy(S.Sym.Name.ShortName, S.Name.data(), COFF::NameSize);
	}			}
	}			}
	return StrTabBuilder.getSize();			return StrTabBuilder.getSize();
	}			}

	template <class SymbolTy>			template <class SymbolTy>
	std::pair<size_t, size_t> COFFWriter::finalizeSymbolTable() {			std::pair<size_t, size_t> COFFWriter::finalizeSymbolTable() {
	size_t RawSymIndex = 0;			size_t RawSymIndex = 0;
	for (auto &S : Obj.getMutableSymbols()) {			for (auto &S : Obj.getMutableSymbols()) {
	// Symbols normally have NumberOfAuxSymbols set correctly all the time.			// Symbols normally have NumberOfAuxSymbols set correctly all the time.
	// For file symbols, we need to know the output file's symbol size to be			// For file symbols, we need to know the output file's symbol size to be
	// able to calculate the number of slots it occupies.			// able to calculate the number of slots it occupies.
	if (!S.AuxFile.empty())			if (!S.AuxFile.empty())
	S.Sym.NumberOfAuxSymbols =			S.Sym.NumberOfAuxSymbols =
	alignTo(S.AuxFile.size(), sizeof(SymbolTy)) / sizeof(SymbolTy);			alignTo(S.AuxFile.size(), sizeof(SymbolTy)) / sizeof(SymbolTy);
	S.RawIndex = RawSymIndex;			S.RawIndex = RawSymIndex;
	RawSymIndex += 1 + S.Sym.NumberOfAuxSymbols;			RawSymIndex += 1 + S.Sym.NumberOfAuxSymbols;
	}			}
	return std::make_pair(RawSymIndex * sizeof(SymbolTy), sizeof(SymbolTy));			return std::make_pair(RawSymIndex * sizeof(SymbolTy), sizeof(SymbolTy));
	}			}

	Error COFFWriter::finalize(bool IsBigObj) {			Error COFFWriter::finalize(bool IsBigObj) {
	size_t SymTabSize, SymbolSize;			size_t SymTabSize, SymbolSize;
	std::tie(SymTabSize, SymbolSize) = IsBigObj			std::tie(SymTabSize, SymbolSize) = IsBigObj
	? finalizeSymbolTable<coff_symbol32>()			? finalizeSymbolTable<coff_symbol32>()
	: finalizeSymbolTable<coff_symbol16>();			: finalizeSymbolTable<coff_symbol16>();

	if (Error E = finalizeRelocTargets())			if (Error E = finalizeRelocTargets())
	return E;			return E;
	if (Error E = finalizeSymbolContents())			if (Error E = finalizeSymbolContents())
	return E;			return E;

	size_t SizeOfHeaders = 0;			size_t SizeOfHeaders = 0;
	FileAlignment = 1;			FileAlignment = 1;
	size_t PeHeaderSize = 0;			size_t PeHeaderSize = 0;
	if (Obj.IsPE) {			if (Obj.IsPE) {
	Obj.DosHeader.AddressOfNewExeHeader =			Obj.DosHeader.AddressOfNewExeHeader =
	sizeof(Obj.DosHeader) + Obj.DosStub.size();			sizeof(Obj.DosHeader) + Obj.DosStub.size();
	SizeOfHeaders += Obj.DosHeader.AddressOfNewExeHeader + sizeof(PEMagic);			SizeOfHeaders += Obj.DosHeader.AddressOfNewExeHeader + sizeof(PEMagic);

	FileAlignment = Obj.PeHeader.FileAlignment;			FileAlignment = Obj.PeHeader.FileAlignment;
	Obj.PeHeader.NumberOfRvaAndSize = Obj.DataDirectories.size();			Obj.PeHeader.NumberOfRvaAndSize = Obj.DataDirectories.size();

	PeHeaderSize = Obj.Is64 ? sizeof(pe32plus_header) : sizeof(pe32_header);			PeHeaderSize = Obj.Is64 ? sizeof(pe32plus_header) : sizeof(pe32_header);
	SizeOfHeaders +=			SizeOfHeaders +=
	PeHeaderSize + sizeof(data_directory) * Obj.DataDirectories.size();			PeHeaderSize + sizeof(data_directory) * Obj.DataDirectories.size();
	}			}
	Obj.CoffFileHeader.NumberOfSections = Obj.getSections().size();			Obj.CoffFileHeader.NumberOfSections = Obj.getSections().size();
	SizeOfHeaders +=			SizeOfHeaders +=
	IsBigObj ? sizeof(coff_bigobj_file_header) : sizeof(coff_file_header);			IsBigObj ? sizeof(coff_bigobj_file_header) : sizeof(coff_file_header);
	SizeOfHeaders += sizeof(coff_section) * Obj.getSections().size();			SizeOfHeaders += sizeof(coff_section) * Obj.getSections().size();
	SizeOfHeaders = alignTo(SizeOfHeaders, FileAlignment);			SizeOfHeaders = alignTo(SizeOfHeaders, FileAlignment);

	Obj.CoffFileHeader.SizeOfOptionalHeader =			Obj.CoffFileHeader.SizeOfOptionalHeader =
	PeHeaderSize + sizeof(data_directory) * Obj.DataDirectories.size();			PeHeaderSize + sizeof(data_directory) * Obj.DataDirectories.size();

	FileSize = SizeOfHeaders;			FileSize = SizeOfHeaders;
	SizeOfInitializedData = 0;			SizeOfInitializedData = 0;

	layoutSections();			layoutSections();

	if (Obj.IsPE) {			if (Obj.IsPE) {
	Obj.PeHeader.SizeOfHeaders = SizeOfHeaders;			Obj.PeHeader.SizeOfHeaders = SizeOfHeaders;
	Obj.PeHeader.SizeOfInitializedData = SizeOfInitializedData;			Obj.PeHeader.SizeOfInitializedData = SizeOfInitializedData;

	if (!Obj.getSections().empty()) {			if (!Obj.getSections().empty()) {
	const Section &S = Obj.getSections().back();			const Section &S = Obj.getSections().back();
	Obj.PeHeader.SizeOfImage =			Obj.PeHeader.SizeOfImage =
	alignTo(S.Header.VirtualAddress + S.Header.VirtualSize,			alignTo(S.Header.VirtualAddress + S.Header.VirtualSize,
	Obj.PeHeader.SectionAlignment);			Obj.PeHeader.SectionAlignment);
	}			}

	// If the PE header had a checksum, clear it, since it isn't valid			// If the PE header had a checksum, clear it, since it isn't valid
	// any longer. (We don't calculate a new one.)			// any longer. (We don't calculate a new one.)
	Obj.PeHeader.CheckSum = 0;			Obj.PeHeader.CheckSum = 0;
	}			}

	size_t StrTabSize = finalizeStringTable();			size_t StrTabSize = finalizeStringTable();

	size_t PointerToSymbolTable = FileSize;			size_t PointerToSymbolTable = FileSize;
	// StrTabSize <= 4 is the size of an empty string table, only consisting			// StrTabSize <= 4 is the size of an empty string table, only consisting
	// of the length field.			// of the length field.
	if (SymTabSize == 0 && StrTabSize <= 4 && Obj.IsPE) {			if (SymTabSize == 0 && StrTabSize <= 4 && Obj.IsPE) {
	// For executables, don't point to the symbol table and skip writing			// For executables, don't point to the symbol table and skip writing
	// the length field, if both the symbol and string tables are empty.			// the length field, if both the symbol and string tables are empty.
	PointerToSymbolTable = 0;			PointerToSymbolTable = 0;
	StrTabSize = 0;			StrTabSize = 0;
	}			}

	size_t NumRawSymbols = SymTabSize / SymbolSize;			size_t NumRawSymbols = SymTabSize / SymbolSize;
	Obj.CoffFileHeader.PointerToSymbolTable = PointerToSymbolTable;			Obj.CoffFileHeader.PointerToSymbolTable = PointerToSymbolTable;
	Obj.CoffFileHeader.NumberOfSymbols = NumRawSymbols;			Obj.CoffFileHeader.NumberOfSymbols = NumRawSymbols;
	FileSize += SymTabSize + StrTabSize;			FileSize += SymTabSize + StrTabSize;
	FileSize = alignTo(FileSize, FileAlignment);			FileSize = alignTo(FileSize, FileAlignment);

	return Error::success();			return Error::success();
	}			}

	void COFFWriter::writeHeaders(bool IsBigObj) {			void COFFWriter::writeHeaders(bool IsBigObj) {
	uint8_t *Ptr = Buf.getBufferStart();			uint8_t *Ptr = Buf.getBufferStart();
	if (Obj.IsPE) {			if (Obj.IsPE) {
	memcpy(Ptr, &Obj.DosHeader, sizeof(Obj.DosHeader));			memcpy(Ptr, &Obj.DosHeader, sizeof(Obj.DosHeader));
	Ptr += sizeof(Obj.DosHeader);			Ptr += sizeof(Obj.DosHeader);
	memcpy(Ptr, Obj.DosStub.data(), Obj.DosStub.size());			memcpy(Ptr, Obj.DosStub.data(), Obj.DosStub.size());
	Ptr += Obj.DosStub.size();			Ptr += Obj.DosStub.size();
	memcpy(Ptr, PEMagic, sizeof(PEMagic));			memcpy(Ptr, PEMagic, sizeof(PEMagic));
	Ptr += sizeof(PEMagic);			Ptr += sizeof(PEMagic);
	}			}
	if (!IsBigObj) {			if (!IsBigObj) {
	memcpy(Ptr, &Obj.CoffFileHeader, sizeof(Obj.CoffFileHeader));			memcpy(Ptr, &Obj.CoffFileHeader, sizeof(Obj.CoffFileHeader));
	Ptr += sizeof(Obj.CoffFileHeader);			Ptr += sizeof(Obj.CoffFileHeader);
	} else {			} else {
	// Generate a coff_bigobj_file_header, filling it in with the values			// Generate a coff_bigobj_file_header, filling it in with the values
	// from Obj.CoffFileHeader. All extra fields that don't exist in			// from Obj.CoffFileHeader. All extra fields that don't exist in
	// coff_file_header can be set to hardcoded values.			// coff_file_header can be set to hardcoded values.
	coff_bigobj_file_header BigObjHeader;			coff_bigobj_file_header BigObjHeader;
	BigObjHeader.Sig1 = IMAGE_FILE_MACHINE_UNKNOWN;			BigObjHeader.Sig1 = IMAGE_FILE_MACHINE_UNKNOWN;
	BigObjHeader.Sig2 = 0xffff;			BigObjHeader.Sig2 = 0xffff;
	BigObjHeader.Version = BigObjHeader::MinBigObjectVersion;			BigObjHeader.Version = BigObjHeader::MinBigObjectVersion;
	BigObjHeader.Machine = Obj.CoffFileHeader.Machine;			BigObjHeader.Machine = Obj.CoffFileHeader.Machine;
	BigObjHeader.TimeDateStamp = Obj.CoffFileHeader.TimeDateStamp;			BigObjHeader.TimeDateStamp = Obj.CoffFileHeader.TimeDateStamp;
	memcpy(BigObjHeader.UUID, BigObjMagic, sizeof(BigObjMagic));			memcpy(BigObjHeader.UUID, BigObjMagic, sizeof(BigObjMagic));
	BigObjHeader.unused1 = 0;			BigObjHeader.unused1 = 0;
	BigObjHeader.unused2 = 0;			BigObjHeader.unused2 = 0;
	BigObjHeader.unused3 = 0;			BigObjHeader.unused3 = 0;
	BigObjHeader.unused4 = 0;			BigObjHeader.unused4 = 0;
	// The value in Obj.CoffFileHeader.NumberOfSections is truncated, thus			// The value in Obj.CoffFileHeader.NumberOfSections is truncated, thus
	// get the original one instead.			// get the original one instead.
	BigObjHeader.NumberOfSections = Obj.getSections().size();			BigObjHeader.NumberOfSections = Obj.getSections().size();
	BigObjHeader.PointerToSymbolTable = Obj.CoffFileHeader.PointerToSymbolTable;			BigObjHeader.PointerToSymbolTable = Obj.CoffFileHeader.PointerToSymbolTable;
	BigObjHeader.NumberOfSymbols = Obj.CoffFileHeader.NumberOfSymbols;			BigObjHeader.NumberOfSymbols = Obj.CoffFileHeader.NumberOfSymbols;

	memcpy(Ptr, &BigObjHeader, sizeof(BigObjHeader));			memcpy(Ptr, &BigObjHeader, sizeof(BigObjHeader));
	Ptr += sizeof(BigObjHeader);			Ptr += sizeof(BigObjHeader);
	}			}
	if (Obj.IsPE) {			if (Obj.IsPE) {
	if (Obj.Is64) {			if (Obj.Is64) {
	memcpy(Ptr, &Obj.PeHeader, sizeof(Obj.PeHeader));			memcpy(Ptr, &Obj.PeHeader, sizeof(Obj.PeHeader));
	Ptr += sizeof(Obj.PeHeader);			Ptr += sizeof(Obj.PeHeader);
	} else {			} else {
	pe32_header PeHeader;			pe32_header PeHeader;
	copyPeHeader(PeHeader, Obj.PeHeader);			copyPeHeader(PeHeader, Obj.PeHeader);
	// The pe32plus_header (stored in Object) lacks the BaseOfData field.			// The pe32plus_header (stored in Object) lacks the BaseOfData field.
	PeHeader.BaseOfData = Obj.BaseOfData;			PeHeader.BaseOfData = Obj.BaseOfData;

	memcpy(Ptr, &PeHeader, sizeof(PeHeader));			memcpy(Ptr, &PeHeader, sizeof(PeHeader));
	Ptr += sizeof(PeHeader);			Ptr += sizeof(PeHeader);
	}			}
	for (const auto &DD : Obj.DataDirectories) {			for (const auto &DD : Obj.DataDirectories) {
	memcpy(Ptr, &DD, sizeof(DD));			memcpy(Ptr, &DD, sizeof(DD));
	Ptr += sizeof(DD);			Ptr += sizeof(DD);
	}			}
	}			}
	for (const auto &S : Obj.getSections()) {			for (const auto &S : Obj.getSections()) {
	memcpy(Ptr, &S.Header, sizeof(S.Header));			memcpy(Ptr, &S.Header, sizeof(S.Header));
	Ptr += sizeof(S.Header);			Ptr += sizeof(S.Header);
	}			}
	}			}

	void COFFWriter::writeSections() {			void COFFWriter::writeSections() {
	for (const auto &S : Obj.getSections()) {			for (const auto &S : Obj.getSections()) {
	uint8_t *Ptr = Buf.getBufferStart() + S.Header.PointerToRawData;			uint8_t *Ptr = Buf.getBufferStart() + S.Header.PointerToRawData;
	ArrayRef<uint8_t> Contents = S.getContents();			ArrayRef<uint8_t> Contents = S.getContents();
	std::copy(Contents.begin(), Contents.end(), Ptr);			std::copy(Contents.begin(), Contents.end(), Ptr);

	// For executable sections, pad the remainder of the raw data size with			// For executable sections, pad the remainder of the raw data size with
	// 0xcc, which is int3 on x86.			// 0xcc, which is int3 on x86.
	if ((S.Header.Characteristics & IMAGE_SCN_CNT_CODE) &&			if ((S.Header.Characteristics & IMAGE_SCN_CNT_CODE) &&
	S.Header.SizeOfRawData > Contents.size())			S.Header.SizeOfRawData > Contents.size())
	memset(Ptr + Contents.size(), 0xcc,			memset(Ptr + Contents.size(), 0xcc,
	S.Header.SizeOfRawData - Contents.size());			S.Header.SizeOfRawData - Contents.size());

	Ptr += S.Header.SizeOfRawData;			Ptr += S.Header.SizeOfRawData;

	if (S.Relocs.size() >= 0xffff) {			if (S.Relocs.size() >= 0xffff) {
	object::coff_relocation R;			object::coff_relocation R;
	R.VirtualAddress = S.Relocs.size() + 1;			R.VirtualAddress = S.Relocs.size() + 1;
	R.SymbolTableIndex = 0;			R.SymbolTableIndex = 0;
	R.Type = 0;			R.Type = 0;
	memcpy(Ptr, &R, sizeof(R));			memcpy(Ptr, &R, sizeof(R));
	Ptr += sizeof(R);			Ptr += sizeof(R);
	}			}
	for (const auto &R : S.Relocs) {			for (const auto &R : S.Relocs) {
	memcpy(Ptr, &R.Reloc, sizeof(R.Reloc));			memcpy(Ptr, &R.Reloc, sizeof(R.Reloc));
	Ptr += sizeof(R.Reloc);			Ptr += sizeof(R.Reloc);
	}			}
	}			}
	}			}

	template <class SymbolTy> void COFFWriter::writeSymbolStringTables() {			template <class SymbolTy> void COFFWriter::writeSymbolStringTables() {
	uint8_t *Ptr = Buf.getBufferStart() + Obj.CoffFileHeader.PointerToSymbolTable;			uint8_t *Ptr = Buf.getBufferStart() + Obj.CoffFileHeader.PointerToSymbolTable;
	for (const auto &S : Obj.getSymbols()) {			for (const auto &S : Obj.getSymbols()) {
	// Convert symbols back to the right size, from coff_symbol32.			// Convert symbols back to the right size, from coff_symbol32.
	copySymbol<SymbolTy, coff_symbol32>(reinterpret_cast<SymbolTy >(Ptr),			copySymbol<SymbolTy, coff_symbol32>(reinterpret_cast<SymbolTy >(Ptr),
	S.Sym);			S.Sym);
	Ptr += sizeof(SymbolTy);			Ptr += sizeof(SymbolTy);
	if (!S.AuxFile.empty()) {			if (!S.AuxFile.empty()) {
	// For file symbols, just write the string into the aux symbol slots,			// For file symbols, just write the string into the aux symbol slots,
	// assuming that the unwritten parts are initialized to zero in the memory			// assuming that the unwritten parts are initialized to zero in the memory
	// mapped file.			// mapped file.
	std::copy(S.AuxFile.begin(), S.AuxFile.end(), Ptr);			std::copy(S.AuxFile.begin(), S.AuxFile.end(), Ptr);
	Ptr += S.Sym.NumberOfAuxSymbols * sizeof(SymbolTy);			Ptr += S.Sym.NumberOfAuxSymbols * sizeof(SymbolTy);
	} else {			} else {
	// For other auxillary symbols, write their opaque payload into one symbol			// For other auxillary symbols, write their opaque payload into one symbol
	// table slot each. For big object files, the symbols are larger than the			// table slot each. For big object files, the symbols are larger than the
	// opaque auxillary symbol struct and we leave padding at the end of each			// opaque auxillary symbol struct and we leave padding at the end of each
	// entry.			// entry.
	for (const AuxSymbol &AuxSym : S.AuxData) {			for (const AuxSymbol &AuxSym : S.AuxData) {
	ArrayRef<uint8_t> Ref = AuxSym.getRef();			ArrayRef<uint8_t> Ref = AuxSym.getRef();
	std::copy(Ref.begin(), Ref.end(), Ptr);			std::copy(Ref.begin(), Ref.end(), Ptr);
	Ptr += sizeof(SymbolTy);			Ptr += sizeof(SymbolTy);
	}			}
	}			}
	}			}
	if (StrTabBuilder.getSize() > 4 \|\| !Obj.IsPE) {			if (StrTabBuilder.getSize() > 4 \|\| !Obj.IsPE) {
	// Always write a string table in object files, even an empty one.			// Always write a string table in object files, even an empty one.
	StrTabBuilder.write(Ptr);			StrTabBuilder.write(Ptr);
	Ptr += StrTabBuilder.getSize();			Ptr += StrTabBuilder.getSize();
	}			}
	}			}

	Error COFFWriter::write(bool IsBigObj) {			Error COFFWriter::write(bool IsBigObj) {
	if (Error E = finalize(IsBigObj))			if (Error E = finalize(IsBigObj))
	return E;			return E;

	if (Error E = Buf.allocate(FileSize))			if (Error E = Buf.allocate(FileSize))
	return E;			return E;

	writeHeaders(IsBigObj);			writeHeaders(IsBigObj);
	writeSections();			writeSections();
	if (IsBigObj)			if (IsBigObj)
	writeSymbolStringTables<coff_symbol32>();			writeSymbolStringTables<coff_symbol32>();
	else			else
	writeSymbolStringTables<coff_symbol16>();			writeSymbolStringTables<coff_symbol16>();

	if (Obj.IsPE)			if (Obj.IsPE)
	if (Error E = patchDebugDirectory())			if (Error E = patchDebugDirectory())
	return E;			return E;

	return Buf.commit();			return Buf.commit();
	}			}

	Expected<uint32_t> COFFWriter::virtualAddressToFileAddress(uint32_t RVA) {			Expected<uint32_t> COFFWriter::virtualAddressToFileAddress(uint32_t RVA) {
	for (const auto &S : Obj.getSections()) {			for (const auto &S : Obj.getSections()) {
	if (RVA >= S.Header.VirtualAddress &&			if (RVA >= S.Header.VirtualAddress &&
	RVA < S.Header.VirtualAddress + S.Header.SizeOfRawData)			RVA < S.Header.VirtualAddress + S.Header.SizeOfRawData)
	return S.Header.PointerToRawData + RVA - S.Header.VirtualAddress;			return S.Header.PointerToRawData + RVA - S.Header.VirtualAddress;
	}			}
	return createStringError(object_error::parse_failed,			return createStringError(object_error::parse_failed,
	"debug directory payload not found");			"debug directory payload not found");
	}			}

	// Locate which sections contain the debug directories, iterate over all			// Locate which sections contain the debug directories, iterate over all
	// the debug_directory structs in there, and set the PointerToRawData field			// the debug_directory structs in there, and set the PointerToRawData field
	// in all of them, according to their new physical location in the file.			// in all of them, according to their new physical location in the file.
	Error COFFWriter::patchDebugDirectory() {			Error COFFWriter::patchDebugDirectory() {
	if (Obj.DataDirectories.size() < DEBUG_DIRECTORY)			if (Obj.DataDirectories.size() < DEBUG_DIRECTORY)
	return Error::success();			return Error::success();
	const data_directory *Dir = &Obj.DataDirectories[DEBUG_DIRECTORY];			const data_directory *Dir = &Obj.DataDirectories[DEBUG_DIRECTORY];
	if (Dir->Size <= 0)			if (Dir->Size <= 0)
	return Error::success();			return Error::success();
	for (const auto &S : Obj.getSections()) {			for (const auto &S : Obj.getSections()) {
	if (Dir->RelativeVirtualAddress >= S.Header.VirtualAddress &&			if (Dir->RelativeVirtualAddress >= S.Header.VirtualAddress &&
	Dir->RelativeVirtualAddress <			Dir->RelativeVirtualAddress <
	S.Header.VirtualAddress + S.Header.SizeOfRawData) {			S.Header.VirtualAddress + S.Header.SizeOfRawData) {
	if (Dir->RelativeVirtualAddress + Dir->Size >			if (Dir->RelativeVirtualAddress + Dir->Size >
	S.Header.VirtualAddress + S.Header.SizeOfRawData)			S.Header.VirtualAddress + S.Header.SizeOfRawData)
	return createStringError(object_error::parse_failed,			return createStringError(object_error::parse_failed,
	"debug directory extends past end of section");			"debug directory extends past end of section");

	size_t Offset = Dir->RelativeVirtualAddress - S.Header.VirtualAddress;			size_t Offset = Dir->RelativeVirtualAddress - S.Header.VirtualAddress;
	uint8_t *Ptr = Buf.getBufferStart() + S.Header.PointerToRawData + Offset;			uint8_t *Ptr = Buf.getBufferStart() + S.Header.PointerToRawData + Offset;
	uint8_t *End = Ptr + Dir->Size;			uint8_t *End = Ptr + Dir->Size;
	while (Ptr < End) {			while (Ptr < End) {
	debug_directory Debug = reinterpret_cast<debug_directory >(Ptr);			debug_directory Debug = reinterpret_cast<debug_directory >(Ptr);
	if (!Debug->AddressOfRawData)			if (!Debug->AddressOfRawData)
	return createStringError(object_error::parse_failed,			return createStringError(object_error::parse_failed,
	"debug directory payload outside of "			"debug directory payload outside of "
	"mapped sections not supported");			"mapped sections not supported");
	if (Expected<uint32_t> FilePosOrErr =			if (Expected<uint32_t> FilePosOrErr =
	virtualAddressToFileAddress(Debug->AddressOfRawData))			virtualAddressToFileAddress(Debug->AddressOfRawData))
	Debug->PointerToRawData = *FilePosOrErr;			Debug->PointerToRawData = *FilePosOrErr;
	else			else
	return FilePosOrErr.takeError();			return FilePosOrErr.takeError();
	Ptr += sizeof(debug_directory);			Ptr += sizeof(debug_directory);
	Offset += sizeof(debug_directory);			Offset += sizeof(debug_directory);
	}			}
	// Debug directory found and patched, all done.			// Debug directory found and patched, all done.
	return Error::success();			return Error::success();
	}			}
	}			}
	return createStringError(object_error::parse_failed,			return createStringError(object_error::parse_failed,
	"debug directory not found");			"debug directory not found");
	}			}

	Error COFFWriter::write() {			Error COFFWriter::write() {
	bool IsBigObj = Obj.getSections().size() > MaxNumberOfSections16;			bool IsBigObj = Obj.getSections().size() > MaxNumberOfSections16;
	if (IsBigObj && Obj.IsPE)			if (IsBigObj && Obj.IsPE)
	return createStringError(object_error::parse_failed,			return createStringError(object_error::parse_failed,
	"too many sections for executable");			"too many sections for executable");
	return write(IsBigObj);			return write(IsBigObj);
	}			}

	} // end namespace coff			} // end namespace coff
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

llvm/lib/ObjCopy/CopyConfig.cpp

This file was added.

				//===- CopyConfig.cpp -----------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/ObjCopy/CopyConfig.h"

				namespace llvm {
				namespace objcopy {

				Expected<NameOrPattern>
				NameOrPattern::create(StringRef Pattern, MatchStyle MS,
				llvm::function_ref<Error(Error)> ErrorCallback) {
				switch (MS) {
				case MatchStyle::Literal:
				return NameOrPattern(Pattern);
				case MatchStyle::Wildcard: {
				SmallVector<char, 32> Data;
				bool IsPositiveMatch = true;
				if (Pattern[0] == '!') {
				IsPositiveMatch = false;
				Pattern = Pattern.drop_front();
				}
				Expected<GlobPattern> GlobOrErr = GlobPattern::create(Pattern);

				// If we couldn't create it as a glob, report the error, but try again with
				// a literal if the error reporting is non-fatal.
				if (!GlobOrErr) {
				if (Error E = ErrorCallback(GlobOrErr.takeError()))
				return std::move(E);
				return create(Pattern, MatchStyle::Literal, ErrorCallback);
				}

				return NameOrPattern(std::make_shared<GlobPattern>(*GlobOrErr),
				IsPositiveMatch);
				}
				case MatchStyle::Regex: {
				SmallVector<char, 32> Data;
				return NameOrPattern(std::make_shared<Regex>(
				("^" + Pattern.ltrim('^').rtrim('$') + "$").toStringRef(Data)));
				}
				}
				llvm_unreachable("Unhandled llvm.objcopy.MatchStyle enum");
				}

				} // namespace objcopy
				} // namespace llvm

llvm/lib/ObjCopy/ELF/ELFConfig.cpp

This file was moved from llvm/tools/llvm-objcopy/ELF/ELFConfig.cpp.

	//===- ELFConfig.cpp ------------------------------------------------------===//			//===- ELFConfig.cpp ------------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "CopyConfig.h"
	#include "llvm/ADT/Optional.h"			#include "llvm/ADT/Optional.h"
	#include "llvm/ADT/StringSwitch.h"			#include "llvm/ADT/StringSwitch.h"
				#include "llvm/ObjCopy/CopyConfig.h"
	#include "llvm/Support/Errc.h"			#include "llvm/Support/Errc.h"
	#include "llvm/Support/Error.h"			#include "llvm/Support/Error.h"

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace elf {			namespace elf {

	static Expected<NewSymbolInfo> parseNewSymbolInfo(StringRef FlagValue,			static Expected<NewSymbolInfo> parseNewSymbolInfo(StringRef FlagValue,
	▲ Show 20 Lines • Show All 114 Lines • Show Last 20 Lines

llvm/lib/ObjCopy/ELF/ELFObjcopy.cpp

This file was moved from llvm/tools/llvm-objcopy/ELF/ELFObjcopy.cpp.

	//===- ELFObjcopy.cpp -----------------------------------------------------===//			//===- ELFObjcopy.cpp -----------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "ELFObjcopy.h"			#include "llvm/ObjCopy/ELF/ELFObjcopy.h"
	#include "Buffer.h"
	#include "CopyConfig.h"
	#include "Object.h"			#include "Object.h"
	#include "llvm/ADT/BitmaskEnum.h"			#include "llvm/ADT/BitmaskEnum.h"
	#include "llvm/ADT/DenseSet.h"			#include "llvm/ADT/DenseSet.h"
	#include "llvm/ADT/Optional.h"			#include "llvm/ADT/Optional.h"
	#include "llvm/ADT/STLExtras.h"			#include "llvm/ADT/STLExtras.h"
	#include "llvm/ADT/SmallVector.h"			#include "llvm/ADT/SmallVector.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/ADT/Twine.h"			#include "llvm/ADT/Twine.h"
	#include "llvm/BinaryFormat/ELF.h"			#include "llvm/BinaryFormat/ELF.h"
	#include "llvm/MC/MCTargetOptions.h"			#include "llvm/MC/MCTargetOptions.h"
				#include "llvm/ObjCopy/Buffer.h"
				#include "llvm/ObjCopy/CopyConfig.h"
	#include "llvm/Object/Binary.h"			#include "llvm/Object/Binary.h"
	#include "llvm/Object/ELFObjectFile.h"			#include "llvm/Object/ELFObjectFile.h"
	#include "llvm/Object/ELFTypes.h"			#include "llvm/Object/ELFTypes.h"
	#include "llvm/Object/Error.h"			#include "llvm/Object/Error.h"
	#include "llvm/Option/Option.h"			#include "llvm/Option/Option.h"
	#include "llvm/Support/Casting.h"			#include "llvm/Support/Casting.h"
	#include "llvm/Support/Compression.h"			#include "llvm/Support/Compression.h"
	#include "llvm/Support/Errc.h"			#include "llvm/Support/Errc.h"
	▲ Show 20 Lines • Show All 838 Lines • Show Last 20 Lines

llvm/lib/ObjCopy/ELF/Object.h

This file was moved from llvm/tools/llvm-objcopy/ELF/Object.h.

//===- Object.h -------------------------------------------------- C++ --===//		//===- Object.h -------------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TOOLS_OBJCOPY_OBJECT_H		#ifndef LLVM_LIB_OBJCOPY_ELF_OBJECT_H
#define LLVM_TOOLS_OBJCOPY_OBJECT_H		#define LLVM_LIB_OBJCOPY_ELF_OBJECT_H

#include "Buffer.h"
#include "CopyConfig.h"
#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/Twine.h"		#include "llvm/ADT/Twine.h"
#include "llvm/BinaryFormat/ELF.h"		#include "llvm/BinaryFormat/ELF.h"
#include "llvm/MC/StringTableBuilder.h"		#include "llvm/MC/StringTableBuilder.h"
		#include "llvm/ObjCopy/Buffer.h"
		#include "llvm/ObjCopy/CopyConfig.h"
#include "llvm/Object/ELFObjectFile.h"		#include "llvm/Object/ELFObjectFile.h"
#include "llvm/Support/Errc.h"		#include "llvm/Support/Errc.h"
#include "llvm/Support/FileOutputBuffer.h"		#include "llvm/Support/FileOutputBuffer.h"
#include <cstddef>		#include <cstddef>
#include <cstdint>		#include <cstdint>
#include <functional>		#include <functional>
#include <memory>		#include <memory>
#include <set>		#include <set>
▲ Show 20 Lines • Show All 1,065 Lines • ▼ Show 20 Lines	bool isRelocatable() const {
return (Type != ELF::ET_DYN && Type != ELF::ET_EXEC) \|\| MustBeRelocatable;		return (Type != ELF::ET_DYN && Type != ELF::ET_EXEC) \|\| MustBeRelocatable;
}		}
};		};

} // end namespace elf		} // end namespace elf
} // end namespace objcopy		} // end namespace objcopy
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_TOOLS_OBJCOPY_OBJECT_H		#endif // LLVM_LIB_OBJCOPY_ELF_OBJECT_H

llvm/lib/ObjCopy/ELF/Object.cpp

This file was moved from llvm/tools/llvm-objcopy/ELF/Object.cpp.

Show First 20 Lines • Show All 1,284 Lines • ▼ Show 20 Lines	auto Data = ArrayRef<uint8_t>(
MemBuf->getBufferSize());		MemBuf->getBufferSize());
auto &DataSection = Obj->addSection<Section>(Data);		auto &DataSection = Obj->addSection<Section>(Data);
DataSection.Name = ".data";		DataSection.Name = ".data";
DataSection.Type = ELF::SHT_PROGBITS;		DataSection.Type = ELF::SHT_PROGBITS;
DataSection.Size = Data.size();		DataSection.Size = Data.size();
DataSection.Flags = ELF::SHF_ALLOC \| ELF::SHF_WRITE;		DataSection.Flags = ELF::SHF_ALLOC \| ELF::SHF_WRITE;

std::string SanitizedFilename = MemBuf->getBufferIdentifier().str();		std::string SanitizedFilename = MemBuf->getBufferIdentifier().str();
std::replace_if(std::begin(SanitizedFilename), std::end(SanitizedFilename),		std::replace_if(std::begin(SanitizedFilename), std::end(SanitizedFilename),
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - std::replace_if(std::begin(SanitizedFilename), std::end(SanitizedFilename), - [](char C) { return !isalnum(C); }, '_'); + std::replace_if( + std::begin(SanitizedFilename), std::end(SanitizedFilename), + [](char C) { return !isalnum(C); }, '_'); Lint: Pre-merge checks: clang-format: please reformat the code ``` - std::replace_if(std::begin(SanitizedFilename)…
[](char C) { return !isalnum(C); }, '_');		[](char C) { return !isalnum(C); }, '_');
Twine Prefix = Twine("_binary_") + SanitizedFilename;		Twine Prefix = Twine("_binary_") + SanitizedFilename;

SymTab->addSymbol(Prefix + "_start", STB_GLOBAL, STT_NOTYPE, &DataSection,		SymTab->addSymbol(Prefix + "_start", STB_GLOBAL, STT_NOTYPE, &DataSection,
/Value=/0, NewSymbolVisibility, 0, 0);		/Value=/0, NewSymbolVisibility, 0, 0);
SymTab->addSymbol(Prefix + "_end", STB_GLOBAL, STT_NOTYPE, &DataSection,		SymTab->addSymbol(Prefix + "_end", STB_GLOBAL, STT_NOTYPE, &DataSection,
/Value=/DataSection.Size, NewSymbolVisibility, 0, 0);		/Value=/DataSection.Size, NewSymbolVisibility, 0, 0);
SymTab->addSymbol(Prefix + "_size", STB_GLOBAL, STT_NOTYPE, nullptr,		SymTab->addSymbol(Prefix + "_size", STB_GLOBAL, STT_NOTYPE, nullptr,
▲ Show 20 Lines • Show All 1,385 Lines • Show Last 20 Lines

llvm/lib/ObjCopy/LLVMBuild.txt

This file was copied from llvm/lib/LLVMBuild.txt.

	;===- ./lib/LLVMBuild.txt --------------------------------------- Conf ---===;			;===- ./lib/ObjCopy/LLVMBuild.txt ------------------------------ Conf ---===;
	;			;
	; Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			; Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	; See https://llvm.org/LICENSE.txt for license information.			; See https://llvm.org/LICENSE.txt for license information.
	; SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			; SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	;			;
	;===------------------------------------------------------------------------===;			;===------------------------------------------------------------------------===;
	;			;
	; This is an LLVMBuild description file for the components in this subdirectory.			; This is an LLVMBuild description file for the components in this subdirectory.
	;			;
	; For more information on the LLVMBuild system, please see:			; For more information on the LLVMBuild system, please see:
	;			;
	; http://llvm.org/docs/LLVMBuild.html			; http://llvm.org/docs/LLVMBuild.html
	;			;
	;===------------------------------------------------------------------------===;			;===------------------------------------------------------------------------===;

	[common]
	subdirectories =
	Analysis
	AsmParser
	Bitcode
	Bitstream
	CodeGen
	DebugInfo
	Demangle
	DWARFLinker
	ExecutionEngine
	Extensions
	Frontend
	FuzzMutate
	LineEditor
	Linker
	InterfaceStub
	IR
	IRReader
	LTO
	MC
	MCA
	Object
	BinaryFormat
	ObjectYAML
	Option
	Remarks
	Passes
	ProfileData
	Support
	TableGen
	TextAPI
	Target
	Testing
	ToolDrivers
	Transforms
	WindowsManifest
	XRay

	[component_0]			[component_0]
	type = Group			type = Library
	name = Libraries			name = ObjCopy
	parent = $ROOT			parent = Libraries
				required_libraries = Object Support MC

llvm/lib/ObjCopy/MachO/MachOLayoutBuilder.h

This file was moved from llvm/tools/llvm-objcopy/MachO/MachOLayoutBuilder.h.

	//===- MachOLayoutBuilder.h -------------------------------------- C++ --===//			//===- MachOLayoutBuilder.h -------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_OBJCOPY_MACHO_MACHOLAYOUTBUILDER_H			#ifndef LLVM_OBJCOPY_MACHO_MACHOLAYOUTBUILDER_H
	#define LLVM_OBJCOPY_MACHO_MACHOLAYOUTBUILDER_H			#define LLVM_OBJCOPY_MACHO_MACHOLAYOUTBUILDER_H

	#include "MachOObjcopy.h"
	#include "Object.h"			#include "Object.h"
				#include "llvm/ObjCopy/MachO/MachOObjcopy.h"

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace macho {			namespace macho {

	class MachOLayoutBuilder {			class MachOLayoutBuilder {
	Object &O;			Object &O;
	bool Is64Bit;			bool Is64Bit;
	Show All 29 Lines

llvm/lib/ObjCopy/MachO/MachOLayoutBuilder.cpp

This file was moved from llvm/tools/llvm-objcopy/MachO/MachOLayoutBuilder.cpp.

	//===- MachOLayoutBuilder.cpp ------------------------------------ C++ --===//			//===- MachOLayoutBuilder.cpp ------------------------------------ C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "MachOLayoutBuilder.h"			#include "MachOLayoutBuilder.h"
	#include "llvm/Support/Alignment.h"			#include "llvm/Support/Alignment.h"
	#include "llvm/Support/Errc.h"			#include "llvm/Support/Errc.h"
	#include "llvm/Support/ErrorHandling.h"			#include "llvm/Support/ErrorHandling.h"

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace macho {			namespace macho {

	uint32_t MachOLayoutBuilder::computeSizeOfCmds() const {			uint32_t MachOLayoutBuilder::computeSizeOfCmds() const {
	uint32_t Size = 0;			uint32_t Size = 0;
	for (const LoadCommand &LC : O.LoadCommands) {			for (const LoadCommand &LC : O.LoadCommands) {
	const MachO::macho_load_command &MLC = LC.MachOLoadCommand;			const MachO::macho_load_command &MLC = LC.MachOLoadCommand;
	auto cmd = MLC.load_command_data.cmd;			auto cmd = MLC.load_command_data.cmd;
	switch (cmd) {			switch (cmd) {
	case MachO::LC_SEGMENT:			case MachO::LC_SEGMENT:
	Size += sizeof(MachO::segment_command) +			Size += sizeof(MachO::segment_command) +
	sizeof(MachO::section) * LC.Sections.size();			sizeof(MachO::section) * LC.Sections.size();
	continue;			continue;
	case MachO::LC_SEGMENT_64:			case MachO::LC_SEGMENT_64:
	Size += sizeof(MachO::segment_command_64) +			Size += sizeof(MachO::segment_command_64) +
	sizeof(MachO::section_64) * LC.Sections.size();			sizeof(MachO::section_64) * LC.Sections.size();
	continue;			continue;
	}			}

	switch (cmd) {			switch (cmd) {
	#define HANDLE_LOAD_COMMAND(LCName, LCValue, LCStruct) \			#define HANDLE_LOAD_COMMAND(LCName, LCValue, LCStruct) \
	case MachO::LCName: \			case MachO::LCName: \
	Size += sizeof(MachO::LCStruct) + LC.Payload.size(); \			Size += sizeof(MachO::LCStruct) + LC.Payload.size(); \
	break;			break;
	#include "llvm/BinaryFormat/MachO.def"			#include "llvm/BinaryFormat/MachO.def"
	#undef HANDLE_LOAD_COMMAND			#undef HANDLE_LOAD_COMMAND
	}			}
	}			}

	return Size;			return Size;
	}			}

	void MachOLayoutBuilder::constructStringTable() {			void MachOLayoutBuilder::constructStringTable() {
	for (std::unique_ptr<SymbolEntry> &Sym : O.SymTable.Symbols)			for (std::unique_ptr<SymbolEntry> &Sym : O.SymTable.Symbols)
	StrTableBuilder.add(Sym->Name);			StrTableBuilder.add(Sym->Name);
	StrTableBuilder.finalize();			StrTableBuilder.finalize();
	}			}

	void MachOLayoutBuilder::updateSymbolIndexes() {			void MachOLayoutBuilder::updateSymbolIndexes() {
	uint32_t Index = 0;			uint32_t Index = 0;
	for (auto &Symbol : O.SymTable.Symbols)			for (auto &Symbol : O.SymTable.Symbols)
	Symbol->Index = Index++;			Symbol->Index = Index++;
	}			}

	// Updates the index and the number of local/external/undefined symbols.			// Updates the index and the number of local/external/undefined symbols.
	void MachOLayoutBuilder::updateDySymTab(MachO::macho_load_command &MLC) {			void MachOLayoutBuilder::updateDySymTab(MachO::macho_load_command &MLC) {
	assert(MLC.load_command_data.cmd == MachO::LC_DYSYMTAB);			assert(MLC.load_command_data.cmd == MachO::LC_DYSYMTAB);
	// Make sure that nlist entries in the symbol table are sorted by the those			// Make sure that nlist entries in the symbol table are sorted by the those
	// types. The order is: local < defined external < undefined external.			// types. The order is: local < defined external < undefined external.
	assert(llvm::is_sorted(O.SymTable.Symbols,			assert(llvm::is_sorted(O.SymTable.Symbols,
	[](const std::unique_ptr<SymbolEntry> &A,			[](const std::unique_ptr<SymbolEntry> &A,
	const std::unique_ptr<SymbolEntry> &B) {			const std::unique_ptr<SymbolEntry> &B) {
	bool AL = A->isLocalSymbol(),			bool AL = A->isLocalSymbol(),
	BL = B->isLocalSymbol();			BL = B->isLocalSymbol();
	if (AL != BL)			if (AL != BL)
	return AL;			return AL;
	return !AL && !A->isUndefinedSymbol() &&			return !AL && !A->isUndefinedSymbol() &&
	B->isUndefinedSymbol();			B->isUndefinedSymbol();
	}) &&			}) &&
	"Symbols are not sorted by their types.");			"Symbols are not sorted by their types.");

	uint32_t NumLocalSymbols = 0;			uint32_t NumLocalSymbols = 0;
	auto Iter = O.SymTable.Symbols.begin();			auto Iter = O.SymTable.Symbols.begin();
	auto End = O.SymTable.Symbols.end();			auto End = O.SymTable.Symbols.end();
	for (; Iter != End; ++Iter) {			for (; Iter != End; ++Iter) {
	if ((*Iter)->isExternalSymbol())			if ((*Iter)->isExternalSymbol())
	break;			break;

	++NumLocalSymbols;			++NumLocalSymbols;
	}			}

	uint32_t NumExtDefSymbols = 0;			uint32_t NumExtDefSymbols = 0;
	for (; Iter != End; ++Iter) {			for (; Iter != End; ++Iter) {
	if ((*Iter)->isUndefinedSymbol())			if ((*Iter)->isUndefinedSymbol())
	break;			break;

	++NumExtDefSymbols;			++NumExtDefSymbols;
	}			}

	MLC.dysymtab_command_data.ilocalsym = 0;			MLC.dysymtab_command_data.ilocalsym = 0;
	MLC.dysymtab_command_data.nlocalsym = NumLocalSymbols;			MLC.dysymtab_command_data.nlocalsym = NumLocalSymbols;
	MLC.dysymtab_command_data.iextdefsym = NumLocalSymbols;			MLC.dysymtab_command_data.iextdefsym = NumLocalSymbols;
	MLC.dysymtab_command_data.nextdefsym = NumExtDefSymbols;			MLC.dysymtab_command_data.nextdefsym = NumExtDefSymbols;
	MLC.dysymtab_command_data.iundefsym = NumLocalSymbols + NumExtDefSymbols;			MLC.dysymtab_command_data.iundefsym = NumLocalSymbols + NumExtDefSymbols;
	MLC.dysymtab_command_data.nundefsym =			MLC.dysymtab_command_data.nundefsym =
	O.SymTable.Symbols.size() - (NumLocalSymbols + NumExtDefSymbols);			O.SymTable.Symbols.size() - (NumLocalSymbols + NumExtDefSymbols);
	}			}

	// Recomputes and updates offset and size fields in load commands and sections			// Recomputes and updates offset and size fields in load commands and sections
	// since they could be modified.			// since they could be modified.
	uint64_t MachOLayoutBuilder::layoutSegments() {			uint64_t MachOLayoutBuilder::layoutSegments() {
	auto HeaderSize =			auto HeaderSize =
	Is64Bit ? sizeof(MachO::mach_header_64) : sizeof(MachO::mach_header);			Is64Bit ? sizeof(MachO::mach_header_64) : sizeof(MachO::mach_header);
	const bool IsObjectFile =			const bool IsObjectFile =
	O.Header.FileType == MachO::HeaderFileType::MH_OBJECT;			O.Header.FileType == MachO::HeaderFileType::MH_OBJECT;
	uint64_t Offset = IsObjectFile ? (HeaderSize + O.Header.SizeOfCmds) : 0;			uint64_t Offset = IsObjectFile ? (HeaderSize + O.Header.SizeOfCmds) : 0;
	for (LoadCommand &LC : O.LoadCommands) {			for (LoadCommand &LC : O.LoadCommands) {
	auto &MLC = LC.MachOLoadCommand;			auto &MLC = LC.MachOLoadCommand;
	StringRef Segname;			StringRef Segname;
	uint64_t SegmentVmAddr;			uint64_t SegmentVmAddr;
	uint64_t SegmentVmSize;			uint64_t SegmentVmSize;
	switch (MLC.load_command_data.cmd) {			switch (MLC.load_command_data.cmd) {
	case MachO::LC_SEGMENT:			case MachO::LC_SEGMENT:
	SegmentVmAddr = MLC.segment_command_data.vmaddr;			SegmentVmAddr = MLC.segment_command_data.vmaddr;
	SegmentVmSize = MLC.segment_command_data.vmsize;			SegmentVmSize = MLC.segment_command_data.vmsize;
	Segname = StringRef(MLC.segment_command_data.segname,			Segname = StringRef(MLC.segment_command_data.segname,
	strnlen(MLC.segment_command_data.segname,			strnlen(MLC.segment_command_data.segname,
	sizeof(MLC.segment_command_data.segname)));			sizeof(MLC.segment_command_data.segname)));
	break;			break;
	case MachO::LC_SEGMENT_64:			case MachO::LC_SEGMENT_64:
	SegmentVmAddr = MLC.segment_command_64_data.vmaddr;			SegmentVmAddr = MLC.segment_command_64_data.vmaddr;
	SegmentVmSize = MLC.segment_command_64_data.vmsize;			SegmentVmSize = MLC.segment_command_64_data.vmsize;
	Segname = StringRef(MLC.segment_command_64_data.segname,			Segname = StringRef(MLC.segment_command_64_data.segname,
	strnlen(MLC.segment_command_64_data.segname,			strnlen(MLC.segment_command_64_data.segname,
	sizeof(MLC.segment_command_64_data.segname)));			sizeof(MLC.segment_command_64_data.segname)));
	break;			break;
	default:			default:
	continue;			continue;
	}			}

	if (Segname == "__LINKEDIT") {			if (Segname == "__LINKEDIT") {
	// We update the __LINKEDIT segment later (in layoutTail).			// We update the __LINKEDIT segment later (in layoutTail).
	assert(LC.Sections.empty() && "__LINKEDIT segment has sections");			assert(LC.Sections.empty() && "__LINKEDIT segment has sections");
	LinkEditLoadCommand = &MLC;			LinkEditLoadCommand = &MLC;
	continue;			continue;
	}			}

	// Update file offsets and sizes of sections.			// Update file offsets and sizes of sections.
	uint64_t SegOffset = Offset;			uint64_t SegOffset = Offset;
	uint64_t SegFileSize = 0;			uint64_t SegFileSize = 0;
	uint64_t VMSize = 0;			uint64_t VMSize = 0;
	for (std::unique_ptr<Section> &Sec : LC.Sections) {			for (std::unique_ptr<Section> &Sec : LC.Sections) {
	assert(SegmentVmAddr <= Sec->Addr &&			assert(SegmentVmAddr <= Sec->Addr &&
	"Section's address cannot be smaller than Segment's one");			"Section's address cannot be smaller than Segment's one");
	uint32_t SectOffset = Sec->Addr - SegmentVmAddr;			uint32_t SectOffset = Sec->Addr - SegmentVmAddr;
	if (IsObjectFile) {			if (IsObjectFile) {
	if (Sec->isVirtualSection()) {			if (Sec->isVirtualSection()) {
	Sec->Offset = 0;			Sec->Offset = 0;
	} else {			} else {
	uint64_t PaddingSize =			uint64_t PaddingSize =
	offsetToAlignment(SegFileSize, Align(1ull << Sec->Align));			offsetToAlignment(SegFileSize, Align(1ull << Sec->Align));
	Sec->Offset = SegOffset + SegFileSize + PaddingSize;			Sec->Offset = SegOffset + SegFileSize + PaddingSize;
	Sec->Size = Sec->Content.size();			Sec->Size = Sec->Content.size();
	SegFileSize += PaddingSize + Sec->Size;			SegFileSize += PaddingSize + Sec->Size;
	}			}
	} else {			} else {
	if (Sec->isVirtualSection()) {			if (Sec->isVirtualSection()) {
	Sec->Offset = 0;			Sec->Offset = 0;
	} else {			} else {
	Sec->Offset = SegOffset + SectOffset;			Sec->Offset = SegOffset + SectOffset;
	Sec->Size = Sec->Content.size();			Sec->Size = Sec->Content.size();
	SegFileSize = std::max(SegFileSize, SectOffset + Sec->Size);			SegFileSize = std::max(SegFileSize, SectOffset + Sec->Size);
	}			}
	}			}
	VMSize = std::max(VMSize, SectOffset + Sec->Size);			VMSize = std::max(VMSize, SectOffset + Sec->Size);
	}			}

	if (IsObjectFile) {			if (IsObjectFile) {
	Offset += SegFileSize;			Offset += SegFileSize;
	} else {			} else {
	Offset = alignTo(Offset + SegFileSize, PageSize);			Offset = alignTo(Offset + SegFileSize, PageSize);
	SegFileSize = alignTo(SegFileSize, PageSize);			SegFileSize = alignTo(SegFileSize, PageSize);
	// Use the original vmsize if the segment is __PAGEZERO.			// Use the original vmsize if the segment is __PAGEZERO.
	VMSize =			VMSize =
	Segname == "__PAGEZERO" ? SegmentVmSize : alignTo(VMSize, PageSize);			Segname == "__PAGEZERO" ? SegmentVmSize : alignTo(VMSize, PageSize);
	}			}

	switch (MLC.load_command_data.cmd) {			switch (MLC.load_command_data.cmd) {
	case MachO::LC_SEGMENT:			case MachO::LC_SEGMENT:
	MLC.segment_command_data.cmdsize =			MLC.segment_command_data.cmdsize =
	sizeof(MachO::segment_command) +			sizeof(MachO::segment_command) +
	sizeof(MachO::section) * LC.Sections.size();			sizeof(MachO::section) * LC.Sections.size();
	MLC.segment_command_data.nsects = LC.Sections.size();			MLC.segment_command_data.nsects = LC.Sections.size();
	MLC.segment_command_data.fileoff = SegOffset;			MLC.segment_command_data.fileoff = SegOffset;
	MLC.segment_command_data.vmsize = VMSize;			MLC.segment_command_data.vmsize = VMSize;
	MLC.segment_command_data.filesize = SegFileSize;			MLC.segment_command_data.filesize = SegFileSize;
	break;			break;
	case MachO::LC_SEGMENT_64:			case MachO::LC_SEGMENT_64:
	MLC.segment_command_64_data.cmdsize =			MLC.segment_command_64_data.cmdsize =
	sizeof(MachO::segment_command_64) +			sizeof(MachO::segment_command_64) +
	sizeof(MachO::section_64) * LC.Sections.size();			sizeof(MachO::section_64) * LC.Sections.size();
	MLC.segment_command_64_data.nsects = LC.Sections.size();			MLC.segment_command_64_data.nsects = LC.Sections.size();
	MLC.segment_command_64_data.fileoff = SegOffset;			MLC.segment_command_64_data.fileoff = SegOffset;
	MLC.segment_command_64_data.vmsize = VMSize;			MLC.segment_command_64_data.vmsize = VMSize;
	MLC.segment_command_64_data.filesize = SegFileSize;			MLC.segment_command_64_data.filesize = SegFileSize;
	break;			break;
	}			}
	}			}

	return Offset;			return Offset;
	}			}

	uint64_t MachOLayoutBuilder::layoutRelocations(uint64_t Offset) {			uint64_t MachOLayoutBuilder::layoutRelocations(uint64_t Offset) {
	for (LoadCommand &LC : O.LoadCommands)			for (LoadCommand &LC : O.LoadCommands)
	for (std::unique_ptr<Section> &Sec : LC.Sections) {			for (std::unique_ptr<Section> &Sec : LC.Sections) {
	Sec->RelOff = Sec->Relocations.empty() ? 0 : Offset;			Sec->RelOff = Sec->Relocations.empty() ? 0 : Offset;
	Sec->NReloc = Sec->Relocations.size();			Sec->NReloc = Sec->Relocations.size();
	Offset += sizeof(MachO::any_relocation_info) * Sec->NReloc;			Offset += sizeof(MachO::any_relocation_info) * Sec->NReloc;
	}			}

	return Offset;			return Offset;
	}			}

	Error MachOLayoutBuilder::layoutTail(uint64_t Offset) {			Error MachOLayoutBuilder::layoutTail(uint64_t Offset) {
	// If we are building the layout of an executable or dynamic library			// If we are building the layout of an executable or dynamic library
	// which does not have any segments other than __LINKEDIT,			// which does not have any segments other than __LINKEDIT,
	// the Offset can be equal to zero by this time. It happens because of the			// the Offset can be equal to zero by this time. It happens because of the
	// convention that in such cases the file offsets specified by LC_SEGMENT			// convention that in such cases the file offsets specified by LC_SEGMENT
	// start with zero (unlike the case of a relocatable object file).			// start with zero (unlike the case of a relocatable object file).
	const uint64_t HeaderSize =			const uint64_t HeaderSize =
	Is64Bit ? sizeof(MachO::mach_header_64) : sizeof(MachO::mach_header);			Is64Bit ? sizeof(MachO::mach_header_64) : sizeof(MachO::mach_header);
	assert((!(O.Header.FileType == MachO::HeaderFileType::MH_OBJECT) \|\|			assert((!(O.Header.FileType == MachO::HeaderFileType::MH_OBJECT) \|\|
	Offset >= HeaderSize + O.Header.SizeOfCmds) &&			Offset >= HeaderSize + O.Header.SizeOfCmds) &&
	"Incorrect tail offset");			"Incorrect tail offset");
	Offset = std::max(Offset, HeaderSize + O.Header.SizeOfCmds);			Offset = std::max(Offset, HeaderSize + O.Header.SizeOfCmds);

	// The order of LINKEDIT elements is as follows:			// The order of LINKEDIT elements is as follows:
	// rebase info, binding info, weak binding info, lazy binding info, export			// rebase info, binding info, weak binding info, lazy binding info, export
	// trie, data-in-code, symbol table, indirect symbol table, symbol table			// trie, data-in-code, symbol table, indirect symbol table, symbol table
	// strings, code signature.			// strings, code signature.
	uint64_t NListSize = Is64Bit ? sizeof(MachO::nlist_64) : sizeof(MachO::nlist);			uint64_t NListSize = Is64Bit ? sizeof(MachO::nlist_64) : sizeof(MachO::nlist);
	uint64_t StartOfLinkEdit = Offset;			uint64_t StartOfLinkEdit = Offset;
	uint64_t StartOfRebaseInfo = StartOfLinkEdit;			uint64_t StartOfRebaseInfo = StartOfLinkEdit;
	uint64_t StartOfBindingInfo = StartOfRebaseInfo + O.Rebases.Opcodes.size();			uint64_t StartOfBindingInfo = StartOfRebaseInfo + O.Rebases.Opcodes.size();
	uint64_t StartOfWeakBindingInfo = StartOfBindingInfo + O.Binds.Opcodes.size();			uint64_t StartOfWeakBindingInfo = StartOfBindingInfo + O.Binds.Opcodes.size();
	uint64_t StartOfLazyBindingInfo =			uint64_t StartOfLazyBindingInfo =
	StartOfWeakBindingInfo + O.WeakBinds.Opcodes.size();			StartOfWeakBindingInfo + O.WeakBinds.Opcodes.size();
	uint64_t StartOfExportTrie =			uint64_t StartOfExportTrie =
	StartOfLazyBindingInfo + O.LazyBinds.Opcodes.size();			StartOfLazyBindingInfo + O.LazyBinds.Opcodes.size();
	uint64_t StartOfFunctionStarts = StartOfExportTrie + O.Exports.Trie.size();			uint64_t StartOfFunctionStarts = StartOfExportTrie + O.Exports.Trie.size();
	uint64_t StartOfDataInCode =			uint64_t StartOfDataInCode =
	StartOfFunctionStarts + O.FunctionStarts.Data.size();			StartOfFunctionStarts + O.FunctionStarts.Data.size();
	uint64_t StartOfSymbols = StartOfDataInCode + O.DataInCode.Data.size();			uint64_t StartOfSymbols = StartOfDataInCode + O.DataInCode.Data.size();
	uint64_t StartOfIndirectSymbols =			uint64_t StartOfIndirectSymbols =
	StartOfSymbols + NListSize * O.SymTable.Symbols.size();			StartOfSymbols + NListSize * O.SymTable.Symbols.size();
	uint64_t StartOfSymbolStrings =			uint64_t StartOfSymbolStrings =
	StartOfIndirectSymbols +			StartOfIndirectSymbols +
	sizeof(uint32_t) * O.IndirectSymTable.Symbols.size();			sizeof(uint32_t) * O.IndirectSymTable.Symbols.size();
	uint64_t StartOfCodeSignature =			uint64_t StartOfCodeSignature =
	StartOfSymbolStrings + StrTableBuilder.getSize();			StartOfSymbolStrings + StrTableBuilder.getSize();
	uint64_t LinkEditSize =			uint64_t LinkEditSize =
	(StartOfCodeSignature + O.CodeSignature.Data.size()) - StartOfLinkEdit;			(StartOfCodeSignature + O.CodeSignature.Data.size()) - StartOfLinkEdit;

	// Now we have determined the layout of the contents of the __LINKEDIT			// Now we have determined the layout of the contents of the __LINKEDIT
	// segment. Update its load command.			// segment. Update its load command.
	if (LinkEditLoadCommand) {			if (LinkEditLoadCommand) {
	MachO::macho_load_command *MLC = LinkEditLoadCommand;			MachO::macho_load_command *MLC = LinkEditLoadCommand;
	switch (LinkEditLoadCommand->load_command_data.cmd) {			switch (LinkEditLoadCommand->load_command_data.cmd) {
	case MachO::LC_SEGMENT:			case MachO::LC_SEGMENT:
	MLC->segment_command_data.cmdsize = sizeof(MachO::segment_command);			MLC->segment_command_data.cmdsize = sizeof(MachO::segment_command);
	MLC->segment_command_data.fileoff = StartOfLinkEdit;			MLC->segment_command_data.fileoff = StartOfLinkEdit;
	MLC->segment_command_data.vmsize = alignTo(LinkEditSize, PageSize);			MLC->segment_command_data.vmsize = alignTo(LinkEditSize, PageSize);
	MLC->segment_command_data.filesize = LinkEditSize;			MLC->segment_command_data.filesize = LinkEditSize;
	break;			break;
	case MachO::LC_SEGMENT_64:			case MachO::LC_SEGMENT_64:
	MLC->segment_command_64_data.cmdsize = sizeof(MachO::segment_command_64);			MLC->segment_command_64_data.cmdsize = sizeof(MachO::segment_command_64);
	MLC->segment_command_64_data.fileoff = StartOfLinkEdit;			MLC->segment_command_64_data.fileoff = StartOfLinkEdit;
	MLC->segment_command_64_data.vmsize = alignTo(LinkEditSize, PageSize);			MLC->segment_command_64_data.vmsize = alignTo(LinkEditSize, PageSize);
	MLC->segment_command_64_data.filesize = LinkEditSize;			MLC->segment_command_64_data.filesize = LinkEditSize;
	break;			break;
	}			}
	}			}

	for (LoadCommand &LC : O.LoadCommands) {			for (LoadCommand &LC : O.LoadCommands) {
	auto &MLC = LC.MachOLoadCommand;			auto &MLC = LC.MachOLoadCommand;
	auto cmd = MLC.load_command_data.cmd;			auto cmd = MLC.load_command_data.cmd;
	switch (cmd) {			switch (cmd) {
	case MachO::LC_CODE_SIGNATURE:			case MachO::LC_CODE_SIGNATURE:
	MLC.linkedit_data_command_data.dataoff = StartOfCodeSignature;			MLC.linkedit_data_command_data.dataoff = StartOfCodeSignature;
	MLC.linkedit_data_command_data.datasize = O.CodeSignature.Data.size();			MLC.linkedit_data_command_data.datasize = O.CodeSignature.Data.size();
	break;			break;
	case MachO::LC_SYMTAB:			case MachO::LC_SYMTAB:
	MLC.symtab_command_data.symoff = StartOfSymbols;			MLC.symtab_command_data.symoff = StartOfSymbols;
	MLC.symtab_command_data.nsyms = O.SymTable.Symbols.size();			MLC.symtab_command_data.nsyms = O.SymTable.Symbols.size();
	MLC.symtab_command_data.stroff = StartOfSymbolStrings;			MLC.symtab_command_data.stroff = StartOfSymbolStrings;
	MLC.symtab_command_data.strsize = StrTableBuilder.getSize();			MLC.symtab_command_data.strsize = StrTableBuilder.getSize();
	break;			break;
	case MachO::LC_DYSYMTAB: {			case MachO::LC_DYSYMTAB: {
	if (MLC.dysymtab_command_data.ntoc != 0 \|\|			if (MLC.dysymtab_command_data.ntoc != 0 \|\|
	MLC.dysymtab_command_data.nmodtab != 0 \|\|			MLC.dysymtab_command_data.nmodtab != 0 \|\|
	MLC.dysymtab_command_data.nextrefsyms != 0 \|\|			MLC.dysymtab_command_data.nextrefsyms != 0 \|\|
	MLC.dysymtab_command_data.nlocrel != 0 \|\|			MLC.dysymtab_command_data.nlocrel != 0 \|\|
	MLC.dysymtab_command_data.nextrel != 0)			MLC.dysymtab_command_data.nextrel != 0)
	return createStringError(llvm::errc::not_supported,			return createStringError(llvm::errc::not_supported,
	"shared library is not yet supported");			"shared library is not yet supported");

	if (!O.IndirectSymTable.Symbols.empty()) {			if (!O.IndirectSymTable.Symbols.empty()) {
	MLC.dysymtab_command_data.indirectsymoff = StartOfIndirectSymbols;			MLC.dysymtab_command_data.indirectsymoff = StartOfIndirectSymbols;
	MLC.dysymtab_command_data.nindirectsyms =			MLC.dysymtab_command_data.nindirectsyms =
	O.IndirectSymTable.Symbols.size();			O.IndirectSymTable.Symbols.size();
	}			}

	updateDySymTab(MLC);			updateDySymTab(MLC);
	break;			break;
	}			}
	case MachO::LC_DATA_IN_CODE:			case MachO::LC_DATA_IN_CODE:
	MLC.linkedit_data_command_data.dataoff = StartOfDataInCode;			MLC.linkedit_data_command_data.dataoff = StartOfDataInCode;
	MLC.linkedit_data_command_data.datasize = O.DataInCode.Data.size();			MLC.linkedit_data_command_data.datasize = O.DataInCode.Data.size();
	break;			break;
	case MachO::LC_FUNCTION_STARTS:			case MachO::LC_FUNCTION_STARTS:
	MLC.linkedit_data_command_data.dataoff = StartOfFunctionStarts;			MLC.linkedit_data_command_data.dataoff = StartOfFunctionStarts;
	MLC.linkedit_data_command_data.datasize = O.FunctionStarts.Data.size();			MLC.linkedit_data_command_data.datasize = O.FunctionStarts.Data.size();
	break;			break;
	case MachO::LC_DYLD_INFO:			case MachO::LC_DYLD_INFO:
	case MachO::LC_DYLD_INFO_ONLY:			case MachO::LC_DYLD_INFO_ONLY:
	MLC.dyld_info_command_data.rebase_off =			MLC.dyld_info_command_data.rebase_off =
	O.Rebases.Opcodes.empty() ? 0 : StartOfRebaseInfo;			O.Rebases.Opcodes.empty() ? 0 : StartOfRebaseInfo;
	MLC.dyld_info_command_data.rebase_size = O.Rebases.Opcodes.size();			MLC.dyld_info_command_data.rebase_size = O.Rebases.Opcodes.size();
	MLC.dyld_info_command_data.bind_off =			MLC.dyld_info_command_data.bind_off =
	O.Binds.Opcodes.empty() ? 0 : StartOfBindingInfo;			O.Binds.Opcodes.empty() ? 0 : StartOfBindingInfo;
	MLC.dyld_info_command_data.bind_size = O.Binds.Opcodes.size();			MLC.dyld_info_command_data.bind_size = O.Binds.Opcodes.size();
	MLC.dyld_info_command_data.weak_bind_off =			MLC.dyld_info_command_data.weak_bind_off =
	O.WeakBinds.Opcodes.empty() ? 0 : StartOfWeakBindingInfo;			O.WeakBinds.Opcodes.empty() ? 0 : StartOfWeakBindingInfo;
	MLC.dyld_info_command_data.weak_bind_size = O.WeakBinds.Opcodes.size();			MLC.dyld_info_command_data.weak_bind_size = O.WeakBinds.Opcodes.size();
	MLC.dyld_info_command_data.lazy_bind_off =			MLC.dyld_info_command_data.lazy_bind_off =
	O.LazyBinds.Opcodes.empty() ? 0 : StartOfLazyBindingInfo;			O.LazyBinds.Opcodes.empty() ? 0 : StartOfLazyBindingInfo;
	MLC.dyld_info_command_data.lazy_bind_size = O.LazyBinds.Opcodes.size();			MLC.dyld_info_command_data.lazy_bind_size = O.LazyBinds.Opcodes.size();
	MLC.dyld_info_command_data.export_off =			MLC.dyld_info_command_data.export_off =
	O.Exports.Trie.empty() ? 0 : StartOfExportTrie;			O.Exports.Trie.empty() ? 0 : StartOfExportTrie;
	MLC.dyld_info_command_data.export_size = O.Exports.Trie.size();			MLC.dyld_info_command_data.export_size = O.Exports.Trie.size();
	break;			break;
	// Note that LC_ENCRYPTION_INFO.cryptoff despite its name and the comment in			// Note that LC_ENCRYPTION_INFO.cryptoff despite its name and the comment in
	// <mach-o/loader.h> is not an offset in the binary file, instead, it is a			// <mach-o/loader.h> is not an offset in the binary file, instead, it is a
	// relative virtual address. At the moment modification of the __TEXT			// relative virtual address. At the moment modification of the __TEXT
	// segment of executables isn't supported anyway (e.g. data in code entries			// segment of executables isn't supported anyway (e.g. data in code entries
	// are not recalculated). Moreover, in general			// are not recalculated). Moreover, in general
	// LC_ENCRYPT_INFO/LC_ENCRYPTION_INFO_64 are nontrivial to update because			// LC_ENCRYPT_INFO/LC_ENCRYPTION_INFO_64 are nontrivial to update because
	// without making additional assumptions (e.g. that the entire __TEXT			// without making additional assumptions (e.g. that the entire __TEXT
	// segment should be encrypted) we do not know how to recalculate the			// segment should be encrypted) we do not know how to recalculate the
	// boundaries of the encrypted part. For now just copy over these load			// boundaries of the encrypted part. For now just copy over these load
	// commands until we encounter a real world usecase where			// commands until we encounter a real world usecase where
	// LC_ENCRYPT_INFO/LC_ENCRYPTION_INFO_64 need to be adjusted.			// LC_ENCRYPT_INFO/LC_ENCRYPTION_INFO_64 need to be adjusted.
	case MachO::LC_ENCRYPTION_INFO:			case MachO::LC_ENCRYPTION_INFO:
	case MachO::LC_ENCRYPTION_INFO_64:			case MachO::LC_ENCRYPTION_INFO_64:
	case MachO::LC_LOAD_DYLINKER:			case MachO::LC_LOAD_DYLINKER:
	case MachO::LC_MAIN:			case MachO::LC_MAIN:
	case MachO::LC_RPATH:			case MachO::LC_RPATH:
	case MachO::LC_SEGMENT:			case MachO::LC_SEGMENT:
	case MachO::LC_SEGMENT_64:			case MachO::LC_SEGMENT_64:
	case MachO::LC_VERSION_MIN_MACOSX:			case MachO::LC_VERSION_MIN_MACOSX:
	case MachO::LC_VERSION_MIN_IPHONEOS:			case MachO::LC_VERSION_MIN_IPHONEOS:
	case MachO::LC_VERSION_MIN_TVOS:			case MachO::LC_VERSION_MIN_TVOS:
	case MachO::LC_VERSION_MIN_WATCHOS:			case MachO::LC_VERSION_MIN_WATCHOS:
	case MachO::LC_BUILD_VERSION:			case MachO::LC_BUILD_VERSION:
	case MachO::LC_ID_DYLIB:			case MachO::LC_ID_DYLIB:
	case MachO::LC_LOAD_DYLIB:			case MachO::LC_LOAD_DYLIB:
	case MachO::LC_LOAD_WEAK_DYLIB:			case MachO::LC_LOAD_WEAK_DYLIB:
	case MachO::LC_UUID:			case MachO::LC_UUID:
	case MachO::LC_SOURCE_VERSION:			case MachO::LC_SOURCE_VERSION:
	// Nothing to update.			// Nothing to update.
	break;			break;
	default:			default:
	// Abort if it's unsupported in order to prevent corrupting the object.			// Abort if it's unsupported in order to prevent corrupting the object.
	return createStringError(llvm::errc::not_supported,			return createStringError(llvm::errc::not_supported,
	"unsupported load command (cmd=0x%x)", cmd);			"unsupported load command (cmd=0x%x)", cmd);
	}			}
	}			}

	return Error::success();			return Error::success();
	}			}

	Error MachOLayoutBuilder::layout() {			Error MachOLayoutBuilder::layout() {
	O.Header.NCmds = O.LoadCommands.size();			O.Header.NCmds = O.LoadCommands.size();
	O.Header.SizeOfCmds = computeSizeOfCmds();			O.Header.SizeOfCmds = computeSizeOfCmds();
	constructStringTable();			constructStringTable();
	updateSymbolIndexes();			updateSymbolIndexes();
	uint64_t Offset = layoutSegments();			uint64_t Offset = layoutSegments();
	Offset = layoutRelocations(Offset);			Offset = layoutRelocations(Offset);
	return layoutTail(Offset);			return layoutTail(Offset);
	}			}

	} // end namespace macho			} // end namespace macho
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

llvm/lib/ObjCopy/MachO/MachOObjcopy.cpp

This file was moved from llvm/tools/llvm-objcopy/MachO/MachOObjcopy.cpp.

//===- MachOObjcopy.cpp ------------------------------------------ C++ --===//		//===- MachOObjcopy.cpp ------------------------------------------ C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "MachOObjcopy.h"		#include "llvm/ObjCopy/MachO/MachOObjcopy.h"
#include "../CopyConfig.h"
#include "../llvm-objcopy.h"
#include "MachOReader.h"		#include "MachOReader.h"
#include "MachOWriter.h"		#include "MachOWriter.h"
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code +#include "Util.h" Lint: Pre-merge checks: clang-format: please reformat the code ``` +#include "Util.h" ```
#include "llvm/ADT/DenseSet.h"		#include "llvm/ADT/DenseSet.h"
		#include "llvm/ObjCopy/CopyConfig.h"
		#include "llvm/ObjCopy/ObjCopy.h"
#include "llvm/Object/ArchiveWriter.h"		#include "llvm/Object/ArchiveWriter.h"
#include "llvm/Object/MachOUniversal.h"		#include "llvm/Object/MachOUniversal.h"
#include "llvm/Object/MachOUniversalWriter.h"		#include "llvm/Object/MachOUniversalWriter.h"
#include "llvm/Support/Errc.h"		#include "llvm/Support/Errc.h"
#include "llvm/Support/Error.h"		#include "llvm/Support/Error.h"
		#include "Util.h"
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code -#include "Util.h" Lint: Pre-merge checks: clang-format: please reformat the code ``` -#include "Util.h" ```

namespace llvm {		namespace llvm {
namespace objcopy {		namespace objcopy {
namespace macho {		namespace macho {

using namespace object;		using namespace object;
using SectionPred = std::function<bool(const std::unique_ptr<Section> &Sec)>;		using SectionPred = std::function<bool(const std::unique_ptr<Section> &Sec)>;
using LoadCommandPred = std::function<bool(const LoadCommand &LC)>;		using LoadCommandPred = std::function<bool(const LoadCommand &LC)>;
▲ Show 20 Lines • Show All 404 Lines • ▼ Show 20 Lines	if (!ObjOrErr) {
return createStringError(std::errc::invalid_argument,		return createStringError(std::errc::invalid_argument,
"slice for '%s' of the universal Mach-O binary "		"slice for '%s' of the universal Mach-O binary "
"'%s' is not a Mach-O object or an archive",		"'%s' is not a Mach-O object or an archive",
O.getArchFlagName().c_str(),		O.getArchFlagName().c_str(),
Config.InputFilename.str().c_str());		Config.InputFilename.str().c_str());
}		}
std::string ArchFlagName = O.getArchFlagName();		std::string ArchFlagName = O.getArchFlagName();
MemBuffer MB(ArchFlagName);		MemBuffer MB(ArchFlagName);
if (Error E = executeObjcopyOnBinary(Config, **ObjOrErr, MB))		if (Error E = objcopy::executeObjcopyOnBinary(Config, **ObjOrErr, MB))
return E;		return E;
std::unique_ptr<WritableMemoryBuffer> OutputBuffer =		std::unique_ptr<WritableMemoryBuffer> OutputBuffer =
MB.releaseMemoryBuffer();		MB.releaseMemoryBuffer();
Expected<std::unique_ptr<Binary>> BinaryOrErr =		Expected<std::unique_ptr<Binary>> BinaryOrErr =
object::createBinary(*OutputBuffer);		object::createBinary(*OutputBuffer);
if (!BinaryOrErr)		if (!BinaryOrErr)
return BinaryOrErr.takeError();		return BinaryOrErr.takeError();
Binaries.emplace_back(std::move(*BinaryOrErr), std::move(OutputBuffer));		Binaries.emplace_back(std::move(*BinaryOrErr), std::move(OutputBuffer));
Show All 16 Lines

llvm/lib/ObjCopy/MachO/MachOReader.h

This file was moved from llvm/tools/llvm-objcopy/MachO/MachOReader.h.

//===- MachOReader.h --------------------------------------------- C++ --===//		//===- MachOReader.h --------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "MachOObjcopy.h"		#ifndef LLVM_LIB_OBJCOPY_MACHO_MACHOREADER_H
		#define LLVM_LIB_OBJCOPY_MACHO_MACHOREADER_H

#include "Object.h"		#include "Object.h"
#include "llvm/BinaryFormat/MachO.h"		#include "llvm/BinaryFormat/MachO.h"
		#include "llvm/ObjCopy/MachO/MachOObjcopy.h"
#include "llvm/Object/MachO.h"		#include "llvm/Object/MachO.h"
#include <memory>		#include <memory>

namespace llvm {		namespace llvm {
namespace objcopy {		namespace objcopy {
namespace macho {		namespace macho {

// The hierarchy of readers is responsible for parsing different inputs:		// The hierarchy of readers is responsible for parsing different inputs:
Show All 27 Lines	public:
explicit MachOReader(const object::MachOObjectFile &Obj) : MachOObj(Obj) {}		explicit MachOReader(const object::MachOObjectFile &Obj) : MachOObj(Obj) {}

Expected<std::unique_ptr<Object>> create() const override;		Expected<std::unique_ptr<Object>> create() const override;
};		};

} // end namespace macho		} // end namespace macho
} // end namespace objcopy		} // end namespace objcopy
} // end namespace llvm		} // end namespace llvm

		#endif // LLVM_LIB_OBJCOPY_MACHO_MACHOREADER_H

llvm/lib/ObjCopy/MachO/MachOReader.cpp

This file was moved from llvm/tools/llvm-objcopy/MachO/MachOReader.cpp.

	//===- MachOReader.cpp ------------------------------------------- C++ --===//			//===- MachOReader.cpp ------------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "MachOReader.h"			#include "MachOReader.h"
	#include "Object.h"			#include "Object.h"
	#include "llvm/BinaryFormat/MachO.h"			#include "llvm/BinaryFormat/MachO.h"
	#include "llvm/Object/MachO.h"			#include "llvm/Object/MachO.h"
	#include "llvm/Support/Errc.h"			#include "llvm/Support/Errc.h"
	#include <memory>			#include <memory>

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace macho {			namespace macho {

	void MachOReader::readHeader(Object &O) const {			void MachOReader::readHeader(Object &O) const {
	O.Header.Magic = MachOObj.getHeader().magic;			O.Header.Magic = MachOObj.getHeader().magic;
	O.Header.CPUType = MachOObj.getHeader().cputype;			O.Header.CPUType = MachOObj.getHeader().cputype;
	O.Header.CPUSubType = MachOObj.getHeader().cpusubtype;			O.Header.CPUSubType = MachOObj.getHeader().cpusubtype;
	O.Header.FileType = MachOObj.getHeader().filetype;			O.Header.FileType = MachOObj.getHeader().filetype;
	O.Header.NCmds = MachOObj.getHeader().ncmds;			O.Header.NCmds = MachOObj.getHeader().ncmds;
	O.Header.SizeOfCmds = MachOObj.getHeader().sizeofcmds;			O.Header.SizeOfCmds = MachOObj.getHeader().sizeofcmds;
	O.Header.Flags = MachOObj.getHeader().flags;			O.Header.Flags = MachOObj.getHeader().flags;
	}			}

	template <typename SectionType>			template <typename SectionType>
	Section constructSectionCommon(SectionType Sec, uint32_t Index) {			Section constructSectionCommon(SectionType Sec, uint32_t Index) {
	StringRef SegName(Sec.segname, strnlen(Sec.segname, sizeof(Sec.segname)));			StringRef SegName(Sec.segname, strnlen(Sec.segname, sizeof(Sec.segname)));
	StringRef SectName(Sec.sectname, strnlen(Sec.sectname, sizeof(Sec.sectname)));			StringRef SectName(Sec.sectname, strnlen(Sec.sectname, sizeof(Sec.sectname)));
	Section S(SegName, SectName);			Section S(SegName, SectName);
	S.Index = Index;			S.Index = Index;
	S.Addr = Sec.addr;			S.Addr = Sec.addr;
	S.Size = Sec.size;			S.Size = Sec.size;
	S.Offset = Sec.offset;			S.Offset = Sec.offset;
	S.Align = Sec.align;			S.Align = Sec.align;
	S.RelOff = Sec.reloff;			S.RelOff = Sec.reloff;
	S.NReloc = Sec.nreloc;			S.NReloc = Sec.nreloc;
	S.Flags = Sec.flags;			S.Flags = Sec.flags;
	S.Reserved1 = Sec.reserved1;			S.Reserved1 = Sec.reserved1;
	S.Reserved2 = Sec.reserved2;			S.Reserved2 = Sec.reserved2;
	S.Reserved3 = 0;			S.Reserved3 = 0;
	return S;			return S;
	}			}

	template <typename SectionType>			template <typename SectionType>
	Section constructSection(SectionType Sec, uint32_t Index);			Section constructSection(SectionType Sec, uint32_t Index);

	template <> Section constructSection(MachO::section Sec, uint32_t Index) {			template <> Section constructSection(MachO::section Sec, uint32_t Index) {
	return constructSectionCommon(Sec, Index);			return constructSectionCommon(Sec, Index);
	}			}

	template <> Section constructSection(MachO::section_64 Sec, uint32_t Index) {			template <> Section constructSection(MachO::section_64 Sec, uint32_t Index) {
	Section S = constructSectionCommon(Sec, Index);			Section S = constructSectionCommon(Sec, Index);
	S.Reserved3 = Sec.reserved3;			S.Reserved3 = Sec.reserved3;
	return S;			return S;
	}			}

	template <typename SectionType, typename SegmentType>			template <typename SectionType, typename SegmentType>
	Expected<std::vector<std::unique_ptr<Section>>>			Expected<std::vector<std::unique_ptr<Section>>>
	extractSections(const object::MachOObjectFile::LoadCommandInfo &LoadCmd,			extractSections(const object::MachOObjectFile::LoadCommandInfo &LoadCmd,
	const object::MachOObjectFile &MachOObj,			const object::MachOObjectFile &MachOObj,
	uint32_t &NextSectionIndex) {			uint32_t &NextSectionIndex) {
	auto End = LoadCmd.Ptr + LoadCmd.C.cmdsize;			auto End = LoadCmd.Ptr + LoadCmd.C.cmdsize;
	const SectionType *Curr =			const SectionType *Curr =
	reinterpret_cast<const SectionType *>(LoadCmd.Ptr + sizeof(SegmentType));			reinterpret_cast<const SectionType *>(LoadCmd.Ptr + sizeof(SegmentType));
	std::vector<std::unique_ptr<Section>> Sections;			std::vector<std::unique_ptr<Section>> Sections;
	for (; reinterpret_cast<const void *>(Curr) < End; Curr++) {			for (; reinterpret_cast<const void *>(Curr) < End; Curr++) {
	if (MachOObj.isLittleEndian() != sys::IsLittleEndianHost) {			if (MachOObj.isLittleEndian() != sys::IsLittleEndianHost) {
	SectionType Sec;			SectionType Sec;
	memcpy((void *)&Sec, Curr, sizeof(SectionType));			memcpy((void *)&Sec, Curr, sizeof(SectionType));
	MachO::swapStruct(Sec);			MachO::swapStruct(Sec);
	Sections.push_back(			Sections.push_back(
	std::make_unique<Section>(constructSection(Sec, NextSectionIndex)));			std::make_unique<Section>(constructSection(Sec, NextSectionIndex)));
	} else {			} else {
	Sections.push_back(			Sections.push_back(
	std::make_unique<Section>(constructSection(*Curr, NextSectionIndex)));			std::make_unique<Section>(constructSection(*Curr, NextSectionIndex)));
	}			}

	Section &S = *Sections.back();			Section &S = *Sections.back();

	Expected<object::SectionRef> SecRef =			Expected<object::SectionRef> SecRef =
	MachOObj.getSection(NextSectionIndex++);			MachOObj.getSection(NextSectionIndex++);
	if (!SecRef)			if (!SecRef)
	return SecRef.takeError();			return SecRef.takeError();

	Expected<ArrayRef<uint8_t>> Data =			Expected<ArrayRef<uint8_t>> Data =
	MachOObj.getSectionContents(SecRef->getRawDataRefImpl());			MachOObj.getSectionContents(SecRef->getRawDataRefImpl());
	if (!Data)			if (!Data)
	return Data.takeError();			return Data.takeError();

	S.Content =			S.Content =
	StringRef(reinterpret_cast<const char *>(Data->data()), Data->size());			StringRef(reinterpret_cast<const char *>(Data->data()), Data->size());

	S.Relocations.reserve(S.NReloc);			S.Relocations.reserve(S.NReloc);
	for (auto RI = MachOObj.section_rel_begin(SecRef->getRawDataRefImpl()),			for (auto RI = MachOObj.section_rel_begin(SecRef->getRawDataRefImpl()),
	RE = MachOObj.section_rel_end(SecRef->getRawDataRefImpl());			RE = MachOObj.section_rel_end(SecRef->getRawDataRefImpl());
	RI != RE; ++RI) {			RI != RE; ++RI) {
	RelocationInfo R;			RelocationInfo R;
	R.Symbol = nullptr; // We'll fill this field later.			R.Symbol = nullptr; // We'll fill this field later.
	R.Info = MachOObj.getRelocation(RI->getRawDataRefImpl());			R.Info = MachOObj.getRelocation(RI->getRawDataRefImpl());
	R.Scattered = MachOObj.isRelocationScattered(R.Info);			R.Scattered = MachOObj.isRelocationScattered(R.Info);
	R.Extern = !R.Scattered && MachOObj.getPlainRelocationExternal(R.Info);			R.Extern = !R.Scattered && MachOObj.getPlainRelocationExternal(R.Info);
	S.Relocations.push_back(R);			S.Relocations.push_back(R);
	}			}

	assert(S.NReloc == S.Relocations.size() &&			assert(S.NReloc == S.Relocations.size() &&
	"Incorrect number of relocations");			"Incorrect number of relocations");
	}			}
	return std::move(Sections);			return std::move(Sections);
	}			}

	Error MachOReader::readLoadCommands(Object &O) const {			Error MachOReader::readLoadCommands(Object &O) const {
	// For MachO sections indices start from 1.			// For MachO sections indices start from 1.
	uint32_t NextSectionIndex = 1;			uint32_t NextSectionIndex = 1;
	for (auto LoadCmd : MachOObj.load_commands()) {			for (auto LoadCmd : MachOObj.load_commands()) {
	LoadCommand LC;			LoadCommand LC;
	switch (LoadCmd.C.cmd) {			switch (LoadCmd.C.cmd) {
	case MachO::LC_CODE_SIGNATURE:			case MachO::LC_CODE_SIGNATURE:
	O.CodeSignatureCommandIndex = O.LoadCommands.size();			O.CodeSignatureCommandIndex = O.LoadCommands.size();
	break;			break;
	case MachO::LC_SEGMENT:			case MachO::LC_SEGMENT:
	if (Expected<std::vector<std::unique_ptr<Section>>> Sections =			if (Expected<std::vector<std::unique_ptr<Section>>> Sections =
	extractSections<MachO::section, MachO::segment_command>(			extractSections<MachO::section, MachO::segment_command>(
	LoadCmd, MachOObj, NextSectionIndex))			LoadCmd, MachOObj, NextSectionIndex))
	LC.Sections = std::move(*Sections);			LC.Sections = std::move(*Sections);
	else			else
	return Sections.takeError();			return Sections.takeError();
	break;			break;
	case MachO::LC_SEGMENT_64:			case MachO::LC_SEGMENT_64:
	if (Expected<std::vector<std::unique_ptr<Section>>> Sections =			if (Expected<std::vector<std::unique_ptr<Section>>> Sections =
	extractSections<MachO::section_64, MachO::segment_command_64>(			extractSections<MachO::section_64, MachO::segment_command_64>(
	LoadCmd, MachOObj, NextSectionIndex))			LoadCmd, MachOObj, NextSectionIndex))
	LC.Sections = std::move(*Sections);			LC.Sections = std::move(*Sections);
	else			else
	return Sections.takeError();			return Sections.takeError();
	break;			break;
	case MachO::LC_SYMTAB:			case MachO::LC_SYMTAB:
	O.SymTabCommandIndex = O.LoadCommands.size();			O.SymTabCommandIndex = O.LoadCommands.size();
	break;			break;
	case MachO::LC_DYSYMTAB:			case MachO::LC_DYSYMTAB:
	O.DySymTabCommandIndex = O.LoadCommands.size();			O.DySymTabCommandIndex = O.LoadCommands.size();
	break;			break;
	case MachO::LC_DYLD_INFO:			case MachO::LC_DYLD_INFO:
	case MachO::LC_DYLD_INFO_ONLY:			case MachO::LC_DYLD_INFO_ONLY:
	O.DyLdInfoCommandIndex = O.LoadCommands.size();			O.DyLdInfoCommandIndex = O.LoadCommands.size();
	break;			break;
	case MachO::LC_DATA_IN_CODE:			case MachO::LC_DATA_IN_CODE:
	O.DataInCodeCommandIndex = O.LoadCommands.size();			O.DataInCodeCommandIndex = O.LoadCommands.size();
	break;			break;
	case MachO::LC_FUNCTION_STARTS:			case MachO::LC_FUNCTION_STARTS:
	O.FunctionStartsCommandIndex = O.LoadCommands.size();			O.FunctionStartsCommandIndex = O.LoadCommands.size();
	break;			break;
	}			}
	#define HANDLE_LOAD_COMMAND(LCName, LCValue, LCStruct) \			#define HANDLE_LOAD_COMMAND(LCName, LCValue, LCStruct) \
	case MachO::LCName: \			case MachO::LCName: \
	memcpy((void *)&(LC.MachOLoadCommand.LCStruct##_data), LoadCmd.Ptr, \			memcpy((void *)&(LC.MachOLoadCommand.LCStruct##_data), LoadCmd.Ptr, \
	sizeof(MachO::LCStruct)); \			sizeof(MachO::LCStruct)); \
	if (MachOObj.isLittleEndian() != sys::IsLittleEndianHost) \			if (MachOObj.isLittleEndian() != sys::IsLittleEndianHost) \
	MachO::swapStruct(LC.MachOLoadCommand.LCStruct##_data); \			MachO::swapStruct(LC.MachOLoadCommand.LCStruct##_data); \
	if (LoadCmd.C.cmdsize > sizeof(MachO::LCStruct)) \			if (LoadCmd.C.cmdsize > sizeof(MachO::LCStruct)) \
	LC.Payload = ArrayRef<uint8_t>( \			LC.Payload = ArrayRef<uint8_t>( \
	reinterpret_cast<uint8_t >(const_cast<char >(LoadCmd.Ptr)) + \			reinterpret_cast<uint8_t >(const_cast<char >(LoadCmd.Ptr)) + \
	sizeof(MachO::LCStruct), \			sizeof(MachO::LCStruct), \
	LoadCmd.C.cmdsize - sizeof(MachO::LCStruct)); \			LoadCmd.C.cmdsize - sizeof(MachO::LCStruct)); \
	break;			break;

	switch (LoadCmd.C.cmd) {			switch (LoadCmd.C.cmd) {
	default:			default:
	memcpy((void *)&(LC.MachOLoadCommand.load_command_data), LoadCmd.Ptr,			memcpy((void *)&(LC.MachOLoadCommand.load_command_data), LoadCmd.Ptr,
	sizeof(MachO::load_command));			sizeof(MachO::load_command));
	if (MachOObj.isLittleEndian() != sys::IsLittleEndianHost)			if (MachOObj.isLittleEndian() != sys::IsLittleEndianHost)
	MachO::swapStruct(LC.MachOLoadCommand.load_command_data);			MachO::swapStruct(LC.MachOLoadCommand.load_command_data);
	if (LoadCmd.C.cmdsize > sizeof(MachO::load_command))			if (LoadCmd.C.cmdsize > sizeof(MachO::load_command))
	LC.Payload = ArrayRef<uint8_t>(			LC.Payload = ArrayRef<uint8_t>(
	reinterpret_cast<uint8_t >(const_cast<char >(LoadCmd.Ptr)) +			reinterpret_cast<uint8_t >(const_cast<char >(LoadCmd.Ptr)) +
	sizeof(MachO::load_command),			sizeof(MachO::load_command),
	LoadCmd.C.cmdsize - sizeof(MachO::load_command));			LoadCmd.C.cmdsize - sizeof(MachO::load_command));
	break;			break;
	#include "llvm/BinaryFormat/MachO.def"			#include "llvm/BinaryFormat/MachO.def"
	}			}
	O.LoadCommands.push_back(std::move(LC));			O.LoadCommands.push_back(std::move(LC));
	}			}
	return Error::success();			return Error::success();
	}			}

	template <typename nlist_t>			template <typename nlist_t>
	SymbolEntry constructSymbolEntry(StringRef StrTable, const nlist_t &nlist) {			SymbolEntry constructSymbolEntry(StringRef StrTable, const nlist_t &nlist) {
	assert(nlist.n_strx < StrTable.size() &&			assert(nlist.n_strx < StrTable.size() &&
	"n_strx exceeds the size of the string table");			"n_strx exceeds the size of the string table");
	SymbolEntry SE;			SymbolEntry SE;
	SE.Name = StringRef(StrTable.data() + nlist.n_strx).str();			SE.Name = StringRef(StrTable.data() + nlist.n_strx).str();
	SE.n_type = nlist.n_type;			SE.n_type = nlist.n_type;
	SE.n_sect = nlist.n_sect;			SE.n_sect = nlist.n_sect;
	SE.n_desc = nlist.n_desc;			SE.n_desc = nlist.n_desc;
	SE.n_value = nlist.n_value;			SE.n_value = nlist.n_value;
	return SE;			return SE;
	}			}

	void MachOReader::readSymbolTable(Object &O) const {			void MachOReader::readSymbolTable(Object &O) const {
	StringRef StrTable = MachOObj.getStringTableData();			StringRef StrTable = MachOObj.getStringTableData();
	for (auto Symbol : MachOObj.symbols()) {			for (auto Symbol : MachOObj.symbols()) {
	SymbolEntry SE =			SymbolEntry SE =
	(MachOObj.is64Bit()			(MachOObj.is64Bit()
	? constructSymbolEntry(StrTable, MachOObj.getSymbol64TableEntry(			? constructSymbolEntry(StrTable, MachOObj.getSymbol64TableEntry(
	Symbol.getRawDataRefImpl()))			Symbol.getRawDataRefImpl()))
	: constructSymbolEntry(StrTable, MachOObj.getSymbolTableEntry(			: constructSymbolEntry(StrTable, MachOObj.getSymbolTableEntry(
	Symbol.getRawDataRefImpl())));			Symbol.getRawDataRefImpl())));

	O.SymTable.Symbols.push_back(std::make_unique<SymbolEntry>(SE));			O.SymTable.Symbols.push_back(std::make_unique<SymbolEntry>(SE));
	}			}
	}			}

	void MachOReader::setSymbolInRelocationInfo(Object &O) const {			void MachOReader::setSymbolInRelocationInfo(Object &O) const {
	std::vector<const Section *> Sections;			std::vector<const Section *> Sections;
	for (auto &LC : O.LoadCommands)			for (auto &LC : O.LoadCommands)
	for (std::unique_ptr<Section> &Sec : LC.Sections)			for (std::unique_ptr<Section> &Sec : LC.Sections)
	Sections.push_back(Sec.get());			Sections.push_back(Sec.get());

	for (LoadCommand &LC : O.LoadCommands)			for (LoadCommand &LC : O.LoadCommands)
	for (std::unique_ptr<Section> &Sec : LC.Sections)			for (std::unique_ptr<Section> &Sec : LC.Sections)
	for (auto &Reloc : Sec->Relocations)			for (auto &Reloc : Sec->Relocations)
	if (!Reloc.Scattered) {			if (!Reloc.Scattered) {
	const uint32_t SymbolNum =			const uint32_t SymbolNum =
	Reloc.getPlainRelocationSymbolNum(MachOObj.isLittleEndian());			Reloc.getPlainRelocationSymbolNum(MachOObj.isLittleEndian());
	if (Reloc.Extern) {			if (Reloc.Extern) {
	Reloc.Symbol = O.SymTable.getSymbolByIndex(SymbolNum);			Reloc.Symbol = O.SymTable.getSymbolByIndex(SymbolNum);
	} else {			} else {
	// FIXME: Refactor error handling in MachOReader and report an error			// FIXME: Refactor error handling in MachOReader and report an error
	// if we encounter an invalid relocation.			// if we encounter an invalid relocation.
	assert(SymbolNum >= 1 && SymbolNum <= Sections.size() &&			assert(SymbolNum >= 1 && SymbolNum <= Sections.size() &&
	"Invalid section index.");			"Invalid section index.");
	Reloc.Sec = Sections[SymbolNum - 1];			Reloc.Sec = Sections[SymbolNum - 1];
	}			}
	}			}
	}			}

	void MachOReader::readRebaseInfo(Object &O) const {			void MachOReader::readRebaseInfo(Object &O) const {
	O.Rebases.Opcodes = MachOObj.getDyldInfoRebaseOpcodes();			O.Rebases.Opcodes = MachOObj.getDyldInfoRebaseOpcodes();
	}			}

	void MachOReader::readBindInfo(Object &O) const {			void MachOReader::readBindInfo(Object &O) const {
	O.Binds.Opcodes = MachOObj.getDyldInfoBindOpcodes();			O.Binds.Opcodes = MachOObj.getDyldInfoBindOpcodes();
	}			}

	void MachOReader::readWeakBindInfo(Object &O) const {			void MachOReader::readWeakBindInfo(Object &O) const {
	O.WeakBinds.Opcodes = MachOObj.getDyldInfoWeakBindOpcodes();			O.WeakBinds.Opcodes = MachOObj.getDyldInfoWeakBindOpcodes();
	}			}

	void MachOReader::readLazyBindInfo(Object &O) const {			void MachOReader::readLazyBindInfo(Object &O) const {
	O.LazyBinds.Opcodes = MachOObj.getDyldInfoLazyBindOpcodes();			O.LazyBinds.Opcodes = MachOObj.getDyldInfoLazyBindOpcodes();
	}			}

	void MachOReader::readExportInfo(Object &O) const {			void MachOReader::readExportInfo(Object &O) const {
	O.Exports.Trie = MachOObj.getDyldInfoExportsTrie();			O.Exports.Trie = MachOObj.getDyldInfoExportsTrie();
	}			}

	void MachOReader::readLinkData(Object &O, Optional<size_t> LCIndex,			void MachOReader::readLinkData(Object &O, Optional<size_t> LCIndex,
	LinkData &LD) const {			LinkData &LD) const {
	if (!LCIndex)			if (!LCIndex)
	return;			return;
	const MachO::linkedit_data_command &LC =			const MachO::linkedit_data_command &LC =
	O.LoadCommands[*LCIndex].MachOLoadCommand.linkedit_data_command_data;			O.LoadCommands[*LCIndex].MachOLoadCommand.linkedit_data_command_data;
	LD.Data =			LD.Data =
	arrayRefFromStringRef(MachOObj.getData().substr(LC.dataoff, LC.datasize));			arrayRefFromStringRef(MachOObj.getData().substr(LC.dataoff, LC.datasize));
	}			}

	void MachOReader::readCodeSignature(Object &O) const {			void MachOReader::readCodeSignature(Object &O) const {
	return readLinkData(O, O.CodeSignatureCommandIndex, O.CodeSignature);			return readLinkData(O, O.CodeSignatureCommandIndex, O.CodeSignature);
	}			}

	void MachOReader::readDataInCodeData(Object &O) const {			void MachOReader::readDataInCodeData(Object &O) const {
	return readLinkData(O, O.DataInCodeCommandIndex, O.DataInCode);			return readLinkData(O, O.DataInCodeCommandIndex, O.DataInCode);
	}			}

	void MachOReader::readFunctionStartsData(Object &O) const {			void MachOReader::readFunctionStartsData(Object &O) const {
	return readLinkData(O, O.FunctionStartsCommandIndex, O.FunctionStarts);			return readLinkData(O, O.FunctionStartsCommandIndex, O.FunctionStarts);
	}			}

	void MachOReader::readIndirectSymbolTable(Object &O) const {			void MachOReader::readIndirectSymbolTable(Object &O) const {
	MachO::dysymtab_command DySymTab = MachOObj.getDysymtabLoadCommand();			MachO::dysymtab_command DySymTab = MachOObj.getDysymtabLoadCommand();
	constexpr uint32_t AbsOrLocalMask =			constexpr uint32_t AbsOrLocalMask =
	MachO::INDIRECT_SYMBOL_LOCAL \| MachO::INDIRECT_SYMBOL_ABS;			MachO::INDIRECT_SYMBOL_LOCAL \| MachO::INDIRECT_SYMBOL_ABS;
	for (uint32_t i = 0; i < DySymTab.nindirectsyms; ++i) {			for (uint32_t i = 0; i < DySymTab.nindirectsyms; ++i) {
	uint32_t Index = MachOObj.getIndirectSymbolTableEntry(DySymTab, i);			uint32_t Index = MachOObj.getIndirectSymbolTableEntry(DySymTab, i);
	if ((Index & AbsOrLocalMask) != 0)			if ((Index & AbsOrLocalMask) != 0)
	O.IndirectSymTable.Symbols.emplace_back(Index, None);			O.IndirectSymTable.Symbols.emplace_back(Index, None);
	else			else
	O.IndirectSymTable.Symbols.emplace_back(			O.IndirectSymTable.Symbols.emplace_back(
	Index, O.SymTable.getSymbolByIndex(Index));			Index, O.SymTable.getSymbolByIndex(Index));
	}			}
	}			}

	void MachOReader::readSwiftVersion(Object &O) const {			void MachOReader::readSwiftVersion(Object &O) const {
	struct ObjCImageInfo {			struct ObjCImageInfo {
	uint32_t Version;			uint32_t Version;
	uint32_t Flags;			uint32_t Flags;
	} ImageInfo;			} ImageInfo;

	for (const LoadCommand &LC : O.LoadCommands)			for (const LoadCommand &LC : O.LoadCommands)
	for (const std::unique_ptr<Section> &Sec : LC.Sections)			for (const std::unique_ptr<Section> &Sec : LC.Sections)
	if (Sec->Sectname == "__objc_imageinfo" &&			if (Sec->Sectname == "__objc_imageinfo" &&
	(Sec->Segname == "__DATA" \|\| Sec->Segname == "__DATA_CONST" \|\|			(Sec->Segname == "__DATA" \|\| Sec->Segname == "__DATA_CONST" \|\|
	Sec->Segname == "__DATA_DIRTY") &&			Sec->Segname == "__DATA_DIRTY") &&
	Sec->Content.size() >= sizeof(ObjCImageInfo)) {			Sec->Content.size() >= sizeof(ObjCImageInfo)) {
	memcpy(&ImageInfo, Sec->Content.data(), sizeof(ObjCImageInfo));			memcpy(&ImageInfo, Sec->Content.data(), sizeof(ObjCImageInfo));
	if (MachOObj.isLittleEndian() != sys::IsLittleEndianHost) {			if (MachOObj.isLittleEndian() != sys::IsLittleEndianHost) {
	sys::swapByteOrder(ImageInfo.Version);			sys::swapByteOrder(ImageInfo.Version);
	sys::swapByteOrder(ImageInfo.Flags);			sys::swapByteOrder(ImageInfo.Flags);
	}			}
	O.SwiftVersion = (ImageInfo.Flags >> 8) & 0xff;			O.SwiftVersion = (ImageInfo.Flags >> 8) & 0xff;
	return;			return;
	}			}
	}			}

	Expected<std::unique_ptr<Object>> MachOReader::create() const {			Expected<std::unique_ptr<Object>> MachOReader::create() const {
	auto Obj = std::make_unique<Object>();			auto Obj = std::make_unique<Object>();
	readHeader(*Obj);			readHeader(*Obj);
	if (Error E = readLoadCommands(*Obj))			if (Error E = readLoadCommands(*Obj))
	return std::move(E);			return std::move(E);
	readSymbolTable(*Obj);			readSymbolTable(*Obj);
	setSymbolInRelocationInfo(*Obj);			setSymbolInRelocationInfo(*Obj);
	readRebaseInfo(*Obj);			readRebaseInfo(*Obj);
	readBindInfo(*Obj);			readBindInfo(*Obj);
	readWeakBindInfo(*Obj);			readWeakBindInfo(*Obj);
	readLazyBindInfo(*Obj);			readLazyBindInfo(*Obj);
	readExportInfo(*Obj);			readExportInfo(*Obj);
	readCodeSignature(*Obj);			readCodeSignature(*Obj);
	readDataInCodeData(*Obj);			readDataInCodeData(*Obj);
	readFunctionStartsData(*Obj);			readFunctionStartsData(*Obj);
	readIndirectSymbolTable(*Obj);			readIndirectSymbolTable(*Obj);
	readSwiftVersion(*Obj);			readSwiftVersion(*Obj);
	return std::move(Obj);			return std::move(Obj);
	}			}

	} // end namespace macho			} // end namespace macho
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

llvm/lib/ObjCopy/MachO/MachOWriter.h

This file was moved from llvm/tools/llvm-objcopy/MachO/MachOWriter.h.

//===- MachOWriter.h --------------------------------------------- C++ --===//		//===- MachOWriter.h --------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "../Buffer.h"		#ifndef LLVM_LIB_OBJCOPY_MACHO_MACHOWRITER_H
		#define LLVM_LIB_OBJCOPY_MACHO_MACHOWRITER_H

#include "MachOLayoutBuilder.h"		#include "MachOLayoutBuilder.h"
#include "MachOObjcopy.h"
#include "Object.h"		#include "Object.h"
#include "llvm/BinaryFormat/MachO.h"		#include "llvm/BinaryFormat/MachO.h"
		#include "llvm/ObjCopy/Buffer.h"
		#include "llvm/ObjCopy/MachO/MachOObjcopy.h"
#include "llvm/Object/MachO.h"		#include "llvm/Object/MachO.h"

namespace llvm {		namespace llvm {
class Error;		class Error;

namespace objcopy {		namespace objcopy {
namespace macho {		namespace macho {

Show All 38 Lines	public:
size_t totalSize() const;		size_t totalSize() const;
Error finalize();		Error finalize();
Error write();		Error write();
};		};

} // end namespace macho		} // end namespace macho
} // end namespace objcopy		} // end namespace objcopy
} // end namespace llvm		} // end namespace llvm

		#endif // LLVM_LIB_OBJCOPY_MACHO_MACHOWRITER_H

llvm/lib/ObjCopy/MachO/MachOWriter.cpp

This file was moved from llvm/tools/llvm-objcopy/MachO/MachOWriter.cpp.

	//===- MachOWriter.cpp ------------------------------------------- C++ --===//			//===- MachOWriter.cpp ------------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "MachOWriter.h"			#include "MachOWriter.h"
	#include "MachOLayoutBuilder.h"			#include "MachOLayoutBuilder.h"
	#include "Object.h"			#include "Object.h"
	#include "llvm/ADT/STLExtras.h"			#include "llvm/ADT/STLExtras.h"
	#include "llvm/BinaryFormat/MachO.h"			#include "llvm/BinaryFormat/MachO.h"
	#include "llvm/Object/MachO.h"			#include "llvm/Object/MachO.h"
	#include "llvm/Support/Errc.h"			#include "llvm/Support/Errc.h"
	#include "llvm/Support/ErrorHandling.h"			#include "llvm/Support/ErrorHandling.h"
	#include <memory>			#include <memory>

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace macho {			namespace macho {

	size_t MachOWriter::headerSize() const {			size_t MachOWriter::headerSize() const {
	return Is64Bit ? sizeof(MachO::mach_header_64) : sizeof(MachO::mach_header);			return Is64Bit ? sizeof(MachO::mach_header_64) : sizeof(MachO::mach_header);
	}			}

	size_t MachOWriter::loadCommandsSize() const { return O.Header.SizeOfCmds; }			size_t MachOWriter::loadCommandsSize() const { return O.Header.SizeOfCmds; }

	size_t MachOWriter::symTableSize() const {			size_t MachOWriter::symTableSize() const {
	return O.SymTable.Symbols.size() *			return O.SymTable.Symbols.size() *
	(Is64Bit ? sizeof(MachO::nlist_64) : sizeof(MachO::nlist));			(Is64Bit ? sizeof(MachO::nlist_64) : sizeof(MachO::nlist));
	}			}

	size_t MachOWriter::totalSize() const {			size_t MachOWriter::totalSize() const {
	// Going from tail to head and looking for an appropriate "anchor" to			// Going from tail to head and looking for an appropriate "anchor" to
	// calculate the total size assuming that all the offsets are either valid			// calculate the total size assuming that all the offsets are either valid
	// ("true") or 0 (0 indicates that the corresponding part is missing).			// ("true") or 0 (0 indicates that the corresponding part is missing).

	SmallVector<size_t, 7> Ends;			SmallVector<size_t, 7> Ends;
	if (O.SymTabCommandIndex) {			if (O.SymTabCommandIndex) {
	const MachO::symtab_command &SymTabCommand =			const MachO::symtab_command &SymTabCommand =
	O.LoadCommands[*O.SymTabCommandIndex]			O.LoadCommands[*O.SymTabCommandIndex]
	.MachOLoadCommand.symtab_command_data;			.MachOLoadCommand.symtab_command_data;
	if (SymTabCommand.symoff)			if (SymTabCommand.symoff)
	Ends.push_back(SymTabCommand.symoff + symTableSize());			Ends.push_back(SymTabCommand.symoff + symTableSize());
	if (SymTabCommand.stroff)			if (SymTabCommand.stroff)
	Ends.push_back(SymTabCommand.stroff + SymTabCommand.strsize);			Ends.push_back(SymTabCommand.stroff + SymTabCommand.strsize);
	}			}
	if (O.DyLdInfoCommandIndex) {			if (O.DyLdInfoCommandIndex) {
	const MachO::dyld_info_command &DyLdInfoCommand =			const MachO::dyld_info_command &DyLdInfoCommand =
	O.LoadCommands[*O.DyLdInfoCommandIndex]			O.LoadCommands[*O.DyLdInfoCommandIndex]
	.MachOLoadCommand.dyld_info_command_data;			.MachOLoadCommand.dyld_info_command_data;
	if (DyLdInfoCommand.rebase_off) {			if (DyLdInfoCommand.rebase_off) {
	assert((DyLdInfoCommand.rebase_size == O.Rebases.Opcodes.size()) &&			assert((DyLdInfoCommand.rebase_size == O.Rebases.Opcodes.size()) &&
	"Incorrect rebase opcodes size");			"Incorrect rebase opcodes size");
	Ends.push_back(DyLdInfoCommand.rebase_off + DyLdInfoCommand.rebase_size);			Ends.push_back(DyLdInfoCommand.rebase_off + DyLdInfoCommand.rebase_size);
	}			}
	if (DyLdInfoCommand.bind_off) {			if (DyLdInfoCommand.bind_off) {
	assert((DyLdInfoCommand.bind_size == O.Binds.Opcodes.size()) &&			assert((DyLdInfoCommand.bind_size == O.Binds.Opcodes.size()) &&
	"Incorrect bind opcodes size");			"Incorrect bind opcodes size");
	Ends.push_back(DyLdInfoCommand.bind_off + DyLdInfoCommand.bind_size);			Ends.push_back(DyLdInfoCommand.bind_off + DyLdInfoCommand.bind_size);
	}			}
	if (DyLdInfoCommand.weak_bind_off) {			if (DyLdInfoCommand.weak_bind_off) {
	assert((DyLdInfoCommand.weak_bind_size == O.WeakBinds.Opcodes.size()) &&			assert((DyLdInfoCommand.weak_bind_size == O.WeakBinds.Opcodes.size()) &&
	"Incorrect weak bind opcodes size");			"Incorrect weak bind opcodes size");
	Ends.push_back(DyLdInfoCommand.weak_bind_off +			Ends.push_back(DyLdInfoCommand.weak_bind_off +
	DyLdInfoCommand.weak_bind_size);			DyLdInfoCommand.weak_bind_size);
	}			}
	if (DyLdInfoCommand.lazy_bind_off) {			if (DyLdInfoCommand.lazy_bind_off) {
	assert((DyLdInfoCommand.lazy_bind_size == O.LazyBinds.Opcodes.size()) &&			assert((DyLdInfoCommand.lazy_bind_size == O.LazyBinds.Opcodes.size()) &&
	"Incorrect lazy bind opcodes size");			"Incorrect lazy bind opcodes size");
	Ends.push_back(DyLdInfoCommand.lazy_bind_off +			Ends.push_back(DyLdInfoCommand.lazy_bind_off +
	DyLdInfoCommand.lazy_bind_size);			DyLdInfoCommand.lazy_bind_size);
	}			}
	if (DyLdInfoCommand.export_off) {			if (DyLdInfoCommand.export_off) {
	assert((DyLdInfoCommand.export_size == O.Exports.Trie.size()) &&			assert((DyLdInfoCommand.export_size == O.Exports.Trie.size()) &&
	"Incorrect trie size");			"Incorrect trie size");
	Ends.push_back(DyLdInfoCommand.export_off + DyLdInfoCommand.export_size);			Ends.push_back(DyLdInfoCommand.export_off + DyLdInfoCommand.export_size);
	}			}
	}			}

	if (O.DySymTabCommandIndex) {			if (O.DySymTabCommandIndex) {
	const MachO::dysymtab_command &DySymTabCommand =			const MachO::dysymtab_command &DySymTabCommand =
	O.LoadCommands[*O.DySymTabCommandIndex]			O.LoadCommands[*O.DySymTabCommandIndex]
	.MachOLoadCommand.dysymtab_command_data;			.MachOLoadCommand.dysymtab_command_data;

	if (DySymTabCommand.indirectsymoff)			if (DySymTabCommand.indirectsymoff)
	Ends.push_back(DySymTabCommand.indirectsymoff +			Ends.push_back(DySymTabCommand.indirectsymoff +
	sizeof(uint32_t) * O.IndirectSymTable.Symbols.size());			sizeof(uint32_t) * O.IndirectSymTable.Symbols.size());
	}			}

	if (O.CodeSignatureCommandIndex) {			if (O.CodeSignatureCommandIndex) {
	const MachO::linkedit_data_command &LinkEditDataCommand =			const MachO::linkedit_data_command &LinkEditDataCommand =
	O.LoadCommands[*O.CodeSignatureCommandIndex]			O.LoadCommands[*O.CodeSignatureCommandIndex]
	.MachOLoadCommand.linkedit_data_command_data;			.MachOLoadCommand.linkedit_data_command_data;
	if (LinkEditDataCommand.dataoff)			if (LinkEditDataCommand.dataoff)
	Ends.push_back(LinkEditDataCommand.dataoff +			Ends.push_back(LinkEditDataCommand.dataoff +
	LinkEditDataCommand.datasize);			LinkEditDataCommand.datasize);
	}			}

	if (O.DataInCodeCommandIndex) {			if (O.DataInCodeCommandIndex) {
	const MachO::linkedit_data_command &LinkEditDataCommand =			const MachO::linkedit_data_command &LinkEditDataCommand =
	O.LoadCommands[*O.DataInCodeCommandIndex]			O.LoadCommands[*O.DataInCodeCommandIndex]
	.MachOLoadCommand.linkedit_data_command_data;			.MachOLoadCommand.linkedit_data_command_data;

	if (LinkEditDataCommand.dataoff)			if (LinkEditDataCommand.dataoff)
	Ends.push_back(LinkEditDataCommand.dataoff +			Ends.push_back(LinkEditDataCommand.dataoff +
	LinkEditDataCommand.datasize);			LinkEditDataCommand.datasize);
	}			}

	if (O.FunctionStartsCommandIndex) {			if (O.FunctionStartsCommandIndex) {
	const MachO::linkedit_data_command &LinkEditDataCommand =			const MachO::linkedit_data_command &LinkEditDataCommand =
	O.LoadCommands[*O.FunctionStartsCommandIndex]			O.LoadCommands[*O.FunctionStartsCommandIndex]
	.MachOLoadCommand.linkedit_data_command_data;			.MachOLoadCommand.linkedit_data_command_data;

	if (LinkEditDataCommand.dataoff)			if (LinkEditDataCommand.dataoff)
	Ends.push_back(LinkEditDataCommand.dataoff +			Ends.push_back(LinkEditDataCommand.dataoff +
	LinkEditDataCommand.datasize);			LinkEditDataCommand.datasize);
	}			}

	// Otherwise, use the last section / reloction.			// Otherwise, use the last section / reloction.
	for (const LoadCommand &LC : O.LoadCommands)			for (const LoadCommand &LC : O.LoadCommands)
	for (const std::unique_ptr<Section> &S : LC.Sections) {			for (const std::unique_ptr<Section> &S : LC.Sections) {
	Ends.push_back(S->Offset + S->Size);			Ends.push_back(S->Offset + S->Size);
	if (S->RelOff)			if (S->RelOff)
	Ends.push_back(S->RelOff +			Ends.push_back(S->RelOff +
	S->NReloc * sizeof(MachO::any_relocation_info));			S->NReloc * sizeof(MachO::any_relocation_info));
	}			}

	if (!Ends.empty())			if (!Ends.empty())
	return *std::max_element(Ends.begin(), Ends.end());			return *std::max_element(Ends.begin(), Ends.end());

	// Otherwise, we have only Mach header and load commands.			// Otherwise, we have only Mach header and load commands.
	return headerSize() + loadCommandsSize();			return headerSize() + loadCommandsSize();
	}			}

	void MachOWriter::writeHeader() {			void MachOWriter::writeHeader() {
	MachO::mach_header_64 Header;			MachO::mach_header_64 Header;

	Header.magic = O.Header.Magic;			Header.magic = O.Header.Magic;
	Header.cputype = O.Header.CPUType;			Header.cputype = O.Header.CPUType;
	Header.cpusubtype = O.Header.CPUSubType;			Header.cpusubtype = O.Header.CPUSubType;
	Header.filetype = O.Header.FileType;			Header.filetype = O.Header.FileType;
	Header.ncmds = O.Header.NCmds;			Header.ncmds = O.Header.NCmds;
	Header.sizeofcmds = O.Header.SizeOfCmds;			Header.sizeofcmds = O.Header.SizeOfCmds;
	Header.flags = O.Header.Flags;			Header.flags = O.Header.Flags;
	Header.reserved = O.Header.Reserved;			Header.reserved = O.Header.Reserved;

	if (IsLittleEndian != sys::IsLittleEndianHost)			if (IsLittleEndian != sys::IsLittleEndianHost)
	MachO::swapStruct(Header);			MachO::swapStruct(Header);

	auto HeaderSize =			auto HeaderSize =
	Is64Bit ? sizeof(MachO::mach_header_64) : sizeof(MachO::mach_header);			Is64Bit ? sizeof(MachO::mach_header_64) : sizeof(MachO::mach_header);
	memcpy(B.getBufferStart(), &Header, HeaderSize);			memcpy(B.getBufferStart(), &Header, HeaderSize);
	}			}

	void MachOWriter::writeLoadCommands() {			void MachOWriter::writeLoadCommands() {
	uint8_t *Begin = B.getBufferStart() + headerSize();			uint8_t *Begin = B.getBufferStart() + headerSize();
	for (const LoadCommand &LC : O.LoadCommands) {			for (const LoadCommand &LC : O.LoadCommands) {
	// Construct a load command.			// Construct a load command.
	MachO::macho_load_command MLC = LC.MachOLoadCommand;			MachO::macho_load_command MLC = LC.MachOLoadCommand;
	switch (MLC.load_command_data.cmd) {			switch (MLC.load_command_data.cmd) {
	case MachO::LC_SEGMENT:			case MachO::LC_SEGMENT:
	if (IsLittleEndian != sys::IsLittleEndianHost)			if (IsLittleEndian != sys::IsLittleEndianHost)
	MachO::swapStruct(MLC.segment_command_data);			MachO::swapStruct(MLC.segment_command_data);
	memcpy(Begin, &MLC.segment_command_data, sizeof(MachO::segment_command));			memcpy(Begin, &MLC.segment_command_data, sizeof(MachO::segment_command));
	Begin += sizeof(MachO::segment_command);			Begin += sizeof(MachO::segment_command);

	for (const std::unique_ptr<Section> &Sec : LC.Sections)			for (const std::unique_ptr<Section> &Sec : LC.Sections)
	writeSectionInLoadCommand<MachO::section>(*Sec, Begin);			writeSectionInLoadCommand<MachO::section>(*Sec, Begin);
	continue;			continue;
	case MachO::LC_SEGMENT_64:			case MachO::LC_SEGMENT_64:
	if (IsLittleEndian != sys::IsLittleEndianHost)			if (IsLittleEndian != sys::IsLittleEndianHost)
	MachO::swapStruct(MLC.segment_command_64_data);			MachO::swapStruct(MLC.segment_command_64_data);
	memcpy(Begin, &MLC.segment_command_64_data,			memcpy(Begin, &MLC.segment_command_64_data,
	sizeof(MachO::segment_command_64));			sizeof(MachO::segment_command_64));
	Begin += sizeof(MachO::segment_command_64);			Begin += sizeof(MachO::segment_command_64);

	for (const std::unique_ptr<Section> &Sec : LC.Sections)			for (const std::unique_ptr<Section> &Sec : LC.Sections)
	writeSectionInLoadCommand<MachO::section_64>(*Sec, Begin);			writeSectionInLoadCommand<MachO::section_64>(*Sec, Begin);
	continue;			continue;
	}			}

	#define HANDLE_LOAD_COMMAND(LCName, LCValue, LCStruct) \			#define HANDLE_LOAD_COMMAND(LCName, LCValue, LCStruct) \
	case MachO::LCName: \			case MachO::LCName: \
	assert(sizeof(MachO::LCStruct) + LC.Payload.size() == \			assert(sizeof(MachO::LCStruct) + LC.Payload.size() == \
	MLC.load_command_data.cmdsize); \			MLC.load_command_data.cmdsize); \
	if (IsLittleEndian != sys::IsLittleEndianHost) \			if (IsLittleEndian != sys::IsLittleEndianHost) \
	MachO::swapStruct(MLC.LCStruct##_data); \			MachO::swapStruct(MLC.LCStruct##_data); \
	memcpy(Begin, &MLC.LCStruct##_data, sizeof(MachO::LCStruct)); \			memcpy(Begin, &MLC.LCStruct##_data, sizeof(MachO::LCStruct)); \
	Begin += sizeof(MachO::LCStruct); \			Begin += sizeof(MachO::LCStruct); \
	if (!LC.Payload.empty()) \			if (!LC.Payload.empty()) \
	memcpy(Begin, LC.Payload.data(), LC.Payload.size()); \			memcpy(Begin, LC.Payload.data(), LC.Payload.size()); \
	Begin += LC.Payload.size(); \			Begin += LC.Payload.size(); \
	break;			break;

	// Copy the load command as it is.			// Copy the load command as it is.
	switch (MLC.load_command_data.cmd) {			switch (MLC.load_command_data.cmd) {
	default:			default:
	assert(sizeof(MachO::load_command) + LC.Payload.size() ==			assert(sizeof(MachO::load_command) + LC.Payload.size() ==
	MLC.load_command_data.cmdsize);			MLC.load_command_data.cmdsize);
	if (IsLittleEndian != sys::IsLittleEndianHost)			if (IsLittleEndian != sys::IsLittleEndianHost)
	MachO::swapStruct(MLC.load_command_data);			MachO::swapStruct(MLC.load_command_data);
	memcpy(Begin, &MLC.load_command_data, sizeof(MachO::load_command));			memcpy(Begin, &MLC.load_command_data, sizeof(MachO::load_command));
	Begin += sizeof(MachO::load_command);			Begin += sizeof(MachO::load_command);
	if (!LC.Payload.empty())			if (!LC.Payload.empty())
	memcpy(Begin, LC.Payload.data(), LC.Payload.size());			memcpy(Begin, LC.Payload.data(), LC.Payload.size());
	Begin += LC.Payload.size();			Begin += LC.Payload.size();
	break;			break;
	#include "llvm/BinaryFormat/MachO.def"			#include "llvm/BinaryFormat/MachO.def"
	}			}
	}			}
	}			}

	template <typename StructType>			template <typename StructType>
	void MachOWriter::writeSectionInLoadCommand(const Section &Sec, uint8_t *&Out) {			void MachOWriter::writeSectionInLoadCommand(const Section &Sec, uint8_t *&Out) {
	StructType Temp;			StructType Temp;
	assert(Sec.Segname.size() <= sizeof(Temp.segname) && "too long segment name");			assert(Sec.Segname.size() <= sizeof(Temp.segname) && "too long segment name");
	assert(Sec.Sectname.size() <= sizeof(Temp.sectname) &&			assert(Sec.Sectname.size() <= sizeof(Temp.sectname) &&
	"too long section name");			"too long section name");
	memset(&Temp, 0, sizeof(StructType));			memset(&Temp, 0, sizeof(StructType));
	memcpy(Temp.segname, Sec.Segname.data(), Sec.Segname.size());			memcpy(Temp.segname, Sec.Segname.data(), Sec.Segname.size());
	memcpy(Temp.sectname, Sec.Sectname.data(), Sec.Sectname.size());			memcpy(Temp.sectname, Sec.Sectname.data(), Sec.Sectname.size());
	Temp.addr = Sec.Addr;			Temp.addr = Sec.Addr;
	Temp.size = Sec.Size;			Temp.size = Sec.Size;
	Temp.offset = Sec.Offset;			Temp.offset = Sec.Offset;
	Temp.align = Sec.Align;			Temp.align = Sec.Align;
	Temp.reloff = Sec.RelOff;			Temp.reloff = Sec.RelOff;
	Temp.nreloc = Sec.NReloc;			Temp.nreloc = Sec.NReloc;
	Temp.flags = Sec.Flags;			Temp.flags = Sec.Flags;
	Temp.reserved1 = Sec.Reserved1;			Temp.reserved1 = Sec.Reserved1;
	Temp.reserved2 = Sec.Reserved2;			Temp.reserved2 = Sec.Reserved2;

	if (IsLittleEndian != sys::IsLittleEndianHost)			if (IsLittleEndian != sys::IsLittleEndianHost)
	MachO::swapStruct(Temp);			MachO::swapStruct(Temp);
	memcpy(Out, &Temp, sizeof(StructType));			memcpy(Out, &Temp, sizeof(StructType));
	Out += sizeof(StructType);			Out += sizeof(StructType);
	}			}

	void MachOWriter::writeSections() {			void MachOWriter::writeSections() {
	for (const LoadCommand &LC : O.LoadCommands)			for (const LoadCommand &LC : O.LoadCommands)
	for (const std::unique_ptr<Section> &Sec : LC.Sections) {			for (const std::unique_ptr<Section> &Sec : LC.Sections) {
	if (Sec->isVirtualSection())			if (Sec->isVirtualSection())
	continue;			continue;

	assert(Sec->Offset && "Section offset can not be zero");			assert(Sec->Offset && "Section offset can not be zero");
	assert((Sec->Size == Sec->Content.size()) && "Incorrect section size");			assert((Sec->Size == Sec->Content.size()) && "Incorrect section size");
	memcpy(B.getBufferStart() + Sec->Offset, Sec->Content.data(),			memcpy(B.getBufferStart() + Sec->Offset, Sec->Content.data(),
	Sec->Content.size());			Sec->Content.size());
	for (size_t Index = 0; Index < Sec->Relocations.size(); ++Index) {			for (size_t Index = 0; Index < Sec->Relocations.size(); ++Index) {
	RelocationInfo RelocInfo = Sec->Relocations[Index];			RelocationInfo RelocInfo = Sec->Relocations[Index];
	if (!RelocInfo.Scattered) {			if (!RelocInfo.Scattered) {
	const uint32_t SymbolNum = RelocInfo.Extern			const uint32_t SymbolNum = RelocInfo.Extern
	? (*RelocInfo.Symbol)->Index			? (*RelocInfo.Symbol)->Index
	: (*RelocInfo.Sec)->Index;			: (*RelocInfo.Sec)->Index;
	RelocInfo.setPlainRelocationSymbolNum(SymbolNum, IsLittleEndian);			RelocInfo.setPlainRelocationSymbolNum(SymbolNum, IsLittleEndian);
	}			}
	if (IsLittleEndian != sys::IsLittleEndianHost)			if (IsLittleEndian != sys::IsLittleEndianHost)
	MachO::swapStruct(			MachO::swapStruct(
	reinterpret_cast<MachO::any_relocation_info &>(RelocInfo.Info));			reinterpret_cast<MachO::any_relocation_info &>(RelocInfo.Info));
	memcpy(B.getBufferStart() + Sec->RelOff +			memcpy(B.getBufferStart() + Sec->RelOff +
	Index * sizeof(MachO::any_relocation_info),			Index * sizeof(MachO::any_relocation_info),
	&RelocInfo.Info, sizeof(RelocInfo.Info));			&RelocInfo.Info, sizeof(RelocInfo.Info));
	}			}
	}			}
	}			}

	template <typename NListType>			template <typename NListType>
	void writeNListEntry(const SymbolEntry &SE, bool IsLittleEndian, char *&Out,			void writeNListEntry(const SymbolEntry &SE, bool IsLittleEndian, char *&Out,
	uint32_t Nstrx) {			uint32_t Nstrx) {
	NListType ListEntry;			NListType ListEntry;
	ListEntry.n_strx = Nstrx;			ListEntry.n_strx = Nstrx;
	ListEntry.n_type = SE.n_type;			ListEntry.n_type = SE.n_type;
	ListEntry.n_sect = SE.n_sect;			ListEntry.n_sect = SE.n_sect;
	ListEntry.n_desc = SE.n_desc;			ListEntry.n_desc = SE.n_desc;
	ListEntry.n_value = SE.n_value;			ListEntry.n_value = SE.n_value;

	if (IsLittleEndian != sys::IsLittleEndianHost)			if (IsLittleEndian != sys::IsLittleEndianHost)
	MachO::swapStruct(ListEntry);			MachO::swapStruct(ListEntry);
	memcpy(Out, reinterpret_cast<const char *>(&ListEntry), sizeof(NListType));			memcpy(Out, reinterpret_cast<const char *>(&ListEntry), sizeof(NListType));
	Out += sizeof(NListType);			Out += sizeof(NListType);
	}			}

	void MachOWriter::writeStringTable() {			void MachOWriter::writeStringTable() {
	if (!O.SymTabCommandIndex)			if (!O.SymTabCommandIndex)
	return;			return;
	const MachO::symtab_command &SymTabCommand =			const MachO::symtab_command &SymTabCommand =
	O.LoadCommands[*O.SymTabCommandIndex]			O.LoadCommands[*O.SymTabCommandIndex]
	.MachOLoadCommand.symtab_command_data;			.MachOLoadCommand.symtab_command_data;

	uint8_t StrTable = (uint8_t )B.getBufferStart() + SymTabCommand.stroff;			uint8_t StrTable = (uint8_t )B.getBufferStart() + SymTabCommand.stroff;
	LayoutBuilder.getStringTableBuilder().write(StrTable);			LayoutBuilder.getStringTableBuilder().write(StrTable);
	}			}

	void MachOWriter::writeSymbolTable() {			void MachOWriter::writeSymbolTable() {
	if (!O.SymTabCommandIndex)			if (!O.SymTabCommandIndex)
	return;			return;
	const MachO::symtab_command &SymTabCommand =			const MachO::symtab_command &SymTabCommand =
	O.LoadCommands[*O.SymTabCommandIndex]			O.LoadCommands[*O.SymTabCommandIndex]
	.MachOLoadCommand.symtab_command_data;			.MachOLoadCommand.symtab_command_data;

	char SymTable = (char )B.getBufferStart() + SymTabCommand.symoff;			char SymTable = (char )B.getBufferStart() + SymTabCommand.symoff;
	for (auto Iter = O.SymTable.Symbols.begin(), End = O.SymTable.Symbols.end();			for (auto Iter = O.SymTable.Symbols.begin(), End = O.SymTable.Symbols.end();
	Iter != End; Iter++) {			Iter != End; Iter++) {
	SymbolEntry *Sym = Iter->get();			SymbolEntry *Sym = Iter->get();
	uint32_t Nstrx = LayoutBuilder.getStringTableBuilder().getOffset(Sym->Name);			uint32_t Nstrx = LayoutBuilder.getStringTableBuilder().getOffset(Sym->Name);

	if (Is64Bit)			if (Is64Bit)
	writeNListEntry<MachO::nlist_64>(*Sym, IsLittleEndian, SymTable, Nstrx);			writeNListEntry<MachO::nlist_64>(*Sym, IsLittleEndian, SymTable, Nstrx);
	else			else
	writeNListEntry<MachO::nlist>(*Sym, IsLittleEndian, SymTable, Nstrx);			writeNListEntry<MachO::nlist>(*Sym, IsLittleEndian, SymTable, Nstrx);
	}			}
	}			}

	void MachOWriter::writeRebaseInfo() {			void MachOWriter::writeRebaseInfo() {
	if (!O.DyLdInfoCommandIndex)			if (!O.DyLdInfoCommandIndex)
	return;			return;
	const MachO::dyld_info_command &DyLdInfoCommand =			const MachO::dyld_info_command &DyLdInfoCommand =
	O.LoadCommands[*O.DyLdInfoCommandIndex]			O.LoadCommands[*O.DyLdInfoCommandIndex]
	.MachOLoadCommand.dyld_info_command_data;			.MachOLoadCommand.dyld_info_command_data;
	char Out = (char )B.getBufferStart() + DyLdInfoCommand.rebase_off;			char Out = (char )B.getBufferStart() + DyLdInfoCommand.rebase_off;
	assert((DyLdInfoCommand.rebase_size == O.Rebases.Opcodes.size()) &&			assert((DyLdInfoCommand.rebase_size == O.Rebases.Opcodes.size()) &&
	"Incorrect rebase opcodes size");			"Incorrect rebase opcodes size");
	memcpy(Out, O.Rebases.Opcodes.data(), O.Rebases.Opcodes.size());			memcpy(Out, O.Rebases.Opcodes.data(), O.Rebases.Opcodes.size());
	}			}

	void MachOWriter::writeBindInfo() {			void MachOWriter::writeBindInfo() {
	if (!O.DyLdInfoCommandIndex)			if (!O.DyLdInfoCommandIndex)
	return;			return;
	const MachO::dyld_info_command &DyLdInfoCommand =			const MachO::dyld_info_command &DyLdInfoCommand =
	O.LoadCommands[*O.DyLdInfoCommandIndex]			O.LoadCommands[*O.DyLdInfoCommandIndex]
	.MachOLoadCommand.dyld_info_command_data;			.MachOLoadCommand.dyld_info_command_data;
	char Out = (char )B.getBufferStart() + DyLdInfoCommand.bind_off;			char Out = (char )B.getBufferStart() + DyLdInfoCommand.bind_off;
	assert((DyLdInfoCommand.bind_size == O.Binds.Opcodes.size()) &&			assert((DyLdInfoCommand.bind_size == O.Binds.Opcodes.size()) &&
	"Incorrect bind opcodes size");			"Incorrect bind opcodes size");
	memcpy(Out, O.Binds.Opcodes.data(), O.Binds.Opcodes.size());			memcpy(Out, O.Binds.Opcodes.data(), O.Binds.Opcodes.size());
	}			}

	void MachOWriter::writeWeakBindInfo() {			void MachOWriter::writeWeakBindInfo() {
	if (!O.DyLdInfoCommandIndex)			if (!O.DyLdInfoCommandIndex)
	return;			return;
	const MachO::dyld_info_command &DyLdInfoCommand =			const MachO::dyld_info_command &DyLdInfoCommand =
	O.LoadCommands[*O.DyLdInfoCommandIndex]			O.LoadCommands[*O.DyLdInfoCommandIndex]
	.MachOLoadCommand.dyld_info_command_data;			.MachOLoadCommand.dyld_info_command_data;
	char Out = (char )B.getBufferStart() + DyLdInfoCommand.weak_bind_off;			char Out = (char )B.getBufferStart() + DyLdInfoCommand.weak_bind_off;
	assert((DyLdInfoCommand.weak_bind_size == O.WeakBinds.Opcodes.size()) &&			assert((DyLdInfoCommand.weak_bind_size == O.WeakBinds.Opcodes.size()) &&
	"Incorrect weak bind opcodes size");			"Incorrect weak bind opcodes size");
	memcpy(Out, O.WeakBinds.Opcodes.data(), O.WeakBinds.Opcodes.size());			memcpy(Out, O.WeakBinds.Opcodes.data(), O.WeakBinds.Opcodes.size());
	}			}

	void MachOWriter::writeLazyBindInfo() {			void MachOWriter::writeLazyBindInfo() {
	if (!O.DyLdInfoCommandIndex)			if (!O.DyLdInfoCommandIndex)
	return;			return;
	const MachO::dyld_info_command &DyLdInfoCommand =			const MachO::dyld_info_command &DyLdInfoCommand =
	O.LoadCommands[*O.DyLdInfoCommandIndex]			O.LoadCommands[*O.DyLdInfoCommandIndex]
	.MachOLoadCommand.dyld_info_command_data;			.MachOLoadCommand.dyld_info_command_data;
	char Out = (char )B.getBufferStart() + DyLdInfoCommand.lazy_bind_off;			char Out = (char )B.getBufferStart() + DyLdInfoCommand.lazy_bind_off;
	assert((DyLdInfoCommand.lazy_bind_size == O.LazyBinds.Opcodes.size()) &&			assert((DyLdInfoCommand.lazy_bind_size == O.LazyBinds.Opcodes.size()) &&
	"Incorrect lazy bind opcodes size");			"Incorrect lazy bind opcodes size");
	memcpy(Out, O.LazyBinds.Opcodes.data(), O.LazyBinds.Opcodes.size());			memcpy(Out, O.LazyBinds.Opcodes.data(), O.LazyBinds.Opcodes.size());
	}			}

	void MachOWriter::writeExportInfo() {			void MachOWriter::writeExportInfo() {
	if (!O.DyLdInfoCommandIndex)			if (!O.DyLdInfoCommandIndex)
	return;			return;
	const MachO::dyld_info_command &DyLdInfoCommand =			const MachO::dyld_info_command &DyLdInfoCommand =
	O.LoadCommands[*O.DyLdInfoCommandIndex]			O.LoadCommands[*O.DyLdInfoCommandIndex]
	.MachOLoadCommand.dyld_info_command_data;			.MachOLoadCommand.dyld_info_command_data;
	char Out = (char )B.getBufferStart() + DyLdInfoCommand.export_off;			char Out = (char )B.getBufferStart() + DyLdInfoCommand.export_off;
	assert((DyLdInfoCommand.export_size == O.Exports.Trie.size()) &&			assert((DyLdInfoCommand.export_size == O.Exports.Trie.size()) &&
	"Incorrect export trie size");			"Incorrect export trie size");
	memcpy(Out, O.Exports.Trie.data(), O.Exports.Trie.size());			memcpy(Out, O.Exports.Trie.data(), O.Exports.Trie.size());
	}			}

	void MachOWriter::writeIndirectSymbolTable() {			void MachOWriter::writeIndirectSymbolTable() {
	if (!O.DySymTabCommandIndex)			if (!O.DySymTabCommandIndex)
	return;			return;

	const MachO::dysymtab_command &DySymTabCommand =			const MachO::dysymtab_command &DySymTabCommand =
	O.LoadCommands[*O.DySymTabCommandIndex]			O.LoadCommands[*O.DySymTabCommandIndex]
	.MachOLoadCommand.dysymtab_command_data;			.MachOLoadCommand.dysymtab_command_data;

	uint32_t *Out =			uint32_t *Out =
	(uint32_t *)(B.getBufferStart() + DySymTabCommand.indirectsymoff);			(uint32_t *)(B.getBufferStart() + DySymTabCommand.indirectsymoff);
	for (const IndirectSymbolEntry &Sym : O.IndirectSymTable.Symbols) {			for (const IndirectSymbolEntry &Sym : O.IndirectSymTable.Symbols) {
	uint32_t Entry = (Sym.Symbol) ? (*Sym.Symbol)->Index : Sym.OriginalIndex;			uint32_t Entry = (Sym.Symbol) ? (*Sym.Symbol)->Index : Sym.OriginalIndex;
	if (IsLittleEndian != sys::IsLittleEndianHost)			if (IsLittleEndian != sys::IsLittleEndianHost)
	sys::swapByteOrder(Entry);			sys::swapByteOrder(Entry);
	*Out++ = Entry;			*Out++ = Entry;
	}			}
	}			}

	void MachOWriter::writeLinkData(Optional<size_t> LCIndex, const LinkData &LD) {			void MachOWriter::writeLinkData(Optional<size_t> LCIndex, const LinkData &LD) {
	if (!LCIndex)			if (!LCIndex)
	return;			return;
	const MachO::linkedit_data_command &LinkEditDataCommand =			const MachO::linkedit_data_command &LinkEditDataCommand =
	O.LoadCommands[*LCIndex].MachOLoadCommand.linkedit_data_command_data;			O.LoadCommands[*LCIndex].MachOLoadCommand.linkedit_data_command_data;
	char Out = (char )B.getBufferStart() + LinkEditDataCommand.dataoff;			char Out = (char )B.getBufferStart() + LinkEditDataCommand.dataoff;
	assert((LinkEditDataCommand.datasize == LD.Data.size()) &&			assert((LinkEditDataCommand.datasize == LD.Data.size()) &&
	"Incorrect data size");			"Incorrect data size");
	memcpy(Out, LD.Data.data(), LD.Data.size());			memcpy(Out, LD.Data.data(), LD.Data.size());
	}			}

	void MachOWriter::writeCodeSignatureData() {			void MachOWriter::writeCodeSignatureData() {
	return writeLinkData(O.CodeSignatureCommandIndex, O.CodeSignature);			return writeLinkData(O.CodeSignatureCommandIndex, O.CodeSignature);
	}			}

	void MachOWriter::writeDataInCodeData() {			void MachOWriter::writeDataInCodeData() {
	return writeLinkData(O.DataInCodeCommandIndex, O.DataInCode);			return writeLinkData(O.DataInCodeCommandIndex, O.DataInCode);
	}			}

	void MachOWriter::writeFunctionStartsData() {			void MachOWriter::writeFunctionStartsData() {
	return writeLinkData(O.FunctionStartsCommandIndex, O.FunctionStarts);			return writeLinkData(O.FunctionStartsCommandIndex, O.FunctionStarts);
	}			}

	void MachOWriter::writeTail() {			void MachOWriter::writeTail() {
	typedef void (MachOWriter::*WriteHandlerType)(void);			typedef void (MachOWriter::*WriteHandlerType)(void);
	typedef std::pair<uint64_t, WriteHandlerType> WriteOperation;			typedef std::pair<uint64_t, WriteHandlerType> WriteOperation;
	SmallVector<WriteOperation, 7> Queue;			SmallVector<WriteOperation, 7> Queue;

	if (O.SymTabCommandIndex) {			if (O.SymTabCommandIndex) {
	const MachO::symtab_command &SymTabCommand =			const MachO::symtab_command &SymTabCommand =
	O.LoadCommands[*O.SymTabCommandIndex]			O.LoadCommands[*O.SymTabCommandIndex]
	.MachOLoadCommand.symtab_command_data;			.MachOLoadCommand.symtab_command_data;
	if (SymTabCommand.symoff)			if (SymTabCommand.symoff)
	Queue.push_back({SymTabCommand.symoff, &MachOWriter::writeSymbolTable});			Queue.push_back({SymTabCommand.symoff, &MachOWriter::writeSymbolTable});
	if (SymTabCommand.stroff)			if (SymTabCommand.stroff)
	Queue.push_back({SymTabCommand.stroff, &MachOWriter::writeStringTable});			Queue.push_back({SymTabCommand.stroff, &MachOWriter::writeStringTable});
	}			}

	if (O.DyLdInfoCommandIndex) {			if (O.DyLdInfoCommandIndex) {
	const MachO::dyld_info_command &DyLdInfoCommand =			const MachO::dyld_info_command &DyLdInfoCommand =
	O.LoadCommands[*O.DyLdInfoCommandIndex]			O.LoadCommands[*O.DyLdInfoCommandIndex]
	.MachOLoadCommand.dyld_info_command_data;			.MachOLoadCommand.dyld_info_command_data;
	if (DyLdInfoCommand.rebase_off)			if (DyLdInfoCommand.rebase_off)
	Queue.push_back(			Queue.push_back(
	{DyLdInfoCommand.rebase_off, &MachOWriter::writeRebaseInfo});			{DyLdInfoCommand.rebase_off, &MachOWriter::writeRebaseInfo});
	if (DyLdInfoCommand.bind_off)			if (DyLdInfoCommand.bind_off)
	Queue.push_back({DyLdInfoCommand.bind_off, &MachOWriter::writeBindInfo});			Queue.push_back({DyLdInfoCommand.bind_off, &MachOWriter::writeBindInfo});
	if (DyLdInfoCommand.weak_bind_off)			if (DyLdInfoCommand.weak_bind_off)
	Queue.push_back(			Queue.push_back(
	{DyLdInfoCommand.weak_bind_off, &MachOWriter::writeWeakBindInfo});			{DyLdInfoCommand.weak_bind_off, &MachOWriter::writeWeakBindInfo});
	if (DyLdInfoCommand.lazy_bind_off)			if (DyLdInfoCommand.lazy_bind_off)
	Queue.push_back(			Queue.push_back(
	{DyLdInfoCommand.lazy_bind_off, &MachOWriter::writeLazyBindInfo});			{DyLdInfoCommand.lazy_bind_off, &MachOWriter::writeLazyBindInfo});
	if (DyLdInfoCommand.export_off)			if (DyLdInfoCommand.export_off)
	Queue.push_back(			Queue.push_back(
	{DyLdInfoCommand.export_off, &MachOWriter::writeExportInfo});			{DyLdInfoCommand.export_off, &MachOWriter::writeExportInfo});
	}			}

	if (O.DySymTabCommandIndex) {			if (O.DySymTabCommandIndex) {
	const MachO::dysymtab_command &DySymTabCommand =			const MachO::dysymtab_command &DySymTabCommand =
	O.LoadCommands[*O.DySymTabCommandIndex]			O.LoadCommands[*O.DySymTabCommandIndex]
	.MachOLoadCommand.dysymtab_command_data;			.MachOLoadCommand.dysymtab_command_data;

	if (DySymTabCommand.indirectsymoff)			if (DySymTabCommand.indirectsymoff)
	Queue.emplace_back(DySymTabCommand.indirectsymoff,			Queue.emplace_back(DySymTabCommand.indirectsymoff,
	&MachOWriter::writeIndirectSymbolTable);			&MachOWriter::writeIndirectSymbolTable);
	}			}

	if (O.CodeSignatureCommandIndex) {			if (O.CodeSignatureCommandIndex) {
	const MachO::linkedit_data_command &LinkEditDataCommand =			const MachO::linkedit_data_command &LinkEditDataCommand =
	O.LoadCommands[*O.CodeSignatureCommandIndex]			O.LoadCommands[*O.CodeSignatureCommandIndex]
	.MachOLoadCommand.linkedit_data_command_data;			.MachOLoadCommand.linkedit_data_command_data;

	if (LinkEditDataCommand.dataoff)			if (LinkEditDataCommand.dataoff)
	Queue.emplace_back(LinkEditDataCommand.dataoff,			Queue.emplace_back(LinkEditDataCommand.dataoff,
	&MachOWriter::writeCodeSignatureData);			&MachOWriter::writeCodeSignatureData);
	}			}

	if (O.DataInCodeCommandIndex) {			if (O.DataInCodeCommandIndex) {
	const MachO::linkedit_data_command &LinkEditDataCommand =			const MachO::linkedit_data_command &LinkEditDataCommand =
	O.LoadCommands[*O.DataInCodeCommandIndex]			O.LoadCommands[*O.DataInCodeCommandIndex]
	.MachOLoadCommand.linkedit_data_command_data;			.MachOLoadCommand.linkedit_data_command_data;

	if (LinkEditDataCommand.dataoff)			if (LinkEditDataCommand.dataoff)
	Queue.emplace_back(LinkEditDataCommand.dataoff,			Queue.emplace_back(LinkEditDataCommand.dataoff,
	&MachOWriter::writeDataInCodeData);			&MachOWriter::writeDataInCodeData);
	}			}

	if (O.FunctionStartsCommandIndex) {			if (O.FunctionStartsCommandIndex) {
	const MachO::linkedit_data_command &LinkEditDataCommand =			const MachO::linkedit_data_command &LinkEditDataCommand =
	O.LoadCommands[*O.FunctionStartsCommandIndex]			O.LoadCommands[*O.FunctionStartsCommandIndex]
	.MachOLoadCommand.linkedit_data_command_data;			.MachOLoadCommand.linkedit_data_command_data;

	if (LinkEditDataCommand.dataoff)			if (LinkEditDataCommand.dataoff)
	Queue.emplace_back(LinkEditDataCommand.dataoff,			Queue.emplace_back(LinkEditDataCommand.dataoff,
	&MachOWriter::writeFunctionStartsData);			&MachOWriter::writeFunctionStartsData);
	}			}

	llvm::sort(Queue, [](const WriteOperation &LHS, const WriteOperation &RHS) {			llvm::sort(Queue, [](const WriteOperation &LHS, const WriteOperation &RHS) {
	return LHS.first < RHS.first;			return LHS.first < RHS.first;
	});			});

	for (auto WriteOp : Queue)			for (auto WriteOp : Queue)
	(this->*WriteOp.second)();			(this->*WriteOp.second)();
	}			}

	Error MachOWriter::finalize() { return LayoutBuilder.layout(); }			Error MachOWriter::finalize() { return LayoutBuilder.layout(); }

	Error MachOWriter::write() {			Error MachOWriter::write() {
	if (Error E = B.allocate(totalSize()))			if (Error E = B.allocate(totalSize()))
	return E;			return E;
	memset(B.getBufferStart(), 0, totalSize());			memset(B.getBufferStart(), 0, totalSize());
	writeHeader();			writeHeader();
	writeLoadCommands();			writeLoadCommands();
	writeSections();			writeSections();
	writeTail();			writeTail();
	return B.commit();			return B.commit();
	}			}

	} // end namespace macho			} // end namespace macho
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

llvm/lib/ObjCopy/MachO/Object.h

This file was moved from llvm/tools/llvm-objcopy/MachO/Object.h.

	//===- Object.h - Mach-O object file model ----------------------- C++ --===//			//===- Object.h - Mach-O object file model ----------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_OBJCOPY_MACHO_OBJECT_H			#ifndef LLVM_OBJCOPY_MACHO_OBJECT_H
	#define LLVM_OBJCOPY_MACHO_OBJECT_H			#define LLVM_OBJCOPY_MACHO_OBJECT_H

	#include "llvm/ADT/Optional.h"			#include "llvm/ADT/Optional.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/BinaryFormat/MachO.h"			#include "llvm/BinaryFormat/MachO.h"
	#include "llvm/MC/StringTableBuilder.h"			#include "llvm/MC/StringTableBuilder.h"
	#include "llvm/ObjectYAML/DWARFYAML.h"			#include "llvm/ObjectYAML/DWARFYAML.h"
	#include "llvm/Support/StringSaver.h"			#include "llvm/Support/StringSaver.h"
	#include "llvm/Support/YAMLTraits.h"			#include "llvm/Support/YAMLTraits.h"
	#include <cstdint>			#include <cstdint>
	#include <string>			#include <string>
	#include <vector>			#include <vector>

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace macho {			namespace macho {

	struct MachHeader {			struct MachHeader {
	uint32_t Magic;			uint32_t Magic;
	uint32_t CPUType;			uint32_t CPUType;
	uint32_t CPUSubType;			uint32_t CPUSubType;
	uint32_t FileType;			uint32_t FileType;
	uint32_t NCmds;			uint32_t NCmds;
	uint32_t SizeOfCmds;			uint32_t SizeOfCmds;
	uint32_t Flags;			uint32_t Flags;
	uint32_t Reserved = 0;			uint32_t Reserved = 0;
	};			};

	struct RelocationInfo;			struct RelocationInfo;
	struct Section {			struct Section {
	uint32_t Index;			uint32_t Index;
	std::string Segname;			std::string Segname;
	std::string Sectname;			std::string Sectname;
	// CanonicalName is a string formatted as “<Segname>,<Sectname>".			// CanonicalName is a string formatted as “<Segname>,<Sectname>".
	std::string CanonicalName;			std::string CanonicalName;
	uint64_t Addr = 0;			uint64_t Addr = 0;
	uint64_t Size = 0;			uint64_t Size = 0;
	uint32_t Offset = 0;			uint32_t Offset = 0;
	uint32_t Align = 0;			uint32_t Align = 0;
	uint32_t RelOff = 0;			uint32_t RelOff = 0;
	uint32_t NReloc = 0;			uint32_t NReloc = 0;
	uint32_t Flags = 0;			uint32_t Flags = 0;
	uint32_t Reserved1 = 0;			uint32_t Reserved1 = 0;
	uint32_t Reserved2 = 0;			uint32_t Reserved2 = 0;
	uint32_t Reserved3 = 0;			uint32_t Reserved3 = 0;
	StringRef Content;			StringRef Content;
	std::vector<RelocationInfo> Relocations;			std::vector<RelocationInfo> Relocations;

	Section(StringRef SegName, StringRef SectName)			Section(StringRef SegName, StringRef SectName)
	: Segname(std::string(SegName)), Sectname(std::string(SectName)),			: Segname(std::string(SegName)), Sectname(std::string(SectName)),
	CanonicalName((Twine(SegName) + Twine(',') + SectName).str()) {}			CanonicalName((Twine(SegName) + Twine(',') + SectName).str()) {}

	Section(StringRef SegName, StringRef SectName, StringRef Content)			Section(StringRef SegName, StringRef SectName, StringRef Content)
	: Segname(std::string(SegName)), Sectname(std::string(SectName)),			: Segname(std::string(SegName)), Sectname(std::string(SectName)),
	CanonicalName((Twine(SegName) + Twine(',') + SectName).str()),			CanonicalName((Twine(SegName) + Twine(',') + SectName).str()),
	Content(Content) {}			Content(Content) {}

	MachO::SectionType getType() const {			MachO::SectionType getType() const {
	return static_cast<MachO::SectionType>(Flags & MachO::SECTION_TYPE);			return static_cast<MachO::SectionType>(Flags & MachO::SECTION_TYPE);
	}			}

	bool isVirtualSection() const {			bool isVirtualSection() const {
	return (getType() == MachO::S_ZEROFILL \|\|			return (getType() == MachO::S_ZEROFILL \|\|
	getType() == MachO::S_GB_ZEROFILL \|\|			getType() == MachO::S_GB_ZEROFILL \|\|
	getType() == MachO::S_THREAD_LOCAL_ZEROFILL);			getType() == MachO::S_THREAD_LOCAL_ZEROFILL);
	}			}
	};			};

	struct LoadCommand {			struct LoadCommand {
	// The type MachO::macho_load_command is defined in llvm/BinaryFormat/MachO.h			// The type MachO::macho_load_command is defined in llvm/BinaryFormat/MachO.h
	// and it is a union of all the structs corresponding to various load			// and it is a union of all the structs corresponding to various load
	// commands.			// commands.
	MachO::macho_load_command MachOLoadCommand;			MachO::macho_load_command MachOLoadCommand;

	// The raw content of the payload of the load command (located right after the			// The raw content of the payload of the load command (located right after the
	// corresponding struct). In some cases it is either empty or can be			// corresponding struct). In some cases it is either empty or can be
	// copied-over without digging into its structure.			// copied-over without digging into its structure.
	std::vector<uint8_t> Payload;			std::vector<uint8_t> Payload;

	// Some load commands can contain (inside the payload) an array of sections,			// Some load commands can contain (inside the payload) an array of sections,
	// though the contents of the sections are stored separately. The struct			// though the contents of the sections are stored separately. The struct
	// Section describes only sections' metadata and where to find the			// Section describes only sections' metadata and where to find the
	// corresponding content inside the binary.			// corresponding content inside the binary.
	std::vector<std::unique_ptr<Section>> Sections;			std::vector<std::unique_ptr<Section>> Sections;

	// Returns the segment name if the load command is a segment command.			// Returns the segment name if the load command is a segment command.
	Optional<StringRef> getSegmentName() const;			Optional<StringRef> getSegmentName() const;

	// Returns the segment vm address if the load command is a segment command.			// Returns the segment vm address if the load command is a segment command.
	Optional<uint64_t> getSegmentVMAddr() const;			Optional<uint64_t> getSegmentVMAddr() const;
	};			};

	// A symbol information. Fields which starts with "n_" are same as them in the			// A symbol information. Fields which starts with "n_" are same as them in the
	// nlist.			// nlist.
	struct SymbolEntry {			struct SymbolEntry {
	std::string Name;			std::string Name;
	bool Referenced = false;			bool Referenced = false;
	uint32_t Index;			uint32_t Index;
	uint8_t n_type;			uint8_t n_type;
	uint8_t n_sect;			uint8_t n_sect;
	uint16_t n_desc;			uint16_t n_desc;
	uint64_t n_value;			uint64_t n_value;

	bool isExternalSymbol() const { return n_type & MachO::N_EXT; }			bool isExternalSymbol() const { return n_type & MachO::N_EXT; }

	bool isLocalSymbol() const { return !isExternalSymbol(); }			bool isLocalSymbol() const { return !isExternalSymbol(); }

	bool isUndefinedSymbol() const {			bool isUndefinedSymbol() const {
	return (n_type & MachO::N_TYPE) == MachO::N_UNDF;			return (n_type & MachO::N_TYPE) == MachO::N_UNDF;
	}			}

	bool isSwiftSymbol() const {			bool isSwiftSymbol() const {
	return StringRef(Name).startswith("_$s") \|\|			return StringRef(Name).startswith("_$s") \|\|
	StringRef(Name).startswith("_$S");			StringRef(Name).startswith("_$S");
	}			}

	Optional<uint32_t> section() const {			Optional<uint32_t> section() const {
	return n_sect == MachO::NO_SECT ? None : Optional<uint32_t>(n_sect);			return n_sect == MachO::NO_SECT ? None : Optional<uint32_t>(n_sect);
	}			}
	};			};

	/// The location of the symbol table inside the binary is described by LC_SYMTAB			/// The location of the symbol table inside the binary is described by LC_SYMTAB
	/// load command.			/// load command.
	struct SymbolTable {			struct SymbolTable {
	std::vector<std::unique_ptr<SymbolEntry>> Symbols;			std::vector<std::unique_ptr<SymbolEntry>> Symbols;

	using iterator = pointee_iterator<			using iterator = pointee_iterator<
	std::vector<std::unique_ptr<SymbolEntry>>::const_iterator>;			std::vector<std::unique_ptr<SymbolEntry>>::const_iterator>;

	iterator begin() const { return iterator(Symbols.begin()); }			iterator begin() const { return iterator(Symbols.begin()); }
	iterator end() const { return iterator(Symbols.end()); }			iterator end() const { return iterator(Symbols.end()); }

	const SymbolEntry *getSymbolByIndex(uint32_t Index) const;			const SymbolEntry *getSymbolByIndex(uint32_t Index) const;
	SymbolEntry *getSymbolByIndex(uint32_t Index);			SymbolEntry *getSymbolByIndex(uint32_t Index);
	void removeSymbols(			void removeSymbols(
	function_ref<bool(const std::unique_ptr<SymbolEntry> &)> ToRemove);			function_ref<bool(const std::unique_ptr<SymbolEntry> &)> ToRemove);
	};			};

	struct IndirectSymbolEntry {			struct IndirectSymbolEntry {
	// The original value in an indirect symbol table. Higher bits encode extra			// The original value in an indirect symbol table. Higher bits encode extra
	// information (INDIRECT_SYMBOL_LOCAL and INDIRECT_SYMBOL_ABS).			// information (INDIRECT_SYMBOL_LOCAL and INDIRECT_SYMBOL_ABS).
	uint32_t OriginalIndex;			uint32_t OriginalIndex;
	/// The Symbol referenced by this entry. It's None if the index is			/// The Symbol referenced by this entry. It's None if the index is
	/// INDIRECT_SYMBOL_LOCAL or INDIRECT_SYMBOL_ABS.			/// INDIRECT_SYMBOL_LOCAL or INDIRECT_SYMBOL_ABS.
	Optional<SymbolEntry *> Symbol;			Optional<SymbolEntry *> Symbol;

	IndirectSymbolEntry(uint32_t OriginalIndex, Optional<SymbolEntry *> Symbol)			IndirectSymbolEntry(uint32_t OriginalIndex, Optional<SymbolEntry *> Symbol)
	: OriginalIndex(OriginalIndex), Symbol(Symbol) {}			: OriginalIndex(OriginalIndex), Symbol(Symbol) {}
	};			};

	struct IndirectSymbolTable {			struct IndirectSymbolTable {
	std::vector<IndirectSymbolEntry> Symbols;			std::vector<IndirectSymbolEntry> Symbols;
	};			};

	/// The location of the string table inside the binary is described by LC_SYMTAB			/// The location of the string table inside the binary is described by LC_SYMTAB
	/// load command.			/// load command.
	struct StringTable {			struct StringTable {
	std::vector<std::string> Strings;			std::vector<std::string> Strings;
	};			};

	struct RelocationInfo {			struct RelocationInfo {
	// The referenced symbol entry. Set if !Scattered && Extern.			// The referenced symbol entry. Set if !Scattered && Extern.
	Optional<const SymbolEntry *> Symbol;			Optional<const SymbolEntry *> Symbol;
	// The referenced section. Set if !Scattered && !Extern.			// The referenced section. Set if !Scattered && !Extern.
	Optional<const Section *> Sec;			Optional<const Section *> Sec;
	// True if Info is a scattered_relocation_info.			// True if Info is a scattered_relocation_info.
	bool Scattered;			bool Scattered;
	// True if the r_symbolnum points to a section number (i.e. r_extern=0).			// True if the r_symbolnum points to a section number (i.e. r_extern=0).
	bool Extern;			bool Extern;
	MachO::any_relocation_info Info;			MachO::any_relocation_info Info;

	unsigned getPlainRelocationSymbolNum(bool IsLittleEndian) {			unsigned getPlainRelocationSymbolNum(bool IsLittleEndian) {
	if (IsLittleEndian)			if (IsLittleEndian)
	return Info.r_word1 & 0xffffff;			return Info.r_word1 & 0xffffff;
	return Info.r_word1 >> 8;			return Info.r_word1 >> 8;
	}			}

	void setPlainRelocationSymbolNum(unsigned SymbolNum, bool IsLittleEndian) {			void setPlainRelocationSymbolNum(unsigned SymbolNum, bool IsLittleEndian) {
	assert(SymbolNum < (1 << 24) && "SymbolNum out of range");			assert(SymbolNum < (1 << 24) && "SymbolNum out of range");
	if (IsLittleEndian)			if (IsLittleEndian)
	Info.r_word1 = (Info.r_word1 & ~0x00ffffff) \| SymbolNum;			Info.r_word1 = (Info.r_word1 & ~0x00ffffff) \| SymbolNum;
	else			else
	Info.r_word1 = (Info.r_word1 & ~0xffffff00) \| (SymbolNum << 8);			Info.r_word1 = (Info.r_word1 & ~0xffffff00) \| (SymbolNum << 8);
	}			}
	};			};

	/// The location of the rebase info inside the binary is described by			/// The location of the rebase info inside the binary is described by
	/// LC_DYLD_INFO load command. Dyld rebases an image whenever dyld loads it at			/// LC_DYLD_INFO load command. Dyld rebases an image whenever dyld loads it at
	/// an address different from its preferred address. The rebase information is			/// an address different from its preferred address. The rebase information is
	/// a stream of byte sized opcodes whose symbolic names start with			/// a stream of byte sized opcodes whose symbolic names start with
	/// REBASE_OPCODE_. Conceptually the rebase information is a table of tuples:			/// REBASE_OPCODE_. Conceptually the rebase information is a table of tuples:
	/// <seg-index, seg-offset, type>			/// <seg-index, seg-offset, type>
	/// The opcodes are a compressed way to encode the table by only			/// The opcodes are a compressed way to encode the table by only
	/// encoding when a column changes. In addition simple patterns			/// encoding when a column changes. In addition simple patterns
	/// like "every n'th offset for m times" can be encoded in a few			/// like "every n'th offset for m times" can be encoded in a few
	/// bytes.			/// bytes.
	struct RebaseInfo {			struct RebaseInfo {
	// At the moment we do not parse this info (and it is simply copied over),			// At the moment we do not parse this info (and it is simply copied over),
	// but the proper support will be added later.			// but the proper support will be added later.
	ArrayRef<uint8_t> Opcodes;			ArrayRef<uint8_t> Opcodes;
	};			};

	/// The location of the bind info inside the binary is described by			/// The location of the bind info inside the binary is described by
	/// LC_DYLD_INFO load command. Dyld binds an image during the loading process,			/// LC_DYLD_INFO load command. Dyld binds an image during the loading process,
	/// if the image requires any pointers to be initialized to symbols in other			/// if the image requires any pointers to be initialized to symbols in other
	/// images. The bind information is a stream of byte sized opcodes whose			/// images. The bind information is a stream of byte sized opcodes whose
	/// symbolic names start with BIND_OPCODE_. Conceptually the bind information is			/// symbolic names start with BIND_OPCODE_. Conceptually the bind information is
	/// a table of tuples: <seg-index, seg-offset, type, symbol-library-ordinal,			/// a table of tuples: <seg-index, seg-offset, type, symbol-library-ordinal,
	/// symbol-name, addend> The opcodes are a compressed way to encode the table by			/// symbol-name, addend> The opcodes are a compressed way to encode the table by
	/// only encoding when a column changes. In addition simple patterns like for			/// only encoding when a column changes. In addition simple patterns like for
	/// runs of pointers initialized to the same value can be encoded in a few			/// runs of pointers initialized to the same value can be encoded in a few
	/// bytes.			/// bytes.
	struct BindInfo {			struct BindInfo {
	// At the moment we do not parse this info (and it is simply copied over),			// At the moment we do not parse this info (and it is simply copied over),
	// but the proper support will be added later.			// but the proper support will be added later.
	ArrayRef<uint8_t> Opcodes;			ArrayRef<uint8_t> Opcodes;
	};			};

	/// The location of the weak bind info inside the binary is described by			/// The location of the weak bind info inside the binary is described by
	/// LC_DYLD_INFO load command. Some C++ programs require dyld to unique symbols			/// LC_DYLD_INFO load command. Some C++ programs require dyld to unique symbols
	/// so that all images in the process use the same copy of some code/data. This			/// so that all images in the process use the same copy of some code/data. This
	/// step is done after binding. The content of the weak_bind info is an opcode			/// step is done after binding. The content of the weak_bind info is an opcode
	/// stream like the bind_info. But it is sorted alphabetically by symbol name.			/// stream like the bind_info. But it is sorted alphabetically by symbol name.
	/// This enable dyld to walk all images with weak binding information in order			/// This enable dyld to walk all images with weak binding information in order
	/// and look for collisions. If there are no collisions, dyld does no updating.			/// and look for collisions. If there are no collisions, dyld does no updating.
	/// That means that some fixups are also encoded in the bind_info. For			/// That means that some fixups are also encoded in the bind_info. For
	/// instance, all calls to "operator new" are first bound to libstdc++.dylib			/// instance, all calls to "operator new" are first bound to libstdc++.dylib
	/// using the information in bind_info. Then if some image overrides operator			/// using the information in bind_info. Then if some image overrides operator
	/// new that is detected when the weak_bind information is processed and the			/// new that is detected when the weak_bind information is processed and the
	/// call to operator new is then rebound.			/// call to operator new is then rebound.
	struct WeakBindInfo {			struct WeakBindInfo {
	// At the moment we do not parse this info (and it is simply copied over),			// At the moment we do not parse this info (and it is simply copied over),
	// but the proper support will be added later.			// but the proper support will be added later.
	ArrayRef<uint8_t> Opcodes;			ArrayRef<uint8_t> Opcodes;
	};			};

	/// The location of the lazy bind info inside the binary is described by			/// The location of the lazy bind info inside the binary is described by
	/// LC_DYLD_INFO load command. Some uses of external symbols do not need to be			/// LC_DYLD_INFO load command. Some uses of external symbols do not need to be
	/// bound immediately. Instead they can be lazily bound on first use. The			/// bound immediately. Instead they can be lazily bound on first use. The
	/// lazy_bind contains a stream of BIND opcodes to bind all lazy symbols. Normal			/// lazy_bind contains a stream of BIND opcodes to bind all lazy symbols. Normal
	/// use is that dyld ignores the lazy_bind section when loading an image.			/// use is that dyld ignores the lazy_bind section when loading an image.
	/// Instead the static linker arranged for the lazy pointer to initially point			/// Instead the static linker arranged for the lazy pointer to initially point
	/// to a helper function which pushes the offset into the lazy_bind area for the			/// to a helper function which pushes the offset into the lazy_bind area for the
	/// symbol needing to be bound, then jumps to dyld which simply adds the offset			/// symbol needing to be bound, then jumps to dyld which simply adds the offset
	/// to lazy_bind_off to get the information on what to bind.			/// to lazy_bind_off to get the information on what to bind.
	struct LazyBindInfo {			struct LazyBindInfo {
	ArrayRef<uint8_t> Opcodes;			ArrayRef<uint8_t> Opcodes;
	};			};

	/// The location of the export info inside the binary is described by			/// The location of the export info inside the binary is described by
	/// LC_DYLD_INFO load command. The symbols exported by a dylib are encoded in a			/// LC_DYLD_INFO load command. The symbols exported by a dylib are encoded in a
	/// trie. This is a compact representation that factors out common prefixes. It			/// trie. This is a compact representation that factors out common prefixes. It
	/// also reduces LINKEDIT pages in RAM because it encodes all information (name,			/// also reduces LINKEDIT pages in RAM because it encodes all information (name,
	/// address, flags) in one small, contiguous range. The export area is a stream			/// address, flags) in one small, contiguous range. The export area is a stream
	/// of nodes. The first node sequentially is the start node for the trie. Nodes			/// of nodes. The first node sequentially is the start node for the trie. Nodes
	/// for a symbol start with a uleb128 that is the length of the exported symbol			/// for a symbol start with a uleb128 that is the length of the exported symbol
	/// information for the string so far. If there is no exported symbol, the node			/// information for the string so far. If there is no exported symbol, the node
	/// starts with a zero byte. If there is exported info, it follows the length.			/// starts with a zero byte. If there is exported info, it follows the length.
	/// First is a uleb128 containing flags. Normally, it is followed by			/// First is a uleb128 containing flags. Normally, it is followed by
	/// a uleb128 encoded offset which is location of the content named			/// a uleb128 encoded offset which is location of the content named
	/// by the symbol from the mach_header for the image. If the flags			/// by the symbol from the mach_header for the image. If the flags
	/// is EXPORT_SYMBOL_FLAGS_REEXPORT, then following the flags is			/// is EXPORT_SYMBOL_FLAGS_REEXPORT, then following the flags is
	/// a uleb128 encoded library ordinal, then a zero terminated			/// a uleb128 encoded library ordinal, then a zero terminated
	/// UTF8 string. If the string is zero length, then the symbol			/// UTF8 string. If the string is zero length, then the symbol
	/// is re-export from the specified dylib with the same name.			/// is re-export from the specified dylib with the same name.
	/// If the flags is EXPORT_SYMBOL_FLAGS_STUB_AND_RESOLVER, then following			/// If the flags is EXPORT_SYMBOL_FLAGS_STUB_AND_RESOLVER, then following
	/// the flags is two uleb128s: the stub offset and the resolver offset.			/// the flags is two uleb128s: the stub offset and the resolver offset.
	/// The stub is used by non-lazy pointers. The resolver is used			/// The stub is used by non-lazy pointers. The resolver is used
	/// by lazy pointers and must be called to get the actual address to use.			/// by lazy pointers and must be called to get the actual address to use.
	/// After the optional exported symbol information is a byte of			/// After the optional exported symbol information is a byte of
	/// how many edges (0-255) that this node has leaving it,			/// how many edges (0-255) that this node has leaving it,
	/// followed by each edge.			/// followed by each edge.
	/// Each edge is a zero terminated UTF8 of the addition chars			/// Each edge is a zero terminated UTF8 of the addition chars
	/// in the symbol, followed by a uleb128 offset for the node that			/// in the symbol, followed by a uleb128 offset for the node that
	/// edge points to.			/// edge points to.
	struct ExportInfo {			struct ExportInfo {
	ArrayRef<uint8_t> Trie;			ArrayRef<uint8_t> Trie;
	};			};

	struct LinkData {			struct LinkData {
	ArrayRef<uint8_t> Data;			ArrayRef<uint8_t> Data;
	};			};

	struct Object {			struct Object {
	MachHeader Header;			MachHeader Header;
	std::vector<LoadCommand> LoadCommands;			std::vector<LoadCommand> LoadCommands;

	SymbolTable SymTable;			SymbolTable SymTable;
	StringTable StrTable;			StringTable StrTable;

	RebaseInfo Rebases;			RebaseInfo Rebases;
	BindInfo Binds;			BindInfo Binds;
	WeakBindInfo WeakBinds;			WeakBindInfo WeakBinds;
	LazyBindInfo LazyBinds;			LazyBindInfo LazyBinds;
	ExportInfo Exports;			ExportInfo Exports;
	IndirectSymbolTable IndirectSymTable;			IndirectSymbolTable IndirectSymTable;
	LinkData DataInCode;			LinkData DataInCode;
	LinkData FunctionStarts;			LinkData FunctionStarts;
	LinkData CodeSignature;			LinkData CodeSignature;

	Optional<uint32_t> SwiftVersion;			Optional<uint32_t> SwiftVersion;

	/// The index of LC_CODE_SIGNATURE load command if present.			/// The index of LC_CODE_SIGNATURE load command if present.
	Optional<size_t> CodeSignatureCommandIndex;			Optional<size_t> CodeSignatureCommandIndex;
	/// The index of LC_SYMTAB load command if present.			/// The index of LC_SYMTAB load command if present.
	Optional<size_t> SymTabCommandIndex;			Optional<size_t> SymTabCommandIndex;
	/// The index of LC_DYLD_INFO or LC_DYLD_INFO_ONLY load command if present.			/// The index of LC_DYLD_INFO or LC_DYLD_INFO_ONLY load command if present.
	Optional<size_t> DyLdInfoCommandIndex;			Optional<size_t> DyLdInfoCommandIndex;
	/// The index LC_DYSYMTAB load comamnd if present.			/// The index LC_DYSYMTAB load comamnd if present.
	Optional<size_t> DySymTabCommandIndex;			Optional<size_t> DySymTabCommandIndex;
	/// The index LC_DATA_IN_CODE load comamnd if present.			/// The index LC_DATA_IN_CODE load comamnd if present.
	Optional<size_t> DataInCodeCommandIndex;			Optional<size_t> DataInCodeCommandIndex;
	/// The index LC_FUNCTION_STARTS load comamnd if present.			/// The index LC_FUNCTION_STARTS load comamnd if present.
	Optional<size_t> FunctionStartsCommandIndex;			Optional<size_t> FunctionStartsCommandIndex;

	BumpPtrAllocator Alloc;			BumpPtrAllocator Alloc;
	StringSaver NewSectionsContents;			StringSaver NewSectionsContents;

	Object() : NewSectionsContents(Alloc) {}			Object() : NewSectionsContents(Alloc) {}

	Error			Error
	removeSections(function_ref<bool(const std::unique_ptr<Section> &)> ToRemove);			removeSections(function_ref<bool(const std::unique_ptr<Section> &)> ToRemove);

	Error removeLoadCommands(function_ref<bool(const LoadCommand &)> ToRemove);			Error removeLoadCommands(function_ref<bool(const LoadCommand &)> ToRemove);

	void updateLoadCommandIndexes();			void updateLoadCommandIndexes();

	/// Creates a new segment load command in the object and returns a reference			/// Creates a new segment load command in the object and returns a reference
	/// to the newly created load command. The caller should verify that SegName			/// to the newly created load command. The caller should verify that SegName
	/// is not too long (SegName.size() should be less than or equal to 16).			/// is not too long (SegName.size() should be less than or equal to 16).
	LoadCommand &addSegment(StringRef SegName);			LoadCommand &addSegment(StringRef SegName);

	bool is64Bit() const {			bool is64Bit() const {
	return Header.Magic == MachO::MH_MAGIC_64 \|\|			return Header.Magic == MachO::MH_MAGIC_64 \|\|
	Header.Magic == MachO::MH_CIGAM_64;			Header.Magic == MachO::MH_CIGAM_64;
	}			}

	uint64_t nextAvailableSegmentAddress() const;			uint64_t nextAvailableSegmentAddress() const;
	};			};

	} // end namespace macho			} // end namespace macho
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_OBJCOPY_MACHO_OBJECT_H			#endif // LLVM_OBJCOPY_MACHO_OBJECT_H

llvm/lib/ObjCopy/MachO/Object.cpp

This file was moved from llvm/tools/llvm-objcopy/MachO/Object.cpp.

	//===- Object.cpp - Mach-O object file model --------------------- C++ --===//			//===- Object.cpp - Mach-O object file model --------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "Object.h"			#include "Object.h"
	#include "llvm/ADT/SmallPtrSet.h"			#include "llvm/ADT/SmallPtrSet.h"
	#include <unordered_set>			#include <unordered_set>

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace macho {			namespace macho {

	const SymbolEntry *SymbolTable::getSymbolByIndex(uint32_t Index) const {			const SymbolEntry *SymbolTable::getSymbolByIndex(uint32_t Index) const {
	assert(Index < Symbols.size() && "invalid symbol index");			assert(Index < Symbols.size() && "invalid symbol index");
	return Symbols[Index].get();			return Symbols[Index].get();
	}			}

	SymbolEntry *SymbolTable::getSymbolByIndex(uint32_t Index) {			SymbolEntry *SymbolTable::getSymbolByIndex(uint32_t Index) {
	return const_cast<SymbolEntry *>(			return const_cast<SymbolEntry *>(
	static_cast<const SymbolTable *>(this)->getSymbolByIndex(Index));			static_cast<const SymbolTable *>(this)->getSymbolByIndex(Index));
	}			}

	void SymbolTable::removeSymbols(			void SymbolTable::removeSymbols(
	function_ref<bool(const std::unique_ptr<SymbolEntry> &)> ToRemove) {			function_ref<bool(const std::unique_ptr<SymbolEntry> &)> ToRemove) {
	Symbols.erase(			Symbols.erase(
	std::remove_if(std::begin(Symbols), std::end(Symbols), ToRemove),			std::remove_if(std::begin(Symbols), std::end(Symbols), ToRemove),
	std::end(Symbols));			std::end(Symbols));
	}			}

	void Object::updateLoadCommandIndexes() {			void Object::updateLoadCommandIndexes() {
	// Update indices of special load commands			// Update indices of special load commands
	for (size_t Index = 0, Size = LoadCommands.size(); Index < Size; ++Index) {			for (size_t Index = 0, Size = LoadCommands.size(); Index < Size; ++Index) {
	LoadCommand &LC = LoadCommands[Index];			LoadCommand &LC = LoadCommands[Index];
	switch (LC.MachOLoadCommand.load_command_data.cmd) {			switch (LC.MachOLoadCommand.load_command_data.cmd) {
	case MachO::LC_SYMTAB:			case MachO::LC_SYMTAB:
	SymTabCommandIndex = Index;			SymTabCommandIndex = Index;
	break;			break;
	case MachO::LC_DYSYMTAB:			case MachO::LC_DYSYMTAB:
	DySymTabCommandIndex = Index;			DySymTabCommandIndex = Index;
	break;			break;
	case MachO::LC_DYLD_INFO:			case MachO::LC_DYLD_INFO:
	case MachO::LC_DYLD_INFO_ONLY:			case MachO::LC_DYLD_INFO_ONLY:
	DyLdInfoCommandIndex = Index;			DyLdInfoCommandIndex = Index;
	break;			break;
	case MachO::LC_DATA_IN_CODE:			case MachO::LC_DATA_IN_CODE:
	DataInCodeCommandIndex = Index;			DataInCodeCommandIndex = Index;
	break;			break;
	case MachO::LC_FUNCTION_STARTS:			case MachO::LC_FUNCTION_STARTS:
	FunctionStartsCommandIndex = Index;			FunctionStartsCommandIndex = Index;
	break;			break;
	}			}
	}			}
	}			}

	Error Object::removeLoadCommands(			Error Object::removeLoadCommands(
	function_ref<bool(const LoadCommand &)> ToRemove) {			function_ref<bool(const LoadCommand &)> ToRemove) {
	auto It = std::stable_partition(			auto It = std::stable_partition(
	LoadCommands.begin(), LoadCommands.end(),			LoadCommands.begin(), LoadCommands.end(),
	[&](const LoadCommand &LC) { return !ToRemove(LC); });			[&](const LoadCommand &LC) { return !ToRemove(LC); });
	LoadCommands.erase(It, LoadCommands.end());			LoadCommands.erase(It, LoadCommands.end());

	updateLoadCommandIndexes();			updateLoadCommandIndexes();
	return Error::success();			return Error::success();
	}			}

	Error Object::removeSections(			Error Object::removeSections(
	function_ref<bool(const std::unique_ptr<Section> &)> ToRemove) {			function_ref<bool(const std::unique_ptr<Section> &)> ToRemove) {
	DenseMap<uint32_t, const Section *> OldIndexToSection;			DenseMap<uint32_t, const Section *> OldIndexToSection;
	uint32_t NextSectionIndex = 1;			uint32_t NextSectionIndex = 1;
	for (LoadCommand &LC : LoadCommands) {			for (LoadCommand &LC : LoadCommands) {
	auto It = std::stable_partition(			auto It = std::stable_partition(
	std::begin(LC.Sections), std::end(LC.Sections),			std::begin(LC.Sections), std::end(LC.Sections),
	[&](const std::unique_ptr<Section> &Sec) { return !ToRemove(Sec); });			[&](const std::unique_ptr<Section> &Sec) { return !ToRemove(Sec); });
	for (auto I = LC.Sections.begin(), End = It; I != End; ++I) {			for (auto I = LC.Sections.begin(), End = It; I != End; ++I) {
	OldIndexToSection[(*I)->Index] = I->get();			OldIndexToSection[(*I)->Index] = I->get();
	(*I)->Index = NextSectionIndex++;			(*I)->Index = NextSectionIndex++;
	}			}
	LC.Sections.erase(It, LC.Sections.end());			LC.Sections.erase(It, LC.Sections.end());
	}			}

	auto IsDead = [&](const std::unique_ptr<SymbolEntry> &S) -> bool {			auto IsDead = [&](const std::unique_ptr<SymbolEntry> &S) -> bool {
	Optional<uint32_t> Section = S->section();			Optional<uint32_t> Section = S->section();
	return (Section && !OldIndexToSection.count(*Section));			return (Section && !OldIndexToSection.count(*Section));
	};			};

	SmallPtrSet<const SymbolEntry *, 2> DeadSymbols;			SmallPtrSet<const SymbolEntry *, 2> DeadSymbols;
	for (const std::unique_ptr<SymbolEntry> &Sym : SymTable.Symbols)			for (const std::unique_ptr<SymbolEntry> &Sym : SymTable.Symbols)
	if (IsDead(Sym))			if (IsDead(Sym))
	DeadSymbols.insert(Sym.get());			DeadSymbols.insert(Sym.get());

	for (const LoadCommand &LC : LoadCommands)			for (const LoadCommand &LC : LoadCommands)
	for (const std::unique_ptr<Section> &Sec : LC.Sections)			for (const std::unique_ptr<Section> &Sec : LC.Sections)
	for (const RelocationInfo &R : Sec->Relocations)			for (const RelocationInfo &R : Sec->Relocations)
	if (R.Symbol && R.Symbol && DeadSymbols.count(R.Symbol))			if (R.Symbol && R.Symbol && DeadSymbols.count(R.Symbol))
	return createStringError(std::errc::invalid_argument,			return createStringError(std::errc::invalid_argument,
	"symbol '%s' defined in section with index "			"symbol '%s' defined in section with index "
	"'%u' cannot be removed because it is "			"'%u' cannot be removed because it is "
	"referenced by a relocation in section '%s'",			"referenced by a relocation in section '%s'",
	(*R.Symbol)->Name.c_str(),			(*R.Symbol)->Name.c_str(),
	((R.Symbol)->section()),			((R.Symbol)->section()),
	Sec->CanonicalName.c_str());			Sec->CanonicalName.c_str());
	SymTable.removeSymbols(IsDead);			SymTable.removeSymbols(IsDead);
	for (std::unique_ptr<SymbolEntry> &S : SymTable.Symbols)			for (std::unique_ptr<SymbolEntry> &S : SymTable.Symbols)
	if (S->section())			if (S->section())
	S->n_sect = OldIndexToSection[S->n_sect]->Index;			S->n_sect = OldIndexToSection[S->n_sect]->Index;
	return Error::success();			return Error::success();
	}			}

	uint64_t Object::nextAvailableSegmentAddress() const {			uint64_t Object::nextAvailableSegmentAddress() const {
	uint64_t HeaderSize =			uint64_t HeaderSize =
	is64Bit() ? sizeof(MachO::mach_header_64) : sizeof(MachO::mach_header);			is64Bit() ? sizeof(MachO::mach_header_64) : sizeof(MachO::mach_header);
	uint64_t Addr = HeaderSize + Header.SizeOfCmds;			uint64_t Addr = HeaderSize + Header.SizeOfCmds;
	for (const LoadCommand &LC : LoadCommands) {			for (const LoadCommand &LC : LoadCommands) {
	const MachO::macho_load_command &MLC = LC.MachOLoadCommand;			const MachO::macho_load_command &MLC = LC.MachOLoadCommand;
	switch (MLC.load_command_data.cmd) {			switch (MLC.load_command_data.cmd) {
	case MachO::LC_SEGMENT:			case MachO::LC_SEGMENT:
	Addr = std::max(Addr,			Addr = std::max(Addr,
	static_cast<uint64_t>(MLC.segment_command_data.vmaddr) +			static_cast<uint64_t>(MLC.segment_command_data.vmaddr) +
	MLC.segment_command_data.vmsize);			MLC.segment_command_data.vmsize);
	break;			break;
	case MachO::LC_SEGMENT_64:			case MachO::LC_SEGMENT_64:
	Addr = std::max(Addr, MLC.segment_command_64_data.vmaddr +			Addr = std::max(Addr, MLC.segment_command_64_data.vmaddr +
	MLC.segment_command_64_data.vmsize);			MLC.segment_command_64_data.vmsize);
	break;			break;
	default:			default:
	continue;			continue;
	}			}
	}			}
	return Addr;			return Addr;
	}			}

	template <typename SegmentType>			template <typename SegmentType>
	static void constructSegment(SegmentType &Seg,			static void constructSegment(SegmentType &Seg,
	llvm::MachO::LoadCommandType CmdType,			llvm::MachO::LoadCommandType CmdType,
	StringRef SegName, uint64_t SegVMAddr) {			StringRef SegName, uint64_t SegVMAddr) {
	assert(SegName.size() <= sizeof(Seg.segname) && "too long segment name");			assert(SegName.size() <= sizeof(Seg.segname) && "too long segment name");
	memset(&Seg, 0, sizeof(SegmentType));			memset(&Seg, 0, sizeof(SegmentType));
	Seg.cmd = CmdType;			Seg.cmd = CmdType;
	strncpy(Seg.segname, SegName.data(), SegName.size());			strncpy(Seg.segname, SegName.data(), SegName.size());
	Seg.maxprot \|=			Seg.maxprot \|=
	(MachO::VM_PROT_READ \| MachO::VM_PROT_WRITE \| MachO::VM_PROT_EXECUTE);			(MachO::VM_PROT_READ \| MachO::VM_PROT_WRITE \| MachO::VM_PROT_EXECUTE);
	Seg.initprot \|=			Seg.initprot \|=
	(MachO::VM_PROT_READ \| MachO::VM_PROT_WRITE \| MachO::VM_PROT_EXECUTE);			(MachO::VM_PROT_READ \| MachO::VM_PROT_WRITE \| MachO::VM_PROT_EXECUTE);
	Seg.vmaddr = SegVMAddr;			Seg.vmaddr = SegVMAddr;
	}			}

	LoadCommand &Object::addSegment(StringRef SegName) {			LoadCommand &Object::addSegment(StringRef SegName) {
	LoadCommand LC;			LoadCommand LC;
	const uint64_t SegVMAddr = nextAvailableSegmentAddress();			const uint64_t SegVMAddr = nextAvailableSegmentAddress();
	if (is64Bit())			if (is64Bit())
	constructSegment(LC.MachOLoadCommand.segment_command_64_data,			constructSegment(LC.MachOLoadCommand.segment_command_64_data,
	MachO::LC_SEGMENT_64, SegName, SegVMAddr);			MachO::LC_SEGMENT_64, SegName, SegVMAddr);
	else			else
	constructSegment(LC.MachOLoadCommand.segment_command_data,			constructSegment(LC.MachOLoadCommand.segment_command_data,
	MachO::LC_SEGMENT, SegName, SegVMAddr);			MachO::LC_SEGMENT, SegName, SegVMAddr);

	LoadCommands.push_back(std::move(LC));			LoadCommands.push_back(std::move(LC));
	return LoadCommands.back();			return LoadCommands.back();
	}			}

	/// Extracts a segment name from a string which is possibly non-null-terminated.			/// Extracts a segment name from a string which is possibly non-null-terminated.
	static StringRef extractSegmentName(const char *SegName) {			static StringRef extractSegmentName(const char *SegName) {
	return StringRef(SegName,			return StringRef(SegName,
	strnlen(SegName, sizeof(MachO::segment_command::segname)));			strnlen(SegName, sizeof(MachO::segment_command::segname)));
	}			}

	Optional<StringRef> LoadCommand::getSegmentName() const {			Optional<StringRef> LoadCommand::getSegmentName() const {
	const MachO::macho_load_command &MLC = MachOLoadCommand;			const MachO::macho_load_command &MLC = MachOLoadCommand;
	switch (MLC.load_command_data.cmd) {			switch (MLC.load_command_data.cmd) {
	case MachO::LC_SEGMENT:			case MachO::LC_SEGMENT:
	return extractSegmentName(MLC.segment_command_data.segname);			return extractSegmentName(MLC.segment_command_data.segname);
	case MachO::LC_SEGMENT_64:			case MachO::LC_SEGMENT_64:
	return extractSegmentName(MLC.segment_command_64_data.segname);			return extractSegmentName(MLC.segment_command_64_data.segname);
	default:			default:
	return None;			return None;
	}			}
	}			}

	Optional<uint64_t> LoadCommand::getSegmentVMAddr() const {			Optional<uint64_t> LoadCommand::getSegmentVMAddr() const {
	const MachO::macho_load_command &MLC = MachOLoadCommand;			const MachO::macho_load_command &MLC = MachOLoadCommand;
	switch (MLC.load_command_data.cmd) {			switch (MLC.load_command_data.cmd) {
	case MachO::LC_SEGMENT:			case MachO::LC_SEGMENT:
	return MLC.segment_command_data.vmaddr;			return MLC.segment_command_data.vmaddr;
	case MachO::LC_SEGMENT_64:			case MachO::LC_SEGMENT_64:
	return MLC.segment_command_64_data.vmaddr;			return MLC.segment_command_64_data.vmaddr;
	default:			default:
	return None;			return None;
	}			}
	}			}

	} // end namespace macho			} // end namespace macho
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

llvm/lib/ObjCopy/ObjCopy.cpp

This file was added.

				//===- Util.cpp -----------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/ObjCopy/ObjCopy.h"
				Lint: Pre-merge checks Inline Actions clang-format: please reformat the code +#include "Util.h" Lint: Pre-merge checks: clang-format: please reformat the code ``` +#include "Util.h" ```
				#include "llvm/ObjCopy/Buffer.h"
				Lint: Pre-merge checks Inline Actions clang-tidy: warning: #includes are not sorted properly [llvm-include-order] not useful Lint: Pre-merge checks: clang-tidy: warning: #includes are not sorted properly [llvm-include-order] [[https://github.
				#include "llvm/ObjCopy/COFF/COFFObjcopy.h"
				#include "llvm/ObjCopy/ELF/ELFObjcopy.h"
				#include "llvm/ObjCopy/MachO/MachOObjcopy.h"
				#include "llvm/ObjCopy/wasm/WasmObjcopy.h"
				#include "llvm/Object/ArchiveWriter.h"
				#include "llvm/Object/COFF.h"
				#include "llvm/Object/ELFObjectFile.h"
				#include "llvm/Object/MachO.h"
				#include "llvm/Object/MachOUniversal.h"
				#include "llvm/Object/ObjectFile.h"
				#include "llvm/Object/Wasm.h"
				#include "llvm/Support/FileOutputBuffer.h"
				#include "llvm/Support/FileSystem.h"
				#include "llvm/Support/MemoryBuffer.h"
				#include "llvm/Support/Process.h"
				#include "Util.h"
				Lint: Pre-merge checks Inline Actions clang-format: please reformat the code -#include "Util.h" Lint: Pre-merge checks: clang-format: please reformat the code ``` -#include "Util.h" ```
				#include <memory>

				using namespace llvm;
				using namespace llvm::object;

				namespace llvm {
				namespace objcopy {

				Error executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In,
				Buffer &Out) {
				if (auto *ELFBinary = dyn_cast<object::ELFObjectFileBase>(&In)) {
				if (Error E = Config.parseELFConfig())
				return E;
				return elf::executeObjcopyOnBinary(Config, *ELFBinary, Out);
				} else if (auto *COFFBinary = dyn_cast<object::COFFObjectFile>(&In))
				return coff::executeObjcopyOnBinary(Config, *COFFBinary, Out);
				else if (auto *MachOBinary = dyn_cast<object::MachOObjectFile>(&In))
				return macho::executeObjcopyOnBinary(Config, *MachOBinary, Out);
				else if (auto *MachOUniversalBinary =
				dyn_cast<object::MachOUniversalBinary>(&In))
				return macho::executeObjcopyOnMachOUniversalBinary(
				Config, *MachOUniversalBinary, Out);
				else if (auto *WasmBinary = dyn_cast<object::WasmObjectFile>(&In))
				return objcopy::wasm::executeObjcopyOnBinary(Config, *WasmBinary, Out);
				else
				return createStringError(object_error::invalid_file_type,
				"unsupported object file format");
				}

				Expected<std::vector<NewArchiveMember>>
				createNewArchiveMembers(CopyConfig &Config, const Archive &Ar) {
				std::vector<NewArchiveMember> NewArchiveMembers;
				Error Err = Error::success();
				for (const Archive::Child &Child : Ar.children(Err)) {
				Expected<StringRef> ChildNameOrErr = Child.getName();
				if (!ChildNameOrErr)
				return createFileError(Ar.getFileName(), ChildNameOrErr.takeError());

				Expected<std::unique_ptr<Binary>> ChildOrErr = Child.getAsBinary();
				if (!ChildOrErr)
				return createFileError(Ar.getFileName() + "(" + *ChildNameOrErr + ")",
				ChildOrErr.takeError());

				MemBuffer MB(ChildNameOrErr.get());
				if (Error E = executeObjcopyOnBinary(Config, *ChildOrErr->get(), MB))
				return std::move(E);

				Expected<NewArchiveMember> Member =
				NewArchiveMember::getOldMember(Child, Config.DeterministicArchives);
				if (!Member)
				return createFileError(Ar.getFileName(), Member.takeError());
				Member->Buf = MB.releaseMemoryBuffer();
				Member->MemberName = Member->Buf->getBufferIdentifier();
				NewArchiveMembers.push_back(std::move(*Member));
				}
				if (Err)
				return createFileError(Config.InputFilename, std::move(Err));
				return std::move(NewArchiveMembers);
				}

				// For regular archives this function simply calls llvm::writeArchive,
				// For thin archives it writes the archive file itself as well as its members.
				static Error deepWriteArchive(StringRef ArcName,
				ArrayRef<NewArchiveMember> NewMembers,
				bool WriteSymtab, object::Archive::Kind Kind,
				bool Deterministic, bool Thin) {
				if (Error E = writeArchive(ArcName, NewMembers, WriteSymtab, Kind,
				Deterministic, Thin))
				return createFileError(ArcName, std::move(E));

				if (!Thin)
				return Error::success();

				for (const NewArchiveMember &Member : NewMembers) {
				// Internally, FileBuffer will use the buffer created by
				// FileOutputBuffer::create, for regular files (that is the case for
				// deepWriteArchive) FileOutputBuffer::create will return OnDiskBuffer.
				// OnDiskBuffer uses a temporary file and then renames it. So in reality
				// there is no inefficiency / duplicated in-memory buffers in this case. For
				// now in-memory buffers can not be completely avoided since
				// NewArchiveMember still requires them even though writeArchive does not
				// write them on disk.
				FileBuffer FB(Member.MemberName);
				if (Error E = FB.allocate(Member.Buf->getBufferSize()))
				return E;
				std::copy(Member.Buf->getBufferStart(), Member.Buf->getBufferEnd(),
				FB.getBufferStart());
				if (Error E = FB.commit())
				return E;
				}
				return Error::success();
				}

				Error executeObjcopyOnArchive(CopyConfig &Config, const object::Archive &Ar) {
				Expected<std::vector<NewArchiveMember>> NewArchiveMembersOrErr =
				createNewArchiveMembers(Config, Ar);
				if (!NewArchiveMembersOrErr)
				return NewArchiveMembersOrErr.takeError();
				return deepWriteArchive(Config.OutputFilename, *NewArchiveMembersOrErr,
				Ar.hasSymbolTable(), Ar.kind(),
				Config.DeterministicArchives, Ar.isThin());
				}

				} // end namespace objcopy
				} // end namespace llvm

llvm/lib/ObjCopy/Util.h

This file was moved from llvm/tools/llvm-objcopy/llvm-objcopy.h.

	//===- llvm-objcopy.h -------------------------------------------- C++ --===//			//===- Util.h ---------------------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_TOOLS_OBJCOPY_OBJCOPY_H			#ifndef LLVM_LIB_OBJCOPY_UTIL_H
	#define LLVM_TOOLS_OBJCOPY_OBJCOPY_H			#define LLVM_LIB_OBJCOPY_UTIL_H

	#include "llvm/Support/Error.h"			#include "llvm/ObjCopy/Buffer.h"
				#include "llvm/ObjCopy/CopyConfig.h"
				#include "llvm/Object/Archive.h"
				#include "llvm/Object/ArchiveWriter.h"

	namespace llvm {			namespace llvm {

	struct NewArchiveMember;

	namespace object {

	class Archive;

	} // end namespace object

	namespace objcopy {			namespace objcopy {
	struct CopyConfig;
	Expected<std::vector<NewArchiveMember>>			Expected<std::vector<NewArchiveMember>>
	createNewArchiveMembers(CopyConfig &Config, const object::Archive &Ar);			createNewArchiveMembers(CopyConfig &Config, const object::Archive &Ar);

	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_TOOLS_OBJCOPY_OBJCOPY_H			#endif // LLVM_LIB_OBJCOPY_UTIL_H

llvm/lib/ObjCopy/wasm/Object.h

This file was moved from llvm/tools/llvm-objcopy/wasm/Object.h.

	//===- Object.h -------------------------------------------------- C++ --===//			//===- Object.h -------------------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_TOOLS_LLVM_OBJCOPY_WASM_OBJECT_H			#ifndef LLVM_LIB_OBJCOPY_WASM_OBJECT_H
	#define LLVM_TOOLS_LLVM_OBJCOPY_WASM_OBJECT_H			#define LLVM_LIB_OBJCOPY_WASM_OBJECT_H

	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/Object/Wasm.h"			#include "llvm/Object/Wasm.h"
	#include "llvm/Support/MemoryBuffer.h"			#include "llvm/Support/MemoryBuffer.h"
	#include <vector>			#include <vector>

	namespace llvm {			namespace llvm {
	Show All 20 Lines
	private:			private:
	std::vector<std::unique_ptr<MemoryBuffer>> OwnedContents;			std::vector<std::unique_ptr<MemoryBuffer>> OwnedContents;
	};			};

	} // end namespace wasm			} // end namespace wasm
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_TOOLS_LLVM_OBJCOPY_WASM_OBJECT_H			#endif // LLVM_LIB_OBJCOPY_WASM_OBJECT_H

llvm/lib/ObjCopy/wasm/Object.cpp

This file was moved from llvm/tools/llvm-objcopy/wasm/Object.cpp.

	//===- Object.cpp ---------------------------------------------------------===//			//===- Object.cpp ---------------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "Object.h"			#include "Object.h"

	#include "llvm/Support/LEB128.h"			#include "llvm/Support/LEB128.h"
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace wasm {			namespace wasm {

	using namespace object;			using namespace object;
	using namespace llvm::wasm;			using namespace llvm::wasm;

	void Object::addSectionWithOwnedContents(			void Object::addSectionWithOwnedContents(
	Section NewSection, std::unique_ptr<MemoryBuffer> &&Content) {			Section NewSection, std::unique_ptr<MemoryBuffer> &&Content) {
	Sections.push_back(NewSection);			Sections.push_back(NewSection);
	OwnedContents.emplace_back(std::move(Content));			OwnedContents.emplace_back(std::move(Content));
	}			}

	void Object::removeSections(function_ref<bool(const Section &)> ToRemove) {			void Object::removeSections(function_ref<bool(const Section &)> ToRemove) {
	// TODO: remove reloc sections for the removed section, handle symbols, etc.			// TODO: remove reloc sections for the removed section, handle symbols, etc.
	Sections.erase(			Sections.erase(
	std::remove_if(std::begin(Sections), std::end(Sections), ToRemove),			std::remove_if(std::begin(Sections), std::end(Sections), ToRemove),
	std::end(Sections));			std::end(Sections));
	}			}

	} // end namespace wasm			} // end namespace wasm
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

llvm/lib/ObjCopy/wasm/Reader.h

This file was moved from llvm/tools/llvm-objcopy/wasm/Reader.h.

//===- Reader.h -------------------------------------------------*- C++ -*-===//

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

//===----------------------------------------------------------------------===//

#ifndef LLVM_TOOLS_LLVM_OBJCOPY_WASM_READER_H

#ifndef LLVM_LIB_LLVM_OBJCOPY_WASM_READER_H

Lint: Pre-merge checks

clang-tidy: warning: header guard does not follow preferred style [llvm-header-guard]
not useful

Lint: Pre-merge checks: clang-tidy: warning: header guard does not follow preferred style [llvm-header-guard] [[https…

jhendersonUnsubmitted

Not Done

//===----------------------------------------------------------------------===//

- #ifndef LLVM_LIB_LLVM_OBJCOPY_WASM_READER_H

+ #ifndef LLVM_LIB_OBJCOPY_WASM_READER_H

#define LLVM_LIB_LLVM_OBJCOPY_WASM_READER_H

jhenderson:

#define LLVM_TOOLS_LLVM_OBJCOPY_WASM_READER_H

#define LLVM_LIB_LLVM_OBJCOPY_WASM_READER_H

#include "Object.h"

namespace llvm {

namespace objcopy {

namespace wasm {

class Reader {

public:

explicit Reader(const object::WasmObjectFile &O) : WasmObj(O) {}

Expected<std::unique_ptr<Object>> create() const;

private:

const object::WasmObjectFile &WasmObj;

};

} // end namespace wasm

} // end namespace objcopy

} // end namespace llvm

#endif // LLVM_TOOLS_LLVM_OBJCOPY_WASM_READER_H

#endif // LLVM_LIB_LLVM_OBJCOPY_WASM_READER_H

llvm/lib/ObjCopy/wasm/Reader.cpp

This file was moved from llvm/tools/llvm-objcopy/wasm/Reader.cpp.

	//===- Reader.cpp ---------------------------------------------------------===//			//===- Reader.cpp ---------------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "Reader.h"			#include "Reader.h"

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace wasm {			namespace wasm {

	using namespace object;			using namespace object;
	using namespace llvm::wasm;			using namespace llvm::wasm;

	Expected<std::unique_ptr<Object>> Reader::create() const {			Expected<std::unique_ptr<Object>> Reader::create() const {
	auto Obj = std::make_unique<Object>();			auto Obj = std::make_unique<Object>();
	Obj->Header = WasmObj.getHeader();			Obj->Header = WasmObj.getHeader();
	std::vector<Section> Sections;			std::vector<Section> Sections;
	Obj->Sections.reserve(WasmObj.getNumSections());			Obj->Sections.reserve(WasmObj.getNumSections());
	for (const SectionRef &Sec : WasmObj.sections()) {			for (const SectionRef &Sec : WasmObj.sections()) {
	const WasmSection &WS = WasmObj.getWasmSection(Sec);			const WasmSection &WS = WasmObj.getWasmSection(Sec);
	Obj->Sections.push_back(			Obj->Sections.push_back(
	{static_cast<uint8_t>(WS.Type), WS.Name, WS.Content});			{static_cast<uint8_t>(WS.Type), WS.Name, WS.Content});
	}			}
	return std::move(Obj);			return std::move(Obj);
	}			}

	} // end namespace wasm			} // end namespace wasm
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

llvm/lib/ObjCopy/wasm/WasmObjcopy.cpp

This file was moved from llvm/tools/llvm-objcopy/wasm/WasmObjcopy.cpp.

	//===- WasmObjcopy.cpp ----------------------------------------------------===//			//===- WasmObjcopy.cpp ----------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "WasmObjcopy.h"			#include "llvm/ObjCopy/wasm/WasmObjcopy.h"
	#include "Buffer.h"
	#include "CopyConfig.h"
	#include "Object.h"			#include "Object.h"
	#include "Reader.h"			#include "Reader.h"
	#include "Writer.h"			#include "Writer.h"
				#include "llvm/ObjCopy/Buffer.h"
				#include "llvm/ObjCopy/CopyConfig.h"
	#include "llvm/Support/Errc.h"			#include "llvm/Support/Errc.h"

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace wasm {			namespace wasm {

	using namespace object;			using namespace object;

	▲ Show 20 Lines • Show All 91 Lines • Show Last 20 Lines

llvm/lib/ObjCopy/wasm/Writer.h

This file was moved from llvm/tools/llvm-objcopy/wasm/Writer.h.

//===- Writer.h -------------------------------------------------- C++ --===//		//===- Writer.h -------------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TOOLS_LLVM_OBJCOPY_WASM_WRITER_H		#ifndef LLVM_LIB_OBJCOPY_WASM_WRITER_H
#define LLVM_TOOLS_LLVM_OBJCOPY_WASM_WRITER_H		#define LLVM_LIB_OBJCOPY_WASM_WRITER_H

#include "Buffer.h"
#include "Object.h"		#include "Object.h"
		#include "llvm/ObjCopy/Buffer.h"
#include <cstdint>		#include <cstdint>
#include <vector>		#include <vector>

namespace llvm {		namespace llvm {
namespace objcopy {		namespace objcopy {
namespace wasm {		namespace wasm {

class Writer {		class Writer {
Show All 20 Lines	static SectionHeader createSectionHeader(const Section &S,
size_t &SectionSize);		size_t &SectionSize);
size_t finalize();		size_t finalize();
};		};

} // end namespace wasm		} // end namespace wasm
} // end namespace objcopy		} // end namespace objcopy
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_TOOLS_LLVM_OBJCOPY_WASM_WRITER_H		#endif // LLVM_LIB_OBJCOPY_WASM_WRITER_H

llvm/lib/ObjCopy/wasm/Writer.cpp

This file was moved from llvm/tools/llvm-objcopy/wasm/Writer.cpp.

	//===- Writer.cpp ---------------------------------------------------------===//			//===- Writer.cpp ---------------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "Writer.h"			#include "Writer.h"
	#include "llvm/BinaryFormat/Wasm.h"			#include "llvm/BinaryFormat/Wasm.h"
	#include "llvm/Support/Endian.h"			#include "llvm/Support/Endian.h"
	#include "llvm/Support/LEB128.h"			#include "llvm/Support/LEB128.h"
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"

	namespace llvm {			namespace llvm {
	namespace objcopy {			namespace objcopy {
	namespace wasm {			namespace wasm {

	using namespace object;			using namespace object;
	using namespace llvm::wasm;			using namespace llvm::wasm;

	Writer::SectionHeader Writer::createSectionHeader(const Section &S,			Writer::SectionHeader Writer::createSectionHeader(const Section &S,
	size_t &SectionSize) {			size_t &SectionSize) {
	SectionHeader Header;			SectionHeader Header;
	raw_svector_ostream OS(Header);			raw_svector_ostream OS(Header);
	OS << S.SectionType;			OS << S.SectionType;
	bool HasName = S.SectionType == WASM_SEC_CUSTOM;			bool HasName = S.SectionType == WASM_SEC_CUSTOM;
	SectionSize = S.Contents.size();			SectionSize = S.Contents.size();
	if (HasName)			if (HasName)
	SectionSize += getULEB128Size(S.Name.size()) + S.Name.size();			SectionSize += getULEB128Size(S.Name.size()) + S.Name.size();
	// Pad the LEB value out to 5 bytes to make it a predictable size, and			// Pad the LEB value out to 5 bytes to make it a predictable size, and
	// match the behavior of clang.			// match the behavior of clang.
	encodeULEB128(SectionSize, OS, 5);			encodeULEB128(SectionSize, OS, 5);
	if (HasName) {			if (HasName) {
	encodeULEB128(S.Name.size(), OS);			encodeULEB128(S.Name.size(), OS);
	OS << S.Name;			OS << S.Name;
	}			}
	// Total section size is the content size plus 1 for the section type and			// Total section size is the content size plus 1 for the section type and
	// 5 for the LEB-encoded size.			// 5 for the LEB-encoded size.
	SectionSize = SectionSize + 1 + 5;			SectionSize = SectionSize + 1 + 5;
	return Header;			return Header;
	}			}

	size_t Writer::finalize() {			size_t Writer::finalize() {
	size_t ObjectSize = sizeof(WasmMagic) + sizeof(WasmVersion);			size_t ObjectSize = sizeof(WasmMagic) + sizeof(WasmVersion);
	SectionHeaders.reserve(Obj.Sections.size());			SectionHeaders.reserve(Obj.Sections.size());
	// Finalize the headers of each section so we know the total size.			// Finalize the headers of each section so we know the total size.
	for (const Section &S : Obj.Sections) {			for (const Section &S : Obj.Sections) {
	size_t SectionSize;			size_t SectionSize;
	SectionHeaders.push_back(createSectionHeader(S, SectionSize));			SectionHeaders.push_back(createSectionHeader(S, SectionSize));
	ObjectSize += SectionSize;			ObjectSize += SectionSize;
	}			}
	return ObjectSize;			return ObjectSize;
	}			}

	Error Writer::write() {			Error Writer::write() {
	size_t FileSize = finalize();			size_t FileSize = finalize();
	if (Error E = Buf.allocate(FileSize))			if (Error E = Buf.allocate(FileSize))
	return E;			return E;

	// Write the header.			// Write the header.
	uint8_t *Ptr = Buf.getBufferStart();			uint8_t *Ptr = Buf.getBufferStart();
	Ptr = std::copy(Obj.Header.Magic.begin(), Obj.Header.Magic.end(), Ptr);			Ptr = std::copy(Obj.Header.Magic.begin(), Obj.Header.Magic.end(), Ptr);
	support::endian::write32le(Ptr, Obj.Header.Version);			support::endian::write32le(Ptr, Obj.Header.Version);
	Ptr += sizeof(Obj.Header.Version);			Ptr += sizeof(Obj.Header.Version);

	// Write each section.			// Write each section.
	for (size_t I = 0, S = SectionHeaders.size(); I < S; ++I) {			for (size_t I = 0, S = SectionHeaders.size(); I < S; ++I) {
	Ptr = std::copy(SectionHeaders[I].begin(), SectionHeaders[I].end(), Ptr);			Ptr = std::copy(SectionHeaders[I].begin(), SectionHeaders[I].end(), Ptr);
	ArrayRef<uint8_t> Contents = Obj.Sections[I].Contents;			ArrayRef<uint8_t> Contents = Obj.Sections[I].Contents;
	Ptr = std::copy(Contents.begin(), Contents.end(), Ptr);			Ptr = std::copy(Contents.begin(), Contents.end(), Ptr);
	}			}
	return Buf.commit();			return Buf.commit();
	}			}

	} // end namespace wasm			} // end namespace wasm
	} // end namespace objcopy			} // end namespace objcopy
	} // end namespace llvm			} // end namespace llvm

llvm/tools/llvm-objcopy/Buffer.h

This file was moved to llvm/include/llvm/ObjCopy/Buffer.h.

llvm/tools/llvm-objcopy/Buffer.cpp

This file was moved to llvm/lib/ObjCopy/Buffer.cpp.

llvm/tools/llvm-objcopy/CMakeLists.txt

	set(LLVM_LINK_COMPONENTS			set(LLVM_LINK_COMPONENTS
	Object			Object
				ObjCopy
	Option			Option
	Support			Support
	MC			MC
	)			)

	set(LLVM_TARGET_DEFINITIONS ObjcopyOpts.td)			set(LLVM_TARGET_DEFINITIONS ObjcopyOpts.td)
	tablegen(LLVM ObjcopyOpts.inc -gen-opt-parser-defs)			tablegen(LLVM ObjcopyOpts.inc -gen-opt-parser-defs)
	add_public_tablegen_target(ObjcopyOptsTableGen)			add_public_tablegen_target(ObjcopyOptsTableGen)

	set(LLVM_TARGET_DEFINITIONS InstallNameToolOpts.td)			set(LLVM_TARGET_DEFINITIONS InstallNameToolOpts.td)
	tablegen(LLVM InstallNameToolOpts.inc -gen-opt-parser-defs)			tablegen(LLVM InstallNameToolOpts.inc -gen-opt-parser-defs)
	add_public_tablegen_target(InstallNameToolOptsTableGen)			add_public_tablegen_target(InstallNameToolOptsTableGen)

	set(LLVM_TARGET_DEFINITIONS BitcodeStripOpts.td)			set(LLVM_TARGET_DEFINITIONS BitcodeStripOpts.td)
	tablegen(LLVM BitcodeStripOpts.inc -gen-opt-parser-defs)			tablegen(LLVM BitcodeStripOpts.inc -gen-opt-parser-defs)
	add_public_tablegen_target(BitcodeStripOptsTableGen)			add_public_tablegen_target(BitcodeStripOptsTableGen)

	set(LLVM_TARGET_DEFINITIONS StripOpts.td)			set(LLVM_TARGET_DEFINITIONS StripOpts.td)
	tablegen(LLVM StripOpts.inc -gen-opt-parser-defs)			tablegen(LLVM StripOpts.inc -gen-opt-parser-defs)
	add_public_tablegen_target(StripOptsTableGen)			add_public_tablegen_target(StripOptsTableGen)

	add_llvm_tool(llvm-objcopy			add_llvm_tool(llvm-objcopy
	Buffer.cpp
	CopyConfig.cpp
	llvm-objcopy.cpp			llvm-objcopy.cpp
	COFF/COFFObjcopy.cpp			ParseConfig.cpp
	COFF/Object.cpp
	COFF/Reader.cpp
	COFF/Writer.cpp
	ELF/ELFConfig.cpp
	ELF/ELFObjcopy.cpp
	ELF/Object.cpp
	MachO/MachOObjcopy.cpp
	MachO/MachOReader.cpp
	MachO/MachOWriter.cpp
	MachO/MachOLayoutBuilder.cpp
	MachO/Object.cpp
	wasm/Object.cpp
	wasm/Reader.cpp
	wasm/Writer.cpp
	wasm/WasmObjcopy.cpp
	DEPENDS			DEPENDS
	ObjcopyOptsTableGen			ObjcopyOptsTableGen
	InstallNameToolOptsTableGen			InstallNameToolOptsTableGen
	StripOptsTableGen			StripOptsTableGen
	)			)

	add_llvm_tool_symlink(llvm-install-name-tool llvm-objcopy)			add_llvm_tool_symlink(llvm-install-name-tool llvm-objcopy)
	add_llvm_tool_symlink(llvm-bitcode-strip llvm-objcopy)			add_llvm_tool_symlink(llvm-bitcode-strip llvm-objcopy)
	Show All 11 Lines

llvm/tools/llvm-objcopy/COFF/COFFObjcopy.h

This file was moved to llvm/include/llvm/ObjCopy/COFF/COFFObjcopy.h.

llvm/tools/llvm-objcopy/COFF/COFFObjcopy.cpp

This file was moved to llvm/lib/ObjCopy/COFF/COFFObjcopy.cpp.

llvm/tools/llvm-objcopy/COFF/Object.h

This file was moved to llvm/lib/ObjCopy/COFF/Object.h.

llvm/tools/llvm-objcopy/COFF/Object.cpp

This file was moved to llvm/lib/ObjCopy/COFF/Object.cpp.

	//===- Object.cpp ---------------------------------------------------------===//
	//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//
	//===----------------------------------------------------------------------===//

	#include "Object.h"
	#include "llvm/ADT/DenseSet.h"
	#include <algorithm>

	namespace llvm {
	namespace objcopy {
	namespace coff {

	using namespace object;

	void Object::addSymbols(ArrayRef<Symbol> NewSymbols) {
	for (Symbol S : NewSymbols) {
	S.UniqueId = NextSymbolUniqueId++;
	Symbols.emplace_back(S);
	}
	updateSymbols();
	}

	void Object::updateSymbols() {
	SymbolMap = DenseMap<size_t, Symbol *>(Symbols.size());
	for (Symbol &Sym : Symbols)
	SymbolMap[Sym.UniqueId] = &Sym;
	}

	const Symbol *Object::findSymbol(size_t UniqueId) const {
	auto It = SymbolMap.find(UniqueId);
	if (It == SymbolMap.end())
	return nullptr;
	return It->second;
	}

	Error Object::removeSymbols(
	function_ref<Expected<bool>(const Symbol &)> ToRemove) {
	Error Errs = Error::success();
	Symbols.erase(std::remove_if(std::begin(Symbols), std::end(Symbols),
	[ToRemove, &Errs](const Symbol &Sym) {
	Expected<bool> ShouldRemove = ToRemove(Sym);
	if (!ShouldRemove) {
	Errs = joinErrors(std::move(Errs),
	ShouldRemove.takeError());
	return false;
	}
	return *ShouldRemove;
	}),
	std::end(Symbols));

	updateSymbols();
	return Errs;
	}

	Error Object::markSymbols() {
	for (Symbol &Sym : Symbols)
	Sym.Referenced = false;
	for (const Section &Sec : Sections) {
	for (const Relocation &R : Sec.Relocs) {
	auto It = SymbolMap.find(R.Target);
	if (It == SymbolMap.end())
	return createStringError(object_error::invalid_symbol_index,
	"relocation target %zu not found", R.Target);
	It->second->Referenced = true;
	}
	}
	return Error::success();
	}

	void Object::addSections(ArrayRef<Section> NewSections) {
	for (Section S : NewSections) {
	S.UniqueId = NextSectionUniqueId++;
	Sections.emplace_back(S);
	}
	updateSections();
	}

	void Object::updateSections() {
	SectionMap = DenseMap<ssize_t, Section *>(Sections.size());
	size_t Index = 1;
	for (Section &S : Sections) {
	SectionMap[S.UniqueId] = &S;
	S.Index = Index++;
	}
	}

	const Section *Object::findSection(ssize_t UniqueId) const {
	auto It = SectionMap.find(UniqueId);
	if (It == SectionMap.end())
	return nullptr;
	return It->second;
	}

	void Object::removeSections(function_ref<bool(const Section &)> ToRemove) {
	DenseSet<ssize_t> AssociatedSections;
	auto RemoveAssociated = [&AssociatedSections](const Section &Sec) {
	return AssociatedSections.count(Sec.UniqueId) == 1;
	};
	do {
	DenseSet<ssize_t> RemovedSections;
	Sections.erase(
	std::remove_if(std::begin(Sections), std::end(Sections),
	[ToRemove, &RemovedSections](const Section &Sec) {
	bool Remove = ToRemove(Sec);
	if (Remove)
	RemovedSections.insert(Sec.UniqueId);
	return Remove;
	}),
	std::end(Sections));
	// Remove all symbols referring to the removed sections.
	AssociatedSections.clear();
	Symbols.erase(
	std::remove_if(
	std::begin(Symbols), std::end(Symbols),
	[&RemovedSections, &AssociatedSections](const Symbol &Sym) {
	// If there are sections that are associative to a removed
	// section,
	// remove those as well as nothing will include them (and we can't
	// leave them dangling).
	if (RemovedSections.count(Sym.AssociativeComdatTargetSectionId) ==
	1)
	AssociatedSections.insert(Sym.TargetSectionId);
	return RemovedSections.count(Sym.TargetSectionId) == 1;
	}),
	std::end(Symbols));
	ToRemove = RemoveAssociated;
	} while (!AssociatedSections.empty());
	updateSections();
	updateSymbols();
	}

	void Object::truncateSections(function_ref<bool(const Section &)> ToTruncate) {
	for (Section &Sec : Sections) {
	if (ToTruncate(Sec)) {
	Sec.clearContents();
	Sec.Relocs.clear();
	Sec.Header.SizeOfRawData = 0;
	}
	}
	}

	} // end namespace coff
	} // end namespace objcopy
	} // end namespace llvm

llvm/tools/llvm-objcopy/COFF/Reader.h

This file was moved to llvm/lib/ObjCopy/COFF/Reader.h.

llvm/tools/llvm-objcopy/COFF/Reader.cpp

This file was moved to llvm/lib/ObjCopy/COFF/Reader.cpp.

llvm/tools/llvm-objcopy/COFF/Writer.h

This file was moved to llvm/lib/ObjCopy/COFF/Writer.h.

llvm/tools/llvm-objcopy/COFF/Writer.cpp

This file was moved to llvm/lib/ObjCopy/COFF/Writer.cpp.

llvm/tools/llvm-objcopy/CopyConfig.h

This file was moved to llvm/include/llvm/ObjCopy/CopyConfig.h.

llvm/tools/llvm-objcopy/CopyConfig.cpp

This file was moved to llvm/tools/llvm-objcopy/ParseConfig.cpp.

llvm/tools/llvm-objcopy/ELF/ELFConfig.h

This file was moved to llvm/include/llvm/ObjCopy/ELF/ELFConfig.h.

llvm/tools/llvm-objcopy/ELF/ELFConfig.cpp

This file was moved to llvm/lib/ObjCopy/ELF/ELFConfig.cpp.

llvm/tools/llvm-objcopy/ELF/ELFObjcopy.h

This file was moved to llvm/include/llvm/ObjCopy/ELF/ELFObjcopy.h.

llvm/tools/llvm-objcopy/ELF/ELFObjcopy.cpp

This file was moved to llvm/lib/ObjCopy/ELF/ELFObjcopy.cpp.

llvm/tools/llvm-objcopy/ELF/Object.h

This file was moved to llvm/lib/ObjCopy/ELF/Object.h.

llvm/tools/llvm-objcopy/ELF/Object.cpp

This file was moved to llvm/lib/ObjCopy/ELF/Object.cpp.

llvm/tools/llvm-objcopy/MachO/MachOLayoutBuilder.h

This file was moved to llvm/lib/ObjCopy/MachO/MachOLayoutBuilder.h.

llvm/tools/llvm-objcopy/MachO/MachOLayoutBuilder.cpp

This file was moved to llvm/lib/ObjCopy/MachO/MachOLayoutBuilder.cpp.

llvm/tools/llvm-objcopy/MachO/MachOObjcopy.h

This file was moved to llvm/include/llvm/ObjCopy/MachO/MachOObjcopy.h.

llvm/tools/llvm-objcopy/MachO/MachOObjcopy.cpp

This file was moved to llvm/lib/ObjCopy/MachO/MachOObjcopy.cpp.

llvm/tools/llvm-objcopy/MachO/MachOReader.h

This file was moved to llvm/lib/ObjCopy/MachO/MachOReader.h.

llvm/tools/llvm-objcopy/MachO/MachOReader.cpp

This file was moved to llvm/lib/ObjCopy/MachO/MachOReader.cpp.

llvm/tools/llvm-objcopy/MachO/MachOWriter.h

This file was moved to llvm/lib/ObjCopy/MachO/MachOWriter.h.

llvm/tools/llvm-objcopy/MachO/MachOWriter.cpp

This file was moved to llvm/lib/ObjCopy/MachO/MachOWriter.cpp.

llvm/tools/llvm-objcopy/MachO/Object.h

This file was moved to llvm/lib/ObjCopy/MachO/Object.h.

llvm/tools/llvm-objcopy/MachO/Object.cpp

This file was moved to llvm/lib/ObjCopy/MachO/Object.cpp.

llvm/tools/llvm-objcopy/ParseConfig.h

This file was added.

				//===- ParseConfig.h ------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_TOOLS_LLVM_OBJCOPY_PARSECONFIG_H
				#define LLVM_TOOLS_LLVM_OBJCOPY_PARSECONFIG_H

				#include "llvm/ADT/SmallVector.h"
				#include "llvm/ObjCopy/CopyConfig.h"
				#include "llvm/Support/Allocator.h"
				#include "llvm/Support/Error.h"
				#include "llvm/Support/GlobPattern.h"
				#include "llvm/Support/Regex.h"

				namespace llvm {
				namespace objcopy {

				// Configuration for the overall invocation of this tool. When invoked as
				// objcopy, will always contain exactly one CopyConfig. When invoked as strip,
				// will contain one or more CopyConfigs.
				struct DriverConfig {
				SmallVector<CopyConfig, 1> CopyConfigs;
				BumpPtrAllocator Alloc;
				};

				// ParseObjcopyOptions returns the config and sets the input arguments. If a
				// help flag is set then ParseObjcopyOptions will print the help messege and
				// exit. ErrorCallback is used to handle recoverable errors. An Error returned
				// by the callback aborts the parsing and is then returned by this function.
				Expected<DriverConfig>
				parseObjcopyOptions(ArrayRef<const char *> ArgsArr,
				llvm::function_ref<Error(Error)> ErrorCallback);

				// ParseInstallNameToolOptions returns the config and sets the input arguments.
				// If a help flag is set then ParseInstallNameToolOptions will print the help
				// messege and exit.
				Expected<DriverConfig>
				parseInstallNameToolOptions(ArrayRef<const char *> ArgsArr);

				// ParseBitcodeStripOptions returns the config and sets the input arguments.
				// If a help flag is set then ParseBitcodeStripOptions will print the help
				// messege and exit.
				Expected<DriverConfig> parseBitcodeStripOptions(ArrayRef<const char *> ArgsArr);

				// ParseStripOptions returns the config and sets the input arguments. If a
				// help flag is set then ParseStripOptions will print the help messege and
				// exit. ErrorCallback is used to handle recoverable errors. An Error returned
				// by the callback aborts the parsing and is then returned by this function.
				Expected<DriverConfig>
				parseStripOptions(ArrayRef<const char *> ArgsArr,
				llvm::function_ref<Error(Error)> ErrorCallback);

				} // namespace objcopy
				} // namespace llvm

				#endif // LLVM_TOOLS_LLVM_OBJCOPY_PARSECONFIG_H

llvm/tools/llvm-objcopy/ParseConfig.cpp

This file was moved from llvm/tools/llvm-objcopy/CopyConfig.cpp.

//===- CopyConfig.cpp -----------------------------------------------------===//		//===- ParseConfig.cpp ----------------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "CopyConfig.h"		#include "ParseConfig.h"

#include "llvm/ADT/Optional.h"		#include "llvm/ADT/Optional.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/StringSet.h"		#include "llvm/ADT/StringSet.h"
#include "llvm/Option/Arg.h"		#include "llvm/Option/Arg.h"
#include "llvm/Option/ArgList.h"		#include "llvm/Option/ArgList.h"
#include "llvm/Support/CRC.h"		#include "llvm/Support/CRC.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
▲ Show 20 Lines • Show All 220 Lines • ▼ Show 20 Lines	return createStringError(
"bad format for --set-section-alignment: missing '='");		"bad format for --set-section-alignment: missing '='");
auto Split = StringRef(FlagValue).split('=');		auto Split = StringRef(FlagValue).split('=');
if (Split.first.empty())		if (Split.first.empty())
return createStringError(		return createStringError(
errc::invalid_argument,		errc::invalid_argument,
"bad format for --set-section-alignment: missing section name");		"bad format for --set-section-alignment: missing section name");
uint64_t NewAlign;		uint64_t NewAlign;
if (Split.second.getAsInteger(0, NewAlign))		if (Split.second.getAsInteger(0, NewAlign))
return createStringError(errc::invalid_argument,		return createStringError(
		errc::invalid_argument,
"invalid alignment for --set-section-alignment: '%s'",		"invalid alignment for --set-section-alignment: '%s'",
Split.second.str().c_str());		Split.second.str().c_str());
return std::make_pair(Split.first, NewAlign);		return std::make_pair(Split.first, NewAlign);
}		}

static Expected<SectionFlagsUpdate>		static Expected<SectionFlagsUpdate>
parseSetSectionFlagValue(StringRef FlagValue) {		parseSetSectionFlagValue(StringRef FlagValue) {
if (!StringRef(FlagValue).contains('='))		if (!StringRef(FlagValue).contains('='))
return createStringError(errc::invalid_argument,		return createStringError(errc::invalid_argument,
"bad format for --set-section-flags: missing '='");		"bad format for --set-section-flags: missing '='");
▲ Show 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	if (!TrimmedLine.empty())
if (Error E = Symbols.addMatcher(NameOrPattern::create(		if (Error E = Symbols.addMatcher(NameOrPattern::create(
Saver.save(TrimmedLine), MS, ErrorCallback)))		Saver.save(TrimmedLine), MS, ErrorCallback)))
return E;		return E;
}		}

return Error::success();		return Error::success();
}		}

Expected<NameOrPattern>
NameOrPattern::create(StringRef Pattern, MatchStyle MS,
llvm::function_ref<Error(Error)> ErrorCallback) {
switch (MS) {
case MatchStyle::Literal:
return NameOrPattern(Pattern);
case MatchStyle::Wildcard: {
SmallVector<char, 32> Data;
bool IsPositiveMatch = true;
if (Pattern[0] == '!') {
IsPositiveMatch = false;
Pattern = Pattern.drop_front();
}
Expected<GlobPattern> GlobOrErr = GlobPattern::create(Pattern);

// If we couldn't create it as a glob, report the error, but try again with
// a literal if the error reporting is non-fatal.
if (!GlobOrErr) {
if (Error E = ErrorCallback(GlobOrErr.takeError()))
return std::move(E);
return create(Pattern, MatchStyle::Literal, ErrorCallback);
}

return NameOrPattern(std::make_shared<GlobPattern>(*GlobOrErr),
IsPositiveMatch);
}
case MatchStyle::Regex: {
SmallVector<char, 32> Data;
return NameOrPattern(std::make_shared<Regex>(
("^" + Pattern.ltrim('^').rtrim('$') + "$").toStringRef(Data)));
}
}
llvm_unreachable("Unhandled llvm.objcopy.MatchStyle enum");
}

static Error addSymbolsToRenameFromFile(StringMap<StringRef> &SymbolsToRename,		static Error addSymbolsToRenameFromFile(StringMap<StringRef> &SymbolsToRename,
BumpPtrAllocator &Alloc,		BumpPtrAllocator &Alloc,
StringRef Filename) {		StringRef Filename) {
StringSaver Saver(Alloc);		StringSaver Saver(Alloc);
SmallVector<StringRef, 16> Lines;		SmallVector<StringRef, 16> Lines;
auto BufOrErr = MemoryBuffer::getFile(Filename);		auto BufOrErr = MemoryBuffer::getFile(Filename);
if (!BufOrErr)		if (!BufOrErr)
return createFileError(Filename, BufOrErr.getError());		return createFileError(Filename, BufOrErr.getError());
▲ Show 20 Lines • Show All 756 Lines • Show Last 20 Lines

llvm/tools/llvm-objcopy/llvm-objcopy.h

This file was moved to llvm/lib/ObjCopy/Util.h.

llvm/tools/llvm-objcopy/llvm-objcopy.cpp

//===- llvm-objcopy.cpp ---------------------------------------------------===//		//===- llvm-objcopy.cpp ---------------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "Buffer.h"		#include "ParseConfig.h"
		jhendersonUnsubmitted Not Done Reply Inline Actions Delete this blank line then resort the headers. jhenderson: Delete this blank line then resort the headers.
#include "COFF/COFFObjcopy.h"
#include "CopyConfig.h"
#include "ELF/ELFObjcopy.h"
#include "MachO/MachOObjcopy.h"
#include "wasm/WasmObjcopy.h"

#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/Twine.h"		#include "llvm/ADT/Twine.h"
		#include "llvm/ObjCopy/Buffer.h"
		#include "llvm/ObjCopy/COFF/COFFObjcopy.h"
		#include "llvm/ObjCopy/CopyConfig.h"
		#include "llvm/ObjCopy/ELF/ELFObjcopy.h"
		#include "llvm/ObjCopy/MachO/MachOObjcopy.h"
		#include "llvm/ObjCopy/ObjCopy.h"
		#include "llvm/ObjCopy/wasm/WasmObjcopy.h"
#include "llvm/Object/Archive.h"		#include "llvm/Object/Archive.h"
#include "llvm/Object/ArchiveWriter.h"		#include "llvm/Object/ArchiveWriter.h"
#include "llvm/Object/Binary.h"		#include "llvm/Object/Binary.h"
#include "llvm/Object/COFF.h"		#include "llvm/Object/COFF.h"
#include "llvm/Object/ELFObjectFile.h"		#include "llvm/Object/ELFObjectFile.h"
#include "llvm/Object/ELFTypes.h"		#include "llvm/Object/ELFTypes.h"
#include "llvm/Object/Error.h"		#include "llvm/Object/Error.h"
#include "llvm/Object/MachO.h"		#include "llvm/Object/MachO.h"
Show All 37 Lines

} // end namespace objcopy		} // end namespace objcopy
} // end namespace llvm		} // end namespace llvm

using namespace llvm;		using namespace llvm;
using namespace llvm::object;		using namespace llvm::object;
using namespace llvm::objcopy;		using namespace llvm::objcopy;

// For regular archives this function simply calls llvm::writeArchive,
// For thin archives it writes the archive file itself as well as its members.
static Error deepWriteArchive(StringRef ArcName,
ArrayRef<NewArchiveMember> NewMembers,
bool WriteSymtab, object::Archive::Kind Kind,
bool Deterministic, bool Thin) {
if (Error E = writeArchive(ArcName, NewMembers, WriteSymtab, Kind,
Deterministic, Thin))
return createFileError(ArcName, std::move(E));

if (!Thin)
return Error::success();

for (const NewArchiveMember &Member : NewMembers) {
// Internally, FileBuffer will use the buffer created by
// FileOutputBuffer::create, for regular files (that is the case for
// deepWriteArchive) FileOutputBuffer::create will return OnDiskBuffer.
// OnDiskBuffer uses a temporary file and then renames it. So in reality
// there is no inefficiency / duplicated in-memory buffers in this case. For
// now in-memory buffers can not be completely avoided since
// NewArchiveMember still requires them even though writeArchive does not
// write them on disk.
FileBuffer FB(Member.MemberName);
if (Error E = FB.allocate(Member.Buf->getBufferSize()))
return E;
std::copy(Member.Buf->getBufferStart(), Member.Buf->getBufferEnd(),
FB.getBufferStart());
if (Error E = FB.commit())
return E;
}
return Error::success();
}

/// The function executeObjcopyOnIHex does the dispatch based on the format		/// The function executeObjcopyOnIHex does the dispatch based on the format
/// of the output specified by the command line options.		/// of the output specified by the command line options.
static Error executeObjcopyOnIHex(CopyConfig &Config, MemoryBuffer &In,		static Error executeObjcopyOnIHex(CopyConfig &Config, MemoryBuffer &In,
Buffer &Out) {		Buffer &Out) {
// TODO: support output formats other than ELF.		// TODO: support output formats other than ELF.
if (Error E = Config.parseELFConfig())		if (Error E = Config.parseELFConfig())
return E;		return E;
return elf::executeObjcopyOnIHex(Config, In, Out);		return elf::executeObjcopyOnIHex(Config, In, Out);
Show All 14 Lines	case FileFormat::Unspecified:
if (Error E = Config.parseELFConfig())		if (Error E = Config.parseELFConfig())
return E;		return E;
return elf::executeObjcopyOnRawBinary(Config, In, Out);		return elf::executeObjcopyOnRawBinary(Config, In, Out);
}		}

llvm_unreachable("unsupported output format");		llvm_unreachable("unsupported output format");
}		}

/// The function executeObjcopyOnBinary does the dispatch based on the format
/// of the input binary (ELF, MachO or COFF).
static Error executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In,
Buffer &Out) {
if (auto *ELFBinary = dyn_cast<object::ELFObjectFileBase>(&In)) {
if (Error E = Config.parseELFConfig())
return E;
return elf::executeObjcopyOnBinary(Config, *ELFBinary, Out);
} else if (auto *COFFBinary = dyn_cast<object::COFFObjectFile>(&In))
return coff::executeObjcopyOnBinary(Config, *COFFBinary, Out);
else if (auto *MachOBinary = dyn_cast<object::MachOObjectFile>(&In))
return macho::executeObjcopyOnBinary(Config, *MachOBinary, Out);
else if (auto *MachOUniversalBinary =
dyn_cast<object::MachOUniversalBinary>(&In))
return macho::executeObjcopyOnMachOUniversalBinary(
Config, *MachOUniversalBinary, Out);
else if (auto *WasmBinary = dyn_cast<object::WasmObjectFile>(&In))
return objcopy::wasm::executeObjcopyOnBinary(Config, *WasmBinary, Out);
else
return createStringError(object_error::invalid_file_type,
"unsupported object file format");
}

namespace llvm {
namespace objcopy {

Expected<std::vector<NewArchiveMember>>
createNewArchiveMembers(CopyConfig &Config, const Archive &Ar) {
std::vector<NewArchiveMember> NewArchiveMembers;
Error Err = Error::success();
for (const Archive::Child &Child : Ar.children(Err)) {
Expected<StringRef> ChildNameOrErr = Child.getName();
if (!ChildNameOrErr)
return createFileError(Ar.getFileName(), ChildNameOrErr.takeError());

Expected<std::unique_ptr<Binary>> ChildOrErr = Child.getAsBinary();
if (!ChildOrErr)
return createFileError(Ar.getFileName() + "(" + *ChildNameOrErr + ")",
ChildOrErr.takeError());

MemBuffer MB(ChildNameOrErr.get());
if (Error E = executeObjcopyOnBinary(Config, *ChildOrErr->get(), MB))
return std::move(E);

Expected<NewArchiveMember> Member =
NewArchiveMember::getOldMember(Child, Config.DeterministicArchives);
if (!Member)
return createFileError(Ar.getFileName(), Member.takeError());
Member->Buf = MB.releaseMemoryBuffer();
Member->MemberName = Member->Buf->getBufferIdentifier();
NewArchiveMembers.push_back(std::move(*Member));
}
if (Err)
return createFileError(Config.InputFilename, std::move(Err));
return std::move(NewArchiveMembers);
}

} // end namespace objcopy
} // end namespace llvm

static Error executeObjcopyOnArchive(CopyConfig &Config,
const object::Archive &Ar) {
Expected<std::vector<NewArchiveMember>> NewArchiveMembersOrErr =
createNewArchiveMembers(Config, Ar);
if (!NewArchiveMembersOrErr)
return NewArchiveMembersOrErr.takeError();
return deepWriteArchive(Config.OutputFilename, *NewArchiveMembersOrErr,
Ar.hasSymbolTable(), Ar.kind(),
Config.DeterministicArchives, Ar.isThin());
}

static Error restoreStatOnFile(StringRef Filename,		static Error restoreStatOnFile(StringRef Filename,
const sys::fs::file_status &Stat,		const sys::fs::file_status &Stat,
bool PreserveDates) {		bool PreserveDates) {
int FD;		int FD;

// Writing to stdout should not be treated as an error here, just		// Writing to stdout should not be treated as an error here, just
// do not set access/modification times or permissions.		// do not set access/modification times or permissions.
if (Filename == "-")		if (Filename == "-")
▲ Show 20 Lines • Show All 161 Lines • Show Last 20 Lines

llvm/tools/llvm-objcopy/wasm/Object.h

This file was moved to llvm/lib/ObjCopy/wasm/Object.h.

llvm/tools/llvm-objcopy/wasm/Object.cpp

This file was moved to llvm/lib/ObjCopy/wasm/Object.cpp.

llvm/tools/llvm-objcopy/wasm/Reader.h

This file was moved to llvm/lib/ObjCopy/wasm/Reader.h.

llvm/tools/llvm-objcopy/wasm/Reader.cpp

This file was moved to llvm/lib/ObjCopy/wasm/Reader.cpp.

llvm/tools/llvm-objcopy/wasm/WasmObjcopy.h

This file was moved to llvm/include/llvm/ObjCopy/wasm/WasmObjcopy.h.

llvm/tools/llvm-objcopy/wasm/WasmObjcopy.cpp

This file was moved to llvm/lib/ObjCopy/wasm/WasmObjcopy.cpp.

llvm/tools/llvm-objcopy/wasm/Writer.h

This file was moved to llvm/lib/ObjCopy/wasm/Writer.h.

llvm/tools/llvm-objcopy/wasm/Writer.cpp

This file was moved to llvm/lib/ObjCopy/wasm/Writer.cpp.

llvm/unittests/CMakeLists.txt

	Show All 29 Lines
	add_subdirectory(InterfaceStub)			add_subdirectory(InterfaceStub)
	add_subdirectory(IR)			add_subdirectory(IR)
	add_subdirectory(LineEditor)			add_subdirectory(LineEditor)
	add_subdirectory(Linker)			add_subdirectory(Linker)
	add_subdirectory(MC)			add_subdirectory(MC)
	add_subdirectory(MI)			add_subdirectory(MI)
	add_subdirectory(Object)			add_subdirectory(Object)
	add_subdirectory(ObjectYAML)			add_subdirectory(ObjectYAML)
				add_subdirectory(ObjCopy)
	add_subdirectory(Option)			add_subdirectory(Option)
	add_subdirectory(Remarks)			add_subdirectory(Remarks)
	add_subdirectory(Passes)			add_subdirectory(Passes)
	add_subdirectory(ProfileData)			add_subdirectory(ProfileData)
	add_subdirectory(Support)			add_subdirectory(Support)
	add_subdirectory(TableGen)			add_subdirectory(TableGen)
	add_subdirectory(Target)			add_subdirectory(Target)
	add_subdirectory(TextAPI)			add_subdirectory(TextAPI)
	add_subdirectory(Transforms)			add_subdirectory(Transforms)
	add_subdirectory(XRay)			add_subdirectory(XRay)
	add_subdirectory(tools)			add_subdirectory(tools)

llvm/unittests/ObjCopy/CMakeLists.txt

This file was added.

				set(LLVM_LINK_COMPONENTS
				Object
				thakisUnsubmitted Not Done Reply Inline Actions I think you have to add_subdirectory this in llvm/unittests/CMakeLists.txt to get this test binary actually built. thakis: I think you have to add_subdirectory this in llvm/unittests/CMakeLists.txt to get this test…
				ObjCopy
				ObjectYAML
				)

				add_llvm_unittest(ObjCopyTests
				ObjCopyTest.cpp
				)

				target_link_libraries(ObjCopyTests PRIVATE LLVMTestingSupport)

llvm/unittests/ObjCopy/ObjCopyTest.cpp

This file was added.

//===- ObjCopyTest.cpp ----------------------------------------------------===//

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

//===----------------------------------------------------------------------===//

#include "llvm/ObjCopy/ObjCopy.h"

#include "llvm/ADT/SmallString.h"

#include "llvm/ObjCopy/Buffer.h"

#include "llvm/ObjCopy/COFF/COFFObjcopy.h"

#include "llvm/ObjCopy/CopyConfig.h"

#include "llvm/ObjCopy/ELF/ELFObjcopy.h"

#include "llvm/ObjCopy/MachO/MachOObjcopy.h"

#include "llvm/ObjCopy/wasm/WasmObjcopy.h"

#include "llvm/Object/COFF.h"

#include "llvm/Object/ELFObjectFile.h"

#include "llvm/Object/MachO.h"

#include "llvm/Object/ObjectFile.h"

#include "llvm/Object/Wasm.h"

#include "llvm/ObjectYAML/yaml2obj.h"

#include "llvm/Support/Error.h"

#include "llvm/Support/SmallVectorMemoryBuffer.h"

#include "llvm/Testing/Support/Error.h"

#include "gtest/gtest.h"

using namespace llvm;

using namespace object;

using namespace objcopy;

using namespace yaml;

void copySimpleFileImpl(const char *YamlCreationString,

std::function<bool(const Binary &File)> IsValidFormat) {

auto ErrHandler = [&](const Twine &Msg) { FAIL() << "Error: " << Msg; };

// Create Object file from yaml description.

SmallString<0> Storage;

std::unique_ptr<ObjectFile> Obj =

yaml2ObjectFile(Storage, YamlCreationString, ErrHandler);

grimarUnsubmitted

Not Done

bool ErrorReported = false;

- auto ErrHandler = [&](const Twine &) { ErrorReported = true; };

+ auto ErrHandler = [&](const Twine &Msg) { FAIL() << "failed to blah blah: " << Msg; };

// Create Object file from yaml description.

Perhaps just fail inside?

grimar: Perhaps just fail inside?

ASSERT_TRUE(Obj);

ASSERT_TRUE(IsValidFormat(*Obj));

CopyConfig Config;

Config.OutputFilename = "a.out";

// Call executeObjcopyOnBinary() from ObjCopy library.

objcopy::MemBuffer FB(Config.OutputFilename);

Error Err = objcopy::executeObjcopyOnBinary(Config, *Obj.get(), FB);

ASSERT_FALSE(std::move(Err));

grimarUnsubmitted

Not Done

You can avod having this ASSERT_TRUE if you use cast<T>, I think.
Also, you can use T & instead of a pointer, because it is expected that the value is always non-null.

grimar: You can avod having this `ASSERT_TRUE` if you use `cast<T>`, I think. Also, you can use `T &`…

// Load and check copied file.

std::unique_ptr<WritableMemoryBuffer> WFB = FB.releaseMemoryBuffer();

Expected<std::unique_ptr<ObjectFile>> Result =

ObjectFile::createObjectFile(WFB->getMemBufferRef());

ASSERT_THAT_EXPECTED(Result, Succeeded());

ASSERT_TRUE(IsValidFormat(**Result));

}

TEST(CopySimpleFile, COFF) {

copySimpleFileImpl(

R"(

--- !COFF

header:

Machine: IMAGE_FILE_MACHINE_AMD64

Characteristics: [ ]

sections:

- Name: .text

Characteristics: [ ]

Alignment: 4

SectionData: E800000000C3C3C3

symbols:

...

)",

[](const Binary &File) { return File.isCOFF(); });

}

TEST(CopySimpleFile, ELF) {

copySimpleFileImpl(

R"(

--- !ELF

FileHeader:

Class: ELFCLASS64

Data: ELFDATA2LSB

Type: ET_REL)",

[](const Binary &File) { return File.isELF(); });

}

TEST(CopySimpleFile, MachO) {

copySimpleFileImpl(

R"(

--- !mach-o

FileHeader:

magic: 0xFEEDFACF

cputype: 0x01000007

cpusubtype: 0x80000003

filetype: 0x00000002

ncmds: 0

sizeofcmds: 0

flags: 0x00218085

grimarUnsubmitted

Not Done

Do you need Machine: EM_X86_64? By default it is EM_NONE, so it should work without an explicit value I guess.

grimar: Do you need `Machine: EM_X86_64`? By default it is `EM_NONE`, so it should work without an…

reserved: 0x00000000

...

)",

[](const Binary &File) { return File.isMachO(); });

}

TEST(CopySimpleFile, Wasm) {

copySimpleFileImpl(

R"(

--- !WASM

FileHeader:

Version: 0x00000001

...

# CHECK: --- !WASM

# CHECK: FileHeader:

# CHECK: Version: 0x00000001

# CHECK: ...

)",

[](const Binary &File) { return File.isWasm(); });

}

This is an archive of the discontinued LLVM Phabricator instance.

[llvm-objcopy][NFC] Move core implementation of llvm-objcopy into separate library.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 296524

llvm/include/llvm/ObjCopy/Buffer.h

llvm/include/llvm/ObjCopy/COFF/COFFObjcopy.h

llvm/include/llvm/ObjCopy/CopyConfig.h

llvm/include/llvm/ObjCopy/ELF/ELFConfig.h

llvm/include/llvm/ObjCopy/ELF/ELFObjcopy.h

llvm/include/llvm/ObjCopy/MachO/MachOObjcopy.h

llvm/include/llvm/ObjCopy/ObjCopy.h

llvm/include/llvm/ObjCopy/wasm/WasmObjcopy.h

llvm/lib/CMakeLists.txt

llvm/lib/LLVMBuild.txt

llvm/lib/ObjCopy/Buffer.cpp

llvm/lib/ObjCopy/CMakeLists.txt

llvm/lib/ObjCopy/COFF/COFFObjcopy.cpp

llvm/lib/ObjCopy/COFF/Object.h

llvm/lib/ObjCopy/COFF/Object.cpp

llvm/lib/ObjCopy/COFF/Reader.h

llvm/lib/ObjCopy/COFF/Reader.cpp

llvm/lib/ObjCopy/COFF/Writer.h

llvm/lib/ObjCopy/COFF/Writer.cpp

llvm/lib/ObjCopy/CopyConfig.cpp

llvm/lib/ObjCopy/ELF/ELFConfig.cpp

llvm/lib/ObjCopy/ELF/ELFObjcopy.cpp

llvm/lib/ObjCopy/ELF/Object.h

llvm/lib/ObjCopy/ELF/Object.cpp

llvm/lib/ObjCopy/LLVMBuild.txt

llvm/lib/ObjCopy/MachO/MachOLayoutBuilder.h

llvm/lib/ObjCopy/MachO/MachOLayoutBuilder.cpp

llvm/lib/ObjCopy/MachO/MachOObjcopy.cpp

llvm/lib/ObjCopy/MachO/MachOReader.h

llvm/lib/ObjCopy/MachO/MachOReader.cpp

llvm/lib/ObjCopy/MachO/MachOWriter.h

llvm/lib/ObjCopy/MachO/MachOWriter.cpp

llvm/lib/ObjCopy/MachO/Object.h

llvm/lib/ObjCopy/MachO/Object.cpp

llvm/lib/ObjCopy/ObjCopy.cpp

llvm/lib/ObjCopy/Util.h

llvm/lib/ObjCopy/wasm/Object.h

llvm/lib/ObjCopy/wasm/Object.cpp

llvm/lib/ObjCopy/wasm/Reader.h

llvm/lib/ObjCopy/wasm/Reader.cpp

llvm/lib/ObjCopy/wasm/WasmObjcopy.cpp

llvm/lib/ObjCopy/wasm/Writer.h

llvm/lib/ObjCopy/wasm/Writer.cpp

llvm/tools/llvm-objcopy/Buffer.h

llvm/tools/llvm-objcopy/Buffer.cpp

llvm/tools/llvm-objcopy/CMakeLists.txt

llvm/tools/llvm-objcopy/COFF/COFFObjcopy.h

llvm/tools/llvm-objcopy/COFF/COFFObjcopy.cpp

llvm/tools/llvm-objcopy/COFF/Object.h

llvm/tools/llvm-objcopy/COFF/Object.cpp

llvm/tools/llvm-objcopy/COFF/Reader.h

llvm/tools/llvm-objcopy/COFF/Reader.cpp

llvm/tools/llvm-objcopy/COFF/Writer.h

llvm/tools/llvm-objcopy/COFF/Writer.cpp

llvm/tools/llvm-objcopy/CopyConfig.h

llvm/tools/llvm-objcopy/CopyConfig.cpp

llvm/tools/llvm-objcopy/ELF/ELFConfig.h

llvm/tools/llvm-objcopy/ELF/ELFConfig.cpp

llvm/tools/llvm-objcopy/ELF/ELFObjcopy.h

llvm/tools/llvm-objcopy/ELF/ELFObjcopy.cpp

llvm/tools/llvm-objcopy/ELF/Object.h

llvm/tools/llvm-objcopy/ELF/Object.cpp

llvm/tools/llvm-objcopy/MachO/MachOLayoutBuilder.h

llvm/tools/llvm-objcopy/MachO/MachOLayoutBuilder.cpp

llvm/tools/llvm-objcopy/MachO/MachOObjcopy.h

llvm/tools/llvm-objcopy/MachO/MachOObjcopy.cpp

llvm/tools/llvm-objcopy/MachO/MachOReader.h

llvm/tools/llvm-objcopy/MachO/MachOReader.cpp

llvm/tools/llvm-objcopy/MachO/MachOWriter.h

llvm/tools/llvm-objcopy/MachO/MachOWriter.cpp

llvm/tools/llvm-objcopy/MachO/Object.h

llvm/tools/llvm-objcopy/MachO/Object.cpp

llvm/tools/llvm-objcopy/ParseConfig.h

[llvm-objcopy][NFC] Move core implementation of llvm-objcopy into separate library.
ClosedPublic