This is an archive of the discontinued LLVM Phabricator instance.

Relocate some macro definitions into libunwind/include/mach-o/compact_unwind_encoding.h
Decouple target word size from the host word size for the compact unwind entry structs

Herald added a project: Restricted Project. · View Herald TranscriptAug 28 2020, 5:42 PM

Herald added a reviewer: Restricted Project. · View Herald Transcript

Herald added a subscriber: libcxx-commits. · View Herald Transcript

Harbormaster completed remote builds in B69995: Diff 288738.Aug 28 2020, 6:07 PM

just a partial review for now... will come back and do a deeper reading next week after I understand a bit more about the CU format.

lld/MachO/OutputSegment.cpp
54 ↗	(On Diff #288677)	nit: StringRefs can be compared using `==`
lld/MachO/UnwindInfo.cpp
1 ↗	(On Diff #288677)	missing license header
58 ↗	(On Diff #288677)	I think this should be `isNeeded()`, not `isHidden()` -- "hidden" indicates that the section header is absent but that the section data itself should still be emitted
93 ↗	(On Diff #288677)	nit: `targetIsec` would be clearer
139 ↗	(On Diff #288677)	frequencies
lld/MachO/UnwindInfo.h
51–66 ↗	(On Diff #288677)	why the need to mark everything as mutable?
lld/MachO/Writer.cpp
412–413	This seems a bit hacky. Do we really have to put the `__compact_unwind` sections in an OutputSegment? If we're not going to output them, could we just put all these InputSections in a vector contained in `UnwindInfoSection`? Also, we may want to support emitting object files at some point via `-r`, in which case we'll actually want to emit the `__LD` segment at a valid position.
lld/test/MachO/tools/validate-unwind-info.py
90	this seems unnecessary given that we'll be exiting anyway when this function returns

gkm marked 8 inline comments as done.Aug 29 2020, 1:48 AM

gkm added inline comments.

lld/MachO/UnwindInfo.h
51–66 ↗	(On Diff #288677)	The work of morphing `__LD,__compact_unwind` into `__TEXT,__unwind_info` happens in `uint64_t getSize() const` and `void writeTo(uint8_t *buf) const`, which can only be logically `const`, but not physically.
lld/MachO/Writer.cpp
412–413	I agree that it could use some improvement. I will revisit. Behaviorally, `__LD,__compact_unwind` is a `MergedOutputSegment`, except with different timing and destination: we relocate & write it to a temp buffer from `Writer::assignAddresses()`) rather than writing to the final link output from `Writer::writeSections()`. We need to relocate it fully in order to compute the size of the `__TEXT,__unwind_info` section when laying-out sections & binding to addresses. You are quite right regarding `-r`, which makes `__LD,__compact_unwind` a completely normal `MergedOutputSegment`.
lld/test/MachO/tools/validate-unwind-info.py
90	True, but since I explicitly `sys.exit("... diagnostic ...")` on error, for symmetry I chose to explicitly `sys.exit()` on success. I dislike falling through the floor of main, despite language guarantees, and since `sys.exit()` accepts a string arg on error, I prefer that to using numeric `return STATUS`.

Update according to Jez's simple review-feedback items - i.e., everything except the hackiness surrounding __LD,__compact_unwind's anomalies as a MergedOutputSection.

Harbormaster completed remote builds in B70035: Diff 288839.Aug 29 2020, 11:59 PM

Rework pageBounds as a simple vector rather than a vector of pairs, and expand associated comments

Harbormaster completed remote builds in B70046: Diff 288867.Aug 30 2020, 1:23 PM

s/std::max/std::min/
Revert to simple llvm-lit test format, since bash is unnecessary

pass functionAddressMax via lambda capture rather than via constructed cuEntry
heed clang-tidy's advice

Harbormaster completed remote builds in B70050: Diff 288873.Aug 30 2020, 3:01 PM

Cleanups:

s/totalCuEntries/cuCount/
constify all CompactUnwindEntry64*
simplify level-1 index sentinel address calculation

Harbormaster completed remote builds in B70051: Diff 288874.Aug 30 2020, 3:31 PM

Harbormaster completed remote builds in B70052: Diff 288875.Aug 30 2020, 3:44 PM

Correction in lld/MachO/UnwindInfo.h: s/LLD_MACHO_COMPACT_UNWIND_H/LLD_MACHO_UNWIND_INFO_H/
Move some constant definitions that belong in libunwind/include/mach-o/compact_unwind_encoding.h

Harbormaster completed remote builds in B70054: Diff 288877.Aug 30 2020, 4:46 PM

Revise the arg signature for the comparison lambda for std::lower_bound() to emphasize that the 2nd pointer argument is ignored. Rather, we pass the comparand via lambda capture.

Harbormaster completed remote builds in B70126: Diff 288994.Aug 31 2020, 11:49 AM

Improvements & cleanups to generate-cfi-funcs.py:

add --functions=N arg
move some comments into doc strings

Harbormaster completed remote builds in B70165: Diff 289061.Aug 31 2020, 7:40 PM

Improvements & cleanups to validate-unwind-info.py:

accept input from multiple files or stdin
fail if any expected categories of input are absent

Harbormaster completed remote builds in B70335: Diff 289338.Sep 1 2020, 7:10 PM

int3 added inline comments.Sep 1 2020, 8:21 PM

lld/MachO/UnwindInfo.h
51–66 ↗	(On Diff #288677)	I would much prefer if we could have a method that explicitly computes the unwind info... but first let me see if I understand the constraints here. From what I understand, the size of the unwind info depends on the values of `CompactUnwindEntry64::functionAddress`, or more precisely, their relative offsets. So we need to assign addresses to our functions before computing the unwind info size. After addresses are assigned, the `cuSection->writeTo()` call below will update the `functionAddress` entries to point to the final addresses, after which we can compute the needed size. Does all that sound right? From what I can tell, ld64 calculates unwind info size based on "tentative addresses", which I think have the right relative offsets, though their final values have yet to be fixed. I wonder if we could do something similar with our current design...

int3 added inline comments.Sep 1 2020, 8:24 PM

lld/MachO/UnwindInfo.h
51–66 ↗	(On Diff #288677)	Also -- do you know if it's possible for unwind info to point to functions that reside in sections other than `__text`?

gkm added inline comments.Sep 2 2020, 12:05 AM

lld/MachO/UnwindInfo.h
51–66 ↗	(On Diff #288677)	Yes, that's substantially correct, and I will elaborate: `UnwindInfoSection::getSize()` must process the compact unwind entries and partition them into 4 KiB second-level pages in order to determine the final size of the `__TEXT,__unwind_info`. There are three factors that influence layout & therefore size: A maximum of 1021 entries can fit into a 4 KiB page The difference between address of first and last functions in a second-level page must be less than 2^24. The page will be cut short and remain under-filled when a candidate entry's 24-bit relative offset field would overflow. We do a linear scan and fold adjacent entries that have the same encoding, thus laying-out fewer entries in the output than are present in the input. As currently implemented, I use `mutable` data members to cache the necessary and substantial work done in `UnwindInfoSection::getSize() const` so that `UnwindInfoSection::writeTo() const` need not redo it. I believe it is not strictly necessary to relocate the compact unwind entries in `getSize()`. I can probably work with the raw input sections for page partitioning and entry folding, though the code will be trickier, and I won't escape the need to relocate later in `writeTo()` for sake of LSDA and personality addresses. I chose to use data members as cache, to relocate early, and to use `mutable` to work-around constness of `getSize()` and `writeTo()` because that results in code that does things in the proper sequence, and in the most straightforward manner for sake of clarity and efficiency.
51–66 ↗	(On Diff #288677)	With assembler language, anything is possible. I assume one can place compiled functions in alternate sections via attributes or pragmas. Those sections might be discontiguous with `__text`, and therefore require lay-out in to distinct second-level pages. I don't know if ld64 supports non-`__text` compact unwind. I will need to conjure some test cases to see how it behaves. If ld64 doesn't handle non-`__text`, then we won't either. If it does, then I will add that later.

lld/MachO/UnwindInfo.cpp:

Remove redundant unwind_info_section_header_lsda_entry, and use existing unwind_info_section_header_lsda_index_entry instead.
Don't cache UnwindInfoSection::isNeeded() result since it is only called once.

lld/test/MachO/tools/generate-cfi-funcs.py:

Permute the register save set in order to generate a larger variety of encodings.
Align generated functions at larger boundary so it works for testing ld64 also.

lld/test/MachO/tools/validate-unwind-info.py

Expand the object-file encoding regexp to capture optional personality+lsda

Harbormaster completed remote builds in B70471: Diff 289583.Sep 2 2020, 4:32 PM

gkm edited the summary of this revision. (Show Details)Sep 2 2020, 10:01 PM

int3 added inline comments.Sep 3 2020, 2:57 PM

lld/MachO/UnwindInfo.h
51–66 ↗	(On Diff #288677)	I believe it is not strictly necessary to relocate the compact unwind entries in getSize(). I can probably work with the raw input sections for page partitioning and entry folding How about doing most of what's currently in `getSize()` in a `finalizeContents()` method instead, much like what is currently being done for the the `__LINKEDIT` sections? My main concern is -- aside from a mutating method being marked as `const` -- is that compact unwind's `getSize()` has an implicit dependency on the addresses of another section. Other synthetic sections' `finalizeContent` methods have such a dependency, too, so this would fit the mold. Of course, unlike the `__LINKEDIT` sections, the address assignments of later sections do depend on compact unwind's size, so we'll have to call it in the middle of `assignAddresses`. Such special-casing of CW would be a bit more verbose than what we currently have, but I think it's a good thing to make the uniqueness of compact unwind's requirements explicit.

Do the work in finalize() member rather than getSize() const. Drop mutable from data members. Ahhh! So much nicer!

Harbormaster completed remote builds in B70743: Diff 290105.Sep 5 2020, 12:53 PM

use %python prefix in test file
minor cleanups

Harbormaster completed remote builds in B70751: Diff 290114.Sep 5 2020, 5:57 PM

Having things in finalize() indeed looks a lot cleaner :D

lld/MachO/SyntheticSections.h
22	this include doesn't seem necessary
lld/MachO/UnwindInfo.cpp
91–92 ↗	(On Diff #290114)	is the `Twine()` call here necessary? I would expect the concatenated StringRefs to already be a Twine
107 ↗	(On Diff #290114)	ditto, avoid `<map>`
120 ↗	(On Diff #290114)	codebase convention is to avoid curly braces for one-liner blocks
127 ↗	(On Diff #290114)	frequencies
147 ↗	(On Diff #290114)	successive
150 ↗	(On Diff #290114)	pageBounds
155 ↗	(On Diff #290114)	within
156–158 ↗	(On Diff #290114)	nit 1: the type here is pretty verbose, a type alias would be nice (`auto` would be acceptable here too I think) nit 2: `it0` and `itN` aren't the most descriptive names... something like `intervalStart` and `intervalMax` would be nice
167–168 ↗	(On Diff #290114)	why not pass `functionAddressMax` in as the 3rd parameter to `lower_bound`? edit: oh, because the return type isn't the container element type. would be good to have a comment :)
193–196 ↗	(On Diff #290114)	can be rm'ed
211 ↗	(On Diff #290114)	would be good to have a comment to the effect that there are a bunch of integer arrays immediately after the section header (I had to pause and think about what `&uip[1]` meant)
257 ↗	(On Diff #290114)	what does a `kind` of 3 indicate?
lld/MachO/UnwindInfo.h
49 ↗	(On Diff #290114)	try to avoid `<map>` if possible: https://llvm.org/docs/ProgrammersManual.html#map also the key here should be `compact_unwind_encoding_t` to be consistent :)
50 ↗	(On Diff #290114)	nit: I'd prefer structs over pairs for readable field names
lld/MachO/Writer.cpp
412–413	I think it might be worthwhile to duplicate some of the code in `InputSection::writeTo` to specialize it for the compact unwind case. The current `writeTo` code assumes that both the source and target sections referenced by a relocation have their final output addresses determined. But this is not true for the compact unwind section -- it doesn't have a valid output address. It just happens to work because there are no pcrel relocations involved. We should really error out if we see a pcrel relocation while relocating `__compact_unwind`. Creating a separate compact unwind relocating method would free us from having to create `Output{Segments, Sections}` that we discard later. I'm hoping we can retain the invariant where we add OutputSections to OutputSegments only after we have determined that they are needed.
lld/test/MachO/compact-unwind.test
4–6	My understanding of the need for generate-cfi-funcs.py is that some test cases need to be very large in order to exercise the edge cases in the implementation. In addition to providing that functionality, this script also serves as a fuzzer by generating random test cases. Would it be possible for the unit test to instead enumerate a small number of cases that cover all important code paths, in order that we run fewer test cases?

int3 added inline comments.Sep 8 2020, 5:15 PM

lld/MachO/UnwindInfo.cpp

90–92 ↗

(On Diff #290114)

Is this check really necessary? What kind of errors are we defending against? I would rather we not loop over the relocations unless necessary (for performance)

95–96 ↗

(On Diff #290114)

CU relocations can be section-based. Just checked a simple program:

~/tmp: llvm-readobj --relocations --expand-relocs bar.o

File: bar.o
Format: Mach-O 64-bit x86-64
Arch: x86_64
AddressSize: 64bit
Relocations [
  Section __compact_unwind {
    Relocation {
      Offset: 0x0
      PCRel: 0
      Length: 3
      Type: X86_64_RELOC_UNSIGNED (0)
      Section: __text (1)
    }
  }
]
~/tmp: cat bar.cpp
int foo() {
  return 123;
}

smeenai added inline comments.Sep 8 2020, 5:17 PM

lld/MachO/UnwindInfo.cpp
90–92 ↗	(On Diff #290114)	Yup. I don't have the full context here yet, but https://lld.llvm.org/NewLLD.html#numbers-you-want-to-know is relevant.

gkm marked 18 inline comments as done.Sep 9 2020, 5:55 PM

gkm added inline comments.

lld/MachO/UnwindInfo.cpp
90–92 ↗	(On Diff #290114)	This is an assertion--a sanity check on my own assumptions. Once I tested more widely, I will have no further use for it. A `TODO` comment to convey my intention is appropriate.
91–92 ↗	(On Diff #290114)	Regarding `Twine()`: I meant to only pass the initial string constant so that later `+` operands would be chained to it. I did not mean to pass entire concat expression. I used explicit `Twine()` following Saleem's example in an earlier diff. I'm happy to drop it. I see that diag functions already coerce string constants to `Twine` anyway.
95–96 ↗	(On Diff #290114)	Yes. I expect relocs to all reference sections whose address is assigned, i.e., `__TEXT,__text`. The assertion checks if this is ever NOT the case.
167–168 ↗	(On Diff #290114)	I added a comment explaining the choice.
lld/MachO/UnwindInfo.h
50 ↗	(On Diff #290114)	`commonEncodings` works best as a vector of pairs because it is filled from a `DenseMap` iterator which returns precisely the pairs we desire.
lld/MachO/Writer.cpp
412–413	I still owe you an answer for this one ...
lld/test/MachO/compact-unwind.test
4–6	It is already a single test case, made deterministic by passing `--seed' so that` check-lld-macho` runs are consistent. The multiplicity that might be reduced is the number of functions generated for that one test case. However, since I need to test boundary conditions around breaking 1021-entry pages, my options are limited. If there is a way to prefer an installed `llvm-mc` over the locally built one, that would solve it.

gkm marked 5 inline comments as done.Sep 9 2020, 5:58 PM

gkm added inline comments.

lld/MachO/UnwindInfo.cpp
90–92 ↗	(On Diff #290114)	I am only looping over a subset of relocs, numbered 1+ per function. The "+" represents the subset of functions that have personality+lsda.

follow review feedback

Herald added a subscriber: jfb. · View Herald TranscriptSep 9 2020, 6:23 PM

Harbormaster completed remote builds in B71169: Diff 290858.Sep 9 2020, 6:58 PM

finalize LD,compact_unwind before relocating

Harbormaster completed remote builds in B71430: Diff 291330.Sep 11 2020, 2:19 PM

Folding adjacent CU entries requires a vector with monotonically increasing functionAddress, and to get that we must first apply std::sort().
Abandon passing pageBreak 2nd compare arg via lambda-capture, because it is a fragile technique. It is only by an accident of the implementation that it works on std::lower_bound. It does not work for std::upper_bound, std::binary_search, or std::equal_range.

Harbormaster completed remote builds in B71494: Diff 291439.Sep 12 2020, 7:36 PM

gkm added inline comments.Sep 17 2020, 2:56 PM

lld/MachO/Writer.cpp
412–413	I believe this latest rev works well and isn't so ugly. I retain the normal flow for gathering input sections and attaching them to `MergedOutputSection` for `__LD`, but I exclude it from `outputSegments` (without `-r`) so further downstream processing doesn't happen. I do insert `__LD` into `nameToOutputSegment[]` so that the synthetic `UnwindInfoSection::finalize()` can find it for processing. That seems minimally invasive and doesn't need a subclass. All evidence I have seen shows that reloc types in `__LD,__compact_unwind` are always absolute and never PC-relative. The only output address that must be determined is `lld`'s VMA. `OutputSection::writeTo()` accepts a pointer to a buffer, which is normally mapped to the linker's output file. However, when relocating `__compact_unwind`, the buffer is the internal `std::vector<CompactUnwindEntry64>::data()` rather than the output file.

clean-up some ugly special cases by excluding __LD from outputSegments

Harbormaster completed remote builds in B72092: Diff 292632.Sep 17 2020, 2:59 PM

add UNWIND_INFO_COMMON_ENCODINGS_MAX
revise, expand & prune comments
improve variable names for CU entry folding

Harbormaster completed remote builds in B72096: Diff 292643.Sep 17 2020, 3:36 PM

int3 accepted this revision.Sep 17 2020, 6:36 PM

int3 added inline comments.

lld/MachO/OutputSegment.cpp
73–74 ↗	(On Diff #292643)	Instead of filtering it out here, how about checking for `__LD,__compact_unwind` inside `createOutputSections()` while looping over `mergedOutputSections`, and then doing something like `unwindInfoSection->setCompactUnwind(...)`? Benefits: no need for the loop in `UnwindInfoSection::isNeeded()`, no need for `getOutputSection`, and no need to create an outputSegment to be discarded later. That said, the current setup is definitely much cleaner than what we had before, and I'm happy to have this refinement in a follow-up diff
lld/MachO/UnwindInfo.cpp
201 ↗	(On Diff #292643)	s/adddress/addresses are/
288 ↗	(On Diff #292643)	nit: `static_cast<size_t>`
lld/MachO/UnwindInfo.h
14–17 ↗	(On Diff #292643)	nit: group includes with LLD headers first, then LLVM headers, then system headers
lld/MachO/Writer.cpp
421–422	how about using `INT_MAX - 1` here instead, to cover the unlikely case where we have more than 100 sections? ld64 seems to do that
631	nit: Synthetic sections that only need to be used within `Writer` -- like `SymtabSection` -- can be a member of `Writer` instead of `InStruct`.
lld/test/MachO/compact-unwind.test
18	maybe change this seed to something that's not company-specific
lld/test/MachO/tools/validate-unwind-info.py
37	JFYI, the parens are unnecessary, but I'm fine if you prefer them
69	I don't suppose you meant to leave `and False` in?

gkm marked 10 inline comments as done.Sep 18 2020, 8:07 PM

gkm added inline comments.

lld/MachO/OutputSegment.cpp
73–74 ↗	(On Diff #292643)	No followup necessary. I did it all here.

Final round of cleanups & review-feedback integration

gkm removed a reviewer: Restricted Project.Sep 18 2020, 8:33 PM

gkm removed a project: Restricted Project.

This revision is now accepted and ready to land.Sep 18 2020, 8:33 PM

int3 added inline comments.Sep 18 2020, 8:34 PM

lld/MachO/OutputSegment.cpp
65 ↗	(On Diff #292936)	is this still necessary?

Remove redundant change to lld/MachO/OutputSegment.cpp

Harbormaster completed remote builds in B72264: Diff 292936.Sep 18 2020, 9:18 PM

Harbormaster completed remote builds in B72265: Diff 292937.Sep 18 2020, 9:27 PM

This revision was landed with ongoing or failed builds.Sep 18 2020, 10:02 PM

Closed by commit rG2124ca1d5cb6: [lld-macho] create __TEXT,__unwind_info from __LD,__compact_unwind (authored by gkm). · Explain Why

This revision was automatically updated to reflect the committed changes.

gkm added a commit: rG2124ca1d5cb6: [lld-macho] create __TEXT,__unwind_info from __LD,__compact_unwind.

thakis added a subscriber: thakis.Sep 19 2020, 5:33 AM

thakis added inline comments.

lld/test/MachO/tools/generate-cfi-funcs.py
21	llvm still supports python2.7 for now.

thakis added inline comments.Sep 19 2020, 6:17 AM

lld/test/MachO/tools/generate-cfi-funcs.py
21	I went ahead and made the scripts 2.7-compatible in e22a4fd59de668af1cb943e23a6f4bfc93090e0f. Once we drop 2.7 support, it's hopefully easy to revert. Sorry for the trouble!

Revision Contents

Path

Size

lld/

MachO/

3 lines

1 line

4 lines

84 lines

UnwindInfoSection.cpp

284 lines

Writer.cpp

16 lines

test/

MachO/

compact-unwind.test

21 lines

tools/

generate-cfi-funcs.py

135 lines

validate-unwind-info.py

96 lines

Diff 292938

lld/MachO/CMakeLists.txt

	set(LLVM_TARGET_DEFINITIONS Options.td)			set(LLVM_TARGET_DEFINITIONS Options.td)
	tablegen(LLVM Options.inc -gen-opt-parser-defs)			tablegen(LLVM Options.inc -gen-opt-parser-defs)
	add_public_tablegen_target(MachOOptionsTableGen)			add_public_tablegen_target(MachOOptionsTableGen)

				include_directories(${LLVM_MAIN_SRC_DIR}/../libunwind/include)

	add_lld_library(lldMachO2			add_lld_library(lldMachO2
	Arch/X86_64.cpp			Arch/X86_64.cpp
				UnwindInfoSection.cpp
	Driver.cpp			Driver.cpp
	DriverUtils.cpp			DriverUtils.cpp
	ExportTrie.cpp			ExportTrie.cpp
	InputFiles.cpp			InputFiles.cpp
	InputSection.cpp			InputSection.cpp
	MergedOutputSection.cpp			MergedOutputSection.cpp
	ObjC.cpp			ObjC.cpp
	OutputSection.cpp			OutputSection.cpp
	Show All 24 Lines

lld/MachO/OutputSegment.h

	Show All 16 Lines

	namespace segment_names {			namespace segment_names {

	constexpr const char pageZero[] = "__PAGEZERO";			constexpr const char pageZero[] = "__PAGEZERO";
	constexpr const char text[] = "__TEXT";			constexpr const char text[] = "__TEXT";
	constexpr const char data[] = "__DATA";			constexpr const char data[] = "__DATA";
	constexpr const char linkEdit[] = "__LINKEDIT";			constexpr const char linkEdit[] = "__LINKEDIT";
	constexpr const char dataConst[] = "__DATA_CONST";			constexpr const char dataConst[] = "__DATA_CONST";
				constexpr const char ld[] = "__LD"; // output only with -r

	} // namespace segment_names			} // namespace segment_names

	class OutputSection;			class OutputSection;
	class InputSection;			class InputSection;

	class OutputSegment {			class OutputSegment {
	public:			public:
	Show All 30 Lines

lld/MachO/SyntheticSections.h

	Show All 13 Lines
	#include "InputSection.h"			#include "InputSection.h"
	#include "OutputSection.h"			#include "OutputSection.h"
	#include "OutputSegment.h"			#include "OutputSegment.h"
	#include "Target.h"			#include "Target.h"

	#include "llvm/ADT/PointerUnion.h"			#include "llvm/ADT/PointerUnion.h"
	#include "llvm/ADT/SetVector.h"			#include "llvm/ADT/SetVector.h"
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"

				int3Unsubmitted Done Reply Inline Actions this include doesn't seem necessary int3: this include doesn't seem necessary
	namespace lld {			namespace lld {
	namespace macho {			namespace macho {

	namespace section_names {			namespace section_names {

	constexpr const char pageZero[] = "__pagezero";			constexpr const char pageZero[] = "__pagezero";
	constexpr const char header[] = "__mach_header";			constexpr const char header[] = "__mach_header";
	constexpr const char binding[] = "__binding";			constexpr const char binding[] = "__binding";
	constexpr const char weakBinding[] = "__weak_binding";			constexpr const char weakBinding[] = "__weak_binding";
	constexpr const char lazyBinding[] = "__lazy_binding";			constexpr const char lazyBinding[] = "__lazy_binding";
	constexpr const char export_[] = "__export";			constexpr const char export_[] = "__export";
	constexpr const char symbolTable[] = "__symbol_table";			constexpr const char symbolTable[] = "__symbol_table";
	constexpr const char stringTable[] = "__string_table";			constexpr const char stringTable[] = "__string_table";
	constexpr const char got[] = "__got";			constexpr const char got[] = "__got";
	constexpr const char threadPtrs[] = "__thread_ptrs";			constexpr const char threadPtrs[] = "__thread_ptrs";
				constexpr const char unwindInfo[] = "__unwind_info";
				// these are not synthetic, but in service of synthetic __unwind_info
				constexpr const char compactUnwind[] = "__compact_unwind";
				constexpr const char ehFrame[] = "__eh_frame";

	} // namespace section_names			} // namespace section_names

	class Defined;			class Defined;
	class DylibSymbol;			class DylibSymbol;
	class LoadCommand;			class LoadCommand;

	class SyntheticSection : public OutputSection {			class SyntheticSection : public OutputSection {
	▲ Show 20 Lines • Show All 364 Lines • Show Last 20 Lines

lld/MachO/UnwindInfoSection.h

This file was added.

				//===- UnwindInfoSection.h ------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLD_MACHO_UNWIND_INFO_H
				#define LLD_MACHO_UNWIND_INFO_H

				#include "MergedOutputSection.h"
				#include "SyntheticSections.h"

				#include "mach-o/compact_unwind_encoding.h"
				#include "llvm/ADT/DenseMap.h"

				#include <vector>

				// In 2020, we mostly care about 64-bit targets: x86_64 and arm64
				struct CompactUnwindEntry64 {
				uint64_t functionAddress;
				uint32_t functionLength;
				compact_unwind_encoding_t encoding;
				uint64_t personality;
				uint64_t lsda;
				};

				// FIXME(gkm): someday we might care about 32-bit targets: x86 & arm
				struct CompactUnwindEntry32 {
				uint32_t functionAddress;
				uint32_t functionLength;
				compact_unwind_encoding_t encoding;
				uint32_t personality;
				uint32_t lsda;
				};

				namespace lld {
				namespace macho {

				class UnwindInfoSection : public SyntheticSection {
				public:
				UnwindInfoSection();
				uint64_t getSize() const override { return unwindInfoSize; }
				bool isNeeded() const override;
				void finalize() override;
				void writeTo(uint8_t *buf) const override;
				void setCompactUnwindSection(MergedOutputSection *cuSection) {
				compactUnwindSection = cuSection;
				}

				private:
				std::vector<std::pair<compact_unwind_encoding_t, size_t>> commonEncodings;
				std::vector<uint32_t> personalities;
				std::vector<unwind_info_section_header_lsda_index_entry> lsdaEntries;
				std::vector<CompactUnwindEntry64> cuVector;
				std::vector<const CompactUnwindEntry64 *> cuPtrVector;
				std::vector<std::vector<const CompactUnwindEntry64 *>::const_iterator>
				pageBounds;
				MergedOutputSection *compactUnwindSection = nullptr;
				uint64_t level2PagesOffset = 0;
				uint64_t unwindInfoSize = 0;
				};

				#define UNWIND_INFO_COMMON_ENCODINGS_MAX 127

				#define UNWIND_INFO_SECOND_LEVEL_PAGE_SIZE 4096
				#define UNWIND_INFO_REGULAR_SECOND_LEVEL_ENTRIES_MAX \
				((UNWIND_INFO_SECOND_LEVEL_PAGE_SIZE - \
				sizeof(unwind_info_regular_second_level_page_header)) / \
				sizeof(unwind_info_regular_second_level_entry))
				#define UNWIND_INFO_COMPRESSED_SECOND_LEVEL_ENTRIES_MAX \
				((UNWIND_INFO_SECOND_LEVEL_PAGE_SIZE - \
				sizeof(unwind_info_compressed_second_level_page_header)) / \
				sizeof(uint32_t))

				#define UNWIND_INFO_COMPRESSED_ENTRY_FUNC_OFFSET_BITS 24
				#define UNWIND_INFO_COMPRESSED_ENTRY_FUNC_OFFSET_MASK \
				UNWIND_INFO_COMPRESSED_ENTRY_FUNC_OFFSET(~0)

				} // namespace macho
				} // namespace lld

				#endif

lld/MachO/UnwindInfoSection.cpp

This file was added.

				//===- UnwindInfoSection.cpp ----------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "UnwindInfoSection.h"
				#include "Config.h"
				#include "InputSection.h"
				#include "MergedOutputSection.h"
				#include "OutputSection.h"
				#include "OutputSegment.h"
				#include "Symbols.h"
				#include "SyntheticSections.h"
				#include "Target.h"

				#include "lld/Common/ErrorHandler.h"
				#include "llvm/ADT/SmallVector.h"
				#include "llvm/BinaryFormat/MachO.h"

				using namespace llvm;
				using namespace llvm::MachO;
				using namespace lld;
				using namespace lld::macho;

				// Compact Unwind format is a Mach-O evolution of DWARF Unwind that
				// optimizes space and exception-time lookup. Most DWARF unwind
				// entries can be replaced with Compact Unwind entries, but the ones
				// that cannot are retained in DWARF form.
				//
				// This comment will address macro-level organization of the pre-link
				// and post-link compact unwind tables. For micro-level organization
				// pertaining to the bitfield layout of the 32-bit compact unwind
				// entries, see libunwind/include/mach-o/compact_unwind_encoding.h
				//
				// Important clarifying factoids:
				//
				// * __LD,__compact_unwind is the compact unwind format for compiler
				// output and linker input. It is never a final output. It could be
				// an intermediate output with the `-r` option which retains relocs.
				//
				// * __TEXT,__unwind_info is the compact unwind format for final
				// linker output. It is never an input.
				//
				// * __TEXT,__eh_frame is the DWARF format for both linker input and output.
				//
				// * __TEXT,__unwind_info entries are divided into 4 KiB pages (2nd
				// level) by ascending address, and the pages are referenced by an
				// index (1st level) in the section header.
				//
				// * Following the headers in __TEXT,__unwind_info, the bulk of the
				// section contains a vector of compact unwind entries
				// `{functionOffset, encoding}` sorted by ascending `functionOffset`.
				// Adjacent entries with the same encoding can be folded to great
				// advantage, achieving a 3-order-of-magnitude reduction in the
				// number of entries.
				//
				// * The __TEXT,__unwind_info format can accommodate up to 127 unique
				// encodings for the space-efficient compressed format. In practice,
				// fewer than a dozen unique encodings are used by C++ programs of
				// all sizes. Therefore, we don't even bother implementing the regular
				// non-compressed format. Time will tell if anyone in the field ever
				// overflows the 127-encodings limit.

				// TODO(gkm): prune __eh_frame entries superseded by __unwind_info
				// TODO(gkm): how do we align the 2nd-level pages?

				UnwindInfoSection::UnwindInfoSection()
				: SyntheticSection(segment_names::text, section_names::unwindInfo) {}

				bool UnwindInfoSection::isNeeded() const {
				return (compactUnwindSection != nullptr);
				}

				// Scan the __LD,__compact_unwind entries and compute the space needs of
				// __TEXT,__unwind_info and __TEXT,__eh_frame

				void UnwindInfoSection::finalize() {
				if (compactUnwindSection == nullptr)
				return;

				// At this point, the address space for __TEXT,__text has been
				// assigned, so we can relocate the __LD,__compact_unwind entries
				// into a temporary buffer. Relocation is necessary in order to sort
				// the CU entries by function address. Sorting is necessary so that
				// we can fold adjacent CU entries with identical
				// encoding+personality+lsda. Folding is necessary because it reduces
				// the number of CU entries by as much as 3 orders of magnitude!
				compactUnwindSection->finalize();
				assert(compactUnwindSection->getSize() % sizeof(CompactUnwindEntry64) == 0);
				size_t cuCount =
				compactUnwindSection->getSize() / sizeof(CompactUnwindEntry64);
				cuVector.resize(cuCount);
				// Relocate all __LD,__compact_unwind entries
				compactUnwindSection->writeTo(reinterpret_cast<uint8_t *>(cuVector.data()));

				// Rather than sort & fold the 32-byte entries directly, we create a
				// vector of pointers to entries and sort & fold that instead.
				cuPtrVector.reserve(cuCount);
				for (const auto &cuEntry : cuVector)
				cuPtrVector.emplace_back(&cuEntry);
				std::sort(cuPtrVector.begin(), cuPtrVector.end(),
				[](const CompactUnwindEntry64 a, const CompactUnwindEntry64 b) {
				return a->functionAddress < b->functionAddress;
				});

				// Fold adjacent entries with matching encoding+personality+lsda
				// We use three iterators on the same cuPtrVector to fold in-situ:
				// (1) `foldBegin` is the first of a potential sequence of matching entries
				// (2) `foldEnd` is the first non-matching entry after `foldBegin`.
				// The semi-open interval [ foldBegin .. foldEnd ) contains a range
				// entries that can be folded into a single entry and written to ...
				// (3) `foldWrite`
				auto foldWrite = cuPtrVector.begin();
				for (auto foldBegin = cuPtrVector.begin(); foldBegin < cuPtrVector.end();) {
				auto foldEnd = foldBegin;
				while (++foldEnd < cuPtrVector.end() &&
				(foldBegin)->encoding == (foldEnd)->encoding &&
				(foldBegin)->personality == (foldEnd)->personality &&
				(foldBegin)->lsda == (foldEnd)->lsda)
				;
				foldWrite++ = foldBegin;
				foldBegin = foldEnd;
				}
				cuPtrVector.erase(foldWrite, cuPtrVector.end());

				// Count frequencies of the folded encodings
				llvm::DenseMap<compact_unwind_encoding_t, size_t> encodingFrequencies;
				for (auto cuPtrEntry : cuPtrVector)
				encodingFrequencies[cuPtrEntry->encoding]++;
				if (encodingFrequencies.size() > UNWIND_INFO_COMMON_ENCODINGS_MAX)
				error("TODO(gkm): handle common encodings table overflow");

				// Make a table of encodings, sorted by descending frequency
				for (const auto &frequency : encodingFrequencies)
				commonEncodings.emplace_back(frequency);
				std::sort(commonEncodings.begin(), commonEncodings.end(),
				[](const std::pair<compact_unwind_encoding_t, size_t> &a,
				const std::pair<compact_unwind_encoding_t, size_t> &b) {
				if (a.second == b.second)
				// When frequencies match, secondarily sort on encoding
				// to maintain parity with validate-unwind-info.py
				return a.first > b.first;
				return a.second > b.second;
				});

				// Split folded encodings into pages, limited by capacity of a page
				// and the 24-bit range of function offset
				//
				// Record the page splits as a vector of iterators on cuPtrVector
				// such that successive elements form a semi-open interval. E.g.,
				// page X's bounds are thus: [ pageBounds[X] .. pageBounds[X+1] )
				//
				// Note that pageBounds.size() is one greater than the number of
				// pages, and pageBounds.back() holds the sentinel cuPtrVector.cend()
				pageBounds.push_back(cuPtrVector.cbegin());
				// TODO(gkm): cut 1st page entries short to accommodate section headers ???
				CompactUnwindEntry64 cuEntryKey;
				for (size_t i = 0;;) {
				// Limit the search to entries that can fit within a 4 KiB page.
				const auto pageBegin = pageBounds[0] + i;
				const auto pageMax =
				pageBounds[0] +
				std::min(i + UNWIND_INFO_COMPRESSED_SECOND_LEVEL_ENTRIES_MAX,
				cuPtrVector.size());
				// Exclude entries with functionOffset that would overflow 24 bits
				cuEntryKey.functionAddress = (*pageBegin)->functionAddress +
				UNWIND_INFO_COMPRESSED_ENTRY_FUNC_OFFSET_MASK;
				const auto pageBreak = std::lower_bound(
				pageBegin, pageMax, &cuEntryKey,
				[](const CompactUnwindEntry64 a, const CompactUnwindEntry64 b) {
				return a->functionAddress < b->functionAddress;
				});
				pageBounds.push_back(pageBreak);
				if (pageBreak == cuPtrVector.cend())
				break;
				i = pageBreak - cuPtrVector.cbegin();
				}

				// compute size of __TEXT,__unwind_info section
				level2PagesOffset =
				sizeof(unwind_info_section_header) +
				commonEncodings.size() * sizeof(uint32_t) +
				personalities.size() * sizeof(uint32_t) +
				pageBounds.size() * sizeof(unwind_info_section_header_index_entry) +
				lsdaEntries.size() * sizeof(unwind_info_section_header_lsda_index_entry);
				unwindInfoSize = level2PagesOffset +
				(pageBounds.size() - 1) *
				sizeof(unwind_info_compressed_second_level_page_header) +
				cuPtrVector.size() * sizeof(uint32_t);
				}

				// All inputs are relocated and output adddresses are known, so write!

				void UnwindInfoSection::writeTo(uint8_t *buf) const {
				// section header
				auto uip = reinterpret_cast<unwind_info_section_header >(buf);
				uip->version = 1;
				uip->commonEncodingsArraySectionOffset = sizeof(unwind_info_section_header);
				uip->commonEncodingsArrayCount = commonEncodings.size();
				uip->personalityArraySectionOffset =
				uip->commonEncodingsArraySectionOffset +
				(uip->commonEncodingsArrayCount * sizeof(uint32_t));
				uip->personalityArrayCount = personalities.size();
				uip->indexSectionOffset = uip->personalityArraySectionOffset +
				(uip->personalityArrayCount * sizeof(uint32_t));
				uip->indexCount = pageBounds.size();

				// Common encodings
				auto i32p = reinterpret_cast<uint32_t >(&uip[1]);
				for (const auto &encoding : commonEncodings)
				*i32p++ = encoding.first;

				// Personalities
				for (const auto &personality : personalities)
				*i32p++ = personality;

				// Level-1 index
				uint32_t lsdaOffset =
				uip->indexSectionOffset +
				uip->indexCount * sizeof(unwind_info_section_header_index_entry);
				uint64_t l2PagesOffset = level2PagesOffset;
				auto iep = reinterpret_cast<unwind_info_section_header_index_entry >(i32p);
				for (size_t i = 0; i < pageBounds.size() - 1; i++) {
				iep->functionOffset = (*pageBounds[i])->functionAddress;
				iep->secondLevelPagesSectionOffset = l2PagesOffset;
				iep->lsdaIndexArraySectionOffset = lsdaOffset;
				iep++;
				// TODO(gkm): pad to 4 KiB page boundary ???
				size_t entryCount = pageBounds[i + 1] - pageBounds[i];
				uint64_t pageSize = sizeof(unwind_info_section_header_index_entry) +
				entryCount * sizeof(uint32_t);
				l2PagesOffset += pageSize;
				}
				// Level-1 sentinel
				const CompactUnwindEntry64 &cuEnd = cuVector.back();
				iep->functionOffset = cuEnd.functionAddress + cuEnd.functionLength;
				iep->secondLevelPagesSectionOffset = 0;
				iep->lsdaIndexArraySectionOffset = lsdaOffset;
				iep++;

				// LSDAs
				auto *lep =
				reinterpret_cast<unwind_info_section_header_lsda_index_entry *>(iep);
				for (const auto &lsda : lsdaEntries) {
				lep->functionOffset = lsda.functionOffset;
				lep->lsdaOffset = lsda.lsdaOffset;
				}

				// create map from encoding to common-encoding-table index compact
				// encoding entries use 7 bits to index the common-encoding table
				size_t i = 0;
				llvm::DenseMap<compact_unwind_encoding_t, size_t> commonEncodingIndexes;
				for (const auto &encoding : commonEncodings)
				commonEncodingIndexes[encoding.first] = i++;

				// Level-2 pages
				auto *p2p =
				reinterpret_cast<unwind_info_compressed_second_level_page_header *>(lep);
				for (size_t i = 0; i < pageBounds.size() - 1; i++) {
				p2p->kind = UNWIND_SECOND_LEVEL_COMPRESSED;
				p2p->entryPageOffset =
				sizeof(unwind_info_compressed_second_level_page_header);
				p2p->entryCount = pageBounds[i + 1] - pageBounds[i];
				p2p->encodingsPageOffset =
				p2p->entryPageOffset + p2p->entryCount * sizeof(uint32_t);
				p2p->encodingsCount = 0;
				auto ep = reinterpret_cast<uint32_t >(&p2p[1]);
				auto cuPtrVectorIt = pageBounds[i];
				uintptr_t functionAddressBase = (*cuPtrVectorIt)->functionAddress;
				while (cuPtrVectorIt < pageBounds[i + 1]) {
				const CompactUnwindEntry64 cuep = cuPtrVectorIt++;
				size_t cueIndex = commonEncodingIndexes.lookup(cuep->encoding);
				*ep++ = ((cueIndex << UNWIND_INFO_COMPRESSED_ENTRY_FUNC_OFFSET_BITS) \|
				(cuep->functionAddress - functionAddressBase));
				}
				p2p =
				reinterpret_cast<unwind_info_compressed_second_level_page_header *>(ep);
				}
				assert(getSize() ==
				static_cast<size_t>((reinterpret_cast<uint8_t *>(p2p) - buf)));
				}

lld/MachO/Writer.cpp

Show All 11 Lines
#include "InputSection.h"		#include "InputSection.h"
#include "MergedOutputSection.h"		#include "MergedOutputSection.h"
#include "OutputSection.h"		#include "OutputSection.h"
#include "OutputSegment.h"		#include "OutputSegment.h"
#include "SymbolTable.h"		#include "SymbolTable.h"
#include "Symbols.h"		#include "Symbols.h"
#include "SyntheticSections.h"		#include "SyntheticSections.h"
#include "Target.h"		#include "Target.h"
		#include "UnwindInfoSection.h"

#include "lld/Common/ErrorHandler.h"		#include "lld/Common/ErrorHandler.h"
#include "lld/Common/Memory.h"		#include "lld/Common/Memory.h"
#include "llvm/BinaryFormat/MachO.h"		#include "llvm/BinaryFormat/MachO.h"
#include "llvm/Config/llvm-config.h"		#include "llvm/Config/llvm-config.h"
#include "llvm/Support/LEB128.h"		#include "llvm/Support/LEB128.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
Show All 24 Lines	public:
void run();		void run();

std::unique_ptr<FileOutputBuffer> &buffer;		std::unique_ptr<FileOutputBuffer> &buffer;
uint64_t addr = 0;		uint64_t addr = 0;
uint64_t fileOff = 0;		uint64_t fileOff = 0;
MachHeaderSection *header = nullptr;		MachHeaderSection *header = nullptr;
StringTableSection *stringTableSection = nullptr;		StringTableSection *stringTableSection = nullptr;
SymtabSection *symtabSection = nullptr;		SymtabSection *symtabSection = nullptr;
		UnwindInfoSection *unwindInfoSection = nullptr;
};		};

// LC_DYLD_INFO_ONLY stores the offsets of symbol import/export information.		// LC_DYLD_INFO_ONLY stores the offsets of symbol import/export information.
class LCDyldInfo : public LoadCommand {		class LCDyldInfo : public LoadCommand {
public:		public:
LCDyldInfo(BindingSection *bindingSection,		LCDyldInfo(BindingSection *bindingSection,
WeakBindingSection *weakBindingSection,		WeakBindingSection *weakBindingSection,
LazyBindingSection *lazyBindingSection,		LazyBindingSection *lazyBindingSection,
▲ Show 20 Lines • Show All 334 Lines • ▼ Show 20 Lines

static int segmentOrder(OutputSegment *seg) {		static int segmentOrder(OutputSegment *seg) {
return StringSwitch<int>(seg->name)		return StringSwitch<int>(seg->name)
.Case(segment_names::pageZero, -2)		.Case(segment_names::pageZero, -2)
.Case(segment_names::text, -1)		.Case(segment_names::text, -1)
// Make sure __LINKEDIT is the last segment (i.e. all its hidden		// Make sure __LINKEDIT is the last segment (i.e. all its hidden
// sections must be ordered after other sections).		// sections must be ordered after other sections).
.Case(segment_names::linkEdit, std::numeric_limits<int>::max())		.Case(segment_names::linkEdit, std::numeric_limits<int>::max())
.Default(0);		.Default(0);
}		}
		int3Unsubmitted Done Reply Inline Actions This seems a bit hacky. Do we really have to put the `__compact_unwind` sections in an OutputSegment? If we're not going to output them, could we just put all these InputSections in a vector contained in `UnwindInfoSection`? Also, we may want to support emitting object files at some point via `-r`, in which case we'll actually want to emit the `__LD` segment at a valid position. int3: This seems a bit hacky. Do we really have to put the `__compact_unwind` sections in an…
		gkmAuthorUnsubmitted Done Reply Inline Actions I agree that it could use some improvement. I will revisit. Behaviorally, `__LD,__compact_unwind` is a `MergedOutputSegment`, except with different timing and destination: we relocate & write it to a temp buffer from `Writer::assignAddresses()`) rather than writing to the final link output from `Writer::writeSections()`. We need to relocate it fully in order to compute the size of the `__TEXT,__unwind_info` section when laying-out sections & binding to addresses. You are quite right regarding `-r`, which makes `__LD,__compact_unwind` a completely normal `MergedOutputSegment`. gkm: I agree that it could use some improvement. I will revisit. Behaviorally, `__LD…
		int3Unsubmitted Done Reply Inline Actions I think it might be worthwhile to duplicate some of the code in `InputSection::writeTo` to specialize it for the compact unwind case. The current `writeTo` code assumes that both the source and target sections referenced by a relocation have their final output addresses determined. But this is not true for the compact unwind section -- it doesn't have a valid output address. It just happens to work because there are no pcrel relocations involved. We should really error out if we see a pcrel relocation while relocating `__compact_unwind`. Creating a separate compact unwind relocating method would free us from having to create `Output{Segments, Sections}` that we discard later. I'm hoping we can retain the invariant where we add OutputSections to OutputSegments only after we have determined that they are needed. int3: I think it might be worthwhile to duplicate some of the code in `InputSection::writeTo` to…
		gkmAuthorUnsubmitted Done Reply Inline Actions I still owe you an answer for this one ... gkm: I still owe you an answer for this one ...
		gkmAuthorUnsubmitted Done Reply Inline Actions I believe this latest rev works well and isn't so ugly. I retain the normal flow for gathering input sections and attaching them to `MergedOutputSection` for `__LD`, but I exclude it from `outputSegments` (without `-r`) so further downstream processing doesn't happen. I do insert `__LD` into `nameToOutputSegment[]` so that the synthetic `UnwindInfoSection::finalize()` can find it for processing. That seems minimally invasive and doesn't need a subclass. All evidence I have seen shows that reloc types in `__LD,__compact_unwind` are always absolute and never PC-relative. The only output address that must be determined is `lld`'s VMA. `OutputSection::writeTo()` accepts a pointer to a buffer, which is normally mapped to the linker's output file. However, when relocating `__compact_unwind`, the buffer is the internal `std::vector<CompactUnwindEntry64>::data()` rather than the output file. gkm: I believe this latest rev works well and isn't so ugly. I retain the normal flow for gathering…

static int sectionOrder(OutputSection *osec) {		static int sectionOrder(OutputSection *osec) {
StringRef segname = osec->parent->name;		StringRef segname = osec->parent->name;
// Sections are uniquely identified by their segment + section name.		// Sections are uniquely identified by their segment + section name.
if (segname == segment_names::text) {		if (segname == segment_names::text) {
if (osec->name == section_names::header)		return StringSwitch<int>(osec->name)
return -1;		.Case(section_names::header, -1)
		.Case(section_names::unwindInfo, std::numeric_limits<int>::max() - 1)
		.Case(section_names::ehFrame, std::numeric_limits<int>::max())
		int3Unsubmitted Done Reply Inline Actions how about using `INT_MAX - 1` here instead, to cover the unlikely case where we have more than 100 sections? ld64 seems to do that int3: how about using `INT_MAX - 1` here instead, to cover the unlikely case where we have more than…
		.Default(0);
} else if (segname == segment_names::linkEdit) {		} else if (segname == segment_names::linkEdit) {
return StringSwitch<int>(osec->name)		return StringSwitch<int>(osec->name)
.Case(section_names::binding, -6)		.Case(section_names::binding, -6)
.Case(section_names::weakBinding, -5)		.Case(section_names::weakBinding, -5)
.Case(section_names::lazyBinding, -4)		.Case(section_names::lazyBinding, -4)
.Case(section_names::export_, -3)		.Case(section_names::export_, -3)
.Case(section_names::symbolTable, -2)		.Case(section_names::symbolTable, -2)
.Case(section_names::stringTable, -1)		.Case(section_names::stringTable, -1)
Show All 40 Lines	for (auto *osec : seg->getSections()) {
}		}
}		}
}		}
}		}

void Writer::createOutputSections() {		void Writer::createOutputSections() {
// First, create hidden sections		// First, create hidden sections
stringTableSection = make<StringTableSection>();		stringTableSection = make<StringTableSection>();
		unwindInfoSection = make<UnwindInfoSection>(); // TODO(gkm): only when no -r
symtabSection = make<SymtabSection>(*stringTableSection);		symtabSection = make<SymtabSection>(*stringTableSection);

switch (config->outputType) {		switch (config->outputType) {
case MH_EXECUTE:		case MH_EXECUTE:
make<PageZeroSection>();		make<PageZeroSection>();
break;		break;
case MH_DYLIB:		case MH_DYLIB:
break;		break;
Show All 10 Lines	for (InputSection *isec : inputSections) {
if (osec == nullptr)		if (osec == nullptr)
osec = make<MergedOutputSection>(isec->name);		osec = make<MergedOutputSection>(isec->name);
osec->mergeInput(isec);		osec->mergeInput(isec);
}		}

for (const auto &it : mergedOutputSections) {		for (const auto &it : mergedOutputSections) {
StringRef segname = it.first.first;		StringRef segname = it.first.first;
MergedOutputSection *osec = it.second;		MergedOutputSection *osec = it.second;
		if (unwindInfoSection && segname == segment_names::ld) {
		assert(osec->name == section_names::compactUnwind);
		unwindInfoSection->setCompactUnwindSection(osec);
		} else
getOrCreateOutputSegment(segname)->addOutputSection(osec);		getOrCreateOutputSegment(segname)->addOutputSection(osec);
}		}

for (SyntheticSection *ssec : syntheticSections) {		for (SyntheticSection *ssec : syntheticSections) {
auto it = mergedOutputSections.find({ssec->segname, ssec->name});		auto it = mergedOutputSections.find({ssec->segname, ssec->name});
if (it == mergedOutputSections.end()) {		if (it == mergedOutputSections.end()) {
if (ssec->isNeeded())		if (ssec->isNeeded())
getOrCreateOutputSegment(ssec->segname)->addOutputSection(ssec);		getOrCreateOutputSegment(ssec->segname)->addOutputSection(ssec);
} else {		} else {
▲ Show 20 Lines • Show All 103 Lines • ▼ Show 20 Lines	void macho::createSyntheticSections() {
in.lazyBinding = make<LazyBindingSection>();		in.lazyBinding = make<LazyBindingSection>();
in.exports = make<ExportSection>();		in.exports = make<ExportSection>();
in.got = make<GotSection>();		in.got = make<GotSection>();
in.tlvPointers = make<TlvPointerSection>();		in.tlvPointers = make<TlvPointerSection>();
in.lazyPointers = make<LazyPointerSection>();		in.lazyPointers = make<LazyPointerSection>();
in.stubs = make<StubsSection>();		in.stubs = make<StubsSection>();
in.stubHelper = make<StubHelperSection>();		in.stubHelper = make<StubHelperSection>();
in.imageLoaderCache = make<ImageLoaderCacheSection>();		in.imageLoaderCache = make<ImageLoaderCacheSection>();
}		}
		int3Unsubmitted Done Reply Inline Actions nit: Synthetic sections that only need to be used within `Writer` -- like `SymtabSection` -- can be a member of `Writer` instead of `InStruct`. int3: nit: Synthetic sections that only need to be used within `Writer` -- like `SymtabSection`…

lld/test/MachO/compact-unwind.test

This file was added.

				# REQUIRES: x86

				# FIXME(gkm): This test is fast on a Release tree, and slow (~10s) on
				# a Debug tree mostly because of llvm-mc. Is there a way to prefer the
				# fast installed llvm-mc rather than the slow one in our Debug tree?

				int3Unsubmitted Done Reply Inline Actions My understanding of the need for generate-cfi-funcs.py is that some test cases need to be very large in order to exercise the edge cases in the implementation. In addition to providing that functionality, this script also serves as a fuzzer by generating random test cases. Would it be possible for the unit test to instead enumerate a small number of cases that cover all important code paths, in order that we run fewer test cases? int3: My understanding of the need for generate-cfi-funcs.py is that some test cases need to be very…
				gkmAuthorUnsubmitted Done Reply Inline Actions It is already a single test case, made deterministic by passing `--seed' so that` check-lld-macho` runs are consistent. The multiplicity that might be reduced is the number of functions generated for that one test case. However, since I need to test boundary conditions around breaking 1021-entry pages, my options are limited. If there is a way to prefer an installed `llvm-mc` over the locally built one, that would solve it. gkm: It is already a single test case, made deterministic by passing `--seed' so that `check-lld…
				# If headers and offsets are proper, then ...
				#
				# 1) llvm-objdump will not crash, and exit with good status
				#
				# 2) Summary encodings from the input object will match
				# those from the linked output
				#
				# 3) Encodings & symbols from the intput object will match
				# those from the linked output

				# RUN: %python %S/tools/generate-cfi-funcs.py --seed=johnnyapple >%t.s
				# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin19.0.0 -o %t.o %t.s
				int3Unsubmitted Done Reply Inline Actions maybe change this seed to something that's not company-specific int3: maybe change this seed to something that's not company-specific
				# RUN: lld -flavor darwinnew -Z -L%S/Inputs/MacOSX.sdk/usr/lib -lSystem -o %t %t.o
				# RUN: llvm-objdump --unwind-info --syms %t %t.o >%t.dump
				# RUN: %python %S/tools/validate-unwind-info.py %t.dump

lld/test/MachO/tools/generate-cfi-funcs.py

This file was added.

Property	Old Value	New Value
File Mode	null	100755

				#!/usr/bin/env python

				"""Generate skeletal functions with a variety .cfi_ directives.
				The purpose is to produce object-file test inputs to lld with a
				variety of compact unwind encodings.
				"""
				import random
				import argparse
				import string
				from math import factorial
				from itertools import permutations

				lsda_n = 0
				lsda_odds = 0.0
				func_size_low = 0x10
				func_size_high = 0x100
				saved_regs = ["%r15", "%r14", "%r13", "%r12", "%rbx"]
				saved_regs_combined = list(list(permutations(saved_regs, i))
				for i in range(0,6))

				def print_function(name: str):
				thakisUnsubmitted Not Done Reply Inline Actions llvm still supports python2.7 for now. thakis: llvm still supports python2.7 for now.
				thakisUnsubmitted Not Done Reply Inline Actions I went ahead and made the scripts 2.7-compatible in e22a4fd59de668af1cb943e23a6f4bfc93090e0f. Once we drop 2.7 support, it's hopefully easy to revert. Sorry for the trouble! thakis: I went ahead and made the scripts 2.7-compatible in e22a4fd59de668af1cb943e23a6f4bfc93090e0f.
				global lsda_odds
				have_lsda = (random.random() < lsda_odds)
				frame_size = random.randint(4, 64) * 16
				frame_offset = -random.randint(0, (frame_size/16 - 4)) * 16
				reg_count = random.randint(0, 4)
				reg_combo = random.randint(0, factorial(reg_count) - 1)
				regs_saved = saved_regs_combined[reg_count][reg_combo]
				global func_size_low, func_size_high
				func_size = random.randint(func_size_low, func_size_high) * 0x10
				func_size_high += 1
				if func_size_high % 0x10 == 0:
				func_size_low += 1

				print(f"""\
				### {name} regs={reg_count} frame={frame_size} lsda={have_lsda} size={func_size}
				.section __TEXT,__text,regular,pure_instructions
				.p2align 4, 0x90
				.globl {name}
				{name}:
				.cfi_startproc""")
				if have_lsda:
				global lsda_n
				lsda_n += 1
				print(f"""\
				.cfi_personality 155, ___gxx_personality_v0
				.cfi_lsda 16, Lexception{lsda_n}""")
				print(f"""\
				pushq %rbp
				.cfi_def_cfa_offset {frame_size}
				.cfi_offset %rbp, {frame_offset+(6*8)}
				movq %rsp, %rbp
				.cfi_def_cfa_register %rbp""")
				for i in range(reg_count):
				print(f".cfi_offset {regs_saved[i]}, {frame_offset+(i*8)}")
				print(f"""\
				.fill {func_size - 6}
				popq %rbp
				retq
				.cfi_endproc
				""")

				if have_lsda:
				print(f"""\
				.section __TEXT,__gcc_except_tab
				.p2align 2
				Lexception{lsda_n}:
				.space 0x10
				""")
				return func_size

				def random_seed():
				"""Generate a seed that can easily be passsed back in via --seed=STRING"""
				return ''.join(random.choice(string.ascii_lowercase) for i in range(10))

				def main():
				parser = argparse.ArgumentParser(
				description=__doc__,
				epilog="""\
				Function sizes begin small then monotonically increase. The goal is
				to produce early pages that are full and later pages that are less
				than full, in order to test handling for both cases. Full pages
				contain the maximum of 1021 compact unwind entries for a total page
				size = 4 KiB.

				Use --pages=N or --functions=N to control the size of the output.
				Default is --pages=2, meaning produce at least two full pages of
				compact unwind entries, plus some more. The calculatation is sloppy.
				""")
				parser.add_argument('--seed', type=str, default=random_seed(),
				help='Seed the random number generator')
				parser.add_argument('--pages', type=int, default=2,
				help='Number of compact-unwind pages')
				parser.add_argument('--functions', type=int, default=None,
				help='Number of functions to generate')
				parser.add_argument('--encodings', type=int, default=127,
				help='Maximum number of unique unwind encodings (default = 127)')
				parser.add_argument('--lsda', type=int, default=0,
				help='Percentage of functions with personality & LSDA (default = 10')
				args = parser.parse_args()
				random.seed(args.seed)
				p2align = 14
				global lsda_odds
				lsda_odds = args.lsda / 100.0

				print(f"""\
				### seed={args.seed} lsda={lsda_odds} p2align={p2align}
				.section __TEXT,__text,regular,pure_instructions
				.p2align {p2align}, 0x90
				""")

				size = 0
				base = (1 << p2align)
				if args.functions:
				for n in range(args.functions):
				size += print_function(f"x{size+base:08x}")
				else:
				while size < (args.pages << 24):
				size += print_function(f"x{size+base:08x}")

				print(f"""\
				.section __TEXT,__text,regular,pure_instructions
				.globl _main
				.p2align 4, 0x90
				_main:
				retq

				.p2align 4, 0x90
				___gxx_personality_v0:
				retq
				""")


				if __name__ == '__main__':
				main()

lld/test/MachO/tools/validate-unwind-info.py

This file was added.

Property	Old Value	New Value
File Mode	null	100755

				#!/usr/bin/env python

				"""Validate compact unwind info by cross checking the llvm-objdump
				reports of the input object file vs final linked output.
				"""
				import sys
				import argparse
				import re
				from pprint import pprint

				def main():
				hex = "[a-f\d]"
				hex8 = hex + "{8}"

				parser = argparse.ArgumentParser(description=__doc__)
				parser.add_argument('files', metavar='FILES', nargs='*',
				help='output of (llvm-objdump --unwind-info --syms) for object file(s) plus final linker output')
				parser.add_argument('--debug', action='store_true')
				args = parser.parse_args()

				if args.files:
				objdump_string = ''.join([open(f).read() for f in args.files])
				else:
				objdump_string = sys.stdin.read()

				object_encodings_list = [(symbol, encoding, personality, lsda)
				for symbol, encoding, personality, lsda in
				re.findall(rf"start:\s+0x{hex}+\s+(\w+)\s+" +
				rf"length:\s+0x{hex}+\s+" +
				rf"compact encoding:\s+0x({hex}+)(?:\s+" +
				rf"personality function:\s+0x({hex}+)\s+\w+\s+" +
				rf"LSDA:\s+0x({hex}+)\s+\w+(?: \+ 0x{hex}+)?)?",
				objdump_string, re.DOTALL)]
				object_encodings_map = {symbol:encoding
				for symbol, encoding, _, _ in object_encodings_list}
				if not object_encodings_map:
				sys.exit("no object encodings found in input")
				int3Unsubmitted Done Reply Inline Actions JFYI, the parens are unnecessary, but I'm fine if you prefer them int3: JFYI, the parens are unnecessary, but I'm fine if you prefer them

				program_symbols_map = {address:symbol
				for address, symbol in
				re.findall(rf"^{hex8}({hex8}) g\s+F __TEXT,__text (x\1)$",
				objdump_string, re.MULTILINE)}
				if not program_symbols_map:
				sys.exit("no program symbols found in input")

				program_common_encodings = (
				re.findall(rf"^\s+encoding\[\d+\]: 0x({hex}+)$",
				objdump_string, re.MULTILINE))
				if not program_common_encodings:
				sys.exit("no common encodings found in input")

				program_encodings_map = {program_symbols_map[address]:encoding
				for address, encoding in
				re.findall(rf"^\s+\[\d+\]: function offset=0x({hex}+), " +
				rf"encoding\[\d+\]=0x({hex}+)$",
				objdump_string, re.MULTILINE)}
				if not object_encodings_map:
				sys.exit("no program encodings found in input")

				# Fold adjacent entries from the object file that have matching encodings
				# TODO(gkm) add check for personality+lsda
				encoding0 = 0
				for symbol in sorted(object_encodings_map):
				encoding = object_encodings_map[symbol]
				fold = (encoding == encoding0)
				if fold:
				del object_encodings_map[symbol]
				if args.debug:
				print(f"{'delete' if fold else 'retain'} {symbol} with {encoding}")
				int3Unsubmitted Done Reply Inline Actions I don't suppose you meant to leave `and False` in? int3: I don't suppose you meant to leave `and False` in?
				encoding0 = encoding

				if program_encodings_map != object_encodings_map:
				if args.debug:
				pprint(f"program encodings map:\n{program_encodings_map}")
				pprint(f"object encodings map:\n{object_encodings_map}")
				sys.exit("encoding maps differ")

				# Count frequency of object-file folded encodings
				# and compare with the program-file common encodings table
				encoding_frequency_map = {}
				for _, encoding in object_encodings_map.items():
				encoding_frequency_map[encoding] = 1 + encoding_frequency_map.get(encoding, 0)
				encoding_frequencies = [x for x in
				sorted(encoding_frequency_map,
				key=lambda x: (encoding_frequency_map.get(x), x),
				reverse=True)]

				if program_common_encodings != encoding_frequencies:
				if args.debug:
				pprint(f"program common encodings:\n{program_common_encodings}")
				int3Unsubmitted Done Reply Inline Actions this seems unnecessary given that we'll be exiting anyway when this function returns int3: this seems unnecessary given that we'll be exiting anyway when this function returns
				gkmAuthorUnsubmitted Done Reply Inline Actions True, but since I explicitly `sys.exit("... diagnostic ...")` on error, for symmetry I chose to explicitly `sys.exit()` on success. I dislike falling through the floor of main, despite language guarantees, and since `sys.exit()` accepts a string arg on error, I prefer that to using numeric `return STATUS`. gkm: True, but since I explicitly `sys.exit("... diagnostic ...")` on error, for symmetry I chose to…
				pprint(f"object encoding frequencies:\n{encoding_frequencies}")
				sys.exit("encoding frequencies differ")


				if __name__ == '__main__':
				main()

This is an archive of the discontinued LLVM Phabricator instance.

[lld-macho] create __TEXT,__unwind_info from __LD,__compact_unwindClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 292938

lld/MachO/CMakeLists.txt

lld/MachO/OutputSegment.h

lld/MachO/SyntheticSections.h

lld/MachO/UnwindInfoSection.h

lld/MachO/UnwindInfoSection.cpp

lld/MachO/Writer.cpp

lld/test/MachO/compact-unwind.test

lld/test/MachO/tools/generate-cfi-funcs.py

lld/test/MachO/tools/validate-unwind-info.py

[lld-macho] create TEXT,unwind_info from LD,compact_unwind
ClosedPublic