This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/MachO/
-
MachO/
-
Arch/
-
ARM.cpp
-
ARM64Common.h
-
ARM64Common.cpp
-
X86_64.cpp
-
Driver.h
-
Driver.cpp
-
DriverUtils.cpp
-
InputFiles.h
-
InputFiles.cpp
-
LTO.h
-
LTO.cpp
-
ObjC.h
-
ObjC.cpp
-
Target.h
-
llvm/
-
include/llvm/
-
llvm/
-
LTO/
-
Caching.h
-
Support/
-
MemoryBuffer.h
-
MemoryBufferRef.h
-
lib/
-
LTO/
-
Caching.cpp
-
Object/
-
Archive.cpp
-
Support/
-
MemoryBuffer.cpp
1
MemoryBufferRef.cpp

Differential D106213

[lld-macho] Enable copy-on-write for input buffers
AbandonedPublic

Authored by int3 on Jul 16 2021, 11:41 PM.

Download Raw Diff

Details

Reviewers

gkm

Group Reviewers

Restricted Project

Summary

The Mach-O format makes extensive use of embedded addends,
particularly for x86_64. This makes ICF's job a bit more difficult:
sections that are otherwise semantically identical may have different
raw bytes due to the embedded addends. Hashing and comparing the raw
bytes naively means that we miss folding opportunities.

This diff canonicalizes these sections by writing over those embedded
addends with zeros. In order to make this possible, the our previously
read-only MemoryBuffers need to be made into copy-on-write.

This requires some awkward casts, but I'm not sure there's a better way to do
things...

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	2,910 ms	x64 debian > libarcher.critical::critical.c
	3,040 ms	x64 debian > libarcher.parallel::parallel-firstprivate.c
	2,840 ms	x64 debian > libarcher.parallel::parallel-simple2.c
	2,750 ms	x64 debian > libarcher.races::critical-unrelated.c
	2,750 ms	x64 debian > libarcher.races::lock-nested-unrelated.c
		View Full Test Results (16 Failed)

Event Timeline

int3 created this revision.Jul 16 2021, 11:41 PM

Herald added a reviewer: gkm. · View Herald TranscriptJul 16 2021, 11:41 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: ormris, dexonsmith, pengfei and 2 others. · View Herald Transcript

int3 requested review of this revision.Jul 16 2021, 11:41 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 16 2021, 11:41 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B114667: Diff 359537.Jul 17 2021, 12:32 AM

How much of a difference does ICF'ing these make?

Right now, pretty small. (0.3% for the internal build I'm testing, and 0.002% for chromium_framework.) But I'm building upon this to apply ICF to functions that have unwind info, since CompactUnwindEntries use unsigned section relocations -- which embed their addends -- to refer to functions. This has a much larger impact -- 3.2% for the internal build, about doubling the impact that ICF currently has. Not sure about chromium_framework yet, as my implementation is currently crashing.

OTOH, we might want CompactUnwindEntries to eventually use a special-cased subclass of InputSection for performance reasons, and that subclass could just ignore the functionAddress pointer when doing hashing / equality comparison. So deduping code with embedded addends is not a strict prerequisite. However, it does make the initial implementation of unwind ICF simpler, and moreover the code complexity here is largely self-contained -- unlike adding a new subclass, we don't have to add extra handling code throughout the linker.

Looks pretty good to me, other than the comment about the casting.

llvm/lib/Support/MemoryBufferRef.cpp
21	Is there any way to prevent casting if this was called on a non-writeable buffer? I was thinking of some flag based approach that is set for writeable buffer and have that be determine whether casting can be done. That way, this can be a bit safer than having to let the developer have context about whether this is safe or not.

Any idea what ld64 does in this case? Might be worth checking (should be testable, I think - give it a really big input section that it needs to canonicalize/fold, and see if that produces more total memory usage than if the section is small?) - and if it doesn't have the same negative effect on memory usage, maybe that's a hint that there's some other lower-(memory)-cost solution to the problem?

In D106213#2905606, @dblaikie wrote:

Any idea what ld64 does in this case? Might be worth checking (should be testable, I think - give it a really big input section that it needs to canonicalize/fold, and see if that produces more total memory usage than if the section is small?) - and if it doesn't have the same negative effect on memory usage, maybe that's a hint that there's some other lower-(memory)-cost solution to the problem?

I believe ld64 takes a different approach: it treats the fixups as-if-zero when comparing/hashing content, without sacrificing the mmap optimization.

I am not able to get ld64 to dedup even the simplest of functions, let alone more complicated cases like these. Or perhaps I'm missing something in how ld64 is supposed to be invoked... this is what I've tried:

(base) ~/tmp: cat test.s
.text
.weak_definition _foo
.weak_definition _bar
.weak_definition _baz

_foo:
  ret

_bar:
  ret

_baz:
  ret

.subsections_via_symbols

(base) ~/tmp: llvm-mc -filetype=obj -triple=x86_64-apple-darwin19.0.0 test.s > test.o
(base) ~/tmp: ld test.o -o test -dylib
(base) ~/tmp: llvm-objdump -d test

test:	file format mach-o 64-bit x86-64

Disassembly of section __TEXT,__text:

0000000000003fb5 <_foo>:
    3fb5: c3                           	retq

0000000000003fb6 <_bar>:
    3fb6: c3                           	retq

0000000000003fb7 <_baz>:
    3fb7: c3                           	retq

In D106213#2889390, @int3 wrote:

Right now, pretty small. (0.3% for the internal build I'm testing, and 0.002% for chromium_framework.) But I'm building upon this to apply ICF to functions that have unwind info, since CompactUnwindEntries use unsigned section relocations -- which embed their addends -- to refer to functions. This has a much larger impact -- 3.2% for the internal build, about doubling the impact that ICF currently has. Not sure about chromium_framework yet, as my implementation is currently crashing.

How much size win do you need to catch up to ld64?

(It's probably clear that I'm asking because this seems like a somewhat ugly change conceptually, and I'm wondering if it's worth it.)

In D106213#2905976, @int3 wrote:

I am not able to get ld64 to dedup even the simplest of functions, let alone more complicated cases like these.

It's not safe to dedup symbols in general, since it's not always known if their addresses are compared.

Mach-O can be marked "autohide" or .weak_def_can_be_hidden, something LLVM sets on symbols when GlobalValue::canBeOmittedFromSymbolTable (see https://github.com/llvm/llvm-project/blob/main/llvm/lib/IR/Globals.cpp#L311 or grep for MCSA_WeakDefAutoPrivate) -- that is, when a symbol is linkonce_odr and unnamed_addr. GlobalVariables that are also const can just be local_unnamed_addr. For C++ code, this should be true for most inlines and templates.

ld64 reuses this autohide flag to trigger dedup only when it's safe.

int3 planned changes to this revision.Aug 26 2021, 12:16 PM

int3 mentioned this in D109946: [lld-macho] Teach ICF to dedup functions with identical unwind info.Sep 16 2021, 8:38 PM

ormris removed a subscriber: ormris.Jan 24 2022, 11:44 AM

int3 abandoned this revision.Mar 16 2022, 2:33 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 16 2022, 2:33 PM

Revision Contents

Path

Size

lld/

MachO/

Arch/

4 lines

2 lines

4 lines

5 lines

6 lines

32 lines

6 lines

18 lines

45 lines

2 lines

15 lines

2 lines

4 lines

2 lines

llvm/

include/

llvm/

LTO/

Caching.h

4 lines

Support/

MemoryBuffer.h

11 lines

MemoryBufferRef.h

20 lines

lib/

LTO/

Caching.cpp

17 lines

Object/

Archive.cpp

6 lines

Support/

MemoryBuffer.cpp

18 lines

MemoryBufferRef.cpp

10 lines

Diff 359537

lld/MachO/Arch/ARM.cpp

	Show All 21 Lines
	using namespace lld;			using namespace lld;
	using namespace lld::macho;			using namespace lld::macho;

	namespace {			namespace {

	struct ARM : TargetInfo {			struct ARM : TargetInfo {
	ARM(uint32_t cpuSubtype);			ARM(uint32_t cpuSubtype);

	int64_t getEmbeddedAddend(MemoryBufferRef, uint64_t offset,			int64_t getEmbeddedAddend(WritableMemoryBufferRef, uint64_t offset,
	const relocation_info) const override;			const relocation_info) const override;
	void relocateOne(uint8_t *loc, const Reloc &, uint64_t va,			void relocateOne(uint8_t *loc, const Reloc &, uint64_t va,
	uint64_t pc) const override;			uint64_t pc) const override;

	void writeStub(uint8_t *buf, const Symbol &) const override;			void writeStub(uint8_t *buf, const Symbol &) const override;
	void writeStubHelperHeader(uint8_t *buf) const override;			void writeStubHelperHeader(uint8_t *buf) const override;
	void writeStubHelperEntry(uint8_t *buf, const DylibSymbol &,			void writeStubHelperEntry(uint8_t *buf, const DylibSymbol &,
	uint64_t entryAddr) const override;			uint64_t entryAddr) const override;
	Show All 21 Lines
	#undef B			#undef B
	}};			}};
	assert(type < relocAttrsArray.size() && "invalid relocation type");			assert(type < relocAttrsArray.size() && "invalid relocation type");
	if (type >= relocAttrsArray.size())			if (type >= relocAttrsArray.size())
	return invalidRelocAttrs;			return invalidRelocAttrs;
	return relocAttrsArray[type];			return relocAttrsArray[type];
	}			}

	int64_t ARM::getEmbeddedAddend(MemoryBufferRef mb, uint64_t offset,			int64_t ARM::getEmbeddedAddend(WritableMemoryBufferRef mb, uint64_t offset,
	relocation_info rel) const {			relocation_info rel) const {
	// FIXME: implement this			// FIXME: implement this
	return 0;			return 0;
	}			}

	template <int N> using BitfieldFlag = Bitfield::Element<bool, N, 1>;			template <int N> using BitfieldFlag = Bitfield::Element<bool, N, 1>;

	// ARM BL encoding:			// ARM BL encoding:
	▲ Show 20 Lines • Show All 96 Lines • Show Last 20 Lines

lld/MachO/Arch/ARM64Common.h

	Show All 16 Lines
	#include "llvm/BinaryFormat/MachO.h"			#include "llvm/BinaryFormat/MachO.h"

	namespace lld {			namespace lld {
	namespace macho {			namespace macho {

	struct ARM64Common : TargetInfo {			struct ARM64Common : TargetInfo {
	template <class LP> ARM64Common(LP lp) : TargetInfo(lp) {}			template <class LP> ARM64Common(LP lp) : TargetInfo(lp) {}

	int64_t getEmbeddedAddend(MemoryBufferRef, uint64_t offset,			int64_t getEmbeddedAddend(llvm::WritableMemoryBufferRef, uint64_t offset,
	const llvm::MachO::relocation_info) const override;			const llvm::MachO::relocation_info) const override;
	void relocateOne(uint8_t *loc, const Reloc &, uint64_t va,			void relocateOne(uint8_t *loc, const Reloc &, uint64_t va,
	uint64_t pc) const override;			uint64_t pc) const override;

	void relaxGotLoad(uint8_t *loc, uint8_t type) const override;			void relaxGotLoad(uint8_t *loc, uint8_t type) const override;
	uint64_t getPageSize() const override { return 16 * 1024; }			uint64_t getPageSize() const override { return 16 * 1024; }
	};			};

	▲ Show 20 Lines • Show All 111 Lines • Show Last 20 Lines

lld/MachO/Arch/ARM64Common.cpp

	//===- ARM64Common.cpp ----------------------------------------------------===//			//===- ARM64Common.cpp ----------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "Arch/ARM64Common.h"			#include "Arch/ARM64Common.h"

	#include "lld/Common/ErrorHandler.h"			#include "lld/Common/ErrorHandler.h"
	#include "llvm/Support/Endian.h"			#include "llvm/Support/Endian.h"

				using namespace llvm;
	using namespace llvm::MachO;			using namespace llvm::MachO;
	using namespace llvm::support::endian;			using namespace llvm::support::endian;
	using namespace lld;			using namespace lld;
	using namespace lld::macho;			using namespace lld::macho;

	int64_t ARM64Common::getEmbeddedAddend(MemoryBufferRef mb, uint64_t offset,			int64_t ARM64Common::getEmbeddedAddend(WritableMemoryBufferRef mb,
				uint64_t offset,
	const relocation_info rel) const {			const relocation_info rel) const {
	if (rel.r_type != ARM64_RELOC_UNSIGNED &&			if (rel.r_type != ARM64_RELOC_UNSIGNED &&
	rel.r_type != ARM64_RELOC_SUBTRACTOR) {			rel.r_type != ARM64_RELOC_SUBTRACTOR) {
	// All other reloc types should use the ADDEND relocation to store their			// All other reloc types should use the ADDEND relocation to store their
	// addends.			// addends.
	// TODO(gkm): extract embedded addend just so we can assert that it is 0			// TODO(gkm): extract embedded addend just so we can assert that it is 0
	return 0;			return 0;
	}			}
	▲ Show 20 Lines • Show All 83 Lines • Show Last 20 Lines

lld/MachO/Arch/X86_64.cpp

Show All 9 Lines
#include "Symbols.h"		#include "Symbols.h"
#include "SyntheticSections.h"		#include "SyntheticSections.h"
#include "Target.h"		#include "Target.h"

#include "lld/Common/ErrorHandler.h"		#include "lld/Common/ErrorHandler.h"
#include "llvm/BinaryFormat/MachO.h"		#include "llvm/BinaryFormat/MachO.h"
#include "llvm/Support/Endian.h"		#include "llvm/Support/Endian.h"

		using namespace llvm;
using namespace llvm::MachO;		using namespace llvm::MachO;
using namespace llvm::support::endian;		using namespace llvm::support::endian;
using namespace lld;		using namespace lld;
using namespace lld::macho;		using namespace lld::macho;

namespace {		namespace {

struct X86_64 : TargetInfo {		struct X86_64 : TargetInfo {
X86_64();		X86_64();

int64_t getEmbeddedAddend(MemoryBufferRef, uint64_t offset,		int64_t getEmbeddedAddend(WritableMemoryBufferRef, uint64_t offset,
const relocation_info) const override;		const relocation_info) const override;
void relocateOne(uint8_t *loc, const Reloc &, uint64_t va,		void relocateOne(uint8_t *loc, const Reloc &, uint64_t va,
uint64_t relocVA) const override;		uint64_t relocVA) const override;

void writeStub(uint8_t *buf, const Symbol &) const override;		void writeStub(uint8_t *buf, const Symbol &) const override;
void writeStubHelperHeader(uint8_t *buf) const override;		void writeStubHelperHeader(uint8_t *buf) const override;
void writeStubHelperEntry(uint8_t *buf, const DylibSymbol &,		void writeStubHelperEntry(uint8_t *buf, const DylibSymbol &,
uint64_t entryAddr) const override;		uint64_t entryAddr) const override;
Show All 35 Lines	case X86_64_RELOC_SIGNED_2:
return 2;		return 2;
case X86_64_RELOC_SIGNED_4:		case X86_64_RELOC_SIGNED_4:
return 4;		return 4;
default:		default:
return 0;		return 0;
}		}
}		}

int64_t X86_64::getEmbeddedAddend(MemoryBufferRef mb, uint64_t offset,		int64_t X86_64::getEmbeddedAddend(WritableMemoryBufferRef mb, uint64_t offset,
relocation_info rel) const {		relocation_info rel) const {
auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());		auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());
const uint8_t *loc = buf + offset + rel.r_address;		const uint8_t *loc = buf + offset + rel.r_address;

switch (rel.r_length) {		switch (rel.r_length) {
case 2:		case 2:
return static_cast<int32_t>(read32le(loc)) + pcrelOffset(rel.r_type);		return static_cast<int32_t>(read32le(loc)) + pcrelOffset(rel.r_type);
case 3:		case 3:
▲ Show 20 Lines • Show All 108 Lines • Show Last 20 Lines

lld/MachO/Driver.h

	Show First 20 Lines • Show All 48 Lines • ▼ Show 20 Lines

	void parseLCLinkerOption(InputFile *, unsigned argc, StringRef data);			void parseLCLinkerOption(InputFile *, unsigned argc, StringRef data);

	std::string createResponseFile(const llvm::opt::InputArgList &args);			std::string createResponseFile(const llvm::opt::InputArgList &args);

	// Check for both libfoo.dylib and libfoo.tbd (in that order).			// Check for both libfoo.dylib and libfoo.tbd (in that order).
	llvm::Optional<std::string> resolveDylibPath(llvm::StringRef path);			llvm::Optional<std::string> resolveDylibPath(llvm::StringRef path);

	DylibFile loadDylib(llvm::MemoryBufferRef mbref, DylibFile umbrella = nullptr,			DylibFile *loadDylib(llvm::WritableMemoryBufferRef mbref,
				DylibFile *umbrella = nullptr,
	bool isBundleLoader = false);			bool isBundleLoader = false);

	// Search for all possible combinations of `{root}/{name}.{extension}`.			// Search for all possible combinations of `{root}/{name}.{extension}`.
	// If \p extensions are not specified, then just search for `{root}/{name}`.			// If \p extensions are not specified, then just search for `{root}/{name}`.
	llvm::Optional<llvm::StringRef>			llvm::Optional<llvm::StringRef>
	findPathCombination(const llvm::Twine &name,			findPathCombination(const llvm::Twine &name,
	const std::vector<llvm::StringRef> &roots,			const std::vector<llvm::StringRef> &roots,
	ArrayRef<llvm::StringRef> extensions = {""});			ArrayRef<llvm::StringRef> extensions = {""});

	// If -syslibroot is specified, absolute paths to non-object files may be			// If -syslibroot is specified, absolute paths to non-object files may be
	// rerooted.			// rerooted.
	llvm::StringRef rerootPath(llvm::StringRef path);			llvm::StringRef rerootPath(llvm::StringRef path);

	llvm::Optional<InputFile *> loadArchiveMember(MemoryBufferRef, uint32_t modTime,			llvm::Optional<InputFile *> loadArchiveMember(llvm::WritableMemoryBufferRef,
				uint32_t modTime,
	StringRef archiveName,			StringRef archiveName,
	bool objCOnly);			bool objCOnly);

	uint32_t getModTime(llvm::StringRef path);			uint32_t getModTime(llvm::StringRef path);

	void printArchiveMemberLoad(StringRef reason, const InputFile *);			void printArchiveMemberLoad(StringRef reason, const InputFile *);

	// Map simulator platforms to their underlying device platform.			// Map simulator platforms to their underlying device platform.
	▲ Show 20 Lines • Show All 47 Lines • Show Last 20 Lines

lld/MachO/Driver.cpp

Show First 20 Lines • Show All 220 Lines • ▼ Show 20 Lines	case OPT_max_relative_cache_size_lto:
break;		break;
}		}
}		}
return CHECK(parseCachePruningPolicy(ltoPolicy), "invalid LTO cache policy");		return CHECK(parseCachePruningPolicy(ltoPolicy), "invalid LTO cache policy");
}		}

namespace {		namespace {
struct ArchiveMember {		struct ArchiveMember {
MemoryBufferRef mbref;		WritableMemoryBufferRef mbref;
uint32_t modTime;		uint32_t modTime;
};		};
} // namespace		} // namespace

// Returns slices of MB by parsing MB as an archive file.		// Returns slices of MB by parsing MB as an archive file.
// Each slice consists of a member file in the archive.		// Each slice consists of a member file in the archive.
static std::vector<ArchiveMember> getArchiveMembers(MemoryBufferRef mb) {		static std::vector<ArchiveMember>
		getArchiveMembers(WritableMemoryBufferRef mb) {
std::unique_ptr<Archive> file =		std::unique_ptr<Archive> file =
CHECK(Archive::create(mb),		CHECK(Archive::create(mb),
mb.getBufferIdentifier() + ": failed to parse archive");		mb.getBufferIdentifier() + ": failed to parse archive");
Archive *archive = file.get();		Archive *archive = file.get();
make<std::unique_ptr<Archive>>(std::move(file)); // take ownership		make<std::unique_ptr<Archive>>(std::move(file)); // take ownership

std::vector<ArchiveMember> v;		std::vector<ArchiveMember> v;
Error err = Error::success();		Error err = Error::success();

// Thin archives refer to .o files, so --reproduce needs the .o files too.		// Thin archives refer to .o files, so --reproduce needs the .o files too.
bool addToTar = archive->isThin() && tar;		bool addToTar = archive->isThin() && tar;

for (const Archive::Child &c : archive->children(err)) {		for (const Archive::Child &c : archive->children(err)) {
MemoryBufferRef mbref =		WritableMemoryBufferRef mbref =
CHECK(c.getMemoryBufferRef(),		CHECK(c.getMemoryBufferRef(),
mb.getBufferIdentifier() +		mb.getBufferIdentifier() +
": could not get the buffer for a child of the archive");		": could not get the buffer for a child of the archive")
		.castToWritable();
if (addToTar)		if (addToTar)
tar->append(relativeToRoot(check(c.getFullName())), mbref.getBuffer());		tar->append(relativeToRoot(check(c.getFullName())), mbref.getBuffer());
uint32_t modTime = toTimeT(		uint32_t modTime = toTimeT(
CHECK(c.getLastModified(), mb.getBufferIdentifier() +		CHECK(c.getLastModified(), mb.getBufferIdentifier() +
": could not get the modification "		": could not get the modification "
"time for a child of the archive"));		"time for a child of the archive"));
v.push_back({mbref, modTime});		v.push_back({mbref, modTime});
}		}
if (err)		if (err)
fatal(mb.getBufferIdentifier() +		fatal(mb.getBufferIdentifier() +
": Archive::children failed: " + toString(std::move(err)));		": Archive::children failed: " + toString(std::move(err)));

return v;		return v;
}		}

static DenseMap<StringRef, ArchiveFile *> loadedArchives;		static DenseMap<StringRef, ArchiveFile *> loadedArchives;

static InputFile *addFile(StringRef path, bool forceLoadArchive,		static InputFile *addFile(StringRef path, bool forceLoadArchive,
bool isExplicit = true, bool isBundleLoader = false) {		bool isExplicit = true, bool isBundleLoader = false) {
Optional<MemoryBufferRef> buffer = readFile(path);		Optional<WritableMemoryBufferRef> buffer = readFile(path);
if (!buffer)		if (!buffer)
return nullptr;		return nullptr;
MemoryBufferRef mbref = *buffer;		WritableMemoryBufferRef mbref = *buffer;
InputFile *newFile = nullptr;		InputFile *newFile = nullptr;

file_magic magic = identify_magic(mbref.getBuffer());		file_magic magic = identify_magic(mbref.getBuffer());
switch (magic) {		switch (magic) {
case file_magic::archive: {		case file_magic::archive: {
// Avoid loading archives twice. If the archives are being force-loaded,		// Avoid loading archives twice. If the archives are being force-loaded,
// loading them twice would create duplicate symbol errors. In the		// loading them twice would create duplicate symbol errors. In the
// non-force-loading case, this is just a minor performance optimization.		// non-force-loading case, this is just a minor performance optimization.
// We don't take a reference to cachedFile here because the		// We don't take a reference to cachedFile here because the
// loadArchiveMember() call below may recursively call addFile() and		// loadArchiveMember() call below may recursively call addFile() and
// invalidate this reference.		// invalidate this reference.
if (ArchiveFile *cachedFile = loadedArchives[path])		if (ArchiveFile *cachedFile = loadedArchives[path])
return cachedFile;		return cachedFile;

std::unique_ptr<object::Archive> file = CHECK(		std::unique_ptr<object::Archive> file = CHECK(
object::Archive::create(mbref), path + ": failed to parse archive");		object::Archive::create(mbref), path + ": failed to parse archive");

if (!file->isEmpty() && !file->hasSymbolTable())		if (!file->isEmpty() && !file->hasSymbolTable())
error(path + ": archive has no index; run ranlib to add one");		error(path + ": archive has no index; run ranlib to add one");

if (config->allLoad \|\| forceLoadArchive) {		if (config->allLoad \|\| forceLoadArchive) {
if (Optional<MemoryBufferRef> buffer = readFile(path)) {		if (Optional<WritableMemoryBufferRef> buffer = readFile(path)) {
for (const ArchiveMember &member : getArchiveMembers(*buffer)) {		for (const ArchiveMember &member : getArchiveMembers(*buffer)) {
if (Optional<InputFile *> file = loadArchiveMember(		if (Optional<InputFile *> file = loadArchiveMember(
member.mbref, member.modTime, path, /objCOnly=/false)) {		member.mbref, member.modTime, path, /objCOnly=/false)) {
inputFiles.insert(*file);		inputFiles.insert(*file);
printArchiveMemberLoad(		printArchiveMemberLoad(
(forceLoadArchive ? "-force_load" : "-all_load"),		(forceLoadArchive ? "-force_load" : "-all_load"),
inputFiles.back());		inputFiles.back());
}		}
}		}
}		}
} else if (config->forceLoadObjC) {		} else if (config->forceLoadObjC) {
for (const object::Archive::Symbol &sym : file->symbols())		for (const object::Archive::Symbol &sym : file->symbols())
if (sym.getName().startswith(objc::klass))		if (sym.getName().startswith(objc::klass))
symtab->addUndefined(sym.getName(), /file=/nullptr,		symtab->addUndefined(sym.getName(), /file=/nullptr,
/isWeakRef=/false);		/isWeakRef=/false);

// TODO: no need to look for ObjC sections for a given archive member if		// TODO: no need to look for ObjC sections for a given archive member if
// we already found that it contains an ObjC symbol. We should also		// we already found that it contains an ObjC symbol. We should also
// consider creating a LazyObjFile class in order to avoid double-loading		// consider creating a LazyObjFile class in order to avoid double-loading
// these files here and below (as part of the ArchiveFile).		// these files here and below (as part of the ArchiveFile).
if (Optional<MemoryBufferRef> buffer = readFile(path)) {		if (Optional<WritableMemoryBufferRef> buffer = readFile(path)) {
for (const ArchiveMember &member : getArchiveMembers(*buffer)) {		for (const ArchiveMember &member : getArchiveMembers(*buffer)) {
if (Optional<InputFile *> file = loadArchiveMember(		if (Optional<InputFile *> file = loadArchiveMember(
member.mbref, member.modTime, path, /objCOnly=/true)) {		member.mbref, member.modTime, path, /objCOnly=/true)) {
inputFiles.insert(*file);		inputFiles.insert(*file);
printArchiveMemberLoad("-ObjC", inputFiles.back());		printArchiveMemberLoad("-ObjC", inputFiles.back());
}		}
}		}
}		}
▲ Show 20 Lines • Show All 113 Lines • ▼ Show 20 Lines	case OPT_framework:
break;		break;
default:		default:
error(arg->getSpelling() + " is not allowed in LC_LINKER_OPTION");		error(arg->getSpelling() + " is not allowed in LC_LINKER_OPTION");
}		}
}		}
}		}

static void addFileList(StringRef path) {		static void addFileList(StringRef path) {
Optional<MemoryBufferRef> buffer = readFile(path);		Optional<WritableMemoryBufferRef> buffer = readFile(path);
if (!buffer)		if (!buffer)
return;		return;
MemoryBufferRef mbref = *buffer;		WritableMemoryBufferRef mbref = *buffer;
for (StringRef path : args::getLines(mbref))		for (StringRef path : args::getLines(mbref))
addFile(rerootPath(path), /forceLoadArchive=/false);		addFile(rerootPath(path), /forceLoadArchive=/false);
}		}

// An order file has one entry per line, in the following format:		// An order file has one entry per line, in the following format:
//		//
// <cpu>:<object file>:<symbol name>		// <cpu>:<object file>:<symbol name>
//		//
// <cpu> and <object file> are optional. If not specified, then that entry		// <cpu> and <object file> are optional. If not specified, then that entry
// matches any symbol of that name. Parsing this format is not quite		// matches any symbol of that name. Parsing this format is not quite
// straightforward because the symbol name itself can contain colons, so when		// straightforward because the symbol name itself can contain colons, so when
// encountering a colon, we consider the preceding characters to decide if it		// encountering a colon, we consider the preceding characters to decide if it
// can be a valid CPU type or file path.		// can be a valid CPU type or file path.
//		//
// If a symbol is matched by multiple entries, then it takes the lowest-ordered		// If a symbol is matched by multiple entries, then it takes the lowest-ordered
// entry (the one nearest to the front of the list.)		// entry (the one nearest to the front of the list.)
//		//
// The file can also have line comments that start with '#'.		// The file can also have line comments that start with '#'.
static void parseOrderFile(StringRef path) {		static void parseOrderFile(StringRef path) {
Optional<MemoryBufferRef> buffer = readFile(path);		Optional<WritableMemoryBufferRef> buffer = readFile(path);
if (!buffer) {		if (!buffer) {
error("Could not read order file at " + path);		error("Could not read order file at " + path);
return;		return;
}		}

MemoryBufferRef mbref = *buffer;		WritableMemoryBufferRef mbref = *buffer;
size_t priority = std::numeric_limits<size_t>::max();		size_t priority = std::numeric_limits<size_t>::max();
for (StringRef line : args::getLines(mbref)) {		for (StringRef line : args::getLines(mbref)) {
StringRef objectFile, symbol;		StringRef objectFile, symbol;
line = line.take_until([](char c) { return c == '#'; }); // ignore comments		line = line.take_until([](char c) { return c == '#'; }); // ignore comments
line = line.ltrim();		line = line.ltrim();

CPUType cpuType = StringSwitch<CPUType>(line)		CPUType cpuType = StringSwitch<CPUType>(line)
.StartsWith("i386:", CPU_TYPE_I386)		.StartsWith("i386:", CPU_TYPE_I386)
▲ Show 20 Lines • Show All 472 Lines • ▼ Show 20 Lines
static void handleSymbolPatterns(InputArgList &args,		static void handleSymbolPatterns(InputArgList &args,
SymbolPatterns &symbolPatterns,		SymbolPatterns &symbolPatterns,
unsigned singleOptionCode,		unsigned singleOptionCode,
unsigned listFileOptionCode) {		unsigned listFileOptionCode) {
for (const Arg *arg : args.filtered(singleOptionCode))		for (const Arg *arg : args.filtered(singleOptionCode))
symbolPatterns.insert(arg->getValue());		symbolPatterns.insert(arg->getValue());
for (const Arg *arg : args.filtered(listFileOptionCode)) {		for (const Arg *arg : args.filtered(listFileOptionCode)) {
StringRef path = arg->getValue();		StringRef path = arg->getValue();
Optional<MemoryBufferRef> buffer = readFile(path);		Optional<WritableMemoryBufferRef> buffer = readFile(path);
if (!buffer) {		if (!buffer) {
error("Could not read symbol file: " + path);		error("Could not read symbol file: " + path);
continue;		continue;
}		}
MemoryBufferRef mbref = *buffer;		WritableMemoryBufferRef mbref = *buffer;
for (StringRef line : args::getLines(mbref)) {		for (StringRef line : args::getLines(mbref)) {
line = line.take_until([](char c) { return c == '#'; }).trim();		line = line.take_until([](char c) { return c == '#'; }).trim();
if (!line.empty())		if (!line.empty())
symbolPatterns.insert(line);		symbolPatterns.insert(line);
}		}
}		}
}		}

▲ Show 20 Lines • Show All 485 Lines • ▼ Show 20 Lines	if (!config->exportedSymbols.empty()) {
if (config->unexportedSymbols.match(defined->getName()))		if (config->unexportedSymbols.match(defined->getName()))
defined->privateExtern = true;		defined->privateExtern = true;
}		}

for (const Arg *arg : args.filtered(OPT_sectcreate)) {		for (const Arg *arg : args.filtered(OPT_sectcreate)) {
StringRef segName = arg->getValue(0);		StringRef segName = arg->getValue(0);
StringRef sectName = arg->getValue(1);		StringRef sectName = arg->getValue(1);
StringRef fileName = arg->getValue(2);		StringRef fileName = arg->getValue(2);
Optional<MemoryBufferRef> buffer = readFile(fileName);		Optional<WritableMemoryBufferRef> buffer = readFile(fileName);
if (buffer)		if (buffer)
inputFiles.insert(make<OpaqueFile>(*buffer, segName, sectName));		inputFiles.insert(make<OpaqueFile>(*buffer, segName, sectName));
}		}

gatherInputSections();		gatherInputSections();

if (config->deadStrip)		if (config->deadStrip)
markLive();		markLive();
Show All 32 Lines

lld/MachO/DriverUtils.cpp

Show First 20 Lines • Show All 139 Lines • ▼ Show 20 Lines	case OPT_reproduce:
break;		break;
case OPT_INPUT:		case OPT_INPUT:
os << quote(rewriteInputPath(arg->getValue())) << "\n";		os << quote(rewriteInputPath(arg->getValue())) << "\n";
break;		break;
case OPT_o:		case OPT_o:
os << "-o " << quote(path::filename(arg->getValue())) << "\n";		os << "-o " << quote(path::filename(arg->getValue())) << "\n";
break;		break;
case OPT_filelist:		case OPT_filelist:
if (Optional<MemoryBufferRef> buffer = readFile(arg->getValue()))		if (Optional<WritableMemoryBufferRef> buffer = readFile(arg->getValue()))
for (StringRef path : args::getLines(*buffer))		for (StringRef path : args::getLines(*buffer))
os << quote(rewriteInputPath(path)) << "\n";		os << quote(rewriteInputPath(path)) << "\n";
break;		break;
case OPT_force_load:		case OPT_force_load:
case OPT_weak_library:		case OPT_weak_library:
os << arg->getSpelling() << " "		os << arg->getSpelling() << " "
<< quote(rewriteInputPath(arg->getValue())) << "\n";		<< quote(rewriteInputPath(arg->getValue())) << "\n";
break;		break;
▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	if (tbdExists)
return std::string(tbdPath);		return std::string(tbdPath);
return {};		return {};
}		}

// It's not uncommon to have multiple attempts to load a single dylib,		// It's not uncommon to have multiple attempts to load a single dylib,
// especially if it's a commonly re-exported core library.		// especially if it's a commonly re-exported core library.
static DenseMap<CachedHashStringRef, DylibFile *> loadedDylibs;		static DenseMap<CachedHashStringRef, DylibFile *> loadedDylibs;

DylibFile macho::loadDylib(MemoryBufferRef mbref, DylibFile umbrella,		DylibFile macho::loadDylib(WritableMemoryBufferRef mbref, DylibFile umbrella,
bool isBundleLoader) {		bool isBundleLoader) {
CachedHashStringRef path(mbref.getBufferIdentifier());		CachedHashStringRef path(mbref.getBufferIdentifier());
DylibFile *&file = loadedDylibs[path];		DylibFile *&file = loadedDylibs[path];
if (file)		if (file)
return file;		return file;

DylibFile *newFile;		DylibFile *newFile;
file_magic magic = identify_magic(mbref.getBuffer());		file_magic magic = identify_magic(mbref.getBuffer());
▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	StringRef macho::rerootPath(StringRef path) {

if (Optional<StringRef> rerootedPath =		if (Optional<StringRef> rerootedPath =
findPathCombination(path, config->systemLibraryRoots))		findPathCombination(path, config->systemLibraryRoots))
return *rerootedPath;		return *rerootedPath;

return path;		return path;
}		}

Optional<InputFile *> macho::loadArchiveMember(MemoryBufferRef mb,		Optional<InputFile *> macho::loadArchiveMember(WritableMemoryBufferRef mb,
uint32_t modTime,		uint32_t modTime,
StringRef archiveName,		StringRef archiveName,
bool objCOnly) {		bool objCOnly) {
if (config->zeroModTime)		if (config->zeroModTime)
modTime = 0;		modTime = 0;

switch (identify_magic(mb.getBuffer())) {		switch (identify_magic(mb.getBuffer())) {
case file_magic::macho_object:		case file_magic::macho_object:
▲ Show 20 Lines • Show All 83 Lines • Show Last 20 Lines

lld/MachO/InputFiles.h

Show First 20 Lines • Show All 64 Lines • ▼ Show 20 Lines	enum Kind {
ArchiveKind,		ArchiveKind,
BitcodeKind,		BitcodeKind,
};		};

virtual ~InputFile() = default;		virtual ~InputFile() = default;
Kind kind() const { return fileKind; }		Kind kind() const { return fileKind; }
StringRef getName() const { return name; }		StringRef getName() const { return name; }

MemoryBufferRef mb;		llvm::WritableMemoryBufferRef mb;

std::vector<Symbol *> symbols;		std::vector<Symbol *> symbols;
std::vector<SubsectionMap> subsections;		std::vector<SubsectionMap> subsections;
// Provides an easy way to sort InputFiles deterministically.		// Provides an easy way to sort InputFiles deterministically.
const int id;		const int id;

// If not empty, this stores the name of the archive containing this file.		// If not empty, this stores the name of the archive containing this file.
// We use this string for creating error messages.		// We use this string for creating error messages.
std::string archiveName;		std::string archiveName;

protected:		protected:
InputFile(Kind kind, MemoryBufferRef mb)		InputFile(Kind kind, llvm::WritableMemoryBufferRef mb)
: mb(mb), id(idCount++), fileKind(kind), name(mb.getBufferIdentifier()) {}		: mb(mb), id(idCount++), fileKind(kind), name(mb.getBufferIdentifier()) {}

InputFile(Kind, const llvm::MachO::InterfaceFile &);		InputFile(Kind, const llvm::MachO::InterfaceFile &);

private:		private:
const Kind fileKind;		const Kind fileKind;
const StringRef name;		const StringRef name;

static int idCount;		static int idCount;
};		};

// .o file		// .o file
class ObjFile final : public InputFile {		class ObjFile final : public InputFile {
public:		public:
ObjFile(MemoryBufferRef mb, uint32_t modTime, StringRef archiveName);		ObjFile(llvm::WritableMemoryBufferRef mb, uint32_t modTime,
		StringRef archiveName);
static bool classof(const InputFile *f) { return f->kind() == ObjKind; }		static bool classof(const InputFile *f) { return f->kind() == ObjKind; }

llvm::DWARFUnit *compileUnit = nullptr;		llvm::DWARFUnit *compileUnit = nullptr;
const uint32_t modTime;		const uint32_t modTime;
std::vector<ConcatInputSection *> debugSections;		std::vector<ConcatInputSection *> debugSections;
ArrayRef<llvm::MachO::data_in_code_entry> dataInCodeEntries;		ArrayRef<llvm::MachO::data_in_code_entry> dataInCodeEntries;

private:		private:
Show All 10 Lines	void parseRelocations(ArrayRef<Section> sectionHeaders, const Section &,
SubsectionMap &);		SubsectionMap &);
void parseDebugInfo();		void parseDebugInfo();
void parseDataInCode();		void parseDataInCode();
};		};

// command-line -sectcreate file		// command-line -sectcreate file
class OpaqueFile final : public InputFile {		class OpaqueFile final : public InputFile {
public:		public:
OpaqueFile(MemoryBufferRef mb, StringRef segName, StringRef sectName);		OpaqueFile(llvm::WritableMemoryBufferRef mb, StringRef segName,
		StringRef sectName);
static bool classof(const InputFile *f) { return f->kind() == OpaqueKind; }		static bool classof(const InputFile *f) { return f->kind() == OpaqueKind; }
};		};

// .dylib or .tbd file		// .dylib or .tbd file
class DylibFile final : public InputFile {		class DylibFile final : public InputFile {
public:		public:
// Mach-O dylibs can re-export other dylibs as sub-libraries, meaning that the		// Mach-O dylibs can re-export other dylibs as sub-libraries, meaning that the
// symbols in those sub-libraries will be available under the umbrella		// symbols in those sub-libraries will be available under the umbrella
// library's namespace. Those sub-libraries can also have their own		// library's namespace. Those sub-libraries can also have their own
// re-exports. When loading a re-exported dylib, `umbrella` should be set to		// re-exports. When loading a re-exported dylib, `umbrella` should be set to
// the root dylib to ensure symbols in the child library are correctly bound		// the root dylib to ensure symbols in the child library are correctly bound
// to the root. On the other hand, if a dylib is being directly loaded		// to the root. On the other hand, if a dylib is being directly loaded
// (through an -lfoo flag), then `umbrella` should be a nullptr.		// (through an -lfoo flag), then `umbrella` should be a nullptr.
explicit DylibFile(MemoryBufferRef mb, DylibFile *umbrella,		explicit DylibFile(llvm::WritableMemoryBufferRef mb, DylibFile *umbrella,
bool isBundleLoader = false);		bool isBundleLoader = false);
explicit DylibFile(const llvm::MachO::InterfaceFile &interface,		explicit DylibFile(const llvm::MachO::InterfaceFile &interface,
DylibFile *umbrella = nullptr,		DylibFile *umbrella = nullptr,
bool isBundleLoader = false);		bool isBundleLoader = false);

void parseLoadCommands(MemoryBufferRef mb);		void parseLoadCommands(llvm::WritableMemoryBufferRef mb);
void parseReexports(const llvm::MachO::InterfaceFile &interface);		void parseReexports(const llvm::MachO::InterfaceFile &interface);

static bool classof(const InputFile *f) { return f->kind() == DylibKind; }		static bool classof(const InputFile *f) { return f->kind() == DylibKind; }

StringRef installName;		StringRef installName;
DylibFile *exportingFile = nullptr;		DylibFile *exportingFile = nullptr;
DylibFile *umbrella;		DylibFile *umbrella;
SmallVector<StringRef, 2> rpaths;		SmallVector<StringRef, 2> rpaths;
Show All 35 Lines	private:
std::unique_ptr<llvm::object::Archive> file;		std::unique_ptr<llvm::object::Archive> file;
// Keep track of children fetched from the archive by tracking		// Keep track of children fetched from the archive by tracking
// which address offsets have been fetched already.		// which address offsets have been fetched already.
llvm::DenseSet<uint64_t> seen;		llvm::DenseSet<uint64_t> seen;
};		};

class BitcodeFile final : public InputFile {		class BitcodeFile final : public InputFile {
public:		public:
explicit BitcodeFile(MemoryBufferRef mb);		explicit BitcodeFile(llvm::WritableMemoryBufferRef mb);
static bool classof(const InputFile *f) { return f->kind() == BitcodeKind; }		static bool classof(const InputFile *f) { return f->kind() == BitcodeKind; }

std::unique_ptr<llvm::lto::InputFile> obj;		std::unique_ptr<llvm::lto::InputFile> obj;
};		};

extern llvm::SetVector<InputFile *> inputFiles;		extern llvm::SetVector<InputFile *> inputFiles;

llvm::Optional<MemoryBufferRef> readFile(StringRef path);		llvm::Optional<llvm::WritableMemoryBufferRef> readFile(StringRef path);

namespace detail {		namespace detail {

template <class CommandType, class... Types>		template <class CommandType, class... Types>
std::vector<const CommandType *>		std::vector<const CommandType *>
findCommands(const void *anyHdr, size_t maxCommands, Types... types) {		findCommands(const void *anyHdr, size_t maxCommands, Types... types) {
std::vector<const CommandType *> cmds;		std::vector<const CommandType *> cmds;
std::initializer_list<uint32_t> typesList{types...};		std::initializer_list<uint32_t> typesList{types...};
Show All 37 Lines

lld/MachO/InputFiles.cpp

Show First 20 Lines • Show All 168 Lines • ▼ Show 20 Lines	if (it->minimum > config->platformInfo.minimum)
warn(toString(input) + " has version " + it->minimum.getAsString() +		warn(toString(input) + " has version " + it->minimum.getAsString() +
", which is newer than target minimum of " +		", which is newer than target minimum of " +
config->platformInfo.minimum.getAsString());		config->platformInfo.minimum.getAsString());

return true;		return true;
}		}

// Open a given file path and return it as a memory-mapped file.		// Open a given file path and return it as a memory-mapped file.
Optional<MemoryBufferRef> macho::readFile(StringRef path) {		Optional<WritableMemoryBufferRef> macho::readFile(StringRef path) {
ErrorOr<std::unique_ptr<MemoryBuffer>> mbOrErr = MemoryBuffer::getFile(path);		ErrorOr<std::unique_ptr<WritableMemoryBuffer>> mbOrErr =
		WritableMemoryBuffer::getFile(path);
if (std::error_code ec = mbOrErr.getError()) {		if (std::error_code ec = mbOrErr.getError()) {
error("cannot open " + path + ": " + ec.message());		error("cannot open " + path + ": " + ec.message());
return None;		return None;
}		}

std::unique_ptr<MemoryBuffer> &mb = *mbOrErr;		std::unique_ptr<WritableMemoryBuffer> &mb = *mbOrErr;
MemoryBufferRef mbref = mb->getMemBufferRef();		WritableMemoryBufferRef mbref = mb->getMemBufferRef().castToWritable();
make<std::unique_ptr<MemoryBuffer>>(std::move(mb)); // take mb ownership		make<std::unique_ptr<MemoryBuffer>>(std::move(mb)); // take mb ownership

// If this is a regular non-fat file, return it.		// If this is a regular non-fat file, return it.
const char *buf = mbref.getBufferStart();		char *buf = mbref.getBufferStart();
const auto hdr = reinterpret_cast<const fat_header >(buf);		const auto hdr = reinterpret_cast<const fat_header >(buf);
if (mbref.getBufferSize() < sizeof(uint32_t) \|\|		if (mbref.getBufferSize() < sizeof(uint32_t) \|\|
read32be(&hdr->magic) != FAT_MAGIC) {		read32be(&hdr->magic) != FAT_MAGIC) {
if (tar)		if (tar)
tar->append(relativeToRoot(path), mbref.getBuffer());		tar->append(relativeToRoot(path), mbref.getBuffer());
return mbref;		return mbref;
}		}

// Object files and archive files may be fat files, which contain multiple		// Object files and archive files may be fat files, which contain multiple
// real files for different CPU ISAs. Here, we search for a file that matches		// real files for different CPU ISAs. Here, we search for a file that matches
// with the current link target and returns it as a MemoryBufferRef.		// with the current link target and returns it as a WritableMemoryBufferRef.
const auto arch = reinterpret_cast<const fat_arch >(buf + sizeof(*hdr));		const auto arch = reinterpret_cast<const fat_arch >(buf + sizeof(*hdr));

for (uint32_t i = 0, n = read32be(&hdr->nfat_arch); i < n; ++i) {		for (uint32_t i = 0, n = read32be(&hdr->nfat_arch); i < n; ++i) {
if (reinterpret_cast<const char *>(arch + i + 1) >		if (reinterpret_cast<const char *>(arch + i + 1) >
buf + mbref.getBufferSize()) {		buf + mbref.getBufferSize()) {
error(path + ": fat_arch struct extends beyond end of file");		error(path + ": fat_arch struct extends beyond end of file");
return None;		return None;
}		}

if (read32be(&arch[i].cputype) != static_cast<uint32_t>(target->cpuType) \|\|		if (read32be(&arch[i].cputype) != static_cast<uint32_t>(target->cpuType) \|\|
read32be(&arch[i].cpusubtype) != target->cpuSubtype)		read32be(&arch[i].cpusubtype) != target->cpuSubtype)
continue;		continue;

uint32_t offset = read32be(&arch[i].offset);		uint32_t offset = read32be(&arch[i].offset);
uint32_t size = read32be(&arch[i].size);		uint32_t size = read32be(&arch[i].size);
if (offset + size > mbref.getBufferSize())		if (offset + size > mbref.getBufferSize())
error(path + ": slice extends beyond end of file");		error(path + ": slice extends beyond end of file");
if (tar)		if (tar)
tar->append(relativeToRoot(path), mbref.getBuffer());		tar->append(relativeToRoot(path), mbref.getBuffer());
return MemoryBufferRef(StringRef(buf + offset, size), path.copy(bAlloc));		return WritableMemoryBufferRef(MutableArrayRef<char>(buf + offset, size),
		path.copy(bAlloc));
}		}

error("unable to find matching architecture in " + path);		error("unable to find matching architecture in " + path);
return None;		return None;
}		}

InputFile::InputFile(Kind kind, const InterfaceFile &interface)		InputFile::InputFile(Kind kind, const InterfaceFile &interface)
: id(idCount++), fileKind(kind), name(saver.save(interface.getPath())) {}		: id(idCount++), fileKind(kind), name(saver.save(interface.getPath())) {}
▲ Show 20 Lines • Show All 123 Lines • ▼ Show 20 Lines	error(message("has width " + std::to_string(1 << rel.r_length) +
" bytes"));		" bytes"));
}		}
return valid;		return valid;
}		}

template <class Section>		template <class Section>
void ObjFile::parseRelocations(ArrayRef<Section> sectionHeaders,		void ObjFile::parseRelocations(ArrayRef<Section> sectionHeaders,
const Section &sec, SubsectionMap &subsecMap) {		const Section &sec, SubsectionMap &subsecMap) {
auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());		auto buf = reinterpret_cast<uint8_t >(mb.getBufferStart());
ArrayRef<relocation_info> relInfos(		ArrayRef<relocation_info> relInfos(
reinterpret_cast<const relocation_info *>(buf + sec.reloff), sec.nreloc);		reinterpret_cast<const relocation_info *>(buf + sec.reloff), sec.nreloc);

auto subsecIt = subsecMap.rbegin();		auto subsecIt = subsecMap.rbegin();
for (size_t i = 0; i < relInfos.size(); i++) {		for (size_t i = 0; i < relInfos.size(); i++) {
// Paired relocations serve as Mach-O's method for attaching a		// Paired relocations serve as Mach-O's method for attaching a
// supplemental datum to a primary relocation record. ELF does not		// supplemental datum to a primary relocation record. ELF does not
// need them because the *_RELOC_RELA records contain the extra		// need them because the *_RELOC_RELA records contain the extra
Show All 31 Lines	for (size_t i = 0; i < relInfos.size(); i++) {
}		}
assert(i < relInfos.size());		assert(i < relInfos.size());
if (!validateRelocationInfo(this, sec, relInfo))		if (!validateRelocationInfo(this, sec, relInfo))
continue;		continue;

bool isSubtrahend =		bool isSubtrahend =
target->hasAttr(relInfo.r_type, RelocAttrBits::SUBTRAHEND);		target->hasAttr(relInfo.r_type, RelocAttrBits::SUBTRAHEND);
int64_t embeddedAddend = target->getEmbeddedAddend(mb, sec.offset, relInfo);		int64_t embeddedAddend = target->getEmbeddedAddend(mb, sec.offset, relInfo);
		// If we will be running ICF, zero out the embedded addend. This makes it
		// easier for ICF to compare just the non-relocatable section data.
		if (config->icfLevel != ICFLevel::none && embeddedAddend != 0)
		writeAddress(reinterpret_cast<uint8_t *>(mb.getBufferStart()) +
		sec.offset + relInfo.r_address,
		0, relInfo.r_length);
assert(!(embeddedAddend && pairedAddend));		assert(!(embeddedAddend && pairedAddend));
int64_t totalAddend = pairedAddend + embeddedAddend;		int64_t totalAddend = pairedAddend + embeddedAddend;
Reloc r;		Reloc r;
r.type = relInfo.r_type;		r.type = relInfo.r_type;
r.pcrel = relInfo.r_pcrel;		r.pcrel = relInfo.r_pcrel;
r.length = relInfo.r_length;		r.length = relInfo.r_length;
r.offset = relInfo.r_address;		r.offset = relInfo.r_address;
if (relInfo.r_extern) {		if (relInfo.r_extern) {
▲ Show 20 Lines • Show All 294 Lines • ▼ Show 20 Lines	for (size_t j = 0; j < symbolIndices.size(); ++j) {
// emulating that behavior.		// emulating that behavior.
nextIsec->align = MinAlign(sectionAlign, sym.n_value);		nextIsec->align = MinAlign(sectionAlign, sym.n_value);
subsecMap.push_back({sym.n_value - sectionAddr, nextIsec});		subsecMap.push_back({sym.n_value - sectionAddr, nextIsec});
subsecEntry = subsecMap.back();		subsecEntry = subsecMap.back();
}		}
}		}
}		}

OpaqueFile::OpaqueFile(MemoryBufferRef mb, StringRef segName,		OpaqueFile::OpaqueFile(WritableMemoryBufferRef mb, StringRef segName,
StringRef sectName)		StringRef sectName)
: InputFile(OpaqueKind, mb) {		: InputFile(OpaqueKind, mb) {
const auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());		const auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());
ArrayRef<uint8_t> data = {buf, mb.getBufferSize()};		ArrayRef<uint8_t> data = {buf, mb.getBufferSize()};
ConcatInputSection *isec =		ConcatInputSection *isec =
make<ConcatInputSection>(segName.take_front(16), sectName.take_front(16),		make<ConcatInputSection>(segName.take_front(16), sectName.take_front(16),
/file=/this, data);		/file=/this, data);
isec->live = true;		isec->live = true;
subsections.push_back({{0, isec}});		subsections.push_back({{0, isec}});
}		}

ObjFile::ObjFile(MemoryBufferRef mb, uint32_t modTime, StringRef archiveName)		ObjFile::ObjFile(WritableMemoryBufferRef mb, uint32_t modTime,
		StringRef archiveName)
: InputFile(ObjKind, mb), modTime(modTime) {		: InputFile(ObjKind, mb), modTime(modTime) {
this->archiveName = std::string(archiveName);		this->archiveName = std::string(archiveName);
if (target->wordSize == 8)		if (target->wordSize == 8)
parse<LP64>();		parse<LP64>();
else		else
parse<ILP32>();		parse<ILP32>();
}		}

▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	void ObjFile::parseDataInCode() {
assert(is_sorted(dataInCodeEntries, [](const data_in_code_entry &lhs,		assert(is_sorted(dataInCodeEntries, [](const data_in_code_entry &lhs,
const data_in_code_entry &rhs) {		const data_in_code_entry &rhs) {
return lhs.offset < rhs.offset;		return lhs.offset < rhs.offset;
}));		}));
}		}

// The path can point to either a dylib or a .tbd file.		// The path can point to either a dylib or a .tbd file.
static DylibFile loadDylib(StringRef path, DylibFile umbrella) {		static DylibFile loadDylib(StringRef path, DylibFile umbrella) {
Optional<MemoryBufferRef> mbref = readFile(path);		Optional<WritableMemoryBufferRef> mbref = readFile(path);
if (!mbref) {		if (!mbref) {
error("could not read dylib file at " + path);		error("could not read dylib file at " + path);
return nullptr;		return nullptr;
}		}
return loadDylib(*mbref, umbrella);		return loadDylib(*mbref, umbrella);
}		}

// TBD files are parsed into a series of TAPI documents (InterfaceFiles), with		// TBD files are parsed into a series of TAPI documents (InterfaceFiles), with
▲ Show 20 Lines • Show All 81 Lines • ▼ Show 20 Lines

static void loadReexport(StringRef path, DylibFile *umbrella,		static void loadReexport(StringRef path, DylibFile *umbrella,
const InterfaceFile *currentTopLevelTapi) {		const InterfaceFile *currentTopLevelTapi) {
DylibFile *reexport = findDylib(path, umbrella, currentTopLevelTapi);		DylibFile *reexport = findDylib(path, umbrella, currentTopLevelTapi);
if (!reexport)		if (!reexport)
error("unable to locate re-export with install name " + path);		error("unable to locate re-export with install name " + path);
}		}

DylibFile::DylibFile(MemoryBufferRef mb, DylibFile *umbrella,		DylibFile::DylibFile(WritableMemoryBufferRef mb, DylibFile *umbrella,
bool isBundleLoader)		bool isBundleLoader)
: InputFile(DylibKind, mb), refState(RefState::Unreferenced),		: InputFile(DylibKind, mb), refState(RefState::Unreferenced),
isBundleLoader(isBundleLoader) {		isBundleLoader(isBundleLoader) {
assert(!isBundleLoader \|\| !umbrella);		assert(!isBundleLoader \|\| !umbrella);
if (umbrella == nullptr)		if (umbrella == nullptr)
umbrella = this;		umbrella = this;
this->umbrella = umbrella;		this->umbrella = umbrella;

▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	parseTrie(buf + c->export_off, c->export_size,
isWeakDef, isTlv));		isWeakDef, isTlv));
});		});
} else {		} else {
error("LC_DYLD_INFO_ONLY not found in " + toString(this));		error("LC_DYLD_INFO_ONLY not found in " + toString(this));
return;		return;
}		}
}		}

void DylibFile::parseLoadCommands(MemoryBufferRef mb) {		void DylibFile::parseLoadCommands(WritableMemoryBufferRef mb) {
auto hdr = reinterpret_cast<const mach_header >(mb.getBufferStart());		auto hdr = reinterpret_cast<const mach_header >(mb.getBufferStart());
const uint8_t p = reinterpret_cast<const uint8_t >(mb.getBufferStart()) +		const uint8_t p = reinterpret_cast<const uint8_t >(mb.getBufferStart()) +
target->headerSize;		target->headerSize;
for (uint32_t i = 0, n = hdr->ncmds; i < n; ++i) {		for (uint32_t i = 0, n = hdr->ncmds; i < n; ++i) {
auto cmd = reinterpret_cast<const load_command >(p);		auto cmd = reinterpret_cast<const load_command >(p);
p += cmd->cmdsize;		p += cmd->cmdsize;

if (!(hdr->flags & MH_NO_REEXPORTED_DYLIBS) &&		if (!(hdr->flags & MH_NO_REEXPORTED_DYLIBS) &&
▲ Show 20 Lines • Show All 183 Lines • ▼ Show 20 Lines
}		}

void DylibFile::checkAppExtensionSafety(bool dylibIsAppExtensionSafe) const {		void DylibFile::checkAppExtensionSafety(bool dylibIsAppExtensionSafe) const {
if (config->applicationExtension && !dylibIsAppExtensionSafe)		if (config->applicationExtension && !dylibIsAppExtensionSafe)
warn("using '-application_extension' with unsafe dylib: " + toString(this));		warn("using '-application_extension' with unsafe dylib: " + toString(this));
}		}

ArchiveFile::ArchiveFile(std::unique_ptr<object::Archive> &&f)		ArchiveFile::ArchiveFile(std::unique_ptr<object::Archive> &&f)
: InputFile(ArchiveKind, f->getMemoryBufferRef()), file(std::move(f)) {		: InputFile(ArchiveKind, f->getMemoryBufferRef().castToWritable()),
		file(std::move(f)) {
for (const object::Archive::Symbol &sym : file->symbols())		for (const object::Archive::Symbol &sym : file->symbols())
symtab->addLazy(sym.getName(), this, sym);		symtab->addLazy(sym.getName(), this, sym);
}		}

void ArchiveFile::fetch(const object::Archive::Symbol &sym) {		void ArchiveFile::fetch(const object::Archive::Symbol &sym) {
object::Archive::Child c =		object::Archive::Child c =
CHECK(sym.getMember(), toString(this) +		CHECK(sym.getMember(), toString(this) +
": could not get the member for symbol " +		": could not get the member for symbol " +
toMachOString(sym));		toMachOString(sym));

if (!seen.insert(c.getChildOffset()).second)		if (!seen.insert(c.getChildOffset()).second)
return;		return;

MemoryBufferRef mb =		WritableMemoryBufferRef mb =
CHECK(c.getMemoryBufferRef(),		CHECK(c.getMemoryBufferRef(),
toString(this) +		toString(this) +
": could not get the buffer for the member defining symbol " +		": could not get the buffer for the member defining symbol " +
toMachOString(sym));		toMachOString(sym))
		.castToWritable();

if (tar && c.getParent()->isThin())		if (tar && c.getParent()->isThin())
tar->append(relativeToRoot(CHECK(c.getFullName(), this)), mb.getBuffer());		tar->append(relativeToRoot(CHECK(c.getFullName(), this)), mb.getBuffer());

uint32_t modTime = toTimeT(		uint32_t modTime = toTimeT(
CHECK(c.getLastModified(), toString(this) +		CHECK(c.getLastModified(), toString(this) +
": could not get the modification time "		": could not get the modification time "
"for the member defining symbol " +		"for the member defining symbol " +
Show All 39 Lines	static macho::Symbol *createBitcodeSymbol(const lto::InputFile::Symbol &objSym,

return symtab->addDefined(name, &file, /isec=/nullptr, /value=/0,		return symtab->addDefined(name, &file, /isec=/nullptr, /value=/0,
/size=/0, objSym.isWeak(), isPrivateExtern,		/size=/0, objSym.isWeak(), isPrivateExtern,
/isThumb=/false,		/isThumb=/false,
/isReferencedDynamically=/false,		/isReferencedDynamically=/false,
/noDeadStrip=/false);		/noDeadStrip=/false);
}		}

BitcodeFile::BitcodeFile(MemoryBufferRef mbref)		BitcodeFile::BitcodeFile(WritableMemoryBufferRef mbref)
: InputFile(BitcodeKind, mbref) {		: InputFile(BitcodeKind, mbref) {
obj = check(lto::InputFile::create(mbref));		obj = check(lto::InputFile::create(mbref));

// Convert LTO Symbols to LLD Symbols in order to perform resolution. The		// Convert LTO Symbols to LLD Symbols in order to perform resolution. The
// "winning" symbol will then be marked as Prevailing at LTO compilation		// "winning" symbol will then be marked as Prevailing at LTO compilation
// time.		// time.
for (const lto::InputFile::Symbol &objSym : obj->symbols())		for (const lto::InputFile::Symbol &objSym : obj->symbols())
symbols.push_back(createBitcodeSymbol(objSym, *this));		symbols.push_back(createBitcodeSymbol(objSym, *this));
}		}

template void ObjFile::parse<LP64>();		template void ObjFile::parse<LP64>();

lld/MachO/LTO.h

Show All 30 Lines	public:
BitcodeCompiler();		BitcodeCompiler();

void add(BitcodeFile &f);		void add(BitcodeFile &f);
std::vector<ObjFile *> compile();		std::vector<ObjFile *> compile();

private:		private:
std::unique_ptr<llvm::lto::LTO> ltoObj;		std::unique_ptr<llvm::lto::LTO> ltoObj;
std::vector<llvm::SmallString<0>> buf;		std::vector<llvm::SmallString<0>> buf;
std::vector<std::unique_ptr<llvm::MemoryBuffer>> files;		std::vector<std::unique_ptr<llvm::WritableMemoryBuffer>> files;
};		};

} // namespace macho		} // namespace macho
} // namespace lld		} // namespace lld

#endif		#endif

lld/MachO/LTO.cpp

Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines	std::vector<ObjFile *> BitcodeCompiler::compile() {
buf.resize(maxTasks);		buf.resize(maxTasks);
files.resize(maxTasks);		files.resize(maxTasks);

// The -cache_path_lto option specifies the path to a directory in which		// The -cache_path_lto option specifies the path to a directory in which
// to cache native object files for ThinLTO incremental builds. If a path was		// to cache native object files for ThinLTO incremental builds. If a path was
// specified, configure LTO to use it as the cache directory.		// specified, configure LTO to use it as the cache directory.
lto::NativeObjectCache cache;		lto::NativeObjectCache cache;
if (!config->thinLTOCacheDir.empty())		if (!config->thinLTOCacheDir.empty())
cache = check(		cache = check(lto::localCache(
lto::localCache(config->thinLTOCacheDir,		config->thinLTOCacheDir,
[&](size_t task, std::unique_ptr<MemoryBuffer> mb) {		[&](size_t task, std::unique_ptr<WritableMemoryBuffer> mb) {
files[task] = std::move(mb);		files[task] = std::move(mb);
}));		}));

checkError(ltoObj->run(		checkError(ltoObj->run(
[&](size_t task) {		[&](size_t task) {
return std::make_unique<lto::NativeObjectStream>(		return std::make_unique<lto::NativeObjectStream>(
std::make_unique<raw_svector_ostream>(buf[task]));		std::make_unique<raw_svector_ostream>(buf[task]));
},		},
cache));		cache));

Show All 20 Lines	if (!config->ltoObjPath.empty()) {
filePath = config->ltoObjPath;		filePath = config->ltoObjPath;
path::append(filePath, Twine(i) + "." +		path::append(filePath, Twine(i) + "." +
getArchitectureName(config->arch()) +		getArchitectureName(config->arch()) +
".lto.o");		".lto.o");
saveBuffer(buf[i], filePath);		saveBuffer(buf[i], filePath);
modTime = getModTime(filePath);		modTime = getModTime(filePath);
}		}
ret.push_back(make<ObjFile>(		ret.push_back(make<ObjFile>(
MemoryBufferRef(buf[i], saver.save(filePath.str())), modTime, ""));		WritableMemoryBufferRef(buf[i], saver.save(filePath.str())), modTime,
		""));
}		}
for (std::unique_ptr<MemoryBuffer> &file : files)		for (std::unique_ptr<WritableMemoryBuffer> &file : files)
if (file)		if (file)
ret.push_back(make<ObjFile>(*file, 0, ""));		ret.push_back(make<ObjFile>(*file, 0, ""));
return ret;		return ret;
}		}

lld/MachO/ObjC.h

	Show All 17 Lines

	constexpr const char klass[] = "_OBJC_CLASS_$_";			constexpr const char klass[] = "_OBJC_CLASS_$_";
	constexpr const char metaclass[] = "_OBJC_METACLASS_$_";			constexpr const char metaclass[] = "_OBJC_METACLASS_$_";
	constexpr const char ehtype[] = "_OBJC_EHTYPE_$_";			constexpr const char ehtype[] = "_OBJC_EHTYPE_$_";
	constexpr const char ivar[] = "_OBJC_IVAR_$_";			constexpr const char ivar[] = "_OBJC_IVAR_$_";

	} // namespace objc			} // namespace objc

	bool hasObjCSection(llvm::MemoryBufferRef);			bool hasObjCSection(llvm::WritableMemoryBufferRef);

	} // namespace macho			} // namespace macho
	} // namespace lld			} // namespace lld

	#endif			#endif

lld/MachO/ObjC.cpp

Show All 13 Lines

#include "llvm/BinaryFormat/MachO.h"		#include "llvm/BinaryFormat/MachO.h"

using namespace llvm;		using namespace llvm;
using namespace llvm::MachO;		using namespace llvm::MachO;
using namespace lld;		using namespace lld;
using namespace lld::macho;		using namespace lld::macho;

template <class LP> static bool hasObjCSection(MemoryBufferRef mb) {		template <class LP> static bool hasObjCSection(WritableMemoryBufferRef mb) {
using Section = typename LP::section;		using Section = typename LP::section;

auto *hdr =		auto *hdr =
reinterpret_cast<const typename LP::mach_header *>(mb.getBufferStart());		reinterpret_cast<const typename LP::mach_header *>(mb.getBufferStart());
if (hdr->magic != LP::magic)		if (hdr->magic != LP::magic)
return false;		return false;

if (const auto *c =		if (const auto *c =
Show All 10 Lines	for (const Section &sec : sectionHeaders) {
sectname == section_names::swift)) {		sectname == section_names::swift)) {
return true;		return true;
}		}
}		}
}		}
return false;		return false;
}		}

bool macho::hasObjCSection(MemoryBufferRef mb) {		bool macho::hasObjCSection(WritableMemoryBufferRef mb) {
if (target->wordSize == 8)		if (target->wordSize == 8)
return ::hasObjCSection<LP64>(mb);		return ::hasObjCSection<LP64>(mb);
else		else
return ::hasObjCSection<ILP32>(mb);		return ::hasObjCSection<ILP32>(mb);
}		}

lld/MachO/Target.h

Show All 37 Lines	template <class LP> TargetInfo(LP) {
headerSize = sizeof(typename LP::mach_header);		headerSize = sizeof(typename LP::mach_header);
wordSize = LP::wordSize;		wordSize = LP::wordSize;
}		}

virtual ~TargetInfo() = default;		virtual ~TargetInfo() = default;

// Validate the relocation structure and get its addend.		// Validate the relocation structure and get its addend.
virtual int64_t		virtual int64_t
getEmbeddedAddend(llvm::MemoryBufferRef, uint64_t offset,		getEmbeddedAddend(llvm::WritableMemoryBufferRef, uint64_t offset,
const llvm::MachO::relocation_info) const = 0;		const llvm::MachO::relocation_info) const = 0;
virtual void relocateOne(uint8_t *loc, const Reloc &, uint64_t va,		virtual void relocateOne(uint8_t *loc, const Reloc &, uint64_t va,
uint64_t relocVA) const = 0;		uint64_t relocVA) const = 0;

// Write code for lazy binding. See the comments on StubsSection for more		// Write code for lazy binding. See the comments on StubsSection for more
// details.		// details.
virtual void writeStub(uint8_t *buf, const Symbol &) const = 0;		virtual void writeStub(uint8_t *buf, const Symbol &) const = 0;
virtual void writeStubHelperHeader(uint8_t *buf) const = 0;		virtual void writeStubHelperHeader(uint8_t *buf) const = 0;
▲ Show 20 Lines • Show All 88 Lines • Show Last 20 Lines

llvm/include/llvm/LTO/Caching.h

	Show All 17 Lines

	namespace llvm {			namespace llvm {
	namespace lto {			namespace lto {

	/// This type defines the callback to add a pre-existing native object file			/// This type defines the callback to add a pre-existing native object file
	/// (e.g. in a cache).			/// (e.g. in a cache).
	///			///
	/// Buffer callbacks must be thread safe.			/// Buffer callbacks must be thread safe.
	using AddBufferFn =			using AddBufferFn = std::function<void(
	std::function<void(unsigned Task, std::unique_ptr<MemoryBuffer> MB)>;			unsigned Task, std::unique_ptr<WritableMemoryBuffer> MB)>;

	/// Create a local file system cache which uses the given cache directory and			/// Create a local file system cache which uses the given cache directory and
	/// file callback. This function also creates the cache directory if it does not			/// file callback. This function also creates the cache directory if it does not
	/// already exist.			/// already exist.
	Expected<NativeObjectCache> localCache(StringRef CacheDirectoryPath,			Expected<NativeObjectCache> localCache(StringRef CacheDirectoryPath,
	AddBufferFn AddBuffer);			AddBufferFn AddBuffer);

	} // namespace lto			} // namespace lto
	} // namespace llvm			} // namespace llvm

	#endif			#endif

llvm/include/llvm/Support/MemoryBuffer.h

Show First 20 Lines • Show All 194 Lines • ▼ Show 20 Lines	public:
getNewUninitMemBuffer(size_t Size, const Twine &BufferName = "");		getNewUninitMemBuffer(size_t Size, const Twine &BufferName = "");

/// Allocate a new zero-initialized MemoryBuffer of the specified size. Note		/// Allocate a new zero-initialized MemoryBuffer of the specified size. Note
/// that the caller need not initialize the memory allocated by this method.		/// that the caller need not initialize the memory allocated by this method.
/// The memory is owned by the MemoryBuffer object.		/// The memory is owned by the MemoryBuffer object.
static std::unique_ptr<WritableMemoryBuffer>		static std::unique_ptr<WritableMemoryBuffer>
getNewMemBuffer(size_t Size, const Twine &BufferName = "");		getNewMemBuffer(size_t Size, const Twine &BufferName = "");

		/// Open the specified memory range as a MemoryBuffer, copying the contents
		/// and taking ownership of it. InputData does not have to be null terminated.
		static std::unique_ptr<WritableMemoryBuffer>
		getMemBufferCopy(StringRef InputData, const Twine &BufferName = "");

		static ErrorOr<std::unique_ptr<WritableMemoryBuffer>>
		getOpenFile(sys::fs::file_t FD, const Twine &Filename, uint64_t FileSize,
		bool RequiresNullTerminator = true, bool IsVolatile = false);

private:		private:
// Hide these base class factory function so one can't write		// Hide these base class factory function so one can't write
// WritableMemoryBuffer::getXXX()		// WritableMemoryBuffer::getXXX()
// and be surprised that he got a read-only Buffer.		// and be surprised that he got a read-only Buffer.
using MemoryBuffer::getFileAsStream;		using MemoryBuffer::getFileAsStream;
using MemoryBuffer::getFileOrSTDIN;		using MemoryBuffer::getFileOrSTDIN;
using MemoryBuffer::getMemBuffer;		using MemoryBuffer::getMemBuffer;
using MemoryBuffer::getMemBufferCopy;
using MemoryBuffer::getOpenFile;
using MemoryBuffer::getOpenFileSlice;		using MemoryBuffer::getOpenFileSlice;
using MemoryBuffer::getSTDIN;		using MemoryBuffer::getSTDIN;
};		};

/// This class is an extension of MemoryBuffer, which allows write access to		/// This class is an extension of MemoryBuffer, which allows write access to
/// the underlying contents and committing those changes to the original source.		/// the underlying contents and committing those changes to the original source.
/// It only supports creation methods that are guaranteed to produce a writable		/// It only supports creation methods that are guaranteed to produce a writable
/// buffer. For example, mapping a file read-only is not supported.		/// buffer. For example, mapping a file read-only is not supported.
▲ Show 20 Lines • Show All 47 Lines • Show Last 20 Lines

llvm/include/llvm/Support/MemoryBufferRef.h

	//===- MemoryBufferRef.h - Memory Buffer Reference --------------- C++ --===//			//===- MemoryBufferRef.h - Memory Buffer Reference --------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file defines the MemoryBuffer interface.			// This file defines the MemoryBuffer interface.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_SUPPORT_MEMORYBUFFERREF_H			#ifndef LLVM_SUPPORT_MEMORYBUFFERREF_H
	#define LLVM_SUPPORT_MEMORYBUFFERREF_H			#define LLVM_SUPPORT_MEMORYBUFFERREF_H

				#include "llvm/ADT/ArrayRef.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"

	namespace llvm {			namespace llvm {

	class MemoryBuffer;			class MemoryBuffer;
				class WritableMemoryBuffer;
				class WritableMemoryBufferRef;

	class MemoryBufferRef {			class MemoryBufferRef {
				protected:
	StringRef Buffer;			StringRef Buffer;
	StringRef Identifier;			StringRef Identifier;

	public:			public:
	MemoryBufferRef() = default;			MemoryBufferRef() = default;
	MemoryBufferRef(const MemoryBuffer &Buffer);			MemoryBufferRef(const MemoryBuffer &Buffer);
	MemoryBufferRef(StringRef Buffer, StringRef Identifier)			MemoryBufferRef(StringRef Buffer, StringRef Identifier)
	: Buffer(Buffer), Identifier(Identifier) {}			: Buffer(Buffer), Identifier(Identifier) {}

	StringRef getBuffer() const { return Buffer; }			StringRef getBuffer() const { return Buffer; }
	StringRef getBufferIdentifier() const { return Identifier; }			StringRef getBufferIdentifier() const { return Identifier; }

	const char *getBufferStart() const { return Buffer.begin(); }			const char *getBufferStart() const { return Buffer.begin(); }
	const char *getBufferEnd() const { return Buffer.end(); }			const char *getBufferEnd() const { return Buffer.end(); }
	size_t getBufferSize() const { return Buffer.size(); }			size_t getBufferSize() const { return Buffer.size(); }
				// This operation is safe iff this MemoryBufferRef was initialized with a
				// writable buffer.
				WritableMemoryBufferRef castToWritable() const;

	/// Check pointer identity (not value) of identifier and data.			/// Check pointer identity (not value) of identifier and data.
	friend bool operator==(const MemoryBufferRef &LHS,			friend bool operator==(const MemoryBufferRef &LHS,
	const MemoryBufferRef &RHS) {			const MemoryBufferRef &RHS) {
	return LHS.Buffer.begin() == RHS.Buffer.begin() &&			return LHS.Buffer.begin() == RHS.Buffer.begin() &&
	LHS.Buffer.end() == RHS.Buffer.end() &&			LHS.Buffer.end() == RHS.Buffer.end() &&
	LHS.Identifier.begin() == RHS.Identifier.begin() &&			LHS.Identifier.begin() == RHS.Identifier.begin() &&
	LHS.Identifier.end() == RHS.Identifier.end();			LHS.Identifier.end() == RHS.Identifier.end();
	}			}

	friend bool operator!=(const MemoryBufferRef &LHS,			friend bool operator!=(const MemoryBufferRef &LHS,
	const MemoryBufferRef &RHS) {			const MemoryBufferRef &RHS) {
	return !(LHS == RHS);			return !(LHS == RHS);
	}			}
	};			};

				class WritableMemoryBufferRef : public MemoryBufferRef {
				public:
				WritableMemoryBufferRef() = default;
				WritableMemoryBufferRef(const WritableMemoryBuffer &Buffer);
				WritableMemoryBufferRef(MutableArrayRef<char> Buffer, StringRef Identifier)
				: MemoryBufferRef({Buffer.data(), Buffer.size()}, Identifier) {}

				// const_cast is well-defined here, because the underlying buffer is
				// guaranteed to have been initialized with a mutable buffer.
				char getBufferStart() const { return const_cast<char >(Buffer.begin()); }
				char getBufferEnd() const { return const_cast<char >(Buffer.end()); }
				};

	} // namespace llvm			} // namespace llvm

	#endif // LLVM_SUPPORT_MEMORYBUFFERREF_H			#endif // LLVM_SUPPORT_MEMORYBUFFERREF_H

llvm/lib/LTO/Caching.cpp

Show All 38 Lines	return [=](unsigned Task, StringRef Key) -> AddStreamFn {
SmallString<64> EntryPath;		SmallString<64> EntryPath;
sys::path::append(EntryPath, CacheDirectoryPath, "llvmcache-" + Key);		sys::path::append(EntryPath, CacheDirectoryPath, "llvmcache-" + Key);
// First, see if we have a cache hit.		// First, see if we have a cache hit.
SmallString<64> ResultPath;		SmallString<64> ResultPath;
Expected<sys::fs::file_t> FDOrErr = sys::fs::openNativeFileForRead(		Expected<sys::fs::file_t> FDOrErr = sys::fs::openNativeFileForRead(
Twine(EntryPath), sys::fs::OF_UpdateAtime, &ResultPath);		Twine(EntryPath), sys::fs::OF_UpdateAtime, &ResultPath);
std::error_code EC;		std::error_code EC;
if (FDOrErr) {		if (FDOrErr) {
ErrorOr<std::unique_ptr<MemoryBuffer>> MBOrErr =		ErrorOr<std::unique_ptr<WritableMemoryBuffer>> MBOrErr =
MemoryBuffer::getOpenFile(*FDOrErr, EntryPath,		WritableMemoryBuffer::getOpenFile(*FDOrErr, EntryPath,
/FileSize=/-1,		/FileSize=/-1,
/RequiresNullTerminator=/false);		/RequiresNullTerminator=/false);
sys::fs::closeFile(*FDOrErr);		sys::fs::closeFile(*FDOrErr);
if (MBOrErr) {		if (MBOrErr) {
AddBuffer(Task, std::move(*MBOrErr));		AddBuffer(Task, std::move(*MBOrErr));
return AddStreamFn();		return AddStreamFn();
}		}
EC = MBOrErr.getError();		EC = MBOrErr.getError();
} else {		} else {
EC = errorToErrorCode(FDOrErr.takeError());		EC = errorToErrorCode(FDOrErr.takeError());
Show All 24 Lines	struct CacheStream : NativeObjectStream {
TempFile(std::move(TempFile)), EntryPath(std::move(EntryPath)),		TempFile(std::move(TempFile)), EntryPath(std::move(EntryPath)),
Task(Task) {}		Task(Task) {}

~CacheStream() {		~CacheStream() {
// Make sure the stream is closed before committing it.		// Make sure the stream is closed before committing it.
OS.reset();		OS.reset();

// Open the file first to avoid racing with a cache pruner.		// Open the file first to avoid racing with a cache pruner.
ErrorOr<std::unique_ptr<MemoryBuffer>> MBOrErr =		ErrorOr<std::unique_ptr<WritableMemoryBuffer>> MBOrErr =
MemoryBuffer::getOpenFile(		WritableMemoryBuffer::getOpenFile(
sys::fs::convertFDToNativeFile(TempFile.FD), TempFile.TmpName,		sys::fs::convertFDToNativeFile(TempFile.FD), TempFile.TmpName,
/FileSize=/-1, /RequiresNullTerminator=/false);		/FileSize=/-1, /RequiresNullTerminator=/false);
if (!MBOrErr)		if (!MBOrErr)
report_fatal_error(Twine("Failed to open new cache file ") +		report_fatal_error(Twine("Failed to open new cache file ") +
TempFile.TmpName + ": " +		TempFile.TmpName + ": " +
MBOrErr.getError().message() + "\n");		MBOrErr.getError().message() + "\n");

// On POSIX systems, this will atomically replace the destination if		// On POSIX systems, this will atomically replace the destination if
// it already exists. We try to emulate this on Windows, but this may		// it already exists. We try to emulate this on Windows, but this may
// fail with a permission denied error (for example, if the destination		// fail with a permission denied error (for example, if the destination
// is currently opened by another process that does not give us the		// is currently opened by another process that does not give us the
// sharing permissions we need). Since the existing file should be		// sharing permissions we need). Since the existing file should be
// semantically equivalent to the one we are trying to write, we give		// semantically equivalent to the one we are trying to write, we give
// AddBuffer a copy of the bytes we wrote in that case. We do this		// AddBuffer a copy of the bytes we wrote in that case. We do this
// instead of just using the existing file, because the pruner might		// instead of just using the existing file, because the pruner might
// delete the file before we get a chance to use it.		// delete the file before we get a chance to use it.
Error E = TempFile.keep(EntryPath);		Error E = TempFile.keep(EntryPath);
E = handleErrors(std::move(E), [&](const ECError &E) -> Error {		E = handleErrors(std::move(E), [&](const ECError &E) -> Error {
std::error_code EC = E.convertToErrorCode();		std::error_code EC = E.convertToErrorCode();
if (EC != errc::permission_denied)		if (EC != errc::permission_denied)
return errorCodeToError(EC);		return errorCodeToError(EC);

auto MBCopy = MemoryBuffer::getMemBufferCopy((*MBOrErr)->getBuffer(),		MutableArrayRef<char> buf = (*MBOrErr)->getBuffer();
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'buf' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'buf' [readability-identifier-naming]…
EntryPath);		auto MBCopy = WritableMemoryBuffer::getMemBufferCopy(
		{buf.data(), buf.size()}, EntryPath);
MBOrErr = std::move(MBCopy);		MBOrErr = std::move(MBCopy);

// FIXME: should we consume the discard error?		// FIXME: should we consume the discard error?
consumeError(TempFile.discard());		consumeError(TempFile.discard());

return Error::success();		return Error::success();
});		});

Show All 27 Lines

llvm/lib/Object/Archive.cpp

Show First 20 Lines • Show All 448 Lines • ▼ Show 20 Lines	if (!isThin) {
if (!Size)		if (!Size)
return Size.takeError();		return Size.takeError();
return StringRef(Data.data() + StartOfFile, Size.get());		return StringRef(Data.data() + StartOfFile, Size.get());
}		}
Expected<std::string> FullNameOrErr = getFullName();		Expected<std::string> FullNameOrErr = getFullName();
if (!FullNameOrErr)		if (!FullNameOrErr)
return FullNameOrErr.takeError();		return FullNameOrErr.takeError();
const std::string &FullName = *FullNameOrErr;		const std::string &FullName = *FullNameOrErr;
ErrorOr<std::unique_ptr<MemoryBuffer>> Buf = MemoryBuffer::getFile(FullName);		// Create a writable buffer in case our caller wants to cast our return value
		// into a WritableMemoryBuffer. (Such a cast would be safe iff this Archive
		// was initialized with a WritableMemoryBuffer too.)
		ErrorOr<std::unique_ptr<MemoryBuffer>> Buf =
		WritableMemoryBuffer::getFile(FullName);
if (std::error_code EC = Buf.getError())		if (std::error_code EC = Buf.getError())
return errorCodeToError(EC);		return errorCodeToError(EC);
Parent->ThinBuffers.push_back(std::move(*Buf));		Parent->ThinBuffers.push_back(std::move(*Buf));
return Parent->ThinBuffers.back()->getBuffer();		return Parent->ThinBuffers.back()->getBuffer();
}		}

Expected<Archive::Child> Archive::Child::getNext() const {		Expected<Archive::Child> Archive::Child::getNext() const {
size_t SpaceToSkip = Data.size();		size_t SpaceToSkip = Data.size();
▲ Show 20 Lines • Show All 537 Lines • Show Last 20 Lines

llvm/lib/Support/MemoryBuffer.cpp

Show First 20 Lines • Show All 130 Lines • ▼ Show 20 Lines	getMemBufferCopyImpl(StringRef InputData, const Twine &BufferName) {
memcpy(Buf->getBufferStart(), InputData.data(), InputData.size());		memcpy(Buf->getBufferStart(), InputData.data(), InputData.size());
return std::move(Buf);		return std::move(Buf);
}		}

std::unique_ptr<MemoryBuffer>		std::unique_ptr<MemoryBuffer>
MemoryBuffer::getMemBufferCopy(StringRef InputData, const Twine &BufferName) {		MemoryBuffer::getMemBufferCopy(StringRef InputData, const Twine &BufferName) {
auto Buf = getMemBufferCopyImpl(InputData, BufferName);		auto Buf = getMemBufferCopyImpl(InputData, BufferName);
if (Buf)		if (Buf)
		return std::unique_ptr<MemoryBuffer>(std::move(*Buf));
		return nullptr;
		}

		std::unique_ptr<WritableMemoryBuffer>
		WritableMemoryBuffer::getMemBufferCopy(StringRef InputData,
		const Twine &BufferName) {
		auto Buf = getMemBufferCopyImpl(InputData, BufferName);
		if (Buf)
return std::move(*Buf);		return std::move(*Buf);
return nullptr;		return nullptr;
}		}

ErrorOr<std::unique_ptr<MemoryBuffer>>		ErrorOr<std::unique_ptr<MemoryBuffer>>
MemoryBuffer::getFileOrSTDIN(const Twine &Filename, bool IsText,		MemoryBuffer::getFileOrSTDIN(const Twine &Filename, bool IsText,
bool RequiresNullTerminator) {		bool RequiresNullTerminator) {
SmallString<256> NameBuf;		SmallString<256> NameBuf;
▲ Show 20 Lines • Show All 354 Lines • ▼ Show 20 Lines

ErrorOr<std::unique_ptr<MemoryBuffer>>		ErrorOr<std::unique_ptr<MemoryBuffer>>
MemoryBuffer::getOpenFile(sys::fs::file_t FD, const Twine &Filename, uint64_t FileSize,		MemoryBuffer::getOpenFile(sys::fs::file_t FD, const Twine &Filename, uint64_t FileSize,
bool RequiresNullTerminator, bool IsVolatile) {		bool RequiresNullTerminator, bool IsVolatile) {
return getOpenFileImpl<MemoryBuffer>(FD, Filename, FileSize, FileSize, 0,		return getOpenFileImpl<MemoryBuffer>(FD, Filename, FileSize, FileSize, 0,
RequiresNullTerminator, IsVolatile);		RequiresNullTerminator, IsVolatile);
}		}

		ErrorOr<std::unique_ptr<WritableMemoryBuffer>>
		WritableMemoryBuffer::getOpenFile(sys::fs::file_t FD, const Twine &Filename,
		uint64_t FileSize,
		bool RequiresNullTerminator,
		bool IsVolatile) {
		return getOpenFileImpl<WritableMemoryBuffer>(
		FD, Filename, FileSize, FileSize, 0, RequiresNullTerminator, IsVolatile);
		}

ErrorOr<std::unique_ptr<MemoryBuffer>>		ErrorOr<std::unique_ptr<MemoryBuffer>>
MemoryBuffer::getOpenFileSlice(sys::fs::file_t FD, const Twine &Filename, uint64_t MapSize,		MemoryBuffer::getOpenFileSlice(sys::fs::file_t FD, const Twine &Filename, uint64_t MapSize,
int64_t Offset, bool IsVolatile) {		int64_t Offset, bool IsVolatile) {
assert(MapSize != uint64_t(-1));		assert(MapSize != uint64_t(-1));
return getOpenFileImpl<MemoryBuffer>(FD, Filename, -1, MapSize, Offset, false,		return getOpenFileImpl<MemoryBuffer>(FD, Filename, -1, MapSize, Offset, false,
IsVolatile);		IsVolatile);
}		}

Show All 30 Lines

llvm/lib/Support/MemoryBufferRef.cpp

	Show All 11 Lines

	#include "llvm/Support/MemoryBufferRef.h"			#include "llvm/Support/MemoryBufferRef.h"
	#include "llvm/Support/MemoryBuffer.h"			#include "llvm/Support/MemoryBuffer.h"

	using namespace llvm;			using namespace llvm;

	MemoryBufferRef::MemoryBufferRef(const MemoryBuffer &Buffer)			MemoryBufferRef::MemoryBufferRef(const MemoryBuffer &Buffer)
	: Buffer(Buffer.getBuffer()), Identifier(Buffer.getBufferIdentifier()) {}			: Buffer(Buffer.getBuffer()), Identifier(Buffer.getBufferIdentifier()) {}

				WritableMemoryBufferRef MemoryBufferRef::castToWritable() const {
				thevinsterUnsubmitted Not Done Reply Inline Actions Is there any way to prevent casting if this was called on a non-writeable buffer? I was thinking of some flag based approach that is set for writeable buffer and have that be determine whether casting can be done. That way, this can be a bit safer than having to let the developer have context about whether this is safe or not. thevinster: Is there any way to prevent casting if this was called on a non-writeable buffer? I was…
				return WritableMemoryBufferRef(
				MutableArrayRef<char>(const_cast<char *>(Buffer.data()), Buffer.size()),
				Identifier);
				}

				WritableMemoryBufferRef::WritableMemoryBufferRef(
				const WritableMemoryBuffer &Buffer)
				: MemoryBufferRef(Buffer) {}

This is an archive of the discontinued LLVM Phabricator instance.

[lld-macho] Enable copy-on-write for input buffersAbandonedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 359537

lld/MachO/Arch/ARM.cpp

lld/MachO/Arch/ARM64Common.h

lld/MachO/Arch/ARM64Common.cpp

lld/MachO/Arch/X86_64.cpp

lld/MachO/Driver.h

lld/MachO/Driver.cpp

lld/MachO/DriverUtils.cpp

lld/MachO/InputFiles.h

lld/MachO/InputFiles.cpp

lld/MachO/LTO.h

lld/MachO/LTO.cpp

lld/MachO/ObjC.h

lld/MachO/ObjC.cpp

lld/MachO/Target.h

llvm/include/llvm/LTO/Caching.h

llvm/include/llvm/Support/MemoryBuffer.h

llvm/include/llvm/Support/MemoryBufferRef.h

llvm/lib/LTO/Caching.cpp

llvm/lib/Object/Archive.cpp

llvm/lib/Support/MemoryBuffer.cpp

llvm/lib/Support/MemoryBufferRef.cpp

[lld-macho] Enable copy-on-write for input buffers
AbandonedPublic