This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/
-
MachO/
1
Driver.cpp
-
InputFiles.h
3/6
InputFiles.cpp
-
test/MachO/
-
MachO/
-
lc-linker-option.ll
-
objc.s

Differential D140592

[lld-macho] Skip re-loading archive if already loaded
AbandonedPublic

Authored by thevinster on Dec 22 2022, 4:43 PM.

Download Raw Diff

Details

Reviewers

None

Group Reviewers

Restricted Project

Summary

When an archive is loaded via an LC_LINKER_OPTION, loading it again will
cause duplicate symbols. This check has been added here. However,
this didn't handle the case where the archive paths are different. For example,
we could be loading a framework and that same framework can be bundled
in another static library. If we are force loading those objects via ObjC, we will
end up with duplicate symbols.

Here, add another check that hashes on the contents and skips loading an
archive's children if the module has already been loaded. This requires making
seen a global property instead of a class property so it can properly track the
different archives being loaded. This also matches ld64's behavior.

While I'm here, make the output test executables be unique so it's easier to debug.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

thevinster created this revision.Dec 22 2022, 4:43 PM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptDec 22 2022, 4:43 PM

Herald added a reviewer: Restricted Project. · View Herald Transcript

thevinster edited the summary of this revision. (Show Details)Dec 22 2022, 5:00 PM

thevinster edited the summary of this revision. (Show Details)

thevinster published this revision for review.Dec 22 2022, 5:02 PM

thevinster edited the summary of this revision. (Show Details)

Herald added a project: Restricted Project. · View Herald TranscriptDec 22 2022, 5:02 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B204692: Diff 484990.Dec 22 2022, 5:27 PM

thakis added a subscriber: thakis.Dec 22 2022, 6:45 PM

thakis added inline comments.

lld/MachO/Driver.cpp
327
lld/MachO/InputFiles.cpp
2110	We're doing this based on buffer contents? Where in ld64's source does it do that? That seems like very surprising semantics (and kinda bad for perf).

thevinster added inline comments.Dec 27 2022, 12:22 AM

lld/MachO/InputFiles.cpp
2110	I don't know if there will be any significant perf regressions. I'll double check that on my end first before saying anything else. From what I can tell, it looks like ld64 hashes based on the contents of the Entry class. I agree that these are surprising semantics (especially the behavior of force loading two of the same object files from an archive), but this strange behavior is also implicitly depended upon by some of our builds. Whether or not this is the correct implementation, I'm merely trying to follow ld64's behavior. As far as implementation is concerned, this is the path of least resistance that'll solve our current duplicate symbol issue. I'm open to suggestions on how else this can be done (I've tried approaches where I only load the symbol if it hasn't been loaded but that ran into many edge cases).

thevinster added inline comments.Dec 27 2022, 4:58 PM

lld/MachO/InputFiles.cpp

2110

Benchmarking chromium framework,

           base           diff           difference (95% CI)
sys_time   2.050 ± 0.028  2.037 ± 0.027  [  -1.2% ..   -0.0%]
user_time  4.550 ± 0.040  4.589 ± 0.040  [  +0.5% ..   +1.2%]
wall_time  7.054 ± 0.067  7.084 ± 0.067  [  +0.0% ..   +0.8%]
samples    45             42

This is a very slight regression, and personally, I think it's worth the hit if it means us being even more compatible with ld64's behaviors.

int3 added a subscriber: int3.Dec 27 2022, 6:32 PM

int3 added inline comments.

lld/MachO/InputFiles.cpp
2110	Do you have a code pointer to where the `Entry` class is getting hashed in ld64?

Imho it'd be nicer to add this to the "list of places where lld is different from ld64". From what I can tell, this isn't a _huge_ problem in practice (we haven't needed it until now), and there's some value in both sensible semantics and in having behavior that's consistent with the other llds too.

In D140592#4018461, @thakis wrote:

Imho it'd be nicer to add this to the "list of places where lld is different from ld64". From what I can tell, this isn't a _huge_ problem in practice (we haven't needed it until now), and there's some value in both sensible semantics and in having behavior that's consistent with the other llds too.

I hear you on the sensible semantics, but I disagree that we need to have behavior that's consistent with the other lld ports. From what I can see, there are many things that the Mach-O port has introduced that differs from ELF and COFF ports (e.g. string alignment, async map writing, etc.) that at this point, I would even consider all three linkers to be distinct. Plus, it deviates from the goal of lld being a "drop-in" replacement. While I don't have strong feelings of whether this specific behavior needs to be implemented in lld, I'm curious to know where you draw the line when a behavior should deviate from ld64. In regards to whether this is a huge problem in practice, I don't think any of us here can make great arguments since everyone focuses on making their builds work. In your case, maybe this isn't a "huge problem", but in our case, it is.

lld/MachO/InputFiles.cpp
2110	If I'm understanding correctly, I believe this method is where the check happens whether an archive member has already been loaded. In particular, `_instantiatedEntries` (https://github.com/apple-oss-distributions/ld64/blob/main/src/ld/parsers/archive_file.cpp#L121) is a map of an Entry to the MemberState.

So I'm kinda with @thakis on this. I would prefer we avoid hashing contents (assuming this is what ld64 is actually doing...)

I'm curious to know where you draw the line when a behavior should deviate from ld64

@thakis would probably say that he doesn't like the async map file stuff either ;)

IMO the async map file stuff is different because

it's a strict implementation detail -- there's no observable change aside from perf
though the perf impact of hashing archives in the chromium_framework build is small, this seems like the kind of thing that could surprise us down the line. hashing entire files is in general Not Cheap, and it seems quite possible that we'll one day encounter a large archive member that causes perf issues. We *could* try to hack around it by introducing a flag such as --no-hash-archive-member=, but that's super ugly

Or in other words, the async map file has an ugly implementation, but the final behavior is easy enough to reason about / build on top of. Hashing archive contents has a relatively simple implementation, but the resulting behavior feels off...

lld/MachO/InputFiles.cpp
2110	`_instantiatedEntries` is a vanilla `std::map` w/o a custom comparator (https://github.com/apple-oss-distributions/ld64/blob/main/src/ld/parsers/archive_file.cpp#L121), so I don't think it's hashing the contents of `Entry`, just the pointer value itself

I'm happy to commit to the idea that hashing on the contents is the wrong implementation. But, do people think this specific behavior is one that should still be fixed within LLD (with another implementation) or be fixed directly in the codebase (assuming this is possible)?

So I played around with the test a bit, creating a has-objc-symbol-2.s file that is identical except with a section named has_objc_symbol2 instead of has_objc_symbol. There is no dup symbol error either, demonstrating that ld64 isn't hashing contents (at least not the full contents).

Anyway that investigation jogged my memory -- I think this the same issue as https://github.com/llvm/llvm-project/issues/50817#issuecomment-981045478. As that comment indicates, I'd prefer if we fixed the issue in the builds, if that's possible.

We should also document that behavioral difference so we don't forget about it again heh

Re-reading that, it seems to be the same issue (though our case is because of swift autolinking instead of symbols defined in the same archive). I think we should re-open that PR and track it for future references. Seeing that it could be more widespread, we might want to prioritize it.

Revision Contents

Path

Size

lld/

MachO/

Driver.cpp

2 lines

InputFiles.h

5 lines

InputFiles.cpp

13 lines

test/

MachO/

lc-linker-option.ll

33 lines

objc.s

36 lines

Diff 484990

lld/MachO/Driver.cpp

Show First 20 Lines • Show All 318 Lines • ▼ Show 20 Lines

if ((isCommandLineLoad && config->allLoad) ||

break;

case LoadType::CommandLineForce:

reason = "-force_load";

break;

case LoadType::CommandLine:

reason = "-all_load";

break;

}

if (Error e = file->fetch(c, reason))

if (Error e = file->fetch(c, reason, true))

thakisUnsubmitted

Not Done

break;

}

- if (Error e = file->fetch(c, reason, true))

+ if (Error e = file->fetch(c, reason, /*skipCache=*/true))

error(toString(file) + ": " + reason +

thakis:

error(toString(file) + ": " + reason +

" failed to load archive member: " + toString(std::move(e)));

}

if (e)

error(toString(file) +

": Archive::children failed: " + toString(std::move(e)));

}

} else if (isCommandLineLoad && config->forceLoadObjC) {

▲ Show 20 Lines • Show All 1,572 Lines • Show Last 20 Lines

lld/MachO/InputFiles.h

	Show First 20 Lines • Show All 276 Lines • ▼ Show 20 Lines
	class ArchiveFile final : public InputFile {			class ArchiveFile final : public InputFile {
	public:			public:
	explicit ArchiveFile(std::unique_ptr<llvm::object::Archive> &&file,			explicit ArchiveFile(std::unique_ptr<llvm::object::Archive> &&file,
	bool forceHidden);			bool forceHidden);
	void addLazySymbols();			void addLazySymbols();
	void fetch(const llvm::object::Archive::Symbol &);			void fetch(const llvm::object::Archive::Symbol &);
	// LLD normally doesn't use Error for error-handling, but the underlying			// LLD normally doesn't use Error for error-handling, but the underlying
	// Archive library does, so this is the cleanest way to wrap it.			// Archive library does, so this is the cleanest way to wrap it.
	Error fetch(const llvm::object::Archive::Child &, StringRef reason);			Error fetch(const llvm::object::Archive::Child &, StringRef reason, bool skipCache = false);
	const llvm::object::Archive &getArchive() const { return *file; };			const llvm::object::Archive &getArchive() const { return *file; };
	static bool classof(const InputFile *f) { return f->kind() == ArchiveKind; }			static bool classof(const InputFile *f) { return f->kind() == ArchiveKind; }

	private:			private:
	std::unique_ptr<llvm::object::Archive> file;			std::unique_ptr<llvm::object::Archive> file;
	// Keep track of children fetched from the archive by tracking
	// which address offsets have been fetched already.
	llvm::DenseSet<uint64_t> seen;
	// Load all symbols with hidden visibility (-load_hidden).			// Load all symbols with hidden visibility (-load_hidden).
	bool forceHidden;			bool forceHidden;
	};			};

	class BitcodeFile final : public InputFile {			class BitcodeFile final : public InputFile {
	public:			public:
	explicit BitcodeFile(MemoryBufferRef mb, StringRef archiveName,			explicit BitcodeFile(MemoryBufferRef mb, StringRef archiveName,
	uint64_t offsetInArchive, bool lazy = false,			uint64_t offsetInArchive, bool lazy = false,
	▲ Show 20 Lines • Show All 63 Lines • Show Last 20 Lines

lld/MachO/InputFiles.cpp

Show First 20 Lines • Show All 2,089 Lines • ▼ Show 20 Lines	return make<BitcodeFile>(mb, archiveName, offsetInArchive, /lazy=/false,
forceHidden);		forceHidden);
default:		default:
return createStringError(inconvertibleErrorCode(),		return createStringError(inconvertibleErrorCode(),
mb.getBufferIdentifier() +		mb.getBufferIdentifier() +
" has unhandled file type");		" has unhandled file type");
}		}
}		}

Error ArchiveFile::fetch(const object::Archive::Child &c, StringRef reason) {		// Globally keep track of children fetched from the archives by tracking
if (!seen.insert(c.getChildOffset()).second)		// the contents that have already been fetched (e.g. Frameworks fetched
return Error::success();		// from LC_LINKER_OPTION should not be re-fetched from archives even with
		// ObjC flag enabled irregardless whether it comes from a framework or a
		// static library).
		static llvm::DenseSet<StringRef> seen;

		Error ArchiveFile::fetch(const object::Archive::Child &c, StringRef reason, bool skipCache) {
Expected<MemoryBufferRef> mb = c.getMemoryBufferRef();		Expected<MemoryBufferRef> mb = c.getMemoryBufferRef();
if (!mb)		if (!mb)
return mb.takeError();		return mb.takeError();

		if (!skipCache && !seen.insert(mb->getBuffer()).second)
		thakisUnsubmitted Not Done Reply Inline Actions We're doing this based on buffer contents? Where in ld64's source does it do that? That seems like very surprising semantics (and kinda bad for perf). thakis: We're doing this based on buffer contents? Where in ld64's source does it do that? That seems…
		thevinsterAuthorUnsubmitted Done Reply Inline Actions I don't know if there will be any significant perf regressions. I'll double check that on my end first before saying anything else. From what I can tell, it looks like ld64 hashes based on the contents of the Entry class. I agree that these are surprising semantics (especially the behavior of force loading two of the same object files from an archive), but this strange behavior is also implicitly depended upon by some of our builds. Whether or not this is the correct implementation, I'm merely trying to follow ld64's behavior. As far as implementation is concerned, this is the path of least resistance that'll solve our current duplicate symbol issue. I'm open to suggestions on how else this can be done (I've tried approaches where I only load the symbol if it hasn't been loaded but that ran into many edge cases). thevinster: I don't know if there will be any significant perf regressions. I'll double check that on my…
		thevinsterAuthorUnsubmitted Done Reply Inline Actions Benchmarking chromium framework, base diff difference (95% CI) sys_time 2.050 ± 0.028 2.037 ± 0.027 [ -1.2% .. -0.0%] user_time 4.550 ± 0.040 4.589 ± 0.040 [ +0.5% .. +1.2%] wall_time 7.054 ± 0.067 7.084 ± 0.067 [ +0.0% .. +0.8%] samples 45 42 This is a very slight regression, and personally, I think it's worth the hit if it means us being even more compatible with ld64's behaviors. thevinster: Benchmarking chromium framework, ``` base diff difference (95%…
		int3Unsubmitted Not Done Reply Inline Actions Do you have a code pointer to where the `Entry` class is getting hashed in ld64? int3: Do you have a code pointer to where the `Entry` class is getting hashed in ld64?
		thevinsterAuthorUnsubmitted Done Reply Inline Actions If I'm understanding correctly, I believe this method is where the check happens whether an archive member has already been loaded. In particular, `_instantiatedEntries` (https://github.com/apple-oss-distributions/ld64/blob/main/src/ld/parsers/archive_file.cpp#L121) is a map of an Entry to the MemberState. thevinster: If I'm understanding correctly, I believe [[ https://github.com/apple-oss…
		int3Unsubmitted Not Done Reply Inline Actions `_instantiatedEntries` is a vanilla `std::map` w/o a custom comparator (https://github.com/apple-oss-distributions/ld64/blob/main/src/ld/parsers/archive_file.cpp#L121), so I don't think it's hashing the contents of `Entry`, just the pointer value itself int3: `_instantiatedEntries` is a vanilla `std::map` w/o a custom comparator (https://github.
		return Error::success();

// Thin archives refer to .o files, so --reproduce needs the .o files too.		// Thin archives refer to .o files, so --reproduce needs the .o files too.
if (tar && c.getParent()->isThin())		if (tar && c.getParent()->isThin())
tar->append(relativeToRoot(CHECK(c.getFullName(), this)), mb->getBuffer());		tar->append(relativeToRoot(CHECK(c.getFullName(), this)), mb->getBuffer());

Expected<TimePoint<std::chrono::seconds>> modTime = c.getLastModified();		Expected<TimePoint<std::chrono::seconds>> modTime = c.getLastModified();
if (!modTime)		if (!modTime)
return modTime.takeError();		return modTime.takeError();

▲ Show 20 Lines • Show All 127 Lines • Show Last 20 Lines

lld/test/MachO/lc-linker-option.ll

	Show First 20 Lines • Show All 64 Lines • ▼ Show 20 Lines
	; RUN: llvm-ar rcs %t/Foo.framework/Foo %t/foo.o			; RUN: llvm-ar rcs %t/Foo.framework/Foo %t/foo.o
	; RUN: llc %t/load-framework-foo.ll -o %t/load-framework-foo.o -filetype=obj			; RUN: llc %t/load-framework-foo.ll -o %t/load-framework-foo.o -filetype=obj
	; RUN: llc %t/load-framework-undefined-symbol.ll -o %t/load-framework-undefined-symbol.o -filetype=obj			; RUN: llc %t/load-framework-undefined-symbol.ll -o %t/load-framework-undefined-symbol.o -filetype=obj
	; RUN: llc %t/load-missing.ll -o %t/load-missing.o -filetype=obj			; RUN: llc %t/load-missing.ll -o %t/load-missing.o -filetype=obj
	; RUN: llc %t/main.ll -o %t/main.o -filetype=obj			; RUN: llc %t/main.ll -o %t/main.o -filetype=obj
	; RUN: %lld %t/load-framework-foo.o %t/main.o -o %t/main -F%t			; RUN: %lld %t/load-framework-foo.o %t/main.o -o %t/main -F%t
	; RUN: llvm-objdump --macho --syms %t/main \| FileCheck %s --check-prefix=SYMS			; RUN: llvm-objdump --macho --syms %t/main \| FileCheck %s --check-prefix=SYMS

				;; Identical contents from different archive paths should not fail the build.
				; RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/refs-foo.s -o %t/refs-foo.o
				; RUN: llvm-ar rcs %t/libFoo.a %t/foo.o
				; RUN: %lld -ObjC %t/refs-foo.o %t/main.o -o %t/no-dup-syms -lFoo -F%t -L%t
				; RUN: llvm-objdump --section-headers --syms %t/no-dup-syms \| FileCheck %s --check-prefix=NO-DUP

				; NO-DUP: Sections:
				; NO-DUP-NEXT: Idx Name Size VMA Type
				; NO-DUP-NEXT: 0 __text {{.*}} TEXT
				; NO-DUP-NEXT: 1 __swift {{.*}} DATA
				; NO-DUP-NEXT: 2 __unwind_info {{.*}} DATA
				; NO-DUP-NEXT: 3 __eh_frame {{.*}} DATA
				; NO-DUP-NEXT: 4 __objc_classrefs {{.*}} DATA
				; NO-DUP-NEXT: 5 __objc_data {{.*}} DATA
				; NO-DUP-EMPTY:
				; NO-DUP-NEXT: SYMBOL TABLE:
				; NO-DUP-NEXT: g F __TEXT,__text _main
				; NO-DUP-NEXT: g O __DATA,__objc_data _OBJC_CLASS_$_TestClass
				; NO-DUP-NEXT: g O __TEXT,__swift _$s6swifty1aSivp
				; NO-DUP-NEXT: g F __TEXT,__text __mh_execute_header
				; NO-DUP-NEXT: UND dyld_stub_binder

	;; Make sure -all_load and -ObjC have no effect on libraries loaded via			;; Make sure -all_load and -ObjC have no effect on libraries loaded via
	;; LC_LINKER_OPTION flags.			;; LC_LINKER_OPTION flags.
	; RUN: llc %t/load-library-foo.ll -o %t/load-library-foo.o -filetype=obj			; RUN: llc %t/load-library-foo.ll -o %t/load-library-foo.o -filetype=obj
	; RUN: llvm-ar rcs %t/libfoo.a %t/foo.o			; RUN: llvm-ar rcs %t/libfoo.a %t/foo.o
	; RUN: %lld -all_load -ObjC %t/load-framework-foo.o %t/load-library-foo.o \			; RUN: %lld -all_load -ObjC %t/load-framework-foo.o %t/load-library-foo.o \
	; RUN: %t/main.o -o %t/main -F%t -L%t			; RUN: %t/main.o -o %t/main -F%t -L%t
	; RUN: llvm-objdump --macho --syms %t/main \| FileCheck %s --check-prefix=SYMS			; RUN: llvm-objdump --macho --syms %t/main \| FileCheck %s --check-prefix=SYMS

	;; Note that _OBJC_CLASS_$_TestClass is not included here.			;; Note that _OBJC_CLASS_$_TestClass is not included here.
	; SYMS: SYMBOL TABLE:			; SYMS: SYMBOL TABLE:
	; SYMS-NEXT: g F __TEXT,__text _main			; SYMS-NEXT: g F __TEXT,__text _main
	; SYMS-NEXT: g F __TEXT,__text __mh_execute_header			; SYMS-NEXT: g F __TEXT,__text __mh_execute_header
	; SYMS-NEXT: UND dyld_stub_binder			; SYMS-NEXT: UND dyld_stub_binder
	; SYMS-EMPTY:			; SYMS-EMPTY:

	;; Make sure -all_load has effect when libraries are loaded via LC_LINKER_OPTION flags and explicitly passed as well			;; Make sure -all_load has effect when libraries are loaded via LC_LINKER_OPTION flags and explicitly passed as well
	; RUN: %lld -all_load %t/load-framework-foo.o %t/load-library-foo.o %t/main.o -o %t/main -F%t -L%t -lfoo			; RUN: %lld -all_load %t/load-framework-foo.o %t/load-library-foo.o %t/main.o -o %t/main -F%t -L%t -lfoo
	; RUN: llvm-objdump --macho --syms %t/main \| FileCheck %s --check-prefix=SYMS_ALL_LOAD			; RUN: llvm-objdump --macho --syms %t/main \| FileCheck %s --check-prefix=SYMS_ALL_LOAD

	;; Note that _OBJC_CLASS_$_TestClass is included here.			;; Note that _OBJC_CLASS_$_TestClass is included here.
	; SYMS_ALL_LOAD: SYMBOL TABLE:			; SYMS_ALL_LOAD: SYMBOL TABLE:
	; SYMS_ALL_LOAD-NEXT: g F __TEXT,__text _main			; SYMS_ALL_LOAD-NEXT: g F __TEXT,__text _main
				; SYMS_ALL_LOAD-NEXT: g O __TEXT,__swift _$s6swifty1aSivp
	; SYMS_ALL_LOAD-NEXT: g O __DATA,__objc_data _OBJC_CLASS_$_TestClass			; SYMS_ALL_LOAD-NEXT: g O __DATA,__objc_data _OBJC_CLASS_$_TestClass
	; SYMS_ALL_LOAD-NEXT: g F __TEXT,__text __mh_execute_header			; SYMS_ALL_LOAD-NEXT: g F __TEXT,__text __mh_execute_header
	; SYMS_ALL_LOAD-NEXT: UND dyld_stub_binder			; SYMS_ALL_LOAD-NEXT: UND dyld_stub_binder
	; SYMS_ALL_LOAD-EMPTY:			; SYMS_ALL_LOAD-EMPTY:

	;; Make sure -force_load has effect when libraries are loaded via LC_LINKER_OPTION flags and explicitly passed as well			;; Make sure -force_load has effect when libraries are loaded via LC_LINKER_OPTION flags and explicitly passed as well
	; RUN: %lld %t/load-library-foo.o %t/main.o -o %t/main -F%t -L%t -force_load %t/libfoo.a			; RUN: %lld %t/load-library-foo.o %t/main.o -o %t/main -F%t -L%t -force_load %t/libfoo.a
	; RUN: llvm-objdump --macho --syms %t/main \| FileCheck %s --check-prefix=SYMS_FORCE_LOAD			; RUN: llvm-objdump --macho --syms %t/main \| FileCheck %s --check-prefix=SYMS_FORCE_LOAD

	;; Note that _OBJC_CLASS_$_TestClass is included here.			;; Note that _OBJC_CLASS_$_TestClass is included here.
	; SYMS_FORCE_LOAD: SYMBOL TABLE:			; SYMS_FORCE_LOAD: SYMBOL TABLE:
	; SYMS_FORCE_LOAD-NEXT: g F __TEXT,__text _main			; SYMS_FORCE_LOAD-NEXT: g F __TEXT,__text _main
				; SYMS_FORCE_LOAD-NEXT: g O __TEXT,__swift _$s6swifty1aSivp
	; SYMS_FORCE_LOAD-NEXT: g O __DATA,__objc_data _OBJC_CLASS_$_TestClass			; SYMS_FORCE_LOAD-NEXT: g O __DATA,__objc_data _OBJC_CLASS_$_TestClass
	; SYMS_FORCE_LOAD-NEXT: g F __TEXT,__text __mh_execute_header			; SYMS_FORCE_LOAD-NEXT: g F __TEXT,__text __mh_execute_header
	; SYMS_FORCE_LOAD-NEXT: UND dyld_stub_binder			; SYMS_FORCE_LOAD-NEXT: UND dyld_stub_binder
	; SYMS_FORCE_LOAD-EMPTY:			; SYMS_FORCE_LOAD-EMPTY:

	;; Make sure -ObjC has effect when frameworks are loaded via LC_LINKER_OPTION flags and explicitly passed as well			;; Make sure -ObjC has effect when frameworks are loaded via LC_LINKER_OPTION flags and explicitly passed as well
	; RUN: %lld -ObjC %t/load-framework-foo.o %t/load-library-foo.o %t/main.o -o %t/main -F%t -L%t -framework Foo			; RUN: %lld -ObjC %t/load-framework-foo.o %t/load-library-foo.o %t/main.o -o %t/main -F%t -L%t -framework Foo
	; RUN: llvm-objdump --macho --syms %t/main \| FileCheck %s --check-prefix=SYMS_OBJC_LOAD			; RUN: llvm-objdump --macho --syms %t/main \| FileCheck %s --check-prefix=SYMS_OBJC_LOAD

	;; Note that _OBJC_CLASS_$_TestClass is included here.			;; Note that _OBJC_CLASS_$_TestClass is included here.
	; SYMS_OBJC_LOAD: SYMBOL TABLE:			; SYMS_OBJC_LOAD: SYMBOL TABLE:
	; SYMS_OBJC_LOAD-NEXT: g F __TEXT,__text _main			; SYMS_OBJC_LOAD-NEXT: g F __TEXT,__text _main
				; SYMS_OBJC_LOAD-NEXT: g O __TEXT,__swift _$s6swifty1aSivp
	; SYMS_OBJC_LOAD-NEXT: g O __DATA,__objc_data _OBJC_CLASS_$_TestClass			; SYMS_OBJC_LOAD-NEXT: g O __DATA,__objc_data _OBJC_CLASS_$_TestClass
	; SYMS_OBJC_LOAD-NEXT: g F __TEXT,__text __mh_execute_header			; SYMS_OBJC_LOAD-NEXT: g F __TEXT,__text __mh_execute_header
	; SYMS_OBJC_LOAD-NEXT: UND dyld_stub_binder			; SYMS_OBJC_LOAD-NEXT: UND dyld_stub_binder
	; SYMS_OBJC_LOAD-EMPTY:			; SYMS_OBJC_LOAD-EMPTY:

	;; Make sure that frameworks containing object files or bitcode instead of			;; Make sure that frameworks containing object files or bitcode instead of
	;; dylibs or archives do not cause duplicate symbol errors			;; dylibs or archives do not cause duplicate symbol errors
	; RUN: mkdir -p %t/Foo.framework			; RUN: mkdir -p %t/Foo.framework
	▲ Show 20 Lines • Show All 127 Lines • ▼ Show 20 Lines
	!llvm.linker.options = !{!0}			!llvm.linker.options = !{!0}

	;--- foo.ll			;--- foo.ll
	target triple = "x86_64-apple-macosx10.15.0"			target triple = "x86_64-apple-macosx10.15.0"
	target datalayout = "e-m:o-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:o-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"

	%struct._class_t = type {}			%struct._class_t = type {}
	@"OBJC_CLASS_$_TestClass" = global %struct._class_t {}, section "__DATA, __objc_data", align 8			@"OBJC_CLASS_$_TestClass" = global %struct._class_t {}, section "__DATA, __objc_data", align 8

				;; Loading special sections with -ObjC should not report duplicate symbols
				%TSi = type <{ i64 }>
				@"$s6swifty1aSivp" = global %TSi zeroinitializer, section "__TEXT, __swift", align 8

				;--- refs-foo.s
				.section __DATA,__objc_classrefs
				.quad _OBJC_CLASS_$_TestClass

lld/test/MachO/objc.s

	Show All 9 Lines
	## Make sure we don't mis-parse a 32-bit file as 64-bit			## Make sure we don't mis-parse a 32-bit file as 64-bit
	# RUN: llvm-mc -filetype=obj -triple=armv7-apple-watchos %t/no-objc.s -o %t/wrong-arch.o			# RUN: llvm-mc -filetype=obj -triple=armv7-apple-watchos %t/no-objc.s -o %t/wrong-arch.o
	# RUN: llvm-ar rcs %t/libHasSomeObjC.a %t/no-objc.o %t/has-objc-symbol.o %t/has-objc-category.o %t/has-swift.o %t/has-swift-proto.o %t/wrong-arch.o			# RUN: llvm-ar rcs %t/libHasSomeObjC.a %t/no-objc.o %t/has-objc-symbol.o %t/has-objc-category.o %t/has-swift.o %t/has-swift-proto.o %t/wrong-arch.o
	# RUN: llvm-ar rcs %t/libHasSomeObjC2.a %t/no-objc.o %t/has-objc-symbol-and-category.o %t/has-swift.o %t/has-swift-proto.o %t/wrong-arch.o			# RUN: llvm-ar rcs %t/libHasSomeObjC2.a %t/no-objc.o %t/has-objc-symbol-and-category.o %t/has-swift.o %t/has-swift-proto.o %t/wrong-arch.o

	# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/test.s -o %t/test.o			# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/test.s -o %t/test.o

	# RUN: %lld -lSystem %t/test.o -o %t/test -L%t -lHasSomeObjC -ObjC			# RUN: %lld -lSystem %t/test.o -o %t/test -L%t -lHasSomeObjC -ObjC
	# RUN: llvm-objdump --section-headers --syms %t/test \| FileCheck %s --check-prefix=OBJC			# RUN: llvm-objdump --section-headers --syms %t/test \| FileCheck %s --check-prefixes=OBJC

	# RUN: %lld -lSystem %t/test.o -o %t/test -L%t -lHasSomeObjC2 -ObjC			# RUN: %lld -lSystem %t/test.o -o %t/test2 -L%t -lHasSomeObjC2 -ObjC
	# RUN: llvm-objdump --section-headers --syms %t/test \| FileCheck %s --check-prefix=OBJC			# RUN: llvm-objdump --section-headers --syms %t/test2 \| FileCheck %s --check-prefixes=OBJC

	# RUN: %lld -lSystem %t/test.o -o %t/test --start-lib %t/no-objc.o %t/has-objc-symbol.o %t/has-objc-category.o %t/has-swift.o %t/has-swift-proto.o %t/wrong-arch.o --end-lib -ObjC			# RUN: %lld -lSystem %t/test.o -o %t/test3 --start-lib %t/no-objc.o %t/has-objc-symbol.o %t/has-objc-category.o %t/has-swift.o %t/has-swift-proto.o %t/wrong-arch.o --end-lib -ObjC
	# RUN: llvm-objdump --section-headers --syms %t/test \| FileCheck %s --check-prefix=OBJC			# RUN: llvm-objdump --section-headers --syms %t/test3 \| FileCheck %s --check-prefixes=OBJC

	# OBJC: Sections:			# OBJC: Sections:
	# OBJC-NEXT: Idx Name Size VMA Type			# OBJC-NEXT: Idx Name Size VMA Type
	# OBJC-NEXT: 0 __text {{.*}} TEXT			# OBJC-NEXT: 0 __text {{.*}} TEXT
	# OBJC-NEXT: 1 __swift {{.*}} DATA			# OBJC-NEXT: 1 __swift {{.*}} DATA
	# OBJC-NEXT: 2 __swift5_fieldmd{{.*}} DATA			# OBJC-NEXT: 2 __swift5_fieldmd{{.*}} DATA
	# OBJC-NEXT: 3 __objc_catlist {{.*}} DATA			# OBJC-NEXT: 3 __objc_catlist {{.*}} DATA
	# OBJC-NEXT: 4 has_objc_symbol {{.*}} DATA			# OBJC-NEXT: 4 has_objc_symbol {{.*}} DATA
	# OBJC-EMPTY:			# OBJC-EMPTY:
	# OBJC-NEXT: SYMBOL TABLE:			# OBJC-NEXT: SYMBOL TABLE:
				# OBJC-DAG: l O __DATA,has_objc_symbol _objc_label
	# OBJC-DAG: g F __TEXT,__text _main			# OBJC-DAG: g F __TEXT,__text _main
	# OBJC-DAG: g F __TEXT,__text _OBJC_CLASS_$_MyObject			# OBJC-DAG: g F __TEXT,__text _OBJC_CLASS_$_MyObject
	# OBJC-DAG: g O __TEXT,__swift5_fieldmd $s7somelib4Blah_pMF			# OBJC-DAG: g O __TEXT,__swift5_fieldmd $s7somelib4Blah_pMF

	# RUN: %lld -lSystem %t/test.o -o %t/test -L%t -lHasSomeObjC			# RUN: %lld -lSystem %t/test.o -o %t/test4 -L%t -lHasSomeObjC
	# RUN: llvm-objdump --section-headers --syms %t/test \| FileCheck %s --check-prefix=NO-OBJC			# RUN: llvm-objdump --section-headers --syms %t/test4 \| FileCheck %s --check-prefix=NO-OBJC

	# NO-OBJC: Sections:			# NO-OBJC: Sections:
	# NO-OBJC-NEXT: Idx Name Size VMA Type			# NO-OBJC-NEXT: Idx Name Size VMA Type
	# NO-OBJC-NEXT: 0 __text {{.*}} TEXT			# NO-OBJC-NEXT: 0 __text {{.*}} TEXT
	# NO-OBJC-EMPTY:			# NO-OBJC-EMPTY:
	# NO-OBJC-NEXT: SYMBOL TABLE:			# NO-OBJC-NEXT: SYMBOL TABLE:
	# NO-OBJC-NEXT: g F __TEXT,__text _main			# NO-OBJC-NEXT: g F __TEXT,__text _main
	# NO-OBJC-NEXT: g F __TEXT,__text __mh_execute_header			# NO-OBJC-NEXT: g F __TEXT,__text __mh_execute_header
	Show All 15 Lines
	# RUN: -lHasSomeObjC 2>&1 \| FileCheck %s --check-prefix=DUP-ERROR			# RUN: -lHasSomeObjC 2>&1 \| FileCheck %s --check-prefix=DUP-ERROR
	# DUP-ERROR: error: duplicate symbol: _has_dup			# DUP-ERROR: error: duplicate symbol: _has_dup

	## TODO: Load has-objc-symbol.o prior to symbol resolution to match the archive behavior.			## TODO: Load has-objc-symbol.o prior to symbol resolution to match the archive behavior.
	# RUN: not %lld -dylib %t/refs-dup.o %t/refs-objc.o -o %t/refs-dup --start-lib %t/no-objc.o \			# RUN: not %lld -dylib %t/refs-dup.o %t/refs-objc.o -o %t/refs-dup --start-lib %t/no-objc.o \
	# RUN: %t/has-objc-symbol.o %t/has-objc-category.o %t/has-swift.o %t/wrong-arch.o --end-lib \			# RUN: %t/has-objc-symbol.o %t/has-objc-category.o %t/has-swift.o %t/wrong-arch.o --end-lib \
	# RUN: -ObjC 2>&1 \| FileCheck %s --check-prefix=DUP-ERROR			# RUN: -ObjC 2>&1 \| FileCheck %s --check-prefix=DUP-ERROR

				## When two identical object files containing ObjC symbols are within the same archive, ld64 "dedups"
				## instead of reporting duplicate syms when -ObjC is enabled.
				# RUN: llvm-ar rcs %t/libHasDuplicateObjC.a %t/has-objc-symbol.o %t/has-objc-symbol.o
				# RUN: %lld -lSystem %t/test.o -o %t/dup-objc-object-syms -L%t -lHasDuplicateObjC -ObjC
				# RUN: llvm-objdump --section-headers --syms %t/dup-objc-object-syms \| FileCheck %s --check-prefix=DUP-OBJECT-ARCHIVE

				# DUP-OBJECT-ARCHIVE: Sections:
				# DUP-OBJECT-ARCHIVE-NEXT: Idx Name Size VMA Type
				# DUP-OBJECT-ARCHIVE-NEXT: 0 __text {{.*}} TEXT
				# DUP-OBJECT-ARCHIVE-NEXT: 1 has_objc_symbol {{.*}} DATA
				# DUP-OBJECT-ARCHIVE-EMPTY:
				# DUP-OBJECT-ARCHIVE-NEXT: SYMBOL TABLE:
				# DUP-OBJECT-ARCHIVE-NEXT: l O __DATA,has_objc_symbol _objc_label
				# DUP-OBJECT-ARCHIVE-NEXT: g F __TEXT,__text _main
				# DUP-OBJECT-ARCHIVE-NEXT: g F __TEXT,__text _OBJC_CLASS_$_MyObject
				# DUP-OBJECT-ARCHIVE-NEXT: g O __DATA,has_objc_symbol _has_dup
				# DUP-OBJECT-ARCHIVE-NEXT: g F __TEXT,__text __mh_execute_header
				# DUP-OBJECT-ARCHIVE-NEXT: UND dyld_stub_binder

	#--- has-objc-symbol.s			#--- has-objc-symbol.s
	.globl _OBJC_CLASS_$_MyObject, _has_dup			.globl _OBJC_CLASS_$_MyObject, _has_dup
	_OBJC_CLASS_$_MyObject:			_OBJC_CLASS_$_MyObject:

	.section __DATA,has_objc_symbol			.section __DATA,has_objc_symbol
	_has_dup:			_has_dup:
				_objc_label:

	#--- has-objc-category.s			#--- has-objc-category.s
	.section __DATA,__objc_catlist			.section __DATA,__objc_catlist
	.quad 0x1234			.quad 0x1234

	#--- has-objc-symbol-and-category.s			#--- has-objc-symbol-and-category.s
	## Make sure we load this archive member exactly once (i.e. no duplicate symbol			## Make sure we load this archive member exactly once (i.e. no duplicate symbol
	## error).			## error).
	.globl _OBJC_CLASS_$_MyObject, _has_dup			.globl _OBJC_CLASS_$_MyObject, _has_dup
	_OBJC_CLASS_$_MyObject:			_OBJC_CLASS_$_MyObject:

	.section __DATA,has_objc_symbol			.section __DATA,has_objc_symbol
	_has_dup:			_has_dup:
				_objc_label:

	.section __DATA,__objc_catlist			.section __DATA,__objc_catlist
	.quad 0x1234			.quad 0x1234

	#--- has-swift.s			#--- has-swift.s
	.section __TEXT,__swift			.section __TEXT,__swift
	.quad 0x1234			.quad 0x1234

	Show All 26 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[lld-macho] Skip re-loading archive if already loadedAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 484990

lld/MachO/Driver.cpp

lld/MachO/InputFiles.h

lld/MachO/InputFiles.cpp

lld/test/MachO/lc-linker-option.ll

lld/test/MachO/objc.s

[lld-macho] Skip re-loading archive if already loaded
AbandonedPublic