This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/
-
clang/
-
Basic/
-
CodeGenOptions.def
-
CodeGen/
-
BackendUtil.h
-
Driver/
-
Options.td
1
ToolChain.h
-
lib/
-
CodeGen/
1/1
BackendUtil.cpp
-
CodeGenAction.cpp
-
Driver/
-
Driver.cpp
-
ToolChains/
1/1
Clang.cpp
-
llvm/
-
include/llvm/Bitcode/
-
llvm/
-
Bitcode/
-
BitcodeWriter.h
-
lib/Bitcode/Writer/
-
Bitcode/
-
Writer/
2/6
BitcodeWriter.cpp

Differential D131090

[WIP] First draft of -ffat-lto-object related changes: front-end additions, duplication and repurposing -fembed-bitcode related code.
DraftPublic

Authored by arda on Aug 3 2022, 12:23 PM.

Download Raw Diff

This is a draft revision that has not yet been submitted for review.

Details

Reviewers

phosek
paulkirth
sfertile

Summary

LLVM users currently have to choose at build-time whether to generate
object code or bitcode. Only having these options can be limiting since
translation units can often be shared between many targets. These different
targets may have different requirements in terms of compile time or
performance. For instance, the benefits for LTO might be worth the overhead for
main targets whereas this might not be the case for the tests.

GCC allows GIMPLE bytecode to be saved alongside object code if the
-ffat-lto-objects option is passed. This “fat” object format allows one to
build one set of fat objects which could be used for targets with different
requirements since the decision to use LTO can be made at link time.

Our goal is to implement support for emitting bitcode alongside object code in
LLVM and use that support to implement the -ffat-lto-objects option in Clang.
Doing so would bring Clang to parity with GCC. We will initially focus on ELF,
but this support could be extended to other formats later.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	3,680 ms	x64 debian > Clang.Driver::clang_f_opts.c
	60,040 ms	x64 debian > MLIR.Examples/standalone::test.toy

Event Timeline

arda created this revision.Aug 3 2022, 12:23 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 3 2022, 12:23 PM

Herald added subscribers: ormris, hiraditya, inglorion. · View Herald Transcript

arda added a reviewer: paulkirth.Aug 3 2022, 12:25 PM

Harbormaster completed remote builds in B179084: Diff 449737.Aug 3 2022, 1:21 PM

Remove the unrelated format changes.

Harbormaster completed remote builds in B179136: Diff 449794.Aug 3 2022, 5:03 PM

Can we get the proposed LLD changes as well. Separate patches are probably best.

clang/lib/CodeGen/BackendUtil.cpp
196	I'd drop this, since its unrelated. If you still want to fix it, you can submit a separate change.
clang/lib/Driver/ToolChains/Clang.cpp
7120	I think these can go away, right?
llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
4959	Can you make the error a bit more specific? Like let them know this is a "Fat LTO" problem? I'm not sure how well the issue will be surfaced as is. There's been some discussion re: the use of unreachable. Not all of it applies here, especially in a prototype, but its a point to consider as work develops. see https://discourse.llvm.org/t/llvm-unreachable-is-widely-misused/60587 for more context. I know this is a copy paste from existing code, but a fatal error might be the better choice. What do you think?
5080	Does this just do what embed bitcode does? My understanding was that LTO generated IR is emitted into object files differently than the embed bitcode scenario. I'm basing that on the discussion at https://discourse.llvm.org/t/rfc-ffat-lto-objects-support/63977

I'd also like to see tests for this.

arda marked 2 inline comments as done.Aug 5 2022, 1:58 PM

arda added inline comments.

llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
4959	From that discussion, it seems like unreachable should not be used in this case. I agree that fatal error might be a better choice here. Thanks!
5080	From my research, -fembed-bitcode was originally used for recompilation purposes, not for LTO. Since our scope is limited to ThinLTO, this prototype currently is used to embed ThinLTO bitcode into the FatLTO object as follows: clang -c -ffat-lto-objects -o fatLTO.o -x ir thinLTO.bc

Addressed Paul's first comments.

Harbormaster completed remote builds in B179601: Diff 450402.Aug 5 2022, 4:16 PM

paulkirth added inline comments.Aug 10 2022, 9:19 AM

clang/include/clang/Driver/ToolChain.h
572	This file mostly uses `toolchain`, so let's try to be consistent. We don't need to worry about other uses in this patch.
llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
5080	Looking at this a second time, I'm not sure I follow your meaning here. What I meant was that the way EmbedBitcode worked was not `TheRightThing` according to that discussion. It appears as though your change just copies its logic. So if I want to compile a C file and get a fat-LTO object? When do the LTO passes get added to the pipeline? I don't mean the big whole program optimization stuff that gets invoked at link-time, I mean the module level passes and summary generation? It's not clear to me when those will run after invoking clang on a source file. can you elaborate on how this is intended to work? I seem to be missing something.

phosek added inline comments.Aug 15 2022, 6:46 PM

llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
4965	Since this section is LLVM specific, I think it should use an `.llvm` prefix, so the name could be `.llvm.lto`. This is similar to GCC which uses `.gcc.lto_` prefixed section names.

arda mentioned this in D131618: [clang][llvm][lld] FatLTO Prototype.Aug 17 2022, 10:04 AM

tstellar added a subscriber: tstellar.Aug 17 2022, 12:17 PM

tstellar removed a subscriber: tstellar.

sfertile added a reviewer: sfertile.Aug 19 2022, 9:45 AM

hubert.reinterpretcast added a subscriber: hubert.reinterpretcast.Aug 19 2022, 12:03 PM

The changes in this patch have been moved to here: https://reviews.llvm.org/D131618

Revision Contents

Path

Size

clang/

include/

clang/

Basic/

CodeGenOptions.def

1 line

CodeGen/

BackendUtil.h

3 lines

Driver/

Options.td

6 lines

ToolChain.h

3 lines

lib/

CodeGen/

BackendUtil.cpp

11 lines

CodeGenAction.cpp

3 lines

Driver/

Driver.cpp

5 lines

ToolChains/

Clang.cpp

3 lines

llvm/

include/

llvm/

Bitcode/

BitcodeWriter.h

1 line

lib/

Bitcode/

Writer/

BitcodeWriter.cpp

96 lines

Diff 450402

clang/include/clang/Basic/CodeGenOptions.def

	Show First 20 Lines • Show All 155 Lines • ▼ Show 20 Lines
	CODEGENOPT(LessPreciseFPMAD , 1, 0) ///< Enable less precise MAD instructions to			CODEGENOPT(LessPreciseFPMAD , 1, 0) ///< Enable less precise MAD instructions to
	///< be generated.			///< be generated.
	CODEGENOPT(PrepareForLTO , 1, 0) ///< Set when -flto is enabled on the			CODEGENOPT(PrepareForLTO , 1, 0) ///< Set when -flto is enabled on the
	///< compile step.			///< compile step.
	CODEGENOPT(PrepareForThinLTO , 1, 0) ///< Set when -flto=thin is enabled on the			CODEGENOPT(PrepareForThinLTO , 1, 0) ///< Set when -flto=thin is enabled on the
	///< compile step.			///< compile step.
	CODEGENOPT(LTOUnit, 1, 0) ///< Emit IR to support LTO unit features (CFI, whole			CODEGENOPT(LTOUnit, 1, 0) ///< Emit IR to support LTO unit features (CFI, whole
	///< program vtable opt).			///< program vtable opt).
				CODEGENOPT(FatLTO, 1, 0) ///< Set when -ffat-lto-objects is enabled.
	CODEGENOPT(EnableSplitLTOUnit, 1, 0) ///< Enable LTO unit splitting to support			CODEGENOPT(EnableSplitLTOUnit, 1, 0) ///< Enable LTO unit splitting to support
	/// CFI and traditional whole program			/// CFI and traditional whole program
	/// devirtualization that require whole			/// devirtualization that require whole
	/// program IR support.			/// program IR support.
	CODEGENOPT(IncrementalLinkerCompatible, 1, 0) ///< Emit an object file which can			CODEGENOPT(IncrementalLinkerCompatible, 1, 0) ///< Emit an object file which can
	///< be used with an incremental			///< be used with an incremental
	///< linker.			///< linker.
	CODEGENOPT(MergeAllConstants , 1, 1) ///< Merge identical constants.			CODEGENOPT(MergeAllConstants , 1, 1) ///< Merge identical constants.
	▲ Show 20 Lines • Show All 311 Lines • Show Last 20 Lines

clang/include/clang/CodeGen/BackendUtil.h

Show All 39 Lines	void EmitBackendOutput(DiagnosticsEngine &Diags, const HeaderSearchOptions &,
const CodeGenOptions &CGOpts,		const CodeGenOptions &CGOpts,
const TargetOptions &TOpts, const LangOptions &LOpts,		const TargetOptions &TOpts, const LangOptions &LOpts,
StringRef TDesc, llvm::Module *M, BackendAction Action,		StringRef TDesc, llvm::Module *M, BackendAction Action,
std::unique_ptr<raw_pwrite_stream> OS);		std::unique_ptr<raw_pwrite_stream> OS);

void EmbedBitcode(llvm::Module *M, const CodeGenOptions &CGOpts,		void EmbedBitcode(llvm::Module *M, const CodeGenOptions &CGOpts,
llvm::MemoryBufferRef Buf);		llvm::MemoryBufferRef Buf);

		void EmbedFatLTO(llvm::Module *M, const CodeGenOptions &CGOpts,
		llvm::MemoryBufferRef Buf);

void EmbedObject(llvm::Module *M, const CodeGenOptions &CGOpts,		void EmbedObject(llvm::Module *M, const CodeGenOptions &CGOpts,
DiagnosticsEngine &Diags);		DiagnosticsEngine &Diags);
}		}

#endif		#endif

clang/include/clang/Driver/Options.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 2,118 Lines • ▼ Show 20 Lines
	def flto_EQ : Joined<["-"], "flto=">, Flags<[CoreOption, CC1Option]>, Group<f_Group>,			def flto_EQ : Joined<["-"], "flto=">, Flags<[CoreOption, CC1Option]>, Group<f_Group>,
	HelpText<"Set LTO mode">, Values<"thin,full">;			HelpText<"Set LTO mode">, Values<"thin,full">;
	def flto_EQ_jobserver : Flag<["-"], "flto=jobserver">, Group<f_Group>,			def flto_EQ_jobserver : Flag<["-"], "flto=jobserver">, Group<f_Group>,
	Alias<flto_EQ>, AliasArgs<["full"]>, HelpText<"Enable LTO in 'full' mode">;			Alias<flto_EQ>, AliasArgs<["full"]>, HelpText<"Enable LTO in 'full' mode">;
	def flto_EQ_auto : Flag<["-"], "flto=auto">, Group<f_Group>,			def flto_EQ_auto : Flag<["-"], "flto=auto">, Group<f_Group>,
	Alias<flto_EQ>, AliasArgs<["full"]>, HelpText<"Enable LTO in 'full' mode">;			Alias<flto_EQ>, AliasArgs<["full"]>, HelpText<"Enable LTO in 'full' mode">;
	def flto : Flag<["-"], "flto">, Flags<[CoreOption, CC1Option]>, Group<f_Group>,			def flto : Flag<["-"], "flto">, Flags<[CoreOption, CC1Option]>, Group<f_Group>,
	Alias<flto_EQ>, AliasArgs<["full"]>, HelpText<"Enable LTO in 'full' mode">;			Alias<flto_EQ>, AliasArgs<["full"]>, HelpText<"Enable LTO in 'full' mode">;
				defm fat_lto_objects : BoolFOption<"fat-lto-objects",
				CodeGenOpts<"FatLTO">, DefaultFalse,
				PosFlag<SetTrue, [], "Enable fat LTO object support">,
				NegFlag<SetFalse, [], "Disable fat LTO object support">,
				BothFlags<[CC1Option], "">>;
	def fno_lto : Flag<["-"], "fno-lto">, Flags<[CoreOption, CC1Option]>, Group<f_Group>,			def fno_lto : Flag<["-"], "fno-lto">, Flags<[CoreOption, CC1Option]>, Group<f_Group>,
	HelpText<"Disable LTO mode (default)">;			HelpText<"Disable LTO mode (default)">;
	def foffload_lto_EQ : Joined<["-"], "foffload-lto=">, Flags<[CoreOption]>, Group<f_Group>,			def foffload_lto_EQ : Joined<["-"], "foffload-lto=">, Flags<[CoreOption]>, Group<f_Group>,
	HelpText<"Set LTO mode for offload compilation">, Values<"thin,full">;			HelpText<"Set LTO mode for offload compilation">, Values<"thin,full">;
	def foffload_lto : Flag<["-"], "foffload-lto">, Flags<[CoreOption]>, Group<f_Group>,			def foffload_lto : Flag<["-"], "foffload-lto">, Flags<[CoreOption]>, Group<f_Group>,
	Alias<foffload_lto_EQ>, AliasArgs<["full"]>, HelpText<"Enable LTO in 'full' mode for offload compilation">;			Alias<foffload_lto_EQ>, AliasArgs<["full"]>, HelpText<"Enable LTO in 'full' mode for offload compilation">;
	def fno_offload_lto : Flag<["-"], "fno-offload-lto">, Flags<[CoreOption]>, Group<f_Group>,			def fno_offload_lto : Flag<["-"], "fno-offload-lto">, Flags<[CoreOption]>, Group<f_Group>,
	HelpText<"Disable LTO mode (default) for offload compilation">;			HelpText<"Disable LTO mode (default) for offload compilation">;
	▲ Show 20 Lines • Show All 2,529 Lines • ▼ Show 20 Lines
	// ignore it for now to avoid breaking builds that use it.			// ignore it for now to avoid breaking builds that use it.
	def fdiagnostics_show_location_EQ : Joined<["-"], "fdiagnostics-show-location=">, Group<clang_ignored_f_Group>;			def fdiagnostics_show_location_EQ : Joined<["-"], "fdiagnostics-show-location=">, Group<clang_ignored_f_Group>;

	defm fcheck_new : BooleanFFlag<"check-new">, Group<clang_ignored_f_Group>;			defm fcheck_new : BooleanFFlag<"check-new">, Group<clang_ignored_f_Group>;
	defm caller_saves : BooleanFFlag<"caller-saves">, Group<clang_ignored_gcc_optimization_f_Group>;			defm caller_saves : BooleanFFlag<"caller-saves">, Group<clang_ignored_gcc_optimization_f_Group>;
	defm reorder_blocks : BooleanFFlag<"reorder-blocks">, Group<clang_ignored_gcc_optimization_f_Group>;			defm reorder_blocks : BooleanFFlag<"reorder-blocks">, Group<clang_ignored_gcc_optimization_f_Group>;
	defm branch_count_reg : BooleanFFlag<"branch-count-reg">, Group<clang_ignored_gcc_optimization_f_Group>;			defm branch_count_reg : BooleanFFlag<"branch-count-reg">, Group<clang_ignored_gcc_optimization_f_Group>;
	defm default_inline : BooleanFFlag<"default-inline">, Group<clang_ignored_gcc_optimization_f_Group>;			defm default_inline : BooleanFFlag<"default-inline">, Group<clang_ignored_gcc_optimization_f_Group>;
	defm fat_lto_objects : BooleanFFlag<"fat-lto-objects">, Group<clang_ignored_gcc_optimization_f_Group>;
	defm float_store : BooleanFFlag<"float-store">, Group<clang_ignored_gcc_optimization_f_Group>;			defm float_store : BooleanFFlag<"float-store">, Group<clang_ignored_gcc_optimization_f_Group>;
	defm friend_injection : BooleanFFlag<"friend-injection">, Group<clang_ignored_f_Group>;			defm friend_injection : BooleanFFlag<"friend-injection">, Group<clang_ignored_f_Group>;
	defm function_attribute_list : BooleanFFlag<"function-attribute-list">, Group<clang_ignored_f_Group>;			defm function_attribute_list : BooleanFFlag<"function-attribute-list">, Group<clang_ignored_f_Group>;
	defm gcse : BooleanFFlag<"gcse">, Group<clang_ignored_gcc_optimization_f_Group>;			defm gcse : BooleanFFlag<"gcse">, Group<clang_ignored_gcc_optimization_f_Group>;
	defm gcse_after_reload: BooleanFFlag<"gcse-after-reload">, Group<clang_ignored_gcc_optimization_f_Group>;			defm gcse_after_reload: BooleanFFlag<"gcse-after-reload">, Group<clang_ignored_gcc_optimization_f_Group>;
	defm gcse_las: BooleanFFlag<"gcse-las">, Group<clang_ignored_gcc_optimization_f_Group>;			defm gcse_las: BooleanFFlag<"gcse-las">, Group<clang_ignored_gcc_optimization_f_Group>;
	defm gcse_sm: BooleanFFlag<"gcse-sm">, Group<clang_ignored_gcc_optimization_f_Group>;			defm gcse_sm: BooleanFFlag<"gcse-sm">, Group<clang_ignored_gcc_optimization_f_Group>;
	defm gnu : BooleanFFlag<"gnu">, Group<clang_ignored_f_Group>;			defm gnu : BooleanFFlag<"gnu">, Group<clang_ignored_f_Group>;
	▲ Show 20 Lines • Show All 2,183 Lines • Show Last 20 Lines

clang/include/clang/Driver/ToolChain.h

Show First 20 Lines • Show All 563 Lines • ▼ Show 20 Lines public:

/// GetExceptionModel - Return the tool chain exception model. /// GetExceptionModel - Return the tool chain exception model.

virtual llvm::ExceptionHandling virtual llvm::ExceptionHandling

GetExceptionModel(const llvm::opt::ArgList &Args) const; GetExceptionModel(const llvm::opt::ArgList &Args) const;

/// SupportsEmbeddedBitcode - Does this tool chain support embedded bitcode. /// SupportsEmbeddedBitcode - Does this tool chain support embedded bitcode.

virtual bool SupportsEmbeddedBitcode() const { return false; } virtual bool SupportsEmbeddedBitcode() const { return false; }

/// SupportsFatLTO - Does this tool chain support fat LTO objects.

paulkirthUnsubmitted

Not Done

virtual bool SupportsEmbeddedBitcode() const { return false; }

- /// SupportsFatLTO - Does this tool chain support fat LTO objects.

+ /// SupportsFatLTO - Does this toolchain support fat LTO objects.

virtual bool SupportsFatLTO() const { return true; }

This file mostly uses toolchain, so let's try to be consistent. We don't need to worry about other uses in this patch.

paulkirth: This file mostly uses `toolchain`, so let's try to be consistent. We don't need to worry about…

virtual bool SupportsFatLTO() const { return true; }

/// getThreadModel() - Which thread model does this target use? /// getThreadModel() - Which thread model does this target use?

virtual std::string getThreadModel() const { return "posix"; } virtual std::string getThreadModel() const { return "posix"; }

/// isThreadModelSupported() - Does this target support a thread model? /// isThreadModelSupported() - Does this target support a thread model?

virtual bool isThreadModelSupported(const StringRef Model) const; virtual bool isThreadModelSupported(const StringRef Model) const;

virtual std::string getMultiarchTriple(const Driver &D, virtual std::string getMultiarchTriple(const Driver &D,

const llvm::Triple &TargetTriple, const llvm::Triple &TargetTriple,

▲ Show 20 Lines • Show All 201 Lines • Show Last 20 Lines

clang/lib/CodeGen/BackendUtil.cpp

Show First 20 Lines • Show All 187 Lines • ▼ Show 20 Lines	public:
}		}

std::unique_ptr<TargetMachine> TM;		std::unique_ptr<TargetMachine> TM;

// Emit output using the new pass manager for the optimization pipeline.		// Emit output using the new pass manager for the optimization pipeline.
void EmitAssembly(BackendAction Action,		void EmitAssembly(BackendAction Action,
std::unique_ptr<raw_pwrite_stream> OS);		std::unique_ptr<raw_pwrite_stream> OS);
};		};
}		}
		paulkirthUnsubmitted Done Reply Inline Actions I'd drop this, since its unrelated. If you still want to fix it, you can submit a separate change. paulkirth: I'd drop this, since its unrelated. If you still want to fix it, you can submit a separate…

static SanitizerCoverageOptions		static SanitizerCoverageOptions
getSancovOptsFromCGOpts(const CodeGenOptions &CGOpts) {		getSancovOptsFromCGOpts(const CodeGenOptions &CGOpts) {
SanitizerCoverageOptions Opts;		SanitizerCoverageOptions Opts;
Opts.CoverageType =		Opts.CoverageType =
static_cast<SanitizerCoverageOptions::Type>(CGOpts.SanitizeCoverageType);		static_cast<SanitizerCoverageOptions::Type>(CGOpts.SanitizeCoverageType);
Opts.IndirectCalls = CGOpts.SanitizeCoverageIndirectCalls;		Opts.IndirectCalls = CGOpts.SanitizeCoverageIndirectCalls;
Opts.TraceBB = CGOpts.SanitizeCoverageTraceBB;		Opts.TraceBB = CGOpts.SanitizeCoverageTraceBB;
▲ Show 20 Lines • Show All 987 Lines • ▼ Show 20 Lines	if (AsmHelper.TM) {
}		}
}		}
}		}

// With -fembed-bitcode, save a copy of the llvm IR as data in the		// With -fembed-bitcode, save a copy of the llvm IR as data in the
// __LLVM,__bitcode section.		// __LLVM,__bitcode section.
void clang::EmbedBitcode(llvm::Module *M, const CodeGenOptions &CGOpts,		void clang::EmbedBitcode(llvm::Module *M, const CodeGenOptions &CGOpts,
llvm::MemoryBufferRef Buf) {		llvm::MemoryBufferRef Buf) {
		if (CGOpts.FatLTO)
		return;
if (CGOpts.getEmbedBitcode() == CodeGenOptions::Embed_Off)		if (CGOpts.getEmbedBitcode() == CodeGenOptions::Embed_Off)
return;		return;
llvm::embedBitcodeInModule(		llvm::embedBitcodeInModule(
*M, Buf, CGOpts.getEmbedBitcode() != CodeGenOptions::Embed_Marker,		*M, Buf, CGOpts.getEmbedBitcode() != CodeGenOptions::Embed_Marker,
CGOpts.getEmbedBitcode() != CodeGenOptions::Embed_Bitcode,		CGOpts.getEmbedBitcode() != CodeGenOptions::Embed_Bitcode,
CGOpts.CmdArgs);		CGOpts.CmdArgs);
}		}

		// With -ffat-lto-objects, save a copy of the llvm IR as data in the
		// .fatlto section.
		void clang::EmbedFatLTO(llvm::Module *M, const CodeGenOptions &CGOpts,
		llvm::MemoryBufferRef Buf) {
		if (!CGOpts.FatLTO)
		return;
		llvm::embedBitcodeInFatObject(*M, Buf);
		}

void clang::EmbedObject(llvm::Module *M, const CodeGenOptions &CGOpts,		void clang::EmbedObject(llvm::Module *M, const CodeGenOptions &CGOpts,
DiagnosticsEngine &Diags) {		DiagnosticsEngine &Diags) {
if (CGOpts.OffloadObjects.empty())		if (CGOpts.OffloadObjects.empty())
return;		return;

for (StringRef OffloadObject : CGOpts.OffloadObjects) {		for (StringRef OffloadObject : CGOpts.OffloadObjects) {
llvm::ErrorOr<std::unique_ptr<llvm::MemoryBuffer>> ObjectOrErr =		llvm::ErrorOr<std::unique_ptr<llvm::MemoryBuffer>> ObjectOrErr =
llvm::MemoryBuffer::getFileOrSTDIN(OffloadObject);		llvm::MemoryBuffer::getFileOrSTDIN(OffloadObject);
Show All 11 Lines

clang/lib/CodeGen/CodeGenAction.cpp

Show First 20 Lines • Show All 369 Lines • ▼ Show 20 Lines	void HandleTranslationUnit(ASTContext &C) override {
// things in ASTContext used after this point and null out the		// things in ASTContext used after this point and null out the
// ASTContext, but too many various parts of the ASTContext are still		// ASTContext, but too many various parts of the ASTContext are still
// used in various parts.		// used in various parts.
C.cleanup();		C.cleanup();
C.getAllocator().Reset();		C.getAllocator().Reset();
}		}

EmbedBitcode(getModule(), CodeGenOpts, llvm::MemoryBufferRef());		EmbedBitcode(getModule(), CodeGenOpts, llvm::MemoryBufferRef());
		EmbedFatLTO(getModule(), CodeGenOpts, llvm::MemoryBufferRef());

EmitBackendOutput(Diags, HeaderSearchOpts, CodeGenOpts, TargetOpts,		EmitBackendOutput(Diags, HeaderSearchOpts, CodeGenOpts, TargetOpts,
LangOpts, C.getTargetInfo().getDataLayoutString(),		LangOpts, C.getTargetInfo().getDataLayoutString(),
getModule(), Action, std::move(AsmOutStream));		getModule(), Action, std::move(AsmOutStream));

Ctx.setDiagnosticHandler(std::move(OldDiagnosticHandler));		Ctx.setDiagnosticHandler(std::move(OldDiagnosticHandler));

if (OptRecordFile)		if (OptRecordFile)
▲ Show 20 Lines • Show All 781 Lines • ▼ Show 20 Lines	void CodeGenAction::ExecuteAction() {
if (TheModule->getTargetTriple() != TargetOpts.Triple) {		if (TheModule->getTargetTriple() != TargetOpts.Triple) {
Diagnostics.Report(SourceLocation(), diag::warn_fe_override_module)		Diagnostics.Report(SourceLocation(), diag::warn_fe_override_module)
<< TargetOpts.Triple;		<< TargetOpts.Triple;
TheModule->setTargetTriple(TargetOpts.Triple);		TheModule->setTargetTriple(TargetOpts.Triple);
}		}

EmbedObject(TheModule.get(), CodeGenOpts, Diagnostics);		EmbedObject(TheModule.get(), CodeGenOpts, Diagnostics);
EmbedBitcode(TheModule.get(), CodeGenOpts, *MainFile);		EmbedBitcode(TheModule.get(), CodeGenOpts, *MainFile);
		EmbedFatLTO(TheModule.get(), CodeGenOpts, *MainFile);


LLVMContext &Ctx = TheModule->getContext();		LLVMContext &Ctx = TheModule->getContext();

// Restore any diagnostic handler previously set before returning from this		// Restore any diagnostic handler previously set before returning from this
// function.		// function.
struct RAII {		struct RAII {
LLVMContext &Ctx;		LLVMContext &Ctx;
std::unique_ptr<DiagnosticHandler> PrevHandler = Ctx.getDiagnosticHandler();		std::unique_ptr<DiagnosticHandler> PrevHandler = Ctx.getDiagnosticHandler();
▲ Show 20 Lines • Show All 61 Lines • Show Last 20 Lines

clang/lib/Driver/Driver.cpp

Show First 20 Lines • Show All 1,308 Lines • ▼ Show 20 Lines	unsigned Model = llvm::StringSwitch<unsigned>(Name)
.Default(~0U);		.Default(~0U);
if (Model == ~0U) {		if (Model == ~0U) {
Diags.Report(diag::err_drv_invalid_value) << A->getAsString(Args)		Diags.Report(diag::err_drv_invalid_value) << A->getAsString(Args)
<< Name;		<< Name;
} else		} else
BitcodeEmbed = static_cast<BitcodeEmbedMode>(Model);		BitcodeEmbed = static_cast<BitcodeEmbedMode>(Model);
}		}

		// Process -ffat-lto-objects flag.
		if (Arg *A = Args.getLastArg(options::OPT_ffat_lto_objects)) {
		BitcodeEmbed = EmbedBitcode;
		}

// Remove existing compilation database so that each job can append to it.		// Remove existing compilation database so that each job can append to it.
if (Arg *A = Args.getLastArg(options::OPT_MJ))		if (Arg *A = Args.getLastArg(options::OPT_MJ))
llvm::sys::fs::remove(A->getValue());		llvm::sys::fs::remove(A->getValue());

// Setting up the jobs for some precompile cases depends on whether we are		// Setting up the jobs for some precompile cases depends on whether we are
// treating them as PCH, implicit modules or C++20 ones.		// treating them as PCH, implicit modules or C++20 ones.
// TODO: inferring the mode like this seems fragile (it meets the objective		// TODO: inferring the mode like this seems fragile (it meets the objective
// of not requiring anything new for operation, however).		// of not requiring anything new for operation, however).
▲ Show 20 Lines • Show All 4,968 Lines • Show Last 20 Lines

clang/lib/Driver/ToolChains/Clang.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,111 Lines • ▼ Show 20 Lines	if (A->getOption().matches(options::OPT_fglobal_isel)) {
else		else
D.Diag(diag::warn_drv_global_isel_incomplete_opt);		D.Diag(diag::warn_drv_global_isel_incomplete_opt);
}		}
} else {		} else {
CmdArgs.push_back("-global-isel=0");		CmdArgs.push_back("-global-isel=0");
}		}
}		}

		if (Args.hasArg(options::OPT_ffat_lto_objects))
		paulkirthUnsubmitted Done Reply Inline Actions I think these can go away, right? paulkirth: I think these can go away, right?
		CmdArgs.push_back("-ffat-lto-objects");

if (Args.hasArg(options::OPT_forder_file_instrumentation)) {		if (Args.hasArg(options::OPT_forder_file_instrumentation)) {
CmdArgs.push_back("-forder-file-instrumentation");		CmdArgs.push_back("-forder-file-instrumentation");
// Enable order file instrumentation when ThinLTO is not on. When ThinLTO is		// Enable order file instrumentation when ThinLTO is not on. When ThinLTO is
// on, we need to pass these flags as linker flags and that will be handled		// on, we need to pass these flags as linker flags and that will be handled
// outside of the compiler.		// outside of the compiler.
if (!IsUsingLTO) {		if (!IsUsingLTO) {
CmdArgs.push_back("-mllvm");		CmdArgs.push_back("-mllvm");
CmdArgs.push_back("-enable-order-file-instrumentation");		CmdArgs.push_back("-enable-order-file-instrumentation");
▲ Show 20 Lines • Show All 1,346 Lines • Show Last 20 Lines

llvm/include/llvm/Bitcode/BitcodeWriter.h

Show First 20 Lines • Show All 159 Lines • ▼ Show 20 Lines	class raw_ostream;
/// is not bitcode (i.e. if it's invalid data or even textual LLVM assembly).		/// is not bitcode (i.e. if it's invalid data or even textual LLVM assembly).
/// If EmbedCmdline is set, the command line is also exported in		/// If EmbedCmdline is set, the command line is also exported in
/// the corresponding section (__LLVM,_cmdline / .llvmcmd) - even if CmdArgs		/// the corresponding section (__LLVM,_cmdline / .llvmcmd) - even if CmdArgs
/// were empty.		/// were empty.
void embedBitcodeInModule(Module &M, MemoryBufferRef Buf, bool EmbedBitcode,		void embedBitcodeInModule(Module &M, MemoryBufferRef Buf, bool EmbedBitcode,
bool EmbedCmdline,		bool EmbedCmdline,
const std::vector<uint8_t> &CmdArgs);		const std::vector<uint8_t> &CmdArgs);

		void embedBitcodeInFatObject(Module &M, MemoryBufferRef Buf);
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_BITCODE_BITCODEWRITER_H		#endif // LLVM_BITCODE_BITCODEWRITER_H

llvm/lib/Bitcode/Writer/BitcodeWriter.cpp

Show First 20 Lines • Show All 4,947 Lines • ▼ Show 20 Lines	case Triple::XCOFF:
break;		break;
case Triple::DXContainer:		case Triple::DXContainer:
llvm_unreachable("DXC is not yet implemented");		llvm_unreachable("DXC is not yet implemented");
break;		break;
}		}
llvm_unreachable("Unimplemented ObjectFormatType");		llvm_unreachable("Unimplemented ObjectFormatType");
}		}

		static const char *getSectionNameForBitcodeForFatLTO(const Triple &T) {
		switch (T.getObjectFormat()) {
		case Triple::MachO:
		llvm::report_fatal_error("MachO is not yet implemented for FatLTO");
		paulkirthUnsubmitted Not Done Reply Inline Actions Can you make the error a bit more specific? Like let them know this is a "Fat LTO" problem? I'm not sure how well the issue will be surfaced as is. There's been some discussion re: the use of unreachable. Not all of it applies here, especially in a prototype, but its a point to consider as work develops. see https://discourse.llvm.org/t/llvm-unreachable-is-widely-misused/60587 for more context. I know this is a copy paste from existing code, but a fatal error might be the better choice. What do you think? paulkirth: Can you make the error a bit more specific? Like let them know this is a "Fat LTO" problem? I'm…
		ardaAuthorUnsubmitted Done Reply Inline Actions From that discussion, it seems like unreachable should not be used in this case. I agree that fatal error might be a better choice here. Thanks! arda: From that discussion, it seems like unreachable should not be used in this case. I agree that…
		break;
		case Triple::COFF:
		llvm::report_fatal_error("COFF is not yet implemented for FatLTO");
		break;
		case Triple::ELF:
		return ".fatlto";
		phosekUnsubmitted Not Done Reply Inline Actions Since this section is LLVM specific, I think it should use an `.llvm` prefix, so the name could be `.llvm.lto`. This is similar to GCC which uses `.gcc.lto_` prefixed section names. phosek: Since this section is LLVM specific, I think it should use an `.llvm` prefix, so the name could…
		case Triple::Wasm:
		llvm::report_fatal_error("Wasm is not yet implemented for FatLTO");
		break;
		case Triple::UnknownObjectFormat:
		return ".fatlto";
		case Triple::GOFF:
		llvm::report_fatal_error("GOFF is not yet implemented for FatLTO");
		break;
		case Triple::SPIRV:
		llvm::report_fatal_error("SPIRV is not yet implemented for FatLTO");
		break;
		case Triple::XCOFF:
		llvm::report_fatal_error("XCOFF is not yet implemented for FatLTO");
		break;
		case Triple::DXContainer:
		llvm::report_fatal_error("DXContainer is not yet implemented for FatLTO");
		break;
		}
		llvm_unreachable("Unimplemented ObjectFormatType");
		}

void llvm::embedBitcodeInModule(llvm::Module &M, llvm::MemoryBufferRef Buf,		void llvm::embedBitcodeInModule(llvm::Module &M, llvm::MemoryBufferRef Buf,
bool EmbedBitcode, bool EmbedCmdline,		bool EmbedBitcode, bool EmbedCmdline,
const std::vector<uint8_t> &CmdArgs) {		const std::vector<uint8_t> &CmdArgs) {
// Save llvm.compiler.used and remove it.		// Save llvm.compiler.used and remove it.
SmallVector<Constant *, 2> UsedArray;		SmallVector<Constant *, 2> UsedArray;
SmallVector<GlobalValue *, 4> UsedGlobals;		SmallVector<GlobalValue *, 4> UsedGlobals;
Type *UsedElementType = Type::getInt8Ty(M.getContext())->getPointerTo(0);		Type *UsedElementType = Type::getInt8Ty(M.getContext())->getPointerTo(0);
GlobalVariable *Used = collectUsedGlobalVariables(M, UsedGlobals, true);		GlobalVariable *Used = collectUsedGlobalVariables(M, UsedGlobals, true);
▲ Show 20 Lines • Show All 76 Lines • ▼ Show 20 Lines	void llvm::embedBitcodeInModule(llvm::Module &M, llvm::MemoryBufferRef Buf,

// Recreate llvm.compiler.used.		// Recreate llvm.compiler.used.
ArrayType *ATy = ArrayType::get(UsedElementType, UsedArray.size());		ArrayType *ATy = ArrayType::get(UsedElementType, UsedArray.size());
auto *NewUsed = new GlobalVariable(		auto *NewUsed = new GlobalVariable(
M, ATy, false, llvm::GlobalValue::AppendingLinkage,		M, ATy, false, llvm::GlobalValue::AppendingLinkage,
llvm::ConstantArray::get(ATy, UsedArray), "llvm.compiler.used");		llvm::ConstantArray::get(ATy, UsedArray), "llvm.compiler.used");
NewUsed->setSection("llvm.metadata");		NewUsed->setSection("llvm.metadata");
}		}

		void llvm::embedBitcodeInFatObject(llvm::Module &M, llvm::MemoryBufferRef Buf) {
		paulkirthUnsubmitted Not Done Reply Inline Actions Does this just do what embed bitcode does? My understanding was that LTO generated IR is emitted into object files differently than the embed bitcode scenario. I'm basing that on the discussion at https://discourse.llvm.org/t/rfc-ffat-lto-objects-support/63977 paulkirth: Does this just do what embed bitcode does? My understanding was that LTO generated IR is…
		ardaAuthorUnsubmitted Done Reply Inline Actions From my research, -fembed-bitcode was originally used for recompilation purposes, not for LTO. Since our scope is limited to ThinLTO, this prototype currently is used to embed ThinLTO bitcode into the FatLTO object as follows: clang -c -ffat-lto-objects -o fatLTO.o -x ir thinLTO.bc arda: From my research, -fembed-bitcode was originally used for recompilation purposes, not for LTO.
		paulkirthUnsubmitted Not Done Reply Inline Actions Looking at this a second time, I'm not sure I follow your meaning here. What I meant was that the way EmbedBitcode worked was not `TheRightThing` according to that discussion. It appears as though your change just copies its logic. So if I want to compile a C file and get a fat-LTO object? When do the LTO passes get added to the pipeline? I don't mean the big whole program optimization stuff that gets invoked at link-time, I mean the module level passes and summary generation? It's not clear to me when those will run after invoking clang on a source file. can you elaborate on how this is intended to work? I seem to be missing something. paulkirth: Looking at this a second time, I'm not sure I follow your meaning here. What I meant was that…
		// Save llvm.compiler.used and remove it.
		SmallVector<Constant *, 2> UsedArray;
		SmallVector<GlobalValue *, 4> UsedGlobals;
		Type *UsedElementType = Type::getInt8Ty(M.getContext())->getPointerTo(0);
		GlobalVariable *Used = collectUsedGlobalVariables(M, UsedGlobals, true);
		for (auto *GV : UsedGlobals) {
		if (GV->getName() != "llvm.embedded.module" &&
		GV->getName() != "llvm.cmdline")
		UsedArray.push_back(
		ConstantExpr::getPointerBitCastOrAddrSpaceCast(GV, UsedElementType));
		}
		if (Used)
		Used->eraseFromParent();

		// Embed the bitcode for the llvm module.
		std::string Data;
		ArrayRef<uint8_t> ModuleData;
		Triple T(M.getTargetTriple());

		if (Buf.getBufferSize() == 0 \|\|
		!isBitcode((const unsigned char *)Buf.getBufferStart(),
		(const unsigned char *)Buf.getBufferEnd())) {
		// If the input is LLVM Assembly, bitcode is produced by serializing
		// the module. Use-lists order need to be preserved in this case.
		llvm::raw_string_ostream OS(Data);
		llvm::WriteBitcodeToFile(M, OS, /* ShouldPreserveUseListOrder */ true);
		ModuleData =
		ArrayRef<uint8_t>((const uint8_t *)OS.str().data(), OS.str().size());
		} else
		// If the input is LLVM bitcode, write the input byte stream directly.
		ModuleData = ArrayRef<uint8_t>((const uint8_t *)Buf.getBufferStart(),
		Buf.getBufferSize());
		llvm::Constant *ModuleConstant =
		llvm::ConstantDataArray::get(M.getContext(), ModuleData);
		llvm::GlobalVariable *GV = new llvm::GlobalVariable(
		M, ModuleConstant->getType(), true, llvm::GlobalValue::PrivateLinkage,
		ModuleConstant);
		GV->setSection(getSectionNameForBitcodeForFatLTO(T));
		// Set alignment to 1 to prevent padding between two contributions from input
		// sections after linking.
		GV->setAlignment(Align(1));
		UsedArray.push_back(
		ConstantExpr::getPointerBitCastOrAddrSpaceCast(GV, UsedElementType));
		if (llvm::GlobalVariable *Old =
		M.getGlobalVariable("llvm.embedded.module", true)) {
		assert(Old->hasZeroLiveUses() &&
		"llvm.embedded.module can only be used once in llvm.compiler.used");
		GV->takeName(Old);
		Old->eraseFromParent();
		} else {
		GV->setName("llvm.embedded.module");
		}

		if (UsedArray.empty())
		return;

		// Recreate llvm.compiler.used.
		ArrayType *ATy = ArrayType::get(UsedElementType, UsedArray.size());
		auto *NewUsed = new GlobalVariable(
		M, ATy, false, llvm::GlobalValue::AppendingLinkage,
		llvm::ConstantArray::get(ATy, UsedArray), "llvm.compiler.used");
		NewUsed->setSection("llvm.metadata");
		}

This is an archive of the discontinued LLVM Phabricator instance.

[WIP] First draft of -ffat-lto-object related changes: front-end additions, duplication and repurposing -fembed-bitcode related code.DraftPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 450402

clang/include/clang/Basic/CodeGenOptions.def

clang/include/clang/CodeGen/BackendUtil.h

clang/include/clang/Driver/Options.td

clang/include/clang/Driver/ToolChain.h

clang/lib/CodeGen/BackendUtil.cpp

clang/lib/CodeGen/CodeGenAction.cpp

clang/lib/Driver/Driver.cpp

clang/lib/Driver/ToolChains/Clang.cpp

llvm/include/llvm/Bitcode/BitcodeWriter.h

llvm/lib/Bitcode/Writer/BitcodeWriter.cpp

[WIP] First draft of -ffat-lto-object related changes: front-end additions, duplication and repurposing -fembed-bitcode related code.
DraftPublic