This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/LTO/
-
llvm/
-
LTO/
-
Config.h
-
lib/LTO/
-
LTO/
-
LTOBackend.cpp

Differential D31100

[LTO] Allow client to skip code gen
AbandonedPublic

Authored by tejohnson on Mar 17 2017, 2:01 PM.

Download Raw Diff

Details

Reviewers

pcc

Summary

This is useful when invoking the ThinLTO backend via clang, which
already has handling (immediately after the existing early return
when invoking the ThinLTO backend) for setting up the requested code
generation.

We noticed that when invoking the thinBackend via clang (for the
distributed build case) that flags like -ffunction-sections and
-emit-llvm were not having the intended effect. This could have been
fixed by setting up the TargetOptions and the CodeGenFileType in the LTO
Config, but since clang already has handling for all of this, it is
straightforward to just let it do the handling.

Diff Detail

Build Status

Buildable 5196
Build 5196: arc lint + arc unit

Event Timeline

tejohnson created this revision.Mar 17 2017, 2:01 PM

Harbormaster completed remote builds in B4851: Diff 92197.Mar 17 2017, 2:01 PM

Herald added a subscriber: mehdi_amini. · View Herald TranscriptMar 17 2017, 2:01 PM

tejohnson added a child revision: D31101: [ThinLTO] Use clang's existing code gen handling for ThinLTO backends.Mar 17 2017, 2:02 PM

We noticed that when invoking the thinBackend via clang (for the distributed build case) that flags like -ffunction-sections and -emit-llvm were not having the intended effect. This could have been fixed by setting up the TargetOptions and the CodeGenFileType in the LTO Config, but since clang already has handling for all of this, it is straightforward to just let it do the handling.

Well, I'm not convinced it is the right thing to do. It is the most straightforward thing to do for the distributed case, but it creates a discrepancy with the non-distributed case, and it won't allow to honor -ffunction-sections this way.
I also believe that it won't help to guarantee that wether you're using a distributed build or not you get the same binary.

In D31100#704263, @mehdi_amini wrote:

We noticed that when invoking the thinBackend via clang (for the distributed build case) that flags like -ffunction-sections and -emit-llvm were not having the intended effect. This could have been fixed by setting up the TargetOptions and the CodeGenFileType in the LTO Config, but since clang already has handling for all of this, it is straightforward to just let it do the handling.

Well, I'm not convinced it is the right thing to do. It is the most straightforward thing to do for the distributed case, but it creates a discrepancy with the non-distributed case, and it won't allow to honor -ffunction-sections this way.

Can you clarify the concern? We do pass down -ffunction-sections correctly in the non-distributed case, e.g. via the gold plugin. The gold plugin which is only handling LTO doesn't have any built in code gen handling (anymore with the new LTO API).

I also believe that it won't help to guarantee that wether you're using a distributed build or not you get the same binary.

How so?

In D31100#704273, @tejohnson wrote:

In D31100#704263, @mehdi_amini wrote:

We noticed that when invoking the thinBackend via clang (for the distributed build case) that flags like -ffunction-sections and -emit-llvm were not having the intended effect. This could have been fixed by setting up the TargetOptions and the CodeGenFileType in the LTO Config, but since clang already has handling for all of this, it is straightforward to just let it do the handling.

Well, I'm not convinced it is the right thing to do. It is the most straightforward thing to do for the distributed case, but it creates a discrepancy with the non-distributed case, and it won't allow to honor -ffunction-sections this way.

Can you clarify the concern? We do pass down -ffunction-sections correctly in the non-distributed case, e.g. via the gold plugin. The gold plugin which is only handling LTO doesn't have any built in code gen handling (anymore with the new LTO API).

OK, then how do we pass down -ffunction-sections to the LTO API in the non-distributed case? I expect the same flow to happen here.

I also believe that it won't help to guarantee that wether you're using a distributed build or not you get the same binary.

How so?

The Target setup can take many options in the way it is created. Having the exact same flow and API used in both cases helps figuring out where any discrepancy comes from (it is also less moving pieces).

In D31100#704279, @mehdi_amini wrote:

In D31100#704273, @tejohnson wrote:

In D31100#704263, @mehdi_amini wrote:

We noticed that when invoking the thinBackend via clang (for the distributed build case) that flags like -ffunction-sections and -emit-llvm were not having the intended effect. This could have been fixed by setting up the TargetOptions and the CodeGenFileType in the LTO Config, but since clang already has handling for all of this, it is straightforward to just let it do the handling.

Well, I'm not convinced it is the right thing to do. It is the most straightforward thing to do for the distributed case, but it creates a discrepancy with the non-distributed case, and it won't allow to honor -ffunction-sections this way.

Can you clarify the concern? We do pass down -ffunction-sections correctly in the non-distributed case, e.g. via the gold plugin. The gold plugin which is only handling LTO doesn't have any built in code gen handling (anymore with the new LTO API).

OK, then how do we pass down -ffunction-sections to the LTO API in the non-distributed case? I expect the same flow to happen here.

See AddGoldPlugin in clang for where we translate it into the equivalent llvm internal option passed to the plugin. It eventually gets used to init the target options in the Config in the plugin.

I also believe that it won't help to guarantee that wether you're using a distributed build or not you get the same binary.

How so?

The Target setup can take many options in the way it is created. Having the exact same flow and API used in both cases helps figuring out where any discrepancy comes from (it is also less moving pieces).

We're more limited by what gets passed to the plugin and how, e.g. in the function sections case.

In D31100#704314, @tejohnson wrote:

In D31100#704279, @mehdi_amini wrote:

In D31100#704273, @tejohnson wrote:

In D31100#704263, @mehdi_amini wrote:

We noticed that when invoking the thinBackend via clang (for the distributed build case) that flags like -ffunction-sections and -emit-llvm were not having the intended effect. This could have been fixed by setting up the TargetOptions and the CodeGenFileType in the LTO Config, but since clang already has handling for all of this, it is straightforward to just let it do the handling.

Well, I'm not convinced it is the right thing to do. It is the most straightforward thing to do for the distributed case, but it creates a discrepancy with the non-distributed case, and it won't allow to honor -ffunction-sections this way.

Can you clarify the concern? We do pass down -ffunction-sections correctly in the non-distributed case, e.g. via the gold plugin. The gold plugin which is only handling LTO doesn't have any built in code gen handling (anymore with the new LTO API).

OK, then how do we pass down -ffunction-sections to the LTO API in the non-distributed case? I expect the same flow to happen here.

See AddGoldPlugin in clang for where we translate it into the equivalent llvm internal option passed to the plugin. It eventually gets used to init the target options in the Config in the plugin.

Yes saw that, this is what I was expecting "somehow".

We're setting this up as a side-channel using cl::ParseCommandLineOptions(NumOpts, &options::extra[0]); in gold-plugin.cpp (oh boy I don't like the backend in general, and this is no exception).

And then: Conf.Options = InitTargetOptionsFromCodeGenFlags(); (Conf is lto::Config here, and Conf.Options is TargetOptions)

So I rather get clang to initialize the TargetOptions in the LTO config. I think it should be easy to extract EmitAssemblyHelper::CreateTargetOptions out of EmitAssemblyHelper::CreateTargetMachine and reuse it for this purpose.

I also believe that it won't help to guarantee that wether you're using a distributed build or not you get the same binary.

How so?

The Target setup can take many options in the way it is created. Having the exact same flow and API used in both cases helps figuring out where any discrepancy comes from (it is also less moving pieces).

We're more limited by what gets passed to the plugin and how, e.g. in the function sections case.

Not sure what you mean here, I suspect this is why I'm against using a different flow: we're not supposed to be passing flags differently one case from the other.

In D31100#704352, @mehdi_amini wrote:

In D31100#704314, @tejohnson wrote:

In D31100#704279, @mehdi_amini wrote:

In D31100#704273, @tejohnson wrote:

In D31100#704263, @mehdi_amini wrote:

We noticed that when invoking the thinBackend via clang (for the distributed build case) that flags like -ffunction-sections and -emit-llvm were not having the intended effect. This could have been fixed by setting up the TargetOptions and the CodeGenFileType in the LTO Config, but since clang already has handling for all of this, it is straightforward to just let it do the handling.

Well, I'm not convinced it is the right thing to do. It is the most straightforward thing to do for the distributed case, but it creates a discrepancy with the non-distributed case, and it won't allow to honor -ffunction-sections this way.

Can you clarify the concern? We do pass down -ffunction-sections correctly in the non-distributed case, e.g. via the gold plugin. The gold plugin which is only handling LTO doesn't have any built in code gen handling (anymore with the new LTO API).

OK, then how do we pass down -ffunction-sections to the LTO API in the non-distributed case? I expect the same flow to happen here.

See AddGoldPlugin in clang for where we translate it into the equivalent llvm internal option passed to the plugin. It eventually gets used to init the target options in the Config in the plugin.

Yes saw that, this is what I was expecting "somehow".

We're setting this up as a side-channel using cl::ParseCommandLineOptions(NumOpts, &options::extra[0]); in gold-plugin.cpp (oh boy I don't like the backend in general, and this is no exception).

And then: Conf.Options = InitTargetOptionsFromCodeGenFlags(); (Conf is lto::Config here, and Conf.Options is TargetOptions)

So I rather get clang to initialize the TargetOptions in the LTO config. I think it should be easy to extract EmitAssemblyHelper::CreateTargetOptions out of EmitAssemblyHelper::CreateTargetMachine and reuse it for this purpose.

That is doable, but I don't see the advantage of duplicating the logic in EmitAssemblyHelper (e.g. CreateTargetMachine), just so we can invoke the codegen through LTO, since we have to be able to do this outside LTO in EmitAssemblyHelper in the non-LTO case in clang. We have to set up TargetOptions in either case, see below.

I also believe that it won't help to guarantee that wether you're using a distributed build or not you get the same binary.

How so?

The Target setup can take many options in the way it is created. Having the exact same flow and API used in both cases helps figuring out where any discrepancy comes from (it is also less moving pieces).

We're more limited by what gets passed to the plugin and how, e.g. in the function sections case.

Not sure what you mean here, I suspect this is why I'm against using a different flow: we're not supposed to be passing flags differently one case from the other.

There's not much choice here I think. In the case of doing a ThinLTO backend via clang, this information is passed in memory in the CodeGenOptions data structure. That is used to fill in the TargetOptions data structure in EmitAssemblyHelper::CreateTargetMachine.

In the in-process gold case, it needs to be passed from clang into the gold plugin somehow, which can't be done directly via the CodeGenOptions data structure, so we pass internal options. Then the gold plugin is responsible for getting those to init the TargetOptions (as you note above it does that via InitTargetOptionsFromCodeGenFlags).

So regardless of whether the clang invocation of the ThinLTO backends uses LTO to do the code gen, and sets up the TargetOptions struct within lto::Config, or just uses the existing EmitAssemblyHelper which sets up the TargetOptions there, the setup of TargetOptions is up to the LTO client.

In D31100#704477, @tejohnson wrote:

So I rather get clang to initialize the TargetOptions in the LTO config. I think it should be easy to extract EmitAssemblyHelper::CreateTargetOptions out of EmitAssemblyHelper::CreateTargetMachine and reuse it for this purpose.

That is doable, but I don't see the advantage of duplicating the logic in EmitAssemblyHelper (e.g. CreateTargetMachine), just so we can invoke the codegen through LTO, since we have to be able to do this outside LTO in EmitAssemblyHelper in the non-LTO case in clang. We have to set up TargetOptions in either case, see below.

Not clear which duplicating logic you're referring to right now? I'm talking about *reusing* the logic (OK I may miss something because I haven't looked deep enough).

Not sure what you mean here, I suspect this is why I'm against using a different flow: we're not supposed to be passing flags differently one case from the other.

There's not much choice here I think. In the case of doing a ThinLTO backend via clang, this information is passed in memory in the CodeGenOptions data structure. That is used to fill in the TargetOptions data structure in EmitAssemblyHelper::CreateTargetMachine.

In the in-process gold case, it needs to be passed from clang into the gold plugin somehow, which can't be done directly via the CodeGenOptions data structure, so we pass internal options. Then the gold plugin is responsible for getting those to init the TargetOptions (as you note above it does that via InitTargetOptionsFromCodeGenFlags).

So regardless of whether the clang invocation of the ThinLTO backends uses LTO to do the code gen, and sets up the TargetOptions struct within lto::Config, or just uses the existing EmitAssemblyHelper which sets up the TargetOptions there, the setup of TargetOptions is up to the LTO client.

What I'm loooking at is to have the guarantee that by dumping (serializing) the lto::Config, you get all the information to reproduce. Ultimately I'd like to include this as part of "save-temps" and be able to reload the config to replay a backend. Having a single API all the way helps to setup something like this.

In D31100#704580, @mehdi_amini wrote:

In D31100#704477, @tejohnson wrote:

So I rather get clang to initialize the TargetOptions in the LTO config. I think it should be easy to extract EmitAssemblyHelper::CreateTargetOptions out of EmitAssemblyHelper::CreateTargetMachine and reuse it for this purpose.

That is doable, but I don't see the advantage of duplicating the logic in EmitAssemblyHelper (e.g. CreateTargetMachine), just so we can invoke the codegen through LTO, since we have to be able to do this outside LTO in EmitAssemblyHelper in the non-LTO case in clang. We have to set up TargetOptions in either case, see below.

Not clear which duplicating logic you're referring to right now? I'm talking about *reusing* the logic (OK I may miss something because I haven't looked deep enough).

I figured it'll be easier to figure what I had in mind by implementing it: https://reviews.llvm.org/D31114

tejohnson mentioned this in D31114: Refactor `initTargetOptions` out of `EmitAssemblyHelper::CreateTargetMachine` and use it to initialize TargetOptions for ThinLTO Backends.Mar 18 2017, 11:33 AM

Add support for allowing LTO client to emit LLVM IR assembly or bitcode.

Subsumed by D31534

Revision Contents

Path

Size

include/

llvm/

LTO/

Config.h

10 lines

lib/

LTO/

LTOBackend.cpp

33 lines

Diff 93534

include/llvm/LTO/Config.h

Show All 40 Lines	struct Config {
std::vector<std::string> MAttrs;		std::vector<std::string> MAttrs;
Reloc::Model RelocModel = Reloc::PIC_;		Reloc::Model RelocModel = Reloc::PIC_;
CodeModel::Model CodeModel = CodeModel::Default;		CodeModel::Model CodeModel = CodeModel::Default;
CodeGenOpt::Level CGOptLevel = CodeGenOpt::Default;		CodeGenOpt::Level CGOptLevel = CodeGenOpt::Default;
TargetMachine::CodeGenFileType CGFileType = TargetMachine::CGFT_ObjectFile;		TargetMachine::CodeGenFileType CGFileType = TargetMachine::CGFT_ObjectFile;
unsigned OptLevel = 2;		unsigned OptLevel = 2;
bool DisableVerify = false;		bool DisableVerify = false;

		enum EmitIRType {
		EIRT_CodeGen, /// Do CodeGen and emit CGFileType
		EIRT_LL, /// Emit human-readable LLVM assembly
		EIRT_BC, /// Emit LLVM bitcode files
		EIRT_Nothing, /// Don't emit anything (benchmarking mode)
		};

		/// Type of IR file to output, defaults to none (do codegen).
		EmitIRType IROutputType = EIRT_CodeGen;

/// Disable entirely the optimizer, including importing for ThinLTO		/// Disable entirely the optimizer, including importing for ThinLTO
bool CodeGenOnly = false;		bool CodeGenOnly = false;

/// If this field is set, the set of passes run in the middle-end optimizer		/// If this field is set, the set of passes run in the middle-end optimizer
/// will be the one specified by the string. Only works with the new pass		/// will be the one specified by the string. Only works with the new pass
/// manager as the old one doesn't have this ability.		/// manager as the old one doesn't have this ability.
std::string OptPipeline;		std::string OptPipeline;

▲ Show 20 Lines • Show All 133 Lines • Show Last 20 Lines

lib/LTO/LTOBackend.cpp

Show All 15 Lines

#include "llvm/LTO/LTOBackend.h"		#include "llvm/LTO/LTOBackend.h"
#include "llvm/Analysis/AliasAnalysis.h"		#include "llvm/Analysis/AliasAnalysis.h"
#include "llvm/Analysis/CGSCCPassManager.h"		#include "llvm/Analysis/CGSCCPassManager.h"
#include "llvm/Analysis/TargetLibraryInfo.h"		#include "llvm/Analysis/TargetLibraryInfo.h"
#include "llvm/Analysis/TargetTransformInfo.h"		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/Bitcode/BitcodeReader.h"		#include "llvm/Bitcode/BitcodeReader.h"
#include "llvm/Bitcode/BitcodeWriter.h"		#include "llvm/Bitcode/BitcodeWriter.h"
		#include "llvm/Bitcode/BitcodeWriterPass.h"
		#include "llvm/IR/IRPrintingPasses.h"
#include "llvm/IR/LegacyPassManager.h"		#include "llvm/IR/LegacyPassManager.h"
#include "llvm/IR/PassManager.h"		#include "llvm/IR/PassManager.h"
#include "llvm/IR/Verifier.h"		#include "llvm/IR/Verifier.h"
#include "llvm/LTO/LTO.h"		#include "llvm/LTO/LTO.h"
#include "llvm/LTO/legacy/UpdateCompilerUsed.h"		#include "llvm/LTO/legacy/UpdateCompilerUsed.h"
#include "llvm/MC/SubtargetFeature.h"		#include "llvm/MC/SubtargetFeature.h"
#include "llvm/Passes/PassBuilder.h"		#include "llvm/Passes/PassBuilder.h"
#include "llvm/Support/Error.h"		#include "llvm/Support/Error.h"
▲ Show 20 Lines • Show All 232 Lines • ▼ Show 20 Lines	bool opt(Config &Conf, TargetMachine *TM, unsigned Task, Module &Mod,
return !Conf.PostOptModuleHook \|\| Conf.PostOptModuleHook(Task, Mod);		return !Conf.PostOptModuleHook \|\| Conf.PostOptModuleHook(Task, Mod);
}		}

void codegen(Config &Conf, TargetMachine *TM, AddStreamFn AddStream,		void codegen(Config &Conf, TargetMachine *TM, AddStreamFn AddStream,
unsigned Task, Module &Mod) {		unsigned Task, Module &Mod) {
if (Conf.PreCodeGenModuleHook && !Conf.PreCodeGenModuleHook(Task, Mod))		if (Conf.PreCodeGenModuleHook && !Conf.PreCodeGenModuleHook(Task, Mod))
return;		return;

		switch (Conf.IROutputType) {
		case Config::EIRT_Nothing:
		break;

		case Config::EIRT_BC: {
		auto Stream = AddStream(Task);
		legacy::PassManager PerModulePasses;
		PerModulePasses.add(createBitcodeWriterPass(*Stream->OS));
		PerModulePasses.run(Mod);
		} break;

		case Config::EIRT_LL: {
		auto Stream = AddStream(Task);
		legacy::PassManager PerModulePasses;
		PerModulePasses.add(createPrintModulePass(*Stream->OS));
		PerModulePasses.run(Mod);
		} break;

		case Config::EIRT_CodeGen: {
auto Stream = AddStream(Task);		auto Stream = AddStream(Task);
legacy::PassManager CodeGenPasses;		legacy::PassManager CodeGenPasses;
if (TM->addPassesToEmitFile(CodeGenPasses, *Stream->OS, Conf.CGFileType))		if (TM->addPassesToEmitFile(CodeGenPasses, *Stream->OS, Conf.CGFileType))
report_fatal_error("Failed to setup codegen");		report_fatal_error("Failed to setup codegen");
CodeGenPasses.run(Mod);		CodeGenPasses.run(Mod);
		} break;
		}
}		}

void splitCodeGen(Config &C, TargetMachine *TM, AddStreamFn AddStream,		void splitCodeGen(Config &C, TargetMachine *TM, AddStreamFn AddStream,
unsigned ParallelCodeGenParallelismLevel,		unsigned ParallelCodeGenParallelismLevel,
std::unique_ptr<Module> Mod) {		std::unique_ptr<Module> Mod) {
ThreadPool CodegenThreadPool(ParallelCodeGenParallelismLevel);		ThreadPool CodegenThreadPool(ParallelCodeGenParallelismLevel);
unsigned ThreadCount = 0;		unsigned ThreadCount = 0;
const Target *T = &TM->getTarget();		const Target *T = &TM->getTarget();
▲ Show 20 Lines • Show All 176 Lines • Show Last 20 Lines