This is an archive of the discontinued LLVM Phabricator instance.

Size LTO (1/3): Standardizing the use of OptimizationLevel
Needs ReviewPublic

Authored by rcorcs on Jun 4 2020, 10:53 PM.

Download Raw Diff

Details

Reviewers

hiraditya
• espindola
MaskRay
tejohnson
steven_wu
mtrofin
hjyamauchi

Summary

This patch is the first in the sequence of three patches for supporting size optimization with LTO. The planned patches are:
1: Standardizing the use of OptimizationLevel across pass builders, which includes both SpeedupLevel and SizeLevel.
2: Enable the support for -Os and -Oz for LTO in lld.
3: Tune the LTO pipeline for size optimization.

Since we already have a class that describes both speed and size levels of optimization, I believe it is a good idea to use across the code base when defining optimization levels.
In the next patch, instead of adding a SizeLevel variable for the LTO configuration, I'll be able to simply use this OptimizationLevel variable.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

rcorcs created this revision.Jun 4 2020, 10:53 PM

Herald added a reviewer: • espindola. · View Herald TranscriptJun 4 2020, 10:53 PM

Herald added a reviewer: MaskRay. · View Herald Transcript

Herald added projects: Restricted Project, Restricted Project. · View Herald Transcript

Herald added subscribers: cfe-commits, msifontes, jurahul and 24 others. · View Herald Transcript

Harbormaster failed remote builds in B59195: Diff 268668!Jun 4 2020, 11:25 PM

This patch is the first in the sequence of three patches for supporting size optimization with LTO.

Can you start by describing the problem you're trying to solve and the overall approach, including the end-to-end user-interface?

LLVM already has the class PassBuilder::OptimizationLevel that encapsulates the logic of both speed and size optimization levels. This class already checks which values for SpeedLevel and SizeLevel are valid.

However, other parts of the code define two separate variables to describe speed and size optimization levels with their semantic specified either in comments or code. Note that when SizeLevel!=0, OptLevel (or SpeedLevel) is usually expected to be 2, that is, their values are interdependent.

From my understanding, ideally, LLVM would use the same OptimizationLevel encapsulation everywhere. If the same encapsulation is used everywhere we can avoid conversions and guarantee
that they always have the same semantics.

For example, the class PassManagerBuilder defines these two separate variables with their semantics in comments:

/// The Optimization Level - Specify the basic optimization level.
///    0 = -O0, 1 = -O1, 2 = -O2, 3 = -O3 
unsigned OptLevel;

/// SizeLevel - How much we're optimizing for size.
///    0 = none, 1 = -Os, 2 = -Oz
unsigned SizeLevel;

On the other hand, the class ThinLTOCodeGenerator defines the semantics of OptLevel in code:

/// IR optimization level: from 0 to 3.
void setOptLevel(unsigned NewOptLevel) {
  OptLevel = (NewOptLevel > 3) ? 3 : NewOptLevel;
}

This patch standardizes the use of OptimizationLevel across PassBuilder, PassManagerBuilder, LTO configuration, and LTO code generators. Even with this patch, further work is still needed to standardize the use of OptimizationLevel across LLVM.

If reviewers think that this patch is touching in too many files, I could try to focus it only on the LTO related files, converting OptimizationLevel back to two separate values when necessary.

fhahn added reviewers: tejohnson, steven_wu.Jun 5 2020, 8:29 AM

In D81223#2076420, @rcorcs wrote:

This patch standardizes the use of OptimizationLevel across PassBuilder, PassManagerBuilder, LTO configuration, and LTO code generators. Even with this patch, further work is still needed to standardize the use of OptimizationLevel across LLVM.

The distinction of size level for LTO isn't obvious to me, this is why I'm asking some clarification here.
In general with LTO the Os/Oz frontend flags will trigger the addition of function attributes, however the Os/Oz flags aren't impacting the optimizer pipeline during LTO (so they basically have no effect during LTO and get mapped to O2 directly).

The way I see it, with size level for LTO, we could have a different LTO optimization pipeline for size or runtime performance. For example, we could have a different tuning for inlining, vectorization, etc. We could also use the size level to automatically enable optimizations such as HotColdSplitting, MergeFunctions, etc., instead of relying on specific enabling flags. We could also have other size-specific optimizations in the future, such as MergeSimilarFunctions (https://reviews.llvm.org/D52896).

I believe that function attributes for size are useful for optimizing cold functions that have been outlined by HotColdSplitting, for example. However, an ideal size level LTO would involve a different optimization pipeline and also a different tuning of those optimizations.

For example, when optimizing for size, we could disable loop vectorization and have SLP optimizing based on the code-size cost model. We could also have MergeFunctions enabled with Os and both MergeFunctions and MergeSimilarFunctions enabled with Oz. A similar logic could be applied to other optimizations, such as Inlining, HotColdSplitting, etc.

tejohnson added reviewers: mtrofin, hjyamauchi.Jun 11 2020, 8:41 AM

Sorry I haven't had a chance to review this yet. Added @mtrofin who recently added the OptimizationLevel class that this is leveraging, and @yamauchi who has also been looking at size optimizations in llvm.

At a first glance it isn't clear to me how much of this is NFC related refactoring/cleanup vs behavior change. If it has both, it would be helpful to split into an NFC patch first and then a follow on with actual behavior changes. Can you clarify?

mtrofin added inline comments.Jun 11 2020, 9:04 AM

llvm/include/llvm/IR/PassManager.h
413	I think this change - moving OptimizationLevel out - should be in its own patch, to avoid noise.
llvm/include/llvm/LTO/legacy/ThinLTOCodeGenerator.h
22	It's unfortunate we now need to pull pass management into places that didn't have that dependency. IIUC, the goal of this overall effort includes piping though the full user-requested optimization parameters (i.e. both speed and size). Given the likely diversity of the consumers, it may make sense to move OptimizationLevel in its own header?
llvm/tools/opt/CMakeLists.txt
20	Nit: make this change separately, and since it's just a style change, it can probably be just submitted with no review.

In D81223#2087660, @rcorcs wrote:

The way I see it, with size level for LTO, we could have a different LTO optimization pipeline for size or runtime performance.

So this is the important point to settle before going on with any patch: this isn't how LTO is setup today.

For example, we could have a different tuning for inlining, vectorization, etc.

All these are covered by the function attributes already.

We could also use the size level to automatically enable optimizations such as HotColdSplitting, MergeFunctions, etc., instead of relying on specific enabling flags. We could also have other size-specific optimizations in the future, such as MergeSimilarFunctions (https://reviews.llvm.org/D52896).

All these could be in the LTO pipeline and driven by the attribute as well.

I believe that function attributes for size are useful for optimizing cold functions that have been outlined by HotColdSplitting, for example.

The attribute is added by the frontend and can change per translation-unit / per function though.

aykevl mentioned this in D72404: [ThinLTO/FullLTO] Support Os and Oz.Feb 9 2022, 5:40 AM

aykevl mentioned this in D119342: [LoopRotate] Don't rotate loops when the minsize attribute is present.Feb 9 2022, 7:26 AM

Revision Contents

Path

Size

clang/

lib/

CodeGen/

BackendUtil.cpp

37 lines

lld/

COFF/

LTO.cpp

2 lines

ELF/

LTO.cpp

6 lines

wasm/

LTO.cpp

2 lines

llvm/

examples/

Bye/

Bye.cpp

2 lines

include/

llvm/

IR/

PassManager.h

111 lines

LTO/

Config.h

2 lines

legacy/

LTOCodeGenerator.h

5 lines

ThinLTOCodeGenerator.h

11 lines

Passes/

PassBuilder.h

110 lines

Transforms/

IPO/

PassManagerBuilder.h

10 lines

lib/

IR/

PassManager.cpp

19 lines

LTO/

LTO.cpp

7 lines

LTOBackend.cpp

23 lines

LTOCodeGenerator.cpp

4 lines

ThinLTOCodeGenerator.cpp

6 lines

Passes/

PassBuilder.cpp

25 lines

Transforms/

IPO/

PassManagerBuilder.cpp

86 lines

tools/

bugpoint/

bugpoint.cpp

3 lines

llvm-lto/

llvm-lto.cpp

2 lines

llvm-lto2/

llvm-lto2.cpp

6 lines

lto/

lto.cpp

4 lines

opt/

CMakeLists.txt

2 lines

NewPMDriver.cpp

14 lines

opt.cpp

5 lines

mlir/

lib/

ExecutionEngine/

OptUtils.cpp

3 lines

Diff 268668

clang/lib/CodeGen/BackendUtil.cpp

Show First 20 Lines • Show All 180 Lines • ▼ Show 20 Lines
private:		private:
const Triple &TargetTriple;		const Triple &TargetTriple;
const CodeGenOptions &CGOpts;		const CodeGenOptions &CGOpts;
const LangOptions &LangOpts;		const LangOptions &LangOpts;
};		};
}		}

static void addObjCARCAPElimPass(const PassManagerBuilder &Builder, PassManagerBase &PM) {		static void addObjCARCAPElimPass(const PassManagerBuilder &Builder, PassManagerBase &PM) {
if (Builder.OptLevel > 0)		if (Builder.OptLevel.getSpeedupLevel() > 0)
PM.add(createObjCARCAPElimPass());		PM.add(createObjCARCAPElimPass());
}		}

static void addObjCARCExpandPass(const PassManagerBuilder &Builder, PassManagerBase &PM) {		static void addObjCARCExpandPass(const PassManagerBuilder &Builder, PassManagerBase &PM) {
if (Builder.OptLevel > 0)		if (Builder.OptLevel.getSpeedupLevel() > 0)
PM.add(createObjCARCExpandPass());		PM.add(createObjCARCExpandPass());
}		}

static void addObjCARCOptPass(const PassManagerBuilder &Builder, PassManagerBase &PM) {		static void addObjCARCOptPass(const PassManagerBuilder &Builder, PassManagerBase &PM) {
if (Builder.OptLevel > 0)		if (Builder.OptLevel.getSpeedupLevel() > 0)
PM.add(createObjCARCOptPass());		PM.add(createObjCARCOptPass());
}		}

static void addAddDiscriminatorsPass(const PassManagerBuilder &Builder,		static void addAddDiscriminatorsPass(const PassManagerBuilder &Builder,
legacy::PassManagerBase &PM) {		legacy::PassManagerBase &PM) {
PM.add(createAddDiscriminatorsPass());		PM.add(createAddDiscriminatorsPass());
}		}

▲ Show 20 Lines • Show All 106 Lines • ▼ Show 20 Lines	static void addGeneralOptsForMemorySanitizer(const PassManagerBuilder &Builder,
int TrackOrigins = CGOpts.SanitizeMemoryTrackOrigins;		int TrackOrigins = CGOpts.SanitizeMemoryTrackOrigins;
bool Recover = CGOpts.SanitizeRecover.has(SanitizerKind::Memory);		bool Recover = CGOpts.SanitizeRecover.has(SanitizerKind::Memory);
PM.add(createMemorySanitizerLegacyPassPass(		PM.add(createMemorySanitizerLegacyPassPass(
MemorySanitizerOptions{TrackOrigins, Recover, CompileKernel}));		MemorySanitizerOptions{TrackOrigins, Recover, CompileKernel}));

// MemorySanitizer inserts complex instrumentation that mostly follows		// MemorySanitizer inserts complex instrumentation that mostly follows
// the logic of the original code, but operates on "shadow" values.		// the logic of the original code, but operates on "shadow" values.
// It can benefit from re-running some general purpose optimization passes.		// It can benefit from re-running some general purpose optimization passes.
if (Builder.OptLevel > 0) {		if (Builder.OptLevel.getSpeedupLevel() > 0) {
PM.add(createEarlyCSEPass());		PM.add(createEarlyCSEPass());
PM.add(createReassociatePass());		PM.add(createReassociatePass());
PM.add(createLICMPass());		PM.add(createLICMPass());
PM.add(createGVNPass());		PM.add(createGVNPass());
PM.add(createInstructionCombiningPass());		PM.add(createInstructionCombiningPass());
PM.add(createDeadStoreEliminationPass());		PM.add(createDeadStoreEliminationPass());
}		}
}		}
▲ Show 20 Lines • Show All 277 Lines • ▼ Show 20 Lines	if (CodeGenOpts.OptimizationLevel <= 1) {
// because profile annotation will happen again in ThinLTO backend, and we		// because profile annotation will happen again in ThinLTO backend, and we
// want the IR of the hot path to match the profile.		// want the IR of the hot path to match the profile.
PMBuilder.Inliner = createFunctionInliningPass(		PMBuilder.Inliner = createFunctionInliningPass(
CodeGenOpts.OptimizationLevel, CodeGenOpts.OptimizeSize,		CodeGenOpts.OptimizationLevel, CodeGenOpts.OptimizeSize,
(!CodeGenOpts.SampleProfileFile.empty() &&		(!CodeGenOpts.SampleProfileFile.empty() &&
CodeGenOpts.PrepareForThinLTO));		CodeGenOpts.PrepareForThinLTO));
}		}

PMBuilder.OptLevel = CodeGenOpts.OptimizationLevel;		PMBuilder.OptLevel = {CodeGenOpts.OptimizationLevel,CodeGenOpts.OptimizeSize};
PMBuilder.SizeLevel = CodeGenOpts.OptimizeSize;
PMBuilder.SLPVectorize = CodeGenOpts.VectorizeSLP;		PMBuilder.SLPVectorize = CodeGenOpts.VectorizeSLP;
PMBuilder.LoopVectorize = CodeGenOpts.VectorizeLoop;		PMBuilder.LoopVectorize = CodeGenOpts.VectorizeLoop;

PMBuilder.DisableUnrollLoops = !CodeGenOpts.UnrollLoops;		PMBuilder.DisableUnrollLoops = !CodeGenOpts.UnrollLoops;
// Loop interleaving in the loop vectorizer has historically been set to be		// Loop interleaving in the loop vectorizer has historically been set to be
// enabled when loop unrolling is enabled.		// enabled when loop unrolling is enabled.
PMBuilder.LoopsInterleaved = CodeGenOpts.UnrollLoops;		PMBuilder.LoopsInterleaved = CodeGenOpts.UnrollLoops;
PMBuilder.MergeFunctions = CodeGenOpts.MergeFunctions;		PMBuilder.MergeFunctions = CodeGenOpts.MergeFunctions;
▲ Show 20 Lines • Show All 335 Lines • ▼ Show 20 Lines	void EmitAssemblyHelper::EmitAssembly(BackendAction Action,
}		}

if (ThinLinkOS)		if (ThinLinkOS)
ThinLinkOS->keep();		ThinLinkOS->keep();
if (DwoOS)		if (DwoOS)
DwoOS->keep();		DwoOS->keep();
}		}

static PassBuilder::OptimizationLevel mapToLevel(const CodeGenOptions &Opts) {		static OptimizationLevel mapToLevel(const CodeGenOptions &Opts) {
switch (Opts.OptimizationLevel) {		switch (Opts.OptimizationLevel) {
default:		default:
llvm_unreachable("Invalid optimization level!");		llvm_unreachable("Invalid optimization level!");

case 1:		case 1:
return PassBuilder::OptimizationLevel::O1;		return OptimizationLevel::O1;

case 2:		case 2:
switch (Opts.OptimizeSize) {		switch (Opts.OptimizeSize) {
default:		default:
llvm_unreachable("Invalid optimization level for size!");		llvm_unreachable("Invalid optimization level for size!");

case 0:		case 0:
return PassBuilder::OptimizationLevel::O2;		return OptimizationLevel::O2;

case 1:		case 1:
return PassBuilder::OptimizationLevel::Os;		return OptimizationLevel::Os;

case 2:		case 2:
return PassBuilder::OptimizationLevel::Oz;		return OptimizationLevel::Oz;
}		}

case 3:		case 3:
return PassBuilder::OptimizationLevel::O3;		return OptimizationLevel::O3;
}		}
}		}

static void addCoroutinePassesAtO0(ModulePassManager &MPM,		static void addCoroutinePassesAtO0(ModulePassManager &MPM,
const LangOptions &LangOpts,		const LangOptions &LangOpts,
const CodeGenOptions &CodeGenOpts) {		const CodeGenOptions &CodeGenOpts) {
if (!LangOpts.Coroutines)		if (!LangOpts.Coroutines)
return;		return;
▲ Show 20 Lines • Show All 237 Lines • ▼ Show 20 Lines	if (CodeGenOpts.OptimizationLevel == 0) {
// Lastly, add semantically necessary passes for LTO.		// Lastly, add semantically necessary passes for LTO.
if (IsLTO \|\| IsThinLTO) {		if (IsLTO \|\| IsThinLTO) {
MPM.addPass(CanonicalizeAliasesPass());		MPM.addPass(CanonicalizeAliasesPass());
MPM.addPass(NameAnonGlobalPass());		MPM.addPass(NameAnonGlobalPass());
}		}
} else {		} else {
// Map our optimization levels into one of the distinct levels used to		// Map our optimization levels into one of the distinct levels used to
// configure the pipeline.		// configure the pipeline.
PassBuilder::OptimizationLevel Level = mapToLevel(CodeGenOpts);		OptimizationLevel Level = mapToLevel(CodeGenOpts);

// If we reached here with a non-empty index file name, then the index		// If we reached here with a non-empty index file name, then the index
// file was empty and we are not performing ThinLTO backend compilation		// file was empty and we are not performing ThinLTO backend compilation
// (used in testing in a distributed build environment). Drop any the type		// (used in testing in a distributed build environment). Drop any the type
// test assume sequences inserted for whole program vtables so that		// test assume sequences inserted for whole program vtables so that
// codegen doesn't complain.		// codegen doesn't complain.
if (!CodeGenOpts.ThinLTOIndexFile.empty())		if (!CodeGenOpts.ThinLTOIndexFile.empty())
PB.registerPipelineStartEPCallback([](ModulePassManager &MPM) {		PB.registerPipelineStartEPCallback([](ModulePassManager &MPM) {
MPM.addPass(LowerTypeTestsPass(/ExportSummary=/nullptr,		MPM.addPass(LowerTypeTestsPass(/ExportSummary=/nullptr,
/ImportSummary=/nullptr,		/ImportSummary=/nullptr,
/DropTypeTests=/true));		/DropTypeTests=/true));
});		});

PB.registerPipelineStartEPCallback([](ModulePassManager &MPM) {		PB.registerPipelineStartEPCallback([](ModulePassManager &MPM) {
MPM.addPass(createModuleToFunctionPassAdaptor(		MPM.addPass(createModuleToFunctionPassAdaptor(
EntryExitInstrumenterPass(/PostInlining=/false)));		EntryExitInstrumenterPass(/PostInlining=/false)));
});		});

// Register callbacks to schedule sanitizer passes at the appropriate part of		// Register callbacks to schedule sanitizer passes at the appropriate part of
// the pipeline.		// the pipeline.
// FIXME: either handle asan/the remaining sanitizers or error out		// FIXME: either handle asan/the remaining sanitizers or error out
if (LangOpts.Sanitize.has(SanitizerKind::LocalBounds))		if (LangOpts.Sanitize.has(SanitizerKind::LocalBounds))
PB.registerScalarOptimizerLateEPCallback(		PB.registerScalarOptimizerLateEPCallback(
[](FunctionPassManager &FPM, PassBuilder::OptimizationLevel Level) {		[](FunctionPassManager &FPM, OptimizationLevel Level) {
FPM.addPass(BoundsCheckingPass());		FPM.addPass(BoundsCheckingPass());
});		});

if (CodeGenOpts.SanitizeCoverageType \|\|		if (CodeGenOpts.SanitizeCoverageType \|\|
CodeGenOpts.SanitizeCoverageIndirectCalls \|\|		CodeGenOpts.SanitizeCoverageIndirectCalls \|\|
CodeGenOpts.SanitizeCoverageTraceCmp) {		CodeGenOpts.SanitizeCoverageTraceCmp) {
PB.registerOptimizerLastEPCallback(		PB.registerOptimizerLastEPCallback(
[this](ModulePassManager &MPM,		[this](ModulePassManager &MPM,
PassBuilder::OptimizationLevel Level) {		OptimizationLevel Level) {
auto SancovOpts = getSancovOptsFromCGOpts(CodeGenOpts);		auto SancovOpts = getSancovOptsFromCGOpts(CodeGenOpts);
MPM.addPass(ModuleSanitizerCoveragePass(		MPM.addPass(ModuleSanitizerCoveragePass(
SancovOpts, CodeGenOpts.SanitizeCoverageWhitelistFiles,		SancovOpts, CodeGenOpts.SanitizeCoverageWhitelistFiles,
CodeGenOpts.SanitizeCoverageBlacklistFiles));		CodeGenOpts.SanitizeCoverageBlacklistFiles));
});		});
}		}

if (LangOpts.Sanitize.has(SanitizerKind::Memory)) {		if (LangOpts.Sanitize.has(SanitizerKind::Memory)) {
int TrackOrigins = CodeGenOpts.SanitizeMemoryTrackOrigins;		int TrackOrigins = CodeGenOpts.SanitizeMemoryTrackOrigins;
bool Recover = CodeGenOpts.SanitizeRecover.has(SanitizerKind::Memory);		bool Recover = CodeGenOpts.SanitizeRecover.has(SanitizerKind::Memory);
PB.registerPipelineStartEPCallback(		PB.registerPipelineStartEPCallback(
[TrackOrigins, Recover](ModulePassManager &MPM) {		[TrackOrigins, Recover](ModulePassManager &MPM) {
MPM.addPass(MemorySanitizerPass({TrackOrigins, Recover, false}));		MPM.addPass(MemorySanitizerPass({TrackOrigins, Recover, false}));
});		});
PB.registerOptimizerLastEPCallback(		PB.registerOptimizerLastEPCallback(
[TrackOrigins, Recover](ModulePassManager &MPM,		[TrackOrigins, Recover](ModulePassManager &MPM,
PassBuilder::OptimizationLevel Level) {		OptimizationLevel Level) {
MPM.addPass(createModuleToFunctionPassAdaptor(		MPM.addPass(createModuleToFunctionPassAdaptor(
MemorySanitizerPass({TrackOrigins, Recover, false})));		MemorySanitizerPass({TrackOrigins, Recover, false})));
});		});
}		}
if (LangOpts.Sanitize.has(SanitizerKind::Thread)) {		if (LangOpts.Sanitize.has(SanitizerKind::Thread)) {
PB.registerPipelineStartEPCallback(		PB.registerPipelineStartEPCallback(
[](ModulePassManager &MPM) { MPM.addPass(ThreadSanitizerPass()); });		[](ModulePassManager &MPM) { MPM.addPass(ThreadSanitizerPass()); });
PB.registerOptimizerLastEPCallback(		PB.registerOptimizerLastEPCallback(
[](ModulePassManager &MPM, PassBuilder::OptimizationLevel Level) {		[](ModulePassManager &MPM, OptimizationLevel Level) {
MPM.addPass(		MPM.addPass(
createModuleToFunctionPassAdaptor(ThreadSanitizerPass()));		createModuleToFunctionPassAdaptor(ThreadSanitizerPass()));
});		});
}		}
if (LangOpts.Sanitize.has(SanitizerKind::Address)) {		if (LangOpts.Sanitize.has(SanitizerKind::Address)) {
PB.registerPipelineStartEPCallback([&](ModulePassManager &MPM) {		PB.registerPipelineStartEPCallback([&](ModulePassManager &MPM) {
MPM.addPass(		MPM.addPass(
RequireAnalysisPass<ASanGlobalsMetadataAnalysis, Module>());		RequireAnalysisPass<ASanGlobalsMetadataAnalysis, Module>());
});		});
bool Recover = CodeGenOpts.SanitizeRecover.has(SanitizerKind::Address);		bool Recover = CodeGenOpts.SanitizeRecover.has(SanitizerKind::Address);
bool UseAfterScope = CodeGenOpts.SanitizeAddressUseAfterScope;		bool UseAfterScope = CodeGenOpts.SanitizeAddressUseAfterScope;
PB.registerOptimizerLastEPCallback(		PB.registerOptimizerLastEPCallback(
[Recover, UseAfterScope](ModulePassManager &MPM,		[Recover, UseAfterScope](ModulePassManager &MPM,
PassBuilder::OptimizationLevel Level) {		OptimizationLevel Level) {
MPM.addPass(		MPM.addPass(
createModuleToFunctionPassAdaptor(AddressSanitizerPass(		createModuleToFunctionPassAdaptor(AddressSanitizerPass(
/CompileKernel=/false, Recover, UseAfterScope)));		/CompileKernel=/false, Recover, UseAfterScope)));
});		});
bool ModuleUseAfterScope = asanUseGlobalsGC(TargetTriple, CodeGenOpts);		bool ModuleUseAfterScope = asanUseGlobalsGC(TargetTriple, CodeGenOpts);
bool UseOdrIndicator = CodeGenOpts.SanitizeAddressUseOdrIndicator;		bool UseOdrIndicator = CodeGenOpts.SanitizeAddressUseOdrIndicator;
PB.registerPipelineStartEPCallback(		PB.registerPipelineStartEPCallback(
[Recover, ModuleUseAfterScope,		[Recover, ModuleUseAfterScope,
▲ Show 20 Lines • Show All 227 Lines • ▼ Show 20 Lines	if (Error E = Conf.addSaveTemps(CGOpts.SaveTempsFilePrefix + ".",
});		});
}		}
}		}
Conf.CPU = TOpts.CPU;		Conf.CPU = TOpts.CPU;
Conf.CodeModel = getCodeModel(CGOpts);		Conf.CodeModel = getCodeModel(CGOpts);
Conf.MAttrs = TOpts.Features;		Conf.MAttrs = TOpts.Features;
Conf.RelocModel = CGOpts.RelocationModel;		Conf.RelocModel = CGOpts.RelocationModel;
Conf.CGOptLevel = getCGOptLevel(CGOpts);		Conf.CGOptLevel = getCGOptLevel(CGOpts);
Conf.OptLevel = CGOpts.OptimizationLevel;		Conf.OptLevel = {CGOpts.OptimizationLevel, 0};
initTargetOptions(Diags, Conf.Options, CGOpts, TOpts, LOpts, HeaderOpts);		initTargetOptions(Diags, Conf.Options, CGOpts, TOpts, LOpts, HeaderOpts);
Conf.SampleProfile = std::move(SampleProfile);		Conf.SampleProfile = std::move(SampleProfile);
Conf.PTO.LoopUnrolling = CGOpts.UnrollLoops;		Conf.PTO.LoopUnrolling = CGOpts.UnrollLoops;
// For historical reasons, loop interleaving is set to mirror setting for loop		// For historical reasons, loop interleaving is set to mirror setting for loop
// unrolling.		// unrolling.
Conf.PTO.LoopInterleaving = CGOpts.UnrollLoops;		Conf.PTO.LoopInterleaving = CGOpts.UnrollLoops;
Conf.PTO.LoopVectorization = CGOpts.VectorizeLoop;		Conf.PTO.LoopVectorization = CGOpts.VectorizeLoop;
Conf.PTO.SLPVectorization = CGOpts.VectorizeSLP;		Conf.PTO.SLPVectorization = CGOpts.VectorizeSLP;
▲ Show 20 Lines • Show All 130 Lines • Show Last 20 Lines

lld/COFF/LTO.cpp

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	static lto::Config createConfig() {
// compact code, and because there are also known code generation bugs when		// compact code, and because there are also known code generation bugs when
// using the PIC model (see PR34306).		// using the PIC model (see PR34306).
if (config->machine == COFF::IMAGE_FILE_MACHINE_I386)		if (config->machine == COFF::IMAGE_FILE_MACHINE_I386)
c.RelocModel = Reloc::Static;		c.RelocModel = Reloc::Static;
else		else
c.RelocModel = Reloc::PIC_;		c.RelocModel = Reloc::PIC_;
c.DisableVerify = true;		c.DisableVerify = true;
c.DiagHandler = diagnosticHandler;		c.DiagHandler = diagnosticHandler;
c.OptLevel = config->ltoo;		c.OptLevel = {config->ltoo, 0};
c.CPU = getCPUStr();		c.CPU = getCPUStr();
c.MAttrs = getMAttrs();		c.MAttrs = getMAttrs();
c.CGOptLevel = args::getCGOptLevel(config->ltoo);		c.CGOptLevel = args::getCGOptLevel(config->ltoo);
c.AlwaysEmitRegularLTOObj = !config->ltoObjPath.empty();		c.AlwaysEmitRegularLTOObj = !config->ltoObjPath.empty();

if (config->saveTemps)		if (config->saveTemps)
checkError(c.addSaveTemps(std::string(config->outputFile) + ".",		checkError(c.addSaveTemps(std::string(config->outputFile) + ".",
/UseInputModulePath/ true));		/UseInputModulePath/ true));
▲ Show 20 Lines • Show All 132 Lines • Show Last 20 Lines

lld/ELF/LTO.cpp

Show First 20 Lines • Show All 108 Lines • ▼ Show 20 Lines	static lto::Config createConfig() {
else if (config->isPic)		else if (config->isPic)
c.RelocModel = Reloc::PIC_;		c.RelocModel = Reloc::PIC_;
else		else
c.RelocModel = Reloc::Static;		c.RelocModel = Reloc::Static;

c.CodeModel = getCodeModelFromCMModel();		c.CodeModel = getCodeModelFromCMModel();
c.DisableVerify = config->disableVerify;		c.DisableVerify = config->disableVerify;
c.DiagHandler = diagnosticHandler;		c.DiagHandler = diagnosticHandler;
c.OptLevel = config->ltoo;		c.OptLevel = {config->ltoo, 0};
c.CPU = getCPUStr();		c.CPU = getCPUStr();
c.MAttrs = getMAttrs();		c.MAttrs = getMAttrs();
c.CGOptLevel = args::getCGOptLevel(config->ltoo);		c.CGOptLevel = args::getCGOptLevel(config->ltoo);

c.PTO.LoopVectorization = c.OptLevel > 1;		c.PTO.LoopVectorization = c.OptLevel.getSpeedupLevel() > 1;
c.PTO.SLPVectorization = c.OptLevel > 1;		c.PTO.SLPVectorization = c.OptLevel.getSpeedupLevel() > 1;

// Set up a custom pipeline if we've been asked to.		// Set up a custom pipeline if we've been asked to.
c.OptPipeline = std::string(config->ltoNewPmPasses);		c.OptPipeline = std::string(config->ltoNewPmPasses);
c.AAPipeline = std::string(config->ltoAAPipeline);		c.AAPipeline = std::string(config->ltoAAPipeline);

// Set up optimization remarks if we've been asked to.		// Set up optimization remarks if we've been asked to.
c.RemarksFilename = std::string(config->optRemarksFilename);		c.RemarksFilename = std::string(config->optRemarksFilename);
c.RemarksPasses = std::string(config->optRemarksPasses);		c.RemarksPasses = std::string(config->optRemarksPasses);
▲ Show 20 Lines • Show All 222 Lines • Show Last 20 Lines

lld/wasm/LTO.cpp

Show First 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	static std::unique_ptr<lto::LTO> createLTO() {
c.Options = initTargetOptionsFromCodeGenFlags();		c.Options = initTargetOptionsFromCodeGenFlags();

// Always emit a section per function/data with LTO.		// Always emit a section per function/data with LTO.
c.Options.FunctionSections = true;		c.Options.FunctionSections = true;
c.Options.DataSections = true;		c.Options.DataSections = true;

c.DisableVerify = config->disableVerify;		c.DisableVerify = config->disableVerify;
c.DiagHandler = diagnosticHandler;		c.DiagHandler = diagnosticHandler;
c.OptLevel = config->ltoo;		c.OptLevel = {config->ltoo, 0};
c.MAttrs = getMAttrs();		c.MAttrs = getMAttrs();
c.CGOptLevel = args::getCGOptLevel(config->ltoo);		c.CGOptLevel = args::getCGOptLevel(config->ltoo);

if (config->relocatable)		if (config->relocatable)
c.RelocModel = None;		c.RelocModel = None;
else if (config->isPic)		else if (config->isPic)
c.RelocModel = Reloc::PIC_;		c.RelocModel = Reloc::PIC_;
else		else
▲ Show 20 Lines • Show All 107 Lines • Show Last 20 Lines

llvm/examples/Bye/Bye.cpp

Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	[](const llvm::PassManagerBuilder &Builder,
llvm::legacy::PassManagerBase &PM) { PM.add(new LegacyBye()); });		llvm::legacy::PassManagerBase &PM) { PM.add(new LegacyBye()); });

/* New PM Registration */		/* New PM Registration */
llvm::PassPluginLibraryInfo getByePluginInfo() {		llvm::PassPluginLibraryInfo getByePluginInfo() {
return {LLVM_PLUGIN_API_VERSION, "Bye", LLVM_VERSION_STRING,		return {LLVM_PLUGIN_API_VERSION, "Bye", LLVM_VERSION_STRING,
[](PassBuilder &PB) {		[](PassBuilder &PB) {
PB.registerVectorizerStartEPCallback(		PB.registerVectorizerStartEPCallback(
[](llvm::FunctionPassManager &PM,		[](llvm::FunctionPassManager &PM,
llvm::PassBuilder::OptimizationLevel Level) {		llvm::OptimizationLevel Level) {
PM.addPass(Bye());		PM.addPass(Bye());
});		});
PB.registerPipelineParsingCallback(		PB.registerPipelineParsingCallback(
[](StringRef Name, llvm::FunctionPassManager &PM,		[](StringRef Name, llvm::FunctionPassManager &PM,
ArrayRef<llvm::PassBuilder::PipelineElement>) {		ArrayRef<llvm::PassBuilder::PipelineElement>) {
if (Name == "goodbye") {		if (Name == "goodbye") {
PM.addPass(Bye());		PM.addPass(Bye());
return true;		return true;
Show All 12 Lines

llvm/include/llvm/IR/PassManager.h

Show First 20 Lines • Show All 404 Lines • ▼ Show 20 Lines	struct AnalysisInfoMixin : PassInfoMixin<DerivedT> {
/// configuration, this mixin can provide the static key as well.		/// configuration, this mixin can provide the static key as well.
static AnalysisKey *ID() {		static AnalysisKey *ID() {
static_assert(std::is_base_of<AnalysisInfoMixin, DerivedT>::value,		static_assert(std::is_base_of<AnalysisInfoMixin, DerivedT>::value,
"Must pass the derived type as the template argument!");		"Must pass the derived type as the template argument!");
return &DerivedT::Key;		return &DerivedT::Key;
}		}
};		};

		/// LLVM-provided high-level optimization levels.
		mtrofinUnsubmitted Not Done Reply Inline Actions I think this change - moving OptimizationLevel out - should be in its own patch, to avoid noise. mtrofin: I think this change - moving OptimizationLevel out - should be in its own patch, to avoid noise.
		///
		/// This enumerates the LLVM-provided high-level optimization levels. Each
		/// level has a specific goal and rationale.
		class OptimizationLevel final {
		unsigned SpeedLevel = 2;
		unsigned SizeLevel = 0;
		public:
		OptimizationLevel() = default;

		OptimizationLevel(unsigned SpeedLevel, unsigned SizeLevel)
		: SpeedLevel(SpeedLevel), SizeLevel(SizeLevel) {
		// Check that only valid combinations are passed.
		assert(SpeedLevel <= 3 &&
		"Optimization level for speed should be 0, 1, 2, or 3");
		assert(SizeLevel <= 2 &&
		"Optimization level for size should be 0, 1, or 2");
		assert((SizeLevel == 0 \|\| SpeedLevel == 2) &&
		"Optimize for size should be encoded with speedup level == 2");
		}

		/// Disable as many optimizations as possible. This doesn't completely
		/// disable the optimizer in all cases, for example always_inline functions
		/// can be required to be inlined for correctness.
		static const OptimizationLevel O0;

		/// Optimize quickly without destroying debuggability.
		///
		/// This level is tuned to produce a result from the optimizer as quickly
		/// as possible and to avoid destroying debuggability. This tends to result
		/// in a very good development mode where the compiled code will be
		/// immediately executed as part of testing. As a consequence, where
		/// possible, we would like to produce efficient-to-execute code, but not
		/// if it significantly slows down compilation or would prevent even basic
		/// debugging of the resulting binary.
		///
		/// As an example, complex loop transformations such as versioning,
		/// vectorization, or fusion don't make sense here due to the degree to
		/// which the executed code differs from the source code, and the compile
		/// time cost.
		static const OptimizationLevel O1;
		/// Optimize for fast execution as much as possible without triggering
		/// significant incremental compile time or code size growth.
		///
		/// The key idea is that optimizations at this level should "pay for
		/// themselves". So if an optimization increases compile time by 5% or
		/// increases code size by 5% for a particular benchmark, that benchmark
		/// should also be one which sees a 5% runtime improvement. If the compile
		/// time or code size penalties happen on average across a diverse range of
		/// LLVM users' benchmarks, then the improvements should as well.
		///
		/// And no matter what, the compile time needs to not grow superlinearly
		/// with the size of input to LLVM so that users can control the runtime of
		/// the optimizer in this mode.
		///
		/// This is expected to be a good default optimization level for the vast
		/// majority of users.
		static const OptimizationLevel O2;
		/// Optimize for fast execution as much as possible.
		///
		/// This mode is significantly more aggressive in trading off compile time
		/// and code size to get execution time improvements. The core idea is that
		/// this mode should include any optimization that helps execution time on
		/// balance across a diverse collection of benchmarks, even if it increases
		/// code size or compile time for some benchmarks without corresponding
		/// improvements to execution time.
		///
		/// Despite being willing to trade more compile time off to get improved
		/// execution time, this mode still tries to avoid superlinear growth in
		/// order to make even significantly slower compile times at least scale
		/// reasonably. This does not preclude very substantial constant factor
		/// costs though.
		static const OptimizationLevel O3;
		/// Similar to \c O2 but tries to optimize for small code size instead of
		/// fast execution without triggering significant incremental execution
		/// time slowdowns.
		///
		/// The logic here is exactly the same as \c O2, but with code size and
		/// execution time metrics swapped.
		///
		/// A consequence of the different core goal is that this should in general
		/// produce substantially smaller executables that still run in
		/// a reasonable amount of time.
		static const OptimizationLevel Os;
		/// A very specialized mode that will optimize for code size at any and all
		/// costs.
		///
		/// This is useful primarily when there are absolute size limitations and
		/// any effort taken to reduce the size is worth it regardless of the
		/// execution time impact. You should expect this level to produce rather
		/// slow, but very small, code.
		static const OptimizationLevel Oz;

		bool isOptimizingForSpeed() const {
		return SizeLevel == 0 && SpeedLevel > 0;
		}

		bool isOptimizingForSize() const { return SizeLevel > 0; }

		bool operator==(const OptimizationLevel &Other) const {
		return SizeLevel == Other.SizeLevel && SpeedLevel == Other.SpeedLevel;
		}
		bool operator!=(const OptimizationLevel &Other) const {
		return SizeLevel != Other.SizeLevel \|\| SpeedLevel != Other.SpeedLevel;
		}

		unsigned getSpeedupLevel() const { return SpeedLevel; }

		unsigned getSizeLevel() const { return SizeLevel; }
		};

namespace detail {		namespace detail {

/// Actual unpacker of extra arguments in getAnalysisResult,		/// Actual unpacker of extra arguments in getAnalysisResult,
/// passes only those tuple arguments that are mentioned in index_sequence.		/// passes only those tuple arguments that are mentioned in index_sequence.
template <typename PassT, typename IRUnitT, typename AnalysisManagerT,		template <typename PassT, typename IRUnitT, typename AnalysisManagerT,
typename... ArgTs, size_t... Ns>		typename... ArgTs, size_t... Ns>
typename PassT::Result		typename PassT::Result
getAnalysisResultUnpackTuple(AnalysisManagerT &AM, IRUnitT &IR,		getAnalysisResultUnpackTuple(AnalysisManagerT &AM, IRUnitT &IR,
▲ Show 20 Lines • Show All 960 Lines • Show Last 20 Lines

llvm/include/llvm/LTO/Config.h

Show All 40 Lines	struct Config {
std::string CPU;		std::string CPU;
TargetOptions Options;		TargetOptions Options;
std::vector<std::string> MAttrs;		std::vector<std::string> MAttrs;
std::vector<std::string> PassPlugins;		std::vector<std::string> PassPlugins;
Optional<Reloc::Model> RelocModel = Reloc::PIC_;		Optional<Reloc::Model> RelocModel = Reloc::PIC_;
Optional<CodeModel::Model> CodeModel = None;		Optional<CodeModel::Model> CodeModel = None;
CodeGenOpt::Level CGOptLevel = CodeGenOpt::Default;		CodeGenOpt::Level CGOptLevel = CodeGenOpt::Default;
CodeGenFileType CGFileType = CGFT_ObjectFile;		CodeGenFileType CGFileType = CGFT_ObjectFile;
unsigned OptLevel = 2;		OptimizationLevel OptLevel = OptimizationLevel::O2;
bool DisableVerify = false;		bool DisableVerify = false;

/// Use the new pass manager		/// Use the new pass manager
bool UseNewPM = false;		bool UseNewPM = false;

/// Flag to indicate that the optimizer should not assume builtins are present		/// Flag to indicate that the optimizer should not assume builtins are present
/// on the target.		/// on the target.
bool Freestanding = false;		bool Freestanding = false;
▲ Show 20 Lines • Show All 203 Lines • Show Last 20 Lines

llvm/include/llvm/LTO/legacy/LTOCodeGenerator.h

Show All 35 Lines

#include "llvm-c/lto.h"		#include "llvm-c/lto.h"
#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/ADT/StringMap.h"		#include "llvm/ADT/StringMap.h"
#include "llvm/ADT/StringSet.h"		#include "llvm/ADT/StringSet.h"
#include "llvm/IR/GlobalValue.h"		#include "llvm/IR/GlobalValue.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
		#include "llvm/IR/PassManager.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/Error.h"		#include "llvm/Support/Error.h"
#include "llvm/Support/ToolOutputFile.h"		#include "llvm/Support/ToolOutputFile.h"
#include "llvm/Target/TargetMachine.h"		#include "llvm/Target/TargetMachine.h"
#include "llvm/Target/TargetOptions.h"		#include "llvm/Target/TargetOptions.h"
#include <string>		#include <string>
#include <vector>		#include <vector>

Show All 37 Lines	struct LTOCodeGenerator {
void setCodePICModel(Optional<Reloc::Model> Model) { RelocModel = Model; }		void setCodePICModel(Optional<Reloc::Model> Model) { RelocModel = Model; }

/// Set the file type to be emitted (assembly or object code).		/// Set the file type to be emitted (assembly or object code).
/// The default is CGFT_ObjectFile.		/// The default is CGFT_ObjectFile.
void setFileType(CodeGenFileType FT) { FileType = FT; }		void setFileType(CodeGenFileType FT) { FileType = FT; }

void setCpu(StringRef MCpu) { this->MCpu = std::string(MCpu); }		void setCpu(StringRef MCpu) { this->MCpu = std::string(MCpu); }
void setAttr(StringRef MAttr) { this->MAttr = std::string(MAttr); }		void setAttr(StringRef MAttr) { this->MAttr = std::string(MAttr); }
void setOptLevel(unsigned OptLevel);		void setOptLevel(OptimizationLevel OptLevel);

void setShouldInternalize(bool Value) { ShouldInternalize = Value; }		void setShouldInternalize(bool Value) { ShouldInternalize = Value; }
void setShouldEmbedUselists(bool Value) { ShouldEmbedUselists = Value; }		void setShouldEmbedUselists(bool Value) { ShouldEmbedUselists = Value; }

/// Restore linkage of globals		/// Restore linkage of globals
///		///
/// When set, the linkage of globals will be restored prior to code		/// When set, the linkage of globals will be restored prior to code
/// generation. That is, a global symbol that had external linkage prior to		/// generation. That is, a global symbol that had external linkage prior to
▲ Show 20 Lines • Show All 123 Lines • ▼ Show 20 Lines	private:
std::string FeatureStr;		std::string FeatureStr;
std::string MCpu;		std::string MCpu;
std::string MAttr;		std::string MAttr;
std::string NativeObjectPath;		std::string NativeObjectPath;
TargetOptions Options;		TargetOptions Options;
CodeGenOpt::Level CGOptLevel = CodeGenOpt::Default;		CodeGenOpt::Level CGOptLevel = CodeGenOpt::Default;
const Target *MArch = nullptr;		const Target *MArch = nullptr;
std::string TripleStr;		std::string TripleStr;
unsigned OptLevel = 2;		OptimizationLevel OptLevel = OptimizationLevel::O2;
lto_diagnostic_handler_t DiagHandler = nullptr;		lto_diagnostic_handler_t DiagHandler = nullptr;
void *DiagContext = nullptr;		void *DiagContext = nullptr;
bool ShouldInternalize = EnableLTOInternalization;		bool ShouldInternalize = EnableLTOInternalization;
bool ShouldEmbedUselists = false;		bool ShouldEmbedUselists = false;
bool ShouldRestoreGlobalsLinkage = false;		bool ShouldRestoreGlobalsLinkage = false;
CodeGenFileType FileType = CGFT_ObjectFile;		CodeGenFileType FileType = CGFT_ObjectFile;
std::unique_ptr<ToolOutputFile> DiagnosticOutputFile;		std::unique_ptr<ToolOutputFile> DiagnosticOutputFile;
bool Freestanding = false;		bool Freestanding = false;
std::unique_ptr<ToolOutputFile> StatsFile = nullptr;		std::unique_ptr<ToolOutputFile> StatsFile = nullptr;
};		};
}		}
#endif		#endif

llvm/include/llvm/LTO/legacy/ThinLTOCodeGenerator.h

Show All 13 Lines

#ifndef LLVM_LTO_THINLTOCODEGENERATOR_H		#ifndef LLVM_LTO_THINLTOCODEGENERATOR_H
#define LLVM_LTO_THINLTOCODEGENERATOR_H		#define LLVM_LTO_THINLTOCODEGENERATOR_H

#include "llvm-c/lto.h"		#include "llvm-c/lto.h"
#include "llvm/ADT/StringSet.h"		#include "llvm/ADT/StringSet.h"
#include "llvm/ADT/Triple.h"		#include "llvm/ADT/Triple.h"
#include "llvm/IR/ModuleSummaryIndex.h"		#include "llvm/IR/ModuleSummaryIndex.h"
		#include "llvm/IR/PassManager.h"
		mtrofinUnsubmitted Not Done Reply Inline Actions It's unfortunate we now need to pull pass management into places that didn't have that dependency. IIUC, the goal of this overall effort includes piping though the full user-requested optimization parameters (i.e. both speed and size). Given the likely diversity of the consumers, it may make sense to move OptimizationLevel in its own header? mtrofin: It's unfortunate we now need to pull pass management into places that didn't have that…
#include "llvm/LTO/LTO.h"		#include "llvm/LTO/LTO.h"
#include "llvm/Support/CachePruning.h"		#include "llvm/Support/CachePruning.h"
#include "llvm/Support/CodeGen.h"		#include "llvm/Support/CodeGen.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/Target/TargetOptions.h"		#include "llvm/Target/TargetOptions.h"

#include <string>		#include <string>

▲ Show 20 Lines • Show All 186 Lines • ▼ Show 20 Lines	void setCodePICModel(Optional<Reloc::Model> Model) {
TMBuilder.RelocModel = Model;		TMBuilder.RelocModel = Model;
}		}

/// CodeGen optimization level		/// CodeGen optimization level
void setCodeGenOptLevel(CodeGenOpt::Level CGOptLevel) {		void setCodeGenOptLevel(CodeGenOpt::Level CGOptLevel) {
TMBuilder.CGOptLevel = CGOptLevel;		TMBuilder.CGOptLevel = CGOptLevel;
}		}

/// IR optimization level: from 0 to 3.		/// IR optimization level
void setOptLevel(unsigned NewOptLevel) {		void setOptLevel(OptimizationLevel NewOptLevel) {
OptLevel = (NewOptLevel > 3) ? 3 : NewOptLevel;		OptLevel = NewOptLevel;
}		}

/// Disable CodeGen, only run the stages till codegen and stop. The output		/// Disable CodeGen, only run the stages till codegen and stop. The output
/// will be bitcode.		/// will be bitcode.
void disableCodeGen(bool Disable) { DisableCodeGen = Disable; }		void disableCodeGen(bool Disable) { DisableCodeGen = Disable; }

/// Perform CodeGen only: disable all other stages.		/// Perform CodeGen only: disable all other stages.
void setCodeGenOnly(bool CGOnly) { CodeGenOnly = CGOnly; }		void setCodeGenOnly(bool CGOnly) { CodeGenOnly = CGOnly; }
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	private:
/// Flag to indicate that only the CodeGen will be performed, no cross-module		/// Flag to indicate that only the CodeGen will be performed, no cross-module
/// importing or optimization.		/// importing or optimization.
bool CodeGenOnly = false;		bool CodeGenOnly = false;

/// Flag to indicate that the optimizer should not assume builtins are present		/// Flag to indicate that the optimizer should not assume builtins are present
/// on the target.		/// on the target.
bool Freestanding = false;		bool Freestanding = false;

/// IR Optimization Level [0-3].		/// IR Optimization Level.
unsigned OptLevel = 3;		OptimizationLevel OptLevel = OptimizationLevel::O3;
};		};
}		}
#endif		#endif

llvm/include/llvm/Passes/PassBuilder.h

Show First 20 Lines • Show All 143 Lines • ▼ Show 20 Lines	enum class ThinLTOPhase {
/// No ThinLTO behavior needed.		/// No ThinLTO behavior needed.
None,		None,
/// ThinLTO prelink (summary) phase.		/// ThinLTO prelink (summary) phase.
PreLink,		PreLink,
/// ThinLTO postlink (backend compile) phase.		/// ThinLTO postlink (backend compile) phase.
PostLink		PostLink
};		};

/// LLVM-provided high-level optimization levels.
///
/// This enumerates the LLVM-provided high-level optimization levels. Each
/// level has a specific goal and rationale.
class OptimizationLevel final {
unsigned SpeedLevel = 2;
unsigned SizeLevel = 0;
OptimizationLevel(unsigned SpeedLevel, unsigned SizeLevel)
: SpeedLevel(SpeedLevel), SizeLevel(SizeLevel) {
// Check that only valid combinations are passed.
assert(SpeedLevel <= 3 &&
"Optimization level for speed should be 0, 1, 2, or 3");
assert(SizeLevel <= 2 &&
"Optimization level for size should be 0, 1, or 2");
assert((SizeLevel == 0 \|\| SpeedLevel == 2) &&
"Optimize for size should be encoded with speedup level == 2");
}

public:
OptimizationLevel() = default;
/// Disable as many optimizations as possible. This doesn't completely
/// disable the optimizer in all cases, for example always_inline functions
/// can be required to be inlined for correctness.
static const OptimizationLevel O0;

/// Optimize quickly without destroying debuggability.
///
/// This level is tuned to produce a result from the optimizer as quickly
/// as possible and to avoid destroying debuggability. This tends to result
/// in a very good development mode where the compiled code will be
/// immediately executed as part of testing. As a consequence, where
/// possible, we would like to produce efficient-to-execute code, but not
/// if it significantly slows down compilation or would prevent even basic
/// debugging of the resulting binary.
///
/// As an example, complex loop transformations such as versioning,
/// vectorization, or fusion don't make sense here due to the degree to
/// which the executed code differs from the source code, and the compile
/// time cost.
static const OptimizationLevel O1;
/// Optimize for fast execution as much as possible without triggering
/// significant incremental compile time or code size growth.
///
/// The key idea is that optimizations at this level should "pay for
/// themselves". So if an optimization increases compile time by 5% or
/// increases code size by 5% for a particular benchmark, that benchmark
/// should also be one which sees a 5% runtime improvement. If the compile
/// time or code size penalties happen on average across a diverse range of
/// LLVM users' benchmarks, then the improvements should as well.
///
/// And no matter what, the compile time needs to not grow superlinearly
/// with the size of input to LLVM so that users can control the runtime of
/// the optimizer in this mode.
///
/// This is expected to be a good default optimization level for the vast
/// majority of users.
static const OptimizationLevel O2;
/// Optimize for fast execution as much as possible.
///
/// This mode is significantly more aggressive in trading off compile time
/// and code size to get execution time improvements. The core idea is that
/// this mode should include any optimization that helps execution time on
/// balance across a diverse collection of benchmarks, even if it increases
/// code size or compile time for some benchmarks without corresponding
/// improvements to execution time.
///
/// Despite being willing to trade more compile time off to get improved
/// execution time, this mode still tries to avoid superlinear growth in
/// order to make even significantly slower compile times at least scale
/// reasonably. This does not preclude very substantial constant factor
/// costs though.
static const OptimizationLevel O3;
/// Similar to \c O2 but tries to optimize for small code size instead of
/// fast execution without triggering significant incremental execution
/// time slowdowns.
///
/// The logic here is exactly the same as \c O2, but with code size and
/// execution time metrics swapped.
///
/// A consequence of the different core goal is that this should in general
/// produce substantially smaller executables that still run in
/// a reasonable amount of time.
static const OptimizationLevel Os;
/// A very specialized mode that will optimize for code size at any and all
/// costs.
///
/// This is useful primarily when there are absolute size limitations and
/// any effort taken to reduce the size is worth it regardless of the
/// execution time impact. You should expect this level to produce rather
/// slow, but very small, code.
static const OptimizationLevel Oz;

bool isOptimizingForSpeed() const {
return SizeLevel == 0 && SpeedLevel > 0;
}

bool isOptimizingForSize() const { return SizeLevel > 0; }

bool operator==(const OptimizationLevel &Other) const {
return SizeLevel == Other.SizeLevel && SpeedLevel == Other.SpeedLevel;
}
bool operator!=(const OptimizationLevel &Other) const {
return SizeLevel != Other.SizeLevel \|\| SpeedLevel != Other.SpeedLevel;
}

unsigned getSpeedupLevel() const { return SpeedLevel; }

unsigned getSizeLevel() const { return SizeLevel; }
};

explicit PassBuilder(TargetMachine *TM = nullptr,		explicit PassBuilder(TargetMachine *TM = nullptr,
PipelineTuningOptions PTO = PipelineTuningOptions(),		PipelineTuningOptions PTO = PipelineTuningOptions(),
Optional<PGOOptions> PGOOpt = None,		Optional<PGOOptions> PGOOpt = None,
PassInstrumentationCallbacks *PIC = nullptr)		PassInstrumentationCallbacks *PIC = nullptr)
: TM(TM), PTO(PTO), PGOOpt(PGOOpt), PIC(PIC) {}		: TM(TM), PTO(PTO), PGOOpt(PGOOpt), PIC(PIC) {}

/// Cross register the analysis managers through their proxies.		/// Cross register the analysis managers through their proxies.
///		///
▲ Show 20 Lines • Show All 551 Lines • Show Last 20 Lines

llvm/include/llvm/Transforms/IPO/PassManagerBuilder.h

Show All 9 Lines
// "standard" optimization sequence suitable for languages like C and C++.		// "standard" optimization sequence suitable for languages like C and C++.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TRANSFORMS_IPO_PASSMANAGERBUILDER_H		#ifndef LLVM_TRANSFORMS_IPO_PASSMANAGERBUILDER_H
#define LLVM_TRANSFORMS_IPO_PASSMANAGERBUILDER_H		#define LLVM_TRANSFORMS_IPO_PASSMANAGERBUILDER_H

#include "llvm-c/Transforms/PassManagerBuilder.h"		#include "llvm-c/Transforms/PassManagerBuilder.h"
		#include "llvm/IR/PassManager.h"
#include <functional>		#include <functional>
#include <memory>		#include <memory>
#include <string>		#include <string>
#include <vector>		#include <vector>

namespace llvm {		namespace llvm {
class ModuleSummaryIndex;		class ModuleSummaryIndex;
class Pass;		class Pass;
class TargetLibraryInfoImpl;		class TargetLibraryInfoImpl;
class TargetMachine;		class TargetMachine;

// The old pass manager infrastructure is hidden in a legacy namespace now.		// The old pass manager infrastructure is hidden in a legacy namespace now.
namespace legacy {		namespace legacy {
class FunctionPassManager;		class FunctionPassManager;
class PassManagerBase;		class PassManagerBase;
}		}

/// PassManagerBuilder - This class is used to set up a standard optimization		/// PassManagerBuilder - This class is used to set up a standard optimization
/// sequence for languages like C and C++, allowing some APIs to customize the		/// sequence for languages like C and C++, allowing some APIs to customize the
/// pass sequence in various ways. A simple example of using it would be:		/// pass sequence in various ways. A simple example of using it would be:
///		///
/// PassManagerBuilder Builder;		/// PassManagerBuilder Builder;
/// Builder.OptLevel = 2;		/// Builder.OptLevel = OptimizationLevel::O2;
/// Builder.populateFunctionPassManager(FPM);		/// Builder.populateFunctionPassManager(FPM);
/// Builder.populateModulePassManager(MPM);		/// Builder.populateModulePassManager(MPM);
///		///
/// In addition to setting up the basic passes, PassManagerBuilder allows		/// In addition to setting up the basic passes, PassManagerBuilder allows
/// frontends to vend a plugin API, where plugins are allowed to add extensions		/// frontends to vend a plugin API, where plugins are allowed to add extensions
/// to the default pass manager. They do this by specifying where in the pass		/// to the default pass manager. They do this by specifying where in the pass
/// pipeline they want to be added, along with a callback function that adds		/// pipeline they want to be added, along with a callback function that adds
/// the pass(es). For example, a plugin that wanted to add a loop optimization		/// the pass(es). For example, a plugin that wanted to add a loop optimization
▲ Show 20 Lines • Show All 74 Lines • ▼ Show 20 Lines	enum ExtensionPointTy {

/// EP_FullLinkTimeOptimizationLast - This extensions point allow adding		/// EP_FullLinkTimeOptimizationLast - This extensions point allow adding
/// passes that		/// passes that
/// run at Link Time, after Full Link Time Optimization.		/// run at Link Time, after Full Link Time Optimization.
EP_FullLinkTimeOptimizationLast,		EP_FullLinkTimeOptimizationLast,
};		};

/// The Optimization Level - Specify the basic optimization level.		/// The Optimization Level - Specify the basic optimization level.
/// 0 = -O0, 1 = -O1, 2 = -O2, 3 = -O3		OptimizationLevel OptLevel;
unsigned OptLevel;

/// SizeLevel - How much we're optimizing for size.
/// 0 = none, 1 = -Os, 2 = -Oz
unsigned SizeLevel;

/// LibraryInfo - Specifies information about the runtime library for the		/// LibraryInfo - Specifies information about the runtime library for the
/// optimizer. If this is non-null, it is added to both the function and		/// optimizer. If this is non-null, it is added to both the function and
/// per-module pass pipeline.		/// per-module pass pipeline.
TargetLibraryInfoImpl *LibraryInfo;		TargetLibraryInfoImpl *LibraryInfo;

/// Inliner - Specifies the inliner to use. If this is non-null, it is		/// Inliner - Specifies the inliner to use. If this is non-null, it is
/// added to the per-module passes.		/// added to the per-module passes.
▲ Show 20 Lines • Show All 119 Lines • Show Last 20 Lines

llvm/lib/IR/PassManager.cpp

Show First 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	bool FunctionAnalysisManagerModuleProxy::Result::invalidate(
// Return false to indicate that this result is still a valid proxy.		// Return false to indicate that this result is still a valid proxy.
return false;		return false;
}		}
}		}

AnalysisSetKey CFGAnalyses::SetKey;		AnalysisSetKey CFGAnalyses::SetKey;

AnalysisSetKey PreservedAnalyses::AllAnalysesKey;		AnalysisSetKey PreservedAnalyses::AllAnalysesKey;

		const OptimizationLevel OptimizationLevel::O0 = {
		/SpeedLevel/ 0,
		/SizeLevel/ 0};
		const OptimizationLevel OptimizationLevel::O1 = {
		/SpeedLevel/ 1,
		/SizeLevel/ 0};
		const OptimizationLevel OptimizationLevel::O2 = {
		/SpeedLevel/ 2,
		/SizeLevel/ 0};
		const OptimizationLevel OptimizationLevel::O3 = {
		/SpeedLevel/ 3,
		/SizeLevel/ 0};
		const OptimizationLevel OptimizationLevel::Os = {
		/SpeedLevel/ 2,
		/SizeLevel/ 1};
		const OptimizationLevel OptimizationLevel::Oz = {
		/SpeedLevel/ 2,
		/SizeLevel/ 2};

llvm/lib/LTO/LTO.cpp

Show First 20 Lines • Show All 121 Lines • ▼ Show 20 Lines	#endif
else		else
AddUnsigned(-1);		AddUnsigned(-1);
if (Conf.CodeModel)		if (Conf.CodeModel)
AddUnsigned(*Conf.CodeModel);		AddUnsigned(*Conf.CodeModel);
else		else
AddUnsigned(-1);		AddUnsigned(-1);
AddUnsigned(Conf.CGOptLevel);		AddUnsigned(Conf.CGOptLevel);
AddUnsigned(Conf.CGFileType);		AddUnsigned(Conf.CGFileType);
AddUnsigned(Conf.OptLevel);		AddUnsigned(Conf.OptLevel.getSpeedupLevel());
		AddUnsigned(Conf.OptLevel.getSizeLevel());
AddUnsigned(Conf.UseNewPM);		AddUnsigned(Conf.UseNewPM);
AddUnsigned(Conf.Freestanding);		AddUnsigned(Conf.Freestanding);
AddString(Conf.OptPipeline);		AddString(Conf.OptPipeline);
AddString(Conf.AAPipeline);		AddString(Conf.AAPipeline);
AddString(Conf.OverrideTriple);		AddString(Conf.OverrideTriple);
AddString(Conf.DefaultTriple);		AddString(Conf.DefaultTriple);
AddString(Conf.DwoDir);		AddString(Conf.DwoDir);

▲ Show 20 Lines • Show All 780 Lines • ▼ Show 20 Lines	Error LTO::run(AddStreamFn AddStream, NativeObjectCache Cache) {

auto isPrevailing = [&](GlobalValue::GUID G) {		auto isPrevailing = [&](GlobalValue::GUID G) {
auto It = GUIDPrevailingResolutions.find(G);		auto It = GUIDPrevailingResolutions.find(G);
if (It == GUIDPrevailingResolutions.end())		if (It == GUIDPrevailingResolutions.end())
return PrevailingType::Unknown;		return PrevailingType::Unknown;
return It->second;		return It->second;
};		};
computeDeadSymbolsWithConstProp(ThinLTO.CombinedIndex, GUIDPreservedSymbols,		computeDeadSymbolsWithConstProp(ThinLTO.CombinedIndex, GUIDPreservedSymbols,
isPrevailing, Conf.OptLevel > 0);		isPrevailing, Conf.OptLevel != OptimizationLevel::O0);

// Setup output file to emit statistics.		// Setup output file to emit statistics.
auto StatsFileOrErr = setupStatsFile(Conf.StatsFile);		auto StatsFileOrErr = setupStatsFile(Conf.StatsFile);
if (!StatsFileOrErr)		if (!StatsFileOrErr)
return StatsFileOrErr.takeError();		return StatsFileOrErr.takeError();
std::unique_ptr<ToolOutputFile> StatsFile = std::move(StatsFileOrErr.get());		std::unique_ptr<ToolOutputFile> StatsFile = std::move(StatsFileOrErr.get());

Error Result = runRegularLTO(AddStream);		Error Result = runRegularLTO(AddStream);
▲ Show 20 Lines • Show All 398 Lines • ▼ Show 20 Lines	Error LTO::runThinLTO(AddStreamFn AddStream, NativeObjectCache Cache,

// Perform index-based WPD. This will return immediately if there are		// Perform index-based WPD. This will return immediately if there are
// no index entries in the typeIdMetadata map (e.g. if we are instead		// no index entries in the typeIdMetadata map (e.g. if we are instead
// performing IR-based WPD in hybrid regular/thin LTO mode).		// performing IR-based WPD in hybrid regular/thin LTO mode).
std::map<ValueInfo, std::vector<VTableSlotSummary>> LocalWPDTargetsMap;		std::map<ValueInfo, std::vector<VTableSlotSummary>> LocalWPDTargetsMap;
runWholeProgramDevirtOnIndex(ThinLTO.CombinedIndex, ExportedGUIDs,		runWholeProgramDevirtOnIndex(ThinLTO.CombinedIndex, ExportedGUIDs,
LocalWPDTargetsMap);		LocalWPDTargetsMap);

if (Conf.OptLevel > 0)		if (Conf.OptLevel != OptimizationLevel::O0)
ComputeCrossModuleImport(ThinLTO.CombinedIndex, ModuleToDefinedGVSummaries,		ComputeCrossModuleImport(ThinLTO.CombinedIndex, ModuleToDefinedGVSummaries,
ImportLists, ExportLists);		ImportLists, ExportLists);

// Figure out which symbols need to be internalized. This also needs to happen		// Figure out which symbols need to be internalized. This also needs to happen
// at -O0 because summary-based DCE is implemented using internalization, and		// at -O0 because summary-based DCE is implemented using internalization, and
// we must apply DCE consistently with the full LTO module in order to avoid		// we must apply DCE consistently with the full LTO module in order to avoid
// undefined references during the final link.		// undefined references during the final link.
for (auto &Res : GlobalResolutions) {		for (auto &Res : GlobalResolutions) {
▲ Show 20 Lines • Show All 100 Lines • Show Last 20 Lines

llvm/lib/LTO/LTOBackend.cpp

Show First 20 Lines • Show All 174 Lines • ▼ Show 20 Lines	else
CodeModel = M.getCodeModel();		CodeModel = M.getCodeModel();

return std::unique_ptr<TargetMachine>(TheTarget->createTargetMachine(		return std::unique_ptr<TargetMachine>(TheTarget->createTargetMachine(
TheTriple, Conf.CPU, Features.getString(), Conf.Options, RelocModel,		TheTriple, Conf.CPU, Features.getString(), Conf.Options, RelocModel,
CodeModel, Conf.CGOptLevel));		CodeModel, Conf.CGOptLevel));
}		}

static void runNewPMPasses(const Config &Conf, Module &Mod, TargetMachine *TM,		static void runNewPMPasses(const Config &Conf, Module &Mod, TargetMachine *TM,
unsigned OptLevel, bool IsThinLTO,		bool IsThinLTO,
ModuleSummaryIndex *ExportSummary,		ModuleSummaryIndex *ExportSummary,
const ModuleSummaryIndex *ImportSummary) {		const ModuleSummaryIndex *ImportSummary) {
Optional<PGOOptions> PGOOpt;		Optional<PGOOptions> PGOOpt;
if (!Conf.SampleProfile.empty())		if (!Conf.SampleProfile.empty())
PGOOpt = PGOOptions(Conf.SampleProfile, "", Conf.ProfileRemapping,		PGOOpt = PGOOptions(Conf.SampleProfile, "", Conf.ProfileRemapping,
PGOOptions::SampleUse, PGOOptions::NoCSAction, true);		PGOOptions::SampleUse, PGOOptions::NoCSAction, true);
else if (Conf.RunCSIRInstr) {		else if (Conf.RunCSIRInstr) {
PGOOpt = PGOOptions("", Conf.CSIRProfile, Conf.ProfileRemapping,		PGOOpt = PGOOptions("", Conf.CSIRProfile, Conf.ProfileRemapping,
Show All 28 Lines	static void runNewPMPasses(const Config &Conf, Module &Mod, TargetMachine *TM,
PB.registerCGSCCAnalyses(CGAM);		PB.registerCGSCCAnalyses(CGAM);
PB.registerFunctionAnalyses(FAM);		PB.registerFunctionAnalyses(FAM);
PB.registerLoopAnalyses(LAM);		PB.registerLoopAnalyses(LAM);
PB.crossRegisterProxies(LAM, FAM, CGAM, MAM);		PB.crossRegisterProxies(LAM, FAM, CGAM, MAM);

ModulePassManager MPM(Conf.DebugPassManager);		ModulePassManager MPM(Conf.DebugPassManager);
// FIXME (davide): verify the input.		// FIXME (davide): verify the input.

PassBuilder::OptimizationLevel OL;		OptimizationLevel OL = Conf.OptLevel;

switch (OptLevel) {
default:
llvm_unreachable("Invalid optimization level");
case 0:
OL = PassBuilder::OptimizationLevel::O0;
break;
case 1:
OL = PassBuilder::OptimizationLevel::O1;
break;
case 2:
OL = PassBuilder::OptimizationLevel::O2;
break;
case 3:
OL = PassBuilder::OptimizationLevel::O3;
break;
}

if (IsThinLTO)		if (IsThinLTO)
MPM = PB.buildThinLTODefaultPipeline(OL, Conf.DebugPassManager,		MPM = PB.buildThinLTODefaultPipeline(OL, Conf.DebugPassManager,
ImportSummary);		ImportSummary);
else		else
MPM = PB.buildLTODefaultPipeline(OL, Conf.DebugPassManager, ExportSummary);		MPM = PB.buildLTODefaultPipeline(OL, Conf.DebugPassManager, ExportSummary);
MPM.run(Mod, MAM);		MPM.run(Mod, MAM);

▲ Show 20 Lines • Show All 79 Lines • ▼ Show 20 Lines
bool opt(const Config &Conf, TargetMachine *TM, unsigned Task, Module &Mod,		bool opt(const Config &Conf, TargetMachine *TM, unsigned Task, Module &Mod,
bool IsThinLTO, ModuleSummaryIndex *ExportSummary,		bool IsThinLTO, ModuleSummaryIndex *ExportSummary,
const ModuleSummaryIndex *ImportSummary) {		const ModuleSummaryIndex *ImportSummary) {
// FIXME: Plumb the combined index into the new pass manager.		// FIXME: Plumb the combined index into the new pass manager.
if (!Conf.OptPipeline.empty())		if (!Conf.OptPipeline.empty())
runNewPMCustomPasses(Conf, Mod, TM, Conf.OptPipeline, Conf.AAPipeline,		runNewPMCustomPasses(Conf, Mod, TM, Conf.OptPipeline, Conf.AAPipeline,
Conf.DisableVerify);		Conf.DisableVerify);
else if (Conf.UseNewPM)		else if (Conf.UseNewPM)
runNewPMPasses(Conf, Mod, TM, Conf.OptLevel, IsThinLTO, ExportSummary,		runNewPMPasses(Conf, Mod, TM, IsThinLTO, ExportSummary,
ImportSummary);		ImportSummary);
else		else
runOldPMPasses(Conf, Mod, TM, IsThinLTO, ExportSummary, ImportSummary);		runOldPMPasses(Conf, Mod, TM, IsThinLTO, ExportSummary, ImportSummary);
return !Conf.PostOptModuleHook \|\| Conf.PostOptModuleHook(Task, Mod);		return !Conf.PostOptModuleHook \|\| Conf.PostOptModuleHook(Task, Mod);
}		}

static cl::opt<bool> EmbedBitcode(		static cl::opt<bool> EmbedBitcode(
"lto-embed-bitcode", cl::init(false),		"lto-embed-bitcode", cl::init(false),
▲ Show 20 Lines • Show All 253 Lines • Show Last 20 Lines

llvm/lib/LTO/LTOCodeGenerator.cpp

Show First 20 Lines • Show All 195 Lines • ▼ Show 20 Lines	void LTOCodeGenerator::setDebugInfo(lto_debug_model Debug) {

case LTO_DEBUG_MODEL_DWARF:		case LTO_DEBUG_MODEL_DWARF:
EmitDwarfDebugInfo = true;		EmitDwarfDebugInfo = true;
return;		return;
}		}
llvm_unreachable("Unknown debug format!");		llvm_unreachable("Unknown debug format!");
}		}

void LTOCodeGenerator::setOptLevel(unsigned Level) {		void LTOCodeGenerator::setOptLevel(OptimizationLevel Level) {
OptLevel = Level;		OptLevel = Level;
switch (OptLevel) {		switch (OptLevel.getSpeedupLevel()) {
case 0:		case 0:
CGOptLevel = CodeGenOpt::None;		CGOptLevel = CodeGenOpt::None;
return;		return;
case 1:		case 1:
CGOptLevel = CodeGenOpt::Less;		CGOptLevel = CodeGenOpt::Less;
return;		return;
case 2:		case 2:
CGOptLevel = CodeGenOpt::Default;		CGOptLevel = CodeGenOpt::Default;
▲ Show 20 Lines • Show All 515 Lines • Show Last 20 Lines

llvm/lib/LTO/ThinLTOCodeGenerator.cpp

Show First 20 Lines • Show All 224 Lines • ▼ Show 20 Lines	if (!Result) {
});		});
report_fatal_error("importFunctions failed");		report_fatal_error("importFunctions failed");
}		}
// Verify again after cross-importing.		// Verify again after cross-importing.
verifyLoadedModule(TheModule);		verifyLoadedModule(TheModule);
}		}

static void optimizeModule(Module &TheModule, TargetMachine &TM,		static void optimizeModule(Module &TheModule, TargetMachine &TM,
unsigned OptLevel, bool Freestanding,		OptimizationLevel OptLevel, bool Freestanding,
ModuleSummaryIndex *Index) {		ModuleSummaryIndex *Index) {
// Populate the PassManager		// Populate the PassManager
PassManagerBuilder PMB;		PassManagerBuilder PMB;
PMB.LibraryInfo = new TargetLibraryInfoImpl(TM.getTargetTriple());		PMB.LibraryInfo = new TargetLibraryInfoImpl(TM.getTargetTriple());
if (Freestanding)		if (Freestanding)
PMB.LibraryInfo->disableAllFunctions();		PMB.LibraryInfo->disableAllFunctions();
PMB.Inliner = createFunctionInliningPass();		PMB.Inliner = createFunctionInliningPass();
// FIXME: should get it from the bitcode?		// FIXME: should get it from the bitcode?
▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	public:
// Create a cache entry. This compute a unique hash for the Module considering		// Create a cache entry. This compute a unique hash for the Module considering
// the current list of export/import, and offer an interface to query to		// the current list of export/import, and offer an interface to query to
// access the content in the cache.		// access the content in the cache.
ModuleCacheEntry(		ModuleCacheEntry(
StringRef CachePath, const ModuleSummaryIndex &Index, StringRef ModuleID,		StringRef CachePath, const ModuleSummaryIndex &Index, StringRef ModuleID,
const FunctionImporter::ImportMapTy &ImportList,		const FunctionImporter::ImportMapTy &ImportList,
const FunctionImporter::ExportSetTy &ExportList,		const FunctionImporter::ExportSetTy &ExportList,
const std::map<GlobalValue::GUID, GlobalValue::LinkageTypes> &ResolvedODR,		const std::map<GlobalValue::GUID, GlobalValue::LinkageTypes> &ResolvedODR,
const GVSummaryMapTy &DefinedGVSummaries, unsigned OptLevel,		const GVSummaryMapTy &DefinedGVSummaries, OptimizationLevel OptLevel,
bool Freestanding, const TargetMachineBuilder &TMBuilder) {		bool Freestanding, const TargetMachineBuilder &TMBuilder) {
if (CachePath.empty())		if (CachePath.empty())
return;		return;

if (!Index.modulePaths().count(ModuleID))		if (!Index.modulePaths().count(ModuleID))
// The module does not have an entry, it can't have a hash at all		// The module does not have an entry, it can't have a hash at all
return;		return;

▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines
ProcessThinLTOModule(Module &TheModule, ModuleSummaryIndex &Index,		ProcessThinLTOModule(Module &TheModule, ModuleSummaryIndex &Index,
StringMap<lto::InputFile *> &ModuleMap, TargetMachine &TM,		StringMap<lto::InputFile *> &ModuleMap, TargetMachine &TM,
const FunctionImporter::ImportMapTy &ImportList,		const FunctionImporter::ImportMapTy &ImportList,
const FunctionImporter::ExportSetTy &ExportList,		const FunctionImporter::ExportSetTy &ExportList,
const DenseSet<GlobalValue::GUID> &GUIDPreservedSymbols,		const DenseSet<GlobalValue::GUID> &GUIDPreservedSymbols,
const GVSummaryMapTy &DefinedGlobals,		const GVSummaryMapTy &DefinedGlobals,
const ThinLTOCodeGenerator::CachingOptions &CacheOptions,		const ThinLTOCodeGenerator::CachingOptions &CacheOptions,
bool DisableCodeGen, StringRef SaveTempsDir,		bool DisableCodeGen, StringRef SaveTempsDir,
bool Freestanding, unsigned OptLevel, unsigned count) {		bool Freestanding, OptimizationLevel OptLevel, unsigned count) {

// "Benchmark"-like optimization: single-source case		// "Benchmark"-like optimization: single-source case
bool SingleModule = (ModuleMap.size() == 1);		bool SingleModule = (ModuleMap.size() == 1);

// When linking an ELF shared object, dso_local should be dropped. We		// When linking an ELF shared object, dso_local should be dropped. We
// conservatively do this for -fpic.		// conservatively do this for -fpic.
bool ClearDSOLocalOnDeclarations =		bool ClearDSOLocalOnDeclarations =
TM.getTargetTriple().isOSBinFormatELF() &&		TM.getTargetTriple().isOSBinFormatELF() &&
▲ Show 20 Lines • Show All 739 Lines • Show Last 20 Lines

llvm/lib/Passes/PassBuilder.cpp

Show First 20 Lines • Show All 274 Lines • ▼ Show 20 Lines
extern cl::opt<bool> EnableHotColdSplit;		extern cl::opt<bool> EnableHotColdSplit;
extern cl::opt<bool> EnableOrderFileInstrumentation;		extern cl::opt<bool> EnableOrderFileInstrumentation;

extern cl::opt<bool> FlattenedProfileUsed;		extern cl::opt<bool> FlattenedProfileUsed;

extern cl::opt<AttributorRunOption> AttributorRun;		extern cl::opt<AttributorRunOption> AttributorRun;
extern cl::opt<bool> EnableKnowledgeRetention;		extern cl::opt<bool> EnableKnowledgeRetention;

const PassBuilder::OptimizationLevel PassBuilder::OptimizationLevel::O0 = {
/SpeedLevel/ 0,
/SizeLevel/ 0};
const PassBuilder::OptimizationLevel PassBuilder::OptimizationLevel::O1 = {
/SpeedLevel/ 1,
/SizeLevel/ 0};
const PassBuilder::OptimizationLevel PassBuilder::OptimizationLevel::O2 = {
/SpeedLevel/ 2,
/SizeLevel/ 0};
const PassBuilder::OptimizationLevel PassBuilder::OptimizationLevel::O3 = {
/SpeedLevel/ 3,
/SizeLevel/ 0};
const PassBuilder::OptimizationLevel PassBuilder::OptimizationLevel::Os = {
/SpeedLevel/ 2,
/SizeLevel/ 1};
const PassBuilder::OptimizationLevel PassBuilder::OptimizationLevel::Oz = {
/SpeedLevel/ 2,
/SizeLevel/ 2};

namespace {		namespace {

/// No-op module pass which does nothing.		/// No-op module pass which does nothing.
struct NoOpModulePass {		struct NoOpModulePass {
PreservedAnalyses run(Module &M, ModuleAnalysisManager &) {		PreservedAnalyses run(Module &M, ModuleAnalysisManager &) {
return PreservedAnalyses::all();		return PreservedAnalyses::all();
}		}
static StringRef name() { return "NoOpModulePass"; }		static StringRef name() { return "NoOpModulePass"; }
▲ Show 20 Lines • Show All 76 Lines • ▼ Show 20 Lines
AnalysisKey NoOpModuleAnalysis::Key;		AnalysisKey NoOpModuleAnalysis::Key;
AnalysisKey NoOpCGSCCAnalysis::Key;		AnalysisKey NoOpCGSCCAnalysis::Key;
AnalysisKey NoOpFunctionAnalysis::Key;		AnalysisKey NoOpFunctionAnalysis::Key;
AnalysisKey NoOpLoopAnalysis::Key;		AnalysisKey NoOpLoopAnalysis::Key;

} // End anonymous namespace.		} // End anonymous namespace.

void PassBuilder::invokePeepholeEPCallbacks(		void PassBuilder::invokePeepholeEPCallbacks(
FunctionPassManager &FPM, PassBuilder::OptimizationLevel Level) {		FunctionPassManager &FPM, OptimizationLevel Level) {
for (auto &C : PeepholeEPCallbacks)		for (auto &C : PeepholeEPCallbacks)
C(FPM, Level);		C(FPM, Level);
}		}

void PassBuilder::registerModuleAnalyses(ModuleAnalysisManager &MAM) {		void PassBuilder::registerModuleAnalyses(ModuleAnalysisManager &MAM) {
#define MODULE_ANALYSIS(NAME, CREATE_PASS) \		#define MODULE_ANALYSIS(NAME, CREATE_PASS) \
MAM.registerPass([&] { return CREATE_PASS; });		MAM.registerPass([&] { return CREATE_PASS; });
#include "PassRegistry.def"		#include "PassRegistry.def"
▲ Show 20 Lines • Show All 326 Lines • ▼ Show 20 Lines	if (EnableCHR && Level == OptimizationLevel::O3 && PGOOpt &&
(PGOOpt->Action == PGOOptions::IRUse \|\|		(PGOOpt->Action == PGOOptions::IRUse \|\|
PGOOpt->Action == PGOOptions::SampleUse))		PGOOpt->Action == PGOOptions::SampleUse))
FPM.addPass(ControlHeightReductionPass());		FPM.addPass(ControlHeightReductionPass());

return FPM;		return FPM;
}		}

void PassBuilder::addPGOInstrPasses(ModulePassManager &MPM, bool DebugLogging,		void PassBuilder::addPGOInstrPasses(ModulePassManager &MPM, bool DebugLogging,
PassBuilder::OptimizationLevel Level,		OptimizationLevel Level,
bool RunProfileGen, bool IsCS,		bool RunProfileGen, bool IsCS,
std::string ProfileFile,		std::string ProfileFile,
std::string ProfileRemappingFile) {		std::string ProfileRemappingFile) {
assert(Level != OptimizationLevel::O0 && "Not expecting O0 here!");		assert(Level != OptimizationLevel::O0 && "Not expecting O0 here!");
// Generally running simplification passes and the inliner with an high		// Generally running simplification passes and the inliner with an high
// threshold results in smaller executables, but there may be cases where		// threshold results in smaller executables, but there may be cases where
// the size grows, so let's be conservative here and skip this simplification		// the size grows, so let's be conservative here and skip this simplification
// at -Os/Oz. We will not do this inline for context sensistive PGO (when		// at -Os/Oz. We will not do this inline for context sensistive PGO (when
▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	if (!ProfileFile.empty())
Options.InstrProfileOutput = ProfileFile;		Options.InstrProfileOutput = ProfileFile;
// Do not do counter promotion at O0.		// Do not do counter promotion at O0.
Options.DoCounterPromotion = false;		Options.DoCounterPromotion = false;
Options.UseBFIInPromotion = IsCS;		Options.UseBFIInPromotion = IsCS;
MPM.addPass(InstrProfiling(Options, IsCS));		MPM.addPass(InstrProfiling(Options, IsCS));
}		}

static InlineParams		static InlineParams
getInlineParamsFromOptLevel(PassBuilder::OptimizationLevel Level) {		getInlineParamsFromOptLevel(OptimizationLevel Level) {
return getInlineParams(Level.getSpeedupLevel(), Level.getSizeLevel());		return getInlineParams(Level.getSpeedupLevel(), Level.getSizeLevel());
}		}

ModuleInlinerWrapperPass		ModuleInlinerWrapperPass
PassBuilder::buildInlinerPipeline(OptimizationLevel Level, ThinLTOPhase Phase,		PassBuilder::buildInlinerPipeline(OptimizationLevel Level, ThinLTOPhase Phase,
bool DebugLogging) {		bool DebugLogging) {
InlineParams IP = getInlineParamsFromOptLevel(Level);		InlineParams IP = getInlineParamsFromOptLevel(Level);
if (Phase == PassBuilder::ThinLTOPhase::PreLink && PGOOpt &&		if (Phase == PassBuilder::ThinLTOPhase::PreLink && PGOOpt &&
▲ Show 20 Lines • Show All 1,804 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp

Show First 20 Lines • Show All 162 Lines • ▼ Show 20 Lines	cl::values(clEnumValN(AttributorRunOption::ALL, "all",
clEnumValN(AttributorRunOption::CGSCC, "cgscc",		clEnumValN(AttributorRunOption::CGSCC, "cgscc",
"enable call graph SCC attributor runs"),		"enable call graph SCC attributor runs"),
clEnumValN(AttributorRunOption::NONE, "none",		clEnumValN(AttributorRunOption::NONE, "none",
"disable attributor runs")));		"disable attributor runs")));

extern cl::opt<bool> EnableKnowledgeRetention;		extern cl::opt<bool> EnableKnowledgeRetention;

PassManagerBuilder::PassManagerBuilder() {		PassManagerBuilder::PassManagerBuilder() {
OptLevel = 2;		OptLevel = OptimizationLevel::O2;
SizeLevel = 0;
LibraryInfo = nullptr;		LibraryInfo = nullptr;
Inliner = nullptr;		Inliner = nullptr;
DisableUnrollLoops = false;		DisableUnrollLoops = false;
SLPVectorize = false;		SLPVectorize = false;
LoopVectorize = true;		LoopVectorize = true;
LoopsInterleaved = true;		LoopsInterleaved = true;
RerollLoops = RunLoopRerolling;		RerollLoops = RunLoopRerolling;
NewGVN = RunNewGVN;		NewGVN = RunNewGVN;
▲ Show 20 Lines • Show All 107 Lines • ▼ Show 20 Lines	void PassManagerBuilder::populateFunctionPassManager(
legacy::FunctionPassManager &FPM) {		legacy::FunctionPassManager &FPM) {
addExtensionsToPM(EP_EarlyAsPossible, FPM);		addExtensionsToPM(EP_EarlyAsPossible, FPM);
FPM.add(createEntryExitInstrumenterPass());		FPM.add(createEntryExitInstrumenterPass());

// Add LibraryInfo if we have some.		// Add LibraryInfo if we have some.
if (LibraryInfo)		if (LibraryInfo)
FPM.add(new TargetLibraryInfoWrapperPass(*LibraryInfo));		FPM.add(new TargetLibraryInfoWrapperPass(*LibraryInfo));

if (OptLevel == 0) return;		if (OptLevel == OptimizationLevel::O0) return;

addInitialAliasAnalysisPasses(FPM);		addInitialAliasAnalysisPasses(FPM);

FPM.add(createCFGSimplificationPass());		FPM.add(createCFGSimplificationPass());
FPM.add(createSROAPass());		FPM.add(createSROAPass());
FPM.add(createEarlyCSEPass());		FPM.add(createEarlyCSEPass());
FPM.add(createLowerExpectIntrinsicPass());		FPM.add(createLowerExpectIntrinsicPass());
}		}

// Do PGO instrumentation generation or use pass as the option specified.		// Do PGO instrumentation generation or use pass as the option specified.
void PassManagerBuilder::addPGOInstrPasses(legacy::PassManagerBase &MPM,		void PassManagerBuilder::addPGOInstrPasses(legacy::PassManagerBase &MPM,
bool IsCS = false) {		bool IsCS = false) {
if (IsCS) {		if (IsCS) {
if (!EnablePGOCSInstrGen && !EnablePGOCSInstrUse)		if (!EnablePGOCSInstrGen && !EnablePGOCSInstrUse)
return;		return;
} else if (!EnablePGOInstrGen && PGOInstrUse.empty() && PGOSampleUse.empty())		} else if (!EnablePGOInstrGen && PGOInstrUse.empty() && PGOSampleUse.empty())
return;		return;

// Perform the preinline and cleanup passes for O1 and above.		// Perform the preinline and cleanup passes for O1 and above.
// And avoid doing them if optimizing for size.		// And avoid doing them if optimizing for size.
// We will not do this inline for context sensitive PGO (when IsCS is true).		// We will not do this inline for context sensitive PGO (when IsCS is true).
if (OptLevel > 0 && SizeLevel == 0 && !DisablePreInliner &&		if (OptLevel.getSpeedupLevel() > 0 && OptLevel.getSizeLevel() == 0 &&
PGOSampleUse.empty() && !IsCS) {		!DisablePreInliner && PGOSampleUse.empty() && !IsCS) {
// Create preinline pass. We construct an InlineParams object and specify		// Create preinline pass. We construct an InlineParams object and specify
// the threshold here to avoid the command line options of the regular		// the threshold here to avoid the command line options of the regular
// inliner to influence pre-inlining. The only fields of InlineParams we		// inliner to influence pre-inlining. The only fields of InlineParams we
// care about are DefaultThreshold and HintThreshold.		// care about are DefaultThreshold and HintThreshold.
InlineParams IP;		InlineParams IP;
IP.DefaultThreshold = PreInlineThreshold;		IP.DefaultThreshold = PreInlineThreshold;
// FIXME: The hint threshold has the same value used by the regular inliner.		// FIXME: The hint threshold has the same value used by the regular inliner.
// This should probably be lowered after performance testing.		// This should probably be lowered after performance testing.
Show All 17 Lines	if ((EnablePGOInstrGen && !IsCS) \|\| (EnablePGOCSInstrGen && IsCS)) {
MPM.add(createLoopRotatePass());		MPM.add(createLoopRotatePass());
MPM.add(createInstrProfilingLegacyPass(Options, IsCS));		MPM.add(createInstrProfilingLegacyPass(Options, IsCS));
}		}
if (!PGOInstrUse.empty())		if (!PGOInstrUse.empty())
MPM.add(createPGOInstrumentationUseLegacyPass(PGOInstrUse, IsCS));		MPM.add(createPGOInstrumentationUseLegacyPass(PGOInstrUse, IsCS));
// Indirect call promotion that promotes intra-module targets only.		// Indirect call promotion that promotes intra-module targets only.
// For ThinLTO this is done earlier due to interactions with globalopt		// For ThinLTO this is done earlier due to interactions with globalopt
// for imported functions. We don't run this at -O0.		// for imported functions. We don't run this at -O0.
if (OptLevel > 0 && !IsCS)		if (OptLevel.getSpeedupLevel() > 0 && !IsCS)
MPM.add(		MPM.add(
createPGOIndirectCallPromotionLegacyPass(false, !PGOSampleUse.empty()));		createPGOIndirectCallPromotionLegacyPass(false, !PGOSampleUse.empty()));
}		}
void PassManagerBuilder::addFunctionSimplificationPasses(		void PassManagerBuilder::addFunctionSimplificationPasses(
legacy::PassManagerBase &MPM) {		legacy::PassManagerBase &MPM) {
// Start of function pass.		// Start of function pass.
// Break up aggregate allocas, using SSAUpdater.		// Break up aggregate allocas, using SSAUpdater.
assert(OptLevel >= 1 && "Calling function optimizer with no optimization level!");		assert(OptLevel.getSpeedupLevel() >= 1 &&
		"Calling function optimizer with no optimization level!");
MPM.add(createSROAPass());		MPM.add(createSROAPass());
MPM.add(createEarlyCSEPass(true /* Enable mem-ssa. */)); // Catch trivial redundancies		MPM.add(createEarlyCSEPass(true /* Enable mem-ssa. */)); // Catch trivial redundancies
if (EnableKnowledgeRetention)		if (EnableKnowledgeRetention)
MPM.add(createAssumeSimplifyPass());		MPM.add(createAssumeSimplifyPass());

if (OptLevel > 1) {		if (OptLevel.getSpeedupLevel() > 1) {
if (EnableGVNHoist)		if (EnableGVNHoist)
MPM.add(createGVNHoistPass());		MPM.add(createGVNHoistPass());
if (EnableGVNSink) {		if (EnableGVNSink) {
MPM.add(createGVNSinkPass());		MPM.add(createGVNSinkPass());
MPM.add(createCFGSimplificationPass());		MPM.add(createCFGSimplificationPass());
}		}
}		}

if (OptLevel > 1) {		if (OptLevel.getSpeedupLevel() > 1) {
// Speculative execution if the target has divergent branches; otherwise nop.		// Speculative execution if the target has divergent branches; otherwise nop.
MPM.add(createSpeculativeExecutionIfHasBranchDivergencePass());		MPM.add(createSpeculativeExecutionIfHasBranchDivergencePass());

MPM.add(createJumpThreadingPass()); // Thread jumps.		MPM.add(createJumpThreadingPass()); // Thread jumps.
MPM.add(createCorrelatedValuePropagationPass()); // Propagate conditionals		MPM.add(createCorrelatedValuePropagationPass()); // Propagate conditionals
}		}
MPM.add(createCFGSimplificationPass()); // Merge & remove BBs		MPM.add(createCFGSimplificationPass()); // Merge & remove BBs
// Combine silly seq's		// Combine silly seq's
if (OptLevel > 2)		if (OptLevel.getSpeedupLevel() > 2)
MPM.add(createAggressiveInstCombinerPass());		MPM.add(createAggressiveInstCombinerPass());
MPM.add(createInstructionCombiningPass());		MPM.add(createInstructionCombiningPass());
if (SizeLevel == 0 && !DisableLibCallsShrinkWrap)		if (OptLevel.getSizeLevel() == 0 && !DisableLibCallsShrinkWrap)
MPM.add(createLibCallsShrinkWrapPass());		MPM.add(createLibCallsShrinkWrapPass());
addExtensionsToPM(EP_Peephole, MPM);		addExtensionsToPM(EP_Peephole, MPM);

// Optimize memory intrinsic calls based on the profiled size information.		// Optimize memory intrinsic calls based on the profiled size information.
if (SizeLevel == 0)		if (OptLevel.getSizeLevel() == 0)
MPM.add(createPGOMemOPSizeOptLegacyPass());		MPM.add(createPGOMemOPSizeOptLegacyPass());

// TODO: Investigate the cost/benefit of tail call elimination on debugging.		// TODO: Investigate the cost/benefit of tail call elimination on debugging.
if (OptLevel > 1)		if (OptLevel.getSpeedupLevel() > 1)
MPM.add(createTailCallEliminationPass()); // Eliminate tail calls		MPM.add(createTailCallEliminationPass()); // Eliminate tail calls
MPM.add(createCFGSimplificationPass()); // Merge & remove BBs		MPM.add(createCFGSimplificationPass()); // Merge & remove BBs
MPM.add(createReassociatePass()); // Reassociate expressions		MPM.add(createReassociatePass()); // Reassociate expressions

// Begin the loop pass pipeline.		// Begin the loop pass pipeline.
if (EnableSimpleLoopUnswitch) {		if (EnableSimpleLoopUnswitch) {
// The simple loop unswitch pass relies on separate cleanup passes. Schedule		// The simple loop unswitch pass relies on separate cleanup passes. Schedule
// them first so when we re-process a loop they run before other loop		// them first so when we re-process a loop they run before other loop
// passes.		// passes.
MPM.add(createLoopInstSimplifyPass());		MPM.add(createLoopInstSimplifyPass());
MPM.add(createLoopSimplifyCFGPass());		MPM.add(createLoopSimplifyCFGPass());
}		}
// Rotate Loop - disable header duplication at -Oz		// Rotate Loop - disable header duplication at -Oz
MPM.add(createLoopRotatePass(SizeLevel == 2 ? 0 : -1));		MPM.add(createLoopRotatePass(OptLevel.getSizeLevel() == 2 ? 0 : -1));
// TODO: Investigate promotion cap for O1.		// TODO: Investigate promotion cap for O1.
MPM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));		MPM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));
if (EnableSimpleLoopUnswitch)		if (EnableSimpleLoopUnswitch)
MPM.add(createSimpleLoopUnswitchLegacyPass());		MPM.add(createSimpleLoopUnswitchLegacyPass());
else		else
MPM.add(createLoopUnswitchPass(SizeLevel \|\| OptLevel < 3, DivergentTarget));		MPM.add(createLoopUnswitchPass(OptLevel.getSizeLevel() \|\|
		OptLevel.getSpeedupLevel() < 3, DivergentTarget));
// FIXME: We break the loop pass pipeline here in order to do full		// FIXME: We break the loop pass pipeline here in order to do full
// simplify-cfg. Eventually loop-simplifycfg should be enhanced to replace the		// simplify-cfg. Eventually loop-simplifycfg should be enhanced to replace the
// need for this.		// need for this.
MPM.add(createCFGSimplificationPass());		MPM.add(createCFGSimplificationPass());
MPM.add(createInstructionCombiningPass());		MPM.add(createInstructionCombiningPass());
// We resume loop passes creating a second loop pipeline here.		// We resume loop passes creating a second loop pipeline here.
MPM.add(createIndVarSimplifyPass()); // Canonicalize indvars		MPM.add(createIndVarSimplifyPass()); // Canonicalize indvars
MPM.add(createLoopIdiomPass()); // Recognize idioms like memset.		MPM.add(createLoopIdiomPass()); // Recognize idioms like memset.
addExtensionsToPM(EP_LateLoopOptimizations, MPM);		addExtensionsToPM(EP_LateLoopOptimizations, MPM);
MPM.add(createLoopDeletionPass()); // Delete dead loops		MPM.add(createLoopDeletionPass()); // Delete dead loops

if (EnableLoopInterchange)		if (EnableLoopInterchange)
MPM.add(createLoopInterchangePass()); // Interchange loops		MPM.add(createLoopInterchangePass()); // Interchange loops

// Unroll small loops		// Unroll small loops
MPM.add(createSimpleLoopUnrollPass(OptLevel, DisableUnrollLoops,		MPM.add(createSimpleLoopUnrollPass(OptLevel.getSpeedupLevel(), DisableUnrollLoops,
ForgetAllSCEVInLoopUnroll));		ForgetAllSCEVInLoopUnroll));
addExtensionsToPM(EP_LoopOptimizerEnd, MPM);		addExtensionsToPM(EP_LoopOptimizerEnd, MPM);
// This ends the loop pass pipelines.		// This ends the loop pass pipelines.

if (OptLevel > 1) {		if (OptLevel.getSpeedupLevel() > 1) {
MPM.add(createMergedLoadStoreMotionPass()); // Merge ld/st in diamonds		MPM.add(createMergedLoadStoreMotionPass()); // Merge ld/st in diamonds
MPM.add(NewGVN ? createNewGVNPass()		MPM.add(NewGVN ? createNewGVNPass()
: createGVNPass(DisableGVNLoadPRE)); // Remove redundancies		: createGVNPass(DisableGVNLoadPRE)); // Remove redundancies
}		}
MPM.add(createMemCpyOptPass()); // Remove memcpy / form memset		MPM.add(createMemCpyOptPass()); // Remove memcpy / form memset
MPM.add(createSCCPPass()); // Constant prop with SCCP		MPM.add(createSCCPPass()); // Constant prop with SCCP

// Delete dead bit computations (instcombine runs after to fold away the dead		// Delete dead bit computations (instcombine runs after to fold away the dead
// computations, and then ADCE will run later to exploit any new DCE		// computations, and then ADCE will run later to exploit any new DCE
// opportunities that creates).		// opportunities that creates).
MPM.add(createBitTrackingDCEPass()); // Delete dead bit computations		MPM.add(createBitTrackingDCEPass()); // Delete dead bit computations

// Run instcombine after redundancy elimination to exploit opportunities		// Run instcombine after redundancy elimination to exploit opportunities
// opened up by them.		// opened up by them.
MPM.add(createInstructionCombiningPass());		MPM.add(createInstructionCombiningPass());
addExtensionsToPM(EP_Peephole, MPM);		addExtensionsToPM(EP_Peephole, MPM);
if (OptLevel > 1) {		if (OptLevel.getSpeedupLevel() > 1) {
MPM.add(createJumpThreadingPass()); // Thread jumps		MPM.add(createJumpThreadingPass()); // Thread jumps
MPM.add(createCorrelatedValuePropagationPass());		MPM.add(createCorrelatedValuePropagationPass());
MPM.add(createDeadStoreEliminationPass()); // Delete dead stores		MPM.add(createDeadStoreEliminationPass()); // Delete dead stores
MPM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));		MPM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));
}		}

addExtensionsToPM(EP_ScalarOptimizerLate, MPM);		addExtensionsToPM(EP_ScalarOptimizerLate, MPM);

if (RerollLoops)		if (RerollLoops)
MPM.add(createLoopRerollPass());		MPM.add(createLoopRerollPass());

// TODO: Investigate if this is too expensive at O1.		// TODO: Investigate if this is too expensive at O1.
MPM.add(createAggressiveDCEPass()); // Delete dead instructions		MPM.add(createAggressiveDCEPass()); // Delete dead instructions
MPM.add(createCFGSimplificationPass()); // Merge & remove BBs		MPM.add(createCFGSimplificationPass()); // Merge & remove BBs
// Clean up after everything.		// Clean up after everything.
MPM.add(createInstructionCombiningPass());		MPM.add(createInstructionCombiningPass());
addExtensionsToPM(EP_Peephole, MPM);		addExtensionsToPM(EP_Peephole, MPM);

if (EnableCHR && OptLevel >= 3 &&		if (EnableCHR && OptLevel.getSpeedupLevel() >= 3 &&
(!PGOInstrUse.empty() \|\| !PGOSampleUse.empty() \|\| EnablePGOCSInstrGen))		(!PGOInstrUse.empty() \|\| !PGOSampleUse.empty() \|\| EnablePGOCSInstrGen))
MPM.add(createControlHeightReductionLegacyPass());		MPM.add(createControlHeightReductionLegacyPass());
}		}

void PassManagerBuilder::populateModulePassManager(		void PassManagerBuilder::populateModulePassManager(
legacy::PassManagerBase &MPM) {		legacy::PassManagerBase &MPM) {
// Whether this is a default or *LTO pre-link pipeline. The FullLTO post-link		// Whether this is a default or *LTO pre-link pipeline. The FullLTO post-link
// is handled separately, so just check this is not the ThinLTO post-link.		// is handled separately, so just check this is not the ThinLTO post-link.
bool DefaultOrPreLinkPipeline = !PerformThinLTO;		bool DefaultOrPreLinkPipeline = !PerformThinLTO;

if (!PGOSampleUse.empty()) {		if (!PGOSampleUse.empty()) {
MPM.add(createPruneEHPass());		MPM.add(createPruneEHPass());
// In ThinLTO mode, when flattened profile is used, all the available		// In ThinLTO mode, when flattened profile is used, all the available
// profile information will be annotated in PreLink phase so there is		// profile information will be annotated in PreLink phase so there is
// no need to load the profile again in PostLink.		// no need to load the profile again in PostLink.
if (!(FlattenedProfileUsed && PerformThinLTO))		if (!(FlattenedProfileUsed && PerformThinLTO))
MPM.add(createSampleProfileLoaderPass(PGOSampleUse));		MPM.add(createSampleProfileLoaderPass(PGOSampleUse));
}		}

// Allow forcing function attributes as a debugging and tuning aid.		// Allow forcing function attributes as a debugging and tuning aid.
MPM.add(createForceFunctionAttrsLegacyPass());		MPM.add(createForceFunctionAttrsLegacyPass());

// If all optimizations are disabled, just run the always-inline pass and,		// If all optimizations are disabled, just run the always-inline pass and,
// if enabled, the function merging pass.		// if enabled, the function merging pass.
if (OptLevel == 0) {		if (OptLevel.getSpeedupLevel() == 0) {
addPGOInstrPasses(MPM);		addPGOInstrPasses(MPM);
if (Inliner) {		if (Inliner) {
MPM.add(Inliner);		MPM.add(Inliner);
Inliner = nullptr;		Inliner = nullptr;
}		}

// FIXME: The BarrierNoopPass is a HACK! The inliner pass above implicitly		// FIXME: The BarrierNoopPass is a HACK! The inliner pass above implicitly
// creates a CGSCC pass manager, but we don't want to add extensions into		// creates a CGSCC pass manager, but we don't want to add extensions into
▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	void PassManagerBuilder::populateModulePassManager(
MPM.add(createInferFunctionAttrsLegacyPass());		MPM.add(createInferFunctionAttrsLegacyPass());

// Infer attributes on declarations, call sites, arguments, etc.		// Infer attributes on declarations, call sites, arguments, etc.
if (AttributorRun & AttributorRunOption::MODULE)		if (AttributorRun & AttributorRunOption::MODULE)
MPM.add(createAttributorLegacyPass());		MPM.add(createAttributorLegacyPass());

addExtensionsToPM(EP_ModuleOptimizerEarly, MPM);		addExtensionsToPM(EP_ModuleOptimizerEarly, MPM);

if (OptLevel > 2)		if (OptLevel.getSpeedupLevel() > 2)
MPM.add(createCallSiteSplittingPass());		MPM.add(createCallSiteSplittingPass());

MPM.add(createIPSCCPPass()); // IP SCCP		MPM.add(createIPSCCPPass()); // IP SCCP
MPM.add(createCalledValuePropagationPass());		MPM.add(createCalledValuePropagationPass());

MPM.add(createGlobalOptimizerPass()); // Optimize out global vars		MPM.add(createGlobalOptimizerPass()); // Optimize out global vars
// Promote any localized global vars.		// Promote any localized global vars.
MPM.add(createPromoteMemoryToRegisterPass());		MPM.add(createPromoteMemoryToRegisterPass());
Show All 32 Lines	void PassManagerBuilder::populateModulePassManager(
}		}

// Infer attributes on declarations, call sites, arguments, etc. for an SCC.		// Infer attributes on declarations, call sites, arguments, etc. for an SCC.
if (AttributorRun & AttributorRunOption::CGSCC)		if (AttributorRun & AttributorRunOption::CGSCC)
MPM.add(createAttributorCGSCCLegacyPass());		MPM.add(createAttributorCGSCCLegacyPass());

// Try to perform OpenMP specific optimizations. This is a (quick!) no-op if		// Try to perform OpenMP specific optimizations. This is a (quick!) no-op if
// there are no OpenMP runtime calls present in the module.		// there are no OpenMP runtime calls present in the module.
if (OptLevel > 1)		if (OptLevel.getSpeedupLevel() > 1)
MPM.add(createOpenMPOptLegacyPass());		MPM.add(createOpenMPOptLegacyPass());

MPM.add(createPostOrderFunctionAttrsLegacyPass());		MPM.add(createPostOrderFunctionAttrsLegacyPass());
if (OptLevel > 2)		if (OptLevel.getSpeedupLevel() > 2)
MPM.add(createArgumentPromotionPass()); // Scalarize uninlined fn args		MPM.add(createArgumentPromotionPass()); // Scalarize uninlined fn args

addExtensionsToPM(EP_CGSCCOptimizerLate, MPM);		addExtensionsToPM(EP_CGSCCOptimizerLate, MPM);
addFunctionSimplificationPasses(MPM);		addFunctionSimplificationPasses(MPM);

// FIXME: This is a HACK! The inliner pass above implicitly creates a CGSCC		// FIXME: This is a HACK! The inliner pass above implicitly creates a CGSCC
// pass manager that we are specifically trying to avoid. To prevent this		// pass manager that we are specifically trying to avoid. To prevent this
// we must insert a no-op module pass to reset the pass manager.		// we must insert a no-op module pass to reset the pass manager.
MPM.add(createBarrierNoopPass());		MPM.add(createBarrierNoopPass());

if (RunPartialInlining)		if (RunPartialInlining)
MPM.add(createPartialInliningPass());		MPM.add(createPartialInliningPass());

if (OptLevel > 1 && !PrepareForLTO && !PrepareForThinLTO)		if (OptLevel.getSpeedupLevel() > 1 && !PrepareForLTO && !PrepareForThinLTO)
// Remove avail extern fns and globals definitions if we aren't		// Remove avail extern fns and globals definitions if we aren't
// compiling an object file for later LTO. For LTO we want to preserve		// compiling an object file for later LTO. For LTO we want to preserve
// these so they are eligible for inlining at link-time. Note if they		// these so they are eligible for inlining at link-time. Note if they
// are unreferenced they will be removed by GlobalDCE later, so		// are unreferenced they will be removed by GlobalDCE later, so
// this only impacts referenced available externally globals.		// this only impacts referenced available externally globals.
// Eventually they will be suppressed during codegen, but eliminating		// Eventually they will be suppressed during codegen, but eliminating
// here enables more opportunity for GlobalDCE as it may make		// here enables more opportunity for GlobalDCE as it may make
// globals referenced by available external functions dead		// globals referenced by available external functions dead
▲ Show 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	if (EnableMatrix) {
MPM.add(createEarlyCSEPass(false));		MPM.add(createEarlyCSEPass(false));
}		}

addExtensionsToPM(EP_VectorizerStart, MPM);		addExtensionsToPM(EP_VectorizerStart, MPM);

// Re-rotate loops in all our loop nests. These may have fallout out of		// Re-rotate loops in all our loop nests. These may have fallout out of
// rotated form due to GVN or other transformations, and the vectorizer relies		// rotated form due to GVN or other transformations, and the vectorizer relies
// on the rotated form. Disable header duplication at -Oz.		// on the rotated form. Disable header duplication at -Oz.
MPM.add(createLoopRotatePass(SizeLevel == 2 ? 0 : -1));		MPM.add(createLoopRotatePass(OptLevel.getSizeLevel() == 2 ? 0 : -1));

// Distribute loops to allow partial vectorization. I.e. isolate dependences		// Distribute loops to allow partial vectorization. I.e. isolate dependences
// into separate loop that would otherwise inhibit vectorization. This is		// into separate loop that would otherwise inhibit vectorization. This is
// currently only performed for loops marked with the metadata		// currently only performed for loops marked with the metadata
// llvm.loop.distribute=true or when -enable-loop-distribute is specified.		// llvm.loop.distribute=true or when -enable-loop-distribute is specified.
MPM.add(createLoopDistributePass());		MPM.add(createLoopDistributePass());

MPM.add(createLoopVectorizePass(!LoopsInterleaved, !LoopVectorize));		MPM.add(createLoopVectorizePass(!LoopsInterleaved, !LoopVectorize));

// Eliminate loads by forwarding stores from the previous iteration to loads		// Eliminate loads by forwarding stores from the previous iteration to loads
// of the current iteration.		// of the current iteration.
MPM.add(createLoopLoadEliminationPass());		MPM.add(createLoopLoadEliminationPass());

// FIXME: Because of #pragma vectorize enable, the passes below are always		// FIXME: Because of #pragma vectorize enable, the passes below are always
// inserted in the pipeline, even when the vectorizer doesn't run (ex. when		// inserted in the pipeline, even when the vectorizer doesn't run (ex. when
// on -O1 and no #pragma is found). Would be good to have these two passes		// on -O1 and no #pragma is found). Would be good to have these two passes
// as function calls, so that we can only pass them when the vectorizer		// as function calls, so that we can only pass them when the vectorizer
// changed the code.		// changed the code.
MPM.add(createInstructionCombiningPass());		MPM.add(createInstructionCombiningPass());
if (OptLevel > 1 && ExtraVectorizerPasses) {		if (OptLevel.getSpeedupLevel() > 1 && ExtraVectorizerPasses) {
// At higher optimization levels, try to clean up any runtime overlap and		// At higher optimization levels, try to clean up any runtime overlap and
// alignment checks inserted by the vectorizer. We want to track correllated		// alignment checks inserted by the vectorizer. We want to track correllated
// runtime checks for two inner loops in the same outer loop, fold any		// runtime checks for two inner loops in the same outer loop, fold any
// common computations, hoist loop-invariant aspects out of any outer loop,		// common computations, hoist loop-invariant aspects out of any outer loop,
// and unswitch the runtime checks if possible. Once hoisted, we may have		// and unswitch the runtime checks if possible. Once hoisted, we may have
// dead (or speculatable) control flows or more combining opportunities.		// dead (or speculatable) control flows or more combining opportunities.
MPM.add(createCorrelatedValuePropagationPass());		MPM.add(createCorrelatedValuePropagationPass());
MPM.add(createInstructionCombiningPass());		MPM.add(createInstructionCombiningPass());
MPM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));		MPM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));
MPM.add(createLoopUnswitchPass(SizeLevel \|\| OptLevel < 3, DivergentTarget));		MPM.add(createLoopUnswitchPass(OptLevel.getSizeLevel() \|\|
		OptLevel.getSpeedupLevel() < 3, DivergentTarget));
MPM.add(createCFGSimplificationPass());		MPM.add(createCFGSimplificationPass());
MPM.add(createInstructionCombiningPass());		MPM.add(createInstructionCombiningPass());
}		}

// Cleanup after loop vectorization, etc. Simplification passes like CVP and		// Cleanup after loop vectorization, etc. Simplification passes like CVP and
// GVN, loop transforms, and others have already run, so it's now better to		// GVN, loop transforms, and others have already run, so it's now better to
// convert to more optimized IR using more aggressive simplify CFG options.		// convert to more optimized IR using more aggressive simplify CFG options.
// The extra sinking transform can create larger basic blocks, so do this		// The extra sinking transform can create larger basic blocks, so do this
// before SLP vectorization.		// before SLP vectorization.
MPM.add(createCFGSimplificationPass(1, true, true, false, true));		MPM.add(createCFGSimplificationPass(1, true, true, false, true));

if (SLPVectorize) {		if (SLPVectorize) {
MPM.add(createSLPVectorizerPass()); // Vectorize parallel scalar chains.		MPM.add(createSLPVectorizerPass()); // Vectorize parallel scalar chains.
if (OptLevel > 1 && ExtraVectorizerPasses) {		if (OptLevel.getSpeedupLevel() > 1 && ExtraVectorizerPasses) {
MPM.add(createEarlyCSEPass());		MPM.add(createEarlyCSEPass());
}		}
}		}

// Enhance/cleanup vector code.		// Enhance/cleanup vector code.
MPM.add(createVectorCombinePass());		MPM.add(createVectorCombinePass());

addExtensionsToPM(EP_Peephole, MPM);		addExtensionsToPM(EP_Peephole, MPM);
MPM.add(createInstructionCombiningPass());		MPM.add(createInstructionCombiningPass());

if (EnableUnrollAndJam && !DisableUnrollLoops) {		if (EnableUnrollAndJam && !DisableUnrollLoops) {
// Unroll and Jam. We do this before unroll but need to be in a separate		// Unroll and Jam. We do this before unroll but need to be in a separate
// loop pass manager in order for the outer loop to be processed by		// loop pass manager in order for the outer loop to be processed by
// unroll and jam before the inner loop is unrolled.		// unroll and jam before the inner loop is unrolled.
MPM.add(createLoopUnrollAndJamPass(OptLevel));		MPM.add(createLoopUnrollAndJamPass(OptLevel.getSpeedupLevel()));
}		}

// Unroll small loops		// Unroll small loops
MPM.add(createLoopUnrollPass(OptLevel, DisableUnrollLoops,		MPM.add(createLoopUnrollPass(OptLevel.getSpeedupLevel(), DisableUnrollLoops,
ForgetAllSCEVInLoopUnroll));		ForgetAllSCEVInLoopUnroll));

if (!DisableUnrollLoops) {		if (!DisableUnrollLoops) {
// LoopUnroll may generate some redundency to cleanup.		// LoopUnroll may generate some redundency to cleanup.
MPM.add(createInstructionCombiningPass());		MPM.add(createInstructionCombiningPass());

// Runtime unrolling will introduce runtime check in loop prologue. If the		// Runtime unrolling will introduce runtime check in loop prologue. If the
// unrolled loop is a inner loop, then the prologue will be inside the		// unrolled loop is a inner loop, then the prologue will be inside the
// outer loop. LICM pass can help to promote the runtime check out if the		// outer loop. LICM pass can help to promote the runtime check out if the
// checked value is loop invariant.		// checked value is loop invariant.
MPM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));		MPM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));
}		}

MPM.add(createWarnMissedTransformationsPass());		MPM.add(createWarnMissedTransformationsPass());

// After vectorization and unrolling, assume intrinsics may tell us more		// After vectorization and unrolling, assume intrinsics may tell us more
// about pointer alignments.		// about pointer alignments.
MPM.add(createAlignmentFromAssumptionsPass());		MPM.add(createAlignmentFromAssumptionsPass());

// FIXME: We shouldn't bother with this anymore.		// FIXME: We shouldn't bother with this anymore.
MPM.add(createStripDeadPrototypesPass()); // Get rid of dead prototypes		MPM.add(createStripDeadPrototypesPass()); // Get rid of dead prototypes

// GlobalOpt already deletes dead functions and globals, at -O2 try a		// GlobalOpt already deletes dead functions and globals, at -O2 try a
// late pass of GlobalDCE. It is capable of deleting dead cycles.		// late pass of GlobalDCE. It is capable of deleting dead cycles.
if (OptLevel > 1) {		if (OptLevel.getSpeedupLevel() > 1) {
MPM.add(createGlobalDCEPass()); // Remove dead fns and globals.		MPM.add(createGlobalDCEPass()); // Remove dead fns and globals.
MPM.add(createConstantMergePass()); // Merge dup global constants		MPM.add(createConstantMergePass()); // Merge dup global constants
}		}

// See comment in the new PM for justification of scheduling splitting at		// See comment in the new PM for justification of scheduling splitting at
// this stage (\ref buildModuleSimplificationPipeline).		// this stage (\ref buildModuleSimplificationPipeline).
if (EnableHotColdSplit && !(PrepareForLTO \|\| PrepareForThinLTO))		if (EnableHotColdSplit && !(PrepareForLTO \|\| PrepareForThinLTO))
MPM.add(createHotColdSplittingPass());		MPM.add(createHotColdSplittingPass());
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	void PassManagerBuilder::addLTOOptimizationPasses(legacy::PassManagerBase &PM) {
addInitialAliasAnalysisPasses(PM);		addInitialAliasAnalysisPasses(PM);

// Allow forcing function attributes as a debugging and tuning aid.		// Allow forcing function attributes as a debugging and tuning aid.
PM.add(createForceFunctionAttrsLegacyPass());		PM.add(createForceFunctionAttrsLegacyPass());

// Infer attributes about declarations if possible.		// Infer attributes about declarations if possible.
PM.add(createInferFunctionAttrsLegacyPass());		PM.add(createInferFunctionAttrsLegacyPass());

if (OptLevel > 1) {		if (OptLevel.getSpeedupLevel() > 1) {
// Split call-site with more constrained arguments.		// Split call-site with more constrained arguments.
PM.add(createCallSiteSplittingPass());		PM.add(createCallSiteSplittingPass());

// Indirect call promotion. This should promote all the targets that are		// Indirect call promotion. This should promote all the targets that are
// left by the earlier promotion pass that promotes intra-module targets.		// left by the earlier promotion pass that promotes intra-module targets.
// This two-step promotion is to save the compile time. For LTO, it should		// This two-step promotion is to save the compile time. For LTO, it should
// produce the same result as if we only do promotion here.		// produce the same result as if we only do promotion here.
PM.add(		PM.add(
Show All 22 Lines	void PassManagerBuilder::addLTOOptimizationPasses(legacy::PassManagerBase &PM) {
// improve the quality of generated code when virtual constant propagation or		// improve the quality of generated code when virtual constant propagation or
// control flow integrity are enabled.		// control flow integrity are enabled.
PM.add(createGlobalSplitPass());		PM.add(createGlobalSplitPass());

// Apply whole-program devirtualization and virtual constant propagation.		// Apply whole-program devirtualization and virtual constant propagation.
PM.add(createWholeProgramDevirtPass(ExportSummary, nullptr));		PM.add(createWholeProgramDevirtPass(ExportSummary, nullptr));

// That's all we need at opt level 1.		// That's all we need at opt level 1.
if (OptLevel == 1)		if (OptLevel.getSpeedupLevel() == 1)
return;		return;

// Now that we internalized some globals, see if we can hack on them!		// Now that we internalized some globals, see if we can hack on them!
PM.add(createGlobalOptimizerPass());		PM.add(createGlobalOptimizerPass());
// Promote any localized global vars.		// Promote any localized global vars.
PM.add(createPromoteMemoryToRegisterPass());		PM.add(createPromoteMemoryToRegisterPass());

// Linking modules together can lead to duplicated global constants, only		// Linking modules together can lead to duplicated global constants, only
// keep one copy of each constant.		// keep one copy of each constant.
PM.add(createConstantMergePass());		PM.add(createConstantMergePass());

// Remove unused arguments from functions.		// Remove unused arguments from functions.
PM.add(createDeadArgEliminationPass());		PM.add(createDeadArgEliminationPass());

// Reduce the code after globalopt and ipsccp. Both can open up significant		// Reduce the code after globalopt and ipsccp. Both can open up significant
// simplification opportunities, and both can propagate functions through		// simplification opportunities, and both can propagate functions through
// function pointers. When this happens, we often have to resolve varargs		// function pointers. When this happens, we often have to resolve varargs
// calls, etc, so let instcombine do this.		// calls, etc, so let instcombine do this.
if (OptLevel > 2)		if (OptLevel.getSpeedupLevel() > 2)
PM.add(createAggressiveInstCombinerPass());		PM.add(createAggressiveInstCombinerPass());
PM.add(createInstructionCombiningPass());		PM.add(createInstructionCombiningPass());
addExtensionsToPM(EP_Peephole, PM);		addExtensionsToPM(EP_Peephole, PM);

// Inline small functions		// Inline small functions
bool RunInliner = Inliner;		bool RunInliner = Inliner;
if (RunInliner) {		if (RunInliner) {
PM.add(Inliner);		PM.add(Inliner);
Inliner = nullptr;		Inliner = nullptr;
}		}

PM.add(createPruneEHPass()); // Remove dead EH info.		PM.add(createPruneEHPass()); // Remove dead EH info.

// CSFDO instrumentation and use pass.		// CSFDO instrumentation and use pass.
addPGOInstrPasses(PM, /* IsCS */ true);		addPGOInstrPasses(PM, /* IsCS */ true);

// Infer attributes on declarations, call sites, arguments, etc. for an SCC.		// Infer attributes on declarations, call sites, arguments, etc. for an SCC.
if (AttributorRun & AttributorRunOption::CGSCC)		if (AttributorRun & AttributorRunOption::CGSCC)
PM.add(createAttributorCGSCCLegacyPass());		PM.add(createAttributorCGSCCLegacyPass());

// Try to perform OpenMP specific optimizations. This is a (quick!) no-op if		// Try to perform OpenMP specific optimizations. This is a (quick!) no-op if
// there are no OpenMP runtime calls present in the module.		// there are no OpenMP runtime calls present in the module.
if (OptLevel > 1)		if (OptLevel.getSpeedupLevel() > 1)
PM.add(createOpenMPOptLegacyPass());		PM.add(createOpenMPOptLegacyPass());

// Optimize globals again if we ran the inliner.		// Optimize globals again if we ran the inliner.
if (RunInliner)		if (RunInliner)
PM.add(createGlobalOptimizerPass());		PM.add(createGlobalOptimizerPass());
PM.add(createGlobalDCEPass()); // Remove dead functions.		PM.add(createGlobalDCEPass()); // Remove dead functions.

// If we didn't decide to inline a function, check to see if we can		// If we didn't decide to inline a function, check to see if we can
// transform it to pass arguments by value instead of by reference.		// transform it to pass arguments by value instead of by reference.
PM.add(createArgumentPromotionPass());		PM.add(createArgumentPromotionPass());

// The IPO passes may leave cruft around. Clean up after them.		// The IPO passes may leave cruft around. Clean up after them.
PM.add(createInstructionCombiningPass());		PM.add(createInstructionCombiningPass());
addExtensionsToPM(EP_Peephole, PM);		addExtensionsToPM(EP_Peephole, PM);
PM.add(createJumpThreadingPass());		PM.add(createJumpThreadingPass());

// Break up allocas		// Break up allocas
PM.add(createSROAPass());		PM.add(createSROAPass());

// LTO provides additional opportunities for tailcall elimination due to		// LTO provides additional opportunities for tailcall elimination due to
// link-time inlining, and visibility of nocapture attribute.		// link-time inlining, and visibility of nocapture attribute.
if (OptLevel > 1)		if (OptLevel.getSpeedupLevel() > 1)
PM.add(createTailCallEliminationPass());		PM.add(createTailCallEliminationPass());

// Infer attributes on declarations, call sites, arguments, etc.		// Infer attributes on declarations, call sites, arguments, etc.
PM.add(createPostOrderFunctionAttrsLegacyPass()); // Add nocapture.		PM.add(createPostOrderFunctionAttrsLegacyPass()); // Add nocapture.
// Run a few AA driven optimizations here and now, to cleanup the code.		// Run a few AA driven optimizations here and now, to cleanup the code.
PM.add(createGlobalsAAWrapperPass()); // IP alias analysis.		PM.add(createGlobalsAAWrapperPass()); // IP alias analysis.

PM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));		PM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));
PM.add(createMergedLoadStoreMotionPass()); // Merge ld/st in diamonds.		PM.add(createMergedLoadStoreMotionPass()); // Merge ld/st in diamonds.
PM.add(NewGVN ? createNewGVNPass()		PM.add(NewGVN ? createNewGVNPass()
: createGVNPass(DisableGVNLoadPRE)); // Remove redundancies.		: createGVNPass(DisableGVNLoadPRE)); // Remove redundancies.
PM.add(createMemCpyOptPass()); // Remove dead memcpys.		PM.add(createMemCpyOptPass()); // Remove dead memcpys.

// Nuke dead stores.		// Nuke dead stores.
PM.add(createDeadStoreEliminationPass());		PM.add(createDeadStoreEliminationPass());

// More loops are countable; try to optimize them.		// More loops are countable; try to optimize them.
PM.add(createIndVarSimplifyPass());		PM.add(createIndVarSimplifyPass());
PM.add(createLoopDeletionPass());		PM.add(createLoopDeletionPass());
if (EnableLoopInterchange)		if (EnableLoopInterchange)
PM.add(createLoopInterchangePass());		PM.add(createLoopInterchangePass());

// Unroll small loops		// Unroll small loops
PM.add(createSimpleLoopUnrollPass(OptLevel, DisableUnrollLoops,		PM.add(createSimpleLoopUnrollPass(OptLevel.getSpeedupLevel(), DisableUnrollLoops,
ForgetAllSCEVInLoopUnroll));		ForgetAllSCEVInLoopUnroll));
PM.add(createLoopVectorizePass(true, !LoopVectorize));		PM.add(createLoopVectorizePass(true, !LoopVectorize));
// The vectorizer may have significantly shortened a loop body; unroll again.		// The vectorizer may have significantly shortened a loop body; unroll again.
PM.add(createLoopUnrollPass(OptLevel, DisableUnrollLoops,		PM.add(createLoopUnrollPass(OptLevel.getSpeedupLevel(), DisableUnrollLoops,
ForgetAllSCEVInLoopUnroll));		ForgetAllSCEVInLoopUnroll));

PM.add(createWarnMissedTransformationsPass());		PM.add(createWarnMissedTransformationsPass());

// Now that we've optimized loops (in particular loop induction variables),		// Now that we've optimized loops (in particular loop induction variables),
// we may have exposed more scalar opportunities. Run parts of the scalar		// we may have exposed more scalar opportunities. Run parts of the scalar
// optimizer again at this point.		// optimizer again at this point.
PM.add(createInstructionCombiningPass()); // Initial cleanup		PM.add(createInstructionCombiningPass()); // Initial cleanup
▲ Show 20 Lines • Show All 78 Lines • ▼ Show 20 Lines	void PassManagerBuilder::populateLTOPassManager(legacy::PassManagerBase &PM) {
if (LibraryInfo)		if (LibraryInfo)
PM.add(new TargetLibraryInfoWrapperPass(*LibraryInfo));		PM.add(new TargetLibraryInfoWrapperPass(*LibraryInfo));

if (VerifyInput)		if (VerifyInput)
PM.add(createVerifierPass());		PM.add(createVerifierPass());

addExtensionsToPM(EP_FullLinkTimeOptimizationEarly, PM);		addExtensionsToPM(EP_FullLinkTimeOptimizationEarly, PM);

if (OptLevel != 0)		if (OptLevel != OptimizationLevel::O0)
addLTOOptimizationPasses(PM);		addLTOOptimizationPasses(PM);
else {		else {
// The whole-program-devirt pass needs to run at -O0 because only it knows		// The whole-program-devirt pass needs to run at -O0 because only it knows
// about the llvm.type.checked.load intrinsic: it needs to both lower the		// about the llvm.type.checked.load intrinsic: it needs to both lower the
// intrinsic itself and handle it in the summary.		// intrinsic itself and handle it in the summary.
PM.add(createWholeProgramDevirtPass(ExportSummary, nullptr));		PM.add(createWholeProgramDevirtPass(ExportSummary, nullptr));
}		}

// Create a function that performs CFI checks for cross-DSO calls with targets		// Create a function that performs CFI checks for cross-DSO calls with targets
// in the current module.		// in the current module.
PM.add(createCrossDSOCFIPass());		PM.add(createCrossDSOCFIPass());

// Lower type metadata and the type.test intrinsic. This pass supports Clang's		// Lower type metadata and the type.test intrinsic. This pass supports Clang's
// control flow integrity mechanisms (-fsanitize=cfi*) and needs to run at		// control flow integrity mechanisms (-fsanitize=cfi*) and needs to run at
// link time if CFI is enabled. The pass does nothing if CFI is disabled.		// link time if CFI is enabled. The pass does nothing if CFI is disabled.
PM.add(createLowerTypeTestsPass(ExportSummary, nullptr));		PM.add(createLowerTypeTestsPass(ExportSummary, nullptr));

if (OptLevel != 0)		if (OptLevel != OptimizationLevel::O0)
addLateLTOOptimizationPasses(PM);		addLateLTOOptimizationPasses(PM);

addExtensionsToPM(EP_FullLinkTimeOptimizationLast, PM);		addExtensionsToPM(EP_FullLinkTimeOptimizationLast, PM);

if (VerifyOutput)		if (VerifyOutput)
PM.add(createVerifierPass());		PM.add(createVerifierPass());
}		}

LLVMPassManagerBuilderRef LLVMPassManagerBuilderCreate() {		LLVMPassManagerBuilderRef LLVMPassManagerBuilderCreate() {
PassManagerBuilder *PMB = new PassManagerBuilder();		PassManagerBuilder *PMB = new PassManagerBuilder();
return wrap(PMB);		return wrap(PMB);
}		}

void LLVMPassManagerBuilderDispose(LLVMPassManagerBuilderRef PMB) {		void LLVMPassManagerBuilderDispose(LLVMPassManagerBuilderRef PMB) {
PassManagerBuilder *Builder = unwrap(PMB);		PassManagerBuilder *Builder = unwrap(PMB);
delete Builder;		delete Builder;
}		}

void		void
LLVMPassManagerBuilderSetOptLevel(LLVMPassManagerBuilderRef PMB,		LLVMPassManagerBuilderSetOptLevel(LLVMPassManagerBuilderRef PMB,
unsigned OptLevel) {		unsigned OptLevel) {
PassManagerBuilder *Builder = unwrap(PMB);		PassManagerBuilder *Builder = unwrap(PMB);
Builder->OptLevel = OptLevel;		Builder->OptLevel = {OptLevel, 0};
}		}

void		void
LLVMPassManagerBuilderSetSizeLevel(LLVMPassManagerBuilderRef PMB,		LLVMPassManagerBuilderSetSizeLevel(LLVMPassManagerBuilderRef PMB,
unsigned SizeLevel) {		unsigned SizeLevel) {
PassManagerBuilder *Builder = unwrap(PMB);		PassManagerBuilder *Builder = unwrap(PMB);
Builder->SizeLevel = SizeLevel;		Builder->OptLevel = {2, SizeLevel};
}		}

void		void
LLVMPassManagerBuilderSetDisableUnitAtATime(LLVMPassManagerBuilderRef PMB,		LLVMPassManagerBuilderSetDisableUnitAtATime(LLVMPassManagerBuilderRef PMB,
LLVMBool Value) {		LLVMBool Value) {
// NOTE: The DisableUnitAtATime switch has been removed.		// NOTE: The DisableUnitAtATime switch has been removed.
}		}

▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines

llvm/tools/bugpoint/bugpoint.cpp

	Show First 20 Lines • Show All 117 Lines • ▼ Show 20 Lines
	// This routine adds optimization passes based on selected optimization level,			// This routine adds optimization passes based on selected optimization level,
	// OptLevel.			// OptLevel.
	//			//
	// OptLevel - Optimization Level			// OptLevel - Optimization Level
	static void AddOptimizationPasses(legacy::FunctionPassManager &FPM,			static void AddOptimizationPasses(legacy::FunctionPassManager &FPM,
	unsigned OptLevel,			unsigned OptLevel,
	unsigned SizeLevel) {			unsigned SizeLevel) {
	PassManagerBuilder Builder;			PassManagerBuilder Builder;
	Builder.OptLevel = OptLevel;			Builder.OptLevel = {OptLevel, SizeLevel};
	Builder.SizeLevel = SizeLevel;

	if (OptLevel > 1)			if (OptLevel > 1)
	Builder.Inliner = createFunctionInliningPass(OptLevel, SizeLevel, false);			Builder.Inliner = createFunctionInliningPass(OptLevel, SizeLevel, false);
	else			else
	Builder.Inliner = createAlwaysInlinerLegacyPass();			Builder.Inliner = createAlwaysInlinerLegacyPass();

	Builder.populateFunctionPassManager(FPM);			Builder.populateFunctionPassManager(FPM);
	Builder.populateModulePassManager(FPM);			Builder.populateModulePassManager(FPM);
	▲ Show 20 Lines • Show All 109 Lines • Show Last 20 Lines

llvm/tools/llvm-lto/llvm-lto.cpp

Show First 20 Lines • Show All 1,011 Lines • ▼ Show 20 Lines	int main(int argc, char **argv) {

// Add all the dso symbols to the table of symbols to expose.		// Add all the dso symbols to the table of symbols to expose.
for (unsigned i = 0; i < KeptDSOSyms.size(); ++i)		for (unsigned i = 0; i < KeptDSOSyms.size(); ++i)
CodeGen.addMustPreserveSymbol(KeptDSOSyms[i]);		CodeGen.addMustPreserveSymbol(KeptDSOSyms[i]);

// Set cpu and attrs strings for the default target/subtarget.		// Set cpu and attrs strings for the default target/subtarget.
CodeGen.setCpu(codegen::getMCPU().c_str());		CodeGen.setCpu(codegen::getMCPU().c_str());

CodeGen.setOptLevel(OptLevel - '0');		CodeGen.setOptLevel({OptLevel - '0', 0});

auto MAttrs = codegen::getMAttrs();		auto MAttrs = codegen::getMAttrs();
if (!MAttrs.empty()) {		if (!MAttrs.empty()) {
std::string attrs = join(MAttrs, ",");		std::string attrs = join(MAttrs, ",");
CodeGen.setAttr(attrs);		CodeGen.setAttr(attrs);
}		}

if (auto FT = codegen::getExplicitFileType())		if (auto FT = codegen::getExplicitFileType())
▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

llvm/tools/llvm-lto2/llvm-lto2.cpp

Show First 20 Lines • Show All 250 Lines • ▼ Show 20 Lines	static int run(int argc, char **argv) {
Conf.SampleProfile = SamplePGOFile;		Conf.SampleProfile = SamplePGOFile;
Conf.CSIRProfile = CSPGOFile;		Conf.CSIRProfile = CSPGOFile;
Conf.RunCSIRInstr = RunCSIRInstr;		Conf.RunCSIRInstr = RunCSIRInstr;

// Run a custom pipeline, if asked for.		// Run a custom pipeline, if asked for.
Conf.OptPipeline = OptPipeline;		Conf.OptPipeline = OptPipeline;
Conf.AAPipeline = AAPipeline;		Conf.AAPipeline = AAPipeline;

Conf.OptLevel = OptLevel - '0';		Conf.OptLevel = {OptLevel - '0', 0};
Conf.UseNewPM = UseNewPM;		Conf.UseNewPM = UseNewPM;
for (auto &PluginFN : PassPlugins)		for (auto &PluginFN : PassPlugins)
Conf.PassPlugins.push_back(PluginFN);		Conf.PassPlugins.push_back(PluginFN);
switch (CGOptLevel) {		switch (CGOptLevel) {
case '0':		case '0':
Conf.CGOptLevel = CodeGenOpt::None;		Conf.CGOptLevel = CodeGenOpt::None;
break;		break;
case '1':		case '1':
Show All 11 Lines	static int run(int argc, char **argv) {
}		}

if (auto FT = codegen::getExplicitFileType())		if (auto FT = codegen::getExplicitFileType())
Conf.CGFileType = FT.getValue();		Conf.CGFileType = FT.getValue();

Conf.OverrideTriple = OverrideTriple;		Conf.OverrideTriple = OverrideTriple;
Conf.DefaultTriple = DefaultTriple;		Conf.DefaultTriple = DefaultTriple;
Conf.StatsFile = StatsFile;		Conf.StatsFile = StatsFile;
Conf.PTO.LoopVectorization = Conf.OptLevel > 1;		Conf.PTO.LoopVectorization = Conf.OptLevel.getSpeedupLevel() > 1;
Conf.PTO.SLPVectorization = Conf.OptLevel > 1;		Conf.PTO.SLPVectorization = Conf.OptLevel.getSpeedupLevel() > 1;

ThinBackend Backend;		ThinBackend Backend;
if (ThinLTODistributedIndexes)		if (ThinLTODistributedIndexes)
Backend = createWriteIndexesThinBackend(/* OldPrefix */ "",		Backend = createWriteIndexesThinBackend(/* OldPrefix */ "",
/* NewPrefix */ "",		/* NewPrefix */ "",
/* ShouldEmitImportsFiles */ true,		/* ShouldEmitImportsFiles */ true,
/* LinkedObjectsFile */ nullptr,		/* LinkedObjectsFile */ nullptr,
/* OnWrite */ {});		/* OnWrite */ {});
▲ Show 20 Lines • Show All 174 Lines • Show Last 20 Lines

llvm/tools/lto/lto.cpp

Show First 20 Lines • Show All 160 Lines • ▼ Show 20 Lines	static void lto_add_attrs(lto_code_gen_t cg) {
auto MAttrs = codegen::getMAttrs();		auto MAttrs = codegen::getMAttrs();
if (!MAttrs.empty()) {		if (!MAttrs.empty()) {
std::string attrs = join(MAttrs, ",");		std::string attrs = join(MAttrs, ",");
CG->setAttr(attrs);		CG->setAttr(attrs);
}		}

if (OptLevel < '0' \|\| OptLevel > '3')		if (OptLevel < '0' \|\| OptLevel > '3')
report_fatal_error("Optimization level must be between 0 and 3");		report_fatal_error("Optimization level must be between 0 and 3");
CG->setOptLevel(OptLevel - '0');		CG->setOptLevel({OptLevel - '0', 0});
CG->setFreestanding(EnableFreestanding);		CG->setFreestanding(EnableFreestanding);
}		}

extern const char* lto_get_version() {		extern const char* lto_get_version() {
return LTOCodeGenerator::getVersionString();		return LTOCodeGenerator::getVersionString();
}		}

const char* lto_get_error_message() {		const char* lto_get_error_message() {
▲ Show 20 Lines • Show All 327 Lines • ▼ Show 20 Lines	thinlto_code_gen_t thinlto_create_codegen(void) {
lto_initialize();		lto_initialize();
ThinLTOCodeGenerator *CodeGen = new ThinLTOCodeGenerator();		ThinLTOCodeGenerator *CodeGen = new ThinLTOCodeGenerator();
CodeGen->setTargetOptions(codegen::InitTargetOptionsFromCodeGenFlags());		CodeGen->setTargetOptions(codegen::InitTargetOptionsFromCodeGenFlags());
CodeGen->setFreestanding(EnableFreestanding);		CodeGen->setFreestanding(EnableFreestanding);

if (OptLevel.getNumOccurrences()) {		if (OptLevel.getNumOccurrences()) {
if (OptLevel < '0' \|\| OptLevel > '3')		if (OptLevel < '0' \|\| OptLevel > '3')
report_fatal_error("Optimization level must be between 0 and 3");		report_fatal_error("Optimization level must be between 0 and 3");
CodeGen->setOptLevel(OptLevel - '0');		CodeGen->setOptLevel({OptLevel - '0', 0});
switch (OptLevel) {		switch (OptLevel) {
case '0':		case '0':
CodeGen->setCodeGenOptLevel(CodeGenOpt::None);		CodeGen->setCodeGenOptLevel(CodeGenOpt::None);
break;		break;
case '1':		case '1':
CodeGen->setCodeGenOptLevel(CodeGenOpt::Less);		CodeGen->setCodeGenOptLevel(CodeGenOpt::Less);
break;		break;
case '2':		case '2':
▲ Show 20 Lines • Show All 170 Lines • Show Last 20 Lines

llvm/tools/opt/CMakeLists.txt

Show All 11 Lines	set(LLVM_LINK_COMPONENTS
Coroutines		Coroutines
Extensions		Extensions
IPO		IPO
IRReader		IRReader
InstCombine		InstCombine
Instrumentation		Instrumentation
MC		MC
ObjCARCOpts		ObjCARCOpts
		Passes
		mtrofinUnsubmitted Not Done Reply Inline Actions Nit: make this change separately, and since it's just a style change, it can probably be just submitted with no review. mtrofin: Nit: make this change separately, and since it's just a style change, it can probably be just…
Remarks		Remarks
ScalarOpts		ScalarOpts
Support		Support
Target		Target
TransformUtils		TransformUtils
Vectorize		Vectorize
Passes
)		)

add_llvm_tool(opt		add_llvm_tool(opt
AnalysisWrappers.cpp		AnalysisWrappers.cpp
BreakpointPrinter.cpp		BreakpointPrinter.cpp
GraphPrinters.cpp		GraphPrinters.cpp
NewPMDriver.cpp		NewPMDriver.cpp
PassPrinters.cpp		PassPrinters.cpp
Show All 12 Lines

llvm/tools/opt/NewPMDriver.cpp

	Show First 20 Lines • Show All 137 Lines • ▼ Show 20 Lines

	/// If one of the EPPipeline command line options was given, register callbacks			/// If one of the EPPipeline command line options was given, register callbacks
	/// for parsing and inserting the given pipeline			/// for parsing and inserting the given pipeline
	static void registerEPCallbacks(PassBuilder &PB, bool VerifyEachPass,			static void registerEPCallbacks(PassBuilder &PB, bool VerifyEachPass,
	bool DebugLogging) {			bool DebugLogging) {
	if (tryParsePipelineText<FunctionPassManager>(PB, PeepholeEPPipeline))			if (tryParsePipelineText<FunctionPassManager>(PB, PeepholeEPPipeline))
	PB.registerPeepholeEPCallback(			PB.registerPeepholeEPCallback(
	[&PB, VerifyEachPass, DebugLogging](			[&PB, VerifyEachPass, DebugLogging](
	FunctionPassManager &PM, PassBuilder::OptimizationLevel Level) {			FunctionPassManager &PM, OptimizationLevel Level) {
	ExitOnError Err("Unable to parse PeepholeEP pipeline: ");			ExitOnError Err("Unable to parse PeepholeEP pipeline: ");
	Err(PB.parsePassPipeline(PM, PeepholeEPPipeline, VerifyEachPass,			Err(PB.parsePassPipeline(PM, PeepholeEPPipeline, VerifyEachPass,
	DebugLogging));			DebugLogging));
	});			});
	if (tryParsePipelineText<LoopPassManager>(PB,			if (tryParsePipelineText<LoopPassManager>(PB,
	LateLoopOptimizationsEPPipeline))			LateLoopOptimizationsEPPipeline))
	PB.registerLateLoopOptimizationsEPCallback(			PB.registerLateLoopOptimizationsEPCallback(
	[&PB, VerifyEachPass, DebugLogging](			[&PB, VerifyEachPass, DebugLogging](
	LoopPassManager &PM, PassBuilder::OptimizationLevel Level) {			LoopPassManager &PM, OptimizationLevel Level) {
	ExitOnError Err("Unable to parse LateLoopOptimizationsEP pipeline: ");			ExitOnError Err("Unable to parse LateLoopOptimizationsEP pipeline: ");
	Err(PB.parsePassPipeline(PM, LateLoopOptimizationsEPPipeline,			Err(PB.parsePassPipeline(PM, LateLoopOptimizationsEPPipeline,
	VerifyEachPass, DebugLogging));			VerifyEachPass, DebugLogging));
	});			});
	if (tryParsePipelineText<LoopPassManager>(PB, LoopOptimizerEndEPPipeline))			if (tryParsePipelineText<LoopPassManager>(PB, LoopOptimizerEndEPPipeline))
	PB.registerLoopOptimizerEndEPCallback(			PB.registerLoopOptimizerEndEPCallback(
	[&PB, VerifyEachPass, DebugLogging](			[&PB, VerifyEachPass, DebugLogging](
	LoopPassManager &PM, PassBuilder::OptimizationLevel Level) {			LoopPassManager &PM, OptimizationLevel Level) {
	ExitOnError Err("Unable to parse LoopOptimizerEndEP pipeline: ");			ExitOnError Err("Unable to parse LoopOptimizerEndEP pipeline: ");
	Err(PB.parsePassPipeline(PM, LoopOptimizerEndEPPipeline,			Err(PB.parsePassPipeline(PM, LoopOptimizerEndEPPipeline,
	VerifyEachPass, DebugLogging));			VerifyEachPass, DebugLogging));
	});			});
	if (tryParsePipelineText<FunctionPassManager>(PB,			if (tryParsePipelineText<FunctionPassManager>(PB,
	ScalarOptimizerLateEPPipeline))			ScalarOptimizerLateEPPipeline))
	PB.registerScalarOptimizerLateEPCallback(			PB.registerScalarOptimizerLateEPCallback(
	[&PB, VerifyEachPass, DebugLogging](			[&PB, VerifyEachPass, DebugLogging](
	FunctionPassManager &PM, PassBuilder::OptimizationLevel Level) {			FunctionPassManager &PM, OptimizationLevel Level) {
	ExitOnError Err("Unable to parse ScalarOptimizerLateEP pipeline: ");			ExitOnError Err("Unable to parse ScalarOptimizerLateEP pipeline: ");
	Err(PB.parsePassPipeline(PM, ScalarOptimizerLateEPPipeline,			Err(PB.parsePassPipeline(PM, ScalarOptimizerLateEPPipeline,
	VerifyEachPass, DebugLogging));			VerifyEachPass, DebugLogging));
	});			});
	if (tryParsePipelineText<CGSCCPassManager>(PB, CGSCCOptimizerLateEPPipeline))			if (tryParsePipelineText<CGSCCPassManager>(PB, CGSCCOptimizerLateEPPipeline))
	PB.registerCGSCCOptimizerLateEPCallback(			PB.registerCGSCCOptimizerLateEPCallback(
	[&PB, VerifyEachPass, DebugLogging](			[&PB, VerifyEachPass, DebugLogging](
	CGSCCPassManager &PM, PassBuilder::OptimizationLevel Level) {			CGSCCPassManager &PM, OptimizationLevel Level) {
	ExitOnError Err("Unable to parse CGSCCOptimizerLateEP pipeline: ");			ExitOnError Err("Unable to parse CGSCCOptimizerLateEP pipeline: ");
	Err(PB.parsePassPipeline(PM, CGSCCOptimizerLateEPPipeline,			Err(PB.parsePassPipeline(PM, CGSCCOptimizerLateEPPipeline,
	VerifyEachPass, DebugLogging));			VerifyEachPass, DebugLogging));
	});			});
	if (tryParsePipelineText<FunctionPassManager>(PB, VectorizerStartEPPipeline))			if (tryParsePipelineText<FunctionPassManager>(PB, VectorizerStartEPPipeline))
	PB.registerVectorizerStartEPCallback(			PB.registerVectorizerStartEPCallback(
	[&PB, VerifyEachPass, DebugLogging](			[&PB, VerifyEachPass, DebugLogging](
	FunctionPassManager &PM, PassBuilder::OptimizationLevel Level) {			FunctionPassManager &PM, OptimizationLevel Level) {
	ExitOnError Err("Unable to parse VectorizerStartEP pipeline: ");			ExitOnError Err("Unable to parse VectorizerStartEP pipeline: ");
	Err(PB.parsePassPipeline(PM, VectorizerStartEPPipeline,			Err(PB.parsePassPipeline(PM, VectorizerStartEPPipeline,
	VerifyEachPass, DebugLogging));			VerifyEachPass, DebugLogging));
	});			});
	if (tryParsePipelineText<ModulePassManager>(PB, PipelineStartEPPipeline))			if (tryParsePipelineText<ModulePassManager>(PB, PipelineStartEPPipeline))
	PB.registerPipelineStartEPCallback(			PB.registerPipelineStartEPCallback(
	[&PB, VerifyEachPass, DebugLogging](ModulePassManager &PM) {			[&PB, VerifyEachPass, DebugLogging](ModulePassManager &PM) {
	ExitOnError Err("Unable to parse PipelineStartEP pipeline: ");			ExitOnError Err("Unable to parse PipelineStartEP pipeline: ");
	Err(PB.parsePassPipeline(PM, PipelineStartEPPipeline, VerifyEachPass,			Err(PB.parsePassPipeline(PM, PipelineStartEPPipeline, VerifyEachPass,
	DebugLogging));			DebugLogging));
	});			});
	if (tryParsePipelineText<FunctionPassManager>(PB, OptimizerLastEPPipeline))			if (tryParsePipelineText<FunctionPassManager>(PB, OptimizerLastEPPipeline))
	PB.registerOptimizerLastEPCallback(			PB.registerOptimizerLastEPCallback(
	[&PB, VerifyEachPass, DebugLogging](ModulePassManager &PM,			[&PB, VerifyEachPass, DebugLogging](ModulePassManager &PM,
	PassBuilder::OptimizationLevel) {			OptimizationLevel) {
	ExitOnError Err("Unable to parse OptimizerLastEP pipeline: ");			ExitOnError Err("Unable to parse OptimizerLastEP pipeline: ");
	Err(PB.parsePassPipeline(PM, OptimizerLastEPPipeline, VerifyEachPass,			Err(PB.parsePassPipeline(PM, OptimizerLastEPPipeline, VerifyEachPass,
	DebugLogging));			DebugLogging));
	});			});
	}			}

	#define HANDLE_EXTENSION(Ext) \			#define HANDLE_EXTENSION(Ext) \
	llvm::PassPluginLibraryInfo get##Ext##PluginInfo();			llvm::PassPluginLibraryInfo get##Ext##PluginInfo();
	▲ Show 20 Lines • Show All 170 Lines • Show Last 20 Lines

llvm/tools/opt/opt.cpp

Show First 20 Lines • Show All 387 Lines • ▼ Show 20 Lines
static void AddOptimizationPasses(legacy::PassManagerBase &MPM,		static void AddOptimizationPasses(legacy::PassManagerBase &MPM,
legacy::FunctionPassManager &FPM,		legacy::FunctionPassManager &FPM,
TargetMachine *TM, unsigned OptLevel,		TargetMachine *TM, unsigned OptLevel,
unsigned SizeLevel) {		unsigned SizeLevel) {
if (!NoVerify \|\| VerifyEach)		if (!NoVerify \|\| VerifyEach)
FPM.add(createVerifierPass()); // Verify that input is correct		FPM.add(createVerifierPass()); // Verify that input is correct

PassManagerBuilder Builder;		PassManagerBuilder Builder;
Builder.OptLevel = OptLevel;		Builder.OptLevel = {OptLevel, SizeLevel};
Builder.SizeLevel = SizeLevel;

if (DisableInline) {		if (DisableInline) {
// No inlining pass		// No inlining pass
} else if (OptLevel > 1) {		} else if (OptLevel > 1) {
Builder.Inliner = createFunctionInliningPass(OptLevel, SizeLevel, false);		Builder.Inliner = createFunctionInliningPass(OptLevel, SizeLevel, false);
} else {		} else {
Builder.Inliner = createAlwaysInlinerLegacyPass();		Builder.Inliner = createAlwaysInlinerLegacyPass();
}		}
Show All 39 Lines	static void AddOptimizationPasses(legacy::PassManagerBase &MPM,
Builder.populateFunctionPassManager(FPM);		Builder.populateFunctionPassManager(FPM);
Builder.populateModulePassManager(MPM);		Builder.populateModulePassManager(MPM);
}		}

static void AddStandardLinkPasses(legacy::PassManagerBase &PM) {		static void AddStandardLinkPasses(legacy::PassManagerBase &PM) {
PassManagerBuilder Builder;		PassManagerBuilder Builder;
Builder.VerifyInput = true;		Builder.VerifyInput = true;
if (DisableOptimizations)		if (DisableOptimizations)
Builder.OptLevel = 0;		Builder.OptLevel = OptimizationLevel::O0;

if (!DisableInline)		if (!DisableInline)
Builder.Inliner = createFunctionInliningPass();		Builder.Inliner = createFunctionInliningPass();
Builder.populateLTOPassManager(PM);		Builder.populateLTOPassManager(PM);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// CodeGen-related helper functions.		// CodeGen-related helper functions.
▲ Show 20 Lines • Show All 565 Lines • Show Last 20 Lines

mlir/lib/ExecutionEngine/OptUtils.cpp

	Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines

	// Populate pass managers according to the optimization and size levels.			// Populate pass managers according to the optimization and size levels.
	// This behaves similarly to LLVM opt.			// This behaves similarly to LLVM opt.
	static void populatePassManagers(llvm::legacy::PassManager &modulePM,			static void populatePassManagers(llvm::legacy::PassManager &modulePM,
	llvm::legacy::FunctionPassManager &funcPM,			llvm::legacy::FunctionPassManager &funcPM,
	unsigned optLevel, unsigned sizeLevel,			unsigned optLevel, unsigned sizeLevel,
	llvm::TargetMachine *targetMachine) {			llvm::TargetMachine *targetMachine) {
	llvm::PassManagerBuilder builder;			llvm::PassManagerBuilder builder;
	builder.OptLevel = optLevel;			builder.OptLevel = {optLevel, sizeLevel};
	builder.SizeLevel = sizeLevel;
	builder.Inliner = llvm::createFunctionInliningPass(			builder.Inliner = llvm::createFunctionInliningPass(
	optLevel, sizeLevel, /DisableInlineHotCallSite=/false);			optLevel, sizeLevel, /DisableInlineHotCallSite=/false);
	builder.LoopVectorize = optLevel > 1 && sizeLevel < 2;			builder.LoopVectorize = optLevel > 1 && sizeLevel < 2;
	builder.SLPVectorize = optLevel > 1 && sizeLevel < 2;			builder.SLPVectorize = optLevel > 1 && sizeLevel < 2;
	builder.DisableUnrollLoops = (optLevel == 0);			builder.DisableUnrollLoops = (optLevel == 0);

	if (targetMachine) {			if (targetMachine) {
	// Add pass to initialize TTI for this specific target. Otherwise, TTI will			// Add pass to initialize TTI for this specific target. Otherwise, TTI will
	▲ Show 20 Lines • Show All 65 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Size LTO (1/3): Standardizing the use of OptimizationLevel Needs ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 268668

clang/lib/CodeGen/BackendUtil.cpp

lld/COFF/LTO.cpp

lld/ELF/LTO.cpp

lld/wasm/LTO.cpp

llvm/examples/Bye/Bye.cpp

llvm/include/llvm/IR/PassManager.h

llvm/include/llvm/LTO/Config.h

llvm/include/llvm/LTO/legacy/LTOCodeGenerator.h

llvm/include/llvm/LTO/legacy/ThinLTOCodeGenerator.h

llvm/include/llvm/Passes/PassBuilder.h

llvm/include/llvm/Transforms/IPO/PassManagerBuilder.h

llvm/lib/IR/PassManager.cpp

llvm/lib/LTO/LTO.cpp

llvm/lib/LTO/LTOBackend.cpp

llvm/lib/LTO/LTOCodeGenerator.cpp

llvm/lib/LTO/ThinLTOCodeGenerator.cpp

llvm/lib/Passes/PassBuilder.cpp

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp

llvm/tools/bugpoint/bugpoint.cpp

llvm/tools/llvm-lto/llvm-lto.cpp

llvm/tools/llvm-lto2/llvm-lto2.cpp

llvm/tools/lto/lto.cpp

llvm/tools/opt/CMakeLists.txt

llvm/tools/opt/NewPMDriver.cpp

llvm/tools/opt/opt.cpp

mlir/lib/ExecutionEngine/OptUtils.cpp

Size LTO (1/3): Standardizing the use of OptimizationLevel
Needs ReviewPublic