This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/
-
COFF/
-
Config.h
-
Driver.cpp
-
LTO.cpp
-
ELF/
-
Config.h
-
Driver.cpp
-
LTO.cpp
-
test/
-
COFF/
-
lto-opt-level.ll
-
ELF/lto/
-
lto/
-
opt-level.ll
-
wasm/lto/
-
lto/
-
opt-level.ll
-
wasm/
-
Config.h
-
Driver.cpp
-
LTO.cpp
-
llvm/
-
include/llvm/LTO/
-
llvm/
-
LTO/
-
Config.h
-
LTO.h
-
lib/LTO/
-
LTO/
-
LTO.cpp
-
LTOBackend.cpp
-
test/tools/
-
tools/
-
gold/X86/
-
X86/
-
opt-level.ll
-
lto/
-
opt-level.ll
-
tools/
-
gold/
-
gold-plugin.cpp
-
llvm-lto2/
-
llvm-lto2.cpp

Differential D72404

[ThinLTO/FullLTO] Support Os and Oz
Needs ReviewPublic

Authored by ychen on Jan 8 2020, 10:35 AM.

Download Raw Diff

Details

Reviewers

pcc
steven_wu
tejohnson
mehdi_amini
• espindola
MaskRay

Summary

When optnone is not present, add optsize for Os and minsize for Oz.
Pass Os/Oz to (old/new PM) LTO pipeline setup.
Ran SPEC2017 & test-suite MultiSource to confirm the size reduction are comparable with NonLTO.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	900 ms	lld.ELF::Unknown Unit Message ("")

Event Timeline

ychen created this revision.Jan 8 2020, 10:35 AM

Herald added a reviewer: • espindola. · View Herald TranscriptJan 8 2020, 10:35 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, dang, dexonsmith and 8 others. · View Herald Transcript

ychen retitled this revision from [ThinLTO] Support Os and Oz to [WIP][ThinLTO] Support Os and Oz.Jan 8 2020, 10:35 AM

This is going in the wrong direction IMO. We already have a way of specifying the size optimization level, which is to specify -Os or -Oz at compile time. Why is that not sufficient?

This revision now requires changes to proceed.Jan 8 2020, 10:47 AM

ychen edited the summary of this revision. (Show Details)Jan 8 2020, 10:47 AM

ormris added a subscriber: ormris.Jan 8 2020, 10:50 AM

In D72404#1810489, @pcc wrote:

This is going in the wrong direction IMO. We already have a way of specifying the size optimization level, which is to specify -Os or -Oz at compile time. Why is that not sufficient?

Thanks for the quick feedback, @pcc. :-) I saw your comments somewhere on this.
I don't know why is this changing directions. It only gives the user choice to optimize for size at link stage.
They still have the freedom the do this at compile stage.

From user perspective, if O1, O2 etc. are valid options for linker stage,
I don't see a reason Os and Oz can not be used or not do the right thing.

With ThinLTO, many decisions are postponed til linker stage to have more flexibility. I guess this is one of them?

Unit tests: fail. 60542 tests passed, 1 failed and 726 were skipped.

failed: lld.wasm/lto/opt-level.ll

clang-tidy: fail. Please fix clang-tidy findings.

clang-format: fail. Please format your changes with clang-format by running git-clang-format HEAD^ or applying this patch.

Build artifacts: diff.json, clang-tidy.txt, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Harbormaster failed remote builds in B43518: Diff 236860!Jan 8 2020, 11:18 AM

fix test

fix test

Unit tests: unknown.

clang-tidy: unknown.

clang-format: unknown.

Build artifacts: diff.json, console-log.txt

Harbormaster failed remote builds in B43532: Diff 236907!Jan 8 2020, 2:33 PM

Unit tests: fail. 60542 tests passed, 1 failed and 726 were skipped.

failed: lld.ELF/note.s

clang-tidy: fail. Please fix clang-tidy findings.

clang-format: fail. Please format your changes with clang-format by running git-clang-format HEAD^ or applying this patch.

Build artifacts: diff.json, clang-tidy.txt, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Harbormaster failed remote builds in B43533: Diff 236908!Jan 8 2020, 3:10 PM

ychen removed a subscriber: merge_guards_bot.Jan 8 2020, 3:27 PM

when optnone is not present, add optsize for Os and minsize for Oz
add Os Oz function attribute test.
add Os Oz pipeline test

Herald added a project: Restricted Project. · View Herald TranscriptJan 14 2020, 10:15 AM

Herald added subscribers: cfe-commits, jfb. · View Herald Transcript

fix a typo

Rebase

Unit tests: fail. 60543 tests passed, 1 failed and 726 were skipped.

failed: Clang.CodeGen/thinlto-debug-pm.c

clang-tidy: unknown.

clang-format: fail. Please format your changes with clang-format by running git-clang-format HEAD^ or applying this patch.

Build artifacts: diff.json, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Harbormaster failed remote builds in B43970: Diff 238028!Jan 14 2020, 11:23 AM

Unit tests: fail. 60543 tests passed, 1 failed and 726 were skipped.

failed: Clang.CodeGen/thinlto-debug-pm.c

clang-tidy: unknown.

clang-format: fail. Please format your changes with clang-format by running git-clang-format HEAD^ or applying this patch.

Build artifacts: diff.json, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Harbormaster failed remote builds in B43972: Diff 238034!Jan 14 2020, 11:32 AM

Unit tests: fail. 61849 tests passed, 1 failed and 781 were skipped.

failed: Clang.CodeGen/thinlto-debug-pm.c

clang-tidy: unknown.

clang-format: fail. Please format your changes with clang-format by running git-clang-format HEAD^ or applying this patch.

Build artifacts: diff.json, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Harbormaster failed remote builds in B43978: Diff 238042!Jan 14 2020, 11:42 AM

MaskRay added subscribers: mehdi_amini, tejohnson.Jan 14 2020, 11:51 AM

MaskRay added a subscriber: tycho.

rebase & clang-format

ychen retitled this revision from [WIP][ThinLTO] Support Os and Oz to [ThinLTO/FullLTO] Support Os and Oz.Jan 14 2020, 12:39 PM

ychen edited the summary of this revision. (Show Details)

ychen edited reviewers, added: steven_wu, tejohnson, mehdi_amini; removed: • espindola.

Herald added a reviewer: • espindola. · View Herald TranscriptJan 14 2020, 12:39 PM

Unit tests: fail. 61858 tests passed, 1 failed and 781 were skipped.

failed: Clang.CodeGen/thinlto-debug-pm.c

clang-tidy: unknown.

clang-format: fail. Please format your changes with clang-format by running git-clang-format HEAD^ or applying this patch.

Build artifacts: diff.json, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Harbormaster failed remote builds in B43988: Diff 238071!Jan 14 2020, 12:59 PM

In D72404#1820461, @merge_guards_bot wrote:
Unit tests: fail. 61858 tests passed, 1 failed and 781 were skipped.
failed: Clang.CodeGen/thinlto-debug-pm.c

I think that failure can be fixed with something like this:

diff --git a/clang/test/CodeGen/thinlto-debug-pm.c b/clang/test/CodeGen/thinlto-debug-pm.c
index 5f449d493af..185a8c8fb8b 100644
--- a/clang/test/CodeGen/thinlto-debug-pm.c
+++ b/clang/test/CodeGen/thinlto-debug-pm.c
@@ -13,17 +13,17 @@
 // O0123sz-NEWPM: Running analysis: PassInstrumentationAnalysis
 // O0123sz-NEWPM: Starting llvm::Module pass manager run.
 // O0123sz-NEWPM: Running pass: WholeProgramDevirtPass
-// O0123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::AnalysisManager<llvm::Function>, llvm::Module>
+// O0123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::FunctionAnalysisManager, llvm::Module>
 // O0123sz-NEWPM: Running pass: LowerTypeTestsPass
 // O0123sz-NEWPM: Invalidating all non-preserved analyses for:
-// O0123sz-NEWPM: Invalidating analysis: InnerAnalysisManagerProxy<llvm::AnalysisManager<llvm::Function>, llvm::Module>
+// O0123sz-NEWPM: Invalidating analysis: InnerAnalysisManagerProxy<llvm::FunctionAnalysisManager, llvm::Module>
 // O123sz-NEWPM: Running pass: ForceFunctionAttrsPass
 // O123sz-NEWPM: Running pass: PassManager<llvm::Module>
 // O123sz-NEWPM: Starting llvm::Module pass manager run.
 // O123sz-NEWPM: Running pass: PGOIndirectCallPromotion
 // O123sz-NEWPM: Running analysis: ProfileSummaryAnalysis
 // O123sz-NEWPM: Running pass: InferFunctionAttrsPass
-// O123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::AnalysisManager<llvm::Function>, llvm::Module>
+// O123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::FunctionAnalysisManager, llvm::Module>
 // O123sz-NEWPM: Running pass: ModuleToFunctionPassAdaptor<llvm::PassManager<llvm::Function> >
 // O123sz-NEWPM: Running analysis: PassInstrumentationAnalysis on foo
 // O123sz-NEWPM: Starting llvm::Function pass manager run.
@@ -41,9 +41,9 @@
 // O123sz-NEWPM: Running pass: CalledValuePropagationPass
 // O123sz-NEWPM: Running pass: GlobalOptPass
 // O123sz-NEWPM: Invalidating all non-preserved analyses for:
-// O123sz-NEWPM: Invalidating analysis: InnerAnalysisManagerProxy<llvm::AnalysisManager<llvm::Function>, llvm::Module>
+// O123sz-NEWPM: Invalidating analysis: InnerAnalysisManagerProxy<llvm::FunctionAnalysisManager, llvm::Module>
 // O123sz-NEWPM: Running pass: ModuleToFunctionPassAdaptor<llvm::PromotePass>
-// O123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::AnalysisManager<llvm::Function>, llvm::Module>
+// O123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::FunctionAnalysisManager, llvm::Module>
 // O123sz-NEWPM: Running analysis: DominatorTreeAnalysis on foo
 // O123sz-NEWPM: Running analysis: PassInstrumentationAnalysis on foo
 // O123sz-NEWPM: Running analysis: AssumptionAnalysis on foo
@@ -57,20 +57,20 @@
 // O123sz-NEWPM: Running analysis: BasicAA on foo
 // O123sz-NEWPM: Running analysis: ScopedNoAliasAA on foo
 // O123sz-NEWPM: Running analysis: TypeBasedAA on foo
-// O123sz-NEWPM: Running analysis: OuterAnalysisManagerProxy<llvm::AnalysisManager<llvm::Module>, llvm::Function> on foo
+// O123sz-NEWPM: Running analysis: OuterAnalysisManagerProxy<llvm::ModuleAnalysisManager, llvm::Function> on foo
 // O123sz-NEWPM: Running pass: SimplifyCFGPass on foo
 // O123sz-NEWPM: Running analysis: TargetIRAnalysis on foo
 // O123sz-NEWPM: Finished llvm::Function pass manager run.
-// O123sz-NEWPM: Running pass: RequireAnalysisPass<llvm::GlobalsAA, llvm::Module>
+// O123sz-NEWPM: Running pass: RequireAnalysisPass<llvm::GlobalsAA, llvm::Module, llvm::AnalysisManager<llvm::Module>>
 // O123sz-NEWPM: Running analysis: GlobalsAA
 // O123sz-NEWPM: Running analysis: CallGraphAnalysis
-// O123sz-NEWPM: Running pass: RequireAnalysisPass<llvm::ProfileSummaryAnalysis, llvm::Module>
-// O123sz-NEWPM: Running pass: ModuleToPostOrderCGSCCPassAdaptor<llvm::DevirtSCCRepeatedPass<llvm::PassManager<llvm::LazyCallGraph::SCC, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&> > >
-// O123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>, llvm::Module>
+// O123sz-NEWPM: Running pass: RequireAnalysisPass<llvm::ProfileSummaryAnalysis, llvm::Module, llvm::AnalysisManager<llvm::Module>>
+// O123sz-NEWPM: Running pass: ModuleToPostOrderCGSCCPassAdaptor<llvm::DevirtSCCRepeatedPass<llvm::PassManager<LazyCallGraph::SCC, llvm::CGSCCAnalysisManager, llvm::LazyCallGraph &, llvm::CGSCCUpdateResult &> > >
+// O123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::CGSCCAnalysisManager, llvm::Module>
 // O123sz-NEWPM: Running analysis: LazyCallGraphAnalysis
 // O123sz-NEWPM: Running analysis: FunctionAnalysisManagerCGSCCProxy on (foo)
 // O123sz-NEWPM: Running analysis: PassInstrumentationAnalysis on (foo)
-// O123sz-NEWPM: Running analysis: OuterAnalysisManagerProxy<llvm::AnalysisManager<llvm::Module>, llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&> on (foo)
+// O123sz-NEWPM: Running analysis: OuterAnalysisManagerProxy<llvm::ModuleAnalysisManager, LazyCallGraph::SCC, llvm::LazyCallGraph &>
 // O123sz-NEWPM: Starting CGSCC pass manager run.
 // O123sz-NEWPM: Running pass: InlinerPass on (foo)
 // O123sz-NEWPM: Running pass: PostOrderFunctionAttrsPass on (foo)
@@ -91,8 +91,8 @@
 // O23sz-NEWPM: Running pass: TailCallElimPass on foo
 // O123sz-NEWPM: Running pass: SimplifyCFGPass on foo
 // O123sz-NEWPM: Running pass: ReassociatePass on foo
-// O123sz-NEWPM: Running pass: RequireAnalysisPass<llvm::OptimizationRemarkEmitterAnalysis, llvm::Function> on foo
-// O123sz-NEWPM: Running pass: FunctionToLoopPassAdaptor<llvm::PassManager<llvm::Loop, llvm::AnalysisManager<llvm::Loop, llvm::LoopStandardAnalysisResults&>, llvm::LoopStandardAnalysisResults&, llvm::LPMUpdater&> > on foo
+// O123sz-NEWPM: Running pass: RequireAnalysisPass<llvm::OptimizationRemarkEmitterAnalysis, llvm::Function, llvm::AnalysisManager<llvm::Function>>
+// O123sz-NEWPM: Running pass: FunctionToLoopPassAdaptor<llvm::PassManager<llvm::Loop, llvm::LoopAnalysisManager, llvm::LoopStandardAnalysisResults &, llvm::LPMUpdater &> >
 // O123sz-NEWPM: Starting llvm::Function pass manager run.
 // O123sz-NEWPM: Running pass: LoopSimplifyPass on foo
 // O123sz-NEWPM: Running analysis: LoopAnalysis on foo
@@ -100,7 +100,7 @@
 // O123sz-NEWPM: Finished llvm::Function pass manager run.
 // O123sz-NEWPM: Running pass: SimplifyCFGPass on foo
 // O123sz-NEWPM: Running pass: InstCombinePass on foo
-// O123sz-NEWPM: Running pass: FunctionToLoopPassAdaptor<llvm::PassManager<llvm::Loop, llvm::AnalysisManager<llvm::Loop, llvm::LoopStandardAnalysisResults&>, llvm::LoopStandardAnalysisResults&, llvm::LPMUpdater&> > on foo
+// O123sz-NEWPM: Running pass: FunctionToLoopPassAdaptor<llvm::PassManager<llvm::Loop, llvm::LoopAnalysisManager, llvm::LoopStandardAnalysisResults &, llvm::LPMUpdater &> >
 // O123sz-NEWPM: Starting llvm::Function pass manager run.
 // O123sz-NEWPM: Running pass: LoopSimplifyPass on foo
 // O123sz-NEWPM: Running pass: LCSSAPass on foo
@@ -137,7 +137,7 @@
 // O123sz-NEWPM: Running pass: GlobalDCEPass
 // O123sz-NEWPM: Running pass: EliminateAvailableExternallyPass
 // O123sz-NEWPM: Running pass: ReversePostOrderFunctionAttrsPass
-// O123sz-NEWPM: Running pass: RequireAnalysisPass<llvm::GlobalsAA, llvm::Module>
+// O123sz-NEWPM: Running pass: RequireAnalysisPass<llvm::GlobalsAA, llvm::Module, llvm::AnalysisManager<llvm::Module>>
 // O123sz-NEWPM: Running pass: ModuleToFunctionPassAdaptor<llvm::PassManager<llvm::Function> >
 // O123sz-NEWPM: Starting llvm::Function pass manager run.
 // O123sz-NEWPM: Running pass: Float2IntPass on foo
@@ -149,7 +149,7 @@
 // O123sz-NEWPM: Finished llvm::Function pass manager run.
 // O123sz-NEWPM: Running pass: LoopDistributePass on foo
 // O123sz-NEWPM: Running analysis: ScalarEvolutionAnalysis on foo
-// O123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::AnalysisManager<llvm::Loop, llvm::LoopStandardAnalysisResults&>, llvm::Function> on foo
+// O123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::LoopAnalysisManager, llvm::Function>
 // O123sz-NEWPM: Running pass: LoopVectorizePass on foo
 // O123sz-NEWPM: Running analysis: BlockFrequencyAnalysis on foo
 // O123sz-NEWPM: Running analysis: BranchProbabilityAnalysis on foo
@@ -160,7 +160,7 @@
 // O123sz-NEWPM: Running pass: LoopUnrollPass on foo
 // O123sz-NEWPM: Running pass: WarnMissedTransformationsPass on foo
 // O123sz-NEWPM: Running pass: InstCombinePass on foo
-// O123sz-NEWPM: Running pass: RequireAnalysisPass<llvm::OptimizationRemarkEmitterAnalysis, llvm::Function> on foo
+// O123sz-NEWPM: Running pass: RequireAnalysisPass<llvm::OptimizationRemarkEmitterAnalysis, llvm::Function, llvm::AnalysisManager<llvm::Function>> on foo
 // O123sz-NEWPM: Running pass: FunctionToLoopPassAdaptor<llvm::LICMPass> on foo
 // O123sz-NEWPM: Starting llvm::Function pass manager run.
 // O123sz-NEWPM: Running pass: LoopSimplifyPass on foo

EDIT: Had to make more changes than I thought. Above passes the test.

In D72404#1820628, @tycho wrote:

In D72404#1820461, @merge_guards_bot wrote:
Unit tests: fail. 61858 tests passed, 1 failed and 781 were skipped.
failed: Clang.CodeGen/thinlto-debug-pm.c

I think that failure can be fixed with something like this:

diff --git a/clang/test/CodeGen/thinlto-debug-pm.c b/clang/test/CodeGen/thinlto-debug-pm.c
index 5f449d493af..9d6d69afd13 100644
--- a/clang/test/CodeGen/thinlto-debug-pm.c
+++ b/clang/test/CodeGen/thinlto-debug-pm.c
@@ -13,17 +13,17 @@
 // O0123sz-NEWPM: Running analysis: PassInstrumentationAnalysis
 // O0123sz-NEWPM: Starting llvm::Module pass manager run.
 // O0123sz-NEWPM: Running pass: WholeProgramDevirtPass
-// O0123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::AnalysisManager<llvm::Function>, llvm::Module>
+// O0123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::FunctionAnalysisManager, llvm::Module>
 // O0123sz-NEWPM: Running pass: LowerTypeTestsPass
 // O0123sz-NEWPM: Invalidating all non-preserved analyses for:
-// O0123sz-NEWPM: Invalidating analysis: InnerAnalysisManagerProxy<llvm::AnalysisManager<llvm::Function>, llvm::Module>
+// O0123sz-NEWPM: Invalidating analysis: InnerAnalysisManagerProxy<llvm::FunctionAnalysisManager, llvm::Module>
 // O123sz-NEWPM: Running pass: ForceFunctionAttrsPass
 // O123sz-NEWPM: Running pass: PassManager<llvm::Module>
 // O123sz-NEWPM: Starting llvm::Module pass manager run.
 // O123sz-NEWPM: Running pass: PGOIndirectCallPromotion
 // O123sz-NEWPM: Running analysis: ProfileSummaryAnalysis
 // O123sz-NEWPM: Running pass: InferFunctionAttrsPass
-// O123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::AnalysisManager<llvm::Function>, llvm::Module>
+// O123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::FunctionAnalysisManager, llvm::Module>
 // O123sz-NEWPM: Running pass: ModuleToFunctionPassAdaptor<llvm::PassManager<llvm::Function> >
 // O123sz-NEWPM: Running analysis: PassInstrumentationAnalysis on foo
 // O123sz-NEWPM: Starting llvm::Function pass manager run.
@@ -41,9 +41,9 @@
 // O123sz-NEWPM: Running pass: CalledValuePropagationPass
 // O123sz-NEWPM: Running pass: GlobalOptPass
 // O123sz-NEWPM: Invalidating all non-preserved analyses for:
-// O123sz-NEWPM: Invalidating analysis: InnerAnalysisManagerProxy<llvm::AnalysisManager<llvm::Function>, llvm::Module>
+// O123sz-NEWPM: Invalidating analysis: InnerAnalysisManagerProxy<llvm::FunctionAnalysisManager, llvm::Module>
 // O123sz-NEWPM: Running pass: ModuleToFunctionPassAdaptor<llvm::PromotePass>
-// O123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::AnalysisManager<llvm::Function>, llvm::Module>
+// O123sz-NEWPM: Running analysis: InnerAnalysisManagerProxy<llvm::FunctionAnalysisManager, llvm::Module>
 // O123sz-NEWPM: Running analysis: DominatorTreeAnalysis on foo
 // O123sz-NEWPM: Running analysis: PassInstrumentationAnalysis on foo
 // O123sz-NEWPM: Running analysis: AssumptionAnalysis on foo
@@ -57,7 +57,7 @@
 // O123sz-NEWPM: Running analysis: BasicAA on foo
 // O123sz-NEWPM: Running analysis: ScopedNoAliasAA on foo
 // O123sz-NEWPM: Running analysis: TypeBasedAA on foo
-// O123sz-NEWPM: Running analysis: OuterAnalysisManagerProxy<llvm::AnalysisManager<llvm::Module>, llvm::Function> on foo
+// O123sz-NEWPM: Running analysis: OuterAnalysisManagerProxy<llvm::ModuleAnalysisManager, llvm::Function> on foo
 // O123sz-NEWPM: Running pass: SimplifyCFGPass on foo
 // O123sz-NEWPM: Running analysis: TargetIRAnalysis on foo
 // O123sz-NEWPM: Finished llvm::Function pass manager run.
@@ -70,7 +70,7 @@
 // O123sz-NEWPM: Running analysis: LazyCallGraphAnalysis
 // O123sz-NEWPM: Running analysis: FunctionAnalysisManagerCGSCCProxy on (foo)
 // O123sz-NEWPM: Running analysis: PassInstrumentationAnalysis on (foo)
-// O123sz-NEWPM: Running analysis: OuterAnalysisManagerProxy<llvm::AnalysisManager<llvm::Module>, llvm::LazyCallGraph::SCC, llvm::LazyCallG>
+// O123sz-NEWPM: Running analysis: OuterAnalysisManagerProxy<llvm::ModuleAnalysisManager, llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&> o>
 // O123sz-NEWPM: Starting CGSCC pass manager run.
 // O123sz-NEWPM: Running pass: InlinerPass on (foo)
 // O123sz-NEWPM: Running pass: PostOrderFunctionAttrsPass on (foo)

Thank you. It turns out that GCC&Clang disagree on __PRETTY_FUNCTION__ for type alias (using). will fix those.

fix test when clang is the host compiler

Harbormaster failed remote builds in B44016: Diff 238142!Jan 14 2020, 5:08 PM

Unit tests: pass. 61859 tests passed, 0 failed and 781 were skipped.

clang-tidy: unknown.

clang-format: fail. Please format your changes with clang-format by running git-clang-format HEAD^ or applying this patch.

Build artifacts: diff.json, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

dblaikie added a subscriber: dblaikie.Mar 26 2020, 6:20 PM

ychen mentioned this in D79818: [lld] Support size levels with (Thin)LTO.May 12 2020, 4:29 PM

I would be very interested in this patch, because for me to use ThinLTO without size regressions I need to set the optimization size level in the linker (PassManagerBuilder.SizeLevel etc).
This patch seems mostly correct to me, except for the function attributes. These function attributes (optsize, minsize) should IMHO be set in the frontend, not in the linker.

Apart from that, would it be reasonable to implement this in the form of -plugin-opt=Os and -plugin-opt=Oz? That is perhaps a cleaner UI (doesn't need a new flag!) and more cleanly maps to PassBuilder::Oz and the like.

Herald added a reviewer: MaskRay. · View Herald TranscriptFeb 8 2022, 4:28 AM

I just saw this. I know it is a good idea to have choice during link time for the pipeline configuration and from your benchmark it also has size impact on the output. I also feel like this is going in the wrong direction as if I have part of the object files built with -O3 and part built with -Oz, I need to make a choice of unoptimized part of the program.

Before I say yes or no to this patch, have you figured out what are the passes that causes the most size regression? Ideally, with function attributes on the function, it shouldn't be much size impact on the output.

In D72404#3304205, @aykevl wrote:

I would be very interested in this patch, because for me to use ThinLTO without size regressions I need to set the optimization size level in the linker (PassManagerBuilder.SizeLevel etc).

Why can't you build the object files with -Os in the first place instead of changing the optimization level between the compilation and the linking phases?

Apparently there is also another patch that tries to do something very similar: D81223.

In D72404#3305275, @mehdi_amini wrote:

Why can't you build the object files with -Os in the first place instead of changing the optimization level between the compilation and the linking phases?

I'm not changing the optimization level. The bitcode files are built with an equivalent of -Oz and have the optsize and minsize attributes. It's the optimization passes in the linker (ThinLTO) that don't respect these attributes.

In D72404#3305267, @steven_wu wrote:

Before I say yes or no to this patch, have you figured out what are the passes that causes the most size regression? Ideally, with function attributes on the function, it shouldn't be much size impact on the output.

Unfortunately, there is an impact. I did a quick test with a small program (around 4.7kB compiled code) and it looks like the LoopRotate pass is the main culprit. If MaxHeaderSize is set to 0 instead of -1, the code size regression is avoided. The following hacky patch avoids the code size increase:

diff --git a/llvm/lib/Transforms/IPO/PassManagerBuilder.cpp b/llvm/lib/Transforms/IPO/PassManagerBuilder.cpp
index aa916345954d..99be1926cf34 100644
--- a/llvm/lib/Transforms/IPO/PassManagerBuilder.cpp
+++ b/llvm/lib/Transforms/IPO/PassManagerBuilder.cpp
@@ -450,7 +450,7 @@ void PassManagerBuilder::addFunctionSimplificationPasses(
   // TODO: Investigate promotion cap for O1.
   MPM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));
   // Rotate Loop - disable header duplication at -Oz
-  MPM.add(createLoopRotatePass(SizeLevel == 2 ? 0 : -1, PrepareForLTO));
+  MPM.add(createLoopRotatePass(0/*SizeLevel == 2 ? 0 : -1*/, PrepareForLTO));
   // TODO: Investigate promotion cap for O1.
   MPM.add(createLICMPass(LicmMssaOptCap, LicmMssaNoAccForPromotionCap));
   if (EnableSimpleLoopUnswitch)
@@ -917,7 +917,7 @@ void PassManagerBuilder::populateModulePassManager(
   // Re-rotate loops in all our loop nests. These may have fallout out of
   // rotated form due to GVN or other transformations, and the vectorizer relies
   // on the rotated form. Disable header duplication at -Oz.
-  MPM.add(createLoopRotatePass(SizeLevel == 2 ? 0 : -1, PrepareForLTO));
+  MPM.add(createLoopRotatePass(0 /*SizeLevel == 2 ? 0 : -1*/, PrepareForLTO));
 
   // Distribute loops to allow partial vectorization.  I.e. isolate dependences
   // into separate loop that would otherwise inhibit vectorization.  This is

So, should all passes just look at the optsize and minsize attributes instead of the SizeLevel? In other words, should PassManagerBuilder.SizeLevel be removed and should passes only look at function attributes instead of SizeLevel? Because at the moment, it's a weird mix of both. IMHO size level should either all go via function attributes or via a flag, not something in between as it is now.
Also, if size level is done via function attributes, why not optimization level? There is already optnone. I'm not saying that's better, but right now I don't see the logic in this whole system.

After some more testing on a larger amount of code (many small programs, together over 1MB in binary size), LoopRotate indeed seems to be the culprit. I'm now looking into a patch to LoopRotate to respect the optsize function attribute.

EDIT: see D119342.

aykevl mentioned this in D119342: [LoopRotate] Don't rotate loops when the minsize attribute is present.Feb 9 2022, 7:26 AM

In D72404#3307623, @aykevl wrote:

So, should all passes just look at the optsize and minsize attributes instead of the SizeLevel? In other words, should PassManagerBuilder.SizeLevel be removed and should passes only look at function attributes instead of SizeLevel? Because at the moment, it's a weird mix of both. IMHO size level should either all go via function attributes or via a flag, not something in between as it is now.

I agree, I don't know (other than history) why we couldn't move towards removing PassManagerBuilder.SizeLevel?

Also, if size level is done via function attributes, why not optimization level? There is already optnone. I'm not saying that's better, but right now I don't see the logic in this whole system.

Despite what gcc and clang exposes to their users, at the LLVM level we don't have a single dimension on which to put O1/O2/O3 compared to Os/Oz. These also may not make sense for every single compiler out-there: O1/O2/O3 for clang may not be the right pass pipeline for my proprietary shader compiler.
Another reason why O1/O2/O3 are not making much such to be in the IR is that the IR is intended to be stored and reloaded any time in the middle of pipeline. LTO is an example of this, so we don't really want to store the "list of pass to run" in the IR.
Finally, we don't want to (or we can't really...) teach passes about whether they should execute during O1/O2/O3, while optnone is just a "fuse" to disable them all. It is also convenient to have optnone as a function attribute because it allows to selectively disable the optimizer on a per function basis. On the other hand because of the nature of the pass pipeline, it can't be tweaked on a per function basis (what about Module passes?).

So O1/O2/O3 is somehow just like an alias for an arbitrary list of passes (not really arbitrary, it is a good "default" one for clang-style IR), on the other hand the optsize/minsize are driving heuristic and can be orthogonal to the pass pipeline / controlled on a per-function basis and made available to every pass: they convey an "optimization goal" that applies to every pass individually.

@mehdi_amini thanks for explaining! D119342 moves slightly closer to removing SizeLevel from the pass pipeline setup.

In other news, I found a workaround that can be used to avoid the size increase due to LoopRotate (until D119342 is merged). Basically, just pass the flags -mllvm --rotation-max-header-size=0 to ld.lld when compiling with -Oz.

In D72404#3310704, @aykevl wrote:

@mehdi_amini thanks for explaining! D119342 moves slightly closer to removing SizeLevel from the pass pipeline setup.

I left a comment on D119342 - I think that is the right way to go. As mentioned there, there is still some legacy handling of options passed down from the driver, but over time we've been trying to move things to use function attributes, for the reasons @mehdi_amini mentioned. To expand on one of the reasons Mehdi mentioned: using function attributes naturally handles LTO linking in the case where one file is compiled -flto -Os and another is compiled -flto -O2 the same way as if you compiled the two files with those different flags all the way down to native code without LTO. Using the approach in this pass you would be forced to pick either -Os or -O2 for both files at LTO link time.

MaskRay mentioned this in D113738: [LTO] Allow passing -Os/-Oz as the optimization level.Feb 14 2022, 9:59 AM

Revision Contents

Path

Size

lld/

COFF/

Config.h

1 line

Driver.cpp

4 lines

LTO.cpp

1 line

ELF/

Config.h

1 line

Driver.cpp

5 lines

LTO.cpp

1 line

test/

COFF/

lto-opt-level.ll

4 lines

ELF/

lto/

opt-level.ll

10 lines

wasm/

lto/

opt-level.ll

6 lines

wasm/

Config.h

1 line

Driver.cpp

5 lines

LTO.cpp

1 line

llvm/

include/

llvm/

LTO/

Config.h

1 line

LTO.h

24 lines

lib/

LTO/

LTO.cpp

39 lines

LTOBackend.cpp

69 lines

test/

tools/

gold/

X86/

opt-level.ll

16 lines

lto/

opt-level.ll

4 lines

tools/

gold/

gold-plugin.cpp

14 lines

llvm-lto2/

llvm-lto2.cpp

14 lines

Diff 236908

lld/COFF/Config.h

Show First 20 Lines • Show All 135 Lines • ▼ Show 20 Lines	struct Configuration {
GuardCFLevel guardCF = GuardCFLevel::Off;		GuardCFLevel guardCF = GuardCFLevel::Off;

// Used for SafeSEH.		// Used for SafeSEH.
bool safeSEH = false;		bool safeSEH = false;
Symbol *sehTable = nullptr;		Symbol *sehTable = nullptr;
Symbol *sehCount = nullptr;		Symbol *sehCount = nullptr;

// Used for /opt:lldlto=N		// Used for /opt:lldlto=N
unsigned ltoo = 2;		unsigned ltoo = 2;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'ltoo' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'ltoo' [readability-identifier-naming]
		unsigned ltoos = 0;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'ltoos' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'ltoos' [readability-identifier-naming]

// Used for /opt:lldltojobs=N		// Used for /opt:lldltojobs=N
unsigned thinLTOJobs = 0;		unsigned thinLTOJobs = 0;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'thinLTOJobs' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'thinLTOJobs' [readability-identifier-naming]
// Used for /opt:lldltopartitions=N		// Used for /opt:lldltopartitions=N
unsigned ltoPartitions = 1;		unsigned ltoPartitions = 1;

// Used for /opt:lldltocache=path		// Used for /opt:lldltocache=path
StringRef ltoCache;		StringRef ltoCache;
// Used for /opt:lldltocachepolicy=policy		// Used for /opt:lldltocachepolicy=policy
llvm::CachePruningPolicy ltoCachePolicy;		llvm::CachePruningPolicy ltoCachePolicy;

▲ Show 20 Lines • Show All 85 Lines • Show Last 20 Lines

lld/COFF/Driver.cpp

Show First 20 Lines • Show All 1,399 Lines • ▼ Show 20 Lines	for (StringRef s : vec) {
icfLevel = 2;		icfLevel = 2;
} else if (s == "noicf") {		} else if (s == "noicf") {
icfLevel = 0;		icfLevel = 0;
} else if (s == "lldtailmerge") {		} else if (s == "lldtailmerge") {
tailMerge = 2;		tailMerge = 2;
} else if (s == "nolldtailmerge") {		} else if (s == "nolldtailmerge") {
tailMerge = 0;		tailMerge = 0;
} else if (s.startswith("lldlto=")) {		} else if (s.startswith("lldlto=")) {
StringRef optLevel = s.substr(7);		StringRef optLevel = s.substr(7);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'optLevel' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'optLevel' [readability-identifier-naming]
if (optLevel.getAsInteger(10, config->ltoo) \|\| config->ltoo > 3)		config->ltoo = CHECK(llvm::lto::getOptLevel(optLevel), "/opt:lldlto: ");
error("/opt:lldlto: invalid optimization level: " + optLevel);		config->ltoos = check(llvm::lto::getSizeLevel(optLevel));
} else if (s.startswith("lldltojobs=")) {		} else if (s.startswith("lldltojobs=")) {
StringRef jobs = s.substr(11);		StringRef jobs = s.substr(11);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'jobs' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'jobs' [readability-identifier-naming]
if (jobs.getAsInteger(10, config->thinLTOJobs) \|\|		if (jobs.getAsInteger(10, config->thinLTOJobs) \|\|
config->thinLTOJobs == 0)		config->thinLTOJobs == 0)
error("/opt:lldltojobs: invalid job count: " + jobs);		error("/opt:lldltojobs: invalid job count: " + jobs);
} else if (s.startswith("lldltopartitions=")) {		} else if (s.startswith("lldltopartitions=")) {
StringRef n = s.substr(17);		StringRef n = s.substr(17);
if (n.getAsInteger(10, config->ltoPartitions) \|\|		if (n.getAsInteger(10, config->ltoPartitions) \|\|
config->ltoPartitions == 0)		config->ltoPartitions == 0)
error("/opt:lldltopartitions: invalid partition count: " + n);		error("/opt:lldltopartitions: invalid partition count: " + n);
▲ Show 20 Lines • Show All 586 Lines • Show Last 20 Lines

lld/COFF/LTO.cpp

Show First 20 Lines • Show All 73 Lines • ▼ Show 20 Lines	static lto::Config createConfig() {
// using the PIC model (see PR34306).		// using the PIC model (see PR34306).
if (config->machine == COFF::IMAGE_FILE_MACHINE_I386)		if (config->machine == COFF::IMAGE_FILE_MACHINE_I386)
c.RelocModel = Reloc::Static;		c.RelocModel = Reloc::Static;
else		else
c.RelocModel = Reloc::PIC_;		c.RelocModel = Reloc::PIC_;
c.DisableVerify = true;		c.DisableVerify = true;
c.DiagHandler = diagnosticHandler;		c.DiagHandler = diagnosticHandler;
c.OptLevel = config->ltoo;		c.OptLevel = config->ltoo;
		c.SizeLevel = config->ltoos;
c.CPU = getCPUStr();		c.CPU = getCPUStr();
c.MAttrs = getMAttrs();		c.MAttrs = getMAttrs();
c.CGOptLevel = args::getCGOptLevel(config->ltoo);		c.CGOptLevel = args::getCGOptLevel(config->ltoo);

if (config->saveTemps)		if (config->saveTemps)
checkError(c.addSaveTemps(std::string(config->outputFile) + ".",		checkError(c.addSaveTemps(std::string(config->outputFile) + ".",
/UseInputModulePath/ true));		/UseInputModulePath/ true));
return c;		return c;
▲ Show 20 Lines • Show All 122 Lines • Show Last 20 Lines

lld/ELF/Config.h

Show First 20 Lines • Show All 229 Lines • ▼ Show 20 Lines	struct Configuration {
BuildIdKind buildId = BuildIdKind::None;		BuildIdKind buildId = BuildIdKind::None;
SeparateSegmentKind zSeparate;		SeparateSegmentKind zSeparate;
ELFKind ekind = ELFNoneKind;		ELFKind ekind = ELFNoneKind;
uint16_t emachine = llvm::ELF::EM_NONE;		uint16_t emachine = llvm::ELF::EM_NONE;
llvm::Optional<uint64_t> imageBase;		llvm::Optional<uint64_t> imageBase;
uint64_t commonPageSize;		uint64_t commonPageSize;
uint64_t maxPageSize;		uint64_t maxPageSize;
uint64_t mipsGotSize;		uint64_t mipsGotSize;
uint64_t zStackSize;		uint64_t zStackSize;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'zStackSize' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'zStackSize' [readability-identifier-naming]
unsigned ltoPartitions;		unsigned ltoPartitions;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'ltoPartitions' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'ltoPartitions' [readability-identifier…
unsigned ltoo;		unsigned ltoo;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'ltoo' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'ltoo' [readability-identifier-naming]
		unsigned ltoos;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'ltoos' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'ltoos' [readability-identifier-naming]
unsigned optimize;		unsigned optimize;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'optimize' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'optimize' [readability-identifier-naming]
unsigned thinLTOJobs;		unsigned thinLTOJobs;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'thinLTOJobs' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'thinLTOJobs' [readability-identifier-naming]
int32_t splitStackAdjustSize;		int32_t splitStackAdjustSize;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'splitStackAdjustSize' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'splitStackAdjustSize' [readability…

// The following config options do not directly correspond to any		// The following config options do not directly correspond to any
// particular command line options.		// particular command line options.

// True if we need to pass through relocations in input files to the		// True if we need to pass through relocations in input files to the
// output file. Usually false because we consume relocations.		// output file. Usually false because we consume relocations.
bool copyRelocs;		bool copyRelocs;

▲ Show 20 Lines • Show All 82 Lines • Show Last 20 Lines

lld/ELF/Driver.cpp

Show First 20 Lines • Show All 889 Lines • ▼ Show 20 Lines	config->ignoreFunctionAddressEquality =
args.hasArg(OPT_ignore_function_address_equality);		args.hasArg(OPT_ignore_function_address_equality);
config->init = args.getLastArgValue(OPT_init, "_init");		config->init = args.getLastArgValue(OPT_init, "_init");
config->ltoAAPipeline = args.getLastArgValue(OPT_lto_aa_pipeline);		config->ltoAAPipeline = args.getLastArgValue(OPT_lto_aa_pipeline);
config->ltoCSProfileGenerate = args.hasArg(OPT_lto_cs_profile_generate);		config->ltoCSProfileGenerate = args.hasArg(OPT_lto_cs_profile_generate);
config->ltoCSProfileFile = args.getLastArgValue(OPT_lto_cs_profile_file);		config->ltoCSProfileFile = args.getLastArgValue(OPT_lto_cs_profile_file);
config->ltoDebugPassManager = args.hasArg(OPT_lto_debug_pass_manager);		config->ltoDebugPassManager = args.hasArg(OPT_lto_debug_pass_manager);
config->ltoNewPassManager = args.hasArg(OPT_lto_new_pass_manager);		config->ltoNewPassManager = args.hasArg(OPT_lto_new_pass_manager);
config->ltoNewPmPasses = args.getLastArgValue(OPT_lto_newpm_passes);		config->ltoNewPmPasses = args.getLastArgValue(OPT_lto_newpm_passes);
config->ltoo = args::getInteger(args, OPT_lto_O, 2);		config->ltoo = check(llvm::lto::getOptLevel(args.getLastArg(OPT_lto_O), 2));
		config->ltoos = check(llvm::lto::getSizeLevel(args.getLastArg(OPT_lto_O), 0));
config->ltoObjPath = args.getLastArgValue(OPT_lto_obj_path_eq);		config->ltoObjPath = args.getLastArgValue(OPT_lto_obj_path_eq);
config->ltoPartitions = args::getInteger(args, OPT_lto_partitions, 1);		config->ltoPartitions = args::getInteger(args, OPT_lto_partitions, 1);
config->ltoSampleProfile = args.getLastArgValue(OPT_lto_sample_profile);		config->ltoSampleProfile = args.getLastArgValue(OPT_lto_sample_profile);
config->mapFile = args.getLastArgValue(OPT_Map);		config->mapFile = args.getLastArgValue(OPT_Map);
config->mipsGotSize = args::getInteger(args, OPT_mips_got_size, 0xfff0);		config->mipsGotSize = args::getInteger(args, OPT_mips_got_size, 0xfff0);
config->mergeArmExidx =		config->mergeArmExidx =
args.hasFlag(OPT_merge_exidx_entries, OPT_no_merge_exidx_entries, true);		args.hasFlag(OPT_merge_exidx_entries, OPT_no_merge_exidx_entries, true);
config->mmapOutputFile =		config->mmapOutputFile =
▲ Show 20 Lines • Show All 86 Lines • ▼ Show 20 Lines	static void readConfigs(opt::InputArgList &args) {
if (auto *arg = args.getLastArg(OPT_plugin_opt_mcpu_eq))		if (auto *arg = args.getLastArg(OPT_plugin_opt_mcpu_eq))
parseClangOption(saver.save("-mcpu=" + StringRef(arg->getValue())),		parseClangOption(saver.save("-mcpu=" + StringRef(arg->getValue())),
arg->getSpelling());		arg->getSpelling());

for (auto *arg : args.filtered(OPT_plugin_opt))		for (auto *arg : args.filtered(OPT_plugin_opt))
parseClangOption(arg->getValue(), arg->getSpelling());		parseClangOption(arg->getValue(), arg->getSpelling());

// Parse -mllvm options.		// Parse -mllvm options.
for (auto *arg : args.filtered(OPT_mllvm))		for (auto *arg : args.filtered(OPT_mllvm))
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'arg' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'arg' [readability-identifier-naming]
parseClangOption(arg->getValue(), arg->getSpelling());		parseClangOption(arg->getValue(), arg->getSpelling());

if (config->ltoo > 3)
error("invalid optimization level for LTO: " + Twine(config->ltoo));
if (config->ltoPartitions == 0)		if (config->ltoPartitions == 0)
error("--lto-partitions: number of threads must be > 0");		error("--lto-partitions: number of threads must be > 0");
if (config->thinLTOJobs == 0)		if (config->thinLTOJobs == 0)
error("--thinlto-jobs: number of threads must be > 0");		error("--thinlto-jobs: number of threads must be > 0");

if (config->splitStackAdjustSize < 0)		if (config->splitStackAdjustSize < 0)
error("--split-stack-adjust-size: size must be >= 0");		error("--split-stack-adjust-size: size must be >= 0");

▲ Show 20 Lines • Show All 996 Lines • Show Last 20 Lines

lld/ELF/LTO.cpp

Show First 20 Lines • Show All 83 Lines • ▼ Show 20 Lines	else if (config->isPic)
c.RelocModel = Reloc::PIC_;		c.RelocModel = Reloc::PIC_;
else		else
c.RelocModel = Reloc::Static;		c.RelocModel = Reloc::Static;

c.CodeModel = getCodeModelFromCMModel();		c.CodeModel = getCodeModelFromCMModel();
c.DisableVerify = config->disableVerify;		c.DisableVerify = config->disableVerify;
c.DiagHandler = diagnosticHandler;		c.DiagHandler = diagnosticHandler;
c.OptLevel = config->ltoo;		c.OptLevel = config->ltoo;
		c.SizeLevel = config->ltoos;
c.CPU = getCPUStr();		c.CPU = getCPUStr();
c.MAttrs = getMAttrs();		c.MAttrs = getMAttrs();
c.CGOptLevel = args::getCGOptLevel(config->ltoo);		c.CGOptLevel = args::getCGOptLevel(config->ltoo);

// Set up a custom pipeline if we've been asked to.		// Set up a custom pipeline if we've been asked to.
c.OptPipeline = config->ltoNewPmPasses;		c.OptPipeline = config->ltoNewPmPasses;
c.AAPipeline = config->ltoAAPipeline;		c.AAPipeline = config->ltoAAPipeline;

▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

lld/test/COFF/lto-opt-level.ll

	; REQUIRES: x86			; REQUIRES: x86
	; RUN: llvm-as -o %t.obj %s			; RUN: llvm-as -o %t.obj %s
	; RUN: lld-link /out:%t0.exe /entry:main /subsystem:console /opt:lldlto=0 /lldmap:%t0.map %t.obj			; RUN: lld-link /out:%t0.exe /entry:main /subsystem:console /opt:lldlto=0 /lldmap:%t0.map %t.obj
	; RUN: FileCheck --check-prefix=CHECK-O0 %s < %t0.map			; RUN: FileCheck --check-prefix=CHECK-O0 %s < %t0.map
	; RUN: lld-link /out:%t2.exe /entry:main /subsystem:console /opt:lldlto=2 /lldmap:%t2.map %t.obj			; RUN: lld-link /out:%t2.exe /entry:main /subsystem:console /opt:lldlto=2 /lldmap:%t2.map %t.obj
	; RUN: FileCheck --check-prefix=CHECK-O2 %s < %t2.map			; RUN: FileCheck --check-prefix=CHECK-O2 %s < %t2.map
				; RUN: lld-link /out:%ts.exe /entry:main /subsystem:console /opt:lldlto=s /lldmap:%ts.map %t.obj
				; RUN: FileCheck --check-prefix=CHECK-O2 %s < %ts.map
				; RUN: lld-link /out:%tz.exe /entry:main /subsystem:console /opt:lldlto=z /lldmap:%tz.map %t.obj
				; RUN: FileCheck --check-prefix=CHECK-O2 %s < %tz.map
	; RUN: lld-link /out:%t2a.exe /entry:main /subsystem:console /lldmap:%t2a.map %t.obj			; RUN: lld-link /out:%t2a.exe /entry:main /subsystem:console /lldmap:%t2a.map %t.obj
	; RUN: FileCheck --check-prefix=CHECK-O2 %s < %t2a.map			; RUN: FileCheck --check-prefix=CHECK-O2 %s < %t2a.map

	target datalayout = "e-m:w-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:w-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-pc-windows-msvc"			target triple = "x86_64-pc-windows-msvc"

	; CHECK-O0: foo			; CHECK-O0: foo
	; CHECK-O2-NOT: foo			; CHECK-O2-NOT: foo
	define internal void @foo() {			define internal void @foo() {
	ret void			ret void
	}			}

	define void @main() {			define void @main() {
	call void @foo()			call void @foo()
	ret void			ret void
	}			}

lld/test/ELF/lto/opt-level.ll

	; REQUIRES: x86			; REQUIRES: x86
	; RUN: llvm-as -o %t.o %s			; RUN: llvm-as -o %t.o %s
	; RUN: ld.lld -o %t0 -e main --lto-O0 %t.o			; RUN: ld.lld -o %t0 -e main --lto-O0 %t.o
	; RUN: llvm-nm %t0 \| FileCheck --check-prefix=CHECK-O0 %s			; RUN: llvm-nm %t0 \| FileCheck --check-prefix=CHECK-O0 %s
	; RUN: ld.lld -o %t0 -e main --plugin-opt=O0 %t.o			; RUN: ld.lld -o %t0 -e main --plugin-opt=O0 %t.o
	; RUN: llvm-nm %t0 \| FileCheck --check-prefix=CHECK-O0 %s			; RUN: llvm-nm %t0 \| FileCheck --check-prefix=CHECK-O0 %s
	; RUN: ld.lld -o %t2 -e main --lto-O2 %t.o			; RUN: ld.lld -o %t2 -e main --lto-O2 %t.o
	; RUN: llvm-nm %t2 \| FileCheck --check-prefix=CHECK-O2 %s			; RUN: llvm-nm %t2 \| FileCheck --check-prefix=CHECK-O2 %s
				; RUN: ld.lld -o %ts -e main --lto-Os %t.o
				; RUN: llvm-nm %ts \| FileCheck --check-prefix=CHECK-O2 %s
				; RUN: ld.lld -o %tz -e main --lto-Oz %t.o
				; RUN: llvm-nm %tz \| FileCheck --check-prefix=CHECK-O2 %s
	; RUN: ld.lld -o %t2a -e main %t.o			; RUN: ld.lld -o %t2a -e main %t.o
	; RUN: llvm-nm %t2a \| FileCheck --check-prefix=CHECK-O2 %s			; RUN: llvm-nm %t2a \| FileCheck --check-prefix=CHECK-O2 %s
	; RUN: ld.lld -o %t2 -e main %t.o --plugin-opt O2			; RUN: ld.lld -o %t2 -e main %t.o --plugin-opt O2
	; RUN: llvm-nm %t2 \| FileCheck --check-prefix=CHECK-O2 %s			; RUN: llvm-nm %t2 \| FileCheck --check-prefix=CHECK-O2 %s

	; Reject invalid optimization levels.			; Reject invalid optimization levels.
	; RUN: not ld.lld -o %t3 -e main --lto-O6 %t.o 2>&1 \| \			; RUN: not ld.lld -o %t3 -e main --lto-O6 %t.o 2>&1 \| \
	; RUN: FileCheck --check-prefix=INVALID1 %s			; RUN: FileCheck --check-prefix=INVALID1 %s
	; INVALID1: invalid optimization level for LTO: 6			; INVALID1: invalid optimization level for LTO: 6
	; RUN: not ld.lld -o %t3 -e main --plugin-opt=O6 %t.o 2>&1 \| \			; RUN: not ld.lld -o %t3 -e main --plugin-opt=O6 %t.o 2>&1 \| \
	; RUN: FileCheck --check-prefix=INVALID1 %s			; RUN: FileCheck --check-prefix=INVALID1 %s
	; RUN: not ld.lld -o %t3 -e main --plugin-opt=Ofoo %t.o 2>&1 \| \			; RUN: not ld.lld -o %t3 -e main --plugin-opt=Ofoo %t.o 2>&1 \| \
	; RUN: FileCheck --check-prefix=INVALID2 %s			; RUN: FileCheck --check-prefix=INVALID2 %s
	; INVALID2: --plugin-opt=Ofoo: number expected, but got 'foo'			; INVALID2: invalid optimization level for LTO: foo

	; RUN: not ld.lld -o %t3 -e main --lto-O-1 %t.o 2>&1 \| \			; RUN: not ld.lld -o %t3 -e main --lto-O-1 %t.o 2>&1 \| \
	; RUN: FileCheck --check-prefix=INVALIDNEGATIVE1 %s			; RUN: FileCheck --check-prefix=INVALIDNEGATIVE1 %s
	; INVALIDNEGATIVE1: invalid optimization level for LTO: 4294967295			; INVALIDNEGATIVE1: invalid optimization level for LTO: -1
	; RUN: not ld.lld -o %t3 -e main --plugin-opt=O-1 %t.o 2>&1 \| \			; RUN: not ld.lld -o %t3 -e main --plugin-opt=O-1 %t.o 2>&1 \| \
	; RUN: FileCheck --check-prefix=INVALIDNEGATIVE2 %s			; RUN: FileCheck --check-prefix=INVALIDNEGATIVE2 %s
	; INVALIDNEGATIVE2: invalid optimization level for LTO: 4294967295			; INVALIDNEGATIVE2: invalid optimization level for LTO: -1

	target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; CHECK-O0: foo			; CHECK-O0: foo
	; CHECK-O2-NOT: foo			; CHECK-O2-NOT: foo
	define internal void @foo() {			define internal void @foo() {
	ret void			ret void
	}			}

	define void @main() {			define void @main() {
	call void @foo()			call void @foo()
	ret void			ret void
	}			}

lld/test/wasm/lto/opt-level.ll

	; RUN: llvm-as -o %t.o %s			; RUN: llvm-as -o %t.o %s
	; RUN: wasm-ld -o %t0 -e main --lto-O0 %t.o			; RUN: wasm-ld -o %t0 -e main --lto-O0 %t.o
	; RUN: obj2yaml %t0 \| FileCheck --check-prefix=CHECK-O0 %s			; RUN: obj2yaml %t0 \| FileCheck --check-prefix=CHECK-O0 %s
	; RUN: wasm-ld -o %t2 -e main --lto-O2 %t.o			; RUN: wasm-ld -o %t2 -e main --lto-O2 %t.o
	; RUN: obj2yaml %t2 \| FileCheck --check-prefix=CHECK-O2 %s			; RUN: obj2yaml %t2 \| FileCheck --check-prefix=CHECK-O2 %s
				; RUN: wasm-ld -o %ts -e main --lto-Os %t.o
				; RUN: obj2yaml %ts \| FileCheck --check-prefix=CHECK-O2 %s
				; RUN: wasm-ld -o %tz -e main --lto-Oz %t.o
				; RUN: obj2yaml %tz \| FileCheck --check-prefix=CHECK-O2 %s
	; RUN: wasm-ld -o %t2a -e main %t.o			; RUN: wasm-ld -o %t2a -e main %t.o
	; RUN: obj2yaml %t2a \| FileCheck --check-prefix=CHECK-O2 %s			; RUN: obj2yaml %t2a \| FileCheck --check-prefix=CHECK-O2 %s

	; Reject invalid optimization levels.			; Reject invalid optimization levels.
	; RUN: not wasm-ld -o %t3 -e main --lto-O6 %t.o 2>&1 \| \			; RUN: not wasm-ld -o %t3 -e main --lto-O6 %t.o 2>&1 \| \
	; RUN: FileCheck --check-prefix=INVALID %s			; RUN: FileCheck --check-prefix=INVALID %s
	; INVALID: invalid optimization level for LTO: 6			; INVALID: invalid optimization level for LTO: 6

	; RUN: not wasm-ld -o %t3 -m elf_x86_64 -e main --lto-O-1 %t.o 2>&1 \| \			; RUN: not wasm-ld -o %t3 -m elf_x86_64 -e main --lto-O-1 %t.o 2>&1 \| \
	; RUN: FileCheck --check-prefix=INVALIDNEGATIVE %s			; RUN: FileCheck --check-prefix=INVALIDNEGATIVE %s
	; INVALIDNEGATIVE: invalid optimization level for LTO: 4294967295			; INVALIDNEGATIVE: invalid optimization level for LTO: -1

	target datalayout = "e-m:e-p:32:32-i64:64-n32:64-S128"			target datalayout = "e-m:e-p:32:32-i64:64-n32:64-S128"
	target triple = "wasm32-unknown-unknown-wasm"			target triple = "wasm32-unknown-unknown-wasm"

	; CHECK-O0: Name: foo			; CHECK-O0: Name: foo
	; CHECK-O2-NOT: Name: foo			; CHECK-O2-NOT: Name: foo
	define internal void @foo() {			define internal void @foo() {
	ret void			ret void
	}			}

	define void @main() {			define void @main() {
	call void @foo()			call void @foo()
	ret void			ret void
	}			}

lld/wasm/Config.h

Show First 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	struct Configuration {
bool shared;		bool shared;
bool stripAll;		bool stripAll;
bool stripDebug;		bool stripDebug;
bool stackFirst;		bool stackFirst;
bool trace;		bool trace;
uint32_t globalBase;		uint32_t globalBase;
uint32_t initialMemory;		uint32_t initialMemory;
uint32_t maxMemory;		uint32_t maxMemory;
uint32_t zStackSize;		uint32_t zStackSize;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'zStackSize' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'zStackSize' [readability-identifier-naming]
unsigned ltoPartitions;		unsigned ltoPartitions;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'ltoPartitions' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'ltoPartitions' [readability-identifier…
unsigned ltoo;		unsigned ltoo;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'ltoo' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'ltoo' [readability-identifier-naming]
		unsigned ltoos;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'ltoos' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'ltoos' [readability-identifier-naming]
unsigned optimize;		unsigned optimize;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'optimize' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'optimize' [readability-identifier-naming]
unsigned thinLTOJobs;		unsigned thinLTOJobs;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'thinLTOJobs' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'thinLTOJobs' [readability-identifier-naming]

llvm::StringRef entry;		llvm::StringRef entry;
llvm::StringRef outputFile;		llvm::StringRef outputFile;
llvm::StringRef thinLTOCacheDir;		llvm::StringRef thinLTOCacheDir;

llvm::StringSet<> allowUndefinedSymbols;		llvm::StringSet<> allowUndefinedSymbols;
llvm::StringSet<> exportedSymbols;		llvm::StringSet<> exportedSymbols;
std::vector<llvm::StringRef> searchPaths;		std::vector<llvm::StringRef> searchPaths;
Show All 23 Lines

lld/wasm/Driver.cpp

Show First 20 Lines • Show All 315 Lines • ▼ Show 20 Lines	static void readConfigs(opt::InputArgList &args) {
config->exportAll = args.hasArg(OPT_export_all);		config->exportAll = args.hasArg(OPT_export_all);
config->exportTable = args.hasArg(OPT_export_table);		config->exportTable = args.hasArg(OPT_export_table);
config->growableTable = args.hasArg(OPT_growable_table);		config->growableTable = args.hasArg(OPT_growable_table);
errorHandler().fatalWarnings =		errorHandler().fatalWarnings =
args.hasFlag(OPT_fatal_warnings, OPT_no_fatal_warnings, false);		args.hasFlag(OPT_fatal_warnings, OPT_no_fatal_warnings, false);
config->importMemory = args.hasArg(OPT_import_memory);		config->importMemory = args.hasArg(OPT_import_memory);
config->sharedMemory = args.hasArg(OPT_shared_memory);		config->sharedMemory = args.hasArg(OPT_shared_memory);
config->importTable = args.hasArg(OPT_import_table);		config->importTable = args.hasArg(OPT_import_table);
config->ltoo = args::getInteger(args, OPT_lto_O, 2);		config->ltoo = check(llvm::lto::getOptLevel(args.getLastArg(OPT_lto_O), 2));
		config->ltoos = check(llvm::lto::getSizeLevel(args.getLastArg(OPT_lto_O), 0));
config->ltoPartitions = args::getInteger(args, OPT_lto_partitions, 1);		config->ltoPartitions = args::getInteger(args, OPT_lto_partitions, 1);
config->optimize = args::getInteger(args, OPT_O, 0);		config->optimize = args::getInteger(args, OPT_O, 0);
config->outputFile = args.getLastArgValue(OPT_o);		config->outputFile = args.getLastArgValue(OPT_o);
config->relocatable = args.hasArg(OPT_relocatable);		config->relocatable = args.hasArg(OPT_relocatable);
config->gcSections =		config->gcSections =
args.hasFlag(OPT_gc_sections, OPT_no_gc_sections, !config->relocatable);		args.hasFlag(OPT_gc_sections, OPT_no_gc_sections, !config->relocatable);
config->mergeDataSegments =		config->mergeDataSegments =
args.hasFlag(OPT_merge_data_segments, OPT_no_merge_data_segments,		args.hasFlag(OPT_merge_data_segments, OPT_no_merge_data_segments,
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines

// Some command line options or some combinations of them are not allowed.		// Some command line options or some combinations of them are not allowed.
// This function checks for such errors.		// This function checks for such errors.
static void checkOptions(opt::InputArgList &args) {		static void checkOptions(opt::InputArgList &args) {
if (!config->stripDebug && !config->stripAll && config->compressRelocations)		if (!config->stripDebug && !config->stripAll && config->compressRelocations)
error("--compress-relocations is incompatible with output debug"		error("--compress-relocations is incompatible with output debug"
" information. Please pass --strip-debug or --strip-all");		" information. Please pass --strip-debug or --strip-all");

if (config->ltoo > 3)
error("invalid optimization level for LTO: " + Twine(config->ltoo));
if (config->ltoPartitions == 0)		if (config->ltoPartitions == 0)
error("--lto-partitions: number of threads must be > 0");		error("--lto-partitions: number of threads must be > 0");
if (config->thinLTOJobs == 0)		if (config->thinLTOJobs == 0)
error("--thinlto-jobs: number of threads must be > 0");		error("--thinlto-jobs: number of threads must be > 0");

if (config->pie && config->shared)		if (config->pie && config->shared)
error("-shared and -pie may not be used together");		error("-shared and -pie may not be used together");

▲ Show 20 Lines • Show All 387 Lines • Show Last 20 Lines

lld/wasm/LTO.cpp

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines	static std::unique_ptr<lto::LTO> createLTO() {

// Always emit a section per function/data with LTO.		// Always emit a section per function/data with LTO.
c.Options.FunctionSections = true;		c.Options.FunctionSections = true;
c.Options.DataSections = true;		c.Options.DataSections = true;

c.DisableVerify = config->disableVerify;		c.DisableVerify = config->disableVerify;
c.DiagHandler = diagnosticHandler;		c.DiagHandler = diagnosticHandler;
c.OptLevel = config->ltoo;		c.OptLevel = config->ltoo;
		c.SizeLevel = config->ltoos;
c.MAttrs = getMAttrs();		c.MAttrs = getMAttrs();
c.CGOptLevel = args::getCGOptLevel(config->ltoo);		c.CGOptLevel = args::getCGOptLevel(config->ltoo);

if (config->relocatable)		if (config->relocatable)
c.RelocModel = None;		c.RelocModel = None;
else if (config->isPic)		else if (config->isPic)
c.RelocModel = Reloc::PIC_;		c.RelocModel = Reloc::PIC_;
else		else
▲ Show 20 Lines • Show All 109 Lines • Show Last 20 Lines

llvm/include/llvm/LTO/Config.h

Show All 37 Lines	struct Config {
std::string CPU;		std::string CPU;
TargetOptions Options;		TargetOptions Options;
std::vector<std::string> MAttrs;		std::vector<std::string> MAttrs;
Optional<Reloc::Model> RelocModel = Reloc::PIC_;		Optional<Reloc::Model> RelocModel = Reloc::PIC_;
Optional<CodeModel::Model> CodeModel = None;		Optional<CodeModel::Model> CodeModel = None;
CodeGenOpt::Level CGOptLevel = CodeGenOpt::Default;		CodeGenOpt::Level CGOptLevel = CodeGenOpt::Default;
CodeGenFileType CGFileType = CGFT_ObjectFile;		CodeGenFileType CGFileType = CGFT_ObjectFile;
unsigned OptLevel = 2;		unsigned OptLevel = 2;
		unsigned SizeLevel = 0;
bool DisableVerify = false;		bool DisableVerify = false;

/// Use the new pass manager		/// Use the new pass manager
bool UseNewPM = false;		bool UseNewPM = false;

/// Flag to indicate that the optimizer should not assume builtins are present		/// Flag to indicate that the optimizer should not assume builtins are present
/// on the target.		/// on the target.
bool Freestanding = false;		bool Freestanding = false;
▲ Show 20 Lines • Show All 184 Lines • Show Last 20 Lines

llvm/include/llvm/LTO/LTO.h

Show All 32 Lines
namespace llvm {		namespace llvm {

class BitcodeModule;		class BitcodeModule;
class Error;		class Error;
class LLVMContext;		class LLVMContext;
class MemoryBufferRef;		class MemoryBufferRef;
class Module;		class Module;
class Target;		class Target;
class raw_pwrite_stream;		class raw_pwrite_stream;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for class 'raw_pwrite_stream' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for class 'raw_pwrite_stream' [readability-identifier…
		namespace opt {
		class Arg;
		}
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: namespace 'opt' not terminated with a closing comment [llvm-namespace-comment] Lint: Pre-merge checks: clang-tidy: warning: namespace 'opt' not terminated with a closing comment [llvm-namespace…

/// Resolve linkage for prevailing symbols in the \p Index. Linkage changes		/// Resolve linkage for prevailing symbols in the \p Index. Linkage changes
/// recorded in the index and the ThinLTO backends must apply the changes to		/// recorded in the index and the ThinLTO backends must apply the changes to
/// the module via thinLTOResolvePrevailingInModule.		/// the module via thinLTOResolvePrevailingInModule.
///		///
/// This is done for correctness (if value exported, ensure we always		/// This is done for correctness (if value exported, ensure we always
/// emit a copy), and compile-time optimization (allow drop of duplicates).		/// emit a copy), and compile-time optimization (allow drop of duplicates).
void thinLTOResolvePrevailingInIndex(		void thinLTOResolvePrevailingInIndex(
▲ Show 20 Lines • Show All 395 Lines • ▼ Show 20 Lines	struct SymbolResolution {
/// The definition of this symbol is visible outside of the LTO unit.		/// The definition of this symbol is visible outside of the LTO unit.
unsigned VisibleToRegularObj : 1;		unsigned VisibleToRegularObj : 1;

/// Linker redefined version of the symbol which appeared in -wrap or -defsym		/// Linker redefined version of the symbol which appeared in -wrap or -defsym
/// linker option.		/// linker option.
unsigned LinkerRedefined : 1;		unsigned LinkerRedefined : 1;
};		};

		class LTOOLevelError : public ErrorInfo<LTOOLevelError> {
		public:
		static char ID;
		StringRef Level;

		LTOOLevelError(StringRef Level) : Level(Level) {}

		void log(raw_ostream &OS) const override {
		OS << "invalid optimization level for LTO: " << Level;
		}

		std::error_code convertToErrorCode() const override {
		return llvm::inconvertibleErrorCode();
		}
		};

		Expected<unsigned> getOptLevel(StringRef OArg);
		Expected<unsigned> getOptLevel(llvm::opt::Arg *OArg, unsigned Default);
		Expected<unsigned> getSizeLevel(StringRef OArg);
		Expected<unsigned> getSizeLevel(llvm::opt::Arg *OArg, unsigned Default);

} // namespace lto		} // namespace lto
} // namespace llvm		} // namespace llvm

#endif		#endif

llvm/lib/LTO/LTO.cpp

Show All 23 Lines
#include "llvm/IR/LegacyPassManager.h"		#include "llvm/IR/LegacyPassManager.h"
#include "llvm/IR/Mangler.h"		#include "llvm/IR/Mangler.h"
#include "llvm/IR/Metadata.h"		#include "llvm/IR/Metadata.h"
#include "llvm/IR/RemarkStreamer.h"		#include "llvm/IR/RemarkStreamer.h"
#include "llvm/LTO/LTOBackend.h"		#include "llvm/LTO/LTOBackend.h"
#include "llvm/LTO/SummaryBasedOptimizations.h"		#include "llvm/LTO/SummaryBasedOptimizations.h"
#include "llvm/Linker/IRMover.h"		#include "llvm/Linker/IRMover.h"
#include "llvm/Object/IRObjectFile.h"		#include "llvm/Object/IRObjectFile.h"
		#include "llvm/Option/Arg.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/Error.h"		#include "llvm/Support/Error.h"
#include "llvm/Support/ManagedStatic.h"		#include "llvm/Support/ManagedStatic.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
#include "llvm/Support/SHA1.h"		#include "llvm/Support/SHA1.h"
#include "llvm/Support/SourceMgr.h"		#include "llvm/Support/SourceMgr.h"
#include "llvm/Support/TargetRegistry.h"		#include "llvm/Support/TargetRegistry.h"
▲ Show 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	else
AddUnsigned(-1);		AddUnsigned(-1);
if (Conf.CodeModel)		if (Conf.CodeModel)
AddUnsigned(*Conf.CodeModel);		AddUnsigned(*Conf.CodeModel);
else		else
AddUnsigned(-1);		AddUnsigned(-1);
AddUnsigned(Conf.CGOptLevel);		AddUnsigned(Conf.CGOptLevel);
AddUnsigned(Conf.CGFileType);		AddUnsigned(Conf.CGFileType);
AddUnsigned(Conf.OptLevel);		AddUnsigned(Conf.OptLevel);
		AddUnsigned(Conf.SizeLevel);
AddUnsigned(Conf.UseNewPM);		AddUnsigned(Conf.UseNewPM);
AddUnsigned(Conf.Freestanding);		AddUnsigned(Conf.Freestanding);
AddString(Conf.OptPipeline);		AddString(Conf.OptPipeline);
AddString(Conf.AAPipeline);		AddString(Conf.AAPipeline);
AddString(Conf.OverrideTriple);		AddString(Conf.OverrideTriple);
AddString(Conf.DefaultTriple);		AddString(Conf.DefaultTriple);
AddString(Conf.DwoDir);		AddString(Conf.DwoDir);

▲ Show 20 Lines • Show All 1,263 Lines • ▼ Show 20 Lines	lto::setupStatsFile(StringRef StatsFilename) {
auto StatsFile =		auto StatsFile =
std::make_unique<ToolOutputFile>(StatsFilename, EC, sys::fs::OF_None);		std::make_unique<ToolOutputFile>(StatsFilename, EC, sys::fs::OF_None);
if (EC)		if (EC)
return errorCodeToError(EC);		return errorCodeToError(EC);

StatsFile->keep();		StatsFile->keep();
return std::move(StatsFile);		return std::move(StatsFile);
}		}

		char LTOOLevelError::ID;

		Expected<unsigned> lto::getOptLevel(StringRef S) {
		if (S == "s" \|\| S == "z" \|\| S.empty())
		return 2;

		if (S == "g")
		return 1;

		int Res;
		if (S.getAsInteger(10, Res) \|\| Res < 0 \|\| Res > 3)
		return make_error<LTOOLevelError>(S);
		return Res;
		}

		Expected<unsigned> lto::getOptLevel(opt::Arg *OArg, unsigned Default) {
		if (!OArg)
		return Default;
		return lto::getOptLevel(OArg->getValue());
		}

		Expected<unsigned> lto::getSizeLevel(StringRef S) {
		if (S[0] == 's')
		return 1;
		if (S[0] == 'z')
		return 2;
		if (S[0] >= '0' \|\| S[0] <= '3' )
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: logical expression is always true [misc-redundant-expression] clang-format: please reformat the code - if (S[0] >= '0' \|\| S[0] <= '3' ) + if (S[0] >= '0' \|\| S[0] <= '3') Lint: Pre-merge checks: clang-tidy: warning: logical expression is always true [misc-redundant-expression] clang-format…
		return 0;
		return make_error<LTOOLevelError>(S);
		}

		Expected<unsigned> lto::getSizeLevel(opt::Arg *OArg, unsigned Default) {
		if (!OArg)
		return Default;
		return lto::getSizeLevel(OArg->getValue());
		}

llvm/lib/LTO/LTOBackend.cpp

Show First 20 Lines • Show All 144 Lines • ▼ Show 20 Lines	createTargetMachine(Config &Conf, const Target *TheTarget, Module &M) {
else		else
CodeModel = M.getCodeModel();		CodeModel = M.getCodeModel();

return std::unique_ptr<TargetMachine>(TheTarget->createTargetMachine(		return std::unique_ptr<TargetMachine>(TheTarget->createTargetMachine(
TheTriple, Conf.CPU, Features.getString(), Conf.Options, RelocModel,		TheTriple, Conf.CPU, Features.getString(), Conf.Options, RelocModel,
CodeModel, Conf.CGOptLevel));		CodeModel, Conf.CGOptLevel));
}		}

		static PassBuilder::OptimizationLevel mapToLevel(const Config &Conf) {
		switch (Conf.OptLevel) {
		default:
		llvm_unreachable("Invalid optimization level!");

		case 0:
		return PassBuilder::O0;

		case 1:
		return PassBuilder::O1;

		case 2:
		switch (Conf.SizeLevel) {
		default:
		llvm_unreachable("Invalid optimization level for size!");

		case 0:
		return PassBuilder::O2;

		case 1:
		return PassBuilder::Os;

		case 2:
		return PassBuilder::Oz;
		}

		case 3:
		return PassBuilder::O3;
		}
		}

static void runNewPMPasses(Config &Conf, Module &Mod, TargetMachine *TM,		static void runNewPMPasses(Config &Conf, Module &Mod, TargetMachine *TM,
unsigned OptLevel, bool IsThinLTO,		bool IsThinLTO, ModuleSummaryIndex *ExportSummary,
ModuleSummaryIndex *ExportSummary,
const ModuleSummaryIndex *ImportSummary) {		const ModuleSummaryIndex *ImportSummary) {
Optional<PGOOptions> PGOOpt;		Optional<PGOOptions> PGOOpt;
if (!Conf.SampleProfile.empty())		if (!Conf.SampleProfile.empty())
PGOOpt = PGOOptions(Conf.SampleProfile, "", Conf.ProfileRemapping,		PGOOpt = PGOOptions(Conf.SampleProfile, "", Conf.ProfileRemapping,
PGOOptions::SampleUse, PGOOptions::NoCSAction, true);		PGOOptions::SampleUse, PGOOptions::NoCSAction, true);
else if (Conf.RunCSIRInstr) {		else if (Conf.RunCSIRInstr) {
PGOOpt = PGOOptions("", Conf.CSIRProfile, Conf.ProfileRemapping,		PGOOpt = PGOOptions("", Conf.CSIRProfile, Conf.ProfileRemapping,
PGOOptions::IRUse, PGOOptions::CSIRInstr);		PGOOptions::IRUse, PGOOptions::CSIRInstr);
Show All 25 Lines	static void runNewPMPasses(Config &Conf, Module &Mod, TargetMachine *TM,
PB.registerCGSCCAnalyses(CGAM);		PB.registerCGSCCAnalyses(CGAM);
PB.registerFunctionAnalyses(FAM);		PB.registerFunctionAnalyses(FAM);
PB.registerLoopAnalyses(LAM);		PB.registerLoopAnalyses(LAM);
PB.crossRegisterProxies(LAM, FAM, CGAM, MAM);		PB.crossRegisterProxies(LAM, FAM, CGAM, MAM);

ModulePassManager MPM(Conf.DebugPassManager);		ModulePassManager MPM(Conf.DebugPassManager);
// FIXME (davide): verify the input.		// FIXME (davide): verify the input.

PassBuilder::OptimizationLevel OL;		PassBuilder::OptimizationLevel OL = mapToLevel(Conf);

switch (OptLevel) {
default:
llvm_unreachable("Invalid optimization level");
case 0:
OL = PassBuilder::O0;
break;
case 1:
OL = PassBuilder::O1;
break;
case 2:
OL = PassBuilder::O2;
break;
case 3:
OL = PassBuilder::O3;
break;
}

if (IsThinLTO)		if (IsThinLTO)
MPM = PB.buildThinLTODefaultPipeline(OL, Conf.DebugPassManager,		MPM = PB.buildThinLTODefaultPipeline(OL, Conf.DebugPassManager,
ImportSummary);		ImportSummary);
else		else
MPM = PB.buildLTODefaultPipeline(OL, Conf.DebugPassManager, ExportSummary);		MPM = PB.buildLTODefaultPipeline(OL, Conf.DebugPassManager, ExportSummary);
MPM.run(Mod, MAM);		MPM.run(Mod, MAM);

▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	static void runOldPMPasses(Config &Conf, Module &Mod, TargetMachine *TM,
PMB.ImportSummary = ImportSummary;		PMB.ImportSummary = ImportSummary;
// Unconditionally verify input since it is not verified before this		// Unconditionally verify input since it is not verified before this
// point and has unknown origin.		// point and has unknown origin.
PMB.VerifyInput = true;		PMB.VerifyInput = true;
PMB.VerifyOutput = !Conf.DisableVerify;		PMB.VerifyOutput = !Conf.DisableVerify;
PMB.LoopVectorize = true;		PMB.LoopVectorize = true;
PMB.SLPVectorize = true;		PMB.SLPVectorize = true;
PMB.OptLevel = Conf.OptLevel;		PMB.OptLevel = Conf.OptLevel;
		PMB.SizeLevel = Conf.SizeLevel;
PMB.PGOSampleUse = Conf.SampleProfile;		PMB.PGOSampleUse = Conf.SampleProfile;
PMB.EnablePGOCSInstrGen = Conf.RunCSIRInstr;		PMB.EnablePGOCSInstrGen = Conf.RunCSIRInstr;
if (!Conf.RunCSIRInstr && !Conf.CSIRProfile.empty()) {		if (!Conf.RunCSIRInstr && !Conf.CSIRProfile.empty()) {
PMB.EnablePGOCSInstrUse = true;		PMB.EnablePGOCSInstrUse = true;
PMB.PGOInstrUse = Conf.CSIRProfile;		PMB.PGOInstrUse = Conf.CSIRProfile;
}		}
if (IsThinLTO)		if (IsThinLTO)
PMB.populateThinLTOPassManager(passes);		PMB.populateThinLTOPassManager(passes);
else		else
PMB.populateLTOPassManager(passes);		PMB.populateLTOPassManager(passes);
passes.run(Mod);		passes.run(Mod);
}		}

bool opt(Config &Conf, TargetMachine *TM, unsigned Task, Module &Mod,		bool optimize(Config &Conf, TargetMachine *TM, unsigned Task, Module &Mod,
bool IsThinLTO, ModuleSummaryIndex *ExportSummary,		bool IsThinLTO, ModuleSummaryIndex *ExportSummary,
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - bool IsThinLTO, ModuleSummaryIndex ExportSummary, - const ModuleSummaryIndex ImportSummary) { + bool IsThinLTO, ModuleSummaryIndex ExportSummary, + const ModuleSummaryIndex ImportSummary) { Lint: Pre-merge checks: clang-format: please reformat the code ``` - bool IsThinLTO, ModuleSummaryIndex…
const ModuleSummaryIndex *ImportSummary) {		const ModuleSummaryIndex *ImportSummary) {
// FIXME: Plumb the combined index into the new pass manager.		// FIXME: Plumb the combined index into the new pass manager.
if (!Conf.OptPipeline.empty())		if (!Conf.OptPipeline.empty())
runNewPMCustomPasses(Mod, TM, Conf.OptPipeline, Conf.AAPipeline,		runNewPMCustomPasses(Mod, TM, Conf.OptPipeline, Conf.AAPipeline,
Conf.DisableVerify);		Conf.DisableVerify);
else if (Conf.UseNewPM)		else {
runNewPMPasses(Conf, Mod, TM, Conf.OptLevel, IsThinLTO, ExportSummary,		auto runPasses = Conf.UseNewPM ? runNewPMPasses : runOldPMPasses;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'runPasses' [readability-identifier-naming] Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'runPasses' [readability-identifier-naming]
ImportSummary);		runPasses(Conf, Mod, TM, IsThinLTO, ExportSummary, ImportSummary);
else		}
runOldPMPasses(Conf, Mod, TM, IsThinLTO, ExportSummary, ImportSummary);
return !Conf.PostOptModuleHook \|\| Conf.PostOptModuleHook(Task, Mod);		return !Conf.PostOptModuleHook \|\| Conf.PostOptModuleHook(Task, Mod);
}		}

void codegen(Config &Conf, TargetMachine *TM, AddStreamFn AddStream,		void codegen(Config &Conf, TargetMachine *TM, AddStreamFn AddStream,
unsigned Task, Module &Mod) {		unsigned Task, Module &Mod) {
if (Conf.PreCodeGenModuleHook && !Conf.PreCodeGenModuleHook(Task, Mod))		if (Conf.PreCodeGenModuleHook && !Conf.PreCodeGenModuleHook(Task, Mod))
return;		return;

▲ Show 20 Lines • Show All 118 Lines • ▼ Show 20 Lines	Error lto::backend(Config &C, AddStreamFn AddStream,
auto DiagFileOrErr = lto::setupOptimizationRemarks(		auto DiagFileOrErr = lto::setupOptimizationRemarks(
Mod->getContext(), C.RemarksFilename, C.RemarksPasses, C.RemarksFormat,		Mod->getContext(), C.RemarksFilename, C.RemarksPasses, C.RemarksFormat,
C.RemarksWithHotness);		C.RemarksWithHotness);
if (!DiagFileOrErr)		if (!DiagFileOrErr)
return DiagFileOrErr.takeError();		return DiagFileOrErr.takeError();
auto DiagnosticOutputFile = std::move(*DiagFileOrErr);		auto DiagnosticOutputFile = std::move(*DiagFileOrErr);

if (!C.CodeGenOnly) {		if (!C.CodeGenOnly) {
if (!opt(C, TM.get(), 0, Mod, /IsThinLTO=*/false,		if (!optimize(C, TM.get(), 0, Mod, /IsThinLTO=*/false,
/ExportSummary=/&CombinedIndex, /ImportSummary=/nullptr))		/ExportSummary=/&CombinedIndex, /ImportSummary=/nullptr))
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /ExportSummary=/&CombinedIndex, /ImportSummary=/nullptr)) + /ExportSummary=/&CombinedIndex, /ImportSummary=/nullptr)) Lint: Pre-merge checks: clang-format: please reformat the code ``` - /ExportSummary=/&CombinedIndex…
return finalizeOptimizationRemarks(std::move(DiagnosticOutputFile));		return finalizeOptimizationRemarks(std::move(DiagnosticOutputFile));
}		}

if (ParallelCodeGenParallelismLevel == 1) {		if (ParallelCodeGenParallelismLevel == 1) {
codegen(C, TM.get(), AddStream, 0, *Mod);		codegen(C, TM.get(), AddStream, 0, *Mod);
} else {		} else {
splitCodeGen(C, TM.get(), AddStream, ParallelCodeGenParallelismLevel,		splitCodeGen(C, TM.get(), AddStream, ParallelCodeGenParallelismLevel,
std::move(Mod));		std::move(Mod));
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	Error lto::thinBackend(Config &Conf, unsigned Task, AddStreamFn AddStream,

FunctionImporter Importer(CombinedIndex, ModuleLoader);		FunctionImporter Importer(CombinedIndex, ModuleLoader);
if (Error Err = Importer.importFunctions(Mod, ImportList).takeError())		if (Error Err = Importer.importFunctions(Mod, ImportList).takeError())
return Err;		return Err;

if (Conf.PostImportModuleHook && !Conf.PostImportModuleHook(Task, Mod))		if (Conf.PostImportModuleHook && !Conf.PostImportModuleHook(Task, Mod))
return finalizeOptimizationRemarks(std::move(DiagnosticOutputFile));		return finalizeOptimizationRemarks(std::move(DiagnosticOutputFile));

if (!opt(Conf, TM.get(), Task, Mod, /IsThinLTO=/true,		if (!optimize(Conf, TM.get(), Task, Mod, /IsThinLTO=/true,
/ExportSummary=/nullptr, /ImportSummary=/&CombinedIndex))		/ExportSummary=/nullptr, /ImportSummary=/&CombinedIndex))
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /ExportSummary=/nullptr, /ImportSummary=/&CombinedIndex)) + /ExportSummary=/nullptr, /ImportSummary=/&CombinedIndex)) Lint: Pre-merge checks: clang-format: please reformat the code ``` - /ExportSummary=/nullptr…
return finalizeOptimizationRemarks(std::move(DiagnosticOutputFile));		return finalizeOptimizationRemarks(std::move(DiagnosticOutputFile));

codegen(Conf, TM.get(), AddStream, Task, Mod);		codegen(Conf, TM.get(), AddStream, Task, Mod);
return finalizeOptimizationRemarks(std::move(DiagnosticOutputFile));		return finalizeOptimizationRemarks(std::move(DiagnosticOutputFile));
}		}

llvm/test/tools/gold/X86/opt-level.ll

	; RUN: llvm-as -o %t.bc %s			; RUN: llvm-as -o %t.bc %s
	; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \			; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \
	; RUN: -m elf_x86_64 \			; RUN: -m elf_x86_64 \
	; RUN: -plugin-opt=O0 -r -o %t.o %t.bc			; RUN: -plugin-opt=O0 -r -o %t.o %t.bc
	; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O0 %s			; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O0 %s
	; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \			; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \
	; RUN: -m elf_x86_64 \			; RUN: -m elf_x86_64 \
	; RUN: -plugin-opt=O1 -r -o %t.o %t.bc			; RUN: -plugin-opt=O1 -r -o %t.o %t.bc
	; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O1 --check-prefix=CHECK-O1-OLDPM %s			; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O1 --check-prefix=CHECK-O1-OLDPM %s
	; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \			; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \
	; RUN: -m elf_x86_64 \			; RUN: -m elf_x86_64 \
	; RUN: -plugin-opt=O2 -r -o %t.o %t.bc			; RUN: -plugin-opt=O2 -r -o %t.o %t.bc
	; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O2 %s			; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O2 %s
				; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \
				; RUN: -m elf_x86_64 \
				; RUN: -plugin-opt=Os -r -o %t.o %t.bc
				; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O2 %s
				; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \
				; RUN: -m elf_x86_64 \
				; RUN: -plugin-opt=Oz -r -o %t.o %t.bc
				; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O2 %s

	; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \			; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \
	; RUN: -m elf_x86_64 --plugin-opt=new-pass-manager \			; RUN: -m elf_x86_64 --plugin-opt=new-pass-manager \
	; RUN: -plugin-opt=O0 -r -o %t.o %t.bc			; RUN: -plugin-opt=O0 -r -o %t.o %t.bc
	; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O0 %s			; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O0 %s
	; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \			; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \
	; RUN: -m elf_x86_64 --plugin-opt=new-pass-manager \			; RUN: -m elf_x86_64 --plugin-opt=new-pass-manager \
	; RUN: -plugin-opt=O1 -r -o %t.o %t.bc			; RUN: -plugin-opt=O1 -r -o %t.o %t.bc
	; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O1 --check-prefix=CHECK-O1-NEWPM %s			; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O1 --check-prefix=CHECK-O1-NEWPM %s
	; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \			; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \
	; RUN: -m elf_x86_64 --plugin-opt=new-pass-manager \			; RUN: -m elf_x86_64 --plugin-opt=new-pass-manager \
	; RUN: -plugin-opt=O2 -r -o %t.o %t.bc			; RUN: -plugin-opt=O2 -r -o %t.o %t.bc
	; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O2 %s			; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O2 %s
				; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \
				; RUN: -m elf_x86_64 --plugin-opt=new-pass-manager \
				; RUN: -plugin-opt=Os -r -o %t.o %t.bc
				; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O2 %s
				; RUN: %gold -plugin %llvmshlibdir/LLVMgold%shlibext -plugin-opt=save-temps \
				; RUN: -m elf_x86_64 --plugin-opt=new-pass-manager \
				; RUN: -plugin-opt=Oz -r -o %t.o %t.bc
				; RUN: llvm-dis < %t.o.0.4.opt.bc -o - \| FileCheck --check-prefix=CHECK-O2 %s

	; CHECK-O0: define internal void @foo(			; CHECK-O0: define internal void @foo(
	; CHECK-O1: define internal void @foo(			; CHECK-O1: define internal void @foo(
	; CHECK-O2-NOT: define internal void @foo(			; CHECK-O2-NOT: define internal void @foo(

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	Show All 39 Lines

llvm/test/tools/lto/opt-level.ll

	; RUN: llvm-as %s -o %t.o			; RUN: llvm-as %s -o %t.o
	; RUN: %ld64 -lto_library %llvmshlibdir/libLTO.dylib -arch x86_64 -dylib -mllvm -O0 -o %t.dylib %t.o			; RUN: %ld64 -lto_library %llvmshlibdir/libLTO.dylib -arch x86_64 -dylib -mllvm -O0 -o %t.dylib %t.o
	; RUN: llvm-nm --no-llvm-bc %t.dylib \| FileCheck --check-prefix=CHECK-O0 %s			; RUN: llvm-nm --no-llvm-bc %t.dylib \| FileCheck --check-prefix=CHECK-O0 %s
	; RUN: %ld64 -lto_library %llvmshlibdir/libLTO.dylib -arch x86_64 -dylib -mllvm -O2 -o %t.dylib %t.o			; RUN: %ld64 -lto_library %llvmshlibdir/libLTO.dylib -arch x86_64 -dylib -mllvm -O2 -o %t.dylib %t.o
	; RUN: llvm-nm --no-llvm-bc %t.dylib \| FileCheck --check-prefix=CHECK-O2 %s			; RUN: llvm-nm --no-llvm-bc %t.dylib \| FileCheck --check-prefix=CHECK-O2 %s
				; RUN: %ld64 -lto_library %llvmshlibdir/libLTO.dylib -arch x86_64 -dylib -mllvm -Os -o %t.dylib %t.o
				; RUN: llvm-nm --no-llvm-bc %t.dylib \| FileCheck --check-prefix=CHECK-O2 %s
				; RUN: %ld64 -lto_library %llvmshlibdir/libLTO.dylib -arch x86_64 -dylib -mllvm -Oz -o %t.dylib %t.o
				; RUN: llvm-nm --no-llvm-bc %t.dylib \| FileCheck --check-prefix=CHECK-O2 %s

	target triple = "x86_64-apple-macosx10.8.0"			target triple = "x86_64-apple-macosx10.8.0"

	; CHECK-O0: t _f1			; CHECK-O0: t _f1
	; CHECK-O2-NOT: _f1			; CHECK-O2-NOT: _f1
	define internal void @f1() {			define internal void @f1() {
	ret void			ret void
	}			}

	; CHECK-O0: T _f2			; CHECK-O0: T _f2
	; CHECK-O2: T _f2			; CHECK-O2: T _f2
	define void @f2() {			define void @f2() {
	call void @f1()			call void @f1()
	ret void			ret void
	}			}

llvm/tools/gold/gold-plugin.cpp

Show All 25 Lines
#include "llvm/Support/FileSystem.h"		#include "llvm/Support/FileSystem.h"
#include "llvm/Support/ManagedStatic.h"		#include "llvm/Support/ManagedStatic.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
#include "llvm/Support/TargetSelect.h"		#include "llvm/Support/TargetSelect.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <list>		#include <list>
#include <map>		#include <map>
#include <plugin-api.h>		#include <plugin-api.h>
		Lint: Pre-merge checks Inline Actions clang-tidy: error: 'plugin-api.h' file not found [clang-diagnostic-error] Lint: Pre-merge checks: clang-tidy: error: 'plugin-api.h' file not found [clang-diagnostic-error]
#include <string>		#include <string>
#include <system_error>		#include <system_error>
#include <utility>		#include <utility>
#include <vector>		#include <vector>

// FIXME: remove this declaration when we stop maintaining Ubuntu Quantal and		// FIXME: remove this declaration when we stop maintaining Ubuntu Quantal and
// Precise and Debian Wheezy (binutils 2.23 is required)		// Precise and Debian Wheezy (binutils 2.23 is required)
#define LDPO_PIE 3		#define LDPO_PIE 3
▲ Show 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	enum OutputType {
OT_NORMAL,		OT_NORMAL,
OT_DISABLE,		OT_DISABLE,
OT_BC_ONLY,		OT_BC_ONLY,
OT_ASM_ONLY,		OT_ASM_ONLY,
OT_SAVE_TEMPS		OT_SAVE_TEMPS
};		};
static OutputType TheOutputType = OT_NORMAL;		static OutputType TheOutputType = OT_NORMAL;
static unsigned OptLevel = 2;		static unsigned OptLevel = 2;
		static unsigned SizeLevel = 0;
// Default parallelism of 0 used to indicate that user did not specify.		// Default parallelism of 0 used to indicate that user did not specify.
// Actual parallelism default value depends on implementation.		// Actual parallelism default value depends on implementation.
// Currently only affects ThinLTO, where the default is		// Currently only affects ThinLTO, where the default is
// llvm::heavyweight_hardware_concurrency.		// llvm::heavyweight_hardware_concurrency.
static unsigned Parallelism = 0;		static unsigned Parallelism = 0;
// Default regular LTO codegen parallelism (number of partitions).		// Default regular LTO codegen parallelism (number of partitions).
static unsigned ParallelCodeGenParallelismLevel = 1;		static unsigned ParallelCodeGenParallelismLevel = 1;
#ifdef NDEBUG		#ifdef NDEBUG
▲ Show 20 Lines • Show All 113 Lines • ▼ Show 20 Lines	if (opt.startswith("mcpu=")) {
if (thinlto_object_suffix_replace.find(';') == std::string::npos)		if (thinlto_object_suffix_replace.find(';') == std::string::npos)
message(LDPL_FATAL,		message(LDPL_FATAL,
"thinlto-object-suffix-replace expects 'old;new' format");		"thinlto-object-suffix-replace expects 'old;new' format");
} else if (opt.startswith("cache-dir=")) {		} else if (opt.startswith("cache-dir=")) {
cache_dir = opt.substr(strlen("cache-dir="));		cache_dir = opt.substr(strlen("cache-dir="));
} else if (opt.startswith("cache-policy=")) {		} else if (opt.startswith("cache-policy=")) {
cache_policy = opt.substr(strlen("cache-policy="));		cache_policy = opt.substr(strlen("cache-policy="));
} else if (opt.size() == 2 && opt[0] == 'O') {		} else if (opt.size() == 2 && opt[0] == 'O') {
if (opt[1] < '0' \|\| opt[1] > '3')		if (auto LevelOrErr = llvm::lto::getOptLevel(opt.substr(1, 1)))
message(LDPL_FATAL, "Optimization level must be between 0 and 3");		OptLevel = *LevelOrErr;
OptLevel = opt[1] - '0';		else
		message(LDPL_FATAL, toString(LevelOrErr.takeError()).c_str());

		if (auto LevelOrErr = llvm::lto::getSizeLevel(opt.substr(1, 1)))
		SizeLevel = *LevelOrErr;
		else
		message(LDPL_FATAL, toString(LevelOrErr.takeError()).c_str());
} else if (opt.startswith("jobs=")) {		} else if (opt.startswith("jobs=")) {
if (StringRef(opt_ + 5).getAsInteger(10, Parallelism))		if (StringRef(opt_ + 5).getAsInteger(10, Parallelism))
message(LDPL_FATAL, "Invalid parallelism level: %s", opt_ + 5);		message(LDPL_FATAL, "Invalid parallelism level: %s", opt_ + 5);
} else if (opt.startswith("lto-partitions=")) {		} else if (opt.startswith("lto-partitions=")) {
if (opt.substr(strlen("lto-partitions="))		if (opt.substr(strlen("lto-partitions="))
.getAsInteger(10, ParallelCodeGenParallelismLevel))		.getAsInteger(10, ParallelCodeGenParallelismLevel))
message(LDPL_FATAL, "Invalid codegen partition level: %s", opt_ + 5);		message(LDPL_FATAL, "Invalid codegen partition level: %s", opt_ + 5);
} else if (opt == "disable-verify") {		} else if (opt == "disable-verify") {
▲ Show 20 Lines • Show All 580 Lines • ▼ Show 20 Lines	if (DataSections.getNumOccurrences() == 0)
Conf.Options.DataSections = SplitSections;		Conf.Options.DataSections = SplitSections;

Conf.MAttrs = MAttrs;		Conf.MAttrs = MAttrs;
Conf.RelocModel = RelocationModel;		Conf.RelocModel = RelocationModel;
Conf.CodeModel = getCodeModel();		Conf.CodeModel = getCodeModel();
Conf.CGOptLevel = getCGOptLevel();		Conf.CGOptLevel = getCGOptLevel();
Conf.DisableVerify = options::DisableVerify;		Conf.DisableVerify = options::DisableVerify;
Conf.OptLevel = options::OptLevel;		Conf.OptLevel = options::OptLevel;
		Conf.SizeLevel = options::SizeLevel;
if (options::Parallelism)		if (options::Parallelism)
Backend = createInProcessThinBackend(options::Parallelism);		Backend = createInProcessThinBackend(options::Parallelism);
if (options::thinlto_index_only) {		if (options::thinlto_index_only) {
std::string OldPrefix, NewPrefix;		std::string OldPrefix, NewPrefix;
getThinLTOOldAndNewPrefix(OldPrefix, NewPrefix);		getThinLTOOldAndNewPrefix(OldPrefix, NewPrefix);
Backend = createWriteIndexesThinBackend(OldPrefix, NewPrefix,		Backend = createWriteIndexesThinBackend(OldPrefix, NewPrefix,
options::thinlto_emit_imports_files,		options::thinlto_emit_imports_files,
LinkedObjectsFile, OnIndexWrite);		LinkedObjectsFile, OnIndexWrite);
▲ Show 20 Lines • Show All 274 Lines • Show Last 20 Lines

llvm/tools/llvm-lto2/llvm-lto2.cpp

Show All 23 Lines
#include "llvm/Support/FileSystem.h"		#include "llvm/Support/FileSystem.h"
#include "llvm/Support/InitLLVM.h"		#include "llvm/Support/InitLLVM.h"
#include "llvm/Support/TargetSelect.h"		#include "llvm/Support/TargetSelect.h"
#include "llvm/Support/Threading.h"		#include "llvm/Support/Threading.h"

using namespace llvm;		using namespace llvm;
using namespace lto;		using namespace lto;

static cl::opt<char>		static cl::opt<std::string>
OptLevel("O", cl::desc("Optimization level. [-O0, -O1, -O2, or -O3] "		OptLevel("O",
"(default = '-O2')"),		cl::desc("Optimization level. [-O0, -O1, -O2, -Os, -Oz, or "
cl::Prefix, cl::ZeroOrMore, cl::init('2'));		"-O3] (default = '-O2')"),
		cl::Prefix, cl::ZeroOrMore, cl::init("2"));

static cl::opt<char> CGOptLevel(		static cl::opt<char> CGOptLevel(
"cg-opt-level",		"cg-opt-level",
cl::desc("Codegen optimization level (0, 1, 2 or 3, default = '2')"),		cl::desc("Codegen optimization level (0, 1, 2 or 3, default = '2')"),
cl::init('2'));		cl::init('2'));

static cl::list<std::string> InputFilenames(cl::Positional, cl::OneOrMore,		static cl::list<std::string> InputFilenames(cl::Positional, cl::OneOrMore,
cl::desc("<input bitcode files>"));		cl::desc("<input bitcode files>"));
▲ Show 20 Lines • Show All 195 Lines • ▼ Show 20 Lines	static int run(int argc, char **argv) {
Conf.SampleProfile = SamplePGOFile;		Conf.SampleProfile = SamplePGOFile;
Conf.CSIRProfile = CSPGOFile;		Conf.CSIRProfile = CSPGOFile;
Conf.RunCSIRInstr = RunCSIRInstr;		Conf.RunCSIRInstr = RunCSIRInstr;

// Run a custom pipeline, if asked for.		// Run a custom pipeline, if asked for.
Conf.OptPipeline = OptPipeline;		Conf.OptPipeline = OptPipeline;
Conf.AAPipeline = AAPipeline;		Conf.AAPipeline = AAPipeline;

Conf.OptLevel = OptLevel - '0';		Conf.OptLevel =
		check(llvm::lto::getOptLevel(OptLevel), "invalid optimization level");
		Conf.SizeLevel =
		check(llvm::lto::getSizeLevel(OptLevel), "invalid optimization level");
Conf.UseNewPM = UseNewPM;		Conf.UseNewPM = UseNewPM;
switch (CGOptLevel) {		switch (CGOptLevel) {
case '0':		case '0':
Conf.CGOptLevel = CodeGenOpt::None;		Conf.CGOptLevel = CodeGenOpt::None;
break;		break;
case '1':		case '1':
Conf.CGOptLevel = CodeGenOpt::Less;		Conf.CGOptLevel = CodeGenOpt::Less;
break;		break;
▲ Show 20 Lines • Show All 196 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[ThinLTO/FullLTO] Support Os and OzNeeds ReviewPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 236908

lld/COFF/Config.h

lld/COFF/Driver.cpp

lld/COFF/LTO.cpp

lld/ELF/Config.h

lld/ELF/Driver.cpp

lld/ELF/LTO.cpp

lld/test/COFF/lto-opt-level.ll

lld/test/ELF/lto/opt-level.ll

lld/test/wasm/lto/opt-level.ll

lld/wasm/Config.h

lld/wasm/Driver.cpp

lld/wasm/LTO.cpp

llvm/include/llvm/LTO/Config.h

llvm/include/llvm/LTO/LTO.h

llvm/lib/LTO/LTO.cpp

llvm/lib/LTO/LTOBackend.cpp

llvm/test/tools/gold/X86/opt-level.ll

llvm/test/tools/lto/opt-level.ll

llvm/tools/gold/gold-plugin.cpp

llvm/tools/llvm-lto2/llvm-lto2.cpp

[ThinLTO/FullLTO] Support Os and Oz
Needs ReviewPublic