This is an archive of the discontinued LLVM Phabricator instance.

llvm/lib/Transforms/Instrumentation/CGProfile.cpp
29	a static function does not need to be wrapped inside an unnamed namespace
104–109	wrap a struct/class in an unnamed namespace to prevent name collision

aeubanks added a subscriber: aeubanks.Jul 1 2020, 11:04 PM

Looks great, thanks! Just some minor comments.

llvm/include/llvm/InitializePasses.h
200	I think these declarations should be in alphabetic order.
llvm/include/llvm/Transforms/IPO.h
285	ultra nit: I would probably keep the empty line before the } here
llvm/lib/Transforms/Instrumentation/CGProfile.cpp
104–109	Might as well declare it 'final' while you're here.

hans added inline comments.Jul 2 2020, 2:55 AM

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp
839	In the new pm this pass seems to be added conditionally based on some flag: if (PTO.CallGraphProfile) Is it possible to do something similar here?
llvm/lib/Transforms/Instrumentation/CGProfile.cpp
133	Should it be "cg-profile" to match the npm pass name? Arthur has been working on making such pass names equal recently.

Address comments.

Herald added a project: Restricted Project. · View Herald TranscriptJul 2 2020, 10:27 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

update.

Harbormaster failed remote builds in B62713: Diff 275161!Jul 2 2020, 11:52 AM

Harbormaster failed remote builds in B62712: Diff 275160!

Enable CGProfilePass for opt with LPM by default, like opt with NPM.

MaskRay added subscribers: zhizhouy, void.Jul 5 2020, 12:37 PM

MaskRay added inline comments.

llvm/tools/opt/opt.cpp
281 ↗	(On Diff #275558)	If there is no strong need for tuning this, please delete the option and PassManagerBuilder::CallGraphProfile I know that `-enable-npm-call-graph-profile` exists, but it seems like a temporary workaround for me. @zhizhouy @void (D62627) Is the option still used?

Harbormaster failed remote builds in B62936: Diff 275558!Jul 5 2020, 1:20 PM

zhizhouy added inline comments.Jul 5 2020, 2:35 PM

llvm/tools/opt/opt.cpp
281 ↗	(On Diff #275558)	Does GNU assembler recognize .cgprofile section now? I think we should keep this option as long as there is still usage of other than integrated assembler.

MaskRay added inline comments.Jul 5 2020, 3:02 PM

llvm/tools/opt/opt.cpp
281 ↗	(On Diff #275558)	Does GNU assembler recognize .cgprofile section now? I don't think it will ever support this section. I think we should keep this option as long as there is still usage of other than integrated assembler. Can you give a link about the use case?

zhizhouy added inline comments.Jul 5 2020, 3:35 PM

llvm/tools/opt/opt.cpp
281 ↗	(On Diff #275558)	I have a link of the effort to migrate Linux kernel to integrated assembler in ChromeOS: https://bugs.chromium.org/p/chromium/issues/detail?id=1020923 I think they are only to migrate newer versions thus the old linux kernels are still using GNU assembler, and all kernels are built with LLVM now.

MaskRay added inline comments.Jul 5 2020, 4:55 PM

llvm/tools/opt/opt.cpp
281 ↗	(On Diff #275558)	If we have to have an option, we should make both legacy pm and new pm use the same option. We should not create two cl::opt.

Adding Chandler and Alina here as well.

In general, I don't think that this is such a great idea. Being able to have this sort of thing work more reliably is one of the reasons for the new pass manager. I think I'd like to see this split out into an old versus new pass manager pass to avoid the difficulty of cleaning this up after we finish migrating llvm to the new pass manager. This also seems to add some technical debt around options and other enablement which is also less than ideal. Is this compelling to add right now versus finishing work migrating llvm completely to the new pass manager and removing the old one? From speaking with Alina I think that work should be done in a short while.

Thanks.

-eric

In D83013#2132070, @echristo wrote:

Adding Chandler and Alina here as well.

In general, I don't think that this is such a great idea. Being able to have this sort of thing work more reliably is one of the reasons for the new pass manager. I think I'd like to see this split out into an old versus new pass manager pass to avoid the difficulty of cleaning this up after we finish migrating llvm to the new pass manager. This also seems to add some technical debt around options and other enablement which is also less than ideal. Is this compelling to add right now versus finishing work migrating llvm completely to the new pass manager and removing the old one? From speaking with Alina I think that work should be done in a short while.

Thanks.

-eric

I don't think we're that close yet, probably at least a couple months out, there are lots of loose ends to be tied up. I'll make a post soon in llvm-dev (maybe first we can sync up again) about what I think needs to be done before the NPM switch.

In D83013#2132070, @echristo wrote:

Adding Chandler and Alina here as well.

In general, I don't think that this is such a great idea. Being able to have this sort of thing work more reliably is one of the reasons for the new pass manager. I think I'd like to see this split out into an old versus new pass manager pass to avoid the difficulty of cleaning this up after we finish migrating llvm to the new pass manager. This also seems to add some technical debt around options and other enablement which is also less than ideal. Is this compelling to add right now versus finishing work migrating llvm completely to the new pass manager and removing the old one? From speaking with Alina I think that work should be done in a short while.

Given how long the new pass manager has been in progress, we definitely don't want to block on enabling it. So yes, porting this pass to the current pass manager is compelling to do right now. I also don't see why it should be a big deal.

As for splitting it into separate passes, this patch technically does that, although it extracts and changes the core code a bit so it can be shared between the passes. I think that's how most passes have been adapted to work with both pass managers, no?

Delete enable-npm-call-graph-profile option for NPM, using enable-call-graph-profile for both LPM and NPM.

zequanwu marked an inline comment as done.Jul 6 2020, 10:49 AM

In D83013#2132088, @aeubanks wrote:

In D83013#2132070, @echristo wrote:

Adding Chandler and Alina here as well.

In general, I don't think that this is such a great idea. Being able to have this sort of thing work more reliably is one of the reasons for the new pass manager. I think I'd like to see this split out into an old versus new pass manager pass to avoid the difficulty of cleaning this up after we finish migrating llvm to the new pass manager. This also seems to add some technical debt around options and other enablement which is also less than ideal. Is this compelling to add right now versus finishing work migrating llvm completely to the new pass manager and removing the old one? From speaking with Alina I think that work should be done in a short while.

Thanks.

-eric

I don't think we're that close yet, probably at least a couple months out, there are lots of loose ends to be tied up. I'll make a post soon in llvm-dev (maybe first we can sync up again) about what I think needs to be done before the NPM switch.

+1 to sync up again and make progress towards the NPM switch.

I don't want to block this patch, but I do agree with Eric's point. We *really* want to focus more on the switch then invest into more LPM infra. Short term resolutions to unblock folks, with the best split possible, sure, keeping in mind they'll need to be cleaned up. But I don't want us to lose focus on tying up the remaining loose ends for the switch.
I think it's critical for LLVM's codebase health to focus on the NPM switch in the next couple of months.

Harbormaster completed remote builds in B63048: Diff 275763.Jul 6 2020, 11:18 AM

I don't want to block this patch, but I do agree with Eric's point. We *really* want to focus more on the switch then invest into more LPM infra. Short term resolutions to unblock folks, with the best split possible, sure, keeping in mind they'll need to be cleaned up.

Sounds good to me.

LGTM

nikic requested changes to this revision.Jul 7 2020, 1:42 AM

nikic added a subscriber: nikic.

nikic added inline comments.

llvm/test/Other/opt-O2-pipeline.ll
289	Is it possible to switch this pass to use LazyBPI / LazyBFA, only fetched if PGO is actually in use? PGO functionality that most people don't use adding expensive analysis passes like PDT should be avoided.

This revision now requires changes to proceed.Jul 7 2020, 1:42 AM

hans added inline comments.Jul 7 2020, 2:16 AM

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp
170	Oh, just noticed: I think CallGraphProfile should be initialized along with the other flags here.
llvm/test/Other/opt-O2-pipeline.ll
289	I wonder if just switching to LazyBlockFrequencyInfo would help though. It looks to me like the CGProfile would request info about each function anyway. I was surprised to see that Clang sets Opts.CallGraphProfile solely based on whether the integrated assembler is used. Maybe a better fix is to only set that to true when a profile is actually being used?

nikic added inline comments.Jul 7 2020, 8:52 AM

llvm/test/Other/opt-O2-pipeline.ll
289	I wonder if just switching to LazyBlockFrequencyInfo would help though. It looks to me like the CGProfile would request info about each function anyway. It would only help if there is some way to only fetch the analysis conditionally. I believe many PGO passes use something like PSI.hasProfileSummary() or F.hasProfileData() for that. I was surprised to see that Clang sets Opts.CallGraphProfile solely based on whether the integrated assembler is used. Maybe a better fix is to only set that to true when a profile is actually being used? Right, just disabling this by default in clang/opt would also work. For reference, the current compile-time numbers for this patch: https://llvm-compile-time-tracker.com/compare.php?from=516ff1d4baee28b1911737e47b42973567adf8ff&to=8df840660bb764b6653fcfd9ac7a72cc6adebde6&stat=instructions Not huge, but it adds up (some similar regressions have been introduced in LLVM 10).

zequanwu marked an inline comment as done.Jul 7 2020, 10:01 AM

zequanwu added inline comments.

llvm/test/Other/opt-O2-pipeline.ll
289	Do you mean disabling it just for LPM or both?

zequanwu marked an inline comment as done.Jul 7 2020, 11:32 AM

zequanwu added inline comments.

llvm/test/Other/opt-O2-pipeline.ll
289	I was surprised to see that Clang sets Opts.CallGraphProfile solely based on whether the integrated assembler is used. Maybe a better fix is to only set that to true when a profile is actually being used? For Clang, a better fix I think is that `Opts.CallGraphProfile` should based on both whether the integrated assembler is used and whether profile instrumentation is turned on. What do you think?

MaskRay added inline comments.Jul 7 2020, 12:19 PM

llvm/test/Other/opt-O2-pipeline.ll
289	I'd prefer not having `CallGraphProfile` `-no-integrated-as -S` => no .cgprofile (.llvm_addrsig behaves this way) `-S` -> .cgprofile

zequanwu added inline comments.Jul 7 2020, 1:33 PM

llvm/test/Other/opt-O2-pipeline.ll
289	As discussed above, I think `CGProfilePass` should be disabled by default in clang unless `-no-integrated-as` is not given and `-fprofile-instrument-use-path=` is given. So, `Opts.CallGraphProfile` is a convenient switch for that.

Disable enable-call-graph-profile by default in opt.
Disable CGProfilePass by default in clang unless -no-integrated-as is not given and -fprofile-instrument-use-path= is given, as this pass only generates module metadata when profile data is given.

Herald added subscribers: dexonsmith, steven_wu. · View Herald TranscriptJul 7 2020, 2:25 PM

I still haven't seen a strong argument keeping a command line option -enable-npm-call-graph-profile. Asked in D62627.

Opts.getProfileUse() != CodeGenOptions::ProfileNone in

Opts.CallGraphProfile = Opts.getProfileUse() != CodeGenOptions::ProfileNone &&
                          !Opts.DisableIntegratedAS;

is redundant. CGProfile.cpp is a no-op if no function provides getEntryFreq().

Harbormaster failed remote builds in B63294: Diff 276208!Jul 7 2020, 5:09 PM

In D83013#2137607, @MaskRay wrote:
Opts.getProfileUse() != CodeGenOptions::ProfileNone in
Opts.CallGraphProfile = Opts.getProfileUse() != CodeGenOptions::ProfileNone &&
                          !Opts.DisableIntegratedAS;
is redundant. CGProfile.cpp is a no-op if no function provides getEntryFreq().

It's a functional no-op, but it runs the BFI analysis, which as Nikita pointed out above adds some compile-time cost. Not scheduling the pass unless we're using profile info seems like a reasonable way to avoid that cost to me.

The alternative of using LazyBlockFrequencyInfoPass and checking PSI->hasProfileSummary() first would also work I guess. If you think that's cleaner, maybe that's the better way to go.

Remove "enable-call-graph-profile" option and enable CGProfilePass by default, unless -no-integrated-as is given in clang.
Use LazyBlockFrequencyInfoPass instead of BlockFrequencyInfoWrapperPass and check F.getEntryCount before get BFI to reduce cost.

The alternative of using LazyBlockFrequencyInfoPass and checking PSI->hasProfileSummary() first would also work I guess. If you think that's cleaner, maybe that's the better way to go.

Since PSI->hasProfileSummary() is not necessary for this pass, it relies on function entry count. So, I check for F.getEntryCount() before getting BFI.

Harbormaster failed remote builds in B63469: Diff 276526!Jul 8 2020, 1:46 PM

In D83013#2139882, @zequanwu wrote:

The alternative of using LazyBlockFrequencyInfoPass and checking PSI->hasProfileSummary() first would also work I guess. If you think that's cleaner, maybe that's the better way to go.

Since PSI->hasProfileSummary() is not necessary for this pass, it relies on function entry count. So, I check for F.getEntryCount() before getting BFI.

Thanks. The last update looks good to me. I'll defer the approval to @nikic and folks who have expressed concerns about deleting legacy PM.

LG from my side.

New compile-time numbers: https://llvm-compile-time-tracker.com/compare.php?from=0b39d2d75275b80994dac06b7ad05031cbd09393&to=fd070b79e063fff2fad3cd4a467f64dfca83eb90&stat=instructions It's nearly neutral now.

llvm/test/CodeGen/AMDGPU/opt-pipeline.ll
285	This test is out of date.

In D83013#2142271, @nikic wrote:

New compile-time numbers: https://llvm-compile-time-tracker.com/compare.php?from=0b39d2d75275b80994dac06b7ad05031cbd09393&to=fd070b79e063fff2fad3cd4a467f64dfca83eb90&stat=instructions It's nearly neutral now.

Sounds great!

lgtm2 (with the test update Nikita mentioned)

Update test case.

rebase.

Harbormaster failed remote builds in B63629: Diff 276813!Jul 9 2020, 1:03 PM

This revision was not accepted when it landed; it landed in state Needs Review.Jul 9 2020, 1:04 PM

Closed by commit rGc92a8c0a0f68: [LPM] Port CGProfilePass from NPM to LPM (authored by zequanwu). · Explain Why

This revision was automatically updated to reflect the committed changes.

Some inline nits. I see you've already committed and that's fine - I still don't think we should do it, but we can delete it again soon :)

clang/lib/CodeGen/BackendUtil.cpp
623	Comment here as to why.
1150–1152	Comment here as to why.
1570–1572	Ditto :)
llvm/lib/Transforms/Instrumentation/CGProfile.cpp
63	Extra space? Did clang-format put this in?
65–69	Comment? What's the change for?

Harbormaster failed remote builds in B63627: Diff 276808!Jul 9 2020, 1:13 PM

This seems to have broken the build
http://45.33.8.238/linux/22500/step_7.txt

MaskRay added a reverting change: rGc025bdf25a59: Revert D83013 "[LPM] Port CGProfilePass from NPM to LPM".Jul 9 2020, 1:36 PM

MaskRay reopened this revision.Jul 9 2020, 1:42 PM

MaskRay requested changes to this revision.

This revision now requires changes to proceed.Jul 9 2020, 1:42 PM

Add comments and fix test failure in http://45.33.8.238/linux/22500/step_7.txt.

zequanwu marked 5 inline comments as done.Jul 9 2020, 2:16 PM

zequanwu added inline comments.

llvm/lib/Transforms/Instrumentation/CGProfile.cpp
63	Yes, `clang-format` put this in.

Harbormaster failed remote builds in B63641: Diff 276834!Jul 9 2020, 3:06 PM

Still lgtm. For what it's worth, I think you could have just re-committed with the fixes rather than uploading for review again.

This revision was not accepted when it landed; it landed in state Needs Review.Jul 10 2020, 9:05 AM

Closed by commit rG1fbb719470c6: [LPM] Port CGProfilePass from NPM to LPM (authored by zequanwu). · Explain Why

This revision was automatically updated to reflect the committed changes.

In D83013#2143470, @hans wrote:

Still lgtm. For what it's worth, I think you could have just re-committed with the fixes rather than uploading for review again.

Gotcha, thanks.

In D83013#2143470, @hans wrote:

Still lgtm. For what it's worth, I think you could have just re-committed with the fixes rather than uploading for review again.

This may be a difference of habits but I usually upload the last revision if it contains anything more than comment changes. The reviewed version might be read by posterity to get a quick overview about the patch. Browsing git log -p is not very convenient at times.

Revision Contents

Path

Size

clang/

include/

clang/

Basic/

CodeGenOptions.def

1 line

lib/

CodeGen/

BackendUtil.cpp

11 lines

Frontend/

CompilerInvocation.cpp

1 line

llvm/

include/

llvm/

InitializePasses.h

1 line

Transforms/

IPO.h

2 lines

IPO/

PassManagerBuilder.h

1 line

Instrumentation/

CGProfile.h

5 lines

lib/

Passes/

PassBuilder.cpp

6 lines

Transforms/

IPO/

PassManagerBuilder.cpp

5 lines

Instrumentation/

CGProfile.cpp

107 lines

Instrumentation.cpp

1 line

test/

CodeGen/

AMDGPU/

opt-pipeline.ll

18 lines

Instrumentation/

cgprofile.ll

1 line

Other/

6 lines

6 lines

6 lines

Diff 277065

clang/include/clang/Basic/CodeGenOptions.def

Show First 20 Lines • Show All 248 Lines • ▼ Show 20 Lines	VALUE_CODEGENOPT(TimeTraceGranularity, 32, 500) ///< Minimum time granularity (in microseconds),
///< traced by time profiler		///< traced by time profiler
CODEGENOPT(UnrollLoops , 1, 0) ///< Control whether loops are unrolled.		CODEGENOPT(UnrollLoops , 1, 0) ///< Control whether loops are unrolled.
CODEGENOPT(RerollLoops , 1, 0) ///< Control whether loops are rerolled.		CODEGENOPT(RerollLoops , 1, 0) ///< Control whether loops are rerolled.
CODEGENOPT(NoUseJumpTables , 1, 0) ///< Set when -fno-jump-tables is enabled.		CODEGENOPT(NoUseJumpTables , 1, 0) ///< Set when -fno-jump-tables is enabled.
CODEGENOPT(UnwindTables , 1, 0) ///< Emit unwind tables.		CODEGENOPT(UnwindTables , 1, 0) ///< Emit unwind tables.
CODEGENOPT(VectorizeLoop , 1, 0) ///< Run loop vectorizer.		CODEGENOPT(VectorizeLoop , 1, 0) ///< Run loop vectorizer.
CODEGENOPT(VectorizeSLP , 1, 0) ///< Run SLP vectorizer.		CODEGENOPT(VectorizeSLP , 1, 0) ///< Run SLP vectorizer.
CODEGENOPT(ProfileSampleAccurate, 1, 0) ///< Sample profile is accurate.		CODEGENOPT(ProfileSampleAccurate, 1, 0) ///< Sample profile is accurate.
CODEGENOPT(CallGraphProfile , 1, 0) ///< Run call graph profile.

/// Attempt to use register sized accesses to bit-fields in structures, when		/// Attempt to use register sized accesses to bit-fields in structures, when
/// possible.		/// possible.
CODEGENOPT(UseRegisterSizedBitfieldAccess , 1, 0)		CODEGENOPT(UseRegisterSizedBitfieldAccess , 1, 0)

CODEGENOPT(VerifyModule , 1, 1) ///< Control whether the module should be run		CODEGENOPT(VerifyModule , 1, 1) ///< Control whether the module should be run
///< through the LLVM Verifier.		///< through the LLVM Verifier.

▲ Show 20 Lines • Show All 133 Lines • Show Last 20 Lines

clang/lib/CodeGen/BackendUtil.cpp

Show First 20 Lines • Show All 614 Lines • ▼ Show 20 Lines	PMBuilder.Inliner = createFunctionInliningPass(
(!CodeGenOpts.SampleProfileFile.empty() &&		(!CodeGenOpts.SampleProfileFile.empty() &&
CodeGenOpts.PrepareForThinLTO));		CodeGenOpts.PrepareForThinLTO));
}		}

PMBuilder.OptLevel = CodeGenOpts.OptimizationLevel;		PMBuilder.OptLevel = CodeGenOpts.OptimizationLevel;
PMBuilder.SizeLevel = CodeGenOpts.OptimizeSize;		PMBuilder.SizeLevel = CodeGenOpts.OptimizeSize;
PMBuilder.SLPVectorize = CodeGenOpts.VectorizeSLP;		PMBuilder.SLPVectorize = CodeGenOpts.VectorizeSLP;
PMBuilder.LoopVectorize = CodeGenOpts.VectorizeLoop;		PMBuilder.LoopVectorize = CodeGenOpts.VectorizeLoop;
		// Only enable CGProfilePass when using integrated assembler, since
		echristoUnsubmitted Done Reply Inline Actions Comment here as to why. echristo: Comment here as to why.
		// non-integrated assemblers don't recognize .cgprofile section.
		PMBuilder.CallGraphProfile = !CodeGenOpts.DisableIntegratedAS;

PMBuilder.DisableUnrollLoops = !CodeGenOpts.UnrollLoops;		PMBuilder.DisableUnrollLoops = !CodeGenOpts.UnrollLoops;
// Loop interleaving in the loop vectorizer has historically been set to be		// Loop interleaving in the loop vectorizer has historically been set to be
// enabled when loop unrolling is enabled.		// enabled when loop unrolling is enabled.
PMBuilder.LoopsInterleaved = CodeGenOpts.UnrollLoops;		PMBuilder.LoopsInterleaved = CodeGenOpts.UnrollLoops;
PMBuilder.MergeFunctions = CodeGenOpts.MergeFunctions;		PMBuilder.MergeFunctions = CodeGenOpts.MergeFunctions;
PMBuilder.PrepareForThinLTO = CodeGenOpts.PrepareForThinLTO;		PMBuilder.PrepareForThinLTO = CodeGenOpts.PrepareForThinLTO;
PMBuilder.PrepareForLTO = CodeGenOpts.PrepareForLTO;		PMBuilder.PrepareForLTO = CodeGenOpts.PrepareForLTO;
▲ Show 20 Lines • Show All 508 Lines • ▼ Show 20 Lines	void EmitAssemblyHelper::EmitAssemblyWithNewPassManager(

PipelineTuningOptions PTO;		PipelineTuningOptions PTO;
PTO.LoopUnrolling = CodeGenOpts.UnrollLoops;		PTO.LoopUnrolling = CodeGenOpts.UnrollLoops;
// For historical reasons, loop interleaving is set to mirror setting for loop		// For historical reasons, loop interleaving is set to mirror setting for loop
// unrolling.		// unrolling.
PTO.LoopInterleaving = CodeGenOpts.UnrollLoops;		PTO.LoopInterleaving = CodeGenOpts.UnrollLoops;
PTO.LoopVectorization = CodeGenOpts.VectorizeLoop;		PTO.LoopVectorization = CodeGenOpts.VectorizeLoop;
PTO.SLPVectorization = CodeGenOpts.VectorizeSLP;		PTO.SLPVectorization = CodeGenOpts.VectorizeSLP;
PTO.CallGraphProfile = CodeGenOpts.CallGraphProfile;		// Only enable CGProfilePass when using integrated assembler, since
		// non-integrated assemblers don't recognize .cgprofile section.
		PTO.CallGraphProfile = !CodeGenOpts.DisableIntegratedAS;
		echristoUnsubmitted Done Reply Inline Actions Comment here as to why. echristo: Comment here as to why.
PTO.Coroutines = LangOpts.Coroutines;		PTO.Coroutines = LangOpts.Coroutines;

PassInstrumentationCallbacks PIC;		PassInstrumentationCallbacks PIC;
StandardInstrumentations SI;		StandardInstrumentations SI;
SI.registerCallbacks(PIC);		SI.registerCallbacks(PIC);
PassBuilder PB(TM.get(), PTO, PGOOpt, &PIC);		PassBuilder PB(TM.get(), PTO, PGOOpt, &PIC);

// Attempt to load pass plugins and register their callbacks with PB.		// Attempt to load pass plugins and register their callbacks with PB.
▲ Show 20 Lines • Show All 401 Lines • ▼ Show 20 Lines	static void runThinLTOBackend(
initTargetOptions(Diags, Conf.Options, CGOpts, TOpts, LOpts, HeaderOpts);		initTargetOptions(Diags, Conf.Options, CGOpts, TOpts, LOpts, HeaderOpts);
Conf.SampleProfile = std::move(SampleProfile);		Conf.SampleProfile = std::move(SampleProfile);
Conf.PTO.LoopUnrolling = CGOpts.UnrollLoops;		Conf.PTO.LoopUnrolling = CGOpts.UnrollLoops;
// For historical reasons, loop interleaving is set to mirror setting for loop		// For historical reasons, loop interleaving is set to mirror setting for loop
// unrolling.		// unrolling.
Conf.PTO.LoopInterleaving = CGOpts.UnrollLoops;		Conf.PTO.LoopInterleaving = CGOpts.UnrollLoops;
Conf.PTO.LoopVectorization = CGOpts.VectorizeLoop;		Conf.PTO.LoopVectorization = CGOpts.VectorizeLoop;
Conf.PTO.SLPVectorization = CGOpts.VectorizeSLP;		Conf.PTO.SLPVectorization = CGOpts.VectorizeSLP;
Conf.PTO.CallGraphProfile = CGOpts.CallGraphProfile;		// Only enable CGProfilePass when using integrated assembler, since
		// non-integrated assemblers don't recognize .cgprofile section.
		Conf.PTO.CallGraphProfile = !CGOpts.DisableIntegratedAS;
		echristoUnsubmitted Done Reply Inline Actions Ditto :) echristo: Ditto :)

// Context sensitive profile.		// Context sensitive profile.
if (CGOpts.hasProfileCSIRInstr()) {		if (CGOpts.hasProfileCSIRInstr()) {
Conf.RunCSIRInstr = true;		Conf.RunCSIRInstr = true;
Conf.CSIRProfile = std::move(CGOpts.InstrProfileOutput);		Conf.CSIRProfile = std::move(CGOpts.InstrProfileOutput);
} else if (CGOpts.hasProfileCSIRUse()) {		} else if (CGOpts.hasProfileCSIRUse()) {
Conf.RunCSIRInstr = false;		Conf.RunCSIRInstr = false;
Conf.CSIRProfile = std::move(CGOpts.ProfileInstrumentUsePath);		Conf.CSIRProfile = std::move(CGOpts.ProfileInstrumentUsePath);
▲ Show 20 Lines • Show All 121 Lines • Show Last 20 Lines

clang/lib/Frontend/CompilerInvocation.cpp

Show First 20 Lines • Show All 854 Lines • ▼ Show 20 Lines	static bool ParseCodeGenArgs(CodeGenOptions &Opts, ArgList &Args, InputKind IK,
if (Opts.SimplifyLibCalls)		if (Opts.SimplifyLibCalls)
getAllNoBuiltinFuncValues(Args, Opts.NoBuiltinFuncs);		getAllNoBuiltinFuncValues(Args, Opts.NoBuiltinFuncs);
Opts.UnrollLoops =		Opts.UnrollLoops =
Args.hasFlag(OPT_funroll_loops, OPT_fno_unroll_loops,		Args.hasFlag(OPT_funroll_loops, OPT_fno_unroll_loops,
(Opts.OptimizationLevel > 1));		(Opts.OptimizationLevel > 1));
Opts.RerollLoops = Args.hasArg(OPT_freroll_loops);		Opts.RerollLoops = Args.hasArg(OPT_freroll_loops);

Opts.DisableIntegratedAS = Args.hasArg(OPT_fno_integrated_as);		Opts.DisableIntegratedAS = Args.hasArg(OPT_fno_integrated_as);
Opts.CallGraphProfile = !Opts.DisableIntegratedAS;
Opts.Autolink = !Args.hasArg(OPT_fno_autolink);		Opts.Autolink = !Args.hasArg(OPT_fno_autolink);
Opts.SampleProfileFile =		Opts.SampleProfileFile =
std::string(Args.getLastArgValue(OPT_fprofile_sample_use_EQ));		std::string(Args.getLastArgValue(OPT_fprofile_sample_use_EQ));
Opts.DebugInfoForProfiling = Args.hasFlag(		Opts.DebugInfoForProfiling = Args.hasFlag(
OPT_fdebug_info_for_profiling, OPT_fno_debug_info_for_profiling, false);		OPT_fdebug_info_for_profiling, OPT_fno_debug_info_for_profiling, false);
Opts.DebugNameTable = static_cast<unsigned>(		Opts.DebugNameTable = static_cast<unsigned>(
Args.hasArg(OPT_ggnu_pubnames)		Args.hasArg(OPT_ggnu_pubnames)
? llvm::DICompileUnit::DebugNameTableKind::GNU		? llvm::DICompileUnit::DebugNameTableKind::GNU
▲ Show 20 Lines • Show All 3,137 Lines • Show Last 20 Lines

llvm/include/llvm/InitializePasses.h

	Show First 20 Lines • Show All 97 Lines • ▼ Show 20 Lines
	void initializeCFGPrinterLegacyPassPass(PassRegistry&);			void initializeCFGPrinterLegacyPassPass(PassRegistry&);
	void initializeCFGSimplifyPassPass(PassRegistry&);			void initializeCFGSimplifyPassPass(PassRegistry&);
	void initializeCFGuardPass(PassRegistry&);			void initializeCFGuardPass(PassRegistry&);
	void initializeCFGuardLongjmpPass(PassRegistry&);			void initializeCFGuardLongjmpPass(PassRegistry&);
	void initializeCFGViewerLegacyPassPass(PassRegistry&);			void initializeCFGViewerLegacyPassPass(PassRegistry&);
	void initializeCFIInstrInserterPass(PassRegistry&);			void initializeCFIInstrInserterPass(PassRegistry&);
	void initializeCFLAndersAAWrapperPassPass(PassRegistry&);			void initializeCFLAndersAAWrapperPassPass(PassRegistry&);
	void initializeCFLSteensAAWrapperPassPass(PassRegistry&);			void initializeCFLSteensAAWrapperPassPass(PassRegistry&);
				void initializeCGProfileLegacyPassPass(PassRegistry &);
	void initializeCallGraphDOTPrinterPass(PassRegistry&);			void initializeCallGraphDOTPrinterPass(PassRegistry&);
	void initializeCallGraphPrinterLegacyPassPass(PassRegistry&);			void initializeCallGraphPrinterLegacyPassPass(PassRegistry&);
	void initializeCallGraphViewerPass(PassRegistry&);			void initializeCallGraphViewerPass(PassRegistry&);
	void initializeCallGraphWrapperPassPass(PassRegistry&);			void initializeCallGraphWrapperPassPass(PassRegistry&);
	void initializeCallSiteSplittingLegacyPassPass(PassRegistry&);			void initializeCallSiteSplittingLegacyPassPass(PassRegistry&);
	void initializeCalledValuePropagationLegacyPassPass(PassRegistry &);			void initializeCalledValuePropagationLegacyPassPass(PassRegistry &);
	void initializeCodeGenPreparePass(PassRegistry&);			void initializeCodeGenPreparePass(PassRegistry&);
	void initializeConstantHoistingLegacyPassPass(PassRegistry&);			void initializeConstantHoistingLegacyPassPass(PassRegistry&);
	▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines
	void initializeInferAddressSpacesPass(PassRegistry&);			void initializeInferAddressSpacesPass(PassRegistry&);
	void initializeInferFunctionAttrsLegacyPassPass(PassRegistry&);			void initializeInferFunctionAttrsLegacyPassPass(PassRegistry&);
	void initializeInjectTLIMappingsLegacyPass(PassRegistry &);			void initializeInjectTLIMappingsLegacyPass(PassRegistry &);
	void initializeInlineCostAnalysisPass(PassRegistry&);			void initializeInlineCostAnalysisPass(PassRegistry&);
	void initializeInstCountPass(PassRegistry&);			void initializeInstCountPass(PassRegistry&);
	void initializeInstNamerPass(PassRegistry&);			void initializeInstNamerPass(PassRegistry&);
	void initializeInstSimplifyLegacyPassPass(PassRegistry &);			void initializeInstSimplifyLegacyPassPass(PassRegistry &);
	void initializeInstrProfilingLegacyPassPass(PassRegistry&);			void initializeInstrProfilingLegacyPassPass(PassRegistry&);
	void initializeInstrOrderFileLegacyPassPass(PassRegistry&);			void initializeInstrOrderFileLegacyPassPass(PassRegistry&);
				hansUnsubmitted Done Reply Inline Actions I think these declarations should be in alphabetic order. hans: I think these declarations should be in alphabetic order.
	void initializeInstructionCombiningPassPass(PassRegistry&);			void initializeInstructionCombiningPassPass(PassRegistry&);
	void initializeInstructionSelectPass(PassRegistry&);			void initializeInstructionSelectPass(PassRegistry&);
	void initializeInterleavedAccessPass(PassRegistry&);			void initializeInterleavedAccessPass(PassRegistry&);
	void initializeInterleavedLoadCombinePass(PassRegistry &);			void initializeInterleavedLoadCombinePass(PassRegistry &);
	void initializeInternalizeLegacyPassPass(PassRegistry&);			void initializeInternalizeLegacyPassPass(PassRegistry&);
	void initializeIntervalPartitionPass(PassRegistry&);			void initializeIntervalPartitionPass(PassRegistry&);
	void initializeJumpThreadingPass(PassRegistry&);			void initializeJumpThreadingPass(PassRegistry&);
	void initializeLCSSAVerificationPassPass(PassRegistry&);			void initializeLCSSAVerificationPassPass(PassRegistry&);
	▲ Show 20 Lines • Show All 238 Lines • Show Last 20 Lines

llvm/include/llvm/Transforms/IPO.h

	Show First 20 Lines • Show All 276 Lines • ▼ Show 20 Lines
	// IR metadata to reflect the profile.			// IR metadata to reflect the profile.
	ModulePass *createSampleProfileLoaderPass();			ModulePass *createSampleProfileLoaderPass();
	ModulePass *createSampleProfileLoaderPass(StringRef Name);			ModulePass *createSampleProfileLoaderPass(StringRef Name);

	/// Write ThinLTO-ready bitcode to Str.			/// Write ThinLTO-ready bitcode to Str.
	ModulePass *createWriteThinLTOBitcodePass(raw_ostream &Str,			ModulePass *createWriteThinLTOBitcodePass(raw_ostream &Str,
	raw_ostream *ThinLinkOS = nullptr);			raw_ostream *ThinLinkOS = nullptr);

				ModulePass *createCGProfileLegacyPass();
				hansUnsubmitted Done Reply Inline Actions ultra nit: I would probably keep the empty line before the } here hans: ultra nit: I would probably keep the empty line before the } here

	} // End llvm namespace			} // End llvm namespace

	#endif			#endif

llvm/include/llvm/Transforms/IPO/PassManagerBuilder.h

Show First 20 Lines • Show All 150 Lines • ▼ Show 20 Lines	public:

/// The module summary index to use for importing information to the		/// The module summary index to use for importing information to the
/// thin LTO backends, for example for the CFI and devirtualization type		/// thin LTO backends, for example for the CFI and devirtualization type
/// tests.		/// tests.
const ModuleSummaryIndex *ImportSummary = nullptr;		const ModuleSummaryIndex *ImportSummary = nullptr;

bool DisableTailCalls;		bool DisableTailCalls;
bool DisableUnrollLoops;		bool DisableUnrollLoops;
		bool CallGraphProfile;
bool SLPVectorize;		bool SLPVectorize;
bool LoopVectorize;		bool LoopVectorize;
bool LoopsInterleaved;		bool LoopsInterleaved;
bool RerollLoops;		bool RerollLoops;
bool NewGVN;		bool NewGVN;
bool DisableGVNLoadPRE;		bool DisableGVNLoadPRE;
bool ForgetAllSCEVInLoopUnroll;		bool ForgetAllSCEVInLoopUnroll;
bool VerifyInput;		bool VerifyInput;
▲ Show 20 Lines • Show All 97 Lines • Show Last 20 Lines

llvm/include/llvm/Transforms/Instrumentation/CGProfile.h

	Show All 13 Lines

	#include "llvm/ADT/MapVector.h"			#include "llvm/ADT/MapVector.h"
	#include "llvm/IR/PassManager.h"			#include "llvm/IR/PassManager.h"

	namespace llvm {			namespace llvm {
	class CGProfilePass : public PassInfoMixin<CGProfilePass> {			class CGProfilePass : public PassInfoMixin<CGProfilePass> {
	public:			public:
	PreservedAnalyses run(Module &M, ModuleAnalysisManager &AM);			PreservedAnalyses run(Module &M, ModuleAnalysisManager &AM);

	private:
	void addModuleFlags(
	Module &M,
	MapVector<std::pair<Function , Function >, uint64_t> &Counts) const;
	};			};
	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_TRANSFORMS_CGPROFILE_H			#endif // LLVM_TRANSFORMS_CGPROFILE_H

llvm/lib/Passes/PassBuilder.cpp

Show First 20 Lines • Show All 242 Lines • ▼ Show 20 Lines	static const Regex DefaultAliasRegex(
"^(default\|thinlto-pre-link\|thinlto\|lto-pre-link\|lto)<(O[0123sz])>$");		"^(default\|thinlto-pre-link\|thinlto\|lto-pre-link\|lto)<(O[0123sz])>$");

// This option is used in simplifying testing SampleFDO optimizations for		// This option is used in simplifying testing SampleFDO optimizations for
// profile loading.		// profile loading.
static cl::opt<bool>		static cl::opt<bool>
EnableCHR("enable-chr-npm", cl::init(true), cl::Hidden,		EnableCHR("enable-chr-npm", cl::init(true), cl::Hidden,
cl::desc("Enable control height reduction optimization (CHR)"));		cl::desc("Enable control height reduction optimization (CHR)"));

static cl::opt<bool> EnableCallGraphProfile(
"enable-npm-call-graph-profile", cl::init(true), cl::Hidden,
cl::desc("Enable call graph profile pass for the new PM (default = on)"));

/// Flag to enable inline deferral during PGO.		/// Flag to enable inline deferral during PGO.
static cl::opt<bool>		static cl::opt<bool>
EnablePGOInlineDeferral("enable-npm-pgo-inline-deferral", cl::init(true),		EnablePGOInlineDeferral("enable-npm-pgo-inline-deferral", cl::init(true),
cl::Hidden,		cl::Hidden,
cl::desc("Enable inline deferral during PGO"));		cl::desc("Enable inline deferral during PGO"));

PipelineTuningOptions::PipelineTuningOptions() {		PipelineTuningOptions::PipelineTuningOptions() {
LoopInterleaving = true;		LoopInterleaving = true;
LoopVectorization = true;		LoopVectorization = true;
SLPVectorization = false;		SLPVectorization = false;
LoopUnrolling = true;		LoopUnrolling = true;
ForgetAllSCEVInLoopUnroll = ForgetSCEVInLoopUnroll;		ForgetAllSCEVInLoopUnroll = ForgetSCEVInLoopUnroll;
Coroutines = false;		Coroutines = false;
LicmMssaOptCap = SetLicmMssaOptCap;		LicmMssaOptCap = SetLicmMssaOptCap;
LicmMssaNoAccForPromotionCap = SetLicmMssaNoAccForPromotionCap;		LicmMssaNoAccForPromotionCap = SetLicmMssaNoAccForPromotionCap;
CallGraphProfile = EnableCallGraphProfile;		CallGraphProfile = true;
}		}

extern cl::opt<bool> EnableHotColdSplit;		extern cl::opt<bool> EnableHotColdSplit;
extern cl::opt<bool> EnableOrderFileInstrumentation;		extern cl::opt<bool> EnableOrderFileInstrumentation;

extern cl::opt<bool> FlattenedProfileUsed;		extern cl::opt<bool> FlattenedProfileUsed;

extern cl::opt<AttributorRunOption> AttributorRun;		extern cl::opt<AttributorRunOption> AttributorRun;
▲ Show 20 Lines • Show All 2,393 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp

Show First 20 Lines • Show All 161 Lines • ▼ Show 20 Lines	cl::values(clEnumValN(AttributorRunOption::ALL, "all",
"enable module-wide attributor runs"),		"enable module-wide attributor runs"),
clEnumValN(AttributorRunOption::CGSCC, "cgscc",		clEnumValN(AttributorRunOption::CGSCC, "cgscc",
"enable call graph SCC attributor runs"),		"enable call graph SCC attributor runs"),
clEnumValN(AttributorRunOption::NONE, "none",		clEnumValN(AttributorRunOption::NONE, "none",
"disable attributor runs")));		"disable attributor runs")));

extern cl::opt<bool> EnableKnowledgeRetention;		extern cl::opt<bool> EnableKnowledgeRetention;

PassManagerBuilder::PassManagerBuilder() {		PassManagerBuilder::PassManagerBuilder() {
		hansUnsubmitted Done Reply Inline Actions Oh, just noticed: I think CallGraphProfile should be initialized along with the other flags here. hans: Oh, just noticed: I think CallGraphProfile should be initialized along with the other flags…
OptLevel = 2;		OptLevel = 2;
SizeLevel = 0;		SizeLevel = 0;
LibraryInfo = nullptr;		LibraryInfo = nullptr;
Inliner = nullptr;		Inliner = nullptr;
DisableUnrollLoops = false;		DisableUnrollLoops = false;
SLPVectorize = false;		SLPVectorize = false;
LoopVectorize = true;		LoopVectorize = true;
LoopsInterleaved = true;		LoopsInterleaved = true;
Show All 11 Lines	PassManagerBuilder::PassManagerBuilder() {
EnablePGOCSInstrGen = false;		EnablePGOCSInstrGen = false;
EnablePGOCSInstrUse = false;		EnablePGOCSInstrUse = false;
PGOInstrGen = "";		PGOInstrGen = "";
PGOInstrUse = "";		PGOInstrUse = "";
PGOSampleUse = "";		PGOSampleUse = "";
PrepareForThinLTO = EnablePrepareForThinLTO;		PrepareForThinLTO = EnablePrepareForThinLTO;
PerformThinLTO = EnablePerformThinLTO;		PerformThinLTO = EnablePerformThinLTO;
DivergentTarget = false;		DivergentTarget = false;
		CallGraphProfile = true;
}		}

PassManagerBuilder::~PassManagerBuilder() {		PassManagerBuilder::~PassManagerBuilder() {
delete LibraryInfo;		delete LibraryInfo;
delete Inliner;		delete Inliner;
}		}

/// Set of global extensions, automatically added as part of the standard set.		/// Set of global extensions, automatically added as part of the standard set.
▲ Show 20 Lines • Show All 623 Lines • ▼ Show 20 Lines	void PassManagerBuilder::populateModulePassManager(
// See comment in the new PM for justification of scheduling splitting at		// See comment in the new PM for justification of scheduling splitting at
// this stage (\ref buildModuleSimplificationPipeline).		// this stage (\ref buildModuleSimplificationPipeline).
if (EnableHotColdSplit && !(PrepareForLTO \|\| PrepareForThinLTO))		if (EnableHotColdSplit && !(PrepareForLTO \|\| PrepareForThinLTO))
MPM.add(createHotColdSplittingPass());		MPM.add(createHotColdSplittingPass());

if (MergeFunctions)		if (MergeFunctions)
MPM.add(createMergeFunctionsPass());		MPM.add(createMergeFunctionsPass());

		// Add Module flag "CG Profile" based on Branch Frequency Information.
		if (CallGraphProfile)
		hansUnsubmitted Done Reply Inline Actions In the new pm this pass seems to be added conditionally based on some flag: if (PTO.CallGraphProfile) Is it possible to do something similar here? hans: In the new pm this pass seems to be added conditionally based on some flag: if (PTO.
		MPM.add(createCGProfileLegacyPass());

// LoopSink pass sinks instructions hoisted by LICM, which serves as a		// LoopSink pass sinks instructions hoisted by LICM, which serves as a
// canonicalization pass that enables other optimizations. As a result,		// canonicalization pass that enables other optimizations. As a result,
// LoopSink pass needs to be a very late IR pass to avoid undoing LICM		// LoopSink pass needs to be a very late IR pass to avoid undoing LICM
// result too early.		// result too early.
MPM.add(createLoopSinkPass());		MPM.add(createLoopSinkPass());
// Get rid of LCSSA nodes.		// Get rid of LCSSA nodes.
MPM.add(createInstSimplifyLegacyPass());		MPM.add(createInstSimplifyLegacyPass());

▲ Show 20 Lines • Show All 371 Lines • Show Last 20 Lines

llvm/lib/Transforms/Instrumentation/CGProfile.cpp

//===-- CGProfile.cpp -----------------------------------------------------===//		//===-- CGProfile.cpp -----------------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/Transforms/Instrumentation/CGProfile.h"		#include "llvm/Transforms/Instrumentation/CGProfile.h"

#include "llvm/ADT/MapVector.h"		#include "llvm/ADT/MapVector.h"
#include "llvm/Analysis/BlockFrequencyInfo.h"		#include "llvm/Analysis/BlockFrequencyInfo.h"
		#include "llvm/Analysis/LazyBlockFrequencyInfo.h"
#include "llvm/Analysis/TargetTransformInfo.h"		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/IR/MDBuilder.h"		#include "llvm/IR/MDBuilder.h"
#include "llvm/IR/PassManager.h"		#include "llvm/IR/PassManager.h"
		#include "llvm/InitializePasses.h"
#include "llvm/ProfileData/InstrProf.h"		#include "llvm/ProfileData/InstrProf.h"
		#include "llvm/Transforms/IPO.h"
#include "llvm/Transforms/Instrumentation.h"		#include "llvm/Transforms/Instrumentation.h"

#include <array>		#include <array>

using namespace llvm;		using namespace llvm;

PreservedAnalyses CGProfilePass::run(Module &M, ModuleAnalysisManager &MAM) {		static bool
		addModuleFlags(Module &M,
		MaskRayUnsubmitted Done Reply Inline Actions a static function does not need to be wrapped inside an unnamed namespace MaskRay: a static function does not need to be wrapped inside an unnamed namespace
		MapVector<std::pair<Function , Function >, uint64_t> &Counts) {
		if (Counts.empty())
		return false;

		LLVMContext &Context = M.getContext();
		MDBuilder MDB(Context);
		std::vector<Metadata *> Nodes;

		for (auto E : Counts) {
		Metadata *Vals[] = {ValueAsMetadata::get(E.first.first),
		ValueAsMetadata::get(E.first.second),
		MDB.createConstant(ConstantInt::get(
		Type::getInt64Ty(Context), E.second))};
		Nodes.push_back(MDNode::get(Context, Vals));
		}

		M.addModuleFlag(Module::Append, "CG Profile", MDNode::get(Context, Nodes));
		return true;
		}

		static bool runCGProfilePass(
		Module &M, function_ref<BlockFrequencyInfo &(Function &)> GetBFI,
		function_ref<TargetTransformInfo &(Function &)> GetTTI, bool LazyBFI) {
MapVector<std::pair<Function , Function >, uint64_t> Counts;		MapVector<std::pair<Function , Function >, uint64_t> Counts;
FunctionAnalysisManager &FAM =
MAM.getResult<FunctionAnalysisManagerModuleProxy>(M).getManager();
InstrProfSymtab Symtab;		InstrProfSymtab Symtab;
auto UpdateCounts = [&](TargetTransformInfo &TTI, Function *F,		auto UpdateCounts = [&](TargetTransformInfo &TTI, Function *F,
Function *CalledF, uint64_t NewCount) {		Function *CalledF, uint64_t NewCount) {
if (!CalledF \|\| !TTI.isLoweredToCall(CalledF))		if (!CalledF \|\| !TTI.isLoweredToCall(CalledF))
return;		return;
uint64_t &Count = Counts[std::make_pair(F, CalledF)];		uint64_t &Count = Counts[std::make_pair(F, CalledF)];
Count = SaturatingAdd(Count, NewCount);		Count = SaturatingAdd(Count, NewCount);
};		};
// Ignore error here. Indirect calls are ignored if this fails.		// Ignore error here. Indirect calls are ignored if this fails.
(void)(bool)Symtab.create(M);		(void)(bool) Symtab.create(M);
		echristoUnsubmitted Not Done Reply Inline Actions Extra space? Did clang-format put this in? echristo: Extra space? Did clang-format put this in?
		zequanwuAuthorUnsubmitted Done Reply Inline Actions Yes, `clang-format` put this in. zequanwu: Yes, `clang-format` put this in.
for (auto &F : M) {		for (auto &F : M) {
if (F.isDeclaration())		// Avoid extra cost of running passes for BFI when the function doesn't have
		// entry count. Since LazyBlockFrequencyInfoPass only exists in LPM, check
		// if using LazyBlockFrequencyInfoPass.
		// TODO: Remove LazyBFI when LazyBlockFrequencyInfoPass is available in NPM.
		if (F.isDeclaration() \|\| (LazyBFI && !F.getEntryCount()))
		echristoUnsubmitted Done Reply Inline Actions Comment? What's the change for? echristo: Comment? What's the change for?
continue;		continue;
auto &BFI = FAM.getResult<BlockFrequencyAnalysis>(F);		auto &BFI = GetBFI(F);
if (BFI.getEntryFreq() == 0)		if (BFI.getEntryFreq() == 0)
continue;		continue;
TargetTransformInfo &TTI = FAM.getResult<TargetIRAnalysis>(F);		TargetTransformInfo &TTI = GetTTI(F);
for (auto &BB : F) {		for (auto &BB : F) {
Optional<uint64_t> BBCount = BFI.getBlockProfileCount(&BB);		Optional<uint64_t> BBCount = BFI.getBlockProfileCount(&BB);
if (!BBCount)		if (!BBCount)
continue;		continue;
for (auto &I : BB) {		for (auto &I : BB) {
CallBase *CB = dyn_cast<CallBase>(&I);		CallBase *CB = dyn_cast<CallBase>(&I);
if (!CB)		if (!CB)
continue;		continue;
Show All 10 Lines	for (auto &BB : F) {
}		}
continue;		continue;
}		}
UpdateCounts(TTI, &F, CB->getCalledFunction(), *BBCount);		UpdateCounts(TTI, &F, CB->getCalledFunction(), *BBCount);
}		}
}		}
}		}

addModuleFlags(M, Counts);		return addModuleFlags(M, Counts);
		}

return PreservedAnalyses::all();		namespace {
		struct CGProfileLegacyPass final : public ModulePass {
		static char ID;
		CGProfileLegacyPass() : ModulePass(ID) {
		initializeCGProfileLegacyPassPass(*PassRegistry::getPassRegistry());
}		}
		MaskRayUnsubmitted Done Reply Inline Actions wrap a struct/class in an unnamed namespace to prevent name collision MaskRay: wrap a struct/class in an unnamed namespace to prevent name collision
		hansUnsubmitted Done Reply Inline Actions Might as well declare it 'final' while you're here. hans: Might as well declare it 'final' while you're here.

void CGProfilePass::addModuleFlags(		void getAnalysisUsage(AnalysisUsage &AU) const override {
Module &M,		AU.setPreservesCFG();
MapVector<std::pair<Function , Function >, uint64_t> &Counts) const {		AU.addRequired<LazyBlockFrequencyInfoPass>();
if (Counts.empty())		AU.addRequired<TargetTransformInfoWrapperPass>();
return;		}

LLVMContext &Context = M.getContext();		bool runOnModule(Module &M) override {
MDBuilder MDB(Context);		auto GetBFI = [this](Function &F) -> BlockFrequencyInfo & {
std::vector<Metadata *> Nodes;		return this->getAnalysis<LazyBlockFrequencyInfoPass>(F).getBFI();
		};
		auto GetTTI = [this](Function &F) -> TargetTransformInfo & {
		return this->getAnalysis<TargetTransformInfoWrapperPass>().getTTI(F);
		};

for (auto E : Counts) {		return runCGProfilePass(M, GetBFI, GetTTI, true);
Metadata *Vals[] = {ValueAsMetadata::get(E.first.first),
ValueAsMetadata::get(E.first.second),
MDB.createConstant(ConstantInt::get(
Type::getInt64Ty(Context), E.second))};
Nodes.push_back(MDNode::get(Context, Vals));
}		}
		};

M.addModuleFlag(Module::Append, "CG Profile", MDNode::get(Context, Nodes));		} // namespace

		char CGProfileLegacyPass::ID = 0;

		INITIALIZE_PASS(CGProfileLegacyPass, "cg-profile", "Call Graph Profile", false,
		hansUnsubmitted Done Reply Inline Actions Should it be "cg-profile" to match the npm pass name? Arthur has been working on making such pass names equal recently. hans: Should it be "cg-profile" to match the npm pass name? Arthur has been working on making such…
		false)

		ModulePass *llvm::createCGProfileLegacyPass() {
		return new CGProfileLegacyPass();
		}

		PreservedAnalyses CGProfilePass::run(Module &M, ModuleAnalysisManager &MAM) {
		FunctionAnalysisManager &FAM =
		MAM.getResult<FunctionAnalysisManagerModuleProxy>(M).getManager();
		auto GetBFI = [&FAM](Function &F) -> BlockFrequencyInfo & {
		return FAM.getResult<BlockFrequencyAnalysis>(F);
		};
		auto GetTTI = [&FAM](Function &F) -> TargetTransformInfo & {
		return FAM.getResult<TargetIRAnalysis>(F);
		};

		runCGProfilePass(M, GetBFI, GetTTI, false);

		return PreservedAnalyses::all();
}		}

llvm/lib/Transforms/Instrumentation/Instrumentation.cpp

Show First 20 Lines • Show All 106 Lines • ▼ Show 20 Lines	void llvm::initializeInstrumentation(PassRegistry &Registry) {
initializeModuleAddressSanitizerLegacyPassPass(Registry);		initializeModuleAddressSanitizerLegacyPassPass(Registry);
initializeBoundsCheckingLegacyPassPass(Registry);		initializeBoundsCheckingLegacyPassPass(Registry);
initializeControlHeightReductionLegacyPassPass(Registry);		initializeControlHeightReductionLegacyPassPass(Registry);
initializeGCOVProfilerLegacyPassPass(Registry);		initializeGCOVProfilerLegacyPassPass(Registry);
initializePGOInstrumentationGenLegacyPassPass(Registry);		initializePGOInstrumentationGenLegacyPassPass(Registry);
initializePGOInstrumentationUseLegacyPassPass(Registry);		initializePGOInstrumentationUseLegacyPassPass(Registry);
initializePGOIndirectCallPromotionLegacyPassPass(Registry);		initializePGOIndirectCallPromotionLegacyPassPass(Registry);
initializePGOMemOPSizeOptLegacyPassPass(Registry);		initializePGOMemOPSizeOptLegacyPassPass(Registry);
		initializeCGProfileLegacyPassPass(Registry);
initializeInstrOrderFileLegacyPassPass(Registry);		initializeInstrOrderFileLegacyPassPass(Registry);
initializeInstrProfilingLegacyPassPass(Registry);		initializeInstrProfilingLegacyPassPass(Registry);
initializeMemorySanitizerLegacyPassPass(Registry);		initializeMemorySanitizerLegacyPassPass(Registry);
initializeHWAddressSanitizerLegacyPassPass(Registry);		initializeHWAddressSanitizerLegacyPassPass(Registry);
initializeThreadSanitizerLegacyPassPass(Registry);		initializeThreadSanitizerLegacyPassPass(Registry);
initializeModuleSanitizerCoverageLegacyPassPass(Registry);		initializeModuleSanitizerCoverageLegacyPassPass(Registry);
initializeDataFlowSanitizerPass(Registry);		initializeDataFlowSanitizerPass(Registry);
}		}

/// LLVMInitializeInstrumentation - C binding for		/// LLVMInitializeInstrumentation - C binding for
/// initializeInstrumentation.		/// initializeInstrumentation.
void LLVMInitializeInstrumentation(LLVMPassRegistryRef R) {		void LLVMInitializeInstrumentation(LLVMPassRegistryRef R) {
initializeInstrumentation(*unwrap(R));		initializeInstrumentation(*unwrap(R));
}		}

llvm/test/CodeGen/AMDGPU/opt-pipeline.ll

	Show First 20 Lines • Show All 270 Lines • ▼ Show 20 Lines
	; GCN-O1-NEXT: Loop Pass Manager			; GCN-O1-NEXT: Loop Pass Manager
	; GCN-O1-NEXT: Loop Invariant Code Motion			; GCN-O1-NEXT: Loop Invariant Code Motion
	; GCN-O1-NEXT: Lazy Branch Probability Analysis			; GCN-O1-NEXT: Lazy Branch Probability Analysis
	; GCN-O1-NEXT: Lazy Block Frequency Analysis			; GCN-O1-NEXT: Lazy Block Frequency Analysis
	; GCN-O1-NEXT: Optimization Remark Emitter			; GCN-O1-NEXT: Optimization Remark Emitter
	; GCN-O1-NEXT: Warn about non-applied transformations			; GCN-O1-NEXT: Warn about non-applied transformations
	; GCN-O1-NEXT: Alignment from assumptions			; GCN-O1-NEXT: Alignment from assumptions
	; GCN-O1-NEXT: Strip Unused Function Prototypes			; GCN-O1-NEXT: Strip Unused Function Prototypes
				; GCN-O1-NEXT: Call Graph Profile
				; GCN-O1-NEXT: FunctionPass Manager
				; GCN-O1-NEXT: Dominator Tree Construction
				; GCN-O1-NEXT: Natural Loop Information
				; GCN-O1-NEXT: Lazy Branch Probability Analysis
				; GCN-O1-NEXT: Lazy Block Frequency Analysis
	; GCN-O1-NEXT: FunctionPass Manager			; GCN-O1-NEXT: FunctionPass Manager
				nikicUnsubmitted Not Done Reply Inline Actions This test is out of date. nikic: This test is out of date.
	; GCN-O1-NEXT: Dominator Tree Construction			; GCN-O1-NEXT: Dominator Tree Construction
	; GCN-O1-NEXT: Natural Loop Information			; GCN-O1-NEXT: Natural Loop Information
	; GCN-O1-NEXT: Post-Dominator Tree Construction			; GCN-O1-NEXT: Post-Dominator Tree Construction
	; GCN-O1-NEXT: Branch Probability Analysis			; GCN-O1-NEXT: Branch Probability Analysis
	; GCN-O1-NEXT: Block Frequency Analysis			; GCN-O1-NEXT: Block Frequency Analysis
	; GCN-O1-NEXT: Canonicalize natural loops			; GCN-O1-NEXT: Canonicalize natural loops
	; GCN-O1-NEXT: LCSSA Verifier			; GCN-O1-NEXT: LCSSA Verifier
	; GCN-O1-NEXT: Loop-Closed SSA Form Pass			; GCN-O1-NEXT: Loop-Closed SSA Form Pass
	▲ Show 20 Lines • Show All 330 Lines • ▼ Show 20 Lines
	; GCN-O2-NEXT: Lazy Branch Probability Analysis			; GCN-O2-NEXT: Lazy Branch Probability Analysis
	; GCN-O2-NEXT: Lazy Block Frequency Analysis			; GCN-O2-NEXT: Lazy Block Frequency Analysis
	; GCN-O2-NEXT: Optimization Remark Emitter			; GCN-O2-NEXT: Optimization Remark Emitter
	; GCN-O2-NEXT: Warn about non-applied transformations			; GCN-O2-NEXT: Warn about non-applied transformations
	; GCN-O2-NEXT: Alignment from assumptions			; GCN-O2-NEXT: Alignment from assumptions
	; GCN-O2-NEXT: Strip Unused Function Prototypes			; GCN-O2-NEXT: Strip Unused Function Prototypes
	; GCN-O2-NEXT: Dead Global Elimination			; GCN-O2-NEXT: Dead Global Elimination
	; GCN-O2-NEXT: Merge Duplicate Global Constants			; GCN-O2-NEXT: Merge Duplicate Global Constants
				; GCN-O2-NEXT: Call Graph Profile
				; GCN-O2-NEXT: FunctionPass Manager
				; GCN-O2-NEXT: Dominator Tree Construction
				; GCN-O2-NEXT: Natural Loop Information
				; GCN-O2-NEXT: Lazy Branch Probability Analysis
				; GCN-O2-NEXT: Lazy Block Frequency Analysis
	; GCN-O2-NEXT: FunctionPass Manager			; GCN-O2-NEXT: FunctionPass Manager
	; GCN-O2-NEXT: Dominator Tree Construction			; GCN-O2-NEXT: Dominator Tree Construction
	; GCN-O2-NEXT: Natural Loop Information			; GCN-O2-NEXT: Natural Loop Information
	; GCN-O2-NEXT: Post-Dominator Tree Construction			; GCN-O2-NEXT: Post-Dominator Tree Construction
	; GCN-O2-NEXT: Branch Probability Analysis			; GCN-O2-NEXT: Branch Probability Analysis
	; GCN-O2-NEXT: Block Frequency Analysis			; GCN-O2-NEXT: Block Frequency Analysis
	; GCN-O2-NEXT: Canonicalize natural loops			; GCN-O2-NEXT: Canonicalize natural loops
	; GCN-O2-NEXT: LCSSA Verifier			; GCN-O2-NEXT: LCSSA Verifier
	▲ Show 20 Lines • Show All 336 Lines • ▼ Show 20 Lines
	; GCN-O3-NEXT: Lazy Branch Probability Analysis			; GCN-O3-NEXT: Lazy Branch Probability Analysis
	; GCN-O3-NEXT: Lazy Block Frequency Analysis			; GCN-O3-NEXT: Lazy Block Frequency Analysis
	; GCN-O3-NEXT: Optimization Remark Emitter			; GCN-O3-NEXT: Optimization Remark Emitter
	; GCN-O3-NEXT: Warn about non-applied transformations			; GCN-O3-NEXT: Warn about non-applied transformations
	; GCN-O3-NEXT: Alignment from assumptions			; GCN-O3-NEXT: Alignment from assumptions
	; GCN-O3-NEXT: Strip Unused Function Prototypes			; GCN-O3-NEXT: Strip Unused Function Prototypes
	; GCN-O3-NEXT: Dead Global Elimination			; GCN-O3-NEXT: Dead Global Elimination
	; GCN-O3-NEXT: Merge Duplicate Global Constants			; GCN-O3-NEXT: Merge Duplicate Global Constants
				; GCN-O3-NEXT: Call Graph Profile
				; GCN-O3-NEXT: FunctionPass Manager
				; GCN-O3-NEXT: Dominator Tree Construction
				; GCN-O3-NEXT: Natural Loop Information
				; GCN-O3-NEXT: Lazy Branch Probability Analysis
				; GCN-O3-NEXT: Lazy Block Frequency Analysis
	; GCN-O3-NEXT: FunctionPass Manager			; GCN-O3-NEXT: FunctionPass Manager
	; GCN-O3-NEXT: Dominator Tree Construction			; GCN-O3-NEXT: Dominator Tree Construction
	; GCN-O3-NEXT: Natural Loop Information			; GCN-O3-NEXT: Natural Loop Information
	; GCN-O3-NEXT: Post-Dominator Tree Construction			; GCN-O3-NEXT: Post-Dominator Tree Construction
	; GCN-O3-NEXT: Branch Probability Analysis			; GCN-O3-NEXT: Branch Probability Analysis
	; GCN-O3-NEXT: Block Frequency Analysis			; GCN-O3-NEXT: Block Frequency Analysis
	; GCN-O3-NEXT: Canonicalize natural loops			; GCN-O3-NEXT: Canonicalize natural loops
	; GCN-O3-NEXT: LCSSA Verifier			; GCN-O3-NEXT: LCSSA Verifier
	▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines

llvm/test/Instrumentation/cgprofile.ll

	; RUN: opt < %s -passes cg-profile -S \| FileCheck %s			; RUN: opt < %s -passes cg-profile -S \| FileCheck %s
				; RUN: opt < %s -cg-profile -S \| FileCheck %s

	declare void @b()			declare void @b()

	define void @a() !prof !1 {			define void @a() !prof !1 {
	call void @b()			call void @b()
	ret void			ret void
	}			}

	Show All 32 Lines

llvm/test/Other/new-pm-cgprofile.ll

This file was deleted.

	; RUN: opt -debug-pass-manager -passes='default<O2>' %s 2>&1 \|FileCheck %s --check-prefixes=DEFAULT
	; RUN: opt -debug-pass-manager -passes='default<O2>' -enable-npm-call-graph-profile=0 %s 2>&1 \|FileCheck %s --check-prefixes=OFF
	; RUN: opt -debug-pass-manager -passes='default<O2>' -enable-npm-call-graph-profile=1 %s 2>&1 \|FileCheck %s --check-prefixes=ON
	;
	; DEFAULT: Running pass: CGProfilePass
	; OFF-NOT: Running pass: CGProfilePass
	; ON: Running pass: CGProfilePass

	define void @foo() {
	ret void
	}

llvm/test/Other/opt-O2-pipeline.ll

	Show First 20 Lines • Show All 274 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Lazy Branch Probability Analysis			; CHECK-NEXT: Lazy Branch Probability Analysis
	; CHECK-NEXT: Lazy Block Frequency Analysis			; CHECK-NEXT: Lazy Block Frequency Analysis
	; CHECK-NEXT: Optimization Remark Emitter			; CHECK-NEXT: Optimization Remark Emitter
	; CHECK-NEXT: Warn about non-applied transformations			; CHECK-NEXT: Warn about non-applied transformations
	; CHECK-NEXT: Alignment from assumptions			; CHECK-NEXT: Alignment from assumptions
	; CHECK-NEXT: Strip Unused Function Prototypes			; CHECK-NEXT: Strip Unused Function Prototypes
	; CHECK-NEXT: Dead Global Elimination			; CHECK-NEXT: Dead Global Elimination
	; CHECK-NEXT: Merge Duplicate Global Constants			; CHECK-NEXT: Merge Duplicate Global Constants
				; CHECK-NEXT: Call Graph Profile
				; CHECK-NEXT: FunctionPass Manager
				; CHECK-NEXT: Dominator Tree Construction
				; CHECK-NEXT: Natural Loop Information
				; CHECK-NEXT: Lazy Branch Probability Analysis
				; CHECK-NEXT: Lazy Block Frequency Analysis
	; CHECK-NEXT: FunctionPass Manager			; CHECK-NEXT: FunctionPass Manager
				nikicUnsubmitted Not Done Reply Inline Actions Is it possible to switch this pass to use LazyBPI / LazyBFA, only fetched if PGO is actually in use? PGO functionality that most people don't use adding expensive analysis passes like PDT should be avoided. nikic: Is it possible to switch this pass to use LazyBPI / LazyBFA, only fetched if PGO is actually in…
				hansUnsubmitted Not Done Reply Inline Actions I wonder if just switching to LazyBlockFrequencyInfo would help though. It looks to me like the CGProfile would request info about each function anyway. I was surprised to see that Clang sets Opts.CallGraphProfile solely based on whether the integrated assembler is used. Maybe a better fix is to only set that to true when a profile is actually being used? hans: I wonder if just switching to LazyBlockFrequencyInfo would help though. It looks to me like the…
				nikicUnsubmitted Not Done Reply Inline Actions I wonder if just switching to LazyBlockFrequencyInfo would help though. It looks to me like the CGProfile would request info about each function anyway. It would only help if there is some way to only fetch the analysis conditionally. I believe many PGO passes use something like PSI.hasProfileSummary() or F.hasProfileData() for that. I was surprised to see that Clang sets Opts.CallGraphProfile solely based on whether the integrated assembler is used. Maybe a better fix is to only set that to true when a profile is actually being used? Right, just disabling this by default in clang/opt would also work. For reference, the current compile-time numbers for this patch: https://llvm-compile-time-tracker.com/compare.php?from=516ff1d4baee28b1911737e47b42973567adf8ff&to=8df840660bb764b6653fcfd9ac7a72cc6adebde6&stat=instructions Not huge, but it adds up (some similar regressions have been introduced in LLVM 10). nikic: > I wonder if just switching to LazyBlockFrequencyInfo would help though. It looks to me like…
				zequanwuAuthorUnsubmitted Done Reply Inline Actions Do you mean disabling it just for LPM or both? zequanwu: Do you mean disabling it just for LPM or both?
				zequanwuAuthorUnsubmitted Done Reply Inline Actions I was surprised to see that Clang sets Opts.CallGraphProfile solely based on whether the integrated assembler is used. Maybe a better fix is to only set that to true when a profile is actually being used? For Clang, a better fix I think is that `Opts.CallGraphProfile` should based on both whether the integrated assembler is used and whether profile instrumentation is turned on. What do you think? zequanwu: > I was surprised to see that Clang sets Opts.CallGraphProfile solely based on whether the…
				MaskRayUnsubmitted Done Reply Inline Actions I'd prefer not having `CallGraphProfile` `-no-integrated-as -S` => no .cgprofile (.llvm_addrsig behaves this way) `-S` -> .cgprofile MaskRay: I'd prefer not having `CallGraphProfile` * `-no-integrated-as -S` => no .cgprofile (.
				zequanwuAuthorUnsubmitted Not Done Reply Inline Actions As discussed above, I think `CGProfilePass` should be disabled by default in clang unless `-no-integrated-as` is not given and `-fprofile-instrument-use-path=` is given. So, `Opts.CallGraphProfile` is a convenient switch for that. zequanwu: As discussed above, I think `CGProfilePass` should be disabled by default in clang unless `-no…
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Post-Dominator Tree Construction			; CHECK-NEXT: Post-Dominator Tree Construction
	; CHECK-NEXT: Branch Probability Analysis			; CHECK-NEXT: Branch Probability Analysis
	; CHECK-NEXT: Block Frequency Analysis			; CHECK-NEXT: Block Frequency Analysis
	; CHECK-NEXT: Canonicalize natural loops			; CHECK-NEXT: Canonicalize natural loops
	; CHECK-NEXT: LCSSA Verifier			; CHECK-NEXT: LCSSA Verifier
	; CHECK-NEXT: Loop-Closed SSA Form Pass			; CHECK-NEXT: Loop-Closed SSA Form Pass
	Show All 37 Lines

llvm/test/Other/opt-O3-pipeline.ll

	Show First 20 Lines • Show All 279 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Lazy Branch Probability Analysis			; CHECK-NEXT: Lazy Branch Probability Analysis
	; CHECK-NEXT: Lazy Block Frequency Analysis			; CHECK-NEXT: Lazy Block Frequency Analysis
	; CHECK-NEXT: Optimization Remark Emitter			; CHECK-NEXT: Optimization Remark Emitter
	; CHECK-NEXT: Warn about non-applied transformations			; CHECK-NEXT: Warn about non-applied transformations
	; CHECK-NEXT: Alignment from assumptions			; CHECK-NEXT: Alignment from assumptions
	; CHECK-NEXT: Strip Unused Function Prototypes			; CHECK-NEXT: Strip Unused Function Prototypes
	; CHECK-NEXT: Dead Global Elimination			; CHECK-NEXT: Dead Global Elimination
	; CHECK-NEXT: Merge Duplicate Global Constants			; CHECK-NEXT: Merge Duplicate Global Constants
				; CHECK-NEXT: Call Graph Profile
				; CHECK-NEXT: FunctionPass Manager
				; CHECK-NEXT: Dominator Tree Construction
				; CHECK-NEXT: Natural Loop Information
				; CHECK-NEXT: Lazy Branch Probability Analysis
				; CHECK-NEXT: Lazy Block Frequency Analysis
	; CHECK-NEXT: FunctionPass Manager			; CHECK-NEXT: FunctionPass Manager
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Post-Dominator Tree Construction			; CHECK-NEXT: Post-Dominator Tree Construction
	; CHECK-NEXT: Branch Probability Analysis			; CHECK-NEXT: Branch Probability Analysis
	; CHECK-NEXT: Block Frequency Analysis			; CHECK-NEXT: Block Frequency Analysis
	; CHECK-NEXT: Canonicalize natural loops			; CHECK-NEXT: Canonicalize natural loops
	; CHECK-NEXT: LCSSA Verifier			; CHECK-NEXT: LCSSA Verifier
	Show All 38 Lines

llvm/test/Other/opt-Os-pipeline.ll

	Show First 20 Lines • Show All 260 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Lazy Branch Probability Analysis			; CHECK-NEXT: Lazy Branch Probability Analysis
	; CHECK-NEXT: Lazy Block Frequency Analysis			; CHECK-NEXT: Lazy Block Frequency Analysis
	; CHECK-NEXT: Optimization Remark Emitter			; CHECK-NEXT: Optimization Remark Emitter
	; CHECK-NEXT: Warn about non-applied transformations			; CHECK-NEXT: Warn about non-applied transformations
	; CHECK-NEXT: Alignment from assumptions			; CHECK-NEXT: Alignment from assumptions
	; CHECK-NEXT: Strip Unused Function Prototypes			; CHECK-NEXT: Strip Unused Function Prototypes
	; CHECK-NEXT: Dead Global Elimination			; CHECK-NEXT: Dead Global Elimination
	; CHECK-NEXT: Merge Duplicate Global Constants			; CHECK-NEXT: Merge Duplicate Global Constants
				; CHECK-NEXT: Call Graph Profile
				; CHECK-NEXT: FunctionPass Manager
				; CHECK-NEXT: Dominator Tree Construction
				; CHECK-NEXT: Natural Loop Information
				; CHECK-NEXT: Lazy Branch Probability Analysis
				; CHECK-NEXT: Lazy Block Frequency Analysis
	; CHECK-NEXT: FunctionPass Manager			; CHECK-NEXT: FunctionPass Manager
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Post-Dominator Tree Construction			; CHECK-NEXT: Post-Dominator Tree Construction
	; CHECK-NEXT: Branch Probability Analysis			; CHECK-NEXT: Branch Probability Analysis
	; CHECK-NEXT: Block Frequency Analysis			; CHECK-NEXT: Block Frequency Analysis
	; CHECK-NEXT: Canonicalize natural loops			; CHECK-NEXT: Canonicalize natural loops
	; CHECK-NEXT: LCSSA Verifier			; CHECK-NEXT: LCSSA Verifier
	Show All 38 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[LPM] Port CGProfilePass from NPM to LPMClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 277065

clang/include/clang/Basic/CodeGenOptions.def

clang/lib/CodeGen/BackendUtil.cpp

clang/lib/Frontend/CompilerInvocation.cpp

llvm/include/llvm/InitializePasses.h

llvm/include/llvm/Transforms/IPO.h

llvm/include/llvm/Transforms/IPO/PassManagerBuilder.h

llvm/include/llvm/Transforms/Instrumentation/CGProfile.h

llvm/lib/Passes/PassBuilder.cpp

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp

llvm/lib/Transforms/Instrumentation/CGProfile.cpp

llvm/lib/Transforms/Instrumentation/Instrumentation.cpp

llvm/test/CodeGen/AMDGPU/opt-pipeline.ll

llvm/test/Instrumentation/cgprofile.ll

llvm/test/Other/new-pm-cgprofile.ll

llvm/test/Other/opt-O2-pipeline.ll

llvm/test/Other/opt-O3-pipeline.ll

llvm/test/Other/opt-Os-pipeline.ll

[LPM] Port CGProfilePass from NPM to LPM
ClosedPublic