This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/test/CodeGen/
-
test/
-
CodeGen/
-
pseudo-probe-emit.c
-
llvm/lib/Passes/
-
lib/
-
Passes/
1/2
PassBuilder.cpp

Differential D109531

[CSSPGO] Enable pseudo probe instrumentation in O0 mode.
ClosedPublic

Authored by hoy on Sep 9 2021, 11:42 AM.

Download Raw Diff

Details

Reviewers

wenlei
wlei
wmi

Commits

rG299b5d420df1: [CSSPGO] Enable pseudo probe instrumentation in O0 mode.

Summary

Pseudo probe instrumentation was missing from O0 build. It is needed in cases where some source files are built in O0 while the others are built in optimize mode.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

hoy created this revision.Sep 9 2021, 11:42 AM

Herald added subscribers: modimo, wenlei, hiraditya. · View Herald TranscriptSep 9 2021, 11:42 AM

hoy requested review of this revision.Sep 9 2021, 11:42 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptSep 9 2021, 11:42 AM

Herald added subscribers: llvm-commits, cfe-commits. · View Herald Transcript

hoy added reviewers: wenlei, wlei, wmi.Sep 9 2021, 11:43 AM

hoy added a subscriber: spupyrev.

LGTM, thanks!

This revision is now accepted and ready to land.Sep 9 2021, 12:18 PM

Harbormaster completed remote builds in B123277: Diff 371675.Sep 9 2021, 1:16 PM

The change makes sense given instr PGO also happens for O0. But practically, if a file is being built with O0, do we care about its profile given we're not really optimizing it anyways? Functions from O0 modules are not supposed to be inlined into O1+ modules either.

In D109531#2992702, @wenlei wrote:

The change makes sense given instr PGO also happens for O0. But practically, if a file is being built with O0, do we care about its profile given we're not really optimizing it anyways? Functions from O0 modules are not supposed to be inlined into O1+ modules either.

We probably don't care about performance for O0 build. The change is for consistency, also makes the compiler happy which otherwise will complain about "Pseudo-probe-based profile requires SampleProfileProbePass" for O0 modules that don't have probes.

In D109531#2992721, @hoy wrote:

In D109531#2992702, @wenlei wrote:

The change makes sense given instr PGO also happens for O0. But practically, if a file is being built with O0, do we care about its profile given we're not really optimizing it anyways? Functions from O0 modules are not supposed to be inlined into O1+ modules either.

We probably don't care about performance for O0 build. The change is for consistency, also makes the compiler happy which otherwise will complain about "Pseudo-probe-based profile requires SampleProfileProbePass" for O0 modules that don't have probes.

The complain message is emitted in SampleProfileLoader::doInitialization. llvm will not run SampleProfileLoader pass for O0 module. Why there is the complain?

In D109531#2993394, @wmi wrote:

In D109531#2992721, @hoy wrote:

In D109531#2992702, @wenlei wrote:

The change makes sense given instr PGO also happens for O0. But practically, if a file is being built with O0, do we care about its profile given we're not really optimizing it anyways? Functions from O0 modules are not supposed to be inlined into O1+ modules either.

We probably don't care about performance for O0 build. The change is for consistency, also makes the compiler happy which otherwise will complain about "Pseudo-probe-based profile requires SampleProfileProbePass" for O0 modules that don't have probes.

The complain message is emitted in SampleProfileLoader::doInitialization. llvm will not run SampleProfileLoader pass for O0 module. Why there is the complain?

Good question. It could happen in lto postlink which by default optimizes in -O2 mode. More specifically, with the following command, both cc1 and lld will run in default mode, which is -O0 for cc1 and -O2 for lld.

clang -flto 1.cpp -v -fuse-ld=lld

In D109531#2993394, @wmi wrote:

In D109531#2992721, @hoy wrote:

In D109531#2992702, @wenlei wrote:

The change makes sense given instr PGO also happens for O0. But practically, if a file is being built with O0, do we care about its profile given we're not really optimizing it anyways? Functions from O0 modules are not supposed to be inlined into O1+ modules either.

We probably don't care about performance for O0 build. The change is for consistency, also makes the compiler happy which otherwise will complain about "Pseudo-probe-based profile requires SampleProfileProbePass" for O0 modules that don't have probes.

The complain message is emitted in SampleProfileLoader::doInitialization. llvm will not run SampleProfileLoader pass for O0 module. Why there is the complain?

We've encountered this exception while building an old version of gcc (8.3) with llvm-12. As Hongtao pointed out, they sometimes try to build targets with "-flto" but without "-O2/O3". Surely, the right "fix" would be to modify the gcc build scripts (which is possibly already done in later gcc releases); this workaround is to make sure we can also process "incorrect" builds.

In D109531#2993484, @hoy wrote:
In D109531#2993394, @wmi wrote:

In D109531#2992721, @hoy wrote:

In D109531#2992702, @wenlei wrote:

The change makes sense given instr PGO also happens for O0. But practically, if a file is being built with O0, do we care about its profile given we're not really optimizing it anyways? Functions from O0 modules are not supposed to be inlined into O1+ modules either.

We probably don't care about performance for O0 build. The change is for consistency, also makes the compiler happy which otherwise will complain about "Pseudo-probe-based profile requires SampleProfileProbePass" for O0 modules that don't have probes.

The complain message is emitted in SampleProfileLoader::doInitialization. llvm will not run SampleProfileLoader pass for O0 module. Why there is the complain?

Good question. It could happen in lto postlink which by default optimizes in -O2 mode. More specifically, with the following command, both cc1 and lld will run in default mode, which is -O0 for cc1 and -O2 for lld.
clang -flto 1.cpp -v -fuse-ld=lld

I see. It seems a problem only exposed in monolithic lto. Could you add some comment before the change in PassBuilder.cpp?

In D109531#2994823, @wmi wrote:
In D109531#2993484, @hoy wrote:
In D109531#2993394, @wmi wrote:

In D109531#2992721, @hoy wrote:

In D109531#2992702, @wenlei wrote:

The change makes sense given instr PGO also happens for O0. But practically, if a file is being built with O0, do we care about its profile given we're not really optimizing it anyways? Functions from O0 modules are not supposed to be inlined into O1+ modules either.

We probably don't care about performance for O0 build. The change is for consistency, also makes the compiler happy which otherwise will complain about "Pseudo-probe-based profile requires SampleProfileProbePass" for O0 modules that don't have probes.

The complain message is emitted in SampleProfileLoader::doInitialization. llvm will not run SampleProfileLoader pass for O0 module. Why there is the complain?

Good question. It could happen in lto postlink which by default optimizes in -O2 mode. More specifically, with the following command, both cc1 and lld will run in default mode, which is -O0 for cc1 and -O2 for lld.
clang -flto 1.cpp -v -fuse-ld=lld
I see. It seems a problem only exposed in monolithic lto. Could you add some comment before the change in PassBuilder.cpp?

Sounds good, comment added. Actually this can also happen in thinlto. Having probe inserted in O0 mode allows users to switch between arbitrary setups.

Updating D109531: [CSSPGO] Enable pseudo probe instrumentation in O0 mode.

LGTM.

Harbormaster completed remote builds in B123462: Diff 371942.Sep 10 2021, 10:29 AM

More specifically, with the following command, both cc1 and lld will run in default mode, which is -O0 for cc1 and -O2 for lld.
clang -flto 1.cpp -v -fuse-ld=lld

I'm wondering is this the expected behavior or an oversight of pass pipeline setup? In what scenario would a O0 prelink + O2 postlink make sense?

Btw, just double check - the O2 here you mentioned is not LLD's O2 for linking, but actually postlink LLVM O2, right?

llvm/lib/Passes/PassBuilder.cpp
1930	Loading a sample profile in the postlink will require pseudo probe instrumentation in the prelink. Even with this change, is it still possible that prelink compile for some module actually doesn't have `-fpseudo-probe-for-profiling`, and it's on for LTO postlink? We could contrive such case, and it could happen in reality too, right? Would we have the same problem when trying to load profile for functions from modules without pseudo-probe in prelink?

hoy added inline comments.Sep 13 2021, 9:07 AM

llvm/lib/Passes/PassBuilder.cpp
1930	Yes, it could happen. The compiler will stop working too with the current implementation. We could change the error reporting to be a warning to make that pass. It is an error because we want to remind user if that's intentional. I think it's mostly user's responsibility to be clear if pseudo probe instrumentation is needed or not, especially when passing linker flags separately. The change being made here is to ensure it works when all flags are passed via CXX_FLAGS, such as clang -flto 1.cpp -v -fuse-ld=lld -fpseudo-probe-for-profiling -fprofile-sample-use=....

lgtm, thanks.

This revision was landed with ongoing or failed builds.Sep 14 2021, 6:13 PM

Closed by commit rG299b5d420df1: [CSSPGO] Enable pseudo probe instrumentation in O0 mode. (authored by hoy). · Explain Why

This revision was automatically updated to reflect the committed changes.

hoy added a commit: rG299b5d420df1: [CSSPGO] Enable pseudo probe instrumentation in O0 mode..

Revision Contents

Path

Size

clang/

test/

CodeGen/

pseudo-probe-emit.c

1 line

llvm/

lib/

Passes/

PassBuilder.cpp

7 lines

Diff 372606

clang/test/CodeGen/pseudo-probe-emit.c

				// RUN: %clang_cc1 -O0 -fno-legacy-pass-manager -fpseudo-probe-for-profiling -debug-info-kind=limited -emit-llvm -o - %s \| FileCheck %s
	// RUN: %clang_cc1 -O2 -fno-legacy-pass-manager -fpseudo-probe-for-profiling -debug-info-kind=limited -emit-llvm -o - %s \| FileCheck %s			// RUN: %clang_cc1 -O2 -fno-legacy-pass-manager -fpseudo-probe-for-profiling -debug-info-kind=limited -emit-llvm -o - %s \| FileCheck %s

	// Check the generation of pseudoprobe intrinsic call			// Check the generation of pseudoprobe intrinsic call

	void bar();			void bar();
	void go();			void go();

	void foo(int x) {			void foo(int x) {
	Show All 9 Lines

llvm/lib/Passes/PassBuilder.cpp

	Show First 20 Lines • Show All 1,918 Lines • ▼ Show 20 Lines

	ModulePassManager PassBuilder::buildO0DefaultPipeline(OptimizationLevel Level,			ModulePassManager PassBuilder::buildO0DefaultPipeline(OptimizationLevel Level,
	bool LTOPreLink) {			bool LTOPreLink) {
	assert(Level == OptimizationLevel::O0 &&			assert(Level == OptimizationLevel::O0 &&
	"buildO0DefaultPipeline should only be used with O0");			"buildO0DefaultPipeline should only be used with O0");

	ModulePassManager MPM;			ModulePassManager MPM;

				// Perform pseudo probe instrumentation in O0 mode. This is for the
				// consistency between different build modes. For example, a LTO build can be
				// mixed with an O0 prelink and an O2 postlink. Loading a sample profile in
				// the postlink will require pseudo probe instrumentation in the prelink.
				wenleiUnsubmitted Not Done Reply Inline Actions Loading a sample profile in the postlink will require pseudo probe instrumentation in the prelink. Even with this change, is it still possible that prelink compile for some module actually doesn't have `-fpseudo-probe-for-profiling`, and it's on for LTO postlink? We could contrive such case, and it could happen in reality too, right? Would we have the same problem when trying to load profile for functions from modules without pseudo-probe in prelink? wenlei: > Loading a sample profile in the postlink will require pseudo probe instrumentation in the…
				hoyAuthorUnsubmitted Done Reply Inline Actions Yes, it could happen. The compiler will stop working too with the current implementation. We could change the error reporting to be a warning to make that pass. It is an error because we want to remind user if that's intentional. I think it's mostly user's responsibility to be clear if pseudo probe instrumentation is needed or not, especially when passing linker flags separately. The change being made here is to ensure it works when all flags are passed via CXX_FLAGS, such as clang -flto 1.cpp -v -fuse-ld=lld -fpseudo-probe-for-profiling -fprofile-sample-use=.... hoy: Yes, it could happen. The compiler will stop working too with the current implementation. We…
				if (PGOOpt && PGOOpt->PseudoProbeForProfiling)
				MPM.addPass(SampleProfileProbePass(TM));

	if (PGOOpt && (PGOOpt->Action == PGOOptions::IRInstr \|\|			if (PGOOpt && (PGOOpt->Action == PGOOptions::IRInstr \|\|
	PGOOpt->Action == PGOOptions::IRUse))			PGOOpt->Action == PGOOptions::IRUse))
	addPGOInstrPassesForO0(			addPGOInstrPassesForO0(
	MPM,			MPM,
	/* RunProfileGen */ (PGOOpt->Action == PGOOptions::IRInstr),			/* RunProfileGen */ (PGOOpt->Action == PGOOptions::IRInstr),
	/* IsCS */ false, PGOOpt->ProfileFile, PGOOpt->ProfileRemappingFile);			/* IsCS */ false, PGOOpt->ProfileFile, PGOOpt->ProfileRemappingFile);

	for (auto &C : PipelineStartEPCallbacks)			for (auto &C : PipelineStartEPCallbacks)
	▲ Show 20 Lines • Show All 1,397 Lines • Show Last 20 Lines