This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/tools/clang-linker-wrapper/
-
tools/
-
clang-linker-wrapper/
-
ClangLinkerWrapper.cpp
-
llvm/
-
include/llvm/LTO/
-
llvm/
-
LTO/
-
Config.h
-
lib/LTO/
-
LTO/
-
LTOBackend.cpp

Differential D122133

[LTO] Add configuartion option to use default optimization pipeline
ClosedPublic

Authored by jhuber6 on Mar 21 2022, 6:27 AM.

Download Raw Diff

Details

Reviewers

jdoerfert
tianshilei1992
JonChesterfield
pcc

Commits

rG5856f30b5ae0: [LTO] Add configuartion option to use default optimization pipeline

Summary

This patch adds a configuration option to simply use the default pass
pipeline in favor of the LTO-specific one. We observed some severe
performance penalties when uding device-side LTO for OpenMP offloading
applications caused by the LTO-pass pipeline. This is primarily because
OpenMP uses an LLVM bitcode library to implement a GPU runtime library.
In a standard compilation we link this bitcode library into each source
file and optimize it with the default pipeline. When performing LTO we
link it late with all the files, but the bitcode library never has the
regular optimization pipeline applied to it so we miss a few
optimizations just using the LTO pipeline to optimize it.

I'm not committed to this solution, but it's the easiest method to solve
this performance regression when using LTO without changing the
optimizatin pipeline for other users.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jhuber6 created this revision.Mar 21 2022, 6:27 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 21 2022, 6:27 AM

Herald added subscribers: ormris, steven_wu, hiraditya, inglorion. · View Herald Transcript

jhuber6 requested review of this revision.Mar 21 2022, 6:27 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptMar 21 2022, 6:27 AM

Herald added subscribers: llvm-commits, cfe-commits, sstefan1. · View Herald Transcript

Harbormaster completed remote builds in B155368: Diff 416913.Mar 21 2022, 7:00 AM

I tested it locally, and it fixes the performance regression. Since it is internal flag, it will not break any existing code. I'm okay with the change. Not sure if we want to ask the gate keeper of LTOBackend.

This revision is now accepted and ready to land.Mar 21 2022, 6:59 PM

jhuber6 added a reviewer: pcc.Mar 21 2022, 7:55 PM

This revision was landed with ongoing or failed builds.Mar 22 2022, 6:28 AM

Closed by commit rG5856f30b5ae0: [LTO] Add configuartion option to use default optimization pipeline (authored by jhuber6). · Explain Why

This revision was automatically updated to reflect the committed changes.

jhuber6 added a commit: rG5856f30b5ae0: [LTO] Add configuartion option to use default optimization pipeline.

Revision Contents

Path

Size

clang/

tools/

clang-linker-wrapper/

ClangLinkerWrapper.cpp

1 line

llvm/

include/

llvm/

LTO/

Config.h

3 lines

lib/

LTO/

LTOBackend.cpp

2 lines

Diff 417276

clang/tools/clang-linker-wrapper/ClangLinkerWrapper.cpp

	Show First 20 Lines • Show All 854 Lines • ▼ Show 20 Lines
	std::unique_ptr<lto::LTO> createLTO(			std::unique_ptr<lto::LTO> createLTO(
	const Triple &TheTriple, StringRef Arch, bool WholeProgram,			const Triple &TheTriple, StringRef Arch, bool WholeProgram,
	ModuleHook Hook = [](size_t, const Module &) { return true; }) {			ModuleHook Hook = [](size_t, const Module &) { return true; }) {
	lto::Config Conf;			lto::Config Conf;
	lto::ThinBackend Backend;			lto::ThinBackend Backend;
	// TODO: Handle index-only thin-LTO			// TODO: Handle index-only thin-LTO
	Backend = lto::createInProcessThinBackend(			Backend = lto::createInProcessThinBackend(
	llvm::heavyweight_hardware_concurrency(1));			llvm::heavyweight_hardware_concurrency(1));
				Conf.UseDefaultPipeline = true;

	Conf.CPU = Arch.str();			Conf.CPU = Arch.str();
	Conf.Options = codegen::InitTargetOptionsFromCodeGenFlags(TheTriple);			Conf.Options = codegen::InitTargetOptionsFromCodeGenFlags(TheTriple);

	Conf.MAttrs = getTargetFeatures(TheTriple);			Conf.MAttrs = getTargetFeatures(TheTriple);
	Conf.CGOptLevel = getCGOptLevel(OptLevel[1] - '0');			Conf.CGOptLevel = getCGOptLevel(OptLevel[1] - '0');
	Conf.OptLevel = OptLevel[1] - '0';			Conf.OptLevel = OptLevel[1] - '0';
	Conf.DefaultTriple = TheTriple.getTriple();			Conf.DefaultTriple = TheTriple.getTriple();
	▲ Show 20 Lines • Show All 468 Lines • Show Last 20 Lines

llvm/include/llvm/LTO/Config.h

Show First 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	struct Config {
CodeGenOpt::Level CGOptLevel = CodeGenOpt::Default;		CodeGenOpt::Level CGOptLevel = CodeGenOpt::Default;
CodeGenFileType CGFileType = CGFT_ObjectFile;		CodeGenFileType CGFileType = CGFT_ObjectFile;
unsigned OptLevel = 2;		unsigned OptLevel = 2;
bool DisableVerify = false;		bool DisableVerify = false;

/// Use the new pass manager		/// Use the new pass manager
bool UseNewPM = LLVM_ENABLE_NEW_PASS_MANAGER;		bool UseNewPM = LLVM_ENABLE_NEW_PASS_MANAGER;

		/// Use the standard optimization pipeline.
		bool UseDefaultPipeline = false;

/// Flag to indicate that the optimizer should not assume builtins are present		/// Flag to indicate that the optimizer should not assume builtins are present
/// on the target.		/// on the target.
bool Freestanding = false;		bool Freestanding = false;

/// Disable entirely the optimizer, including importing for ThinLTO		/// Disable entirely the optimizer, including importing for ThinLTO
bool CodeGenOnly = false;		bool CodeGenOnly = false;

/// Run PGO context sensitive IR instrumentation.		/// Run PGO context sensitive IR instrumentation.
▲ Show 20 Lines • Show All 228 Lines • Show Last 20 Lines

llvm/lib/LTO/LTOBackend.cpp

Show First 20 Lines • Show All 292 Lines • ▼ Show 20 Lines	static void runNewPMPasses(const Config &Conf, Module &Mod, TargetMachine *TM,
}		}

// Parse a custom pipeline if asked to.		// Parse a custom pipeline if asked to.
if (!Conf.OptPipeline.empty()) {		if (!Conf.OptPipeline.empty()) {
if (auto Err = PB.parsePassPipeline(MPM, Conf.OptPipeline)) {		if (auto Err = PB.parsePassPipeline(MPM, Conf.OptPipeline)) {
report_fatal_error(Twine("unable to parse pass pipeline description '") +		report_fatal_error(Twine("unable to parse pass pipeline description '") +
Conf.OptPipeline + "': " + toString(std::move(Err)));		Conf.OptPipeline + "': " + toString(std::move(Err)));
}		}
		} else if (Conf.UseDefaultPipeline) {
		MPM.addPass(PB.buildPerModuleDefaultPipeline(OL));
} else if (IsThinLTO) {		} else if (IsThinLTO) {
MPM.addPass(PB.buildThinLTODefaultPipeline(OL, ImportSummary));		MPM.addPass(PB.buildThinLTODefaultPipeline(OL, ImportSummary));
} else {		} else {
MPM.addPass(PB.buildLTODefaultPipeline(OL, ExportSummary));		MPM.addPass(PB.buildLTODefaultPipeline(OL, ExportSummary));
}		}

if (!Conf.DisableVerify)		if (!Conf.DisableVerify)
MPM.addPass(VerifierPass());		MPM.addPass(VerifierPass());
▲ Show 20 Lines • Show All 413 Lines • Show Last 20 Lines