This is an archive of the discontinued LLVM Phabricator instance.

Update the new PM pipeline to make ICP aware if it is SamplePGO build.
ClosedPublic

Authored by danielcdh on Jul 29 2017, 3:02 PM.

Download Raw Diff

Details

Reviewers

chandlerc
tejohnson
davidxl

Commits

rG89d322601932: Update the new PM pipeline to make ICP aware if it is SamplePGO build.
rL309780: Update the new PM pipeline to make ICP aware if it is SamplePGO build.

Summary

In ThinLTO backend compile, OPTOptions are not set so that the ICP in ThinLTO backend does not know if it is a SamplePGO build, in which profile count needs to be annotated directly on call instructions. This patch cleaned up the PGOOptions handling logic and passes down PGOOptions to ThinLTO backend.

Diff Detail

Build Status

Buildable 8739
Build 8739: arc lint + arc unit

Event Timeline

danielcdh created this revision.Jul 29 2017, 3:02 PM

Herald added subscribers: mehdi_amini, sanjoy. · View Herald TranscriptJul 29 2017, 3:02 PM

Harbormaster completed remote builds in B8734: Diff 108808.Jul 29 2017, 3:03 PM

chandlerc added inline comments.Jul 29 2017, 3:57 PM

lib/Passes/PassBuilder.cpp
775–777	This is a very significant change in the PGO instrumentation pass ordering. Before, we would do non-trivial cleanup of the IR module before adding instrumentation passes. Now they get added before any of that cleanup. This includes thing such as the first (and primary) SSA formation pass, lowering @llvm.expect intrinsics into branch weight metadata, dead argument elimination, global-opt, etc. Was all of that intended? Is it really OK to instrument before those cleanups?

Instr PGO has its own set of simplification passes, but still I think we should keep this patch NFC for instrPGO. If there are performance numbers that justify the InstrPGO pipeline changes, it can be discussed/done in a separate thread. This one should be left as SamplePGO only change.

danielcdh added inline comments.Jul 29 2017, 4:23 PM

lib/Passes/PassBuilder.cpp
775–777	There are still some optimizations invoked in addPGOInstrPasses before the actual instrumentation: FunctionPassManager FPM; FPM.addPass(SROA()); FPM.addPass(EarlyCSEPass()); // Catch trivial redundancies. FPM.addPass(SimplifyCFGPass()); // Merge & remove basic blocks. FPM.addPass(InstCombinePass()); // Combine silly sequences. invokePeepholeEPCallbacks(FPM, Level); CGPipeline.addPass(createCGSCCToFunctionPassAdaptor(std::move(FPM))); MPM.addPass(createModuleToPostOrderCGSCCPassAdaptor(std::move(CGPipeline))); If we invoke this in the original location, SROA/EarlyCSE/SimplifyCFG will be invoked twice before entering PGO instr/use pass, seems redundant to me. I've also tested the instrumentation PGO on our largest benchmark, and did not see performance change. I know these EarlyFPM were to match the behavior of the legacy PM. But maybe it should not belong to buildModuleSimplificationPipeline, which is called by both regular pipeline, prelink, and thinlto default pipeline. At least thinlto default pipeline does not need these passes?

In D36052#825318, @davidxl wrote:

Instr PGO has its own set of simplification passes, but still I think we should keep this patch NFC for instrPGO. If there are performance numbers that justify the InstrPGO pipeline changes, it can be discussed/done in a separate thread. This one should be left as SamplePGO only change.

The tricky part is that PGOOptions was not passed in the ThinLTO backend before this patch. But after this patch, it will be passed in. However, in buildModuleSimplificationPipeline, it cannot know if this is the ThinLTO backend. As a result, we cannot disable instrumentation/annotation pass in there.

The potential solution could be:

add another boolean value in buildModuleSimplificationPipeline to indicate it's ThinLTO backend
refactor the code from the beginning of buildModuleSimplificationPipeline to the PGOOpt handling to a separate function, and call it separately in the caller of buildModuleSimplificationPipeline.

Any suggestions on how to move forward?

Thanks,
Dehao

In D36052#825321, @danielcdh wrote:

In D36052#825318, @davidxl wrote:

Instr PGO has its own set of simplification passes, but still I think we should keep this patch NFC for instrPGO. If there are performance numbers that justify the InstrPGO pipeline changes, it can be discussed/done in a separate thread. This one should be left as SamplePGO only change.

FWIW, I do agree with you Dehao that we seem to be deing redundant simplification, but as David says here we shouldn't change that behavior in this patch.

The tricky part is that PGOOptions was not passed in the ThinLTO backend before this patch. But after this patch, it will be passed in. However, in buildModuleSimplificationPipeline, it cannot know if this is the ThinLTO backend. As a result, we cannot disable instrumentation/annotation pass in there.

The potential solution could be:

add another boolean value in buildModuleSimplificationPipeline to indicate it's ThinLTO backend

refactor the code from the beginning of buildModuleSimplificationPipeline to the PGOOpt handling to a separate function, and call it separately in the caller of buildModuleSimplificationPipeline.

I actually think the second option might be interesting, but before we do that I think understanding how much redundancy there really is in the cleanup passes is important. That will help inform how to refactor these into separate components, or if there even *are* separate components. So I largely agree w/ David that something like #1 is the best path forward.

However, rather than adding another boolean value though, I think you should turn the current boolean into an enumeration for ThinLTO phase; "none" by default, and then with beth prelink and postlink values.

Note that I don't think we should call the phase "backend" or "codegen". Those are heavily overloaded in LLVM, so I think pre/post-link is a better terminology to use.

-Chandler

In D36052#825325, @chandlerc wrote:

In D36052#825321, @danielcdh wrote:

In D36052#825318, @davidxl wrote:

Instr PGO has its own set of simplification passes, but still I think we should keep this patch NFC for instrPGO. If there are performance numbers that justify the InstrPGO pipeline changes, it can be discussed/done in a separate thread. This one should be left as SamplePGO only change.

FWIW, I do agree with you Dehao that we seem to be deing redundant simplification, but as David says here we shouldn't change that behavior in this patch.

The tricky part is that PGOOptions was not passed in the ThinLTO backend before this patch. But after this patch, it will be passed in. However, in buildModuleSimplificationPipeline, it cannot know if this is the ThinLTO backend. As a result, we cannot disable instrumentation/annotation pass in there.

The potential solution could be:

add another boolean value in buildModuleSimplificationPipeline to indicate it's ThinLTO backend

refactor the code from the beginning of buildModuleSimplificationPipeline to the PGOOpt handling to a separate function, and call it separately in the caller of buildModuleSimplificationPipeline.

I actually think the second option might be interesting, but before we do that I think understanding how much redundancy there really is in the cleanup passes is important. That will help inform how to refactor these into separate components, or if there even *are* separate components. So I largely agree w/ David that something like #1 is the best path forward.

However, rather than adding another boolean value though, I think you should turn the current boolean into an enumeration for ThinLTO phase; "none" by default, and then with beth prelink and postlink values.

Note that I don't think we should call the phase "backend" or "codegen". Those are heavily overloaded in LLVM, so I think pre/post-link is a better terminology to use.

Thanks Chandler and David for the suggestion.

I've sent a separate patch https://reviews.llvm.org/D36053 to refactor the code. The patch is NFC, but I'd like to ask for your opinions on the naming/description to see if it's concise. Thanks in advance for comments.

rebase and make instrumentation PGO behavior unchanged.

Harbormaster completed remote builds in B8739: Diff 108816.Jul 29 2017, 10:32 PM

Looks awesome, LGTM.

This does move ICP to pre-cleanup passes for non-ThinLTO builds with sample PGO. I assume you've tested it and this is fine. (It also moves the sample profile loading, but that one seems completely innocuous to me.)

This revision is now accepted and ready to land.Jul 29 2017, 11:23 PM

In D36052#825377, @chandlerc wrote:

Looks awesome, LGTM.

This does move ICP to pre-cleanup passes for non-ThinLTO builds with sample PGO. I assume you've tested it and this is fine. (It also moves the sample profile loading, but that one seems completely innocuous to me.)

Yes, verified that this does not affect non-ThinLTO performance. commit the patch now.

danielcdh closed this revision.Aug 1 2017, 6:29 PM

Revision Contents

Path

Size

lib/

LTO/

LTOBackend.cpp

12 lines

Passes/

PassBuilder.cpp

46 lines

test/

LTO/

Resolution/

X86/

Inputs/

load-sample-prof-icp.prof

2 lines

load-sample-prof-icp.ll

46 lines

Diff 108816

lib/LTO/LTOBackend.cpp

Show First 20 Lines • Show All 125 Lines • ▼ Show 20 Lines	else
RelocModel =		RelocModel =
M.getPICLevel() == PICLevel::NotPIC ? Reloc::Static : Reloc::PIC_;		M.getPICLevel() == PICLevel::NotPIC ? Reloc::Static : Reloc::PIC_;

return std::unique_ptr<TargetMachine>(TheTarget->createTargetMachine(		return std::unique_ptr<TargetMachine>(TheTarget->createTargetMachine(
TheTriple, Conf.CPU, Features.getString(), Conf.Options, RelocModel,		TheTriple, Conf.CPU, Features.getString(), Conf.Options, RelocModel,
Conf.CodeModel, Conf.CGOptLevel));		Conf.CodeModel, Conf.CGOptLevel));
}		}

static void runNewPMPasses(Module &Mod, TargetMachine *TM, unsigned OptLevel,		static void runNewPMPasses(Config &Conf, Module &Mod, TargetMachine *TM,
bool IsThinLTO) {		unsigned OptLevel, bool IsThinLTO) {
PassBuilder PB(TM);		Optional<PGOOptions> PGOOpt;
		if (!Conf.SampleProfile.empty())
		PGOOpt = PGOOptions("", "", Conf.SampleProfile, false, true);

		PassBuilder PB(TM, PGOOpt);
AAManager AA;		AAManager AA;

// Parse a custom AA pipeline if asked to.		// Parse a custom AA pipeline if asked to.
assert(PB.parseAAPipeline(AA, "default"));		assert(PB.parseAAPipeline(AA, "default"));

LoopAnalysisManager LAM;		LoopAnalysisManager LAM;
FunctionAnalysisManager FAM;		FunctionAnalysisManager FAM;
CGSCCAnalysisManager CGAM;		CGSCCAnalysisManager CGAM;
▲ Show 20 Lines • Show All 112 Lines • ▼ Show 20 Lines
bool opt(Config &Conf, TargetMachine *TM, unsigned Task, Module &Mod,		bool opt(Config &Conf, TargetMachine *TM, unsigned Task, Module &Mod,
bool IsThinLTO, ModuleSummaryIndex *ExportSummary,		bool IsThinLTO, ModuleSummaryIndex *ExportSummary,
const ModuleSummaryIndex *ImportSummary) {		const ModuleSummaryIndex *ImportSummary) {
// FIXME: Plumb the combined index into the new pass manager.		// FIXME: Plumb the combined index into the new pass manager.
if (!Conf.OptPipeline.empty())		if (!Conf.OptPipeline.empty())
runNewPMCustomPasses(Mod, TM, Conf.OptPipeline, Conf.AAPipeline,		runNewPMCustomPasses(Mod, TM, Conf.OptPipeline, Conf.AAPipeline,
Conf.DisableVerify);		Conf.DisableVerify);
else if (Conf.UseNewPM)		else if (Conf.UseNewPM)
runNewPMPasses(Mod, TM, Conf.OptLevel, IsThinLTO);		runNewPMPasses(Conf, Mod, TM, Conf.OptLevel, IsThinLTO);
else		else
runOldPMPasses(Conf, Mod, TM, IsThinLTO, ExportSummary, ImportSummary);		runOldPMPasses(Conf, Mod, TM, IsThinLTO, ExportSummary, ImportSummary);
return !Conf.PostOptModuleHook \|\| Conf.PostOptModuleHook(Task, Mod);		return !Conf.PostOptModuleHook \|\| Conf.PostOptModuleHook(Task, Mod);
}		}

void codegen(Config &Conf, TargetMachine *TM, AddStreamFn AddStream,		void codegen(Config &Conf, TargetMachine *TM, AddStreamFn AddStream,
unsigned Task, Module &Mod) {		unsigned Task, Module &Mod) {
if (Conf.PreCodeGenModuleHook && !Conf.PreCodeGenModuleHook(Task, Mod))		if (Conf.PreCodeGenModuleHook && !Conf.PreCodeGenModuleHook(Task, Mod))
▲ Show 20 Lines • Show All 174 Lines • Show Last 20 Lines

lib/Passes/PassBuilder.cpp

Show First 20 Lines • Show All 565 Lines • ▼ Show 20 Lines	PassBuilder::buildModuleSimplificationPipeline(OptimizationLevel Level,
// optimizations.		// optimizations.
FunctionPassManager GlobalCleanupPM(DebugLogging);		FunctionPassManager GlobalCleanupPM(DebugLogging);
GlobalCleanupPM.addPass(InstCombinePass());		GlobalCleanupPM.addPass(InstCombinePass());
invokePeepholeEPCallbacks(GlobalCleanupPM, Level);		invokePeepholeEPCallbacks(GlobalCleanupPM, Level);

GlobalCleanupPM.addPass(SimplifyCFGPass());		GlobalCleanupPM.addPass(SimplifyCFGPass());
MPM.addPass(createModuleToFunctionPassAdaptor(std::move(GlobalCleanupPM)));		MPM.addPass(createModuleToFunctionPassAdaptor(std::move(GlobalCleanupPM)));

// Add all the requested passes for PGO, if requested.		// Add all the requested passes for instrumentation PGO, if requested.
if (PGOOpt) {		if (PGOOpt && Phase != ThinLTOPhase::PostLink &&
if (!PGOOpt->ProfileGenFile.empty() \|\| !PGOOpt->ProfileUseFile.empty())		(!PGOOpt->ProfileGenFile.empty() \|\| !PGOOpt->ProfileUseFile.empty())) {
// Instrumentation based PGO (gen and use)
addPGOInstrPasses(MPM, DebugLogging, Level, PGOOpt->RunProfileGen,		addPGOInstrPasses(MPM, DebugLogging, Level, PGOOpt->RunProfileGen,
PGOOpt->ProfileGenFile, PGOOpt->ProfileUseFile);		PGOOpt->ProfileGenFile, PGOOpt->ProfileUseFile);
else if (!PGOOpt->SampleProfileFile.empty())		MPM.addPass(PGOIndirectCallPromotion(false, false));
// SamplePGO use
MPM.addPass(SampleProfileLoaderPass(PGOOpt->SampleProfileFile));

// Indirect call promotion that promotes intra-module targes only.
// Do not enable it in PreLinkThinLTO phase during sample PGO because
// it changes IR to makes profile annotation in back compile inaccurate.
if ((Phase != ThinLTOPhase::PreLink && !PGOOpt->SampleProfileFile.empty())
\|\| !PGOOpt->ProfileUseFile.empty())
MPM.addPass(PGOIndirectCallPromotion(
false, PGOOpt && !PGOOpt->SampleProfileFile.empty()));
}		}

// Require the GlobalsAA analysis for the module so we can query it within		// Require the GlobalsAA analysis for the module so we can query it within
// the CGSCC pipeline.		// the CGSCC pipeline.
MPM.addPass(RequireAnalysisPass<GlobalsAA, Module>());		MPM.addPass(RequireAnalysisPass<GlobalsAA, Module>());

// Require the ProfileSummaryAnalysis for the module so we can query it within		// Require the ProfileSummaryAnalysis for the module so we can query it within
// the inliner pass.		// the inliner pass.
▲ Show 20 Lines • Show All 175 Lines • ▼ Show 20 Lines	PassBuilder::buildPerModuleDefaultPipeline(OptimizationLevel Level,
bool DebugLogging) {		bool DebugLogging) {
assert(Level != O0 && "Must request optimizations for the default pipeline!");		assert(Level != O0 && "Must request optimizations for the default pipeline!");

ModulePassManager MPM(DebugLogging);		ModulePassManager MPM(DebugLogging);

// Force any function attributes we want the rest of the pipeline to observe.		// Force any function attributes we want the rest of the pipeline to observe.
MPM.addPass(ForceFunctionAttrsPass());		MPM.addPass(ForceFunctionAttrsPass());

if (PGOOpt && PGOOpt->SamplePGOSupport)		if (PGOOpt && PGOOpt->SamplePGOSupport) {
MPM.addPass(createModuleToFunctionPassAdaptor(AddDiscriminatorsPass()));		MPM.addPass(createModuleToFunctionPassAdaptor(AddDiscriminatorsPass()));
		if (!PGOOpt->SampleProfileFile.empty()) {
		MPM.addPass(SampleProfileLoaderPass(PGOOpt->SampleProfileFile));
		MPM.addPass(PGOIndirectCallPromotion(false, true));
		}
		}
		chandlercUnsubmitted Not Done Reply Inline Actions This is a very significant change in the PGO instrumentation pass ordering. Before, we would do non-trivial cleanup of the IR module before adding instrumentation passes. Now they get added before any of that cleanup. This includes thing such as the first (and primary) SSA formation pass, lowering @llvm.expect intrinsics into branch weight metadata, dead argument elimination, global-opt, etc. Was all of that intended? Is it really OK to instrument before those cleanups? chandlerc: This is a very significant change in the PGO instrumentation pass ordering. Before, we would…
		danielcdhAuthorUnsubmitted Not Done Reply Inline Actions There are still some optimizations invoked in addPGOInstrPasses before the actual instrumentation: FunctionPassManager FPM; FPM.addPass(SROA()); FPM.addPass(EarlyCSEPass()); // Catch trivial redundancies. FPM.addPass(SimplifyCFGPass()); // Merge & remove basic blocks. FPM.addPass(InstCombinePass()); // Combine silly sequences. invokePeepholeEPCallbacks(FPM, Level); CGPipeline.addPass(createCGSCCToFunctionPassAdaptor(std::move(FPM))); MPM.addPass(createModuleToPostOrderCGSCCPassAdaptor(std::move(CGPipeline))); If we invoke this in the original location, SROA/EarlyCSE/SimplifyCFG will be invoked twice before entering PGO instr/use pass, seems redundant to me. I've also tested the instrumentation PGO on our largest benchmark, and did not see performance change. I know these EarlyFPM were to match the behavior of the legacy PM. But maybe it should not belong to buildModuleSimplificationPipeline, which is called by both regular pipeline, prelink, and thinlto default pipeline. At least thinlto default pipeline does not need these passes? danielcdh: There are still some optimizations invoked in addPGOInstrPasses before the actual…

// Add the core simplification pipeline.		// Add the core simplification pipeline.
MPM.addPass(buildModuleSimplificationPipeline(Level, ThinLTOPhase::None,		MPM.addPass(buildModuleSimplificationPipeline(Level, ThinLTOPhase::None,
DebugLogging));		DebugLogging));

// Now add the optimization pipeline.		// Now add the optimization pipeline.
MPM.addPass(buildModuleOptimizationPipeline(Level, DebugLogging));		MPM.addPass(buildModuleOptimizationPipeline(Level, DebugLogging));

return MPM;		return MPM;
}		}

ModulePassManager		ModulePassManager
PassBuilder::buildThinLTOPreLinkDefaultPipeline(OptimizationLevel Level,		PassBuilder::buildThinLTOPreLinkDefaultPipeline(OptimizationLevel Level,
bool DebugLogging) {		bool DebugLogging) {
assert(Level != O0 && "Must request optimizations for the default pipeline!");		assert(Level != O0 && "Must request optimizations for the default pipeline!");

ModulePassManager MPM(DebugLogging);		ModulePassManager MPM(DebugLogging);

// Force any function attributes we want the rest of the pipeline to observe.		// Force any function attributes we want the rest of the pipeline to observe.
MPM.addPass(ForceFunctionAttrsPass());		MPM.addPass(ForceFunctionAttrsPass());

if (PGOOpt && PGOOpt->SamplePGOSupport)		// Invoke the SamplePGO annotation pass for the first time to annotate
		// profile for functions in the current module to give ThinLink info
		// about module grouping.
		if (PGOOpt && PGOOpt->SamplePGOSupport) {
MPM.addPass(createModuleToFunctionPassAdaptor(AddDiscriminatorsPass()));		MPM.addPass(createModuleToFunctionPassAdaptor(AddDiscriminatorsPass()));
		if (!PGOOpt->ProfileUseFile.empty())
		MPM.addPass(SampleProfileLoaderPass(PGOOpt->SampleProfileFile));
		}

// If we are planning to perform ThinLTO later, we don't bloat the code with		// If we are planning to perform ThinLTO later, we don't bloat the code with
// unrolling/vectorization/... now. Just simplify the module as much as we		// unrolling/vectorization/... now. Just simplify the module as much as we
// can.		// can.
MPM.addPass(buildModuleSimplificationPipeline(Level, ThinLTOPhase::PreLink,		MPM.addPass(buildModuleSimplificationPipeline(Level, ThinLTOPhase::PreLink,
DebugLogging));		DebugLogging));

// Run partial inlining pass to partially inline functions that have		// Run partial inlining pass to partially inline functions that have
Show All 19 Lines	PassBuilder::buildThinLTODefaultPipeline(OptimizationLevel Level,
// When it's going to be hooked, enable WholeProgramDevirt and LowerTypeTest		// When it's going to be hooked, enable WholeProgramDevirt and LowerTypeTest
// here.		// here.

ModulePassManager MPM(DebugLogging);		ModulePassManager MPM(DebugLogging);

// Force any function attributes we want the rest of the pipeline to observe.		// Force any function attributes we want the rest of the pipeline to observe.
MPM.addPass(ForceFunctionAttrsPass());		MPM.addPass(ForceFunctionAttrsPass());

		// Invoke the SamplePGO annotation pass for the second time to annotate on
		// functions imported from other modules.
		if (PGOOpt && !PGOOpt->SampleProfileFile.empty())
		MPM.addPass(SampleProfileLoaderPass(PGOOpt->SampleProfileFile));

// During the ThinLTO backend phase we perform early indirect call promotion		// During the ThinLTO backend phase we perform early indirect call promotion
// here, before globalopt. Otherwise imported available_externally functions		// here, before globalopt. Otherwise imported available_externally functions
// look unreferenced and are removed.		// look unreferenced and are removed.
MPM.addPass(PGOIndirectCallPromotion(		MPM.addPass(PGOIndirectCallPromotion(
true /* InLTO */, PGOOpt && !PGOOpt->SampleProfileFile.empty() &&		true /* InLTO */, PGOOpt && !PGOOpt->SampleProfileFile.empty()));
!PGOOpt->ProfileUseFile.empty()));

// Add the core simplification pipeline.		// Add the core simplification pipeline.
MPM.addPass(buildModuleSimplificationPipeline(Level, ThinLTOPhase::PostLink,		MPM.addPass(buildModuleSimplificationPipeline(Level, ThinLTOPhase::PostLink,
DebugLogging));		DebugLogging));

// Now add the optimization pipeline.		// Now add the optimization pipeline.
MPM.addPass(buildModuleOptimizationPipeline(Level, DebugLogging));		MPM.addPass(buildModuleOptimizationPipeline(Level, DebugLogging));

▲ Show 20 Lines • Show All 908 Lines • Show Last 20 Lines

test/LTO/Resolution/X86/Inputs/load-sample-prof-icp.prof

This file was added.

				test:1000:0
				1: 1000 bar:1000

test/LTO/Resolution/X86/load-sample-prof-icp.ll

This file was added.

				; Test that llvm-lto2 handles the -lto-sample-profile-file option and pass
				; down to the ICP correctly.
				;
				; RUN: opt -module-summary < %s -o %t.bc
				; RUN: llvm-lto2 run -o %t.out %t.bc -save-temps \
				; RUN: -r %t.bc,test,px -r %t.bc,bar,x \
				; RUN: -lto-sample-profile-file=%S/Inputs/load-sample-prof-icp.prof
				; RUN: llvm-dis %t.out.0.4.opt.bc -o - \| FileCheck %s
				; RUN: llvm-lto2 run -o %t.out %t.bc -save-temps \
				; RUN: -r %t.bc,test,px -r %t.bc,bar,x -use-new-pm \
				; RUN: -lto-sample-profile-file=%S/Inputs/load-sample-prof-icp.prof
				; RUN: llvm-dis %t.out.0.4.opt.bc -o - \| FileCheck %s

				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				; CHECK-LABEL: @test
				; Checks that the call instruction is promoted to direct call and has
				; profile count annotated on the direct call.
				define void @test(void ()*) !dbg !7 {
				%2 = alloca void ()*
				store void ()* %0, void ()** %2
				%3 = load void (), void ()* %2
				; CHECK: call void @bar(),{{.*}}!prof
				call void %3(), !dbg !10
				ret void
				}

				declare void @bar() local_unnamed_addr

				!llvm.dbg.cu = !{!0}
				!llvm.module.flags = !{!3, !4, !5}
				!llvm.ident = !{!6}

				!0 = distinct !DICompileUnit(language: DW_LANG_C99, file: !1, producer: "clang version 6.0.0 ", isOptimized: true, runtimeVersion: 0, emissionKind: FullDebug, enums: !2)
				!1 = !DIFile(filename: "test.c", directory: "/tmp")
				!2 = !{}
				!3 = !{i32 2, !"Dwarf Version", i32 4}
				!4 = !{i32 2, !"Debug Info Version", i32 3}
				!5 = !{i32 1, !"wchar_size", i32 4}
				!6 = !{!"clang version 6.0.0 "}
				!7 = distinct !DISubprogram(name: "test", scope: !1, file: !1, line: 3, type: !8, isLocal: false, isDefinition: true, scopeLine: 3, isOptimized: true, unit: !0, variables: !2)
				!8 = !DISubroutineType(types: !9)
				!9 = !{null}
				!10 = !DILocation(line: 4, column: 5, scope: !7)
				!11 = !DILocation(line: 5, column: 1, scope: !7)