This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/CodeGen/
-
CodeGen/
9/18
MachineOutliner.cpp
-
test/CodeGen/AArch64/
-
CodeGen/
-
AArch64/
6/11
machine-outliner-iterative.mir

Differential D71027

Support repeated machine outlining
ClosedPublic

Authored by jinlin on Dec 4 2019, 10:29 AM.

Download Raw Diff

Details

Reviewers

aschwaighofer
tellenbach
paquette

Commits

rG0d896278c81c: Support repeated machine outlining
rGab2dcff309f9: Support repeated machine outlining
rG1f93b162fc6b: Support repeated machine outlining

Summary

The following change is to allow the machine outlining can be applied for Nth times, where N is specified by the compiler option. By default the value of N is 1. The motivation is that the repeated machine outlining can further reduce code size. Please refer to the presentation "Improving Swift Binary Size via Link Time Optimization" in LLVM Developers' Meeting in 2019.

Diff Detail

Event Timeline

jinlin created this revision.Dec 4 2019, 10:29 AM

Herald added a project: Restricted Project. · View Herald TranscriptDec 4 2019, 10:30 AM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

tellenbach added a reviewer: tellenbach.Dec 5 2019, 6:16 PM

tellenbach added a subscriber: tellenbach.

Hi,

D69446 seems to introduce something similar. Are these patches related or is this just coincidence?

tellenbach added a reviewer: paquette.Dec 5 2019, 6:23 PM

In D71027#1772074, @tellenbach wrote:

Hi,

D69446 seems to introduce something similar. Are these patches related or is this just coincidence?

Thank you for pointing out this. It could be by coincidence. I have presented this in LLVM Developers' Meeting in 10/22/2019. I didn't realize someone submitted D69446 on 10/25/2019.

In D71027, I have also fixed incorrect logic in maintaining the side-effect information for outlined function at line 1295. This bug shows up during repeated machine outliner.

Please let me know if you have any questions.

Okay, thanks for clarifying.

In D71027#1772142, @jinlin wrote:

In D71027, I have also fixed incorrect logic in maintaining the side-effect information for outlined function at line 1295. This bug shows up during repeated machine outliner.

Would it be possible to separate both patches? The fix can then be reviewed independently from repeated outlining itself.

tellenbach mentioned this in D69446: [llvm][MachineOutliner] Add support for repeating machine outliner N times. .Dec 9 2019, 4:59 AM

jinlin updated this revision to Diff 232876.Dec 9 2019, 9:31 AM

In D71027#1774949, @tellenbach wrote:

Okay, thanks for clarifying.

In D71027#1772142, @jinlin wrote:

In D71027, I have also fixed incorrect logic in maintaining the side-effect information for outlined function at line 1295. This bug shows up during repeated machine outliner.

Would it be possible to separate both patches? The fix can then be reviewed independently from repeated outlining itself.

Sure. I have updated the patch. Please note that the repeating machine outliner may not work correctly without fixing the logic at line 1265.

jinlin updated this revision to Diff 232899.Dec 9 2019, 10:58 AM

tellenbach mentioned this in D71217: Fix incorrect logic in maintaining the side-effect of compiler generated outliner functions.Dec 9 2019, 1:11 PM

In D71027#1775515, @jinlin wrote:

Sure. I have updated the patch. Please note that the repeating machine outliner may not work correctly without fixing the logic at line 1265.

Thanks! I've commented on D69446 to see if the revision is still active. So I suggest giving this some more time.

In D71027#1775926, @tellenbach wrote:

In D71027#1775515, @jinlin wrote:

Sure. I have updated the patch. Please note that the repeating machine outliner may not work correctly without fixing the logic at line 1265.

Thanks! I've commented on D69446 to see if the revision is still active. So I suggest giving this some more time.

Is there any update for D69446? I am writing the test case for D71217. The changes D71027 have to be checked in first. Thanks.

In D71027#1862192, @jinlin wrote:

Is there any update for D69446? I am writing the test case for D71217. The changes D71027 have to be checked in first. Thanks.

Unfortunately no update so far. If it's okay for the other reviewers (@paquette @aschwaighofer) I suggest reviewing @jinlin 's patches.

Update the test from IR file to MIR file.

Simplified the test case.

paquette added inline comments.Mar 10 2020, 2:42 PM

llvm/lib/CodeGen/MachineOutliner.cpp
1510–1512	I think that allowing the number of repeats to be 0 is unintuitive. Would it be possible to output an error when the number of repeats is less than 1 instead of silently changing it?
1515–1517	Can you add some debug output here that says why the outliner stopped running? E.g. "Stopped outlining because `I >= NumRepeats`" "Stopped outlining at iteration `I` because no changes were found" This debug output could then be used in a separate testcase showing that repeated outlining terminates as expected.
llvm/test/CodeGen/AArch64/machine-outliner-iterative.mir
66–72	The IR instructions inside each of the IR functions here should not be necessary.

jinlin updated this revision to Diff 249551.Mar 10 2020, 10:07 PM

jinlin marked 6 inline comments as done.

jinlin added inline comments.Mar 11 2020, 8:32 AM

llvm/lib/CodeGen/MachineOutliner.cpp
1510–1512	Done.
1515–1517	Done.
llvm/test/CodeGen/AArch64/machine-outliner-iterative.mir
66–72	Please let me know how to make it work. Initially I used those instructions to generate the first iteration machine outlining functions. If I simply remove them, it will not work.

jinlin updated this revision to Diff 249651.Mar 11 2020, 9:14 AM

jinlin updated this revision to Diff 249654.Mar 11 2020, 9:27 AM

jinlin marked an inline comment as done.

jinlin added inline comments.

llvm/test/CodeGen/AArch64/machine-outliner-iterative.mir
66–72	Now I understand what you mean. I have updated the test based on your suggestions. Thank you.

tellenbach added inline comments.Mar 11 2020, 4:46 PM

llvm/lib/CodeGen/MachineOutliner.cpp
102	Could we rename this to something like `machine-outline-repeat-counts` or `machine-outline-runs`? The term `outlining` itself is somewhat ambigious and the prefix `machine` would be more consistent.
103	Same here: I suggest something like ...to apply machine outlining

tellenbach added inline comments.Mar 11 2020, 4:56 PM

llvm/lib/CodeGen/MachineOutliner.cpp
1510	Why is this indirection necessary?
llvm/test/CodeGen/AArch64/machine-outliner-iterative.mir
2	Could you run this test with a higher number for `-outline-repeat-count` to verfiy that the outlining stops when no changes are found and nothing changes (beside the name of the outlined functions)?

jinlin updated this revision to Diff 249831.Mar 11 2020, 10:12 PM

jinlin marked 3 inline comments as done.

jinlin added inline comments.

llvm/lib/CodeGen/MachineOutliner.cpp
102	Done.
103	Done.
1510	I have removed this extra variable.

jinlin marked an inline comment as done.Mar 11 2020, 10:16 PM

jinlin added inline comments.

llvm/test/CodeGen/AArch64/machine-outliner-iterative.mir
2	I have updated the test case to verify that no more machine outlining is performance after the iteration is greater than 2.

LGTM, but let's wait a bit for other comments.

paquette added inline comments.Mar 13 2020, 12:08 PM

llvm/test/CodeGen/AArch64/machine-outliner-iterative.mir
2–3	I think that it would make sense to use different check prefixes for each level of outlining. That would make it easier to show explicitly what happens when you add extra iterations. # RUN: llc -mtriple=aarch64--- -run-pass=prologepilog -run-pass=machine-outliner -machine-outline-runs=2 -verify-machineinstrs %s -o - \| FileCheck %s --check-prefix TWO-RUNS # RUN: llc -mtriple=aarch64--- -run-pass=prologepilog -run-pass=machine-outliner -machine-outline-runs=4 -verify-machineinstrs %s -o - \| FileCheck %s --check-prefix FOUR-RUNS Also you shouldn't need `--run-pass=prologepilog` here.
114–116	Can you check which instructions are in these functions?

jinlin updated this revision to Diff 250310.Mar 13 2020, 2:46 PM

jinlin marked 2 inline comments as done.

jinlin added inline comments.

llvm/test/CodeGen/AArch64/machine-outliner-iterative.mir
2–3	Done.
114–116	Done.

jinlin updated this revision to Diff 250311.Mar 13 2020, 2:54 PM

Thanks for updating this!

I have a couple nits, but I think this is pretty close to being ready. :)

llvm/lib/CodeGen/MachineOutliner.cpp
1512	Grammar nit: "Expect NumRepeat for machine outlining to be greater than or equal to 1"
1516	Nit: pull `I` into the loop: for (unsigned I = 0; I < NumRepeat; I++) { // ... }
1518–1519	Can we have a small testcase showing that this happens?
1524–1525	Can we have a small testcase showing that this happens?
llvm/test/CodeGen/AArch64/machine-outliner-iterative.mir
140–141	Can you add a comment explaining why these functions are not expected?

jinlin updated this revision to Diff 250658.Mar 16 2020, 5:15 PM

jinlin marked 2 inline comments as done.

jinlin added inline comments.

llvm/lib/CodeGen/MachineOutliner.cpp
1512	Done.
llvm/test/CodeGen/AArch64/machine-outliner-iterative.mir
140–141	Done.

jinlin marked 3 inline comments as done.Mar 16 2020, 5:20 PM

jinlin added inline comments.

llvm/lib/CodeGen/MachineOutliner.cpp
1516	Done.
1518–1519	The prefix FOUR-RUN shows that the machine outliner stops at 3rd iteration since no changes were found.
1524–1525	The prefix ONE-RUN shows that the machine outliner stops at 1st first iteration since NumRepeat is 1.

LGTM!

This revision is now accepted and ready to land.Mar 16 2020, 5:51 PM

In D71027#1925639, @paquette wrote:

LGTM!

Thank you Jessica and David for reviewing my changes. Appreciate your excellent suggestions.

Closed by commit rG1f93b162fc6b: Support repeated machine outlining (authored by jinlin). · Explain WhyMar 17 2020, 9:39 AM

This revision was automatically updated to reflect the committed changes.

jinlin updated this revision to Diff 250904.Mar 17 2020, 2:31 PM

I have made minor changes in test case llvm/test/CodeGen/AArch64/machine-outliner-iterative.mir.

Testing Time: 128.22s

Expected Passes    : 36105
Expected Failures  : 163
Unsupported Tests  : 336

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

MachineOutliner.cpp

41 lines

test/

CodeGen/

AArch64/

machine-outliner-iterative.mir

271 lines

Diff 246034

llvm/lib/CodeGen/MachineOutliner.cpp

Show First 20 Lines • Show All 90 Lines • ▼ Show 20 Lines
// functions. Since the outliner is confined to a single module (modulo LTO),		// functions. Since the outliner is confined to a single module (modulo LTO),
// this is off by default. It should, however, be the default behaviour in		// this is off by default. It should, however, be the default behaviour in
// LTO.		// LTO.
static cl::opt<bool> EnableLinkOnceODROutlining(		static cl::opt<bool> EnableLinkOnceODROutlining(
"enable-linkonceodr-outlining", cl::Hidden,		"enable-linkonceodr-outlining", cl::Hidden,
cl::desc("Enable the machine outliner on linkonceodr functions"),		cl::desc("Enable the machine outliner on linkonceodr functions"),
cl::init(false));		cl::init(false));

		// Set the number of times to repeatedly apply outlining.
		// Defaults to 1, but more repetitions can save additional size.
		static cl::opt<unsigned>
		NumRepeat("outline-repeat-count", cl::Hidden,
		tellenbachUnsubmitted Not Done Reply Inline Actions Could we rename this to something like `machine-outline-repeat-counts` or `machine-outline-runs`? The term `outlining` itself is somewhat ambigious and the prefix `machine` would be more consistent. tellenbach: Could we rename this to something like `machine-outline-repeat-counts` or `machine-outline…
		jinlinAuthorUnsubmitted Done Reply Inline Actions Done. jinlin: Done.
		cl::desc("The number of times to apply outlining"), cl::init(1));
		tellenbachUnsubmitted Not Done Reply Inline Actions Same here: I suggest something like ...to apply machine outlining tellenbach: Same here: I suggest something like ...to apply machine outlining
		jinlinAuthorUnsubmitted Done Reply Inline Actions Done. jinlin: Done.

namespace {		namespace {

/// Represents an undefined index in the suffix tree.		/// Represents an undefined index in the suffix tree.
const unsigned EmptyIdx = -1;		const unsigned EmptyIdx = -1;

/// A node in a suffix tree which represents a substring or suffix.		/// A node in a suffix tree which represents a substring or suffix.
///		///
/// Each node has either no children or at least two children, with the root		/// Each node has either no children or at least two children, with the root
▲ Show 20 Lines • Show All 729 Lines • ▼ Show 20 Lines
struct MachineOutliner : public ModulePass {		struct MachineOutliner : public ModulePass {

static char ID;		static char ID;

/// Set to true if the outliner should consider functions with		/// Set to true if the outliner should consider functions with
/// linkonceodr linkage.		/// linkonceodr linkage.
bool OutlineFromLinkOnceODRs = false;		bool OutlineFromLinkOnceODRs = false;

		/// The current repeat number of machine outlining.
		unsigned OutlineRepeatedNum = 0;

/// Set to true if the outliner should run on all functions in the module		/// Set to true if the outliner should run on all functions in the module
/// considered safe for outlining.		/// considered safe for outlining.
/// Set to true by default for compatibility with llc's -run-pass option.		/// Set to true by default for compatibility with llc's -run-pass option.
/// Set when the pass is constructed in TargetPassConfig.		/// Set when the pass is constructed in TargetPassConfig.
bool RunOnAllFunctions = true;		bool RunOnAllFunctions = true;

StringRef getPassName() const override { return "Machine Outliner"; }		StringRef getPassName() const override { return "Machine Outliner"; }

▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	struct MachineOutliner : public ModulePass {
bool outline(Module &M, std::vector<OutlinedFunction> &FunctionList,		bool outline(Module &M, std::vector<OutlinedFunction> &FunctionList,
InstructionMapper &Mapper, unsigned &OutlinedFunctionNum);		InstructionMapper &Mapper, unsigned &OutlinedFunctionNum);

/// Creates a function for \p OF and inserts it into the module.		/// Creates a function for \p OF and inserts it into the module.
MachineFunction *createOutlinedFunction(Module &M, OutlinedFunction &OF,		MachineFunction *createOutlinedFunction(Module &M, OutlinedFunction &OF,
InstructionMapper &Mapper,		InstructionMapper &Mapper,
unsigned Name);		unsigned Name);

/// Calls 'doOutline()'.		/// Calls runOnceOnModule NumRepeat times
bool runOnModule(Module &M) override;		bool runOnModule(Module &M) override;

		/// Calls 'doOutline()'.
		bool runOnceOnModule(Module &M, unsigned Iter);

/// Construct a suffix tree on the instructions in \p M and outline repeated		/// Construct a suffix tree on the instructions in \p M and outline repeated
/// strings from that tree.		/// strings from that tree.
bool doOutline(Module &M, unsigned &OutlinedFunctionNum);		bool doOutline(Module &M, unsigned &OutlinedFunctionNum);

/// Return a DISubprogram for OF if one exists, and null otherwise. Helper		/// Return a DISubprogram for OF if one exists, and null otherwise. Helper
/// function for remark emission.		/// function for remark emission.
DISubprogram *getSubprogramOrNull(const OutlinedFunction &OF) {		DISubprogram *getSubprogramOrNull(const OutlinedFunction &OF) {
for (const Candidate &C : OF.Candidates)		for (const Candidate &C : OF.Candidates)
▲ Show 20 Lines • Show All 180 Lines • ▼ Show 20 Lines
}		}

MachineFunction *MachineOutliner::createOutlinedFunction(		MachineFunction *MachineOutliner::createOutlinedFunction(
Module &M, OutlinedFunction &OF, InstructionMapper &Mapper, unsigned Name) {		Module &M, OutlinedFunction &OF, InstructionMapper &Mapper, unsigned Name) {

// Create the function name. This should be unique.		// Create the function name. This should be unique.
// FIXME: We should have a better naming scheme. This should be stable,		// FIXME: We should have a better naming scheme. This should be stable,
// regardless of changes to the outliner's cost model/traversal order.		// regardless of changes to the outliner's cost model/traversal order.
std::string FunctionName = ("OUTLINED_FUNCTION_" + Twine(Name)).str();		std::string FunctionName;
		if (OutlineRepeatedNum > 0)
		FunctionName = ("OUTLINED_FUNCTION_" + Twine(OutlineRepeatedNum + 1) + "_" +
		Twine(Name))
		.str();
		else
		FunctionName = ("OUTLINED_FUNCTION_" + Twine(Name)).str();

// Create the function using an IR-level function.		// Create the function using an IR-level function.
LLVMContext &C = M.getContext();		LLVMContext &C = M.getContext();
Function *F = Function::Create(FunctionType::get(Type::getVoidTy(C), false),		Function *F = Function::Create(FunctionType::get(Type::getVoidTy(C), false),
Function::ExternalLinkage, FunctionName, M);		Function::ExternalLinkage, FunctionName, M);

// NOTE: If this is linkonceodr, then we can take advantage of linker deduping		// NOTE: If this is linkonceodr, then we can take advantage of linker deduping
// which gives us better results when we outline from linkonceodr functions.		// which gives us better results when we outline from linkonceodr functions.
▲ Show 20 Lines • Show All 300 Lines • ▼ Show 20 Lines	MORE.emit([&]() {
FnCountAfter)		FnCountAfter)
<< "; Delta: "		<< "; Delta: "
<< DiagnosticInfoOptimizationBase::Argument("Delta", FnDelta);		<< DiagnosticInfoOptimizationBase::Argument("Delta", FnDelta);
return R;		return R;
});		});
}		}
}		}

bool MachineOutliner::runOnModule(Module &M) {		bool MachineOutliner::runOnceOnModule(Module &M, unsigned Iter) {
// Check if there's anything in the module. If it's empty, then there's		// Check if there's anything in the module. If it's empty, then there's
// nothing to outline.		// nothing to outline.
if (M.empty())		if (M.empty())
return false;		return false;

		OutlineRepeatedNum = Iter;

// Number to append to the current outlined function.		// Number to append to the current outlined function.
unsigned OutlinedFunctionNum = 0;		unsigned OutlinedFunctionNum = 0;

if (!doOutline(M, OutlinedFunctionNum))		if (!doOutline(M, OutlinedFunctionNum))
return false;		return false;
return true;		return true;
}		}

▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	bool MachineOutliner::doOutline(Module &M, unsigned &OutlinedFunctionNum) {
// If we outlined something, we definitely changed the MI count of the		// If we outlined something, we definitely changed the MI count of the
// module. If we've asked for size remarks, then output them.		// module. If we've asked for size remarks, then output them.
// FIXME: This should be in the pass manager.		// FIXME: This should be in the pass manager.
if (ShouldEmitSizeRemarks && OutlinedSomething)		if (ShouldEmitSizeRemarks && OutlinedSomething)
emitInstrCountChangedRemark(M, MMI, FunctionToInstrCount);		emitInstrCountChangedRemark(M, MMI, FunctionToInstrCount);

return OutlinedSomething;		return OutlinedSomething;
}		}

		// Apply machine outlining for NumRepeat times.
		bool MachineOutliner::runOnModule(Module &M) {
		unsigned NumRepeats = NumRepeat;
		tellenbachUnsubmitted Not Done Reply Inline Actions Why is this indirection necessary? tellenbach: Why is this indirection necessary?
		jinlinAuthorUnsubmitted Done Reply Inline Actions I have removed this extra variable. jinlin: I have removed this extra variable.
		if (NumRepeats < 1)
		NumRepeats = 1;
		paquetteUnsubmitted Not Done Reply Inline Actions I think that allowing the number of repeats to be 0 is unintuitive. Would it be possible to output an error when the number of repeats is less than 1 instead of silently changing it? paquette: I think that allowing the number of repeats to be 0 is unintuitive. Would it be possible to…
		jinlinAuthorUnsubmitted Done Reply Inline Actions Done. jinlin: Done.
		paquetteUnsubmitted Not Done Reply Inline Actions Grammar nit: "Expect NumRepeat for machine outlining to be greater than or equal to 1" paquette: Grammar nit: "Expect NumRepeat for machine outlining to be greater than or equal to 1"
		jinlinAuthorUnsubmitted Done Reply Inline Actions Done. jinlin: Done.

		bool Changed = false;
		for (unsigned I = 0; I < NumRepeats; I++) {
		if (!runOnceOnModule(M, I))
		paquetteUnsubmitted Not Done Reply Inline Actions Nit: pull `I` into the loop: for (unsigned I = 0; I < NumRepeat; I++) { // ... } paquette: Nit: pull `I` into the loop: ``` for (unsigned I = 0; I < NumRepeat; I++) { // ... } ```
		jinlinAuthorUnsubmitted Done Reply Inline Actions Done. jinlin: Done.
		return Changed;
		paquetteUnsubmitted Not Done Reply Inline Actions Can you add some debug output here that says why the outliner stopped running? E.g. "Stopped outlining because `I >= NumRepeats`" "Stopped outlining at iteration `I` because no changes were found" This debug output could then be used in a separate testcase showing that repeated outlining terminates as expected. paquette: Can you add some debug output here that says why the outliner stopped running? E.g.
		jinlinAuthorUnsubmitted Done Reply Inline Actions Done. jinlin: Done.
		Changed = true;
		}
		paquetteUnsubmitted Not Done Reply Inline Actions Can we have a small testcase showing that this happens? paquette: Can we have a small testcase showing that this happens?
		jinlinAuthorUnsubmitted Done Reply Inline Actions The prefix FOUR-RUN shows that the machine outliner stops at 3rd iteration since no changes were found. jinlin: The prefix FOUR-RUN shows that the machine outliner stops at 3rd iteration since no changes…
		return Changed;
		}
		paquetteUnsubmitted Not Done Reply Inline Actions Can we have a small testcase showing that this happens? paquette: Can we have a small testcase showing that this happens?
		jinlinAuthorUnsubmitted Done Reply Inline Actions The prefix ONE-RUN shows that the machine outliner stops at 1st first iteration since NumRepeat is 1. jinlin: The prefix ONE-RUN shows that the machine outliner stops at 1st first iteration since NumRepeat…

llvm/test/CodeGen/AArch64/machine-outliner-iterative.mir

This file was added.

				# RUN: llc -mtriple=aarch64--- -run-pass=prologepilog -run-pass=machine-outliner -outline-repeat-count=2 -verify-machineinstrs %s -o - \| FileCheck %s

				tellenbachUnsubmitted Not Done Reply Inline Actions Could you run this test with a higher number for `-outline-repeat-count` to verfiy that the outlining stops when no changes are found and nothing changes (beside the name of the outlined functions)? tellenbach: Could you run this test with a higher number for `-outline-repeat-count` to verfiy that the…
				jinlinAuthorUnsubmitted Done Reply Inline Actions I have updated the test case to verify that no more machine outlining is performance after the iteration is greater than 2. jinlin: I have updated the test case to verify that no more machine outlining is performance after the…
				# Example of Repeated Instruction Sequence - Iterative Machine Outlining
				paquetteUnsubmitted Not Done Reply Inline Actions I think that it would make sense to use different check prefixes for each level of outlining. That would make it easier to show explicitly what happens when you add extra iterations. # RUN: llc -mtriple=aarch64--- -run-pass=prologepilog -run-pass=machine-outliner -machine-outline-runs=2 -verify-machineinstrs %s -o - \| FileCheck %s --check-prefix TWO-RUNS # RUN: llc -mtriple=aarch64--- -run-pass=prologepilog -run-pass=machine-outliner -machine-outline-runs=4 -verify-machineinstrs %s -o - \| FileCheck %s --check-prefix FOUR-RUNS Also you shouldn't need `--run-pass=prologepilog` here. paquette: I think that it would make sense to use different check prefixes for each level of outlining.
				jinlinAuthorUnsubmitted Done Reply Inline Actions Done. jinlin: Done.
				#
				#; define void @"$s12"(...) { define i64 @"$s5” (...) { define void @"$s13"(...) {
				# ... ... ...
				# %8 = load i1, i1* %7 %8 = load i1, i1* %7
				# %9 = load i4, i4, %6 %9 = load i4, i4, %6 %9 = load i4, i4*, %6
				# store i4 %9, i4* %5 store i4 %9, i4* %5 store i4 %9, i4* %5
				# ... ... ...
				# } } }
				#
				# After machine outliner (1st time)
				#
				# define void @"$s12"(...) { define i64 @"$s5” (...) { define void @"$s13"(...) {
				# ... ... ...
				# %8 = load i1, i1* %7 %8 = load i1, i1* %7
				# call void @outlined_function_1_1 call void @outlined_function_1_1 call void @outlined_function_1_1
				# ... ... ...
				# } } }
				#
				# After machine outliner (2nd time)
				#
				# define void @"$s12"(...) { define i64 @"$s5” (...) { define void @"$s13"(...) {
				# ... ... ...
				# call void @outlined_function_2_1 call void @outlined_function_1_1 call void @outlined_function_2_1
				# ... ... ...
				# } } }
				#
				# Check whether machine outliner can further find the outlining opportunity after machine
				# outlining has performed.
				#
				--- \|
				target triple = "aarch64-apple-darwin"

				%0 = type { %1*, i64 }
				%1 = type { i64 }
				%2 = type <{ %0, %3 }>
				%3 = type <{ %4 }>
				%4 = type <{ %5 }>
				%5 = type <{ %6* }>
				%6 = type opaque
				%7 = type <{ %0, %8, %3 }>
				%8 = type <{ %9, %9, %9, %9 }>
				%9 = type <{ %10 }>
				%10 = type <{ double }>
				declare %0* @widget(%0* returned) local_unnamed_addr
				declare void @foo(i8, [24 x i8], i64, i8*) local_unnamed_addr
				declare hidden swiftcc %2* @bar()
				define void @baz.14() {
				bb:
				%tmp3 = alloca [24 x i8], align 8
				%tmp37 = call swiftcc %7* @barney.16()
				%tmp57 = getelementptr inbounds %7, %7* %tmp37, i64 0, i32 2
				%tmp59 = bitcast %3* %tmp57 to i8*
				call void @foo(i8* nonnull %tmp59, [24 x i8]* nonnull %tmp3, i64 33, i8* null)
				%tmp602 = bitcast %7* %tmp37 to %0*
				%tmp61 = call %0* @widget(%0* returned %tmp602)
				ret void
				}
				define void @baz.15() {
				bb:
				%tmp4 = alloca [24 x i8], align 8
				%tmp11 = tail call swiftcc %2* @bar()
				%tmp38 = call swiftcc %7* @barney.16()
				%tmp101 = getelementptr inbounds %2, %2* %tmp11, i64 0, i32 1
				%tmp103 = bitcast %3* %tmp101 to i8*
				call void @foo(i8* nonnull %tmp103, [24 x i8]* nonnull %tmp4, i64 33, i8* null)
				%tmp1042 = bitcast %7* %tmp38 to %0*
				%tmp105 = call %0* @widget(%0* returned %tmp1042)
				ret void
				}
				paquetteUnsubmitted Not Done Reply Inline Actions The IR instructions inside each of the IR functions here should not be necessary. paquette: The IR instructions inside each of the IR functions here should not be necessary.
				jinlinAuthorUnsubmitted Done Reply Inline Actions Please let me know how to make it work. Initially I used those instructions to generate the first iteration machine outlining functions. If I simply remove them, it will not work. jinlin: Please let me know how to make it work. Initially I used those instructions to generate the…
				jinlinAuthorUnsubmitted Done Reply Inline Actions Now I understand what you mean. I have updated the test based on your suggestions. Thank you. jinlin: Now I understand what you mean. I have updated the test based on your suggestions. Thank you.
				define void @baz.16() {
				bb:
				%tmp6 = alloca [24 x i8], align 8
				%tmp39 = call swiftcc %7* @barney.16()
				%tmp178 = getelementptr inbounds %7, %7* %tmp39, i64 0, i32 2
				%tmp180 = bitcast %3* %tmp178 to i8*
				call void @foo(i8* nonnull %tmp180, [24 x i8]* nonnull %tmp6, i64 33, i8* null)
				%tmp1812 = bitcast %7* %tmp39 to %0*
				%tmp182 = call %0* @widget(%0* returned %tmp1812)
				ret void
				}
				declare hidden swiftcc %7* @barney.16() local_unnamed_addr

				...
				---
				name: baz.14
				alignment: 4
				exposesReturnsTwice: false
				legalized: false
				regBankSelected: false
				selected: false
				failedISel: false
				tracksRegLiveness: true
				hasWinCFI: false
				registers: []
				liveins: []
				frameInfo:
				isFrameAddressTaken: false
				isReturnAddressTaken: false
				hasStackMap: false
				hasPatchPoint: false
				stackSize: 0
				offsetAdjustment: 0
				maxAlignment: 8
				adjustsStack: true
				hasCalls: true
				stackProtector: ''
				maxCallFrameSize: 0
				cvBytesOfCalleeSavedRegisters: 0
				hasOpaqueSPAdjustment: false
				hasVAStart: false
				hasMustTailInVarArgFunc: false
				localFrameSize: 24
				savePoint: ''
				paquetteUnsubmitted Not Done Reply Inline Actions Can you check which instructions are in these functions? paquette: Can you check which instructions are in these functions?
				jinlinAuthorUnsubmitted Done Reply Inline Actions Done. jinlin: Done.
				restorePoint: ''
				fixedStack: []
				stack:
				- { id: 0, name: tmp3, type: default, offset: 0, size: 24, alignment: 8,
				stack-id: default, callee-saved-register: '', callee-saved-restored: true,
				local-offset: -24, debug-info-variable: '', debug-info-expression: '',
				debug-info-location: '' }
				callSites: []
				constants: []
				machineFunctionInfo: {}
				body: \|
				bb.0.bb:
				ADJCALLSTACKDOWN 0, 0, implicit-def dead $sp, implicit $sp
				BL @barney.16, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit-def $sp, implicit-def $x0
				ADJCALLSTACKUP 0, 0, implicit-def dead $sp, implicit $sp
				renamable $x19 = COPY $x0
				renamable $x0 = nuw ADDXri $x0, 48, 0
				ADJCALLSTACKDOWN 0, 0, implicit-def dead $sp, implicit $sp
				$x1 = ADDXri %stack.0.tmp3, 0, 0
				dead $w2 = MOVi32imm 33, implicit-def $x2
				$x3 = COPY $xzr
				BL @foo, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit $x0, implicit killed $x1, implicit killed $x2, implicit killed $x3, implicit-def $sp
				ADJCALLSTACKUP 0, 0, implicit-def dead $sp, implicit $sp
				ADJCALLSTACKDOWN 0, 0, implicit-def dead $sp, implicit $sp
				$x0 = COPY killed renamable $x19
				paquetteUnsubmitted Not Done Reply Inline Actions Can you add a comment explaining why these functions are not expected? paquette: Can you add a comment explaining why these functions are not expected?
				jinlinAuthorUnsubmitted Done Reply Inline Actions Done. jinlin: Done.
				BL @widget, csr_aarch64_aapcs_thisreturn, implicit-def dead $lr, implicit $sp, implicit $x0, implicit-def $sp
				ADJCALLSTACKUP 0, 0, implicit-def dead $sp, implicit $sp
				RET_ReallyLR

				...
				---
				name: baz.15
				alignment: 4
				exposesReturnsTwice: false
				legalized: false
				regBankSelected: false
				selected: false
				failedISel: false
				tracksRegLiveness: true
				hasWinCFI: false
				registers: []
				liveins: []
				frameInfo:
				isFrameAddressTaken: false
				isReturnAddressTaken: false
				hasStackMap: false
				hasPatchPoint: false
				stackSize: 0
				offsetAdjustment: 0
				maxAlignment: 8
				adjustsStack: true
				hasCalls: true
				stackProtector: ''
				maxCallFrameSize: 0
				cvBytesOfCalleeSavedRegisters: 0
				hasOpaqueSPAdjustment: false
				hasVAStart: false
				hasMustTailInVarArgFunc: false
				localFrameSize: 24
				savePoint: ''
				restorePoint: ''
				fixedStack: []
				stack:
				- { id: 0, name: tmp4, type: default, offset: 0, size: 24, alignment: 8,
				stack-id: default, callee-saved-register: '', callee-saved-restored: true,
				local-offset: -24, debug-info-variable: '', debug-info-expression: '',
				debug-info-location: '' }
				callSites: []
				constants: []
				machineFunctionInfo: {}
				body: \|
				bb.0.bb:
				ADJCALLSTACKDOWN 0, 0, implicit-def dead $sp, implicit $sp
				BL @bar, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit-def $sp, implicit-def $x0
				ADJCALLSTACKUP 0, 0, implicit-def dead $sp, implicit $sp
				renamable $x19 = COPY $x0
				ADJCALLSTACKDOWN 0, 0, implicit-def dead $sp, implicit $sp
				BL @barney.16, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit-def $sp, implicit-def $x0
				ADJCALLSTACKUP 0, 0, implicit-def dead $sp, implicit $sp
				renamable $x20 = COPY $x0
				renamable $x0 = nuw ADDXri killed renamable $x19, 16, 0
				ADJCALLSTACKDOWN 0, 0, implicit-def dead $sp, implicit $sp
				$x1 = ADDXri %stack.0.tmp4, 0, 0
				dead $w2 = MOVi32imm 33, implicit-def $x2
				$x3 = COPY $xzr
				BL @foo, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit $x0, implicit killed $x1, implicit killed $x2, implicit killed $x3, implicit-def $sp
				ADJCALLSTACKUP 0, 0, implicit-def dead $sp, implicit $sp
				ADJCALLSTACKDOWN 0, 0, implicit-def dead $sp, implicit $sp
				$x0 = COPY killed renamable $x20
				BL @widget, csr_aarch64_aapcs_thisreturn, implicit-def dead $lr, implicit $sp, implicit $x0, implicit-def $sp
				ADJCALLSTACKUP 0, 0, implicit-def dead $sp, implicit $sp
				RET_ReallyLR

				...
				---
				name: baz.16
				alignment: 4
				exposesReturnsTwice: false
				legalized: false
				regBankSelected: false
				selected: false
				failedISel: false
				tracksRegLiveness: true
				hasWinCFI: false
				registers: []
				liveins: []
				frameInfo:
				isFrameAddressTaken: false
				isReturnAddressTaken: false
				hasStackMap: false
				hasPatchPoint: false
				stackSize: 0
				offsetAdjustment: 0
				maxAlignment: 8
				adjustsStack: true
				hasCalls: true
				stackProtector: ''
				maxCallFrameSize: 0
				cvBytesOfCalleeSavedRegisters: 0
				hasOpaqueSPAdjustment: false
				hasVAStart: false
				hasMustTailInVarArgFunc: false
				localFrameSize: 24
				savePoint: ''
				restorePoint: ''
				fixedStack: []
				stack:
				- { id: 0, name: tmp6, type: default, offset: 0, size: 24, alignment: 8,
				stack-id: default, callee-saved-register: '', callee-saved-restored: true,
				local-offset: -24, debug-info-variable: '', debug-info-expression: '',
				debug-info-location: '' }
				callSites: []
				constants: []
				machineFunctionInfo: {}
				body: \|
				bb.0.bb:
				ADJCALLSTACKDOWN 0, 0, implicit-def dead $sp, implicit $sp
				BL @barney.16, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit-def $sp, implicit-def $x0
				ADJCALLSTACKUP 0, 0, implicit-def dead $sp, implicit $sp
				renamable $x19 = COPY $x0
				renamable $x0 = nuw ADDXri $x0, 48, 0
				ADJCALLSTACKDOWN 0, 0, implicit-def dead $sp, implicit $sp
				$x1 = ADDXri %stack.0.tmp6, 0, 0
				dead $w2 = MOVi32imm 33, implicit-def $x2
				$x3 = COPY $xzr
				BL @foo, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit $x0, implicit killed $x1, implicit killed $x2, implicit killed $x3, implicit-def $sp
				ADJCALLSTACKUP 0, 0, implicit-def dead $sp, implicit $sp
				ADJCALLSTACKDOWN 0, 0, implicit-def dead $sp, implicit $sp
				$x0 = COPY killed renamable $x19
				BL @widget, csr_aarch64_aapcs_thisreturn, implicit-def dead $lr, implicit $sp, implicit $x0, implicit-def $sp
				ADJCALLSTACKUP 0, 0, implicit-def dead $sp, implicit $sp
				RET_ReallyLR

				...
				# CHECK: [[OUTLINED:OUTLINED_FUNCTION_2_[0-9]+]]

This is an archive of the discontinued LLVM Phabricator instance.

Support repeated machine outliningClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 246034

llvm/lib/CodeGen/MachineOutliner.cpp

llvm/test/CodeGen/AArch64/machine-outliner-iterative.mir

Support repeated machine outlining
ClosedPublic