This is an archive of the discontinued LLVM Phabricator instance.

[FuncSpec] Do not overestimate the specialization bonus for users inside loops.
ClosedPublic

Authored by labrinea on Oct 25 2022, 8:18 AM.

Download Raw Diff

Details

Reviewers

SjoerdMeijer
ChuanqiXu
momchil.velikov

Commits

rGdbeaf6baa2ad: [FuncSpec] Do not overestimate the specialization bonus for users inside loops.

Summary

When calculating the specialization bonus for a given function argument, we recursively traverse the chain of (certain) users, accumulating the instruction costs. Then we exponentially increase the bonus to account for loop nests. This is problematic for two reasons: (a) the users might not themselves be inside the loop nest, (b) if they are we are accounting for it multiple times. Instead we should be adjusting the bonus before traversing the user chain.

This reduces the instruction count for CTMark (newPM-O3) when Function Specialization is enabled without actually reducing the amount of specializations performed.

testname	delta % non-LTO	delta % LTO
ClamAV	-0.005	0.039
7zip	0.012	-0.007
tramp3d-v4	-0.013	-0.011
kimwitu++	-0.011	0.146
sqlite3	0.04	-0.445
mafft	0.006	0.011
lencod	-0.02	-0.023
SPASS	-0.006	-1.06
consumer-typeset	0.005	-2.644
Bullet	-0.015	-0.029
geomean	-0.001	-0.406

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

labrinea created this revision.Oct 25 2022, 8:18 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 25 2022, 8:18 AM

Herald added subscribers: snehasish, ormris, hiraditya. · View Herald Transcript

labrinea requested review of this revision.Oct 25 2022, 8:18 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 25 2022, 8:18 AM

labrinea retitled this revision from [FuncSpec] Do not overestime the specialization bonus for users inside loops. to [FuncSpec] Do not overestimate the specialization bonus for users inside loops..Oct 25 2022, 8:21 AM

Harbormaster completed remote builds in B194196: Diff 470494.Oct 25 2022, 10:25 AM

The reason makes sense and the data is overwhelming. LGTM!

This revision is now accepted and ready to land.Oct 26 2022, 7:32 PM

This revision was landed with ongoing or failed builds.Oct 27 2022, 7:40 AM

Closed by commit rGdbeaf6baa2ad: [FuncSpec] Do not overestimate the specialization bonus for users inside loops. (authored by labrinea). · Explain Why

This revision was automatically updated to reflect the committed changes.

labrinea added a commit: rGdbeaf6baa2ad: [FuncSpec] Do not overestimate the specialization bonus for users inside loops..

Revision Contents

Path

Size

llvm/

lib/

Transforms/

IPO/

FunctionSpecialization.cpp

7 lines

test/

Transforms/

FunctionSpecialization/

function-specialization-loop.ll

4 lines

Diff 471165

llvm/lib/Transforms/IPO/FunctionSpecialization.cpp

Show First 20 Lines • Show All 568 Lines • ▼ Show 20 Lines	InstructionCost getUserBonus(User *U, llvm::TargetTransformInfo &TTI,
// Keep minimum possible cost for now so that it doesnt affect		// Keep minimum possible cost for now so that it doesnt affect
// specialization.		// specialization.
if (!I)		if (!I)
return std::numeric_limits<unsigned>::min();		return std::numeric_limits<unsigned>::min();

InstructionCost Cost =		InstructionCost Cost =
TTI.getInstructionCost(U, TargetTransformInfo::TCK_SizeAndLatency);		TTI.getInstructionCost(U, TargetTransformInfo::TCK_SizeAndLatency);

		// Increase the cost if it is inside the loop.
		unsigned LoopDepth = LI.getLoopDepth(I->getParent());
		Cost *= std::pow((double)AvgLoopIterationCount, LoopDepth);

// Traverse recursively if there are more uses.		// Traverse recursively if there are more uses.
// TODO: Any other instructions to be added here?		// TODO: Any other instructions to be added here?
if (I->mayReadFromMemory() \|\| I->isCast())		if (I->mayReadFromMemory() \|\| I->isCast())
for (auto *User : I->users())		for (auto *User : I->users())
Cost += getUserBonus(User, TTI, LI);		Cost += getUserBonus(User, TTI, LI);

// Increase the cost if it is inside the loop.
auto LoopDepth = LI.getLoopDepth(I->getParent());
Cost *= std::pow((double)AvgLoopIterationCount, LoopDepth);
return Cost;		return Cost;
}		}

/// Compute a bonus for replacing argument \p A with constant \p C.		/// Compute a bonus for replacing argument \p A with constant \p C.
InstructionCost getSpecializationBonus(Argument A, Constant C) {		InstructionCost getSpecializationBonus(Argument A, Constant C) {
Function *F = A->getParent();		Function *F = A->getParent();
DominatorTree DT(*F);		DominatorTree DT(*F);
LoopInfo LI(DT);		LoopInfo LI(DT);
▲ Show 20 Lines • Show All 381 Lines • Show Last 20 Lines

llvm/test/Transforms/FunctionSpecialization/function-specialization-loop.ll

	; RUN: opt -function-specialization -func-specialization-avg-iters-cost=3 -func-specialization-size-threshold=10 -S < %s \| FileCheck %s			; RUN: opt -function-specialization -func-specialization-avg-iters-cost=5 -func-specialization-size-threshold=10 -S < %s \| FileCheck %s

	; Check that the loop depth results in a larger specialization bonus.			; Check that the loop depth results in a larger specialization bonus.
	; CHECK: @foo.1(			; CHECK: @foo.1(
	; CHECK: @foo.2(			; CHECK: @foo.2(

	target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"			target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"

	@A = external dso_local constant i32, align 4			@A = external dso_local constant i32, align 4
	▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines

	if.else:			if.else:
	%call1 = call i32 @foo(i32 %y, i32* @B, i32* @D)			%call1 = call i32 @foo(i32 %y, i32* @B, i32* @D)
	br label %return			br label %return

	return:			return:
	%retval.0 = phi i32 [ %call, %if.then ], [ %call1, %if.else ]			%retval.0 = phi i32 [ %call, %if.then ], [ %call1, %if.else ]
	ret i32 %retval.0			ret i32 %retval.0
	}			}
	No newline at end of file