Download Raw Diff

Details

Reviewers

ChuanqiXu
SjoerdMeijer
labrinea

Commits

rG14384c96df0d: Recommit: [FuncSpec][NFC] Refactor finding specialisation opportunities
rGa8853924bd3c: [FuncSpec][NFC] Refactor finding specialisation opportunities

Summary

This patch reorders the traversal of function call sites and function
formal parameters to:

do various argument feasibility checks (isArgumentInteresting ) only once per argument, i.e. doing N-args checks instead of N-calls x N-args checks.

do hash table lookups only once per call site, i.e. N-calls lookups/inserts instead of N-call x N-args lookups/inserts.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

chill created this revision.Oct 14 2022, 9:21 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 14 2022, 9:21 AM

Herald added subscribers: snehasish, ormris, hiraditya. · View Herald Transcript

chill requested review of this revision.Oct 14 2022, 9:21 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 14 2022, 9:21 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

chill added a parent revision: D135893: [FuncSpec] Fix specialisation based on literals.Oct 14 2022, 9:21 AM

chill added reviewers: ChuanqiXu, SjoerdMeijer, labrinea.

labrinea accepted this revision.Oct 14 2022, 9:44 AM

labrinea added inline comments.

llvm/lib/Transforms/IPO/FunctionSpecialization.cpp
464–465	This confused me a bit and then I remembered it's a MapVector. LGTM!

This revision is now accepted and ready to land.Oct 14 2022, 9:44 AM

Harbormaster completed remote builds in B192195: Diff 467796.Oct 14 2022, 10:11 AM

ChuanqiXu added inline comments.Oct 16 2022, 7:16 PM

llvm/lib/Transforms/IPO/FunctionSpecialization.cpp
671–673	Why this check missed?

chill added inline comments.Oct 17 2022, 1:35 AM

llvm/lib/Transforms/IPO/FunctionSpecialization.cpp
671–673	Oops, that's accidental. I'll put it back.

chill planned changes to this revision.Oct 17 2022, 4:12 AM

Yeah, nice patch.

Just a few nits inlined when I was reading the patch.

According to my definition of NFC, this is not so NFC as things are done quite differently now. I appreciate the codegen should be the same, which is the definition of NFC that some use I believe.

Just out of curiousity, did you measure compile times improvements with this, which I guess is the reason for doing this?

llvm/lib/Transforms/IPO/FunctionSpecialization.cpp
429	Nit: you can also drop these brackets I think
434	Nit: not sure the coding standard says anything about comparing against zero, but personally it's a bit verbose for me and I prefer `!Args.size()`.
671–673	Looks like we are missing a test case? :)
682	After reducing/changing the comments, the function name and comment are a bit out of sync now for me, because specialisation should not only "possible" (the new comment), but also profitable which is why "interesting" is in the function name. Perhaps restoring some words about the profitability explains the "interesting" part.

Unfortunately, this patch exposes a latent issue in the pass - one call site matching two or more specialisations - as can be seen in the failure of llvm/test/Transforms/FunctionSpecialization/identical-specializations.ll.

llvm/lib/Transforms/IPO/FunctionSpecialization.cpp
671–673	Ack!
682	In fact, the old comment was out of sync - neither the old `isArgumentInteresting` nor the old `getPossibleConstants` did any cost/benefit analysis.

chill added a parent revision: D136180: [FuncSpec] Compute specialisation gain even when forcing specialisation.Oct 18 2022, 8:27 AM

chill removed a parent revision: D135893: [FuncSpec] Fix specialisation based on literals.

chill added inline comments.Oct 18 2022, 9:54 AM

llvm/lib/Transforms/IPO/FunctionSpecialization.cpp
671–673	Testcase added in D136184

chill updated this revision to Diff 468603.Oct 18 2022, 9:58 AM

This revision is now accepted and ready to land.Oct 18 2022, 9:58 AM

chill requested review of this revision.Oct 18 2022, 9:58 AM

chill marked 6 inline comments as done.

In D135968#3861974, @SjoerdMeijer wrote:

According to my definition of NFC, this is not so NFC as things are done quite differently now. I appreciate the codegen should be the same, which is the definition of NFC that some use I believe.

AFAIK, the working notion of NFC in LLVM is "does not change any testcase", which is a bit of a compromise, of course. 😁

I think it can try specialisations of the same cost in a different order, which has the potential of changing the codegen, however, it does not
violate any explicit guarantees we make in the pass.

Just out of curiousity, did you measure compile times improvements with this, which I guess is the reason for doing this?

No, not measured yet, but planning to.

Harbormaster completed remote builds in B192788: Diff 468603.Oct 18 2022, 11:15 AM

LGTM.

This revision is now accepted and ready to land.Oct 18 2022, 7:17 PM

chill added a child revision: D136332: [FuncSpec][NFC] Avoid redundant computations of DominatorTree/LoopInfo.Oct 20 2022, 3:56 AM

In D135968#3865784, @chill wrote:

Just out of curiousity, did you measure compile times improvements with this, which I guess is the reason for doing this?

No, not measured yet, but planning to.

I did some measurements on sqlite3 (with the whole series ending in D136332), with -O3 and -O3 -mllvm -enable-function-specialization).
I got about 0.42% overall regression compared to not running the FunctionSpecialization pass and about 0.45% improvement when the pass is enabled in both compilers.
Comparing just the pass (via clang ... -ftime-report 2>&1 | grep FunctionSpecialization) shows about 2.2% improvement.

This revision was landed with ongoing or failed builds.Oct 26 2022, 2:26 AM

Closed by commit rGa8853924bd3c: [FuncSpec][NFC] Refactor finding specialisation opportunities (authored by chill). · Explain Why

This revision was automatically updated to reflect the committed changes.

chill added a commit: rGa8853924bd3c: [FuncSpec][NFC] Refactor finding specialisation opportunities.

chill added a reverting change: rG2c8a4c6e620c: Revert "[FuncSpec][NFC] Refactor finding specialisation opportunities".Oct 26 2022, 5:54 AM

chill reopened this revision.Oct 26 2022, 5:57 AM

This revision is now accepted and ready to land.Oct 26 2022, 5:57 AM

chill updated this revision to Diff 470835.Oct 26 2022, 9:21 AM

Harbormaster completed remote builds in B194428: Diff 470835.Oct 26 2022, 9:22 AM

This revision looks similar to 467796, which must be accidental. Can you update it as it was when it landed? (470754)

This revision now requires changes to proceed.Oct 26 2022, 3:35 PM

chill edited parent revisions, added: D135893: [FuncSpec] Fix specialisation based on literals; removed: D136180: [FuncSpec] Compute specialisation gain even when forcing specialisation.Oct 27 2022, 3:04 AM

chill updated this revision to Diff 471103.Oct 27 2022, 3:07 AM

Harbormaster completed remote builds in B194608: Diff 471103.Oct 27 2022, 3:08 AM

chill planned changes to this revision.Oct 27 2022, 3:11 AM

chill updated this revision to Diff 471107.Oct 27 2022, 3:32 AM

Harbormaster completed remote builds in B194613: Diff 471107.Oct 27 2022, 3:32 AM

labrinea accepted this revision.Oct 27 2022, 6:00 AM

This revision is now accepted and ready to land.Oct 27 2022, 6:00 AM

This revision was landed with ongoing or failed builds.Oct 28 2022, 3:27 AM

Closed by commit rG14384c96df0d: Recommit: [FuncSpec][NFC] Refactor finding specialisation opportunities (authored by chill). · Explain Why

This revision was automatically updated to reflect the committed changes.

chill added a commit: rG14384c96df0d: Recommit: [FuncSpec][NFC] Refactor finding specialisation opportunities.

Diff 471475

llvm/lib/Transforms/IPO/FunctionSpecialization.cpp

Show First 20 Lines • Show All 309 Lines • ▼ Show 20 Lines	for (auto *F : Candidates) {
dbgs() << "FnSpecialization: Invalid specialization cost.\n");		dbgs() << "FnSpecialization: Invalid specialization cost.\n");
continue;		continue;
}		}

LLVM_DEBUG(dbgs() << "FnSpecialization: Specialization cost for "		LLVM_DEBUG(dbgs() << "FnSpecialization: Specialization cost for "
<< F->getName() << " is " << Cost << "\n");		<< F->getName() << " is " << Cost << "\n");

SmallVector<CallSpecBinding, 8> Specializations;		SmallVector<CallSpecBinding, 8> Specializations;
if (!calculateGains(F, Cost, Specializations)) {		if (!findSpecializations(F, Cost, Specializations)) {
LLVM_DEBUG(dbgs() << "FnSpecialization: No possible constants found\n");		LLVM_DEBUG(
		dbgs() << "FnSpecialization: No possible specializations found\n");
continue;		continue;
}		}

Changed = true;		Changed = true;
for (auto &Entry : Specializations)		for (auto &Entry : Specializations)
specializeFunction(F, Entry.second, WorkList);		specializeFunction(F, Entry.second, WorkList);
}		}

▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	private:
}		}

/// This function decides whether it's worthwhile to specialize function		/// This function decides whether it's worthwhile to specialize function
/// \p F based on the known constant values its arguments can take on. It		/// \p F based on the known constant values its arguments can take on. It
/// only discovers potential specialization opportunities without actually		/// only discovers potential specialization opportunities without actually
/// applying them.		/// applying them.
///		///
/// \returns true if any specializations have been found.		/// \returns true if any specializations have been found.
bool calculateGains(Function *F, InstructionCost Cost,		bool findSpecializations(Function *F, InstructionCost Cost,
SmallVectorImpl<CallSpecBinding> &WorkList) {		SmallVectorImpl<CallSpecBinding> &WorkList) {
		// Get a list of interesting arguments.
		SmallVector<Argument *, 4> Args;
		for (Argument &Arg : F->args())
		SjoerdMeijerUnsubmitted Done Reply Inline Actions Nit: you can also drop these brackets I think SjoerdMeijer: Nit: you can also drop these brackets I think
		if (isArgumentInteresting(&Arg))
		Args.push_back(&Arg);

		if (!Args.size())
		return false;
		SjoerdMeijerUnsubmitted Done Reply Inline Actions Nit: not sure the coding standard says anything about comparing against zero, but personally it's a bit verbose for me and I prefer `!Args.size()`. SjoerdMeijer: Nit: not sure the coding standard says anything about comparing against zero, but personally…

		// Find all the call sites for the function.
SpecializationMap Specializations;		SpecializationMap Specializations;
// Determine if we should specialize the function based on the values the		for (User *U : F->users()) {
// argument can take on. If specialization is not profitable, we continue		if (!isa<CallInst>(U) && !isa<InvokeInst>(U))
// on to the next argument.		continue;
for (Argument &FormalArg : F->args()) {		auto &CS = *cast<CallBase>(U);
// Determine if this argument is interesting. If we know the argument can		// If the call site has attribute minsize set, that callsite won't be
// take on any constant values, they are collected in Constants.		// specialized.
SmallVector<CallArgBinding, 8> ActualArgs;		if (CS.hasFnAttr(Attribute::MinSize))
if (!isArgumentInteresting(&FormalArg, ActualArgs)) {		continue;
LLVM_DEBUG(dbgs() << "FnSpecialization: Argument "
<< FormalArg.getNameOrAsOperand()		// If the parent of the call site will never be executed, we don't need
<< " is not interesting\n");		// to worry about the passed value.
		if (!Solver.isBlockExecutable(CS.getParent()))
continue;		continue;

		// Examine arguments and create specialization candidates from call sites
		// with constant arguments.
		bool Added = false;
		for (Argument *A : Args) {
		Constant *C = getCandidateConstant(CS.getArgOperand(A->getArgNo()));
		if (!C)
		continue;

		if (!Added) {
		Specializations[&CS] = {{}, 0 - Cost};
		Added = true;
}		}

for (const auto &Entry : ActualArgs) {		SpecializationInfo &S = Specializations.back().second;
		labrineaUnsubmitted Done Reply Inline Actions This confused me a bit and then I remembered it's a MapVector. LGTM! labrinea: This confused me a bit and then I remembered it's a MapVector. LGTM!
CallBase *Call = Entry.first;		S.Gain += getSpecializationBonus(A, C);
Constant *ActualArg = Entry.second;		S.Args.push_back({A, C});

auto I = Specializations.insert({Call, SpecializationInfo()});
SpecializationInfo &S = I.first->second;

if (I.second)
S.Gain = 0 - Cost;
S.Gain += getSpecializationBonus(&FormalArg, ActualArg);
S.Args.push_back({&FormalArg, ActualArg});
}		}
		Added = false;
}		}

// Remove unprofitable specializations.		// Remove unprofitable specializations.
if (!ForceFunctionSpecialization)		if (!ForceFunctionSpecialization)
Specializations.remove_if(		Specializations.remove_if(
[](const auto &Entry) { return Entry.second.Gain <= 0; });		[](const auto &Entry) { return Entry.second.Gain <= 0; });

// Clear the MapVector and return the underlying vector.		// Clear the MapVector and return the underlying vector.
▲ Show 20 Lines • Show All 188 Lines • ▼ Show 20 Lines	for (User *U : A->users()) {

LLVM_DEBUG(dbgs() << "FnSpecialization: Inlining bonus " << Bonus		LLVM_DEBUG(dbgs() << "FnSpecialization: Inlining bonus " << Bonus
<< " for user " << *U << "\n");		<< " for user " << *U << "\n");
}		}

return TotalCost + Bonus;		return TotalCost + Bonus;
}		}

/// Determine if we should specialize a function based on the incoming values		/// Determine if it is possible to specialise the function for constant values
/// of the given argument.		/// of the formal parameter \p A.
///		bool isArgumentInteresting(Argument *A) {
/// This function implements the goal-directed heuristic. It determines if
/// specializing the function based on the incoming values of argument \p A
/// would result in any significant optimization opportunities. If
/// optimization opportunities exist, the constant values of \p A on which to
/// specialize the function are collected in \p Constants.
///
/// \returns true if the function should be specialized on the given
/// argument.
bool isArgumentInteresting(Argument *A,
SmallVectorImpl<CallArgBinding> &Constants) {

// No point in specialization if the argument is unused.		// No point in specialization if the argument is unused.
if (A->user_empty())		if (A->user_empty())
return false;		return false;
ChuanqiXuUnsubmitted Done Reply Inline Actions Why this check missed? ChuanqiXu: Why this check missed?
chillAuthorUnsubmitted Done Reply Inline Actions Oops, that's accidental. I'll put it back. chill: Oops, that's accidental. I'll put it back.
SjoerdMeijerUnsubmitted Done Reply Inline Actions Looks like we are missing a test case? :) SjoerdMeijer: Looks like we are missing a test case? :)
chillAuthorUnsubmitted Done Reply Inline Actions Ack! chill: Ack!
chillAuthorUnsubmitted Done Reply Inline Actions Testcase added in D136184 chill: Testcase added in D136184

// For now, don't attempt to specialize functions based on the values of		// For now, don't attempt to specialize functions based on the values of
// composite types.		// composite types.
		SjoerdMeijerUnsubmitted Done Reply Inline Actions After reducing/changing the comments, the function name and comment are a bit out of sync now for me, because specialisation should not only "possible" (the new comment), but also profitable which is why "interesting" is in the function name. Perhaps restoring some words about the profitability explains the "interesting" part. SjoerdMeijer: After reducing/changing the comments, the function name and comment are a bit out of sync now…
		chillAuthorUnsubmitted Done Reply Inline Actions In fact, the old comment was out of sync - neither the old `isArgumentInteresting` nor the old `getPossibleConstants` did any cost/benefit analysis. chill: In fact, the old comment was out of sync - neither the old `isArgumentInteresting` nor the old…
Type *ArgTy = A->getType() ;		Type *ArgTy = A->getType();
if (!ArgTy->isSingleValueType())		if (!ArgTy->isSingleValueType())
return false;		return false;

// Specialization of integer and floating point types needs to be explicitly enabled.		// Specialization of integer and floating point types needs to be explicitly
		// enabled.
if (!EnableSpecializationForLiteralConstant &&		if (!EnableSpecializationForLiteralConstant &&
(ArgTy->isIntegerTy() \|\| ArgTy->isFloatingPointTy()))		(ArgTy->isIntegerTy() \|\| ArgTy->isFloatingPointTy()))
return false;		return false;

// SCCP solver does not record an argument that will be constructed on		// SCCP solver does not record an argument that will be constructed on
// stack.		// stack.
if (A->hasByValAttr() && !A->getParent()->onlyReadsMemory())		if (A->hasByValAttr() && !A->getParent()->onlyReadsMemory())
return false;		return false;

// Check the lattice value and decide if we should attemt to specialize,		// Check the lattice value and decide if we should attemt to specialize,
// based on this argument. No point in specialization, if the lattice value		// based on this argument. No point in specialization, if the lattice value
// is already a constant.		// is already a constant.
const ValueLatticeElement &LV = Solver.getLatticeValueFor(A);		const ValueLatticeElement &LV = Solver.getLatticeValueFor(A);
if (LV.isUnknownOrUndef() \|\| LV.isConstant() \|\|		if (LV.isUnknownOrUndef() \|\| LV.isConstant() \|\|
(LV.isConstantRange() && LV.getConstantRange().isSingleElement())) {		(LV.isConstantRange() && LV.getConstantRange().isSingleElement())) {
LLVM_DEBUG(dbgs() << "FnSpecialization: Nothing to do, argument "		LLVM_DEBUG(dbgs() << "FnSpecialization: Nothing to do, argument "
<< A->getNameOrAsOperand() << " is already constant\n");		<< A->getNameOrAsOperand() << " is already constant\n");
return false;		return false;
}		}

// Collect the constant values that the argument can take on. If the
// argument can't take on any constant values, we aren't going to
// specialize the function. While it's possible to specialize the function
// based on non-constant arguments, there's likely not much benefit to
// constant propagation in doing so.
//
// TODO 1: currently it won't specialize if there are over the threshold of
// calls using the same argument, e.g foo(a) x 4 and foo(b) x 1, but it
// might be beneficial to take the occurrences into account in the cost
// model, so we would need to find the unique constants.
//
// TODO 2: this currently does not support constants, i.e. integer ranges.
//
getPossibleConstants(A, Constants);

if (Constants.empty())
return false;

LLVM_DEBUG(dbgs() << "FnSpecialization: Found interesting argument "
<< A->getNameOrAsOperand() << "\n");
return true;		return true;
}		}

/// Collect in \p Constants all the constant values that argument \p A can		/// Check if the valuy \p V (an actual argument) is a constant or can only
/// take on.		/// have a constant value. Return that constant.
void getPossibleConstants(Argument *A,		Constant getCandidateConstant(Value V) {
SmallVectorImpl<CallArgBinding> &Constants) {
Function *F = A->getParent();

// Iterate over all the call sites of the argument's parent function.
for (User *U : F->users()) {
if (!isa<CallInst>(U) && !isa<InvokeInst>(U))
continue;
auto &CS = *cast<CallBase>(U);
// If the call site has attribute minsize set, that callsite won't be
// specialized.
if (CS.hasFnAttr(Attribute::MinSize))
continue;

// If the parent of the call site will never be executed, we don't need
// to worry about the passed value.
if (!Solver.isBlockExecutable(CS.getParent()))
continue;

auto *V = CS.getArgOperand(A->getArgNo());
if (isa<PoisonValue>(V))		if (isa<PoisonValue>(V))
continue;		return nullptr;

// TrackValueOfGlobalVariable only tracks scalar global variables.		// TrackValueOfGlobalVariable only tracks scalar global variables.
if (auto *GV = dyn_cast<GlobalVariable>(V)) {		if (auto *GV = dyn_cast<GlobalVariable>(V)) {
// Check if we want to specialize on the address of non-constant		// Check if we want to specialize on the address of non-constant
// global values.		// global values.
if (!GV->isConstant() && !SpecializeOnAddresses)		if (!GV->isConstant() && !SpecializeOnAddresses)
continue;		return nullptr;

if (!GV->getValueType()->isSingleValueType())		if (!GV->getValueType()->isSingleValueType())
continue;		return nullptr;
}		}

// Select for possible specialisation arguments which are constants or		// Select for possible specialisation values that are constants or
// are deduced to be constants or constant ranges with a single element.		// are deduced to be constants or constant ranges with a single element.
Constant *C = dyn_cast<Constant>(V);		Constant *C = dyn_cast<Constant>(V);
if (!C) {		if (!C) {
const ValueLatticeElement &LV = Solver.getLatticeValueFor(V);		const ValueLatticeElement &LV = Solver.getLatticeValueFor(V);
if (LV.isConstant())		if (LV.isConstant())
C = LV.getConstant();		C = LV.getConstant();
else if (LV.isConstantRange() &&		else if (LV.isConstantRange() &&
LV.getConstantRange().isSingleElement()) {		LV.getConstantRange().isSingleElement()) {
assert(V->getType()->isIntegerTy() && "Non-integral constant range");		assert(V->getType()->isIntegerTy() && "Non-integral constant range");
C = Constant::getIntegerValue(		C = Constant::getIntegerValue(
V->getType(), *LV.getConstantRange().getSingleElement());		V->getType(), *LV.getConstantRange().getSingleElement());
} else		} else
continue;		return nullptr;
}		}

Constants.push_back({&CS, C});		LLVM_DEBUG(dbgs() << "FnSpecialization: Found interesting argument "
}		<< V->getNameOrAsOperand() << "\n");

		return C;
}		}

/// Rewrite calls to function \p F to call function \p Clone instead.		/// Rewrite calls to function \p F to call function \p Clone instead.
///		///
/// This function modifies calls to function \p F as long as the actual		/// This function modifies calls to function \p F as long as the actual
/// arguments match those in \p Args. Note that for recursive calls we		/// arguments match those in \p Args. Note that for recursive calls we
/// need to compare against the cloned formal arguments.		/// need to compare against the cloned formal arguments.
///		///
▲ Show 20 Lines • Show All 189 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[FuncSpec][NFC] Refactor finding specialisation opportunities
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 471475

llvm/lib/Transforms/IPO/FunctionSpecialization.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[FuncSpec][NFC] Refactor finding specialisation opportunities ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 471475

llvm/lib/Transforms/IPO/FunctionSpecialization.cpp

[FuncSpec][NFC] Refactor finding specialisation opportunities
ClosedPublic