This is an archive of the discontinued LLVM Phabricator instance.

Inline Cost improvement - GetElementPtr with constant operands
ClosedPublic

Authored by knaumov on Jun 2 2020, 12:37 PM.

Download Raw Diff

Details

Reviewers

apilipenko
davidxl
mtrofin

Commits

rGd48c7859fbb9: [InlineCost] GetElementPtr with constant operands
rG34fba68d8005: [InlineCost] GetElementPtr with constant operands

Summary

Currently, InlineCost doesn't simplify GetElementPtr with constant operands. In this patch, we are introducing this functionality.
The improvement is built on a similar lambda-structure as the other simplification of instructions with constant operands.

Diff Detail

Event Timeline

knaumov created this revision.Jun 2 2020, 12:37 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 2 2020, 12:37 PM

Herald added subscribers: llvm-commits, haicheng, hiraditya, eraman. · View Herald Transcript

What is the impact on the performance of the generated code?

Also, consider offering an opt-in flag.

llvm/lib/Analysis/InlineCost.cpp
1012	Code style: Index (not index)

Answering @mtrofin 's comments:

As such cases (GEPs from a constant address with constant operands) will be simplified by the pipeline to a constant, InlineCost should be able to see this to apply the correct cost to the instruction. As eventually, the instruction will turn to a constant, the cost is 0.

In D81026#2069569, @knaumov wrote:

Answering @mtrofin 's comments:

As such cases (GEPs from a constant address with constant operands) will be simplified by the pipeline to a constant, InlineCost should be able to see this to apply the correct cost to the instruction. As eventually, the instruction will turn to a constant, the cost is 0.

That sounds fine, my point is that having some run on benchmarks to quantify the benefit would help understand the value of this particular omission from the cost analysis (or at minimum show there's no regression, or understand where the regressions may be). It could shed important insights into what matters for the cost analysis, basically.

Having an optin/out flag would help teams quickly handle any unexpected regressions they may encounter in the field.

knaumov added parent revisions: D81024: InlineCostAnnotationPrinterPass - print constants to which instructions are simplified, D81016: Adding InlineCostAnnotationPrinterPass for Inline Cost Analysis.Jun 3 2020, 7:32 AM

In D81026#2069624, @mtrofin wrote:

That sounds fine, my point is that having some run on benchmarks to quantify the benefit would help understand the value of this particular omission from the cost analysis (or at minimum show there's no regression, or understand where the regressions may be). It could shed important insights into what matters for the cost analysis, basically.

The motivation for this change is the sequence generated by our downstream frontend. Essentially we are looking at something like this:

void foo(ID) {
  X = constant_table[ID];
  if (X == some constant) {
    ...
  }
}

This is a bit oversimplified, but when constant ID is passed into foo it can significantly reduce the cost of inlining. But in order to recognize this we need to recognize a gep of constant operands (the gep is used to compute the address of constant_table[ID]).

Teaching InlineCost to recognize geps of constant operands look like a generic enhancements. If you'd like we can do some performance verification on Clang.

In D81026#2071857, @apilipenko wrote:
In D81026#2069624, @mtrofin wrote:

That sounds fine, my point is that having some run on benchmarks to quantify the benefit would help understand the value of this particular omission from the cost analysis (or at minimum show there's no regression, or understand where the regressions may be). It could shed important insights into what matters for the cost analysis, basically.

The motivation for this change is the sequence generated by our downstream frontend. Essentially we are looking at something like this:
void foo(ID) {
  X = constant_table[ID];
  if (X == some constant) {
    ...
  }
}
This is a bit oversimplified, but when constant ID is passed into foo it can significantly reduce the cost of inlining. But in order to recognize this we need to recognize a gep of constant operands (the gep is used to compute the address of constant_table[ID]).

Teaching InlineCost to recognize geps of constant operands look like a generic enhancements. If you'd like we can do some performance verification on Clang.

If it's easy, it'd be nice to have. I was mostly concerned with staging it, i.e. having first an easy way to revert in the field, should there be a regression, and then removing that flag later - since adding a flag should be very straight forward.

Added flag for the change which is true by default

I have been struggling to collect the data @mtrofin has asked for to prove the usefulness of the patch. I will continue to do so, but meanwhile, I suggest accepting the change. Once I have gathered needed data, I will post new differential presenting the results and (most likely) deleting the flag.

In D81026#2094588, @knaumov wrote:

Added flag for the change which is true by default

I have been struggling to collect the data @mtrofin has asked for to prove the usefulness of the patch. I will continue to do so, but meanwhile, I suggest accepting the change. Once I have gathered needed data, I will post new differential presenting the results and (most likely) deleting the flag.

LGTM

This revision is now accepted and ready to land.Jun 15 2020, 7:33 PM

Closed by commit rG34fba68d8005: [InlineCost] GetElementPtr with constant operands (authored by knaumov). · Explain WhyJun 17 2020, 6:59 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

lib/

Analysis/

InlineCost.cpp

14 lines

test/

Transforms/

Inline/

gep_from_constant.ll

15 lines

Diff 270939

llvm/lib/Analysis/InlineCost.cpp

Show First 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	cl::desc("Compute the full inline cost of a call site even when the cost "
"exceeds the threshold."));		"exceeds the threshold."));

static cl::opt<bool> InlineCallerSupersetNoBuiltin(		static cl::opt<bool> InlineCallerSupersetNoBuiltin(
"inline-caller-superset-nobuiltin", cl::Hidden, cl::init(true),		"inline-caller-superset-nobuiltin", cl::Hidden, cl::init(true),
cl::ZeroOrMore,		cl::ZeroOrMore,
cl::desc("Allow inlining when caller has a superset of callee's nobuiltin "		cl::desc("Allow inlining when caller has a superset of callee's nobuiltin "
"attributes."));		"attributes."));

		static cl::opt<bool> DisableGEPConstOperand(
		"disable-gep-const-evaluation", cl::Hidden, cl::init(false),
		cl::desc("Disables evaluation of GetElementPtr with constant operands"));

namespace {		namespace {
class InlineCostCallAnalyzer;		class InlineCostCallAnalyzer;

// This struct is used to store information about inline cost of a		// This struct is used to store information about inline cost of a
// particular instruction		// particular instruction
struct InstructionCostDetail {		struct InstructionCostDetail {
int CostBefore = 0;		int CostBefore = 0;
int CostAfter = 0;		int CostAfter = 0;
▲ Show 20 Lines • Show All 877 Lines • ▼ Show 20 Lines	bool CallAnalyzer::visitGetElementPtr(GetElementPtrInst &I) {
// Lambda to check whether a GEP's indices are all constant.		// Lambda to check whether a GEP's indices are all constant.
auto IsGEPOffsetConstant = [&](GetElementPtrInst &GEP) {		auto IsGEPOffsetConstant = [&](GetElementPtrInst &GEP) {
for (User::op_iterator I = GEP.idx_begin(), E = GEP.idx_end(); I != E; ++I)		for (User::op_iterator I = GEP.idx_begin(), E = GEP.idx_end(); I != E; ++I)
if (!isa<Constant>(I) && !SimplifiedValues.lookup(I))		if (!isa<Constant>(I) && !SimplifiedValues.lookup(I))
return false;		return false;
return true;		return true;
};		};

		if (!DisableGEPConstOperand)
		if (simplifyInstruction(I, [&](SmallVectorImpl<Constant *> &COps) {
		SmallVector<Constant *, 2> Indices;
		mtrofinUnsubmitted Not Done Reply Inline Actions Code style: Index (not index) mtrofin: Code style: Index (not index)
		for (unsigned int Index = 1 ; Index < COps.size() ; ++Index)
		Indices.push_back(COps[Index]);
		return ConstantExpr::getGetElementPtr(I.getSourceElementType(), COps[0],
		Indices, I.isInBounds());
		}))
		return true;

if ((I.isInBounds() && canFoldInboundsGEP(I)) \|\| IsGEPOffsetConstant(I)) {		if ((I.isInBounds() && canFoldInboundsGEP(I)) \|\| IsGEPOffsetConstant(I)) {
if (SROAArg)		if (SROAArg)
SROAArgValues[&I] = SROAArg;		SROAArgValues[&I] = SROAArg;

// Constant GEPs are modeled as free.		// Constant GEPs are modeled as free.
return true;		return true;
}		}

▲ Show 20 Lines • Show All 1,540 Lines • Show Last 20 Lines

llvm/test/Transforms/Inline/gep_from_constant.ll

This file was added.

				; RUN: opt < %s -passes="print<inline-cost>" 2>&1 \| FileCheck %s

				; CHECK-LABEL: @foo
				; CHECK: cost before = {{.}}, cost after = {{.}}, threshold before = {{.}}, threshold after = {{.}}, cost delta = {{.}}, simplified to i8 addrspace(1)* getelementptr (i8 addrspace(1), i8 addrspace(1)* inttoptr (i64 754974720 to i8 addrspace(1)**), i64 5)

				define i8 addrspace(1)** @foo(i64 %0) {
				%2 = inttoptr i64 754974720 to i8 addrspace(1)**
				%3 = getelementptr i8 addrspace(1), i8 addrspace(1)* %2, i64 %0
				ret i8 addrspace(1)** %3
				}

				define i8 addrspace(1)** @main() {
				%1 = call i8 addrspace(1)** @foo(i64 5)
				ret i8 addrspace(1)** %1
				}