Download Raw Diff

Details

Reviewers

Ayal
dorit
hsaito
samparker

Commits

rG401a324c5186: [LV] Refactor widenIntOrFpInduction. NFC.

Summary

This untangles some of the logic in widenIntOrFpInduction, which was quite necessary IMHO as the different cases was extremely difficult to follow and read. So I've removed/replaced this with more straigth-line code, which I would like to do first before making some other functional changes. Most of the times I directly commit NFC patches, but here I would like to get feedback as this is a little bit of a rewrite.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

SjoerdMeijer created this revision.Mar 24 2020, 4:35 AM

Herald added subscribers: rkruppe, hiraditya. · View Herald TranscriptMar 24 2020, 4:35 AM

fhahn added a subscriber: fhahn.Mar 24 2020, 5:58 AM

fhahn added inline comments.

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
1813	Instead of implicitly setting Step in the lambdas, would it be possible to have the closures return the step/IV they create and take it as argument if required?
1827–1828	Comment seems out of place now.
1851–1852	The comment seems a bit out of place now, should it be around line 1910?
1852–1854	it might be good to make the VF, Part and Step explicit parameters here (and potentially other places) to make things a bit more explicit at the call sites.

+1 to Florian's suggestions for lambda parameters and some explicit data flow.

Thanks for taking a look @fhahn ! Comments addressed.

gilr added a subscriber: gilr.Mar 24 2020, 10:27 AM

Ayal added inline comments.Mar 25 2020, 9:57 AM

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
1817–1826	Seems like it may be good to pass in ID.getStep() as an explicit parameter, as it forms the basis of the step being created.
1824	https://llvm.org/docs/CodingStandards.html#don-t-use-else-after-a-return
1832–1833	+1 about the first part of the original comment "If we haven't yet vectorized the induction variable..." seeming out of place here now. But the remaining part that explains what this code is doing, would help here to describe what this lambda is for, including that comment about truncation. Possibly also resurrecting the original comment about ScalarIV (changing "will be" to "is"): // The scalar value to broadcast. This will be derived from the canonical // induction variable.
1851–1852	+1 Some explanation what this lambda is for may be good here though. It basically creates the vector values from the scalar IV, in the absence of creating a vector IV.
1864	Is CreateScalarIVCode() really needed, instead of invoking Value *ScalarIV = CreateScalarIV(Step); CreateSplatIV(ScalarIV, Step); directly, as done below before calling buildScalarSteps()? (They are not folded together because CreateScalarIV() may change Step, right?)
1887	Can continue early-exiting by doing next if (!NeedsScalarIV) { createVectorIntOrFpInductionPHI(ID, Step, EntryVal); return; } cleaning up the checks for NeedsScalarIV below.

SjoerdMeijer marked 2 inline comments as done.Mar 25 2020, 1:23 PM

SjoerdMeijer added inline comments.

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
1864	Ha, that's funny: no, we don't need it. Probably I stared too long at this code to notice this duplication and it is indeed the same as: Value *ScalarIV = CreateScalarIV(Step); CreateSplatIV(ScalarIV, Step); which was actually also my intention. You're exactly right about: They are not folded together because CreateScalarIV() may change Step, right?) I wanted to have a one-liner: CreateSplatIV(CreateScalarIV(Step), Step); but that's indeed not possible because Step may change.
1887	Ah, many thanks. That cleans up things even more and this function is becoming a real beauty with just straight-line code.

Many thanks for reviewing!
I think this has all comments addressed.

Ayal added inline comments.Mar 25 2020, 5:33 PM

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
1822	There's one more ID.getStep() >> Step
1882–1883	Not sure where this first "If an induction ... doesn't need to be widened" sentence belongs now, but it doesn't fit here. The rest does fit before calling buildScalarStep(), which is also/firstly invoked above... better place it there?

replaced ID.getStep() -> Step
removed the sentence on line 1899 - 1900, and
moved the comments on lines 1900 - 1904 to 1889.

This looks good to me, thanks. The title should be more descriptive, e.g., "[LV] Refactor widenIntOrFpInduction, NFC".
Please wait a day or so to see if @fhahn or @samparker have further comments.

This revision is now accepted and ready to land.Mar 26 2020, 3:06 AM

Will do, and many thanks for all your reviews!

And just fyi, I've uploaded work-in-progress patch D76838 that builds on top of this refactoring.

Closed by commit rG401a324c5186: [LV] Refactor widenIntOrFpInduction. NFC. (authored by SjoerdMeijer). · Explain WhyMar 27 2020, 6:29 AM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptMar 27 2020, 6:29 AM

Diff 253098

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,800 Lines • ▼ Show 20 Lines	assert((IV->getType()->isIntegerTy() \|\| IV != OldInduction) &&
"Primary induction variable must have an integer type");		"Primary induction variable must have an integer type");

auto II = Legal->getInductionVars().find(IV);		auto II = Legal->getInductionVars().find(IV);
assert(II != Legal->getInductionVars().end() && "IV is not an induction");		assert(II != Legal->getInductionVars().end() && "IV is not an induction");

auto ID = II->second;		auto ID = II->second;
assert(IV->getType() == ID.getStartValue()->getType() && "Types must match");		assert(IV->getType() == ID.getStartValue()->getType() && "Types must match");

// The scalar value to broadcast. This will be derived from the canonical
// induction variable.
Value *ScalarIV = nullptr;

// The value from the original loop to which we are mapping the new induction		// The value from the original loop to which we are mapping the new induction
// variable.		// variable.
Instruction *EntryVal = Trunc ? cast<Instruction>(Trunc) : IV;		Instruction *EntryVal = Trunc ? cast<Instruction>(Trunc) : IV;

// True if we have vectorized the induction variable.		auto &DL = OrigLoop->getHeader()->getModule()->getDataLayout();
		fhahnUnsubmitted Not Done Reply Inline Actions Instead of implicitly setting Step in the lambdas, would it be possible to have the closures return the step/IV they create and take it as argument if required? fhahn: Instead of implicitly setting Step in the lambdas, would it be possible to have the closures…
auto VectorizedIV = false;

// Determine if we want a scalar version of the induction variable. This is
// true if the induction variable itself is not widened, or if it has at
// least one user in the loop that is not widened.
auto NeedsScalarIV = VF > 1 && needsScalarInduction(EntryVal);

// Generate code for the induction step. Note that induction steps are		// Generate code for the induction step. Note that induction steps are
// required to be loop-invariant		// required to be loop-invariant
assert(PSE.getSE()->isLoopInvariant(ID.getStep(), OrigLoop) &&		auto CreateStepValue = [&](const SCEV Step) -> Value {
		assert(PSE.getSE()->isLoopInvariant(Step, OrigLoop) &&
"Induction step should be loop invariant");		"Induction step should be loop invariant");
auto &DL = OrigLoop->getHeader()->getModule()->getDataLayout();
Value *Step = nullptr;
if (PSE.getSE()->isSCEVable(IV->getType())) {		if (PSE.getSE()->isSCEVable(IV->getType())) {
SCEVExpander Exp(*PSE.getSE(), DL, "induction");		SCEVExpander Exp(*PSE.getSE(), DL, "induction");
Step = Exp.expandCodeFor(ID.getStep(), ID.getStep()->getType(),		return Exp.expandCodeFor(Step, Step->getType(),
		AyalUnsubmitted Not Done Reply Inline Actions There's one more ID.getStep() >> Step Ayal: There's one more ID.getStep() >> Step
LoopVectorPreHeader->getTerminator());		LoopVectorPreHeader->getTerminator());
} else {
Step = cast<SCEVUnknown>(ID.getStep())->getValue();
}

// Try to create a new independent vector induction variable. If we can't
// create the phi node, we will splat the scalar induction variable in each
// loop iteration.
if (VF > 1 && !shouldScalarizeInstruction(EntryVal)) {
createVectorIntOrFpInductionPHI(ID, Step, EntryVal);
VectorizedIV = true;
}		}
		AyalUnsubmitted Not Done Reply Inline Actions https://llvm.org/docs/CodingStandards.html#don-t-use-else-after-a-return Ayal: https://llvm.org/docs/CodingStandards.html#don-t-use-else-after-a-return
		return cast<SCEVUnknown>(Step)->getValue();
		};
		AyalUnsubmitted Not Done Reply Inline Actions Seems like it may be good to pass in ID.getStep() as an explicit parameter, as it forms the basis of the step being created. Ayal: Seems like it may be good to pass in ID.getStep() as an explicit parameter, as it forms the…

// If we haven't yet vectorized the induction variable, or if we will create		// The scalar value to broadcast. This is derived from the canonical
		fhahnUnsubmitted Not Done Reply Inline Actions Comment seems out of place now. fhahn: Comment seems out of place now.
// a scalar one, we need to define the scalar induction variable and step		// induction variable. If a truncation type is given, truncate the canonical
// values. If we were given a truncation type, truncate the canonical
// induction variable and step. Otherwise, derive these values from the		// induction variable and step. Otherwise, derive these values from the
// induction descriptor.		// induction descriptor.
if (!VectorizedIV \|\| NeedsScalarIV) {		auto CreateScalarIV = [&](Value &Step) -> Value {
ScalarIV = Induction;		Value *ScalarIV = Induction;
		AyalUnsubmitted Not Done Reply Inline Actions +1 about the first part of the original comment "If we haven't yet vectorized the induction variable..." seeming out of place here now. But the remaining part that explains what this code is doing, would help here to describe what this lambda is for, including that comment about truncation. Possibly also resurrecting the original comment about ScalarIV (changing "will be" to "is"): // The scalar value to broadcast. This will be derived from the canonical // induction variable. Ayal: +1 about the first part of the original comment "If we haven't yet vectorized the induction…
if (IV != OldInduction) {		if (IV != OldInduction) {
ScalarIV = IV->getType()->isIntegerTy()		ScalarIV = IV->getType()->isIntegerTy()
? Builder.CreateSExtOrTrunc(Induction, IV->getType())		? Builder.CreateSExtOrTrunc(Induction, IV->getType())
: Builder.CreateCast(Instruction::SIToFP, Induction,		: Builder.CreateCast(Instruction::SIToFP, Induction,
IV->getType());		IV->getType());
ScalarIV = emitTransformedIndex(Builder, ScalarIV, PSE.getSE(), DL, ID);		ScalarIV = emitTransformedIndex(Builder, ScalarIV, PSE.getSE(), DL, ID);
ScalarIV->setName("offset.idx");		ScalarIV->setName("offset.idx");
}		}
if (Trunc) {		if (Trunc) {
auto *TruncType = cast<IntegerType>(Trunc->getType());		auto *TruncType = cast<IntegerType>(Trunc->getType());
assert(Step->getType()->isIntegerTy() &&		assert(Step->getType()->isIntegerTy() &&
"Truncation requires an integer step");		"Truncation requires an integer step");
ScalarIV = Builder.CreateTrunc(ScalarIV, TruncType);		ScalarIV = Builder.CreateTrunc(ScalarIV, TruncType);
Step = Builder.CreateTrunc(Step, TruncType);		Step = Builder.CreateTrunc(Step, TruncType);
}		}
}		return ScalarIV;
		};

// If we haven't yet vectorized the induction variable, splat the scalar		// Create the vector values from the scalar IV, in the absence of creating a
		fhahnUnsubmitted Not Done Reply Inline Actions The comment seems a bit out of place now, should it be around line 1910? fhahn: The comment seems a bit out of place now, should it be around line 1910?
		AyalUnsubmitted Not Done Reply Inline Actions +1 Some explanation what this lambda is for may be good here though. It basically creates the vector values from the scalar IV, in the absence of creating a vector IV. Ayal: +1 Some explanation what this lambda is for may be good here though. It basically creates the…
// induction variable, and build the necessary step vectors.		// vector IV.
// TODO: Don't do it unless the vectorized IV is really required.		auto CreateSplatIV = [&](Value ScalarIV, Value Step) {
		fhahnUnsubmitted Not Done Reply Inline Actions it might be good to make the VF, Part and Step explicit parameters here (and potentially other places) to make things a bit more explicit at the call sites. fhahn: it might be good to make the VF, Part and Step explicit parameters here (and potentially other…
if (!VectorizedIV) {
Value *Broadcasted = getBroadcastInstrs(ScalarIV);		Value *Broadcasted = getBroadcastInstrs(ScalarIV);
for (unsigned Part = 0; Part < UF; ++Part) {		for (unsigned Part = 0; Part < UF; ++Part) {
Value *EntryPart =		Value *EntryPart =
getStepVector(Broadcasted, VF * Part, Step, ID.getInductionOpcode());		getStepVector(Broadcasted, VF * Part, Step, ID.getInductionOpcode());
VectorLoopValueMap.setVectorValue(EntryVal, Part, EntryPart);		VectorLoopValueMap.setVectorValue(EntryVal, Part, EntryPart);
if (Trunc)		if (Trunc)
addMetadata(EntryPart, Trunc);		addMetadata(EntryPart, Trunc);
recordVectorLoopValueForInductionCast(ID, EntryVal, EntryPart, Part);		recordVectorLoopValueForInductionCast(ID, EntryVal, EntryPart, Part);
}		}
		};
		AyalUnsubmitted Not Done Reply Inline Actions Is CreateScalarIVCode() really needed, instead of invoking Value ScalarIV = CreateScalarIV(Step); CreateSplatIV(ScalarIV, Step); directly, as done below before calling buildScalarSteps()? (They are not folded together because CreateScalarIV() may change Step, right?) Ayal:* Is CreateScalarIVCode() really needed, instead of invoking ``` Value *ScalarIV =…
		SjoerdMeijerAuthorUnsubmitted Done Reply Inline Actions Ha, that's funny: no, we don't need it. Probably I stared too long at this code to notice this duplication and it is indeed the same as: Value ScalarIV = CreateScalarIV(Step); CreateSplatIV(ScalarIV, Step); which was actually also my intention. You're exactly right about: They are not folded together because CreateScalarIV() may change Step, right?) I wanted to have a one-liner: CreateSplatIV(CreateScalarIV(Step), Step); but that's indeed not possible because Step may change. SjoerdMeijer:* Ha, that's funny: no, we don't need it. Probably I stared too long at this code to notice this…

		// Now do the actual transformations, and start with creating the step value.
		Value *Step = CreateStepValue(ID.getStep());
		if (VF <= 1) {
		Value *ScalarIV = CreateScalarIV(Step);
		CreateSplatIV(ScalarIV, Step);
		return;
		}

		// Determine if we want a scalar version of the induction variable. This is
		// true if the induction variable itself is not widened, or if it has at
		// least one user in the loop that is not widened.
		auto NeedsScalarIV = needsScalarInduction(EntryVal);
		if (!NeedsScalarIV) {
		createVectorIntOrFpInductionPHI(ID, Step, EntryVal);
		return;
		}

		// Try to create a new independent vector induction variable. If we can't
		AyalUnsubmitted Not Done Reply Inline Actions Not sure where this first "If an induction ... doesn't need to be widened" sentence belongs now, but it doesn't fit here. The rest does fit before calling buildScalarStep(), which is also/firstly invoked above... better place it there? Ayal: Not sure where this first "If an induction ... doesn't need to be widened" sentence belongs now…
		// create the phi node, we will splat the scalar induction variable in each
		// loop iteration.
		if (!shouldScalarizeInstruction(EntryVal)) {
		createVectorIntOrFpInductionPHI(ID, Step, EntryVal);
		AyalUnsubmitted Not Done Reply Inline Actions Can continue early-exiting by doing next if (!NeedsScalarIV) { createVectorIntOrFpInductionPHI(ID, Step, EntryVal); return; } cleaning up the checks for NeedsScalarIV below. Ayal: Can continue early-exiting by doing next ``` if (!NeedsScalarIV) {…
		SjoerdMeijerAuthorUnsubmitted Done Reply Inline Actions Ah, many thanks. That cleans up things even more and this function is becoming a real beauty with just straight-line code. SjoerdMeijer: Ah, many thanks. That cleans up things even more and this function is becoming a real beauty…
		Value *ScalarIV = CreateScalarIV(Step);
		// Create scalar steps that can be used by instructions we will later
		// scalarize. Note that the addition of the scalar steps will not increase
		// the number of instructions in the loop in the common case prior to
		// InstCombine. We will be trading one vector extract for each scalar step.
		buildScalarSteps(ScalarIV, Step, EntryVal, ID);
		return;
}		}

// If an induction variable is only used for counting loop iterations or		// If we haven't yet vectorized the induction variable, splat the scalar
// calculating addresses, it doesn't need to be widened. Create scalar steps		// induction variable, and build the necessary step vectors.
// that can be used by instructions we will later scalarize. Note that the		// TODO: Don't do it unless the vectorized IV is really required.
// addition of the scalar steps will not increase the number of instructions		Value *ScalarIV = CreateScalarIV(Step);
// in the loop in the common case prior to InstCombine. We will be trading		CreateSplatIV(ScalarIV, Step);
// one vector extract for each scalar step.
if (NeedsScalarIV)
buildScalarSteps(ScalarIV, Step, EntryVal, ID);		buildScalarSteps(ScalarIV, Step, EntryVal, ID);
}		}

Value InnerLoopVectorizer::getStepVector(Value Val, int StartIdx, Value *Step,		Value InnerLoopVectorizer::getStepVector(Value Val, int StartIdx, Value *Step,
Instruction::BinaryOps BinOp) {		Instruction::BinaryOps BinOp) {
// Create and check the types.		// Create and check the types.
assert(Val->getType()->isVectorTy() && "Must be a vector");		assert(Val->getType()->isVectorTy() && "Must be a vector");
int VLen = Val->getType()->getVectorNumElements();		int VLen = Val->getType()->getVectorNumElements();

▲ Show 20 Lines • Show All 6,127 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[LV] widenIntOrFpInduction. NFC.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 253098

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[LV] widenIntOrFpInduction. NFC.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 253098

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

[LV] widenIntOrFpInduction. NFC.
ClosedPublic