This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/Vectorize/
-
Transforms/
-
Vectorize/
38/41
LoopVectorize.cpp
32/32
VPlan.h
4/5
VPlan.cpp
8/8
VPlanRecipes.cpp
16/16
VPlanTransforms.cpp
4/4
VPlanValue.h
-
test/Transforms/LoopVectorize/
-
Transforms/
-
LoopVectorize/
-
AArch64/
3/3
sve-tail-folding-forced.ll
-
widen-call-with-intrinsic-or-libfunc.ll
-
RISCV/
-
riscv-vector-reverse.ll
-
first-order-recurrence-chains-vplan.ll
-
first-order-recurrence-sink-replicate-region.ll
-
icmp-uniforms.ll
10/11
interleave-and-scalarize-only.ll
-
vplan-dot-printing.ll
-
vplan-printing.ll
-
vplan-sink-scalars-and-merge-vf1.ll
-
vplan-sink-scalars-and-merge.ll

Differential D133758

[VPlan] Add VPDerivedIVRecipe, use for VPScalarIVStepsRecipe.
ClosedPublic

Authored by fhahn on Sep 13 2022, 1:20 AM.

Download Raw Diff

Details

Reviewers

Ayal
gilr
rengolin

Commits

rG0fa666ecedc3: [VPlan] Add VPDerivedIVRecipe, use for VPScalarIVStepsRecipe.

Summary

This patch splits off the logic to transform the canonical IV to a
a value for an induction with a different start and step. This
transformation only needs to be done once (independent of VF/UF) and
enables sinking of VPScalarIVStepsRecipe as follow-up.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

fhahn created this revision.Sep 13 2022, 1:20 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 13 2022, 1:20 AM

Herald added subscribers: tschuett, psnobl, rogfer01 and 2 others. · View Herald Transcript

fhahn requested review of this revision.Sep 13 2022, 1:20 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 13 2022, 1:20 AM

Herald added subscribers: • pcwang-thead, vkmr. · View Herald Transcript

fhahn added a child revision: D133760: [VPlan] Support sinking VPScalarIVStepsRecipe..Sep 13 2022, 1:23 AM

Harbormaster completed remote builds in B186317: Diff 459673.Sep 13 2022, 2:23 AM

Nice refactoring - potentially closing a gap between VPlan and original post-vectorization IR sink scalar operands?

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
8696	Can alternatively provide the desired constant 1 VPValue when asked to retrieve getStep()? (Such constant operands are analogous to Attributes in MLIR.) (Unlike the constant Start which VPlan::prepareToExecute() may replace later with a nonzero value, something worth fixing...)
9527	`assert(!State.Instance && "VPTransformedIVRecipe being replicated.");` ?
9537	Some things seem a bit confusing here, looking at the existing code: VPScalarIVStepsRecipe::getCanonicalIV() presumably retrieves the same Recipe/VPValue as does the enclosing VPlan->getCanonicalIV()? The original code has both `ScalarIV` and `CanonicalIV` - are they not the same - one retrieves a Value per lane (0,0) and the other per part (0) - used only to check its type? Now `TransformedIV` is also "Scalar" (as in non-Vector) similar to `ScalarIV`. Perhaps instead of `Value ScalarIV = State.get(getCanonicalIV(), VPIteration(0, 0));` we should have `Value CanonicalIV = State.get(getCanonicalIV(), VPIteration(0, 0));` ? Perhaps instead of `TransformedIV` have `NonCanonicalIV`, `AffineIV` or `DerivedIV` - considering that the canonical IV is aka a "BasicIV"? Then rename `VPTransformedIVRecipe` accordingly? Would be good to explain somewhere all the IV recipes together: those representing a single scalar (canonical or not) across VF&UF, a single vector per part, a single scalar per lane.
9550	assert(TransformedIV != ScalarIV && "..."); ?
9569–9570	ditto: everything here is scalar IV. Perhaps BaseIV, FirstLaneScalarIV, or Start - reviving getStartValue() to wrap getOperand(0)?
9572	Is this the source for introducing a unit step as operand to Can[onical]IV?
9574	Have both producing recipes feed VPScalarIVStepsRecipe() with their step value as another operand, reviving getStepValue() to retrieve it?
llvm/lib/Transforms/Vectorize/VPlan.cpp
642–643	Suggest to add a FIXME to find a better way than this for VPlan to represent epilogue loop.
llvm/lib/Transforms/Vectorize/VPlan.h
1131	second >> last
1992	Worth clarifying which scalarized versions are actually generated.
llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
405	Should this be a method of VPCanonicalIVPHIRecipe, checking if a given Start and Step match those of its own?
414	Suggest to first set `VPCanonicalIVPHIRecipe CanonicalIV = Plan.getCanonicalIV();` (or auto) to avoid casting below. Then set VPValue Start, Step to be either those of CanonicalIV or those of the new VPTransformedIVRecipe, to feed the new VPScalarIVStepsRecipe?

Address latest comments, thanks!

Herald added subscribers: frasercrmck, luismarques, apazos and 18 others. · View Herald TranscriptOct 18 2022, 11:17 AM

fhahn retitled this revision from [VPlan] Add VPTransformedIVRecipe, use for VPScalarIVStepsRecipe. to [VPlan] Add VPDerivedIVRecipe, use for VPScalarIVStepsRecipe..Oct 18 2022, 11:20 AM

fhahn added inline comments.

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
8696	Updated, thanks!
9527	Added, thanks!
9537	VPScalarIVStepsRecipe::getCanonicalIV() presumably retrieves the same Recipe/VPValue as does the enclosing VPlan->getCanonicalIV()? Yes we could also use `VPlan->getCanonicalIV()`, but it might be easier to follow if modeled explicitly? The original code has both ScalarIV and CanonicalIV - are they not the same - one retrieves a Value per lane (0,0) and the other per part (0) - used only to check its type? Yep, that should be cleaner in the new code. Perhaps instead of Thanks, I updated the naming to use `CanonicalIV` and `DerivedIV`. I also renamed `VPTransformedIVRecipe` -> `VPDerivedIVRecipe` Would be good to explain somewhere all the IV recipes together: those representing a single scalar (canonical or not) across VF&UF, a single vector per part, a single scalar per lane. Good idea, I'll see about that separately.
9550	Thanks, added the assert.
9569–9570	Updated to use `BaseIV`. I kept `getOperand(0)` for now, as it seems like it may be confused with the different behavior of the existing `getStartValue()`.
9572	Yes but that's refactored now.
9574	I left things as they are for now, after refactoring `getSTepValue` in `VPCanonicalIVPHIrecipe`.
llvm/lib/Transforms/Vectorize/VPlan.cpp
642–643	Added, thanks!
llvm/lib/Transforms/Vectorize/VPlan.h
1131	That's not needed any longer.
1172	Should be gone now.
1992	Added an explanation, thanks!
llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
405	Added a helper.
414	Updated to have a separate `CanIV` variable, thanks!

Harbormaster completed remote builds in B192805: Diff 468629.Oct 18 2022, 12:56 PM

Ahh, this brings up some further thoughts...

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
9544	nit: can also rename this original emitTransformedIndex() - emitDerivedIndex() TODO: have emitTransformedIndex() also take care of casting its 2nd "Index" parameter to Ty instead of asserting it's the same as that of its 4th "Step" parameter?
9570	Ahh, operand 0 of VPScalarIVStepsRecipe is providing both the start value (directly - BaseIV) and the step (indirectly - by casting operand 0 into a recipe and asking it for its step). Better represent all def-use relations explicitly by passing VPValues (only) between recipes. This can be done by propagating Step instead of delegating it, e.g.: if SCEV is needed then have a common VPExpandSCEVRecipe (placed in the preheader) take care of generating the `VPValue *Step = vputils::getOrCreateVPValueForSCEVExpr(Plan, ID.getStep(), SE);` and feed it to both VPDerivedIVRecipe and VPScalarIVStepsRecipe. If SCEV is not used, have a Plan.getOrAddVPValue(1)) of the desired type feed these recipes? Sounds reasonable?
9578	This truncation of Step is needed only if fed directly from CanonicalIV, because the Step produced by DeriveIV should have the desired (value and) type, including a truncation at the end if needed, right? Alternatively, introduce a DeriveIV recipe also if only truncation of Step is needed, so as not to sink it into triangles? (Or have ScalarIV recipe take care of TruncToTy always, also for derived steps, relieving DeriveIV of doing so, though this would sink into triangles.)
llvm/lib/Transforms/Vectorize/VPlan.cpp
659	... or DerivedIV ...
llvm/lib/Transforms/Vectorize/VPlan.h
1172	nit: inlining can be applied separately.
1860	Check if this method can remain const if this recipe no longer needs to delegate its Step.
1865	Consider removing and feeding users directly with a constant 1 VPValue when needed upon their construction, to keep this recipe with a single value (Start) VPDef. Otherwise it can provide both Start and Step as parts of its multi-valued VPDef... (nit: const?)
1875	nit: extra space "and the" step is also the same (1), as type and start; or does the original canonical unit step become UF * VF?
1961	Worth clarifying the use of both Ty (type to cast to before conversion) and TruncToTy (type to cast to after conversion)?
2020	nit: 2nd, 1st, 3rd? Also clarify involved type casts?
2029	Return operand 0 inline, or outlined to avoid cast?
llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
1058	Also check for same type, as claimed at the interface?
llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
400–401	nit: define `ID` and `TruncI` below slightly closer to first use?
404	`IVR` may be a confusing name, defined as a VPValue* rather than a Recipe. `IV` also stands for a recipe, conflicting with `IVR`. How about renaming `IVR` to something like `BaseIV`, and define it as a VPRecipeBase*? It stands for the recipe providing a single scalar value per iteration of vectorized & unrolled loop with the desired type/start/step values, as a Base on which to build scalar steps - a scalar value per lane and part. This is either the canonical IV recipe if suitable or a newly introduced derived IV recipe which transforms it. Perhaps also rename `IV` to `WidenIV`, and spell out `CanIV` to `CanonicalV`.
407	TruncTy here is the desired type of the resulting scalar steps; should it be supplied to isCanonical() along with ID in order to check if CanonicalIV is providing the suitable start/step/type for scalar users of `Phi`, or else a derived recipe is needed?
410	nit: (re)use `IVTy` nit: use `Can[onical]IV` directly instead of `IVR`.
413	nit: was there some helper to get Def as RecipeBase?
llvm/lib/Transforms/Vectorize/VPlanValue.h
103	Transformed or Derived? Lex order
362	Lex order
llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding-forced.ll
22	This does appear less descriptive than having SCALAR-STEPS depict the step it uses?
llvm/test/Transforms/LoopVectorize/interleave-and-scalarize-only.ll
73–75	While we're here, `ir<false>`, `ir<true>` seem odd (and even ;-))

fhahn updated this revision to Diff 473523.Nov 6 2022, 4:49 PM

fhahn marked 13 inline comments as done.

Address latest comments, thanks!

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
9544	nit: can also rename this original emitTransformedIndex() - emitDerivedIndex() Done! TODO: have emitTransformedIndex() also take care of casting its 2nd "Index" parameter to Ty instead of asserting it's the same as that of its 4th "Step" parameter? It looks like this is only needed for the use here and not the other uses, so it seems simpler to keep to code here?
9570	Thanks, updated to keep adding the step to VPScalarIVStepsRecipe as well.
9578	at the moment, a derived IV recipe is created if only a truncate is needed, but the derived IV will only be truncated at the end, so it uses the wide step whereas for the steps recipe we need the truncated step. I think to unify this we would have to compute the derived IV with truncated steps, but that would be a bigger, unrelated change,
llvm/lib/Transforms/Vectorize/VPlan.cpp
659	added, thanks
llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
1058	updated, thanks!
llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
400–401	done, thanks!
404	Updated, thanks! I kept the type as VPVale as this requires using getDef just once.
407	done, thanks!
410	done, thanks!
413	the patch has not landed yet (D136068)
llvm/lib/Transforms/Vectorize/VPlanValue.h
103	done thanks!
362	done thanks!
llvm/test/Transforms/LoopVectorize/interleave-and-scalarize-only.ll
73–75	I guess that's because they are boolean values and the IR get printed as boolean literals by the IR printer. Do you think this is something that should be changed?

Harbormaster completed remote builds in B196372: Diff 473523.Nov 6 2022, 5:39 PM

ping :)

pong :-)

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
9537	VPScalarIVStepsRecipe::getCanonicalIV() presumably retrieves the same Recipe/VPValue as does the enclosing VPlan->getCanonicalIV()? Yes we could also use VPlan->getCanonicalIV(), but it might be easier to follow if modeled explicitly? Modeling the canonical IV as an explicit operand is fine. In fact, it then seems irrelevant if its Canonical or not - can simply refer to it as getBasicIV() or getIV()? (When there's a need to call isCanonical() then the CanonicalIV is needed, but that is the case in optimizeInductions() rather than here in DerivedIV.) Would be good to explain somewhere all the IV recipes together: those representing a single scalar (canonical or not) across VF&UF, a single vector per part, a single scalar per lane. Good idea, I'll see about that separately. Found a good place?
9544	TODO: have emitTransformedIndex() also take care of casting its 2nd "Index" parameter to Ty instead of asserting it's the same as that of its 4th "Step" parameter? It looks like this is only needed for the use here and not the other uses, so it seems simpler to keep to code here? Hmm, on the contrary - it appears all users cast the 2nd operand to match the type of the 4th operand before calling emitTransformedIndex(): VTC/"cast.vtc" casts VectorTripCount and AdditionalBypass.second to StepType, CMO/"cast.cmo" casts CountMinusOne to Step Type, PtrInd casts CanonicalIV to Step Type in order for Idx and GlobalIdx to be the desired type. Here `Step` is obtained from an operand via State, `Ty` is recorded in the recipe, and we eventually assert that the type of Step matches Ty. Seems overly complicated? Would it be simpler, here and also at the other callers, to feed emitTransformedIndex() or emitDerivedIndex() directly with the original IV and Step, and let it do the necessary casting of the former to match the type of the latter?
9569	Another type reconciliation worth taking care of by the callee instead of having it assert it? buildScalarSteps() in this case.
9569–9570	I kept getOperand(0) for now, as it seems like it may be confused with the different behavior of the existing getStartValue(). Not sure what the source of confusion is, but it may appear clearer to have either Value BaseIV = State.get(getOperand(0), VPIteration(0, 0)); Value Step = State.get(getOperand(1), VPIteration(0, 0)); or Value BaseIV = State.get(getBaseIV(), VPIteration(0, 0)); Value Step = State.get(getStepValue(), VPIteration(0, 0));
9578	Perhaps worth leaving a note behind about said bigger unrelated change.
llvm/lib/Transforms/Vectorize/VPlan.h
1860	Cannot return a const Type because ConstantInt::get() expects a non-const type, in VPCanonicalIVPHIRecipe::getStepValue()?
1865	Is VPCanonicalIVPHIRecipe::getStepValue() still needed?
1875	I mean, would the following rephrasing be better: `/// i.e., has the same start, step (of 1), and type as the canonical IV.` ?
1961	Worth removing Ty altogether from VPDerivedIVRecipe - Step provides the desired type.
1972	nit: Derived IV stands for producing Start + CanonicalIV * Step, so seems more natural to order its operands and dump them in this order? Possibly with + and * instead of separating commas, possibly along with type cast information.
1996	Reordering the operands will also simplify this documentation.
2005	Suffice to have VPValue *getBasicIVValue() const { return getOperand(0); } instead?
llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
1065	nit: can check `ConstantInt *Step = ID.getConstIntStepValue()` as in Loop::isCanonical(). nit: can assert ID.getInductionOpcode() == Instruction::Add.
llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
396	nit: TruncI is used for its type only, TruncTy may better be called ResultTy, IVTy is hopefully unneeded if VPDerivedRecipe can be constructed w/o it; consider setting: Type IVTy = WideIV->getPHINode()->getType(); Type ResultTy = IVTy; Type TruncInstTy = nullptr; if (auto TruncI = WideIV->getTruncInst()) { TruncInstTy = TruncI->getType(); ResultTy = TruncInstTy; }
llvm/test/Transforms/LoopVectorize/interleave-and-scalarize-only.ll
73–75	Oh well, the input IR is adding and subtracting true and false... The reason for having DERIVED-IV with canonical start `ir<false>` ==0 and step `ir<true>` ==1 is because of type expansion and/or truncation?
188	The reason for having DERIVED-IV with canonical start ir<0> and step ir<1> is because of type expansion and/or truncation? Perhaps worth dumping the distinct types, as this operation is effectively a cast.
189	nit: check the second (ir<1>) operand of SCALAR-STEPS, for completeness? It is in practice either +1 or -1 ...

fhahn mentioned this in rG12bb5535d270: [VPlan] Move cast codegen to emitTransformedIndex (NFCI)..Nov 26 2022, 2:47 PM

fhahn mentioned this in rGbf0bd85f9d82: [LV] Move trunc codegen to buildScalarSteps (NFCI)..Nov 26 2022, 3:50 PM

Thank you very much Ayal! Comments should be addressed, hopefully I didn't miss any.

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
9537	Modeling the canonical IV as an explicit operand is fine. In fact, it then seems irrelevant if its Canonical or not - can simply refer to it as getBasicIV() or getIV()? (When there's a need to call isCanonical() then the CanonicalIV is needed, but that is the case in optimizeInductions() rather than here in DerivedIV.) I think `emitTransformedIndex` needs the canonical IV. Found a good place? I put up D138748 to add it to the VPHeaderPHIRecipe documentation.
9544	Simplified in 12bb5535d270, thanks!
9569	Refactored in bf0bd85f9d82 and removed here, thanks!
llvm/lib/Transforms/Vectorize/VPlan.h
1860	Unfortunately yes!
1865	It's not needed in the latest version, removed, thanks!
1961	Removed in the latest version, thanks!
1972	Updated, thanks!
1996	Updated, thanks!
2005	Updated, thanks!
llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
1065	Updated, thanks! I turns out the assertion triggered in some tests, added as an extra condition.
llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
396	Simplified, thanks!
llvm/test/Transforms/LoopVectorize/interleave-and-scalarize-only.ll
73–75	Here the inputs are already truncated to `i1` before recipe construction so the recipe doesn't truncated itself.
188	Updated, thanks!
189	Done, thanks!

Harbormaster completed remote builds in B199633: Diff 478071.Nov 26 2022, 4:36 PM

Ayal added inline comments.Nov 27 2022, 8:15 AM

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
2343	Above assert is now redundant. Can hoist the comment above and rephrase it, e.g., "Ensure scalar IV and step have the same integer type", or rather "Ensure step has the same type as that of scalar IV"?
9537	Modeling the canonical IV as an explicit operand is fine. In fact, it then seems irrelevant if its Canonical or not - can simply refer to it as getBasicIV() or getIV()? (When there's a need to call isCanonical() then the CanonicalIV is needed, but that is the case in optimizeInductions() rather than here in DerivedIV.) I think `emitTransformedIndex` needs the canonical IV. Hmm, `emitTransformedIndex` just needs an "Index", i.e., an "IV" or "BasicIV", regardless if it is "the canonical" IV.
9543	nit: suffice to ask if TruncToTy != DerivedIV->getType() as the latter is never null?
9544	assert(Step->getType()->isIntegerTy()) belongs earlier if still needed here at all?
9569	Refactored in bf0bd85f9d82 and removed here, thanks! Good! Thanks!
llvm/lib/Transforms/Vectorize/VPlan.h
1875	above nit worth addressing?
1961	nit: it seems TruncToTy is always non-null, can simplify later check?
1969	CanonicalIV >> BaseIV or IV? I.e., any Index that has a value per iteration rather than per-part/per-lane, regardless if it is "the canonical" IV? TruncToTy >> ResultTy? I.e., always specifies the type of the result, never nullptr?
llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
1053	turns out the assertion triggered in some tests, added as an extra condition. Good! Worth a comment?
1055	steps-recipe?
llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding-forced.ll
22	Ah, the step it uses is explicit - it's ir<1>! What has been omitted is "start" (ir<0>), which is moved from ScalarIVSteps to DerivedIV, if needed.
llvm/test/Transforms/LoopVectorize/interleave-and-scalarize-only.ll
73–75	Trying to clarify why DERIVED-IV is needed at all here (too), given that it starts at 0 (false) and bumps with a step of 1 (true)?

Address latest comments, thanks!

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
2343	Updated, thanks!
9537	Right, but VPDerivedIVRecipe will always use the canonical IV, at least to start with IIUC.
9543	Adjusted, thanks!
9544	I might be missing something, but I think before the patch we also only had this assert for the case we need to truncate, as this can only be done for integer types. The induction could also be a floating point IV in general here I think.
9578	Added to `buildScalarSteps`.
llvm/lib/Transforms/Vectorize/VPlan.h
1172	I think it was already inlined in current `main`, it just got re-formatted here. Undid the formatting change.
1875	Missed those earlier, should be adjusted, thanks!
1961	Adjusted, thanks!
1969	CanonicalIV >> BaseIV or IV? I.e., any Index that has a value per iteration rather than per-part/per-lane, regardless if it is "the canonical" IV? I kept it as CanonicalIV for now, as this is what all current clients use (also updated the constructor to require `VPCanonicalIVPHIRecipe`. I can change it if you would prefer, but can also do that if we ever lift the restriction. TruncToTy >> ResultTy? I.e., always specifies the type of the result, never nullptr? Updated, thanks!
llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
1053	Moved the check to the return and move the part about incrementing it by one there.
1055	Adjusted, thanks!
llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding-forced.ll
22	Ah I missed this comment earlier, Yes, DerivedIV will now take care of adjusting the start value, ScalarIVSteps just generate steps.
llvm/test/Transforms/LoopVectorize/interleave-and-scalarize-only.ll
73–75	I think `DERIVED-IV` here is for `%d = phi i1 ...`, which has a different type than the canonical IV, but doesn't itself need truncating because the operands are already `i1`.

Harbormaster completed remote builds in B199665: Diff 478116.Nov 27 2022, 3:50 PM

This looks good to me, ship it!

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
2340	Is this TODO an NFC to simplify the code, w/o affecting the generated code?
9537	Agreed, VPDerivedIVRecipe is (currently) always fed the canonical IV, but it can compute a derived IV given any BaseIV, i.e., does not rely on it being The Canonical IV.
9544	Ahh, sorry, agreed. (Confused by same assert-guarding-trunc in buildScalarSteps() and thought one could check if ResultTy differs from Step->getType() instead of DerivedIV->getType() thereby asserting earlier before emitTransformedIndex(). But current code is fine.)
llvm/lib/Transforms/Vectorize/VPlan.cpp
659	worth adding in the error message as well
llvm/test/Transforms/LoopVectorize/interleave-and-scalarize-only.ll
73–75	Ah, right; DERIVED-IV truncates CAN_IV to i1 before Mul & Add, which is not dumped-out like the truncation to ResultTy.

This revision is now accepted and ready to land.Nov 27 2022, 3:58 PM

This revision was landed with ongoing or failed builds.Nov 28 2022, 8:32 AM

Closed by commit rG0fa666ecedc3: [VPlan] Add VPDerivedIVRecipe, use for VPScalarIVStepsRecipe. (authored by fhahn). · Explain Why

This revision was automatically updated to reflect the committed changes.

fhahn added a commit: rG0fa666ecedc3: [VPlan] Add VPDerivedIVRecipe, use for VPScalarIVStepsRecipe..

fhahn added a reverting change: rGbf15f1e489aa: Revert "[VPlan] Add VPDerivedIVRecipe, use for VPScalarIVStepsRecipe.".Nov 28 2022, 2:43 PM

@fhahn one of our internal tests also hit the same assertion failure and I have a reduced testcase for it if it helps. I was going to file a bug for it, but since you have already reverted the change, I'll hold off for now. Let me know if you would like the testcase we found.

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Vectorize/

57 lines

91 lines

4 lines

40 lines

37 lines

2 lines

test/

Transforms/

LoopVectorize/

AArch64/

sve-tail-folding-forced.ll

2 lines

widen-call-with-intrinsic-or-libfunc.ll

4 lines

RISCV/

riscv-vector-reverse.ll

22 lines

first-order-recurrence-chains-vplan.ll

4 lines

first-order-recurrence-sink-replicate-region.ll

13 lines

icmp-uniforms.ll

2 lines

interleave-and-scalarize-only.ll

23 lines

vplan-dot-printing.ll

2 lines

vplan-printing.ll

22 lines

vplan-sink-scalars-and-merge-vf1.ll

2 lines

vplan-sink-scalars-and-merge.ll

26 lines

Diff 473523

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,331 Lines • ▼ Show 20 Lines

/// Compute scalar induction steps. \p ScalarIV is the scalar induction		/// Compute scalar induction steps. \p ScalarIV is the scalar induction
/// variable on which to base the steps, \p Step is the size of the step.		/// variable on which to base the steps, \p Step is the size of the step.
static void buildScalarSteps(Value ScalarIV, Value Step,		static void buildScalarSteps(Value ScalarIV, Value Step,
const InductionDescriptor &ID, VPValue *Def,		const InductionDescriptor &ID, VPValue *Def,
VPTransformState &State) {		VPTransformState &State) {
IRBuilderBase &Builder = State.Builder;		IRBuilderBase &Builder = State.Builder;
// We shouldn't have to build scalar steps if we aren't vectorizing.		// We shouldn't have to build scalar steps if we aren't vectorizing.
// Get the value type and ensure it and the step have the same integer type.		// Get the value type and ensure it and the step have the same integer type.
		AyalUnsubmitted Not Done Reply Inline Actions Is this TODO an NFC to simplify the code, w/o affecting the generated code? Ayal: Is this TODO an NFC to simplify the code, w/o affecting the generated code?
Type *ScalarIVTy = ScalarIV->getType()->getScalarType();		Type *ScalarIVTy = ScalarIV->getType()->getScalarType();
assert(ScalarIVTy == Step->getType() &&		assert(ScalarIVTy == Step->getType() &&
"Val and Step should have the same type");		"Val and Step should have the same type");
		AyalUnsubmitted Done Reply Inline Actions Above assert is now redundant. Can hoist the comment above and rephrase it, e.g., "Ensure scalar IV and step have the same integer type", or rather "Ensure step has the same type as that of scalar IV"? Ayal: Above assert is now redundant. Can hoist the comment above and rephrase it, e.g., "Ensure…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Updated, thanks! fhahn: Updated, thanks!

// We build scalar steps for both integer and floating-point induction		// We build scalar steps for both integer and floating-point induction
// variables. Here, we determine the kind of arithmetic we will perform.		// variables. Here, we determine the kind of arithmetic we will perform.
Instruction::BinaryOps AddOp;		Instruction::BinaryOps AddOp;
Instruction::BinaryOps MulOp;		Instruction::BinaryOps MulOp;
if (ScalarIVTy->isIntegerTy()) {		if (ScalarIVTy->isIntegerTy()) {
AddOp = Instruction::Add;		AddOp = Instruction::Add;
MulOp = Instruction::Mul;		MulOp = Instruction::Mul;
▲ Show 20 Lines • Show All 6,336 Lines • ▼ Show 20 Lines

// Add the necessary canonical IV and branch recipes required to control the		// Add the necessary canonical IV and branch recipes required to control the
// loop.		// loop.
static void addCanonicalIVRecipes(VPlan &Plan, Type *IdxTy, DebugLoc DL,		static void addCanonicalIVRecipes(VPlan &Plan, Type *IdxTy, DebugLoc DL,
bool HasNUW,		bool HasNUW,
bool UseLaneMaskForLoopControlFlow) {		bool UseLaneMaskForLoopControlFlow) {
Value *StartIdx = ConstantInt::get(IdxTy, 0);		Value *StartIdx = ConstantInt::get(IdxTy, 0);
auto *StartV = Plan.getOrAddVPValue(StartIdx);		auto *StartV = Plan.getOrAddVPValue(StartIdx);

		AyalUnsubmitted Done Reply Inline Actions Can alternatively provide the desired constant 1 VPValue when asked to retrieve getStep()? (Such constant operands are analogous to Attributes in MLIR.) (Unlike the constant Start which VPlan::prepareToExecute() may replace later with a nonzero value, something worth fixing...) Ayal: Can alternatively provide the desired constant 1 VPValue when asked to retrieve getStep()?
		fhahnAuthorUnsubmitted Done Reply Inline Actions Updated, thanks! fhahn: Updated, thanks!
// Add a VPCanonicalIVPHIRecipe starting at 0 to the header.		// Add a VPCanonicalIVPHIRecipe starting at 0 to the header.
auto *CanonicalIVPHI = new VPCanonicalIVPHIRecipe(StartV, DL);		auto *CanonicalIVPHI = new VPCanonicalIVPHIRecipe(StartV, DL);
VPRegionBlock *TopRegion = Plan.getVectorLoopRegion();		VPRegionBlock *TopRegion = Plan.getVectorLoopRegion();
VPBasicBlock *Header = TopRegion->getEntryBasicBlock();		VPBasicBlock *Header = TopRegion->getEntryBasicBlock();
Header->insert(CanonicalIVPHI, Header->begin());		Header->insert(CanonicalIVPHI, Header->begin());

// Add a CanonicalIVIncrement{NUW} VPInstruction to increment the scalar		// Add a CanonicalIVIncrement{NUW} VPInstruction to increment the scalar
// IV by VF * UF.		// IV by VF * UF.
▲ Show 20 Lines • Show All 814 Lines • ▼ Show 20 Lines	Value *GEP = State.Builder.CreateGEP(
State.Builder.CreateMul(		State.Builder.CreateMul(
StartOffset,		StartOffset,
State.Builder.CreateVectorSplat(State.VF, ScalarStepValue),		State.Builder.CreateVectorSplat(State.VF, ScalarStepValue),
"vector.gep"));		"vector.gep"));
State.set(this, GEP, Part);		State.set(this, GEP, Part);
}		}
}		}

void VPScalarIVStepsRecipe::execute(VPTransformState &State) {		void VPDerivedIVRecipe::execute(VPTransformState &State) {
		AyalUnsubmitted Done Reply Inline Actions `assert(!State.Instance && "VPTransformedIVRecipe being replicated.");` ? Ayal: `assert(!State.Instance && "VPTransformedIVRecipe being replicated.");` ?
		fhahnAuthorUnsubmitted Done Reply Inline Actions Added, thanks! fhahn: Added, thanks!
assert(!State.Instance && "VPScalarIVStepsRecipe being replicated.");		assert(!State.Instance && "VPDerivedIVRecipe being replicated.");

// Fast-math-flags propagate from the original induction instruction.		// Fast-math-flags propagate from the original induction instruction.
IRBuilder<>::FastMathFlagGuard FMFG(State.Builder);		IRBuilder<>::FastMathFlagGuard FMFG(State.Builder);
if (IndDesc.getInductionBinOp() &&		if (IndDesc.getInductionBinOp() &&
isa<FPMathOperator>(IndDesc.getInductionBinOp()))		isa<FPMathOperator>(IndDesc.getInductionBinOp()))
State.Builder.setFastMathFlags(		State.Builder.setFastMathFlags(
IndDesc.getInductionBinOp()->getFastMathFlags());		IndDesc.getInductionBinOp()->getFastMathFlags());

Value *Step = State.get(getStepValue(), VPIteration(0, 0));		Value *Step = State.get(getStepValue(), VPIteration(0, 0));
		AyalUnsubmitted Done Reply Inline Actions Some things seem a bit confusing here, looking at the existing code: VPScalarIVStepsRecipe::getCanonicalIV() presumably retrieves the same Recipe/VPValue as does the enclosing VPlan->getCanonicalIV()? The original code has both `ScalarIV` and `CanonicalIV` - are they not the same - one retrieves a Value per lane (0,0) and the other per part (0) - used only to check its type? Now `TransformedIV` is also "Scalar" (as in non-Vector) similar to `ScalarIV`. Perhaps instead of `Value ScalarIV = State.get(getCanonicalIV(), VPIteration(0, 0));` we should have `Value CanonicalIV = State.get(getCanonicalIV(), VPIteration(0, 0));` ? Perhaps instead of `TransformedIV` have `NonCanonicalIV`, `AffineIV` or `DerivedIV` - considering that the canonical IV is aka a "BasicIV"? Then rename `VPTransformedIVRecipe` accordingly? Would be good to explain somewhere all the IV recipes together: those representing a single scalar (canonical or not) across VF&UF, a single vector per part, a single scalar per lane. Ayal: Some things seem a bit confusing here, looking at the existing code: VPScalarIVStepsRecipe…
		fhahnAuthorUnsubmitted Done Reply Inline Actions VPScalarIVStepsRecipe::getCanonicalIV() presumably retrieves the same Recipe/VPValue as does the enclosing VPlan->getCanonicalIV()? Yes we could also use `VPlan->getCanonicalIV()`, but it might be easier to follow if modeled explicitly? The original code has both ScalarIV and CanonicalIV - are they not the same - one retrieves a Value per lane (0,0) and the other per part (0) - used only to check its type? Yep, that should be cleaner in the new code. Perhaps instead of Thanks, I updated the naming to use `CanonicalIV` and `DerivedIV`. I also renamed `VPTransformedIVRecipe` -> `VPDerivedIVRecipe` Would be good to explain somewhere all the IV recipes together: those representing a single scalar (canonical or not) across VF&UF, a single vector per part, a single scalar per lane. Good idea, I'll see about that separately. fhahn: > VPScalarIVStepsRecipe::getCanonicalIV() presumably retrieves the same Recipe/VPValue as does…
		AyalUnsubmitted Done Reply Inline Actions VPScalarIVStepsRecipe::getCanonicalIV() presumably retrieves the same Recipe/VPValue as does the enclosing VPlan->getCanonicalIV()? Yes we could also use VPlan->getCanonicalIV(), but it might be easier to follow if modeled explicitly? Modeling the canonical IV as an explicit operand is fine. In fact, it then seems irrelevant if its Canonical or not - can simply refer to it as getBasicIV() or getIV()? (When there's a need to call isCanonical() then the CanonicalIV is needed, but that is the case in optimizeInductions() rather than here in DerivedIV.) Would be good to explain somewhere all the IV recipes together: those representing a single scalar (canonical or not) across VF&UF, a single vector per part, a single scalar per lane. Good idea, I'll see about that separately. Found a good place? Ayal: >> VPScalarIVStepsRecipe::getCanonicalIV() presumably retrieves the same Recipe/VPValue as does…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Modeling the canonical IV as an explicit operand is fine. In fact, it then seems irrelevant if its Canonical or not - can simply refer to it as getBasicIV() or getIV()? (When there's a need to call isCanonical() then the CanonicalIV is needed, but that is the case in optimizeInductions() rather than here in DerivedIV.) I think `emitTransformedIndex` needs the canonical IV. Found a good place? I put up D138748 to add it to the VPHeaderPHIRecipe documentation. fhahn: Modeling the canonical IV as an explicit operand is fine. In fact, it then seems irrelevant if…
		AyalUnsubmitted Done Reply Inline Actions Modeling the canonical IV as an explicit operand is fine. In fact, it then seems irrelevant if its Canonical or not - can simply refer to it as getBasicIV() or getIV()? (When there's a need to call isCanonical() then the CanonicalIV is needed, but that is the case in optimizeInductions() rather than here in DerivedIV.) I think `emitTransformedIndex` needs the canonical IV. Hmm, `emitTransformedIndex` just needs an "Index", i.e., an "IV" or "BasicIV", regardless if it is "the canonical" IV. Ayal: > Modeling the canonical IV as an explicit operand is fine. In fact, it then seems irrelevant…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Right, but VPDerivedIVRecipe will always use the canonical IV, at least to start with IIUC. fhahn: Right, but VPDerivedIVRecipe will always use the canonical IV, at least to start with IIUC.
		AyalUnsubmitted Not Done Reply Inline Actions Agreed, VPDerivedIVRecipe is (currently) always fed the canonical IV, but it can compute a derived IV given any BaseIV, i.e., does not rely on it being The Canonical IV. Ayal: Agreed, VPDerivedIVRecipe is (currently) always fed the canonical IV, but it can compute a…
auto CreateScalarIV = [&](Value &Step) -> Value {		Value *CanonicalIV = State.get(getCanonicalIV(), VPIteration(0, 0));
Value *ScalarIV = State.get(getCanonicalIV(), VPIteration(0, 0));		Value *DerivedIV =
auto *CanonicalIV = State.get(getParent()->getPlan()->getCanonicalIV(), 0);
if (!isCanonical() \|\| CanonicalIV->getType() != Ty) {
ScalarIV =
Ty->isIntegerTy()		Ty->isIntegerTy()
? State.Builder.CreateSExtOrTrunc(ScalarIV, Ty)		? State.Builder.CreateSExtOrTrunc(CanonicalIV, Ty)
: State.Builder.CreateCast(Instruction::SIToFP, ScalarIV, Ty);		: State.Builder.CreateCast(Instruction::SIToFP, CanonicalIV, Ty);
ScalarIV = emitTransformedIndex(State.Builder, ScalarIV,		DerivedIV =
		AyalUnsubmitted Done Reply Inline Actions nit: suffice to ask if TruncToTy != DerivedIV->getType() as the latter is never null? Ayal: nit: suffice to ask if TruncToTy != DerivedIV->getType() as the latter is never null?
		fhahnAuthorUnsubmitted Done Reply Inline Actions Adjusted, thanks! fhahn: Adjusted, thanks!
getStartValue()->getLiveInIRValue(), Step,		emitTransformedIndex(State.Builder, DerivedIV,
		AyalUnsubmitted Done Reply Inline Actions nit: can also rename this original emitTransformedIndex() - emitDerivedIndex() TODO: have emitTransformedIndex() also take care of casting its 2nd "Index" parameter to Ty instead of asserting it's the same as that of its 4th "Step" parameter? Ayal: nit: can also rename this original emitTransformedIndex() - emitDerivedIndex() TODO: have…
		fhahnAuthorUnsubmitted Done Reply Inline Actions nit: can also rename this original emitTransformedIndex() - emitDerivedIndex() Done! TODO: have emitTransformedIndex() also take care of casting its 2nd "Index" parameter to Ty instead of asserting it's the same as that of its 4th "Step" parameter? It looks like this is only needed for the use here and not the other uses, so it seems simpler to keep to code here? fhahn: > nit: can also rename this original emitTransformedIndex() - emitDerivedIndex() Done! > TODO…
		AyalUnsubmitted Done Reply Inline Actions TODO: have emitTransformedIndex() also take care of casting its 2nd "Index" parameter to Ty instead of asserting it's the same as that of its 4th "Step" parameter? It looks like this is only needed for the use here and not the other uses, so it seems simpler to keep to code here? Hmm, on the contrary - it appears all users cast the 2nd operand to match the type of the 4th operand before calling emitTransformedIndex(): VTC/"cast.vtc" casts VectorTripCount and AdditionalBypass.second to StepType, CMO/"cast.cmo" casts CountMinusOne to Step Type, PtrInd casts CanonicalIV to Step Type in order for Idx and GlobalIdx to be the desired type. Here `Step` is obtained from an operand via State, `Ty` is recorded in the recipe, and we eventually assert that the type of Step matches Ty. Seems overly complicated? Would it be simpler, here and also at the other callers, to feed emitTransformedIndex() or emitDerivedIndex() directly with the original IV and Step, and let it do the necessary casting of the former to match the type of the latter? Ayal: >> TODO: have emitTransformedIndex() also take care of casting its 2nd "Index" parameter to Ty…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Simplified in 12bb5535d270, thanks! fhahn: Simplified in 12bb5535d270, thanks!
		AyalUnsubmitted Done Reply Inline Actions assert(Step->getType()->isIntegerTy()) belongs earlier if still needed here at all? Ayal: assert(Step->getType()->isIntegerTy()) belongs earlier if still needed here at all?
		fhahnAuthorUnsubmitted Done Reply Inline Actions I might be missing something, but I think before the patch we also only had this assert for the case we need to truncate, as this can only be done for integer types. The induction could also be a floating point IV in general here I think. fhahn: I might be missing something, but I think before the patch we also only had this assert for the…
		AyalUnsubmitted Not Done Reply Inline Actions Ahh, sorry, agreed. (Confused by same assert-guarding-trunc in buildScalarSteps() and thought one could check if ResultTy differs from Step->getType() instead of DerivedIV->getType() thereby asserting earlier before emitTransformedIndex(). But current code is fine.) Ayal: Ahh, sorry, agreed. (Confused by same assert-guarding-trunc in buildScalarSteps() and thought…
IndDesc);		getStartValue()->getLiveInIRValue(), Step, IndDesc);
ScalarIV->setName("offset.idx");		DerivedIV->setName("offset.idx");
}
if (TruncToTy) {		if (TruncToTy) {
assert(Step->getType()->isIntegerTy() &&		assert(Step->getType()->isIntegerTy() &&
"Truncation requires an integer step");		"Truncation requires an integer step");
ScalarIV = State.Builder.CreateTrunc(ScalarIV, TruncToTy);		DerivedIV = State.Builder.CreateTrunc(DerivedIV, TruncToTy);
		AyalUnsubmitted Done Reply Inline Actions assert(TransformedIV != ScalarIV && "..."); ? Ayal: assert(TransformedIV != ScalarIV && "..."); ?
		fhahnAuthorUnsubmitted Done Reply Inline Actions Thanks, added the assert. fhahn: Thanks, added the assert.
Step = State.Builder.CreateTrunc(Step, TruncToTy);
}		}
return ScalarIV;		assert(DerivedIV != CanonicalIV && "IV didn't need transforming?");
};
		State.set(this, DerivedIV, VPIteration(0, 0));
		}

		void VPScalarIVStepsRecipe::execute(VPTransformState &State) {
		assert(!State.Instance && "VPScalarIVStepsRecipe being replicated.");

		// Fast-math-flags propagate from the original induction instruction.
		IRBuilder<>::FastMathFlagGuard FMFG(State.Builder);
		if (IndDesc.getInductionBinOp() &&
		isa<FPMathOperator>(IndDesc.getInductionBinOp()))
		State.Builder.setFastMathFlags(
		IndDesc.getInductionBinOp()->getFastMathFlags());

		Value *BaseIV = State.get(getOperand(0), VPIteration(0, 0));
		Value *Step = State.get(getStepValue(), VPIteration(0, 0));
		if (Step->getType() != BaseIV->getType())
		AyalUnsubmitted Done Reply Inline Actions Another type reconciliation worth taking care of by the callee instead of having it assert it? buildScalarSteps() in this case. Ayal: Another type reconciliation worth taking care of by the callee instead of having it assert it?
		fhahnAuthorUnsubmitted Done Reply Inline Actions Refactored in bf0bd85f9d82 and removed here, thanks! fhahn: Refactored in bf0bd85f9d82 and removed here, thanks!
		AyalUnsubmitted Done Reply Inline Actions Refactored in bf0bd85f9d82 and removed here, thanks! Good! Thanks! Ayal: > Refactored in bf0bd85f9d82 and removed here, thanks! Good! Thanks!
		Step = State.Builder.CreateTrunc(Step, BaseIV->getType());
		AyalUnsubmitted Done Reply Inline Actions ditto: everything here is scalar IV. Perhaps BaseIV, FirstLaneScalarIV, or Start - reviving getStartValue() to wrap getOperand(0)? Ayal: ditto: everything here is scalar IV. Perhaps BaseIV, FirstLaneScalarIV, or Start - reviving…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Updated to use `BaseIV`. I kept `getOperand(0)` for now, as it seems like it may be confused with the different behavior of the existing `getStartValue()`. fhahn: Updated to use `BaseIV`. I kept `getOperand(0)` for now, as it seems like it may be confused…
		AyalUnsubmitted Done Reply Inline Actions I kept getOperand(0) for now, as it seems like it may be confused with the different behavior of the existing getStartValue(). Not sure what the source of confusion is, but it may appear clearer to have either Value BaseIV = State.get(getOperand(0), VPIteration(0, 0)); Value Step = State.get(getOperand(1), VPIteration(0, 0)); or Value BaseIV = State.get(getBaseIV(), VPIteration(0, 0)); Value Step = State.get(getStepValue(), VPIteration(0, 0)); Ayal: > I kept getOperand(0) for now, as it seems like it may be confused with the different behavior…
		AyalUnsubmitted Done Reply Inline Actions Ahh, operand 0 of VPScalarIVStepsRecipe is providing both the start value (directly - BaseIV) and the step (indirectly - by casting operand 0 into a recipe and asking it for its step). Better represent all def-use relations explicitly by passing VPValues (only) between recipes. This can be done by propagating Step instead of delegating it, e.g.: if SCEV is needed then have a common VPExpandSCEVRecipe (placed in the preheader) take care of generating the `VPValue Step = vputils::getOrCreateVPValueForSCEVExpr(Plan, ID.getStep(), SE);` and feed it to both VPDerivedIVRecipe and VPScalarIVStepsRecipe. If SCEV is not used, have a Plan.getOrAddVPValue(1)) of the desired type feed these recipes? Sounds reasonable? Ayal:* Ahh, operand 0 of VPScalarIVStepsRecipe is providing both the start value (directly - BaseIV)…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Thanks, updated to keep adding the step to VPScalarIVStepsRecipe as well. fhahn: Thanks, updated to keep adding the step to VPScalarIVStepsRecipe as well.

Value *ScalarIV = CreateScalarIV(Step);		buildScalarSteps(BaseIV, Step, IndDesc, this, State);
		AyalUnsubmitted Done Reply Inline Actions Is this the source for introducing a unit step as operand to Can[onical]IV? Ayal: Is this the source for introducing a unit step as operand to Can[onical]IV?
		fhahnAuthorUnsubmitted Done Reply Inline Actions Yes but that's refactored now. fhahn: Yes but that's refactored now.
buildScalarSteps(ScalarIV, Step, IndDesc, this, State);
}		}

		AyalUnsubmitted Done Reply Inline Actions Have both producing recipes feed VPScalarIVStepsRecipe() with their step value as another operand, reviving getStepValue() to retrieve it? Ayal: Have both producing recipes feed VPScalarIVStepsRecipe() with their step value as another…
		fhahnAuthorUnsubmitted Done Reply Inline Actions I left things as they are for now, after refactoring `getSTepValue` in `VPCanonicalIVPHIrecipe`. fhahn: I left things as they are for now, after refactoring `getSTepValue` in `VPCanonicalIVPHIrecipe`.
void VPInterleaveRecipe::execute(VPTransformState &State) {		void VPInterleaveRecipe::execute(VPTransformState &State) {
assert(!State.Instance && "Interleave group being replicated.");		assert(!State.Instance && "Interleave group being replicated.");
State.ILV->vectorizeInterleaveGroup(IG, definedValues(), State, getAddr(),		State.ILV->vectorizeInterleaveGroup(IG, definedValues(), State, getAddr(),
getStoredValues(), getMask());		getStoredValues(), getMask());
		AyalUnsubmitted Done Reply Inline Actions This truncation of Step is needed only if fed directly from CanonicalIV, because the Step produced by DeriveIV should have the desired (value and) type, including a truncation at the end if needed, right? Alternatively, introduce a DeriveIV recipe also if only truncation of Step is needed, so as not to sink it into triangles? (Or have ScalarIV recipe take care of TruncToTy always, also for derived steps, relieving DeriveIV of doing so, though this would sink into triangles.) Ayal: This truncation of Step is needed only if fed directly from CanonicalIV, because the Step…
		fhahnAuthorUnsubmitted Done Reply Inline Actions at the moment, a derived IV recipe is created if only a truncate is needed, but the derived IV will only be truncated at the end, so it uses the wide step whereas for the steps recipe we need the truncated step. I think to unify this we would have to compute the derived IV with truncated steps, but that would be a bigger, unrelated change, fhahn: at the moment, a derived IV recipe is created if only a truncate is needed, but the derived IV…
		AyalUnsubmitted Done Reply Inline Actions Perhaps worth leaving a note behind about said bigger unrelated change. Ayal: Perhaps worth leaving a note behind about said bigger unrelated change.
		fhahnAuthorUnsubmitted Done Reply Inline Actions Added to `buildScalarSteps`. fhahn: Added to `buildScalarSteps`.
}		}

void VPReductionRecipe::execute(VPTransformState &State) {		void VPReductionRecipe::execute(VPTransformState &State) {
assert(!State.Instance && "Reduction being replicated.");		assert(!State.Instance && "Reduction being replicated.");
Value *PrevInChain = State.get(getChainOp(), 0);		Value *PrevInChain = State.get(getChainOp(), 0);
RecurKind Kind = RdxDesc->getRecurrenceKind();		RecurKind Kind = RdxDesc->getRecurrenceKind();
bool IsOrdered = State.ILV->useOrderedReductions(*RdxDesc);		bool IsOrdered = State.ILV->useOrderedReductions(*RdxDesc);
// Propagate the fast-math flags carried by the underlying instruction.		// Propagate the fast-math flags carried by the underlying instruction.
▲ Show 20 Lines • Show All 1,088 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/VPlan.h

Show First 20 Lines • Show All 1,122 Lines • ▼ Show 20 Lines	#endif

/// Returns true if a vector phi needs to be created for the induction.		/// Returns true if a vector phi needs to be created for the induction.
bool needsVectorIV() const { return NeedsVectorIV; }		bool needsVectorIV() const { return NeedsVectorIV; }
};		};

/// A pure virtual base class for all recipes modeling header phis, including		/// A pure virtual base class for all recipes modeling header phis, including
/// phis for first order recurrences, pointer inductions and reductions. The		/// phis for first order recurrences, pointer inductions and reductions. The
/// start value is the first operand of the recipe and the incoming value from		/// start value is the first operand of the recipe and the incoming value from
/// the backedge is the second operand.		/// the backedge is the second operand.
		AyalUnsubmitted Done Reply Inline Actions second >> last Ayal: second >> last
		fhahnAuthorUnsubmitted Done Reply Inline Actions That's not needed any longer. fhahn: That's not needed any longer.
class VPHeaderPHIRecipe : public VPRecipeBase, public VPValue {		class VPHeaderPHIRecipe : public VPRecipeBase, public VPValue {
protected:		protected:
VPHeaderPHIRecipe(unsigned char VPVID, unsigned char VPDefID, PHINode *Phi,		VPHeaderPHIRecipe(unsigned char VPVID, unsigned char VPDefID, PHINode *Phi,
VPValue *Start = nullptr)		VPValue *Start = nullptr)
: VPRecipeBase(VPDefID, {}), VPValue(VPVID, Phi, this) {		: VPRecipeBase(VPDefID, {}), VPValue(VPVID, Phi, this) {
if (Start)		if (Start)
addOperand(Start);		addOperand(Start);
}		}
Show All 24 Lines	#endif
VPValue *getStartValue() {		VPValue *getStartValue() {
return getNumOperands() == 0 ? nullptr : getOperand(0);		return getNumOperands() == 0 ? nullptr : getOperand(0);
}		}
VPValue *getStartValue() const {		VPValue *getStartValue() const {
return getNumOperands() == 0 ? nullptr : getOperand(0);		return getNumOperands() == 0 ? nullptr : getOperand(0);
}		}

/// Returns the incoming value from the loop backedge.		/// Returns the incoming value from the loop backedge.
VPValue *getBackedgeValue() {		VPValue *getBackedgeValue() { return getOperand(1); }
		fhahnAuthorUnsubmitted Done Reply Inline Actions Should be gone now. fhahn: Should be gone now.
		AyalUnsubmitted Done Reply Inline Actions nit: inlining can be applied separately. Ayal: nit: inlining can be applied separately.
		fhahnAuthorUnsubmitted Done Reply Inline Actions I think it was already inlined in current `main`, it just got re-formatted here. Undid the formatting change. fhahn: I think it was already inlined in current `main`, it just got re-formatted here. Undid the…
return getOperand(1);
}

/// Returns the backedge value as a recipe. The backedge value is guaranteed		/// Returns the backedge value as a recipe. The backedge value is guaranteed
/// to be a recipe.		/// to be a recipe.
VPRecipeBase *getBackedgeRecipe() {		VPRecipeBase *getBackedgeRecipe() {
return cast<VPRecipeBase>(getBackedgeValue()->getDef());		return cast<VPRecipeBase>(getBackedgeValue()->getDef());
}		}
};		};

▲ Show 20 Lines • Show All 671 Lines • ▼ Show 20 Lines

#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
/// Print the recipe.		/// Print the recipe.
void print(raw_ostream &O, const Twine &Indent,		void print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const override;		VPSlotTracker &SlotTracker) const override;
#endif		#endif

/// Returns the scalar type of the induction.		/// Returns the scalar type of the induction.
const Type *getScalarType() const {		Type *getScalarType() const {
		AyalUnsubmitted Done Reply Inline Actions Check if this method can remain const if this recipe no longer needs to delegate its Step. Ayal: Check if this method can remain const if this recipe no longer needs to delegate its Step.
		AyalUnsubmitted Done Reply Inline Actions Cannot return a const Type because ConstantInt::get() expects a non-const type, in VPCanonicalIVPHIRecipe::getStepValue()? Ayal: Cannot return a const Type because ConstantInt::get() expects a non-const type, in…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Unfortunately yes! fhahn: Unfortunately yes!
return getOperand(0)->getLiveInIRValue()->getType();		return getOperand(0)->getLiveInIRValue()->getType();
}		}

		/// Returns a VPValue for the step (constant 1) of the induction.
		VPValue *getStepValue();
		AyalUnsubmitted Done Reply Inline Actions Consider removing and feeding users directly with a constant 1 VPValue when needed upon their construction, to keep this recipe with a single value (Start) VPDef. Otherwise it can provide both Start and Step as parts of its multi-valued VPDef... (nit: const?) Ayal: Consider removing and feeding users directly with a constant 1 VPValue when needed upon their…
		AyalUnsubmitted Done Reply Inline Actions Is VPCanonicalIVPHIRecipe::getStepValue() still needed? Ayal: Is VPCanonicalIVPHIRecipe::getStepValue() still needed?
		fhahnAuthorUnsubmitted Done Reply Inline Actions It's not needed in the latest version, removed, thanks! fhahn: It's not needed in the latest version, removed, thanks!

/// Returns true if the recipe only uses the first lane of operand \p Op.		/// Returns true if the recipe only uses the first lane of operand \p Op.
bool onlyFirstLaneUsed(const VPValue *Op) const override {		bool onlyFirstLaneUsed(const VPValue *Op) const override {
assert(is_contained(operands(), Op) &&		assert(is_contained(operands(), Op) &&
"Op must be an operand of the recipe");		"Op must be an operand of the recipe");
return true;		return true;
}		}

		/// Check if the induction described by \p ID is canonical, i.e. has a step of
		/// 1 and the same type and start as the canonical IV.
		AyalUnsubmitted Done Reply Inline Actions nit: extra space "and the" step is also the same (1), as type and start; or does the original canonical unit step become UF * VF? Ayal: nit: extra space "and the" step is also the same (1), as type and start; or does the original…
		AyalUnsubmitted Done Reply Inline Actions I mean, would the following rephrasing be better: `/// i.e., has the same start, step (of 1), and type as the canonical IV.` ? Ayal: I mean, would the following rephrasing be better: `/// i.e., has the same start, step (of 1)…
		AyalUnsubmitted Done Reply Inline Actions above nit worth addressing? Ayal: above nit worth addressing?
		fhahnAuthorUnsubmitted Done Reply Inline Actions Missed those earlier, should be adjusted, thanks! fhahn: Missed those earlier, should be adjusted, thanks!
		bool isCanonical(const InductionDescriptor &ID, Type *Ty) const;
};		};

/// A recipe for generating the active lane mask for the vector loop that is		/// A recipe for generating the active lane mask for the vector loop that is
/// used to predicate the vector operations.		/// used to predicate the vector operations.
/// TODO: It would be good to use the existing VPWidenPHIRecipe instead and		/// TODO: It would be good to use the existing VPWidenPHIRecipe instead and
/// remove VPActiveLaneMaskPHIRecipe.		/// remove VPActiveLaneMaskPHIRecipe.
class VPActiveLaneMaskPHIRecipe : public VPHeaderPHIRecipe {		class VPActiveLaneMaskPHIRecipe : public VPHeaderPHIRecipe {
DebugLoc DL;		DebugLoc DL;
▲ Show 20 Lines • Show All 64 Lines • ▼ Show 20 Lines	#endif

/// Returns the scalar type of the induction.		/// Returns the scalar type of the induction.
const Type *getScalarType() const {		const Type *getScalarType() const {
return cast<VPCanonicalIVPHIRecipe>(getOperand(0)->getDef())		return cast<VPCanonicalIVPHIRecipe>(getOperand(0)->getDef())
->getScalarType();		->getScalarType();
}		}
};		};

/// A recipe for handling phi nodes of integer and floating-point inductions,		/// A recipe for converting the canonical IV value to the corresponding value of
/// producing their scalar values.		/// an IV with different start and step values.
class VPScalarIVStepsRecipe : public VPRecipeBase, public VPValue {		class VPDerivedIVRecipe : public VPRecipeBase, public VPValue {
/// Scalar type to use for the generated values.		/// Scalar type to use for the generated values.
Type *Ty;		Type *Ty;
		AyalUnsubmitted Done Reply Inline Actions Worth clarifying the use of both Ty (type to cast to before conversion) and TruncToTy (type to cast to after conversion)? Ayal: Worth clarifying the use of both Ty (type to cast to before conversion) and TruncToTy (type to…
		AyalUnsubmitted Done Reply Inline Actions Worth removing Ty altogether from VPDerivedIVRecipe - Step provides the desired type. Ayal: Worth removing Ty altogether from VPDerivedIVRecipe - Step provides the desired type.
		fhahnAuthorUnsubmitted Done Reply Inline Actions Removed in the latest version, thanks! fhahn: Removed in the latest version, thanks!
		AyalUnsubmitted Done Reply Inline Actions nit: it seems TruncToTy is always non-null, can simplify later check? Ayal: nit: it seems TruncToTy is always non-null, can simplify later check?
		fhahnAuthorUnsubmitted Done Reply Inline Actions Adjusted, thanks! fhahn: Adjusted, thanks!
/// If not nullptr, truncate the generated values to TruncToTy.		/// If not nullptr, truncate the generated values to TruncToTy.
Type *TruncToTy;		Type *TruncToTy;

		/// Induction descriptor for the induction the canonical IV is transformed to.
const InductionDescriptor &IndDesc;		const InductionDescriptor &IndDesc;

public:		public:
VPScalarIVStepsRecipe(Type *Ty, const InductionDescriptor &IndDesc,		VPDerivedIVRecipe(Type *Ty, const InductionDescriptor &IndDesc,
		AyalUnsubmitted Done Reply Inline Actions CanonicalIV >> BaseIV or IV? I.e., any Index that has a value per iteration rather than per-part/per-lane, regardless if it is "the canonical" IV? TruncToTy >> ResultTy? I.e., always specifies the type of the result, never nullptr? Ayal: CanonicalIV >> BaseIV or IV? I.e., any Index that has a value per iteration rather than per…
		fhahnAuthorUnsubmitted Done Reply Inline Actions CanonicalIV >> BaseIV or IV? I.e., any Index that has a value per iteration rather than per-part/per-lane, regardless if it is "the canonical" IV? I kept it as CanonicalIV for now, as this is what all current clients use (also updated the constructor to require `VPCanonicalIVPHIRecipe`. I can change it if you would prefer, but can also do that if we ever lift the restriction. TruncToTy >> ResultTy? I.e., always specifies the type of the result, never nullptr? Updated, thanks! fhahn: > CanonicalIV >> BaseIV or IV? I.e., any Index that has a value per iteration rather than per…
VPValue CanonicalIV, VPValue Start, VPValue *Step,		VPValue CanonicalIV, VPValue Start, VPValue *Step,
Type *TruncToTy)		Type *TruncToTy)
: VPRecipeBase(VPScalarIVStepsSC, {CanonicalIV, Start, Step}),		: VPRecipeBase(VPDerivedIVSC, {CanonicalIV, Start, Step}),
		AyalUnsubmitted Done Reply Inline Actions nit: Derived IV stands for producing Start + CanonicalIV * Step, so seems more natural to order its operands and dump them in this order? Possibly with + and * instead of separating commas, possibly along with type cast information. Ayal: nit: Derived IV stands for producing Start + CanonicalIV * Step, so seems more natural to order…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Updated, thanks! fhahn: Updated, thanks!
VPValue(nullptr, this), Ty(Ty), TruncToTy(TruncToTy), IndDesc(IndDesc) {		VPValue(VPVDerivedIVSC, nullptr, this), Ty(Ty), TruncToTy(TruncToTy),
		IndDesc(IndDesc) {}

		~VPDerivedIVRecipe() override = default;

		/// Method to support type inquiry through isa, cast, and dyn_cast.
		static inline bool classof(const VPDef *D) {
		return D->getVPDefID() == VPRecipeBase::VPDerivedIVSC;
		}
		/// Extra classof implementations to allow directly casting from VPUser ->
		/// VPDerivedIVRecipe.
		static inline bool classof(const VPUser *U) {
		auto *R = dyn_cast<VPRecipeBase>(U);
		return R && R->getVPDefID() == VPRecipeBase::VPDerivedIVSC;
		}
		static inline bool classof(const VPRecipeBase *R) {
		return R->getVPDefID() == VPRecipeBase::VPDerivedIVSC;
		}
		static inline bool classof(const VPValue *V) {
		return V->getVPValueID() == VPValue::VPVDerivedIVSC;
		AyalUnsubmitted Done Reply Inline Actions Worth clarifying which scalarized versions are actually generated. Ayal: Worth clarifying which scalarized versions are actually generated.
		fhahnAuthorUnsubmitted Done Reply Inline Actions Added an explanation, thanks! fhahn: Added an explanation, thanks!
		}

		/// Generate the transformed value of the induction at offset StartValue (2.
		/// operand) + IV (1. operand) * StepValue (3, operand).
		AyalUnsubmitted Done Reply Inline Actions Reordering the operands will also simplify this documentation. Ayal: Reordering the operands will also simplify this documentation.
		fhahnAuthorUnsubmitted Done Reply Inline Actions Updated, thanks! fhahn: Updated, thanks!
		void execute(VPTransformState &State) override;

		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
		/// Print the recipe.
		void print(raw_ostream &O, const Twine &Indent,
		VPSlotTracker &SlotTracker) const override;
		#endif

		VPCanonicalIVPHIRecipe *getCanonicalIV() const;
		AyalUnsubmitted Done Reply Inline Actions Suffice to have VPValue getBasicIVValue() const { return getOperand(0); } instead? Ayal:* Suffice to have ``` VPValue *getBasicIVValue() const { return getOperand(0); } ``` instead?
		fhahnAuthorUnsubmitted Done Reply Inline Actions Updated, thanks! fhahn: Updated, thanks!
		VPValue *getStartValue() const { return getOperand(1); }
		VPValue *getStepValue() const { return getOperand(2); }

		/// Returns true if the recipe only uses the first lane of operand \p Op.
		bool onlyFirstLaneUsed(const VPValue *Op) const override {
		assert(is_contained(operands(), Op) &&
		"Op must be an operand of the recipe");
		return true;
}		}
		};

		/// A recipe for handling phi nodes of integer and floating-point inductions,
		/// producing their scalar values.
		class VPScalarIVStepsRecipe : public VPRecipeBase, public VPValue {
		const InductionDescriptor &IndDesc;
		AyalUnsubmitted Done Reply Inline Actions nit: 2nd, 1st, 3rd? Also clarify involved type casts? Ayal: nit: 2nd, 1st, 3rd? Also clarify involved type casts?

		public:
		VPScalarIVStepsRecipe(const InductionDescriptor &IndDesc, VPValue *IV,
		VPValue *Step)
		: VPRecipeBase(VPScalarIVStepsSC, {IV, Step}), VPValue(nullptr, this),
		IndDesc(IndDesc) {}

~VPScalarIVStepsRecipe() override = default;		~VPScalarIVStepsRecipe() override = default;

		AyalUnsubmitted Done Reply Inline Actions Return operand 0 inline, or outlined to avoid cast? Ayal: Return operand 0 inline, or outlined to avoid cast?
/// Method to support type inquiry through isa, cast, and dyn_cast.		/// Method to support type inquiry through isa, cast, and dyn_cast.
static inline bool classof(const VPDef *D) {		static inline bool classof(const VPDef *D) {
return D->getVPDefID() == VPRecipeBase::VPScalarIVStepsSC;		return D->getVPDefID() == VPRecipeBase::VPScalarIVStepsSC;
}		}
/// Extra classof implementations to allow directly casting from VPUser ->		/// Extra classof implementations to allow directly casting from VPUser ->
/// VPScalarIVStepsRecipe.		/// VPScalarIVStepsRecipe.
static inline bool classof(const VPUser *U) {		static inline bool classof(const VPUser *U) {
auto *R = dyn_cast<VPRecipeBase>(U);		auto *R = dyn_cast<VPRecipeBase>(U);
return R && R->getVPDefID() == VPRecipeBase::VPScalarIVStepsSC;		return R && R->getVPDefID() == VPRecipeBase::VPScalarIVStepsSC;
}		}
static inline bool classof(const VPRecipeBase *R) {		static inline bool classof(const VPRecipeBase *R) {
return R->getVPDefID() == VPRecipeBase::VPScalarIVStepsSC;		return R->getVPDefID() == VPRecipeBase::VPScalarIVStepsSC;
}		}

/// Generate the scalarized versions of the phi node as needed by their users.		/// Generate the scalarized versions of the phi node as needed by their users.
void execute(VPTransformState &State) override;		void execute(VPTransformState &State) override;

#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
/// Print the recipe.		/// Print the recipe.
void print(raw_ostream &O, const Twine &Indent,		void print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const override;		VPSlotTracker &SlotTracker) const override;
#endif		#endif

/// Returns true if the induction is canonical, i.e. starting at 0 and		VPValue *getStepValue() const { return getOperand(1); }
/// incremented by UF * VF (= the original IV is incremented by 1).
bool isCanonical() const;

VPCanonicalIVPHIRecipe *getCanonicalIV() const;
VPValue *getStartValue() const { return getOperand(1); }
VPValue *getStepValue() const { return getOperand(2); }

/// Returns true if the recipe only uses the first lane of operand \p Op.		/// Returns true if the recipe only uses the first lane of operand \p Op.
bool onlyFirstLaneUsed(const VPValue *Op) const override {		bool onlyFirstLaneUsed(const VPValue *Op) const override {
assert(is_contained(operands(), Op) &&		assert(is_contained(operands(), Op) &&
"Op must be an operand of the recipe");		"Op must be an operand of the recipe");
return true;		return true;
}		}
};		};
▲ Show 20 Lines • Show All 1,064 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/VPlan.cpp

Show First 20 Lines • Show All 633 Lines • ▼ Show 20 Lines	if (BackedgeTakenCount && BackedgeTakenCount->getNumUsers()) {
for (unsigned Part = 0, UF = State.UF; Part < UF; ++Part)		for (unsigned Part = 0, UF = State.UF; Part < UF; ++Part)
State.set(BackedgeTakenCount, VTCMO, Part);		State.set(BackedgeTakenCount, VTCMO, Part);
}		}

for (unsigned Part = 0, UF = State.UF; Part < UF; ++Part)		for (unsigned Part = 0, UF = State.UF; Part < UF; ++Part)
State.set(&VectorTripCount, VectorTripCountV, Part);		State.set(&VectorTripCount, VectorTripCountV, Part);

// When vectorizing the epilogue loop, the canonical induction start value		// When vectorizing the epilogue loop, the canonical induction start value
// needs to be changed from zero to the value after the main vector loop.		// needs to be changed from zero to the value after the main vector loop.
		// FIXME: Improve modeling for canonical IV start values in the epilogue loop.
		AyalUnsubmitted Done Reply Inline Actions Suggest to add a FIXME to find a better way than this for VPlan to represent epilogue loop. Ayal: Suggest to add a FIXME to find a better way than this for VPlan to represent epilogue loop.
		fhahnAuthorUnsubmitted Done Reply Inline Actions Added, thanks! fhahn: Added, thanks!
if (CanonicalIVStartValue) {		if (CanonicalIVStartValue) {
VPValue *VPV = getOrAddExternalDef(CanonicalIVStartValue);		VPValue *VPV = getOrAddExternalDef(CanonicalIVStartValue);
auto *IV = getCanonicalIV();		auto *IV = getCanonicalIV();
assert(all_of(IV->users(),		assert(all_of(IV->users(),
[](const VPUser *U) {		[](const VPUser *U) {
if (isa<VPScalarIVStepsRecipe>(U))		if (isa<VPScalarIVStepsRecipe>(U) \|\|
		isa<VPDerivedIVRecipe>(U))
return true;		return true;
auto *VPI = cast<VPInstruction>(U);		auto *VPI = cast<VPInstruction>(U);
return VPI->getOpcode() ==		return VPI->getOpcode() ==
VPInstruction::CanonicalIVIncrement \|\|		VPInstruction::CanonicalIVIncrement \|\|
VPI->getOpcode() ==		VPI->getOpcode() ==
VPInstruction::CanonicalIVIncrementNUW;		VPInstruction::CanonicalIVIncrementNUW;
}) &&		}) &&
"the canonical IV should only be used by its increments or "		"the canonical IV should only be used by its increments or "
"ScalarIVSteps when "		"ScalarIVSteps when "
		AyalUnsubmitted Done Reply Inline Actions ... or DerivedIV ... Ayal: ... or DerivedIV ...
		fhahnAuthorUnsubmitted Done Reply Inline Actions added, thanks fhahn: added, thanks
		AyalUnsubmitted Not Done Reply Inline Actions worth adding in the error message as well Ayal: worth adding in the error message as well
"resetting the start value");		"resetting the start value");
IV->setOperand(0, VPV);		IV->setOperand(0, VPV);
}		}
}		}

/// Generate the code inside the preheader and body of the vectorized loop.		/// Generate the code inside the preheader and body of the vectorized loop.
/// Assumes a single pre-header basic-block was created for this. Introduce		/// Assumes a single pre-header basic-block was created for this. Introduce
/// additional basic-blocks as needed, and fill them all.		/// additional basic-blocks as needed, and fill them all.
▲ Show 20 Lines • Show All 444 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp

Show First 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	bool VPRecipeBase::mayReadFromMemory() const {
}		}
default:		default:
return true;		return true;
}		}
}		}

bool VPRecipeBase::mayHaveSideEffects() const {		bool VPRecipeBase::mayHaveSideEffects() const {
switch (getVPDefID()) {		switch (getVPDefID()) {
		case VPDerivedIVSC:
case VPPredInstPHISC:		case VPPredInstPHISC:
return false;		return false;
case VPWidenIntOrFpInductionSC:		case VPWidenIntOrFpInductionSC:
case VPWidenPointerInductionSC:		case VPWidenPointerInductionSC:
case VPWidenCanonicalIVSC:		case VPWidenCanonicalIVSC:
case VPWidenPHISC:		case VPWidenPHISC:
case VPBlendSC:		case VPBlendSC:
case VPWidenSC:		case VPWidenSC:
▲ Show 20 Lines • Show All 593 Lines • ▼ Show 20 Lines
#endif		#endif

bool VPWidenIntOrFpInductionRecipe::isCanonical() const {		bool VPWidenIntOrFpInductionRecipe::isCanonical() const {
auto *StartC = dyn_cast<ConstantInt>(getStartValue()->getLiveInIRValue());		auto *StartC = dyn_cast<ConstantInt>(getStartValue()->getLiveInIRValue());
auto *StepC = dyn_cast<SCEVConstant>(getInductionDescriptor().getStep());		auto *StepC = dyn_cast<SCEVConstant>(getInductionDescriptor().getStep());
return StartC && StartC->isZero() && StepC && StepC->isOne();		return StartC && StartC->isZero() && StepC && StepC->isOne();
}		}

VPCanonicalIVPHIRecipe *VPScalarIVStepsRecipe::getCanonicalIV() const {		VPCanonicalIVPHIRecipe *VPDerivedIVRecipe::getCanonicalIV() const {
return cast<VPCanonicalIVPHIRecipe>(getOperand(0));		return cast<VPCanonicalIVPHIRecipe>(getOperand(0));
}		}

bool VPScalarIVStepsRecipe::isCanonical() const {		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
auto *CanIV = getCanonicalIV();		void VPDerivedIVRecipe::print(raw_ostream &O, const Twine &Indent,
// The start value of the steps-recipe must match the start value of the		VPSlotTracker &SlotTracker) const {
// canonical induction and it must step by 1.		O << Indent;
if (CanIV->getStartValue() != getStartValue())		printAsOperand(O, SlotTracker);
return false;		O << Indent << "= DERIVED-IV ";
auto *StepVPV = getStepValue();		printOperands(O, SlotTracker);
if (StepVPV->getDef())
return false;
auto *StepC = dyn_cast_or_null<ConstantInt>(StepVPV->getLiveInIRValue());
return StepC && StepC->isOne();
}		}
		#endif

#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
void VPScalarIVStepsRecipe::print(raw_ostream &O, const Twine &Indent,		void VPScalarIVStepsRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << Indent;		O << Indent;
printAsOperand(O, SlotTracker);		printAsOperand(O, SlotTracker);
O << Indent << "= SCALAR-STEPS ";		O << Indent << "= SCALAR-STEPS ";
printOperands(O, SlotTracker);		printOperands(O, SlotTracker);
▲ Show 20 Lines • Show All 306 Lines • ▼ Show 20 Lines
void VPCanonicalIVPHIRecipe::print(raw_ostream &O, const Twine &Indent,		void VPCanonicalIVPHIRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << Indent << "EMIT ";		O << Indent << "EMIT ";
printAsOperand(O, SlotTracker);		printAsOperand(O, SlotTracker);
O << " = CANONICAL-INDUCTION";		O << " = CANONICAL-INDUCTION";
}		}
#endif		#endif

		VPValue *VPCanonicalIVPHIRecipe::getStepValue() {
		VPlan &Plan = *getParent()->getPlan();
		return Plan.getOrAddVPValue(ConstantInt::get(getScalarType(), 1));
		AyalUnsubmitted Done Reply Inline Actions turns out the assertion triggered in some tests, added as an extra condition. Good! Worth a comment? Ayal: > turns out the assertion triggered in some tests, added as an extra condition. Good! Worth a…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Moved the check to the return and move the part about incrementing it by one there. fhahn: Moved the check to the return and move the part about incrementing it by one there.
		}

		AyalUnsubmitted Done Reply Inline Actions steps-recipe? Ayal: steps-recipe?
		fhahnAuthorUnsubmitted Done Reply Inline Actions Adjusted, thanks! fhahn: Adjusted, thanks!
		bool VPCanonicalIVPHIRecipe::isCanonical(const InductionDescriptor &ID,
		Type *Ty) const {
		if (Ty != getScalarType())
		AyalUnsubmitted Done Reply Inline Actions Also check for same type, as claimed at the interface? Ayal: Also check for same type, as claimed at the interface?
		fhahnAuthorUnsubmitted Done Reply Inline Actions updated, thanks! fhahn: updated, thanks!
		return false;
		// The start value of the steps-recipe must match the start value of the
		// canonical induction and it must step by 1.
		if (getStartValue()->getLiveInIRValue() != ID.getStartValue())
		return false;

		auto *StepC = dyn_cast_or_null<SCEVConstant>(ID.getStep());
		AyalUnsubmitted Done Reply Inline Actions nit: can check `ConstantInt Step = ID.getConstIntStepValue()` as in Loop::isCanonical(). nit: can assert ID.getInductionOpcode() == Instruction::Add. Ayal:* nit: can check `ConstantInt *Step = ID.getConstIntStepValue()` as in Loop::isCanonical(). nit…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Updated, thanks! I turns out the assertion triggered in some tests, added as an extra condition. fhahn: Updated, thanks! I turns out the assertion triggered in some tests, added as an extra condition.
		return StepC && StepC->isOne();
		}

bool VPWidenPointerInductionRecipe::onlyScalarsGenerated(ElementCount VF) {		bool VPWidenPointerInductionRecipe::onlyScalarsGenerated(ElementCount VF) {
return IsScalarAfterVectorization &&		return IsScalarAfterVectorization &&
(!VF.isScalable() \|\| vputils::onlyFirstLaneUsed(this));		(!VF.isScalable() \|\| vputils::onlyFirstLaneUsed(this));
}		}

#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
void VPWidenPointerInductionRecipe::print(raw_ostream &O, const Twine &Indent,		void VPWidenPointerInductionRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
▲ Show 20 Lines • Show All 243 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp

Show First 20 Lines • Show All 376 Lines • ▼ Show 20 Lines	void VPlanTransforms::removeDeadRecipes(VPlan &Plan) {
}		}
}		}

void VPlanTransforms::optimizeInductions(VPlan &Plan, ScalarEvolution &SE) {		void VPlanTransforms::optimizeInductions(VPlan &Plan, ScalarEvolution &SE) {
SmallVector<VPRecipeBase *> ToRemove;		SmallVector<VPRecipeBase *> ToRemove;
VPBasicBlock *HeaderVPBB = Plan.getVectorLoopRegion()->getEntryBasicBlock();		VPBasicBlock *HeaderVPBB = Plan.getVectorLoopRegion()->getEntryBasicBlock();
bool HasOnlyVectorVFs = !Plan.hasVF(ElementCount::getFixed(1));		bool HasOnlyVectorVFs = !Plan.hasVF(ElementCount::getFixed(1));
for (VPRecipeBase &Phi : HeaderVPBB->phis()) {		for (VPRecipeBase &Phi : HeaderVPBB->phis()) {
auto *IV = dyn_cast<VPWidenIntOrFpInductionRecipe>(&Phi);		auto *WideIV = dyn_cast<VPWidenIntOrFpInductionRecipe>(&Phi);
if (!IV)		if (!WideIV)
continue;		continue;
if (HasOnlyVectorVFs &&		if (HasOnlyVectorVFs && none_of(WideIV->users(), [WideIV](VPUser *U) {
none_of(IV->users(), [IV](VPUser *U) { return U->usesScalars(IV); }))		return U->usesScalars(WideIV);
		}))
continue;		continue;

const InductionDescriptor &ID = IV->getInductionDescriptor();		auto IP = HeaderVPBB->getFirstNonPhi();
		VPCanonicalIVPHIRecipe *CanonicalIV = Plan.getCanonicalIV();
		Type *IVTy = WideIV->getPHINode()->getType();
		Instruction *TruncI = WideIV->getTruncInst();
		AyalUnsubmitted Done Reply Inline Actions nit: TruncI is used for its type only, TruncTy may better be called ResultTy, IVTy is hopefully unneeded if VPDerivedRecipe can be constructed w/o it; consider setting: Type IVTy = WideIV->getPHINode()->getType(); Type ResultTy = IVTy; Type TruncInstTy = nullptr; if (auto TruncI = WideIV->getTruncInst()) { TruncInstTy = TruncI->getType(); ResultTy = TruncInstTy; } Ayal: nit: TruncI is used for its type only, TruncTy may better be called ResultTy, IVTy is hopefully…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Simplified, thanks! fhahn: Simplified, thanks!
		Type *TruncTy = TruncI ? TruncI->getType() : IVTy;
		const InductionDescriptor &ID = WideIV->getInductionDescriptor();
VPValue *Step =		VPValue *Step =
vputils::getOrCreateVPValueForSCEVExpr(Plan, ID.getStep(), SE);		vputils::getOrCreateVPValueForSCEVExpr(Plan, ID.getStep(), SE);
Instruction *TruncI = IV->getTruncInst();		VPValue *BaseIV = CanonicalIV;
		AyalUnsubmitted Done Reply Inline Actions nit: define `ID` and `TruncI` below slightly closer to first use? Ayal: nit: define `ID` and `TruncI` below slightly closer to first use?
		fhahnAuthorUnsubmitted Done Reply Inline Actions done, thanks! fhahn: done, thanks!
VPScalarIVStepsRecipe *Steps = new VPScalarIVStepsRecipe(		if (!CanonicalIV->isCanonical(ID, TruncTy)) {
IV->getPHINode()->getType(), ID, Plan.getCanonicalIV(),		BaseIV =
IV->getStartValue(), Step, TruncI ? TruncI->getType() : nullptr);		new VPDerivedIVRecipe(IVTy, ID, CanonicalIV, WideIV->getStartValue(),
		AyalUnsubmitted Done Reply Inline Actions `IVR` may be a confusing name, defined as a VPValue* rather than a Recipe. `IV` also stands for a recipe, conflicting with `IVR`. How about renaming `IVR` to something like `BaseIV`, and define it as a VPRecipeBase? It stands for the recipe providing a single scalar value per iteration of vectorized & unrolled loop with the desired type/start/step values, as a Base on which to build scalar steps - a scalar value per lane and part. This is either the canonical IV recipe if suitable or a newly introduced derived IV recipe which transforms it. Perhaps also rename `IV` to `WidenIV`, and spell out `CanIV` to `CanonicalV`. Ayal:* `IVR` may be a confusing name, defined as a VPValue* rather than a Recipe. `IV` also stands for…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Updated, thanks! I kept the type as VPVale as this requires using getDef just once. fhahn: Updated, thanks! I kept the type as VPVale as this requires using getDef just once.
HeaderVPBB->insert(Steps, HeaderVPBB->getFirstNonPhi());		Step, TruncI ? TruncI->getType() : nullptr);
		AyalUnsubmitted Done Reply Inline Actions Should this be a method of VPCanonicalIVPHIRecipe, checking if a given Start and Step match those of its own? Ayal: Should this be a method of VPCanonicalIVPHIRecipe, checking if a given Start and Step match…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Added a helper. fhahn: Added a helper.
		HeaderVPBB->insert(cast<VPRecipeBase>(BaseIV->getDef()), IP);
		}
		AyalUnsubmitted Done Reply Inline Actions TruncTy here is the desired type of the resulting scalar steps; should it be supplied to isCanonical() along with ID in order to check if CanonicalIV is providing the suitable start/step/type for scalar users of `Phi`, or else a derived recipe is needed? Ayal: TruncTy here is the desired type of the resulting scalar steps; should it be supplied to…
		fhahnAuthorUnsubmitted Done Reply Inline Actions done, thanks! fhahn: done, thanks!

		VPScalarIVStepsRecipe *Steps = new VPScalarIVStepsRecipe(ID, BaseIV, Step);
		HeaderVPBB->insert(Steps, IP);
		AyalUnsubmitted Done Reply Inline Actions nit: (re)use `IVTy` nit: use `Can[onical]IV` directly instead of `IVR`. Ayal: nit: (re)use `IVTy` nit: use `Can[onical]IV` directly instead of `IVR`.
		fhahnAuthorUnsubmitted Done Reply Inline Actions done, thanks! fhahn: done, thanks!

// Update scalar users of IV to use Step instead. Use SetVector to ensure		// Update scalar users of IV to use Step instead. Use SetVector to ensure
// the list of users doesn't contain duplicates.		// the list of users doesn't contain duplicates.
		AyalUnsubmitted Done Reply Inline Actions nit: was there some helper to get Def as RecipeBase? Ayal: nit: was there some helper to get Def as RecipeBase?
		fhahnAuthorUnsubmitted Done Reply Inline Actions the patch has not landed yet (D136068) fhahn: the patch has not landed yet (D136068)
SetVector<VPUser *> Users(IV->user_begin(), IV->user_end());		SetVector<VPUser *> Users(WideIV->user_begin(), WideIV->user_end());
		AyalUnsubmitted Done Reply Inline Actions Suggest to first set `VPCanonicalIVPHIRecipe CanonicalIV = Plan.getCanonicalIV();` (or auto) to avoid casting below. Then set VPValue Start, Step to be either those of CanonicalIV or those of the new VPTransformedIVRecipe, to feed the new VPScalarIVStepsRecipe? Ayal: Suggest to first set `VPCanonicalIVPHIRecipe *CanonicalIV = Plan.getCanonicalIV();` (or auto)…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Updated to have a separate `CanIV` variable, thanks! fhahn: Updated to have a separate `CanIV` variable, thanks!
for (VPUser *U : Users) {		for (VPUser *U : Users) {
if (HasOnlyVectorVFs && !U->usesScalars(IV))		if (HasOnlyVectorVFs && !U->usesScalars(WideIV))
continue;		continue;
for (unsigned I = 0, E = U->getNumOperands(); I != E; I++) {		for (unsigned I = 0, E = U->getNumOperands(); I != E; I++) {
if (U->getOperand(I) != IV)		if (U->getOperand(I) != WideIV)
continue;		continue;
U->setOperand(I, Steps);		U->setOperand(I, Steps);
}		}
}		}
}		}
}		}

void VPlanTransforms::removeRedundantExpandSCEVRecipes(VPlan &Plan) {		void VPlanTransforms::removeRedundantExpandSCEVRecipes(VPlan &Plan) {
Show All 15 Lines

llvm/lib/Transforms/Vectorize/VPlanValue.h

Show First 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	public:
const Value *getUnderlyingValue() const { return UnderlyingVal; }		const Value *getUnderlyingValue() const { return UnderlyingVal; }

/// An enumeration for keeping track of the concrete subclass of VPValue that		/// An enumeration for keeping track of the concrete subclass of VPValue that
/// are actually instantiated. Values of this enumeration are kept in the		/// are actually instantiated. Values of this enumeration are kept in the
/// SubclassID field of the VPValue objects. They are used for concrete		/// SubclassID field of the VPValue objects. They are used for concrete
/// type identification.		/// type identification.
enum {		enum {
VPValueSC,		VPValueSC,
		VPVDerivedIVSC,
VPVInstructionSC,		VPVInstructionSC,
VPVMemoryInstructionSC,		VPVMemoryInstructionSC,
VPVReductionSC,		VPVReductionSC,
VPVReplicateSC,		VPVReplicateSC,
VPVWidenSC,		VPVWidenSC,
VPVWidenCallSC,		VPVWidenCallSC,
VPVWidenCanonicalIVSC,		VPVWidenCanonicalIVSC,
VPVWidenGEPSC,		VPVWidenGEPSC,
VPVWidenSelectSC,		VPVWidenSelectSC,

		AyalUnsubmitted Done Reply Inline Actions Transformed or Derived? Lex order Ayal: Transformed or Derived? Lex order
		fhahnAuthorUnsubmitted Done Reply Inline Actions done thanks! fhahn: done thanks!
// Phi-like VPValues. Need to be kept together.		// Phi-like VPValues. Need to be kept together.
VPVBlendSC,		VPVBlendSC,
VPVPredInstPHI,		VPVPredInstPHI,
// Header-phi recipes. Need to be kept together.		// Header-phi recipes. Need to be kept together.
VPVCanonicalIVPHISC,		VPVCanonicalIVPHISC,
VPVActiveLaneMaskPHISC,		VPVActiveLaneMaskPHISC,
VPVFirstOrderRecurrencePHISC,		VPVFirstOrderRecurrencePHISC,
VPVWidenPHISC,		VPVWidenPHISC,
▲ Show 20 Lines • Show All 235 Lines • ▼ Show 20 Lines

public:		public:
/// An enumeration for keeping track of the concrete subclass of VPRecipeBase		/// An enumeration for keeping track of the concrete subclass of VPRecipeBase
/// that is actually instantiated. Values of this enumeration are kept in the		/// that is actually instantiated. Values of this enumeration are kept in the
/// SubclassID field of the VPRecipeBase objects. They are used for concrete		/// SubclassID field of the VPRecipeBase objects. They are used for concrete
/// type identification.		/// type identification.
using VPRecipeTy = enum {		using VPRecipeTy = enum {
VPBranchOnMaskSC,		VPBranchOnMaskSC,
		VPDerivedIVSC,
VPExpandSCEVSC,		VPExpandSCEVSC,
VPInstructionSC,		VPInstructionSC,
VPInterleaveSC,		VPInterleaveSC,
VPReductionSC,		VPReductionSC,
VPReplicateSC,		VPReplicateSC,
VPScalarIVStepsSC,		VPScalarIVStepsSC,
VPWidenCallSC,		VPWidenCallSC,
		AyalUnsubmitted Done Reply Inline Actions Lex order Ayal: Lex order
		fhahnAuthorUnsubmitted Done Reply Inline Actions done thanks! fhahn: done thanks!
VPWidenCanonicalIVSC,		VPWidenCanonicalIVSC,
VPWidenGEPSC,		VPWidenGEPSC,
VPWidenMemoryInstructionSC,		VPWidenMemoryInstructionSC,
VPWidenSC,		VPWidenSC,
VPWidenSelectSC,		VPWidenSelectSC,

// Phi-like recipes. Need to be kept together.		// Phi-like recipes. Need to be kept together.
VPBlendSC,		VPBlendSC,
▲ Show 20 Lines • Show All 103 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding-forced.ll

	Show All 13 Lines
	; VPLANS-NEXT: EMIT vp<%2> = VF * Part + ir<0>			; VPLANS-NEXT: EMIT vp<%2> = VF * Part + ir<0>
	; VPLANS-NEXT: EMIT vp<%3> = active lane mask vp<%2> <badref>			; VPLANS-NEXT: EMIT vp<%3> = active lane mask vp<%2> <badref>
	; VPLANS-NEXT: Successor(s): vector loop			; VPLANS-NEXT: Successor(s): vector loop
	; VPLANS-EMPTY:			; VPLANS-EMPTY:
	; VPLANS-NEXT: <x1> vector loop: {			; VPLANS-NEXT: <x1> vector loop: {
	; VPLANS-NEXT: vector.body:			; VPLANS-NEXT: vector.body:
	; VPLANS-NEXT: EMIT vp<%4> = CANONICAL-INDUCTION			; VPLANS-NEXT: EMIT vp<%4> = CANONICAL-INDUCTION
	; VPLANS-NEXT: ACTIVE-LANE-MASK-PHI vp<%5> = phi vp<%3>, vp<%10>			; VPLANS-NEXT: ACTIVE-LANE-MASK-PHI vp<%5> = phi vp<%3>, vp<%10>
	; VPLANS-NEXT: vp<%6> = SCALAR-STEPS vp<%4>, ir<0>, ir<1>			; VPLANS-NEXT: vp<%6> = SCALAR-STEPS vp<%4>, ir<1>
				AyalUnsubmitted Done Reply Inline Actions This does appear less descriptive than having SCALAR-STEPS depict the step it uses? Ayal: This does appear less descriptive than having SCALAR-STEPS depict the step it uses?
				AyalUnsubmitted Done Reply Inline Actions Ah, the step it uses is explicit - it's ir<1>! What has been omitted is "start" (ir<0>), which is moved from ScalarIVSteps to DerivedIV, if needed. Ayal: Ah, the step it uses is explicit - it's ir<1>! What has been omitted is "start" (ir<0>), which…
				fhahnAuthorUnsubmitted Done Reply Inline Actions Ah I missed this comment earlier, Yes, DerivedIV will now take care of adjusting the start value, ScalarIVSteps just generate steps. fhahn: Ah I missed this comment earlier, Yes, DerivedIV will now take care of adjusting the start…
	; VPLANS-NEXT: CLONE ir<%gep> = getelementptr ir<%ptr>, vp<%6>			; VPLANS-NEXT: CLONE ir<%gep> = getelementptr ir<%ptr>, vp<%6>
	; VPLANS-NEXT: WIDEN store ir<%gep>, ir<%val>, vp<%5>			; VPLANS-NEXT: WIDEN store ir<%gep>, ir<%val>, vp<%5>
	; VPLANS-NEXT: EMIT vp<%8> = VF * UF + vp<%4>			; VPLANS-NEXT: EMIT vp<%8> = VF * UF + vp<%4>
	; VPLANS-NEXT: EMIT vp<%9> = VF * Part + vp<%8>			; VPLANS-NEXT: EMIT vp<%9> = VF * Part + vp<%8>
	; VPLANS-NEXT: EMIT vp<%10> = active lane mask vp<%9> <badref>			; VPLANS-NEXT: EMIT vp<%10> = active lane mask vp<%9> <badref>
	; VPLANS-NEXT: EMIT vp<%11> = not vp<%10>			; VPLANS-NEXT: EMIT vp<%11> = not vp<%10>
	; VPLANS-NEXT: EMIT branch-on-cond vp<%11>			; VPLANS-NEXT: EMIT branch-on-cond vp<%11>
	; VPLANS-NEXT: No successors			; VPLANS-NEXT: No successors
	▲ Show 20 Lines • Show All 62 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/AArch64/widen-call-with-intrinsic-or-libfunc.ll

	Show All 9 Lines
	; CHECK-NEXT: Live-in vp<%1> = vector-trip-count			; CHECK-NEXT: Live-in vp<%1> = vector-trip-count
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<%2> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<%2> = CANONICAL-INDUCTION
	; CHECK-NEXT: vp<%3> = SCALAR-STEPS vp<%2>, ir<0>, ir<1>			; CHECK-NEXT: vp<%3> = SCALAR-STEPS vp<%2>, ir<1>
	; CHECK-NEXT: CLONE ir<%gep.src> = getelementptr ir<%src>, vp<%3>			; CHECK-NEXT: CLONE ir<%gep.src> = getelementptr ir<%src>, vp<%3>
	; CHECK-NEXT: WIDEN ir<%l> = load ir<%gep.src>			; CHECK-NEXT: WIDEN ir<%l> = load ir<%gep.src>
	; CHECK-NEXT: WIDEN ir<%conv> = fpext ir<%l>			; CHECK-NEXT: WIDEN ir<%conv> = fpext ir<%l>
	; CHECK-NEXT: WIDEN-CALL ir<%s> = call @llvm.sin.f64(ir<%conv>) (using library function)			; CHECK-NEXT: WIDEN-CALL ir<%s> = call @llvm.sin.f64(ir<%conv>) (using library function)
	; CHECK-NEXT: REPLICATE ir<%gep.dst> = getelementptr ir<%dst>, vp<%3>			; CHECK-NEXT: REPLICATE ir<%gep.dst> = getelementptr ir<%dst>, vp<%3>
	; CHECK-NEXT: REPLICATE store ir<%s>, ir<%gep.dst>			; CHECK-NEXT: REPLICATE store ir<%s>, ir<%gep.dst>
	; CHECK-NEXT: EMIT vp<%10> = VF * UF +(nuw) vp<%2>			; CHECK-NEXT: EMIT vp<%10> = VF * UF +(nuw) vp<%2>
	; CHECK-NEXT: EMIT branch-on-count vp<%10> vp<%1>			; CHECK-NEXT: EMIT branch-on-count vp<%10> vp<%1>
	Show All 9 Lines
	; CHECK-NEXT: Live-in vp<%1> = vector-trip-count			; CHECK-NEXT: Live-in vp<%1> = vector-trip-count
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<%2> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<%2> = CANONICAL-INDUCTION
	; CHECK-NEXT: vp<%3> = SCALAR-STEPS vp<%2>, ir<0>, ir<1>			; CHECK-NEXT: vp<%3> = SCALAR-STEPS vp<%2>, ir<1>
	; CHECK-NEXT: CLONE ir<%gep.src> = getelementptr ir<%src>, vp<%3>			; CHECK-NEXT: CLONE ir<%gep.src> = getelementptr ir<%src>, vp<%3>
	; CHECK-NEXT: WIDEN ir<%l> = load ir<%gep.src>			; CHECK-NEXT: WIDEN ir<%l> = load ir<%gep.src>
	; CHECK-NEXT: WIDEN ir<%conv> = fpext ir<%l>			; CHECK-NEXT: WIDEN ir<%conv> = fpext ir<%l>
	; CHECK-NEXT: WIDEN-CALL ir<%s> = call @llvm.sin.f64(ir<%conv>) (using vector intrinsic)			; CHECK-NEXT: WIDEN-CALL ir<%s> = call @llvm.sin.f64(ir<%conv>) (using vector intrinsic)
	; CHECK-NEXT: REPLICATE ir<%gep.dst> = getelementptr ir<%dst>, vp<%3>			; CHECK-NEXT: REPLICATE ir<%gep.dst> = getelementptr ir<%dst>, vp<%3>
	; CHECK-NEXT: REPLICATE store ir<%s>, ir<%gep.dst>			; CHECK-NEXT: REPLICATE store ir<%s>, ir<%gep.dst>
	; CHECK-NEXT: EMIT vp<%10> = VF * UF +(nuw) vp<%2>			; CHECK-NEXT: EMIT vp<%10> = VF * UF +(nuw) vp<%2>
	; CHECK-NEXT: EMIT branch-on-count vp<%10> vp<%1>			; CHECK-NEXT: EMIT branch-on-count vp<%10> vp<%1>
	▲ Show 20 Lines • Show All 54 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/RISCV/riscv-vector-reverse.ll

	Show First 20 Lines • Show All 50 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: LV: Scalarizing: %cmp = icmp ugt i64 %indvars.iv, 1			; CHECK-NEXT: LV: Scalarizing: %cmp = icmp ugt i64 %indvars.iv, 1
	; CHECK-NEXT: LV: Scalarizing: %indvars.iv.next = add nsw i64 %indvars.iv, -1			; CHECK-NEXT: LV: Scalarizing: %indvars.iv.next = add nsw i64 %indvars.iv, -1
	; CHECK-NEXT: VPlan 'Initial VPlan for VF={vscale x 4},UF>=1' {			; CHECK-NEXT: VPlan 'Initial VPlan for VF={vscale x 4},UF>=1' {
	; CHECK-NEXT: Live-in vp<%2> = vector-trip-count			; CHECK-NEXT: Live-in vp<%2> = vector-trip-count
	; CHECK: vector.ph:			; CHECK: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK: <x1> vector loop: {			; CHECK: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<%3> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: vp<%4> = SCALAR-STEPS vp<%3>, ir<%n>, ir<-1>			; CHECK-NEXT: vp<[[TRANS_IV:%.+]]> = DERIVED-IV vp<[[CAN_IV]]>, ir<%n>, ir<-1>
	; CHECK-NEXT: CLONE ir<%i.0> = add vp<%4>, ir<-1>			; CHECK-NEXT: vp<[[SCALAR_STEPS:%.+]]> = SCALAR-STEPS vp<[[TRANS_IV]]>, ir<-1>
				; CHECK-NEXT: CLONE ir<%i.0> = add vp<[[SCALAR_STEPS]]>, ir<-1>
	; CHECK-NEXT: CLONE ir<%idxprom> = zext ir<%i.0>			; CHECK-NEXT: CLONE ir<%idxprom> = zext ir<%i.0>
	; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%B>, ir<%idxprom>			; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%B>, ir<%idxprom>
	; CHECK-NEXT: WIDEN ir<%1> = load ir<%arrayidx>			; CHECK-NEXT: WIDEN ir<%1> = load ir<%arrayidx>
	; CHECK-NEXT: WIDEN ir<%add9> = add ir<%1>, ir<1>			; CHECK-NEXT: WIDEN ir<%add9> = add ir<%1>, ir<1>
	; CHECK-NEXT: CLONE ir<%arrayidx3> = getelementptr ir<%A>, ir<%idxprom>			; CHECK-NEXT: CLONE ir<%arrayidx3> = getelementptr ir<%A>, ir<%idxprom>
	; CHECK-NEXT: WIDEN store ir<%arrayidx3>, ir<%add9>			; CHECK-NEXT: WIDEN store ir<%arrayidx3>, ir<%add9>
	; CHECK-NEXT: EMIT vp<%11> = VF * UF +(nuw) vp<%3>			; CHECK-NEXT: EMIT vp<[[IV_INC:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>
	; CHECK-NEXT: EMIT branch-on-count vp<%11> vp<%2>			; CHECK-NEXT: EMIT branch-on-count vp<[[IV_INC]]> vp<%2>
	; CHECK-NEXT: No successors			; CHECK-NEXT: No successors
	; CHECK-NEXT: }			; CHECK-NEXT: }
	; CHECK-NEXT: Successor(s): middle.block			; CHECK-NEXT: Successor(s): middle.block
	; CHECK: middle.block:			; CHECK: middle.block:
	; CHECK-NEXT: No successors			; CHECK-NEXT: No successors
	; CHECK-NEXT: }			; CHECK-NEXT: }
	; CHECK-NEXT: LV: Found an estimated cost of 1 for VF vscale x 4 For instruction: %indvars.iv = phi i64 [ %0, %for.body.preheader ], [ %indvars.iv.next, %for.body ]			; CHECK-NEXT: LV: Found an estimated cost of 1 for VF vscale x 4 For instruction: %indvars.iv = phi i64 [ %0, %for.body.preheader ], [ %indvars.iv.next, %for.body ]
	; CHECK-NEXT: LV: Found an estimated cost of 1 for VF vscale x 4 For instruction: %i.0.in8 = phi i32 [ %n, %for.body.preheader ], [ %i.0, %for.body ]			; CHECK-NEXT: LV: Found an estimated cost of 1 for VF vscale x 4 For instruction: %i.0.in8 = phi i32 [ %n, %for.body.preheader ], [ %i.0, %for.body ]
	▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: LV: Scalarizing: %cmp = icmp ugt i64 %indvars.iv, 1			; CHECK-NEXT: LV: Scalarizing: %cmp = icmp ugt i64 %indvars.iv, 1
	; CHECK-NEXT: LV: Scalarizing: %indvars.iv.next = add nsw i64 %indvars.iv, -1			; CHECK-NEXT: LV: Scalarizing: %indvars.iv.next = add nsw i64 %indvars.iv, -1
	; CHECK-NEXT: VPlan 'Initial VPlan for VF={vscale x 4},UF>=1' {			; CHECK-NEXT: VPlan 'Initial VPlan for VF={vscale x 4},UF>=1' {
	; CHECK-NEXT: Live-in vp<%2> = vector-trip-count			; CHECK-NEXT: Live-in vp<%2> = vector-trip-count
	; CHECK: vector.ph:			; CHECK: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK: <x1> vector loop: {			; CHECK: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<%3> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: vp<%4> = SCALAR-STEPS vp<%3>, ir<%n>, ir<-1>			; CHECK-NEXT: vp<[[TRANS_IV:%.+]]> = DERIVED-IV vp<[[CAN_IV]]>, ir<%n>, ir<-1>
	; CHECK-NEXT: CLONE ir<%i.0> = add vp<%4>, ir<-1>			; CHECK-NEXT: vp<[[SCALAR_STEPS:%.+]]> = SCALAR-STEPS vp<[[TRANS_IV]]>, ir<-1>
				; CHECK-NEXT: CLONE ir<%i.0> = add vp<[[SCALAR_STEPS]]>, ir<-1>
	; CHECK-NEXT: CLONE ir<%idxprom> = zext ir<%i.0>			; CHECK-NEXT: CLONE ir<%idxprom> = zext ir<%i.0>
	; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%B>, ir<%idxprom>			; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%B>, ir<%idxprom>
	; CHECK-NEXT: WIDEN ir<%1> = load ir<%arrayidx>			; CHECK-NEXT: WIDEN ir<%1> = load ir<%arrayidx>
	; CHECK-NEXT: WIDEN ir<%conv1> = fadd ir<%1>, ir<1.000000e+00>			; CHECK-NEXT: WIDEN ir<%conv1> = fadd ir<%1>, ir<1.000000e+00>
	; CHECK-NEXT: CLONE ir<%arrayidx3> = getelementptr ir<%A>, ir<%idxprom>			; CHECK-NEXT: CLONE ir<%arrayidx3> = getelementptr ir<%A>, ir<%idxprom>
	; CHECK-NEXT: WIDEN store ir<%arrayidx3>, ir<%conv1>			; CHECK-NEXT: WIDEN store ir<%arrayidx3>, ir<%conv1>
	; CHECK-NEXT: EMIT vp<%11> = VF * UF +(nuw) vp<%3>			; CHECK-NEXT: EMIT vp<[[IV_INC:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>
	; CHECK-NEXT: EMIT branch-on-count vp<%11> vp<%2>			; CHECK-NEXT: EMIT branch-on-count vp<[[IV_INC]]> vp<%2>
	; CHECK-NEXT: No successors			; CHECK-NEXT: No successors
	; CHECK-NEXT: }			; CHECK-NEXT: }
	; CHECK-NEXT: Successor(s): middle.block			; CHECK-NEXT: Successor(s): middle.block
	; CHECK: middle.block:			; CHECK: middle.block:
	; CHECK-NEXT: No successors			; CHECK-NEXT: No successors
	; CHECK-NEXT: }			; CHECK-NEXT: }
	; CHECK-NEXT: LV: Found an estimated cost of 1 for VF vscale x 4 For instruction: %indvars.iv = phi i64 [ %0, %for.body.preheader ], [ %indvars.iv.next, %for.body ]			; CHECK-NEXT: LV: Found an estimated cost of 1 for VF vscale x 4 For instruction: %indvars.iv = phi i64 [ %0, %for.body.preheader ], [ %indvars.iv.next, %for.body ]
	; CHECK-NEXT: LV: Found an estimated cost of 1 for VF vscale x 4 For instruction: %i.0.in8 = phi i32 [ %n, %for.body.preheader ], [ %i.0, %for.body ]			; CHECK-NEXT: LV: Found an estimated cost of 1 for VF vscale x 4 For instruction: %i.0.in8 = phi i32 [ %n, %for.body.preheader ], [ %i.0, %for.body ]
	▲ Show 20 Lines • Show All 70 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/first-order-recurrence-chains-vplan.ll

	Show All 9 Lines
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<%2> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<%2> = CANONICAL-INDUCTION
	; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for.1> = phi ir<22>, ir<%for.1.next>			; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for.1> = phi ir<22>, ir<%for.1.next>
	; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for.2> = phi ir<33>, vp<%8>			; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for.2> = phi ir<33>, vp<%8>
	; CHECK-NEXT: vp<%5> = SCALAR-STEPS vp<%2>, ir<0>, ir<1>			; CHECK-NEXT: vp<%5> = SCALAR-STEPS vp<%2>, ir<1>
	; CHECK-NEXT: CLONE ir<%gep.ptr> = getelementptr ir<%ptr>, vp<%5>			; CHECK-NEXT: CLONE ir<%gep.ptr> = getelementptr ir<%ptr>, vp<%5>
	; CHECK-NEXT: WIDEN ir<%for.1.next> = load ir<%gep.ptr>			; CHECK-NEXT: WIDEN ir<%for.1.next> = load ir<%gep.ptr>
	; CHECK-NEXT: EMIT vp<%8> = first-order splice ir<%for.1> ir<%for.1.next>			; CHECK-NEXT: EMIT vp<%8> = first-order splice ir<%for.1> ir<%for.1.next>
	; CHECK-NEXT: EMIT vp<%9> = first-order splice ir<%for.2> vp<%8>			; CHECK-NEXT: EMIT vp<%9> = first-order splice ir<%for.2> vp<%8>
	; CHECK-NEXT: WIDEN ir<%add> = add vp<%8>, vp<%9>			; CHECK-NEXT: WIDEN ir<%add> = add vp<%8>, vp<%9>
	; CHECK-NEXT: WIDEN store ir<%gep.ptr>, ir<%add>			; CHECK-NEXT: WIDEN store ir<%gep.ptr>, ir<%add>
	; CHECK-NEXT: EMIT vp<%11> = VF * UF +(nuw) vp<%2>			; CHECK-NEXT: EMIT vp<%11> = VF * UF +(nuw) vp<%2>
	; CHECK-NEXT: EMIT branch-on-count vp<%11> vp<%1>			; CHECK-NEXT: EMIT branch-on-count vp<%11> vp<%1>
	Show All 33 Lines
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<%2> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<%2> = CANONICAL-INDUCTION
	; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for.1> = phi ir<22>, ir<%for.1.next>			; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for.1> = phi ir<22>, ir<%for.1.next>
	; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for.2> = phi ir<33>, vp<%9>			; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for.2> = phi ir<33>, vp<%9>
	; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for.3> = phi ir<33>, vp<%10>			; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for.3> = phi ir<33>, vp<%10>
	; CHECK-NEXT: vp<%6> = SCALAR-STEPS vp<%2>, ir<0>, ir<1>			; CHECK-NEXT: vp<%6> = SCALAR-STEPS vp<%2>, ir<1>
	; CHECK-NEXT: CLONE ir<%gep.ptr> = getelementptr ir<%ptr>, vp<%6>			; CHECK-NEXT: CLONE ir<%gep.ptr> = getelementptr ir<%ptr>, vp<%6>
	; CHECK-NEXT: WIDEN ir<%for.1.next> = load ir<%gep.ptr>			; CHECK-NEXT: WIDEN ir<%for.1.next> = load ir<%gep.ptr>
	; CHECK-NEXT: EMIT vp<%9> = first-order splice ir<%for.1> ir<%for.1.next>			; CHECK-NEXT: EMIT vp<%9> = first-order splice ir<%for.1> ir<%for.1.next>
	; CHECK-NEXT: EMIT vp<%10> = first-order splice ir<%for.2> vp<%9>			; CHECK-NEXT: EMIT vp<%10> = first-order splice ir<%for.2> vp<%9>
	; CHECK-NEXT: EMIT vp<%11> = first-order splice ir<%for.3> vp<%10>			; CHECK-NEXT: EMIT vp<%11> = first-order splice ir<%for.3> vp<%10>
	; CHECK-NEXT: WIDEN ir<%add.1> = add vp<%9>, vp<%10>			; CHECK-NEXT: WIDEN ir<%add.1> = add vp<%9>, vp<%10>
	; CHECK-NEXT: WIDEN ir<%add.2> = add ir<%add.1>, vp<%11>			; CHECK-NEXT: WIDEN ir<%add.2> = add ir<%add.1>, vp<%11>
	; CHECK-NEXT: WIDEN store ir<%gep.ptr>, ir<%add.2>			; CHECK-NEXT: WIDEN store ir<%gep.ptr>, ir<%add.2>
	Show All 32 Lines

llvm/test/Transforms/LoopVectorize/first-order-recurrence-sink-replicate-region.ll

	Show All 15 Lines
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%0> = phi ir<0>, ir<%conv>			; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%0> = phi ir<0>, ir<%conv>
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: Successor(s): loop.0			; CHECK-NEXT: Successor(s): loop.0
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.0:			; CHECK-NEXT: loop.0:
	; CHECK-NEXT: Successor(s): pred.load			; CHECK-NEXT: Successor(s): pred.load
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <xVFxUF> pred.load: {			; CHECK-NEXT: <xVFxUF> pred.load: {
	; CHECK-NEXT: pred.load.entry:			; CHECK-NEXT: pred.load.entry:
	▲ Show 20 Lines • Show All 79 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%recur> = phi ir<0>, ir<%recur.next>			; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%recur> = phi ir<0>, ir<%recur.next>
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: Successor(s): loop.0			; CHECK-NEXT: Successor(s): loop.0
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.0:			; CHECK-NEXT: loop.0:
	; CHECK-NEXT: WIDEN ir<%recur.next> = sext ir<%y>			; CHECK-NEXT: WIDEN ir<%recur.next> = sext ir<%y>
	; CHECK-NEXT: EMIT vp<[[SPLICE:%.+]]> = first-order splice ir<%recur> ir<%recur.next>			; CHECK-NEXT: EMIT vp<[[SPLICE:%.+]]> = first-order splice ir<%recur> ir<%recur.next>
	; CHECK-NEXT: Successor(s): loop.0.split			; CHECK-NEXT: Successor(s): loop.0.split
	; CHECK-EMPTY:			; CHECK-EMPTY:
	▲ Show 20 Lines • Show All 135 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%0> = phi ir<0>, ir<%conv>			; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%0> = phi ir<0>, ir<%conv>
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: REPLICATE ir<%gep> = getelementptr ir<%ptr>, vp<[[STEPS]]>			; CHECK-NEXT: REPLICATE ir<%gep> = getelementptr ir<%ptr>, vp<[[STEPS]]>
	; CHECK-NEXT: Successor(s): loop.0			; CHECK-NEXT: Successor(s): loop.0
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.0:			; CHECK-NEXT: loop.0:
	; CHECK-NEXT: Successor(s): pred.load			; CHECK-NEXT: Successor(s): pred.load
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <xVFxUF> pred.load: {			; CHECK-NEXT: <xVFxUF> pred.load: {
	▲ Show 20 Lines • Show All 90 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%recur> = phi ir<0>, ir<%recur.next>			; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%recur> = phi ir<0>, ir<%recur.next>
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: Successor(s): loop.0			; CHECK-NEXT: Successor(s): loop.0
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.0:			; CHECK-NEXT: loop.0:
	; CHECK-NEXT: Successor(s): loop.1			; CHECK-NEXT: Successor(s): loop.1
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.1:			; CHECK-NEXT: loop.1:
	; CHECK-NEXT: WIDEN ir<%recur.next> = sext ir<%y>			; CHECK-NEXT: WIDEN ir<%recur.next> = sext ir<%y>
	▲ Show 20 Lines • Show All 74 Lines • ▼ Show 20 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%.pn> = phi ir<0>, ir<[[L:%.+]]>			; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%.pn> = phi ir<0>, ir<[[L:%.+]]>
	; CHECK-NEXT: vp<[[SCALAR_STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<2>, ir<1>			; CHECK-NEXT: vp<[[DERIVED_IV:%.+]]> = DERIVED-IV vp<[[CAN_IV]]>, ir<2>, ir<1>
				; CHECK-NEXT: vp<[[SCALAR_STEPS:%.+]]> = SCALAR-STEPS vp<[[DERIVED_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[WIDE_IV:%.+]]> = WIDEN-CANONICAL-INDUCTION vp<[[CAN_IV]]>			; CHECK-NEXT: EMIT vp<[[WIDE_IV:%.+]]> = WIDEN-CANONICAL-INDUCTION vp<[[CAN_IV]]>
	; CHECK-NEXT: EMIT vp<[[CMP:%.+]]> = icmp ule vp<[[WIDE_IV]]> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[CMP:%.+]]> = icmp ule vp<[[WIDE_IV]]> vp<[[BTC]]>
	; CHECK-NEXT: Successor(s): loop.0			; CHECK-NEXT: Successor(s): loop.0
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.0:			; CHECK-NEXT: loop.0:
	; CHECK-NEXT: CLONE ir<[[L]]> = load ir<%src>			; CHECK-NEXT: CLONE ir<[[L]]> = load ir<%src>
	; CHECK-NEXT: EMIT vp<[[SPLICE:%.+]]> = first-order splice ir<%.pn> ir<[[L]]>			; CHECK-NEXT: EMIT vp<[[SPLICE:%.+]]> = first-order splice ir<%.pn> ir<[[L]]>
	; CHECK-NEXT: Successor(s): loop.0.split			; CHECK-NEXT: Successor(s): loop.0.split
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.0.split:			; CHECK-NEXT: loop.0.split:
	; CHECK-NEXT: Successor(s): pred.store			; CHECK-NEXT: Successor(s): pred.store
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <xVFxUF> pred.store: {			; CHECK-NEXT: <xVFxUF> pred.store: {
	; CHECK-NEXT: pred.store.entry:			; CHECK-NEXT: pred.store.entry:
	; CHECK-NEXT: BRANCH-ON-MASK vp<[[CMP]]>			; CHECK-NEXT: BRANCH-ON-MASK vp<[[CMP]]>
	; CHECK-NEXT: Successor(s): pred.store.if, pred.store.continue			; CHECK-NEXT: Successor(s): pred.store.if, pred.store.continue
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: pred.store.if:			; CHECK-NEXT: pred.store.if:
	; CHECK-NEXT: REPLICATE ir<%val> = sdiv vp<[[SPLICE]]>, ir<%x>			; CHECK-NEXT: REPLICATE ir<%val> = sdiv vp<[[SPLICE]]>, ir<%x>
	; CHECK-NEXT: REPLICATE ir<%gep.dst> = getelementptr ir<%dst>, vp<%5>			; CHECK-NEXT: REPLICATE ir<%gep.dst> = getelementptr ir<%dst>, vp<[[SCALAR_STEPS]]>
	; CHECK-NEXT: REPLICATE store ir<%val>, ir<%gep.dst>			; CHECK-NEXT: REPLICATE store ir<%val>, ir<%gep.dst>
	; CHECK-NEXT: Successor(s): pred.store.continue			; CHECK-NEXT: Successor(s): pred.store.continue
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: pred.store.continue:			; CHECK-NEXT: pred.store.continue:
	; CHECK-NEXT: PHI-PREDICATED-INSTRUCTION vp<[[P_VAL:%.+]]> = ir<%val>			; CHECK-NEXT: PHI-PREDICATED-INSTRUCTION vp<[[P_VAL:%.+]]> = ir<%val>
	; CHECK-NEXT: No successors			; CHECK-NEXT: No successors
	; CHECK-NEXT: }			; CHECK-NEXT: }
	; CHECK-NEXT: Successor(s): loop.1			; CHECK-NEXT: Successor(s): loop.1
	Show All 29 Lines

llvm/test/Transforms/LoopVectorize/icmp-uniforms.ll

	Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[COND:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[COND:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: WIDEN ir<%cond0> = icmp ult ir<%iv>, ir<13>			; CHECK-NEXT: WIDEN ir<%cond0> = icmp ult ir<%iv>, ir<13>
	; CHECK-NEXT: WIDEN-SELECT ir<%s> = select ir<%cond0>, ir<10>, ir<20>			; CHECK-NEXT: WIDEN-SELECT ir<%s> = select ir<%cond0>, ir<10>, ir<20>
	; CHECK-NEXT: Successor(s): pred.store			; CHECK-NEXT: Successor(s): pred.store
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <xVFxUF> pred.store: {			; CHECK-NEXT: <xVFxUF> pred.store: {
	; CHECK-NEXT: pred.store.entry:			; CHECK-NEXT: pred.store.entry:
	; CHECK-NEXT: BRANCH-ON-MASK vp<[[COND]]>			; CHECK-NEXT: BRANCH-ON-MASK vp<[[COND]]>
	Show All 34 Lines

llvm/test/Transforms/LoopVectorize/interleave-and-scalarize-only.ll

	; REQUIRES: asserts			; REQUIRES: asserts

	; RUN: opt -passes=loop-vectorize -force-vector-width=1 -force-vector-interleave=2 -debug -disable-output %s 2>&1 \| FileCheck --check-prefix=DBG %s			; RUN: opt -passes=loop-vectorize -force-vector-width=1 -force-vector-interleave=2 -debug -disable-output %s 2>&1 \| FileCheck --check-prefix=DBG %s
	; RUN: opt -passes=loop-vectorize -force-vector-width=1 -force-vector-interleave=2 -S %s \| FileCheck %s			; RUN: opt -passes=loop-vectorize -force-vector-width=1 -force-vector-interleave=2 -S %s \| FileCheck %s

	; DBG-LABEL: 'test_scalarize_call'			; DBG-LABEL: 'test_scalarize_call'
	; DBG: VPlan 'Initial VPlan for VF={1},UF>=1' {			; DBG: VPlan 'Initial VPlan for VF={1},UF>=1' {
	; DBG-NEXT: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count			; DBG-NEXT: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count
	; DBG-EMPTY:			; DBG-EMPTY:
	; DBG-NEXT: vector.ph:			; DBG-NEXT: vector.ph:
	; DBG-NEXT: Successor(s): vector loop			; DBG-NEXT: Successor(s): vector loop
	; DBG-EMPTY:			; DBG-EMPTY:
	; DBG-NEXT: <x1> vector loop: {			; DBG-NEXT: <x1> vector loop: {
	; DBG-NEXT: vector.body:			; DBG-NEXT: vector.body:
	; DBG-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; DBG-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; DBG-NEXT: vp<[[IV_STEPS:%.]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<%start>, ir<1>			; DBG-NEXT: vp<[[DERIVED_IV:%.+]]> = DERIVED-IV vp<[[CAN_IV]]>, ir<%start>, ir<1>
				; DBG-NEXT: vp<[[IV_STEPS:%.]]> = SCALAR-STEPS vp<[[DERIVED_IV]]>, ir<1>
	; DBG-NEXT: CLONE ir<%min> = call @llvm.smin.i32(vp<[[IV_STEPS]]>, ir<65535>)			; DBG-NEXT: CLONE ir<%min> = call @llvm.smin.i32(vp<[[IV_STEPS]]>, ir<65535>)
	; DBG-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%dst>, vp<[[IV_STEPS]]>			; DBG-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%dst>, vp<[[IV_STEPS]]>
	; DBG-NEXT: CLONE store ir<%min>, ir<%arrayidx>			; DBG-NEXT: CLONE store ir<%min>, ir<%arrayidx>
	; DBG-NEXT: EMIT vp<[[INC:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>			; DBG-NEXT: EMIT vp<[[INC:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>
	; DBG-NEXT: EMIT branch-on-count vp<[[INC]]> vp<[[VEC_TC]]>			; DBG-NEXT: EMIT branch-on-count vp<[[INC]]> vp<[[VEC_TC]]>
	; DBG-NEXT: No successors			; DBG-NEXT: No successors
	; DBG-NEXT: }			; DBG-NEXT: }
	;			;
	Show All 39 Lines
	; DBG: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count			; DBG: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count
	; DBG-EMPTY:			; DBG-EMPTY:
	; DBG-NEXT: vector.ph:			; DBG-NEXT: vector.ph:
	; DBG-NEXT: Successor(s): vector loop			; DBG-NEXT: Successor(s): vector loop
	; DBG-EMPTY:			; DBG-EMPTY:
	; DBG-NEXT: <x1> vector loop: {			; DBG-NEXT: <x1> vector loop: {
	; DBG-NEXT: vector.body:			; DBG-NEXT: vector.body:
	; DBG-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; DBG-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; DBG-NEXT: vp<[[STEPS1:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<false>, ir<true>			; DBG-NEXT: vp<[[DERIVED_IV:%.+]]> = DERIVED-IV vp<[[CAN_IV]]>, ir<false>, ir<true>
	; DBG-NEXT: vp<[[STEPS2:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; DBG-NEXT: vp<[[STEPS1:%.+]]> = SCALAR-STEPS vp<[[DERIVED_IV]]>, ir<true>
				; DBG-NEXT: vp<[[STEPS2:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
				AyalUnsubmitted Done Reply Inline Actions While we're here, `ir<false>`, `ir<true>` seem odd (and even ;-)) Ayal: While we're here, `ir<false>`, `ir<true>` seem odd (and even ;-))
				fhahnAuthorUnsubmitted Done Reply Inline Actions I guess that's because they are boolean values and the IR get printed as boolean literals by the IR printer. Do you think this is something that should be changed? fhahn: I guess that's because they are boolean values and the IR get printed as boolean literals by…
				AyalUnsubmitted Done Reply Inline Actions Oh well, the input IR is adding and subtracting true and false... The reason for having DERIVED-IV with canonical start `ir<false>` ==0 and step `ir<true>` ==1 is because of type expansion and/or truncation? Ayal: Oh well, the input IR is adding and subtracting true and false... The reason for having…
				fhahnAuthorUnsubmitted Done Reply Inline Actions Here the inputs are already truncated to `i1` before recipe construction so the recipe doesn't truncated itself. fhahn: Here the inputs are already truncated to `i1` before recipe construction so the recipe doesn't…
				AyalUnsubmitted Done Reply Inline Actions Trying to clarify why DERIVED-IV is needed at all here (too), given that it starts at 0 (false) and bumps with a step of 1 (true)? Ayal: Trying to clarify why DERIVED-IV is needed at all here (too), given that it starts at 0 (false)…
				fhahnAuthorUnsubmitted Done Reply Inline Actions I think `DERIVED-IV` here is for `%d = phi i1 ...`, which has a different type than the canonical IV, but doesn't itself need truncating because the operands are already `i1`. fhahn: I think `DERIVED-IV` here is for `%d = phi i1 ...`, which has a different type than the…
				AyalUnsubmitted Not Done Reply Inline Actions Ah, right; DERIVED-IV truncates CAN_IV to i1 before Mul & Add, which is not dumped-out like the truncation to ResultTy. Ayal: Ah, right; DERIVED-IV truncates CAN_IV to i1 before Mul & Add, which is not dumped-out like the…
	; DBG-NEXT: Successor(s): cond.false			; DBG-NEXT: Successor(s): cond.false
	; DBG-EMPTY:			; DBG-EMPTY:
	; DBG-NEXT: cond.false:			; DBG-NEXT: cond.false:
	; DBG-NEXT: CLONE ir<%gep.src> = getelementptr ir<%src>, vp<[[STEPS2]]>			; DBG-NEXT: CLONE ir<%gep.src> = getelementptr ir<%src>, vp<[[STEPS2]]>
	; DBG-NEXT: CLONE ir<%gep.dst> = getelementptr ir<%dst>, vp<[[STEPS2]]>			; DBG-NEXT: CLONE ir<%gep.dst> = getelementptr ir<%dst>, vp<[[STEPS2]]>
	; DBG-NEXT: Successor(s): cond.false.0			; DBG-NEXT: Successor(s): cond.false.0
	; DBG-EMPTY:			; DBG-EMPTY:
	; DBG-NEXT: cond.false.0:			; DBG-NEXT: cond.false.0:
	▲ Show 20 Lines • Show All 94 Lines • ▼ Show 20 Lines
	; DBG: VPlan 'Initial VPlan for VF={1},UF>=1' {			; DBG: VPlan 'Initial VPlan for VF={1},UF>=1' {
	; DBG-NEXT: Live-in vp<%1> = vector-trip-count			; DBG-NEXT: Live-in vp<%1> = vector-trip-count
	; DBG-EMPTY:			; DBG-EMPTY:
	; DBG-NEXT: vector.ph:			; DBG-NEXT: vector.ph:
	; DBG-NEXT: Successor(s): vector loop			; DBG-NEXT: Successor(s): vector loop
	; DBG-EMPTY:			; DBG-EMPTY:
	; DBG-NEXT: <x1> vector loop: {			; DBG-NEXT: <x1> vector loop: {
	; DBG-NEXT: vector.body:			; DBG-NEXT: vector.body:
	; DBG-NEXT: EMIT vp<%2> = CANONICAL-INDUCTION			; DBG-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; DBG-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for> = phi ir<0>, vp<%4>			; DBG-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for> = phi ir<0>, vp<[[SCALAR_STEPS:.+]]>
	; DBG-NEXT: vp<%4> = SCALAR-STEPS vp<%2>, ir<0>, ir<1>			; DBG-NEXT: vp<[[DERIVED_IV:%.+]]> = DERIVED-IV vp<[[CAN_IV]]>, ir<0>, ir<1>
				AyalUnsubmitted Done Reply Inline Actions The reason for having DERIVED-IV with canonical start ir<0> and step ir<1> is because of type expansion and/or truncation? Perhaps worth dumping the distinct types, as this operation is effectively a cast. Ayal: The reason for having DERIVED-IV with canonical start ir<0> and step ir<1> is because of type…
				fhahnAuthorUnsubmitted Done Reply Inline Actions Updated, thanks! fhahn: Updated, thanks!
	; DBG-NEXT: EMIT vp<%5> = first-order splice ir<%for> vp<%4>			; DBG-NEXT: vp<[[SCALAR_STEPS]]> = SCALAR-STEPS vp<[[DERIVED_IV]]>
				AyalUnsubmitted Done Reply Inline Actions nit: check the second (ir<1>) operand of SCALAR-STEPS, for completeness? It is in practice either +1 or -1 ... Ayal: nit: check the second (ir<1>) operand of SCALAR-STEPS, for completeness? It is in practice…
				fhahnAuthorUnsubmitted Done Reply Inline Actions Done, thanks! fhahn: Done, thanks!
	; DBG-NEXT: CLONE store vp<%5>, ir<%dst>			; DBG-NEXT: EMIT vp<[[SPLICE:%.+]]> = first-order splice ir<%for> vp<[[SCALAR_STEPS]]>
	; DBG-NEXT: EMIT vp<%7> = VF * UF +(nuw) vp<%2>			; DBG-NEXT: CLONE store vp<[[SPLICE]]>, ir<%dst>
	; DBG-NEXT: EMIT branch-on-count vp<%7> vp<%1>			; DBG-NEXT: EMIT vp<[[IV_INC:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>
				; DBG-NEXT: EMIT branch-on-count vp<[[IV_INC]]> vp<%1>
	; DBG-NEXT: No successors			; DBG-NEXT: No successors
	; DBG-NEXT: }			; DBG-NEXT: }
	; DBG-NEXT: Successor(s): middle.block			; DBG-NEXT: Successor(s): middle.block
	; DBG-EMPTY:			; DBG-EMPTY:
	; DBG-NEXT: middle.block:			; DBG-NEXT: middle.block:
	; DBG-NEXT: No successors			; DBG-NEXT: No successors
	; DBG-NEXT: }			; DBG-NEXT: }

	Show All 31 Lines

llvm/test/Transforms/LoopVectorize/vplan-dot-printing.ll

	Show All 17 Lines
	; CHECK-NEXT: ]			; CHECK-NEXT: ]
	; CHECK-NEXT: N0 -> N1 [ label="" lhead=cluster_N2]			; CHECK-NEXT: N0 -> N1 [ label="" lhead=cluster_N2]
	; CHECK-NEXT: subgraph cluster_N2 {			; CHECK-NEXT: subgraph cluster_N2 {
	; CHECK-NEXT: fontname=Courier			; CHECK-NEXT: fontname=Courier
	; CHECK-NEXT: label="\<x1\> vector loop"			; CHECK-NEXT: label="\<x1\> vector loop"
	; CHECK-NEXT: N1 [label =			; CHECK-NEXT: N1 [label =
	; CHECK-NEXT: "vector.body:\l" +			; CHECK-NEXT: "vector.body:\l" +
	; CHECK-NEXT: " EMIT vp\<[[CAN_IV:%.+]]\> = CANONICAL-INDUCTION\l" +			; CHECK-NEXT: " EMIT vp\<[[CAN_IV:%.+]]\> = CANONICAL-INDUCTION\l" +
	; CHECK-NEXT: " vp\<[[STEPS:%.+]]\> = SCALAR-STEPS vp\<[[CAN_IV]]\>, ir\<0\>, ir\<1\>\l" +			; CHECK-NEXT: " vp\<[[STEPS:%.+]]\> = SCALAR-STEPS vp\<[[CAN_IV]]\>, ir\<1\>\l" +
	; CHECK-NEXT: " CLONE ir\<%arrayidx\> = getelementptr ir\<%y\>, vp\<[[STEPS]]\>\l" +			; CHECK-NEXT: " CLONE ir\<%arrayidx\> = getelementptr ir\<%y\>, vp\<[[STEPS]]\>\l" +
	; CHECK-NEXT: " WIDEN ir\<%lv\> = load ir\<%arrayidx\>\l" +			; CHECK-NEXT: " WIDEN ir\<%lv\> = load ir\<%arrayidx\>\l" +
	; CHECK-NEXT: " WIDEN-CALL ir\<%call\> = call @llvm.sqrt.f32(ir\<%lv\>) (using vector intrinsic)\l" +			; CHECK-NEXT: " WIDEN-CALL ir\<%call\> = call @llvm.sqrt.f32(ir\<%lv\>) (using vector intrinsic)\l" +
	; CHECK-NEXT: " CLONE ir\<%arrayidx2\> = getelementptr ir\<%x\>, vp\<[[STEPS]]\>\l" +			; CHECK-NEXT: " CLONE ir\<%arrayidx2\> = getelementptr ir\<%x\>, vp\<[[STEPS]]\>\l" +
	; CHECK-NEXT: " WIDEN store ir\<%arrayidx2\>, ir\<%call\>\l" +			; CHECK-NEXT: " WIDEN store ir\<%arrayidx2\>, ir\<%call\>\l" +
	; CHECK-NEXT: " EMIT vp\<[[CAN_IV_NEXT:%.+]]\> = VF * UF +(nuw) vp\<[[CAN_IV]]\>\l" +			; CHECK-NEXT: " EMIT vp\<[[CAN_IV_NEXT:%.+]]\> = VF * UF +(nuw) vp\<[[CAN_IV]]\>\l" +
	; CHECK-NEXT: " EMIT branch-on-count vp\<[[CAN_IV_NEXT]]\> vp\<{{.+}}\>\l" +			; CHECK-NEXT: " EMIT branch-on-count vp\<[[CAN_IV_NEXT]]\> vp\<{{.+}}\>\l" +
	; CHECK-NEXT: "No successors\l"			; CHECK-NEXT: "No successors\l"
	Show All 22 Lines

llvm/test/Transforms/LoopVectorize/vplan-printing.ll

	Show All 11 Lines
	; CHECK-NEXT: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count			; CHECK-NEXT: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%y>, vp<[[STEPS]]>			; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%y>, vp<[[STEPS]]>
	; CHECK-NEXT: WIDEN ir<%lv> = load ir<%arrayidx>			; CHECK-NEXT: WIDEN ir<%lv> = load ir<%arrayidx>
	; CHECK-NEXT: WIDEN-CALL ir<%call> = call @llvm.sqrt.f32(ir<%lv>)			; CHECK-NEXT: WIDEN-CALL ir<%call> = call @llvm.sqrt.f32(ir<%lv>)
	; CHECK-NEXT: CLONE ir<%arrayidx2> = getelementptr ir<%x>, vp<[[STEPS]]>			; CHECK-NEXT: CLONE ir<%arrayidx2> = getelementptr ir<%x>, vp<[[STEPS]]>
	; CHECK-NEXT: WIDEN store ir<%arrayidx2>, ir<%call>			; CHECK-NEXT: WIDEN store ir<%arrayidx2>, ir<%call>
	; CHECK-NEXT: EMIT vp<[[CAN_IV_NEXT:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>			; CHECK-NEXT: EMIT vp<[[CAN_IV_NEXT:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>
	; CHECK-NEXT: EMIT branch-on-count vp<[[CAN_IV_NEXT]]> vp<[[VEC_TC]]>			; CHECK-NEXT: EMIT branch-on-count vp<[[CAN_IV_NEXT]]> vp<[[VEC_TC]]>
	; CHECK-NEXT: No successors			; CHECK-NEXT: No successors
	Show All 30 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi %iv.next, 0, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi %iv.next, 0, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: WIDEN-GEP Inv[Var] ir<%arrayidx> = getelementptr ir<%y>, ir<%iv>			; CHECK-NEXT: WIDEN-GEP Inv[Var] ir<%arrayidx> = getelementptr ir<%y>, ir<%iv>
	; CHECK-NEXT: WIDEN ir<%lv> = load ir<%arrayidx>			; CHECK-NEXT: WIDEN ir<%lv> = load ir<%arrayidx>
	; CHECK-NEXT: WIDEN ir<%cmp> = icmp eq ir<%arrayidx>, ir<%z>			; CHECK-NEXT: WIDEN ir<%cmp> = icmp eq ir<%arrayidx>, ir<%z>
	; CHECK-NEXT: WIDEN-SELECT ir<%sel> = select ir<%cmp>, ir<1.000000e+01>, ir<2.000000e+01>			; CHECK-NEXT: WIDEN-SELECT ir<%sel> = select ir<%cmp>, ir<1.000000e+01>, ir<2.000000e+01>
	; CHECK-NEXT: WIDEN ir<%add> = fadd ir<%lv>, ir<%sel>			; CHECK-NEXT: WIDEN ir<%add> = fadd ir<%lv>, ir<%sel>
	; CHECK-NEXT: CLONE ir<%arrayidx2> = getelementptr ir<%x>, vp<[[STEPS]]>			; CHECK-NEXT: CLONE ir<%arrayidx2> = getelementptr ir<%x>, vp<[[STEPS]]>
	; CHECK-NEXT: WIDEN store ir<%arrayidx2>, ir<%add>			; CHECK-NEXT: WIDEN store ir<%arrayidx2>, ir<%add>
	; CHECK-NEXT: EMIT vp<[[CAN_IV_NEXT:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>			; CHECK-NEXT: EMIT vp<[[CAN_IV_NEXT:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>
	Show All 34 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-REDUCTION-PHI ir<%red> = phi ir<0.000000e+00>, ir<%red.next>			; CHECK-NEXT: WIDEN-REDUCTION-PHI ir<%red> = phi ir<0.000000e+00>, ir<%red.next>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%y>, vp<[[STEPS]]>			; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%y>, vp<[[STEPS]]>
	; CHECK-NEXT: WIDEN ir<%lv> = load ir<%arrayidx>			; CHECK-NEXT: WIDEN ir<%lv> = load ir<%arrayidx>
	; CHECK-NEXT: REDUCE ir<%red.next> = ir<%red> + fast reduce.fadd (ir<%lv>)			; CHECK-NEXT: REDUCE ir<%red.next> = ir<%red> + fast reduce.fadd (ir<%lv>)
	; CHECK-NEXT: EMIT vp<[[CAN_IV_NEXT:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>			; CHECK-NEXT: EMIT vp<[[CAN_IV_NEXT:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>
	; CHECK-NEXT: EMIT branch-on-count vp<[[CAN_IV_NEXT]]> vp<[[VEC_TC]]>			; CHECK-NEXT: EMIT branch-on-count vp<[[CAN_IV_NEXT]]> vp<[[VEC_TC]]>
	; CHECK-NEXT: No successors			; CHECK-NEXT: No successors
	; CHECK-NEXT: }			; CHECK-NEXT: }
	; CHECK-NEXT: Successor(s): middle.block			; CHECK-NEXT: Successor(s): middle.block
	Show All 28 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-REDUCTION-PHI ir<%red> = phi ir<0.000000e+00>, ir<%red.next>			; CHECK-NEXT: WIDEN-REDUCTION-PHI ir<%red> = phi ir<0.000000e+00>, ir<%red.next>
	; CHECK-NEXT: vp<[[IV:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[IV:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%y>, vp<[[IV]]>			; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%y>, vp<[[IV]]>
	; CHECK-NEXT: WIDEN ir<%lv> = load ir<%arrayidx>			; CHECK-NEXT: WIDEN ir<%lv> = load ir<%arrayidx>
	; CHECK-NEXT: REDUCE ir<%red.next> = ir<%red> + fast reduce.fadd (ir<%lv>) (with final reduction value stored in invariant address sank outside of loop)			; CHECK-NEXT: REDUCE ir<%red.next> = ir<%red> + fast reduce.fadd (ir<%lv>) (with final reduction value stored in invariant address sank outside of loop)
	; CHECK-NEXT: EMIT vp<[[CAN_IV_NEXT:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>			; CHECK-NEXT: EMIT vp<[[CAN_IV_NEXT:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>
	; CHECK-NEXT: EMIT branch-on-count vp<[[CAN_IV_NEXT]]> vp<[[VEC_TC]]>			; CHECK-NEXT: EMIT branch-on-count vp<[[CAN_IV_NEXT]]> vp<[[VEC_TC]]>
	; CHECK-NEXT: No successors			; CHECK-NEXT: No successors
	; CHECK-NEXT: }			; CHECK-NEXT: }
	; CHECK-NEXT: Successor(s): middle.block			; CHECK-NEXT: Successor(s): middle.block
	Show All 27 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %i = phi 0, %i.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %i = phi 0, %i.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: WIDEN ir<%cmp> = icmp ult ir<%i>, ir<5>			; CHECK-NEXT: WIDEN ir<%cmp> = icmp ult ir<%i>, ir<5>
	; CHECK-NEXT: Successor(s): if.then			; CHECK-NEXT: Successor(s): if.then
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: if.then:			; CHECK-NEXT: if.then:
	; CHECK-NEXT: Successor(s): pred.udiv			; CHECK-NEXT: Successor(s): pred.udiv
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <xVFxUF> pred.udiv: {			; CHECK-NEXT: <xVFxUF> pred.udiv: {
	; CHECK-NEXT: pred.udiv.entry:			; CHECK-NEXT: pred.udiv.entry:
	▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count			; CHECK-NEXT: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<4>			; CHECK-NEXT: vp<[[DERIVED_IV:%.+]]> = DERIVED-IV vp<[[CAN_IV]]>, ir<0>, ir<4>
				; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[DERIVED_IV]]>, ir<4>
	; CHECK-NEXT: CLONE ir<%gep.AB.0> = getelementptr ir<@AB>, ir<0>, vp<[[STEPS]]>			; CHECK-NEXT: CLONE ir<%gep.AB.0> = getelementptr ir<@AB>, ir<0>, vp<[[STEPS]]>
	; CHECK-NEXT: INTERLEAVE-GROUP with factor 4 at %AB.0, ir<%gep.AB.0>			; CHECK-NEXT: INTERLEAVE-GROUP with factor 4 at %AB.0, ir<%gep.AB.0>
	; CHECK-NEXT: ir<%AB.0> = load from index 0			; CHECK-NEXT: ir<%AB.0> = load from index 0
	; CHECK-NEXT: ir<%AB.1> = load from index 1			; CHECK-NEXT: ir<%AB.1> = load from index 1
	; CHECK-NEXT: ir<%AB.3> = load from index 3			; CHECK-NEXT: ir<%AB.3> = load from index 3
	; CHECK-NEXT: CLONE ir<%iv.plus.3> = add vp<[[STEPS]]>, ir<3>			; CHECK-NEXT: CLONE ir<%iv.plus.3> = add vp<[[STEPS]]>, ir<3>
	; CHECK-NEXT: WIDEN ir<%add> = add ir<%AB.0>, ir<%AB.1>			; CHECK-NEXT: WIDEN ir<%add> = add ir<%AB.0>, ir<%AB.1>
	; CHECK-NEXT: CLONE ir<%gep.CD.3> = getelementptr ir<@CD>, ir<0>, ir<%iv.plus.3>			; CHECK-NEXT: CLONE ir<%gep.CD.3> = getelementptr ir<@CD>, ir<0>, ir<%iv.plus.3>
	▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-REDUCTION-PHI ir<%sum.07> = phi ir<0.000000e+00>, ir<%muladd>			; CHECK-NEXT: WIDEN-REDUCTION-PHI ir<%sum.07> = phi ir<0.000000e+00>, ir<%muladd>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%a>, vp<[[STEPS]]>			; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%a>, vp<[[STEPS]]>
	; CHECK-NEXT: WIDEN ir<%l.a> = load ir<%arrayidx>			; CHECK-NEXT: WIDEN ir<%l.a> = load ir<%arrayidx>
	; CHECK-NEXT: CLONE ir<%arrayidx2> = getelementptr ir<%b>, vp<[[STEPS]]>			; CHECK-NEXT: CLONE ir<%arrayidx2> = getelementptr ir<%b>, vp<[[STEPS]]>
	; CHECK-NEXT: WIDEN ir<%l.b> = load ir<%arrayidx2>			; CHECK-NEXT: WIDEN ir<%l.b> = load ir<%arrayidx2>
	; CHECK-NEXT: EMIT vp<[[FMUL:%.+]]> = fmul nnan ninf nsz ir<%l.a> ir<%l.b>			; CHECK-NEXT: EMIT vp<[[FMUL:%.+]]> = fmul nnan ninf nsz ir<%l.a> ir<%l.b>
	; CHECK-NEXT: REDUCE ir<[[MULADD:%.+]]> = ir<%sum.07> + nnan ninf nsz reduce.fadd (vp<[[FMUL]]>)			; CHECK-NEXT: REDUCE ir<[[MULADD:%.+]]> = ir<%sum.07> + nnan ninf nsz reduce.fadd (vp<[[FMUL]]>)
	; CHECK-NEXT: EMIT vp<[[CAN_IV_NEXT:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>			; CHECK-NEXT: EMIT vp<[[CAN_IV_NEXT:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>
	; CHECK-NEXT: EMIT branch-on-count vp<[[CAN_IV_NEXT]]> vp<[[VEC_TC]]>			; CHECK-NEXT: EMIT branch-on-count vp<[[CAN_IV_NEXT]]> vp<[[VEC_TC]]>
	Show All 32 Lines
	; CHECK-NEXT: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count			; CHECK-NEXT: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: CLONE ir<%isd> = getelementptr ir<%asd>, vp<[[STEPS]]>			; CHECK-NEXT: CLONE ir<%isd> = getelementptr ir<%asd>, vp<[[STEPS]]>
	; CHECK-NEXT: WIDEN ir<%lsd> = load ir<%isd>			; CHECK-NEXT: WIDEN ir<%lsd> = load ir<%isd>
	; CHECK-NEXT: WIDEN ir<%psd> = add ir<%lsd>, ir<23>			; CHECK-NEXT: WIDEN ir<%psd> = add ir<%lsd>, ir<23>
	; CHECK-NEXT: WIDEN ir<%cmp1> = icmp slt ir<%lsd>, ir<100>			; CHECK-NEXT: WIDEN ir<%cmp1> = icmp slt ir<%lsd>, ir<100>
	; CHECK-NEXT: Successor(s): check			; CHECK-NEXT: Successor(s): check
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: check:			; CHECK-NEXT: check:
	; CHECK-NEXT: WIDEN ir<%cmp2> = icmp sge ir<%lsd>, ir<200>			; CHECK-NEXT: WIDEN ir<%cmp2> = icmp sge ir<%lsd>, ir<200>
	▲ Show 20 Lines • Show All 81 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION\l" +			; CHECK-NEXT: WIDEN-INDUCTION\l" +
	; CHECK-NEXT: " %iv = phi %iv.next, 0\l" +			; CHECK-NEXT: " %iv = phi %iv.next, 0\l" +
	; CHECK-NEXT: " ir<%v2>, vp<[[EXP_SCEV]]>			; CHECK-NEXT: " ir<%v2>, vp<[[EXP_SCEV]]>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, vp<[[EXP_SCEV]]>			; CHECK-NEXT: vp<[[DERIVED_IV:%.+]]> = DERIVED-IV vp<[[CAN_IV]]>, ir<0>, vp<[[EXP_SCEV]]>
				; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[DERIVED_IV]]>, vp<[[EXP_SCEV]]>
	; CHECK-NEXT: WIDEN ir<%v3> = add ir<%v2>, ir<1>			; CHECK-NEXT: WIDEN ir<%v3> = add ir<%v2>, ir<1>
	; CHECK-NEXT: REPLICATE ir<%gep> = getelementptr ir<%ptr>, vp<[[STEPS]]>			; CHECK-NEXT: REPLICATE ir<%gep> = getelementptr ir<%ptr>, vp<[[STEPS]]>
	; CHECK-NEXT: REPLICATE store ir<%v3>, ir<%gep>			; CHECK-NEXT: REPLICATE store ir<%v3>, ir<%gep>
	; CHECK-NEXT: EMIT vp<[[CAN_INC:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>			; CHECK-NEXT: EMIT vp<[[CAN_INC:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>
	; CHECK-NEXT: EMIT branch-on-count vp<[[CAN_INC]]> vp<%0>			; CHECK-NEXT: EMIT branch-on-count vp<[[CAN_INC]]> vp<%0>
	; CHECK-NEXT: No successors			; CHECK-NEXT: No successors
	; CHECK-NEXT: }			; CHECK-NEXT: }
	; CHECK-NEXT: Successor(s): middle.block			; CHECK-NEXT: Successor(s): middle.block
	Show All 29 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: CLONE ir<%gep> = getelementptr ir<%ptr>, vp<[[STEPS]]>			; CHECK-NEXT: CLONE ir<%gep> = getelementptr ir<%ptr>, vp<[[STEPS]]>
	; CHECK-NEXT: WIDEN ir<%add> = add ir<%iv>, ir<%off>			; CHECK-NEXT: WIDEN ir<%add> = add ir<%iv>, ir<%off>
	; CHECK-NEXT: WIDEN store ir<%gep>, ir<0>			; CHECK-NEXT: WIDEN store ir<%gep>, ir<0>
	; CHECK-NEXT: EMIT vp<[[CAN_IV_NEXT:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>			; CHECK-NEXT: EMIT vp<[[CAN_IV_NEXT:%.+]]> = VF * UF +(nuw) vp<[[CAN_IV]]>
	; CHECK-NEXT: EMIT branch-on-count vp<[[CAN_IV_NEXT]]> vp<[[VEC_TC]]>			; CHECK-NEXT: EMIT branch-on-count vp<[[CAN_IV_NEXT]]> vp<[[VEC_TC]]>
	; CHECK-NEXT: No successors			; CHECK-NEXT: No successors
	; CHECK-NEXT: }			; CHECK-NEXT: }
	; CHECK-NEXT: Successor(s): middle.block			; CHECK-NEXT: Successor(s): middle.block
	Show All 36 Lines

llvm/test/Transforms/LoopVectorize/vplan-sink-scalars-and-merge-vf1.ll

	Show All 10 Lines
	; CHECK-NEXT: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count			; CHECK-NEXT: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: CLONE ir<%tmp2> = getelementptr ir<%ptr>, vp<[[STEPS]]>			; CHECK-NEXT: CLONE ir<%tmp2> = getelementptr ir<%ptr>, vp<[[STEPS]]>
	; CHECK-NEXT: CLONE ir<%tmp3> = load ir<%tmp2>			; CHECK-NEXT: CLONE ir<%tmp3> = load ir<%tmp2>
	; CHECK-NEXT: CLONE store ir<0>, ir<%tmp2>			; CHECK-NEXT: CLONE store ir<0>, ir<%tmp2>
	; CHECK-NEXT: CLONE ir<%tmp4> = zext ir<%tmp3>			; CHECK-NEXT: CLONE ir<%tmp4> = zext ir<%tmp3>
	; CHECK-NEXT: CLONE ir<%tmp5> = trunc ir<%tmp4>			; CHECK-NEXT: CLONE ir<%tmp5> = trunc ir<%tmp4>
	; CHECK-NEXT: Successor(s): if.then			; CHECK-NEXT: Successor(s): if.then

	; CHECK: if.then:			; CHECK: if.then:
	▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/vplan-sink-scalars-and-merge.ll

	Show All 16 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: Successor(s): loop.0			; CHECK-NEXT: Successor(s): loop.0

	; CHECK: loop.0:			; CHECK: loop.0:
	; CHECK-NEXT: Successor(s): pred.store			; CHECK-NEXT: Successor(s): pred.store

	; CHECK: <xVFxUF> pred.store: {			; CHECK: <xVFxUF> pred.store: {
	; CHECK-NEXT: pred.store.entry:			; CHECK-NEXT: pred.store.entry:
	▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: Successor(s): pred.load			; CHECK-NEXT: Successor(s): pred.load

	; CHECK: <xVFxUF> pred.load: {			; CHECK: <xVFxUF> pred.load: {
	; CHECK-NEXT: pred.load.entry:			; CHECK-NEXT: pred.load.entry:
	; CHECK-NEXT: BRANCH-ON-MASK vp<[[MASK]]>			; CHECK-NEXT: BRANCH-ON-MASK vp<[[MASK]]>
	; CHECK-NEXT: Successor(s): pred.load.if, pred.load.continue			; CHECK-NEXT: Successor(s): pred.load.if, pred.load.continue

	▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: Successor(s): pred.load			; CHECK-NEXT: Successor(s): pred.load

	; CHECK: <xVFxUF> pred.load: {			; CHECK: <xVFxUF> pred.load: {
	; CHECK-NEXT: pred.load.entry:			; CHECK-NEXT: pred.load.entry:
	; CHECK-NEXT: BRANCH-ON-MASK vp<[[MASK]]>			; CHECK-NEXT: BRANCH-ON-MASK vp<[[MASK]]>
	; CHECK-NEXT: Successor(s): pred.load.if, pred.load.continue			; CHECK-NEXT: Successor(s): pred.load.if, pred.load.continue

	▲ Show 20 Lines • Show All 64 Lines • ▼ Show 20 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 21, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 21, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<21>, ir<1>			; CHECK-NEXT: vp<[[DERIVED_IV:%.+]]> = DERIVED-IV vp<[[CAN_IV]]>, ir<21>, ir<1>
				; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[DERIVED_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[WIDE_CAN_IV:%.+]]> = WIDEN-CANONICAL-INDUCTION vp<[[CAN_IV]]>			; CHECK-NEXT: EMIT vp<[[WIDE_CAN_IV:%.+]]> = WIDEN-CANONICAL-INDUCTION vp<[[CAN_IV]]>
	; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule vp<[[WIDE_CAN_IV]]> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule vp<[[WIDE_CAN_IV]]> vp<[[BTC]]>
	; CHECK-NEXT: CLONE ir<%gep.A.uniform> = getelementptr ir<%A>, ir<0>			; CHECK-NEXT: CLONE ir<%gep.A.uniform> = getelementptr ir<%A>, ir<0>
	; CHECK-NEXT: CLONE ir<%lv> = load ir<%gep.A.uniform>			; CHECK-NEXT: CLONE ir<%lv> = load ir<%gep.A.uniform>
	; CHECK-NEXT: WIDEN ir<%cmp> = icmp ult ir<%iv>, ir<%k>			; CHECK-NEXT: WIDEN ir<%cmp> = icmp ult ir<%iv>, ir<%k>
	; CHECK-NEXT: Successor(s): loop.then			; CHECK-NEXT: Successor(s): loop.then
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.then:			; CHECK-NEXT: loop.then:
	▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK1:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK1:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: WIDEN ir<%c.1> = icmp ult ir<%iv>, ir<%j>			; CHECK-NEXT: WIDEN ir<%c.1> = icmp ult ir<%iv>, ir<%j>
	; CHECK-NEXT: WIDEN ir<%mul> = mul ir<%iv>, ir<10>			; CHECK-NEXT: WIDEN ir<%mul> = mul ir<%iv>, ir<10>
	; CHECK-NEXT: Successor(s): then.0			; CHECK-NEXT: Successor(s): then.0
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: then.0:			; CHECK-NEXT: then.0:
	; CHECK-NEXT: EMIT vp<[[MASK2:%.+]]> = select vp<[[MASK1]]> ir<%c.1> ir<false>			; CHECK-NEXT: EMIT vp<[[MASK2:%.+]]> = select vp<[[MASK1]]> ir<%c.1> ir<false>
	; CHECK-NEXT: Successor(s): pred.load			; CHECK-NEXT: Successor(s): pred.load
	▲ Show 20 Lines • Show All 84 Lines • ▼ Show 20 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK1:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK1:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: WIDEN ir<%mul> = mul ir<%iv>, ir<10>			; CHECK-NEXT: WIDEN ir<%mul> = mul ir<%iv>, ir<10>
	; CHECK-NEXT: WIDEN ir<%c.0> = icmp ult ir<%iv>, ir<%j>			; CHECK-NEXT: WIDEN ir<%c.0> = icmp ult ir<%iv>, ir<%j>
	; CHECK-NEXT: WIDEN ir<%c.1> = icmp ugt ir<%iv>, ir<%j>			; CHECK-NEXT: WIDEN ir<%c.1> = icmp ugt ir<%iv>, ir<%j>
	; CHECK-NEXT: Successor(s): then.0			; CHECK-NEXT: Successor(s): then.0
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: then.0:			; CHECK-NEXT: then.0:
	; CHECK-NEXT: EMIT vp<[[MASK2:%.+]]> = select vp<[[MASK1]]> ir<%c.0> ir<false>			; CHECK-NEXT: EMIT vp<[[MASK2:%.+]]> = select vp<[[MASK1]]> ir<%c.0> ir<false>
	▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK1:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK1:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: WIDEN ir<%mul> = mul ir<%iv>, ir<10>			; CHECK-NEXT: WIDEN ir<%mul> = mul ir<%iv>, ir<10>
	; CHECK-NEXT: WIDEN ir<%c.0> = icmp ult ir<%iv>, ir<%j>			; CHECK-NEXT: WIDEN ir<%c.0> = icmp ult ir<%iv>, ir<%j>
	; CHECK-NEXT: Successor(s): then.0			; CHECK-NEXT: Successor(s): then.0
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: then.0:			; CHECK-NEXT: then.0:
	; CHECK-NEXT: EMIT vp<[[MASK2:%.+]]> = select vp<[[MASK1:%.+]]> ir<%c.0> ir<false>			; CHECK-NEXT: EMIT vp<[[MASK2:%.+]]> = select vp<[[MASK1:%.+]]> ir<%c.0> ir<false>
	; CHECK-NEXT: Successor(s): pred.load			; CHECK-NEXT: Successor(s): pred.load
	▲ Show 20 Lines • Show All 102 Lines • ▼ Show 20 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: REPLICATE ir<%gep.a> = getelementptr ir<@a>, ir<0>, vp<[[STEPS]]>			; CHECK-NEXT: REPLICATE ir<%gep.a> = getelementptr ir<@a>, ir<0>, vp<[[STEPS]]>
	; CHECK-NEXT: Successor(s): loop.0			; CHECK-NEXT: Successor(s): loop.0
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.0:			; CHECK-NEXT: loop.0:
	; CHECK-NEXT: Successor(s): loop.1			; CHECK-NEXT: Successor(s): loop.1
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.1:			; CHECK-NEXT: loop.1:
	▲ Show 20 Lines • Show All 98 Lines • ▼ Show 20 Lines
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: REPLICATE ir<%gep.a> = getelementptr ir<@a>, ir<0>, vp<[[STEPS]]>			; CHECK-NEXT: REPLICATE ir<%gep.a> = getelementptr ir<@a>, ir<0>, vp<[[STEPS]]>
	; CHECK-NEXT: Successor(s): loop.0			; CHECK-NEXT: Successor(s): loop.0
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.0:			; CHECK-NEXT: loop.0:
	; CHECK-NEXT: Successor(s): loop.1			; CHECK-NEXT: Successor(s): loop.1
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.1:			; CHECK-NEXT: loop.1:
	▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi 0, %iv.next, ir<1>
	; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for> = phi ir<0>, vp<[[PRED:%.+]]>			; CHECK-NEXT: FIRST-ORDER-RECURRENCE-PHI ir<%for> = phi ir<0>, vp<[[PRED:%.+]]>
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule ir<%iv> vp<[[BTC]]>
	; CHECK-NEXT: REPLICATE ir<%gep.a> = getelementptr ir<@a>, ir<0>, vp<[[STEPS]]>			; CHECK-NEXT: REPLICATE ir<%gep.a> = getelementptr ir<@a>, ir<0>, vp<[[STEPS]]>
	; CHECK-NEXT: Successor(s): pred.load			; CHECK-NEXT: Successor(s): pred.load
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <xVFxUF> pred.load: {			; CHECK-NEXT: <xVFxUF> pred.load: {
	; CHECK-NEXT: pred.load.entry:			; CHECK-NEXT: pred.load.entry:
	; CHECK-NEXT: BRANCH-ON-MASK vp<[[MASK]]>			; CHECK-NEXT: BRANCH-ON-MASK vp<[[MASK]]>
	; CHECK-NEXT: Successor(s): pred.load.if, pred.load.continue			; CHECK-NEXT: Successor(s): pred.load.if, pred.load.continue
	▲ Show 20 Lines • Show All 135 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count			; CHECK-NEXT: Live-in vp<[[VEC_TC:%.+]]> = vector-trip-count
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<0>, ir<1>			; CHECK-NEXT: vp<[[STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<1>
	; CHECK-NEXT: CLONE ir<%gep> = getelementptr ir<%addr>, vp<[[STEPS]]>			; CHECK-NEXT: CLONE ir<%gep> = getelementptr ir<%addr>, vp<[[STEPS]]>
	; CHECK-NEXT: Successor(s): loop.body			; CHECK-NEXT: Successor(s): loop.body
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.body:			; CHECK-NEXT: loop.body:
	; CHECK-NEXT: WIDEN ir<%0> = load ir<%gep>			; CHECK-NEXT: WIDEN ir<%0> = load ir<%gep>
	; CHECK-NEXT: WIDEN ir<%pred> = fcmp oeq ir<%0>, ir<0.000000e+00>			; CHECK-NEXT: WIDEN ir<%pred> = fcmp oeq ir<%0>, ir<0.000000e+00>
	; CHECK-NEXT: Successor(s): then			; CHECK-NEXT: Successor(s): then
	; CHECK-EMPTY:			; CHECK-EMPTY:
	▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Live-in vp<[[BTC:%.+]]> = backedge-taken count			; CHECK-NEXT: Live-in vp<[[BTC:%.+]]> = backedge-taken count
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: vector.ph:			; CHECK-NEXT: vector.ph:
	; CHECK-NEXT: Successor(s): vector loop			; CHECK-NEXT: Successor(s): vector loop
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <x1> vector loop: {			; CHECK-NEXT: <x1> vector loop: {
	; CHECK-NEXT: vector.body:			; CHECK-NEXT: vector.body:
	; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION			; CHECK-NEXT: EMIT vp<[[CAN_IV:%.+]]> = CANONICAL-INDUCTION
	; CHECK-NEXT: vp<[[SCALAR_STEPS:%.+]]> = SCALAR-STEPS vp<[[CAN_IV]]>, ir<%n>, ir<-1>			; CHECK-NEXT: vp<[[DERIVED_IV:%.+]]> = DERIVED-IV vp<[[CAN_IV]]>, ir<%n>, ir<-1>
				; CHECK-NEXT: vp<[[SCALAR_STEPS:%.+]]> = SCALAR-STEPS vp<[[DERIVED_IV]]>, ir<-1>
	; CHECK-NEXT: EMIT vp<[[WIDE_IV:%.+]]> = WIDEN-CANONICAL-INDUCTION vp<[[CAN_IV]]>			; CHECK-NEXT: EMIT vp<[[WIDE_IV:%.+]]> = WIDEN-CANONICAL-INDUCTION vp<[[CAN_IV]]>
	; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule vp<[[WIDE_IV]]> vp<[[BTC]]>			; CHECK-NEXT: EMIT vp<[[MASK:%.+]]> = icmp ule vp<[[WIDE_IV]]> vp<[[BTC]]>
	; CHECK-NEXT: Successor(s): loop.0			; CHECK-NEXT: Successor(s): loop.0
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: loop.0:			; CHECK-NEXT: loop.0:
	; CHECK-NEXT: Successor(s): pred.store			; CHECK-NEXT: Successor(s): pred.store
	; CHECK-EMPTY:			; CHECK-EMPTY:
	; CHECK-NEXT: <xVFxUF> pred.store: {			; CHECK-NEXT: <xVFxUF> pred.store: {
	▲ Show 20 Lines • Show All 117 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[VPlan] Add VPDerivedIVRecipe, use for VPScalarIVStepsRecipe.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 473523

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

llvm/lib/Transforms/Vectorize/VPlan.h

llvm/lib/Transforms/Vectorize/VPlan.cpp

llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp

llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp

llvm/lib/Transforms/Vectorize/VPlanValue.h

llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding-forced.ll

llvm/test/Transforms/LoopVectorize/AArch64/widen-call-with-intrinsic-or-libfunc.ll

llvm/test/Transforms/LoopVectorize/RISCV/riscv-vector-reverse.ll

llvm/test/Transforms/LoopVectorize/first-order-recurrence-chains-vplan.ll

llvm/test/Transforms/LoopVectorize/first-order-recurrence-sink-replicate-region.ll

llvm/test/Transforms/LoopVectorize/icmp-uniforms.ll

llvm/test/Transforms/LoopVectorize/interleave-and-scalarize-only.ll

llvm/test/Transforms/LoopVectorize/vplan-dot-printing.ll

llvm/test/Transforms/LoopVectorize/vplan-printing.ll

llvm/test/Transforms/LoopVectorize/vplan-sink-scalars-and-merge-vf1.ll

llvm/test/Transforms/LoopVectorize/vplan-sink-scalars-and-merge.ll

[VPlan] Add VPDerivedIVRecipe, use for VPScalarIVStepsRecipe.
ClosedPublic