Download Raw Diff

Details

Reviewers

Ayal
gilr
rengolin

Commits

rG3e58dd19dfa3: [LV] Move reduction PHI node fixup to VPlan::execute (NFC).

Summary

All information to fix-up the reduction phi nodes in the vectorized loop
is available in VPlan now. This patch moves the code to do so, to make
this clearer. Fixing up the loop exit value still relies on other
information and remains outside of VPlan for now.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

fhahn created this revision.Apr 8 2021, 8:03 AM

Herald added subscribers: tschuett, bmahjour, psnobl and 3 others. · View Herald TranscriptApr 8 2021, 8:03 AM

fhahn requested review of this revision.Apr 8 2021, 8:03 AM

Herald added a project: Restricted Project. · View Herald TranscriptApr 8 2021, 8:03 AM

Herald added a subscriber: vkmr. · View Herald Transcript

Harbormaster completed remote builds in B97735: Diff 336110.Apr 8 2021, 8:03 AM

fhahn mentioned this in D99294: [VPlan] Representing backedge def-use feeding reduction phis..Apr 8 2021, 8:09 AM

Ayal added inline comments.May 3 2021, 12:47 AM

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
9688	better turn on an IsOrdered bit inside VPReductionRecipe during its creation or planning, than during its execute? Admittedly goes back to D98435. There are 3 types of reductions: (1) parallel lanes: each lane computes an independent partial-sum; (2) parallel parts: each vector computes an independent partial sum, in-loop within vectors but not across parts, and (3) serial: in-loop across vectors and parts, the header phi operates on a single scalar. Would be good to explicitly indicate which type of reduction is chosen, in both ReductionRecipe(s) and the header phi; it effects their code-gen.
llvm/lib/Transforms/Vectorize/VPlan.cpp
63	Defining a method of LVP inside VPlan.cpp seems odd, at best. The use of a gating EnableStrictReductions knob (in contrast to a CM cost-based decision), belongs to Legal rather than CM or Planner? I.e., Legal should bail out from vectorizing a loop having an ordered reduction, when !EnableStrictReductions? Admittedly goes back to D98435.
892	Moving this to VPlan::execute is great! Can use getSingleVPValue(), getBackedgeValue(). May be worth explaining that Val is first set per-part supporting parallel partial sums, either vector/per-lane (1) or scalar/per-part (2), and then set to last-part supporting serial reduction (3). Admittedly goes back to D98435.

Rebased and updated the code to check whether to use ordered reductions during codegen to get the VPReductionRecipe and check isOrdered on the reduction descriptr. This builds on the code to determine whether a VPWidenPHI is part of an in-loop reduction.

Harbormaster completed remote builds in B103337: Diff 343863.May 8 2021, 1:54 PM

fhahn added inline comments.May 8 2021, 1:58 PM

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
9688	better turn on an IsOrdered bit inside VPReductionRecipe during its creation or planning, than during its execute? Admittedly goes back to D98435. I think we already have all the required information available with the existing pieces. I updated the patch to check `isOrdered` on the recurrence descriptor of the VPReductionRecipe and a helper to get the VPReductionRecipe for a VPWidenPHIRecipe, if there's any.
llvm/lib/Transforms/Vectorize/VPlan.cpp
63	Agreed! I updated the code to check `isOrdered` on the reduction recurrence descriptor directly during code generation. To do so, I evolved the `isInLoopReduction` helper to an `getInLoopReduction` member that returns the `VPReductionRecipe`, if there's one.
892	I rebased the code, and it now uses getVPSingleValue/getBackedgeValue. I can add the additional comments separately or in this change.

fhahn mentioned this in D104989: [VPlan] Add VPReductionPHIRecipe (NFC)..Jun 27 2021, 3:08 AM

fhahn mentioned this in rG6c3451cd76cb: [VPlan] Add VPReductionPHIRecipe (NFC)..Jul 6 2021, 3:26 AM

Rebased after recent changes, should be much simpler now :)

Harbormaster completed remote builds in B117337: Diff 363313.Jul 31 2021, 1:11 PM

Ayal added inline comments.Aug 1 2021, 11:01 PM

llvm/lib/Transforms/Vectorize/VPlan.cpp
880	Place this next to hooking up incoming of FOR header phi's above, or even fuse both into a common loop over header phi's? The isOrdered case does behave quite similar to FOR phi's.

Merge FOR and reduction handling loops as suggested, thanks!

fhahn marked an inline comment as done.Aug 2 2021, 2:16 PM

fhahn added inline comments.

llvm/lib/Transforms/Vectorize/VPlan.cpp
880	done, thanks!

Harbormaster completed remote builds in B117522: Diff 363573.Aug 2 2021, 3:17 PM

Ayal added inline comments.Aug 3 2021, 4:20 PM

llvm/lib/Transforms/Vectorize/VPlan.cpp

823

The isOrdered case does behave quite similar to FOR phi's.

How about having a common treatment, e.g.,

for (VPRecipeBase &R : Header->phis()) {
  auto *PhiR = dyn_cast<VPWidenPHIRecipe>(&R);
  if (!PhiR)
    continue;
  // For first-order recurrences and in-order reduction phis, only a single part
  // is generated, which provides the last part from the previous iteration.
  // Otherwise all UF parts are generated.
  bool IsOrdered = (isa<VPFirstOrderRecurrencePHIRecipe>(&R)
                    || cast<VPReductionPHIRecipe>(&R)->isOrdered());
  unsigned LastPartForNewPhi = IsOrdered ? 1 : State->UF;
  for (unsigned Part = 0; Part < LastPartForNewPhi; ++Part) {
    Value *VecPhi = State->get(PhiR->getVPSingleValue(), Part);
      // Or suffice State->get(PhiR, Part)?
    Value *Val = State->get(PhiR->getBackedgeValue(),
                            IsOrdered ? State->UF - 1 : Part);
    cast<PHINode>(VecPhi)->addIncoming(Val, VectorLatchBB);
  }

Merge FOR & reduction phi handling as suggested, thanks!

fhahn added inline comments.Aug 5 2021, 11:03 AM

llvm/lib/Transforms/Vectorize/VPlan.cpp
823	Looks good, updated with a small re-naming. of `IsOrdered` -> `SinglePartNeeded` and a check to limit to FOR and reduction PHIs only.

Harbormaster completed remote builds in B118218: Diff 364552.Aug 5 2021, 11:40 AM

Thanks! This looks good to me ;-)

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
4309	nit: place above comment about start value where it belongs?

This revision is now accepted and ready to land.Aug 5 2021, 1:18 PM

fhahn marked an inline comment as done.Aug 6 2021, 12:17 AM

fhahn added inline comments.

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
4309	I'll move it to `VPReductionPHIRecipe::execute`, which creates & sets the start value.

Closed by commit rG3e58dd19dfa3: [LV] Move reduction PHI node fixup to VPlan::execute (NFC). (authored by fhahn). · Explain WhyAug 6 2021, 12:30 AM

This revision was automatically updated to reflect the committed changes.

fhahn marked an inline comment as done.

fhahn added a commit: rG3e58dd19dfa3: [LV] Move reduction PHI node fixup to VPlan::execute (NFC)..

Diff 364712

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 588 Lines • ▼ Show 20 Lines	protected:

/// Handle all cross-iteration phis in the header.		/// Handle all cross-iteration phis in the header.
void fixCrossIterationPHIs(VPTransformState &State);		void fixCrossIterationPHIs(VPTransformState &State);

/// Create the exit value of first order recurrences in the middle block and		/// Create the exit value of first order recurrences in the middle block and
/// update their users.		/// update their users.
void fixFirstOrderRecurrence(VPWidenPHIRecipe *PhiR, VPTransformState &State);		void fixFirstOrderRecurrence(VPWidenPHIRecipe *PhiR, VPTransformState &State);

/// Fix a reduction cross-iteration phi. This is the second phase of		/// Create code for the loop exit value of the reduction.
/// vectorizing this phi node.
void fixReduction(VPReductionPHIRecipe *Phi, VPTransformState &State);		void fixReduction(VPReductionPHIRecipe *Phi, VPTransformState &State);

/// Clear NSW/NUW flags from reduction instructions if necessary.		/// Clear NSW/NUW flags from reduction instructions if necessary.
void clearReductionWrapFlags(const RecurrenceDescriptor &RdxDesc,		void clearReductionWrapFlags(const RecurrenceDescriptor &RdxDesc,
VPTransformState &State);		VPTransformState &State);

/// Fixup the LCSSA phi nodes in the unique exit block. This simply		/// Fixup the LCSSA phi nodes in the unique exit block. This simply
/// means we need to add the appropriate incoming value from the middle		/// means we need to add the appropriate incoming value from the middle
▲ Show 20 Lines • Show All 3,691 Lines • ▼ Show 20 Lines	void InnerLoopVectorizer::fixReduction(VPReductionPHIRecipe *PhiR,

VPValue *LoopExitInstDef = State.Plan->getVPValue(LoopExitInst);		VPValue *LoopExitInstDef = State.Plan->getVPValue(LoopExitInst);
// This is the vector-clone of the value that leaves the loop.		// This is the vector-clone of the value that leaves the loop.
Type *VecTy = State.get(LoopExitInstDef, 0)->getType();		Type *VecTy = State.get(LoopExitInstDef, 0)->getType();

// Wrap flags are in general invalid after vectorization, clear them.		// Wrap flags are in general invalid after vectorization, clear them.
clearReductionWrapFlags(RdxDesc, State);		clearReductionWrapFlags(RdxDesc, State);

// Fix the vector-loop phi.

// Reductions do not have to start at zero. They can start with
// any loop invariant values.
AyalUnsubmitted Done Reply Inline Actions nit: place above comment about start value where it belongs? Ayal: nit: place above comment about start value where it belongs?
fhahnAuthorUnsubmitted Done Reply Inline Actions I'll move it to `VPReductionPHIRecipe::execute`, which creates & sets the start value. fhahn: I'll move it to `VPReductionPHIRecipe::execute`, which creates & sets the start value.
BasicBlock *VectorLoopLatch = LI->getLoopFor(LoopVectorBody)->getLoopLatch();

unsigned LastPartForNewPhi = PhiR->isOrdered() ? 1 : UF;
for (unsigned Part = 0; Part < LastPartForNewPhi; ++Part) {
Value *VecRdxPhi = State.get(PhiR->getVPSingleValue(), Part);
Value *Val = State.get(PhiR->getBackedgeValue(), Part);
if (PhiR->isOrdered())
Val = State.get(PhiR->getBackedgeValue(), UF - 1);

cast<PHINode>(VecRdxPhi)->addIncoming(Val, VectorLoopLatch);
}

// Before each round, move the insertion point right between		// Before each round, move the insertion point right between
// the PHIs and the values we are going to write.		// the PHIs and the values we are going to write.
// This allows us to write both PHINodes and the extractelement		// This allows us to write both PHINodes and the extractelement
// instructions.		// instructions.
Builder.SetInsertPoint(&*LoopMiddleBlock->getFirstInsertionPt());		Builder.SetInsertPoint(&*LoopMiddleBlock->getFirstInsertionPt());

setDebugLocFromInst(LoopExitInst);		setDebugLocFromInst(LoopExitInst);

▲ Show 20 Lines • Show All 5,367 Lines • ▼ Show 20 Lines	State.ILV->vectorizeInterleaveGroup(IG, definedValues(), State, getAddr(),
getStoredValues(), getMask());		getStoredValues(), getMask());
}		}

void VPReductionRecipe::execute(VPTransformState &State) {		void VPReductionRecipe::execute(VPTransformState &State) {
assert(!State.Instance && "Reduction being replicated.");		assert(!State.Instance && "Reduction being replicated.");
Value *PrevInChain = State.get(getChainOp(), 0);		Value *PrevInChain = State.get(getChainOp(), 0);
for (unsigned Part = 0; Part < State.UF; ++Part) {		for (unsigned Part = 0; Part < State.UF; ++Part) {
RecurKind Kind = RdxDesc->getRecurrenceKind();		RecurKind Kind = RdxDesc->getRecurrenceKind();
bool IsOrdered = State.ILV->useOrderedReductions(*RdxDesc);		bool IsOrdered = State.ILV->useOrderedReductions(*RdxDesc);
		AyalUnsubmitted Not Done Reply Inline Actions better turn on an IsOrdered bit inside VPReductionRecipe during its creation or planning, than during its execute? Admittedly goes back to D98435. There are 3 types of reductions: (1) parallel lanes: each lane computes an independent partial-sum; (2) parallel parts: each vector computes an independent partial sum, in-loop within vectors but not across parts, and (3) serial: in-loop across vectors and parts, the header phi operates on a single scalar. Would be good to explicitly indicate which type of reduction is chosen, in both ReductionRecipe(s) and the header phi; it effects their code-gen. Ayal: better turn on an IsOrdered bit inside VPReductionRecipe during its creation or planning, than…
		fhahnAuthorUnsubmitted Done Reply Inline Actions better turn on an IsOrdered bit inside VPReductionRecipe during its creation or planning, than during its execute? Admittedly goes back to D98435. I think we already have all the required information available with the existing pieces. I updated the patch to check `isOrdered` on the recurrence descriptor of the VPReductionRecipe and a helper to get the VPReductionRecipe for a VPWidenPHIRecipe, if there's any. fhahn: > better turn on an IsOrdered bit inside VPReductionRecipe during its creation or planning…
Value *NewVecOp = State.get(getVecOp(), Part);		Value *NewVecOp = State.get(getVecOp(), Part);
if (VPValue *Cond = getCondOp()) {		if (VPValue *Cond = getCondOp()) {
Value *NewCond = State.get(Cond, Part);		Value *NewCond = State.get(Cond, Part);
VectorType *VecTy = cast<VectorType>(NewVecOp->getType());		VectorType *VecTy = cast<VectorType>(NewVecOp->getType());
Constant *Iden = RecurrenceDescriptor::getRecurrenceIdentity(		Constant *Iden = RecurrenceDescriptor::getRecurrenceIdentity(
Kind, VecTy->getElementType(), RdxDesc->getFastMathFlags());		Kind, VecTy->getElementType(), RdxDesc->getFastMathFlags());
Constant *IdenVec =		Constant *IdenVec =
ConstantVector::getSplat(VecTy->getElementCount(), Iden);		ConstantVector::getSplat(VecTy->getElementCount(), Iden);
▲ Show 20 Lines • Show All 839 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/VPlan.cpp

Show First 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	raw_ostream &llvm::operator<<(raw_ostream &OS, const VPValue &V) {
const VPInstruction *Instr = dyn_cast<VPInstruction>(&V);		const VPInstruction *Instr = dyn_cast<VPInstruction>(&V);
VPSlotTracker SlotTracker(		VPSlotTracker SlotTracker(
(Instr && Instr->getParent()) ? Instr->getParent()->getPlan() : nullptr);		(Instr && Instr->getParent()) ? Instr->getParent()->getPlan() : nullptr);
V.print(OS, SlotTracker);		V.print(OS, SlotTracker);
return OS;		return OS;
}		}
#endif		#endif

Value *VPLane::getAsRuntimeExpr(IRBuilder<> &Builder,		Value *VPLane::getAsRuntimeExpr(IRBuilder<> &Builder,
		AyalUnsubmitted Not Done Reply Inline Actions Defining a method of LVP inside VPlan.cpp seems odd, at best. The use of a gating EnableStrictReductions knob (in contrast to a CM cost-based decision), belongs to Legal rather than CM or Planner? I.e., Legal should bail out from vectorizing a loop having an ordered reduction, when !EnableStrictReductions? Admittedly goes back to D98435. Ayal: Defining a method of LVP inside VPlan.cpp seems odd, at best. The use of a gating…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Agreed! I updated the code to check `isOrdered` on the reduction recurrence descriptor directly during code generation. To do so, I evolved the `isInLoopReduction` helper to an `getInLoopReduction` member that returns the `VPReductionRecipe`, if there's one. fhahn: Agreed! I updated the code to check `isOrdered` on the reduction recurrence descriptor directly…
const ElementCount &VF) const {		const ElementCount &VF) const {
switch (LaneKind) {		switch (LaneKind) {
case VPLane::Kind::ScalableLast:		case VPLane::Kind::ScalableLast:
// Lane = RuntimeVF - VF.getKnownMinValue() + Lane		// Lane = RuntimeVF - VF.getKnownMinValue() + Lane
return Builder.CreateSub(getRuntimeVF(Builder, Builder.getInt32Ty(), VF),		return Builder.CreateSub(getRuntimeVF(Builder, Builder.getInt32Ty(), VF),
Builder.getInt32(VF.getKnownMinValue() - Lane));		Builder.getInt32(VF.getKnownMinValue() - Lane));
case VPLane::Kind::First:		case VPLane::Kind::First:
return Builder.getInt32(Lane);		return Builder.getInt32(Lane);
▲ Show 20 Lines • Show All 738 Lines • ▼ Show 20 Lines	void VPlan::execute(VPTransformState *State) {
// 2. Generate code in loop body.		// 2. Generate code in loop body.
State->CFG.PrevVPBB = nullptr;		State->CFG.PrevVPBB = nullptr;
State->CFG.PrevBB = VectorHeaderBB;		State->CFG.PrevBB = VectorHeaderBB;
State->CFG.LastBB = VectorLatchBB;		State->CFG.LastBB = VectorLatchBB;

for (VPBlockBase *Block : depth_first(Entry))		for (VPBlockBase *Block : depth_first(Entry))
Block->execute(State);		Block->execute(State);

// Fix the latch value of the first-order recurrences in the vector loop. Only		// Fix the latch value of reduction and first-order recurrences phis in the
// a single part is generated, regardless of the UF.		// vector loop.
VPBasicBlock *Header = Entry->getEntryBasicBlock();		VPBasicBlock *Header = Entry->getEntryBasicBlock();
for (VPRecipeBase &R : Header->phis()) {		for (VPRecipeBase &R : Header->phis()) {
if (auto *FOR = dyn_cast<VPFirstOrderRecurrencePHIRecipe>(&R)) {		auto *PhiR = dyn_cast<VPWidenPHIRecipe>(&R);
auto *VecPhi = cast<PHINode>(State->get(FOR, 0));		if (!PhiR \|\| !(isa<VPFirstOrderRecurrencePHIRecipe>(&R) \|\|
		AyalUnsubmitted Not Done Reply Inline Actions The isOrdered case does behave quite similar to FOR phi's. How about having a common treatment, e.g., for (VPRecipeBase &R : Header->phis()) { auto PhiR = dyn_cast<VPWidenPHIRecipe>(&R); if (!PhiR) continue; // For first-order recurrences and in-order reduction phis, only a single part // is generated, which provides the last part from the previous iteration. // Otherwise all UF parts are generated. bool IsOrdered = (isa<VPFirstOrderRecurrencePHIRecipe>(&R) \|\| cast<VPReductionPHIRecipe>(&R)->isOrdered()); unsigned LastPartForNewPhi = IsOrdered ? 1 : State->UF; for (unsigned Part = 0; Part < LastPartForNewPhi; ++Part) { Value VecPhi = State->get(PhiR->getVPSingleValue(), Part); // Or suffice State->get(PhiR, Part)? Value Val = State->get(PhiR->getBackedgeValue(), IsOrdered ? State->UF - 1 : Part); cast<PHINode>(VecPhi)->addIncoming(Val, VectorLatchBB); } ? Ayal:* > The isOrdered case does behave quite similar to FOR phi's. How about having a common…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Looks good, updated with a small re-naming. of `IsOrdered` -> `SinglePartNeeded` and a check to limit to FOR and reduction PHIs only. fhahn: Looks good, updated with a small re-naming. of `IsOrdered` -> `SinglePartNeeded` and a check to…
		isa<VPReductionPHIRecipe>(&R)))
VPValue *PreviousDef = FOR->getBackedgeValue();		continue;
Value *Incoming = State->get(PreviousDef, State->UF - 1);		// For first-order recurrences and in-order reduction phis, only a single
VecPhi->addIncoming(Incoming, VectorLatchBB);		// part is generated, which provides the last part from the previous
		// iteration. Otherwise all UF parts are generated.
		bool SinglePartNeeded = isa<VPFirstOrderRecurrencePHIRecipe>(&R) \|\|
		cast<VPReductionPHIRecipe>(&R)->isOrdered();
		unsigned LastPartForNewPhi = SinglePartNeeded ? 1 : State->UF;
		for (unsigned Part = 0; Part < LastPartForNewPhi; ++Part) {
		Value *VecPhi = State->get(PhiR, Part);
		Value *Val = State->get(PhiR->getBackedgeValue(),
		SinglePartNeeded ? State->UF - 1 : Part);
		cast<PHINode>(VecPhi)->addIncoming(Val, VectorLatchBB);
}		}
}		}

// Setup branch terminator successors for VPBBs in VPBBsToFix based on		// Setup branch terminator successors for VPBBs in VPBBsToFix based on
// VPBB's successors.		// VPBB's successors.
for (auto VPBB : State->CFG.VPBBsToFix) {		for (auto VPBB : State->CFG.VPBBsToFix) {
assert(EnableVPlanNativePath &&		assert(EnableVPlanNativePath &&
"Unexpected VPBBsToFix in non VPlan-native path");		"Unexpected VPBBsToFix in non VPlan-native path");
Show All 27 Lines	void VPlan::execute(VPTransformState *State) {
assert(Merged && "Could not merge last basic block with latch.");		assert(Merged && "Could not merge last basic block with latch.");
VectorLatchBB = LastBB;		VectorLatchBB = LastBB;

// We do not attempt to preserve DT for outer loop vectorization currently.		// We do not attempt to preserve DT for outer loop vectorization currently.
if (!EnableVPlanNativePath)		if (!EnableVPlanNativePath)
updateDominatorTree(State->DT, VectorPreHeaderBB, VectorLatchBB,		updateDominatorTree(State->DT, VectorPreHeaderBB, VectorLatchBB,
L->getExitBlock());		L->getExitBlock());
}		}

		AyalUnsubmitted Done Reply Inline Actions Place this next to hooking up incoming of FOR header phi's above, or even fuse both into a common loop over header phi's? The isOrdered case does behave quite similar to FOR phi's. Ayal: Place this next to hooking up incoming of FOR header phi's above, or even fuse both into a…
		fhahnAuthorUnsubmitted Done Reply Inline Actions done, thanks! fhahn: done, thanks!
#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
LLVM_DUMP_METHOD		LLVM_DUMP_METHOD
void VPlan::print(raw_ostream &O) const {		void VPlan::print(raw_ostream &O) const {
VPSlotTracker SlotTracker(this);		VPSlotTracker SlotTracker(this);

O << "VPlan '" << Name << "' {";		O << "VPlan '" << Name << "' {";
for (const VPBlockBase *Block : depth_first(getEntry())) {		for (const VPBlockBase *Block : depth_first(getEntry())) {
O << '\n';		O << '\n';
Block->print(O, "", SlotTracker);		Block->print(O, "", SlotTracker);
}		}
O << "}\n";		O << "}\n";
}		}
		AyalUnsubmitted Not Done Reply Inline Actions Moving this to VPlan::execute is great! Can use getSingleVPValue(), getBackedgeValue(). May be worth explaining that Val is first set per-part supporting parallel partial sums, either vector/per-lane (1) or scalar/per-part (2), and then set to last-part supporting serial reduction (3). Admittedly goes back to D98435. Ayal: Moving this to VPlan::execute is great! Can use getSingleVPValue(), getBackedgeValue(). May be…
		fhahnAuthorUnsubmitted Done Reply Inline Actions I rebased the code, and it now uses getVPSingleValue/getBackedgeValue. I can add the additional comments separately or in this change. fhahn: I rebased the code, and it now uses getVPSingleValue/getBackedgeValue. I can add the additional…

LLVM_DUMP_METHOD		LLVM_DUMP_METHOD
void VPlan::printDOT(raw_ostream &O) const {		void VPlan::printDOT(raw_ostream &O) const {
VPlanPrinter Printer(O, *this);		VPlanPrinter Printer(O, *this);
Printer.dump();		Printer.dump();
}		}

LLVM_DUMP_METHOD		LLVM_DUMP_METHOD
▲ Show 20 Lines • Show All 422 Lines • ▼ Show 20 Lines	void VPReductionPHIRecipe::execute(VPTransformState &State) {
assert(State.LI->getLoopFor(HeaderBB)->getHeader() == HeaderBB &&		assert(State.LI->getLoopFor(HeaderBB)->getHeader() == HeaderBB &&
"recipe must be in the vector loop header");		"recipe must be in the vector loop header");
unsigned LastPartForNewPhi = isOrdered() ? 1 : State.UF;		unsigned LastPartForNewPhi = isOrdered() ? 1 : State.UF;
for (unsigned Part = 0; Part < LastPartForNewPhi; ++Part) {		for (unsigned Part = 0; Part < LastPartForNewPhi; ++Part) {
Value *EntryPart =		Value *EntryPart =
PHINode::Create(VecTy, 2, "vec.phi", &*HeaderBB->getFirstInsertionPt());		PHINode::Create(VecTy, 2, "vec.phi", &*HeaderBB->getFirstInsertionPt());
State.set(this, EntryPart, Part);		State.set(this, EntryPart, Part);
}		}

		// Reductions do not have to start at zero. They can start with
		// any loop invariant values.
VPValue *StartVPV = getStartValue();		VPValue *StartVPV = getStartValue();
Value *StartV = StartVPV->getLiveInIRValue();		Value *StartV = StartVPV->getLiveInIRValue();

Value *Iden = nullptr;		Value *Iden = nullptr;
RecurKind RK = RdxDesc.getRecurrenceKind();		RecurKind RK = RdxDesc.getRecurrenceKind();
if (RecurrenceDescriptor::isMinMaxRecurrenceKind(RK)) {		if (RecurrenceDescriptor::isMinMaxRecurrenceKind(RK)) {
// MinMax reduction have the start value as their identify.		// MinMax reduction have the start value as their identify.
if (ScalarPHI) {		if (ScalarPHI) {
▲ Show 20 Lines • Show All 152 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[LV] Move reduction PHI node fixup to VPlan::execute (NFC).
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 364712

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

llvm/lib/Transforms/Vectorize/VPlan.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[LV] Move reduction PHI node fixup to VPlan::execute (NFC).ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 364712

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

llvm/lib/Transforms/Vectorize/VPlan.cpp

[LV] Move reduction PHI node fixup to VPlan::execute (NFC).
ClosedPublic