This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/Vectorize/
-
Transforms/
-
Vectorize/
3/8
LoopVectorize.cpp
-
test/Transforms/LoopVectorize/
-
Transforms/
-
LoopVectorize/
-
RISCV/
-
divrem.ll
-
first-order-recurrence-sink-replicate-region.ll
-
first-order-recurrence.ll
-
if-pred-non-void.ll
1
induction.ll
1
pr44488-predication.ll
-
reduction-small-size.ll
-
vplan-printing.ll

Differential D132591

[LV] Use safe-divisor lowering for fixed vectors if profitable
ClosedPublic

Authored by reames on Aug 24 2022, 12:25 PM.

Download Raw Diff

Details

Reviewers

fhahn
david-arm

Commits

rG4c4c0d2c06de: [LV] Use safe-divisor lowering for fixed vectors if profitable

Summary

This extends the safe-divisor widening scheme recently added for scalable vectors to handle fixed vectors as well.

Two notes:

Due to how early this is used during costing, I couldn't figure out how to intersect this with the minimum bitwidth handling. So, in theory, we could chose to widen when a narrower scalarization would have been profitable.
This is mostly completion-ism from my side. I think it makes sense, but I'm not strongly motivated to push this currently. If there are any serious cost model concerns raised, I'm likely to abandon the patch.

Diff Detail

Event Timeline

reames created this revision.Aug 24 2022, 12:25 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 24 2022, 12:25 PM

Herald added subscribers: frasercrmck, luismarques, apazos and 21 others. · View Herald Transcript

reames requested review of this revision.Aug 24 2022, 12:25 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 24 2022, 12:25 PM

Herald added subscribers: alextsao1999, • pcwang-thead, MaskRay. · View Herald Transcript

Harbormaster completed remote builds in B183191: Diff 455327.Aug 24 2022, 2:17 PM

I like the idea of this patch - I imagine it's a bit similar to the setCostBasedWideningDecision function where we make a widening decision before doing the actual full loop costing. I guess if you had concerns about enabling this by default for all targets you could limit this to a RISCV TTI hook initially?

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
4519	This is just a minor point, but I wonder if ScalarizationCost is a better name, to distinguish from an actual scalar cost (i.e. VF=1)?
4551	In order to avoid creating the vector type twice here and below, perhaps create a local variable, i.e. Type VecTy = ToVectorTy(I->getType(), VF); ?
7154	It seems a bit unfortunate that we're calling getDivRemSpeculationCost twice here - once in this function and once in isScalarWithPredication. Would it be better to pull the switch statement out of isScalarWithPredication into a small, common function, i.e.: bool isDivRemScalarWithPredication(Instruction *I, ElementCount VF) { switch (ForceSafeDivisor) { case cl::BOU_UNSET: { const auto [ScalarCost, SafeDivisorCost] = getDivRemSpeculationCost(I, VF); return ScalarCost< SafeDivisorCost; } case cl::BOU_TRUE: return false; case cl::BOU_FALSE: return true; }; } Then isScalarWithPredication and this function can just call isDivRemScalarWithPredication. It also restricts the cost calculations to when ForceSafeDivisor is cl::BOU_UNSET.

In D132591#3760792, @david-arm wrote:

I guess if you had concerns about enabling this by default for all targets you could limit this to a RISCV TTI hook initially?

I have no concerns with simply enabling this. We have no evidence of problems, and we can revert if actual performance issues are revealed.

If you, or another reviewer, has concerns I'll simply drop the patch. As I said up front, I'm not strongly motivated to push this.

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
4519	Will change.
4551	Was in the code being moved already, but sure, will change.
7154	You're misreading the code. We need to return a cost regardless of the value of the cl::opt. The only thing that changes is which one we return,

david-arm added inline comments.Sep 1 2022, 1:06 AM

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

7154

My apologies, yes you're right. However, my previous comment still stands - we're calculating the costs twice, which feels inefficient. In that case, it's still possible to refactor this to get the costs just once. For example,

case Instruction::UDiv:
case Instruction::SDiv:
case Instruction::URem:
case Instruction::SRem:
  const auto [ScalarCost, SafeDivisorCost] = getDivRemSpeculationCost(I, VF);
  return isDivRemScalarWithPredication(ScalarCost, SafeDivisorCost) ? ScalarCost : SafeDivisorCost;

where

bool isDivRemScalarWithPredication(InstructionCost ScalarCost, InstructionCost  SafeDivisorCost) {
  switch (ForceSafeDivisor) {
  case cl::BOU_UNSET:
    return ScalarCost< SafeDivisorCost;
  case cl::BOU_TRUE:
    return false;
  case cl::BOU_FALSE:
    return true;
  };
}

and do something similar for the div/rem case in isScalarWithPredication.

Address review style comments.

Harbormaster completed remote builds in B185248: Diff 458207.Sep 6 2022, 11:12 AM

LGTM! Thanks for making the changes @reames.

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
1442	nit: I realise I asked you to add this function, but could you add a brief comment above this function before landing the patch?

This revision is now accepted and ready to land.Sep 8 2022, 5:24 AM

I have no concerns with simply enabling this. We have no evidence of problems, and we can revert if actual performance issues are revealed.

Enabling this by default should be fine, AFAICT.

LGTM, thanks! I think it would be good to add -force-widen-divrem-via-safe-divisor=0 to induction.ll and pr44488-predication.ll to make sure they test the same as before.

llvm/test/Transforms/LoopVectorize/induction.ll
2036	I think the division here is used intentionally to force uses of an IV in predicated blocks and with this change we are now not checking for that scenario any longer. This should probably use `-force-widen-divrem-via-safe-divisor=0` to preserve the original behavior.
llvm/test/Transforms/LoopVectorize/pr44488-predication.ll
11	It looks like this test specially tests for a special case with branches and it should probably use `-force-widen-divrem-via-safe-divisor=0` .

This revision was landed with ongoing or failed builds.Sep 8 2022, 9:16 AM

Closed by commit rG4c4c0d2c06de: [LV] Use safe-divisor lowering for fixed vectors if profitable (authored by reames). · Explain Why

This revision was automatically updated to reflect the committed changes.

reames added a commit: rG4c4c0d2c06de: [LV] Use safe-divisor lowering for fixed vectors if profitable.

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Vectorize/

LoopVectorize.cpp

145 lines

test/

Transforms/

LoopVectorize/

RISCV/

divrem.ll

402 lines

first-order-recurrence-sink-replicate-region.ll

2 lines

first-order-recurrence.ll

6 lines

if-pred-non-void.ll

4 lines

induction.ll

776 lines

pr44488-predication.ll

34 lines

reduction-small-size.ll

51 lines

vplan-printing.ll

2 lines

Diff 455327

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 360 Lines • ▼ Show 20 Lines
cl::opt<bool> llvm::EnableLoopVectorization(		cl::opt<bool> llvm::EnableLoopVectorization(
"vectorize-loops", cl::init(true), cl::Hidden,		"vectorize-loops", cl::init(true), cl::Hidden,
cl::desc("Run the Loop vectorization passes"));		cl::desc("Run the Loop vectorization passes"));

cl::opt<bool> PrintVPlansInDotFormat(		cl::opt<bool> PrintVPlansInDotFormat(
"vplan-print-in-dot-format", cl::init(false), cl::Hidden,		"vplan-print-in-dot-format", cl::init(false), cl::Hidden,
cl::desc("Use dot format instead of plain text when dumping VPlans"));		cl::desc("Use dot format instead of plain text when dumping VPlans"));

		cl::opt<cl::boolOrDefault> ForceSafeDivisor(
		"force-widen-divrem-via-safe-divisor", cl::Hidden,
		cl::desc("Override cost based safe divisor widening for div/rem instructions"));

/// A helper function that returns true if the given type is irregular. The		/// A helper function that returns true if the given type is irregular. The
/// type is irregular if its allocated size doesn't equal the store size of an		/// type is irregular if its allocated size doesn't equal the store size of an
/// element of the corresponding vector type.		/// element of the corresponding vector type.
static bool hasIrregularType(Type *Ty, const DataLayout &DL) {		static bool hasIrregularType(Type *Ty, const DataLayout &DL) {
// Determine if an array of N elements of type Ty is "bitcast compatible"		// Determine if an array of N elements of type Ty is "bitcast compatible"
// with a <N x Ty> vector.		// with a <N x Ty> vector.
// This is only true if there is no padding between the array elements.		// This is only true if there is no padding between the array elements.
return DL.getTypeAllocSizeInBits(Ty) != DL.getTypeSizeInBits(Ty);		return DL.getTypeAllocSizeInBits(Ty) != DL.getTypeSizeInBits(Ty);
▲ Show 20 Lines • Show All 1,053 Lines • ▼ Show 20 Lines	public:
/// variables found for the given VF.		/// variables found for the given VF.
bool canVectorizeReductions(ElementCount VF) const {		bool canVectorizeReductions(ElementCount VF) const {
return (all_of(Legal->getReductionVars(), [&](auto &Reduction) -> bool {		return (all_of(Legal->getReductionVars(), [&](auto &Reduction) -> bool {
const RecurrenceDescriptor &RdxDesc = Reduction.second;		const RecurrenceDescriptor &RdxDesc = Reduction.second;
return TTI.isLegalToVectorizeReduction(RdxDesc, VF);		return TTI.isLegalToVectorizeReduction(RdxDesc, VF);
}));		}));
}		}

/// Returns true if \p I is an instruction which requires predication and		/// Returns true if \p I is an instruction which requires predication and
		david-armUnsubmitted Not Done Reply Inline Actions nit: I realise I asked you to add this function, but could you add a brief comment above this function before landing the patch? david-arm: nit: I realise I asked you to add this function, but could you add a brief comment above this…
/// for which our chosen predication strategy is scalarization (i.e. we		/// for which our chosen predication strategy is scalarization (i.e. we
/// don't have an alternate strategy such as masking available).		/// don't have an alternate strategy such as masking available).
/// \p VF is the vectorization factor that will be used to vectorize \p I.		/// \p VF is the vectorization factor that will be used to vectorize \p I.
bool isScalarWithPredication(Instruction *I, ElementCount VF) const;		bool isScalarWithPredication(Instruction *I, ElementCount VF) const;

/// Returns true if \p I is an instruction that needs to be predicated		/// Returns true if \p I is an instruction that needs to be predicated
/// at runtime. The result is independent of the predication mechanism.		/// at runtime. The result is independent of the predication mechanism.
/// Superset of instructions that return true for isScalarWithPredication.		/// Superset of instructions that return true for isScalarWithPredication.
bool isPredicatedInst(Instruction *I) const;		bool isPredicatedInst(Instruction *I) const;

		/// Return the costs for our two available strategies for lowering a
		/// div/rem operation which requires speculating at least one lane.
		/// First result is for scalarization (will be invalid for scalable
		/// vectors); second is for the safe-divisor strategy.
		std::pair<InstructionCost, InstructionCost>
		getDivRemSpeculationCost(Instruction *I,
		ElementCount VF) const;

/// Returns true if \p I is a memory instruction with consecutive memory		/// Returns true if \p I is a memory instruction with consecutive memory
/// access that can be widened.		/// access that can be widened.
bool		bool
memoryInstructionCanBeWidened(Instruction *I,		memoryInstructionCanBeWidened(Instruction *I,
ElementCount VF = ElementCount::getFixed(1));		ElementCount VF = ElementCount::getFixed(1));

/// Returns true if \p I is a memory instruction in an interleaved-group		/// Returns true if \p I is a memory instruction in an interleaved-group
/// of memory accesses that can be vectorized with wide vector loads/stores		/// of memory accesses that can be vectorized with wide vector loads/stores
▲ Show 20 Lines • Show All 2,973 Lines • ▼ Show 20 Lines	case Instruction::Store: {
return isa<LoadInst>(I) ? !(isLegalMaskedLoad(Ty, Ptr, Alignment) \|\|		return isa<LoadInst>(I) ? !(isLegalMaskedLoad(Ty, Ptr, Alignment) \|\|
TTI.isLegalMaskedGather(VTy, Alignment))		TTI.isLegalMaskedGather(VTy, Alignment))
: !(isLegalMaskedStore(Ty, Ptr, Alignment) \|\|		: !(isLegalMaskedStore(Ty, Ptr, Alignment) \|\|
TTI.isLegalMaskedScatter(VTy, Alignment));		TTI.isLegalMaskedScatter(VTy, Alignment));
}		}
case Instruction::UDiv:		case Instruction::UDiv:
case Instruction::SDiv:		case Instruction::SDiv:
case Instruction::SRem:		case Instruction::SRem:
case Instruction::URem:		case Instruction::URem: {
// We have the option to use the safe-divisor idiom to avoid predication.		// We have the option to use the safe-divisor idiom to avoid predication.
// At the moment this is only used for scalable (which legally can't		// The cost based decision here will always select safe-divisor for
// scalarize), but long term we want to make a cost based decision		// scalable vectors as scalarization isn't legal.
// for fixed length vectors as well.		const auto [ScalarCost, SafeDivisorCost] = getDivRemSpeculationCost(I, VF);
return !VF.isScalable();		switch (ForceSafeDivisor) {
		case cl::BOU_UNSET:
		return ScalarCost < SafeDivisorCost;
		case cl::BOU_TRUE:
		return false;
		case cl::BOU_FALSE:
		return true;
		};
		}
}		}
}		}

bool LoopVectorizationCostModel::isPredicatedInst(Instruction *I) const {		bool LoopVectorizationCostModel::isPredicatedInst(Instruction *I) const {
if (!blockNeedsPredicationForAnyReason(I->getParent()))		if (!blockNeedsPredicationForAnyReason(I->getParent()))
return false;		return false;

// Can we prove this instruction is safe to unconditionally execute?		// Can we prove this instruction is safe to unconditionally execute?
Show All 27 Lines	bool LoopVectorizationCostModel::isPredicatedInst(Instruction *I) const {
case Instruction::SRem:		case Instruction::SRem:
case Instruction::URem:		case Instruction::URem:
// TODO: We can use the loop-preheader as context point here and get		// TODO: We can use the loop-preheader as context point here and get
// context sensitive reasoning		// context sensitive reasoning
return !isSafeToSpeculativelyExecute(I);		return !isSafeToSpeculativelyExecute(I);
}		}
}		}

		std::pair<InstructionCost, InstructionCost>
		LoopVectorizationCostModel::getDivRemSpeculationCost(Instruction *I,
		ElementCount VF) const {
		assert(I->getOpcode() == Instruction::UDiv \|\|
		I->getOpcode() == Instruction::SDiv \|\|
		I->getOpcode() == Instruction::SRem \|\|
		I->getOpcode() == Instruction::URem);
		assert(!isSafeToSpeculativelyExecute(I));

		const TTI::TargetCostKind CostKind = TTI::TCK_RecipThroughput;

		// Scalarization isn't legal for scalable vector types
		InstructionCost ScalarCost = InstructionCost::getInvalid();
		david-armUnsubmitted Not Done Reply Inline Actions This is just a minor point, but I wonder if ScalarizationCost is a better name, to distinguish from an actual scalar cost (i.e. VF=1)? david-arm: This is just a minor point, but I wonder if ScalarizationCost is a better name, to distinguish…
		reamesAuthorUnsubmitted Done Reply Inline Actions Will change. reames: Will change.
		if (!VF.isScalable()) {
		// Get the scalarization cost and scale this amount by the probability of
		// executing the predicated block. If the instruction is not predicated,
		// we fall through to the next case.
		ScalarCost = 0;

		// These instructions have a non-void type, so account for the phi nodes
		// that we will create. This cost is likely to be zero. The phi node
		// cost, if any, should be scaled by the block probability because it
		// models a copy at the end of each predicated block.
		ScalarCost += VF.getKnownMinValue() *
		TTI.getCFInstrCost(Instruction::PHI, CostKind);

		// The cost of the non-predicated instruction.
		ScalarCost += VF.getKnownMinValue() *
		TTI.getArithmeticInstrCost(I->getOpcode(), I->getType(), CostKind);

		// The cost of insertelement and extractelement instructions needed for
		// scalarization.
		ScalarCost += getScalarizationOverhead(I, VF);

		// Scale the cost by the probability of executing the predicated blocks.
		// This assumes the predicated block for each vector lane is equally
		// likely.
		ScalarCost = ScalarCost / getReciprocalPredBlockProb();
		}
		InstructionCost SafeDivisorCost = 0;

		// The cost of the select guard to ensure all lanes are well defined
		// after we speculate above any internal control flow.
		SafeDivisorCost += TTI.getCmpSelInstrCost(
		Instruction::Select, ToVectorTy(I->getType(), VF),
		david-armUnsubmitted Not Done Reply Inline Actions In order to avoid creating the vector type twice here and below, perhaps create a local variable, i.e. Type VecTy = ToVectorTy(I->getType(), VF); ? david-arm: In order to avoid creating the vector type twice here and below, perhaps create a local…
		reamesAuthorUnsubmitted Done Reply Inline Actions Was in the code being moved already, but sure, will change. reames: Was in the code being moved already, but sure, will change.
		ToVectorTy(Type::getInt1Ty(I->getContext()), VF),
		CmpInst::BAD_ICMP_PREDICATE, CostKind);

		// Certain instructions can be cheaper to vectorize if they have a constant
		// second vector operand. One example of this are shifts on x86.
		Value *Op2 = I->getOperand(1);
		auto Op2Info = TTI.getOperandInfo(Op2);
		if (Op2Info.Kind == TargetTransformInfo::OK_AnyValue && Legal->isUniform(Op2))
		Op2Info.Kind = TargetTransformInfo::OK_UniformValue;

		SmallVector<const Value *, 4> Operands(I->operand_values());
		SafeDivisorCost += TTI.getArithmeticInstrCost(
		I->getOpcode(), ToVectorTy(I->getType(), VF), CostKind,
		{TargetTransformInfo::OK_AnyValue, TargetTransformInfo::OP_None},
		Op2Info, Operands, I);
		return {ScalarCost, SafeDivisorCost};
		}

bool LoopVectorizationCostModel::interleavedAccessCanBeWidened(		bool LoopVectorizationCostModel::interleavedAccessCanBeWidened(
Instruction *I, ElementCount VF) {		Instruction *I, ElementCount VF) {
assert(isAccessInterleaved(I) && "Expecting interleaved access.");		assert(isAccessInterleaved(I) && "Expecting interleaved access.");
assert(getWideningDecision(I, VF) == CM_Unknown &&		assert(getWideningDecision(I, VF) == CM_Unknown &&
"Decision should not be set yet.");		"Decision should not be set yet.");
auto *Group = getInterleavedAccessGroup(I);		auto *Group = getInterleavedAccessGroup(I);
assert(Group && "Must have a group.");		assert(Group && "Must have a group.");

▲ Show 20 Lines • Show All 2,567 Lines • ▼ Show 20 Lines	case Instruction::PHI: {

return TTI.getCFInstrCost(Instruction::PHI, CostKind);		return TTI.getCFInstrCost(Instruction::PHI, CostKind);
}		}
case Instruction::UDiv:		case Instruction::UDiv:
case Instruction::SDiv:		case Instruction::SDiv:
case Instruction::URem:		case Instruction::URem:
case Instruction::SRem:		case Instruction::SRem:
if (VF.isVector() && isPredicatedInst(I)) {		if (VF.isVector() && isPredicatedInst(I)) {
// If we're speculating lanes, we have two options - scalarization and		const auto [ScalarCost, SafeDivisorCost] = getDivRemSpeculationCost(I, VF);
// guarded widening.		return isScalarWithPredication(I, VF) ? ScalarCost : SafeDivisorCost;
		david-armUnsubmitted Not Done Reply Inline Actions It seems a bit unfortunate that we're calling getDivRemSpeculationCost twice here - once in this function and once in isScalarWithPredication. Would it be better to pull the switch statement out of isScalarWithPredication into a small, common function, i.e.: bool isDivRemScalarWithPredication(Instruction I, ElementCount VF) { switch (ForceSafeDivisor) { case cl::BOU_UNSET: { const auto [ScalarCost, SafeDivisorCost] = getDivRemSpeculationCost(I, VF); return ScalarCost< SafeDivisorCost; } case cl::BOU_TRUE: return false; case cl::BOU_FALSE: return true; }; } Then isScalarWithPredication and this function can just call isDivRemScalarWithPredication. It also restricts the cost calculations to when ForceSafeDivisor is cl::BOU_UNSET. david-arm:* It seems a bit unfortunate that we're calling getDivRemSpeculationCost twice here - once in…
		reamesAuthorUnsubmitted Done Reply Inline Actions You're misreading the code. We need to return a cost regardless of the value of the cl::opt. The only thing that changes is which one we return, reames: You're misreading the code. We need to return a cost regardless of the value of the cl::opt.
		david-armUnsubmitted Not Done Reply Inline Actions My apologies, yes you're right. However, my previous comment still stands - we're calculating the costs twice, which feels inefficient. In that case, it's still possible to refactor this to get the costs just once. For example, case Instruction::UDiv: case Instruction::SDiv: case Instruction::URem: case Instruction::SRem: const auto [ScalarCost, SafeDivisorCost] = getDivRemSpeculationCost(I, VF); return isDivRemScalarWithPredication(ScalarCost, SafeDivisorCost) ? ScalarCost : SafeDivisorCost; where bool isDivRemScalarWithPredication(InstructionCost ScalarCost, InstructionCost SafeDivisorCost) { switch (ForceSafeDivisor) { case cl::BOU_UNSET: return ScalarCost< SafeDivisorCost; case cl::BOU_TRUE: return false; case cl::BOU_FALSE: return true; }; } and do something similar for the div/rem case in isScalarWithPredication. david-arm: My apologies, yes you're right. However, my previous comment still stands - we're calculating…
if (isScalarWithPredication(I, VF)) {
// Get the scalarization cost and scale this amount by the probability of
// executing the predicated block. If the instruction is not predicated,
// we fall through to the next case.
InstructionCost Cost = 0;

// These instructions have a non-void type, so account for the phi nodes
// that we will create. This cost is likely to be zero. The phi node
// cost, if any, should be scaled by the block probability because it
// models a copy at the end of each predicated block.
Cost += VF.getKnownMinValue() *
TTI.getCFInstrCost(Instruction::PHI, CostKind);

// The cost of the non-predicated instruction.
Cost += VF.getKnownMinValue() *
TTI.getArithmeticInstrCost(I->getOpcode(), RetTy, CostKind);

// The cost of insertelement and extractelement instructions needed for
// scalarization.
Cost += getScalarizationOverhead(I, VF);

// Scale the cost by the probability of executing the predicated blocks.
// This assumes the predicated block for each vector lane is equally
// likely.
return Cost / getReciprocalPredBlockProb();
}
InstructionCost Cost = 0;

// The cost of the select guard to ensure all lanes are well defined
// after we speculate above any internal control flow.
Cost += TTI.getCmpSelInstrCost(
Instruction::Select, ToVectorTy(I->getType(), VF),
ToVectorTy(Type::getInt1Ty(I->getContext()), VF),
CmpInst::BAD_ICMP_PREDICATE, CostKind);

// Certain instructions can be cheaper to vectorize if they have a constant
// second vector operand. One example of this are shifts on x86.
Value *Op2 = I->getOperand(1);
auto Op2Info = TTI.getOperandInfo(Op2);
if (Op2Info.Kind == TargetTransformInfo::OK_AnyValue && Legal->isUniform(Op2))
Op2Info.Kind = TargetTransformInfo::OK_UniformValue;

SmallVector<const Value *, 4> Operands(I->operand_values());
Cost += TTI.getArithmeticInstrCost(
I->getOpcode(), VectorTy, CostKind,
{TargetTransformInfo::OK_AnyValue, TargetTransformInfo::OP_None},
Op2Info, Operands, I);
return Cost;
}		}
// We've proven all lanes safe to speculate, fall through.		// We've proven all lanes safe to speculate, fall through.
[[fallthrough]];		[[fallthrough]];
case Instruction::Add:		case Instruction::Add:
case Instruction::FAdd:		case Instruction::FAdd:
case Instruction::Sub:		case Instruction::Sub:
case Instruction::FSub:		case Instruction::FSub:
case Instruction::Mul:		case Instruction::Mul:
▲ Show 20 Lines • Show All 3,537 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/RISCV/divrem.ll

	Show First 20 Lines • Show All 480 Lines • ▼ Show 20 Lines
	; FIXED-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; FIXED-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; FIXED: vector.ph:			; FIXED: vector.ph:
	; FIXED-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i64> poison, i64 [[V:%.]], i32 0			; FIXED-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i64> poison, i64 [[V:%.]], i32 0
	; FIXED-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i64> [[BROADCAST_SPLATINSERT]], <2 x i64> poison, <2 x i32> zeroinitializer			; FIXED-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i64> [[BROADCAST_SPLATINSERT]], <2 x i64> poison, <2 x i32> zeroinitializer
	; FIXED-NEXT: [[BROADCAST_SPLATINSERT2:%.*]] = insertelement <2 x i64> poison, i64 [[V]], i32 0			; FIXED-NEXT: [[BROADCAST_SPLATINSERT2:%.*]] = insertelement <2 x i64> poison, i64 [[V]], i32 0
	; FIXED-NEXT: [[BROADCAST_SPLAT3:%.*]] = shufflevector <2 x i64> [[BROADCAST_SPLATINSERT2]], <2 x i64> poison, <2 x i32> zeroinitializer			; FIXED-NEXT: [[BROADCAST_SPLAT3:%.*]] = shufflevector <2 x i64> [[BROADCAST_SPLATINSERT2]], <2 x i64> poison, <2 x i32> zeroinitializer
	; FIXED-NEXT: br label [[VECTOR_BODY:%.*]]			; FIXED-NEXT: br label [[VECTOR_BODY:%.*]]
	; FIXED: vector.body:			; FIXED: vector.body:
	; FIXED-NEXT: [[INDEX:%.]] = phi i64 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_UDIV_CONTINUE9:%.*]] ]			; FIXED-NEXT: [[INDEX:%.]] = phi i64 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; FIXED-NEXT: [[TMP0:%.*]] = add i64 [[INDEX]], 0			; FIXED-NEXT: [[TMP0:%.*]] = add i64 [[INDEX]], 0
	; FIXED-NEXT: [[TMP1:%.*]] = add i64 [[INDEX]], 2			; FIXED-NEXT: [[TMP1:%.*]] = add i64 [[INDEX]], 2
	; FIXED-NEXT: [[TMP2:%.]] = getelementptr inbounds i64, ptr [[A:%.]], i64 [[TMP0]]			; FIXED-NEXT: [[TMP2:%.]] = getelementptr inbounds i64, ptr [[A:%.]], i64 [[TMP0]]
	; FIXED-NEXT: [[TMP3:%.*]] = getelementptr inbounds i64, ptr [[A]], i64 [[TMP1]]			; FIXED-NEXT: [[TMP3:%.*]] = getelementptr inbounds i64, ptr [[A]], i64 [[TMP1]]
	; FIXED-NEXT: [[TMP4:%.*]] = getelementptr inbounds i64, ptr [[TMP2]], i32 0			; FIXED-NEXT: [[TMP4:%.*]] = getelementptr inbounds i64, ptr [[TMP2]], i32 0
	; FIXED-NEXT: [[WIDE_LOAD:%.*]] = load <2 x i64>, ptr [[TMP4]], align 8			; FIXED-NEXT: [[WIDE_LOAD:%.*]] = load <2 x i64>, ptr [[TMP4]], align 8
	; FIXED-NEXT: [[TMP5:%.*]] = getelementptr inbounds i64, ptr [[TMP2]], i32 2			; FIXED-NEXT: [[TMP5:%.*]] = getelementptr inbounds i64, ptr [[TMP2]], i32 2
	; FIXED-NEXT: [[WIDE_LOAD1:%.*]] = load <2 x i64>, ptr [[TMP5]], align 8			; FIXED-NEXT: [[WIDE_LOAD1:%.*]] = load <2 x i64>, ptr [[TMP5]], align 8
	; FIXED-NEXT: [[TMP6:%.*]] = icmp ne <2 x i64> [[BROADCAST_SPLAT]], zeroinitializer			; FIXED-NEXT: [[TMP6:%.*]] = icmp ne <2 x i64> [[BROADCAST_SPLAT]], zeroinitializer
	; FIXED-NEXT: [[TMP7:%.*]] = icmp ne <2 x i64> [[BROADCAST_SPLAT3]], zeroinitializer			; FIXED-NEXT: [[TMP7:%.*]] = icmp ne <2 x i64> [[BROADCAST_SPLAT3]], zeroinitializer
	; FIXED-NEXT: [[TMP8:%.*]] = extractelement <2 x i1> [[TMP6]], i32 0			; FIXED-NEXT: [[TMP8:%.*]] = select <2 x i1> [[TMP6]], <2 x i64> [[BROADCAST_SPLAT]], <2 x i64> <i64 1, i64 1>
	; FIXED-NEXT: br i1 [[TMP8]], label [[PRED_UDIV_IF:%.]], label [[PRED_UDIV_CONTINUE:%.]]			; FIXED-NEXT: [[TMP9:%.*]] = select <2 x i1> [[TMP7]], <2 x i64> [[BROADCAST_SPLAT3]], <2 x i64> <i64 1, i64 1>
	; FIXED: pred.udiv.if:			; FIXED-NEXT: [[TMP10:%.*]] = udiv <2 x i64> [[WIDE_LOAD]], [[TMP8]]
	; FIXED-NEXT: [[TMP9:%.*]] = extractelement <2 x i64> [[WIDE_LOAD]], i32 0			; FIXED-NEXT: [[TMP11:%.*]] = udiv <2 x i64> [[WIDE_LOAD1]], [[TMP9]]
	; FIXED-NEXT: [[TMP10:%.*]] = udiv i64 [[TMP9]], [[V]]			; FIXED-NEXT: [[TMP12:%.*]] = xor <2 x i1> [[TMP6]], <i1 true, i1 true>
	; FIXED-NEXT: [[TMP11:%.*]] = insertelement <2 x i64> poison, i64 [[TMP10]], i32 0			; FIXED-NEXT: [[TMP13:%.*]] = xor <2 x i1> [[TMP7]], <i1 true, i1 true>
	; FIXED-NEXT: br label [[PRED_UDIV_CONTINUE]]			; FIXED-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP6]], <2 x i64> [[TMP10]], <2 x i64> [[WIDE_LOAD]]
	; FIXED: pred.udiv.continue:			; FIXED-NEXT: [[PREDPHI4:%.*]] = select <2 x i1> [[TMP7]], <2 x i64> [[TMP11]], <2 x i64> [[WIDE_LOAD1]]
	; FIXED-NEXT: [[TMP12:%.*]] = phi <2 x i64> [ poison, [[VECTOR_BODY]] ], [ [[TMP11]], [[PRED_UDIV_IF]] ]			; FIXED-NEXT: store <2 x i64> [[PREDPHI]], ptr [[TMP4]], align 8
	; FIXED-NEXT: [[TMP13:%.*]] = extractelement <2 x i1> [[TMP6]], i32 1			; FIXED-NEXT: store <2 x i64> [[PREDPHI4]], ptr [[TMP5]], align 8
	; FIXED-NEXT: br i1 [[TMP13]], label [[PRED_UDIV_IF4:%.]], label [[PRED_UDIV_CONTINUE5:%.]]
	; FIXED: pred.udiv.if4:
	; FIXED-NEXT: [[TMP14:%.*]] = extractelement <2 x i64> [[WIDE_LOAD]], i32 1
	; FIXED-NEXT: [[TMP15:%.*]] = udiv i64 [[TMP14]], [[V]]
	; FIXED-NEXT: [[TMP16:%.*]] = insertelement <2 x i64> [[TMP12]], i64 [[TMP15]], i32 1
	; FIXED-NEXT: br label [[PRED_UDIV_CONTINUE5]]
	; FIXED: pred.udiv.continue5:
	; FIXED-NEXT: [[TMP17:%.*]] = phi <2 x i64> [ [[TMP12]], [[PRED_UDIV_CONTINUE]] ], [ [[TMP16]], [[PRED_UDIV_IF4]] ]
	; FIXED-NEXT: [[TMP18:%.*]] = extractelement <2 x i1> [[TMP7]], i32 0
	; FIXED-NEXT: br i1 [[TMP18]], label [[PRED_UDIV_IF6:%.]], label [[PRED_UDIV_CONTINUE7:%.]]
	; FIXED: pred.udiv.if6:
	; FIXED-NEXT: [[TMP19:%.*]] = extractelement <2 x i64> [[WIDE_LOAD1]], i32 0
	; FIXED-NEXT: [[TMP20:%.*]] = udiv i64 [[TMP19]], [[V]]
	; FIXED-NEXT: [[TMP21:%.*]] = insertelement <2 x i64> poison, i64 [[TMP20]], i32 0
	; FIXED-NEXT: br label [[PRED_UDIV_CONTINUE7]]
	; FIXED: pred.udiv.continue7:
	; FIXED-NEXT: [[TMP22:%.*]] = phi <2 x i64> [ poison, [[PRED_UDIV_CONTINUE5]] ], [ [[TMP21]], [[PRED_UDIV_IF6]] ]
	; FIXED-NEXT: [[TMP23:%.*]] = extractelement <2 x i1> [[TMP7]], i32 1
	; FIXED-NEXT: br i1 [[TMP23]], label [[PRED_UDIV_IF8:%.*]], label [[PRED_UDIV_CONTINUE9]]
	; FIXED: pred.udiv.if8:
	; FIXED-NEXT: [[TMP24:%.*]] = extractelement <2 x i64> [[WIDE_LOAD1]], i32 1
	; FIXED-NEXT: [[TMP25:%.*]] = udiv i64 [[TMP24]], [[V]]
	; FIXED-NEXT: [[TMP26:%.*]] = insertelement <2 x i64> [[TMP22]], i64 [[TMP25]], i32 1
	; FIXED-NEXT: br label [[PRED_UDIV_CONTINUE9]]
	; FIXED: pred.udiv.continue9:
	; FIXED-NEXT: [[TMP27:%.*]] = phi <2 x i64> [ [[TMP22]], [[PRED_UDIV_CONTINUE7]] ], [ [[TMP26]], [[PRED_UDIV_IF8]] ]
	; FIXED-NEXT: [[TMP28:%.*]] = xor <2 x i1> [[TMP6]], <i1 true, i1 true>
	; FIXED-NEXT: [[TMP29:%.*]] = xor <2 x i1> [[TMP7]], <i1 true, i1 true>
	; FIXED-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP6]], <2 x i64> [[TMP17]], <2 x i64> [[WIDE_LOAD]]
	; FIXED-NEXT: [[PREDPHI10:%.*]] = select <2 x i1> [[TMP7]], <2 x i64> [[TMP27]], <2 x i64> [[WIDE_LOAD1]]
	; FIXED-NEXT: [[TMP30:%.*]] = getelementptr inbounds i64, ptr [[TMP2]], i32 0
	; FIXED-NEXT: store <2 x i64> [[PREDPHI]], ptr [[TMP30]], align 8
	; FIXED-NEXT: [[TMP31:%.*]] = getelementptr inbounds i64, ptr [[TMP2]], i32 2
	; FIXED-NEXT: store <2 x i64> [[PREDPHI10]], ptr [[TMP31]], align 8
	; FIXED-NEXT: [[INDEX_NEXT]] = add nuw i64 [[INDEX]], 4			; FIXED-NEXT: [[INDEX_NEXT]] = add nuw i64 [[INDEX]], 4
	; FIXED-NEXT: [[TMP32:%.*]] = icmp eq i64 [[INDEX_NEXT]], 1024			; FIXED-NEXT: [[TMP14:%.*]] = icmp eq i64 [[INDEX_NEXT]], 1024
	; FIXED-NEXT: br i1 [[TMP32]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP10:![0-9]+]]			; FIXED-NEXT: br i1 [[TMP14]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP10:![0-9]+]]
	; FIXED: middle.block:			; FIXED: middle.block:
	; FIXED-NEXT: [[CMP_N:%.*]] = icmp eq i64 1024, 1024			; FIXED-NEXT: [[CMP_N:%.*]] = icmp eq i64 1024, 1024
	; FIXED-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]			; FIXED-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; FIXED: scalar.ph:			; FIXED: scalar.ph:
	; FIXED-NEXT: [[BC_RESUME_VAL:%.]] = phi i64 [ 1024, [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]			; FIXED-NEXT: [[BC_RESUME_VAL:%.]] = phi i64 [ 1024, [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]
	; FIXED-NEXT: br label [[FOR_BODY:%.*]]			; FIXED-NEXT: br label [[FOR_BODY:%.*]]
	; FIXED: for.body:			; FIXED: for.body:
	; FIXED-NEXT: [[IV:%.]] = phi i64 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[IV_NEXT:%.]], [[LATCH:%.*]] ]			; FIXED-NEXT: [[IV:%.]] = phi i64 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[IV_NEXT:%.]], [[LATCH:%.*]] ]
	▲ Show 20 Lines • Show All 94 Lines • ▼ Show 20 Lines
	; FIXED-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; FIXED-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; FIXED: vector.ph:			; FIXED: vector.ph:
	; FIXED-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i64> poison, i64 [[V:%.]], i32 0			; FIXED-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i64> poison, i64 [[V:%.]], i32 0
	; FIXED-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i64> [[BROADCAST_SPLATINSERT]], <2 x i64> poison, <2 x i32> zeroinitializer			; FIXED-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i64> [[BROADCAST_SPLATINSERT]], <2 x i64> poison, <2 x i32> zeroinitializer
	; FIXED-NEXT: [[BROADCAST_SPLATINSERT2:%.*]] = insertelement <2 x i64> poison, i64 [[V]], i32 0			; FIXED-NEXT: [[BROADCAST_SPLATINSERT2:%.*]] = insertelement <2 x i64> poison, i64 [[V]], i32 0
	; FIXED-NEXT: [[BROADCAST_SPLAT3:%.*]] = shufflevector <2 x i64> [[BROADCAST_SPLATINSERT2]], <2 x i64> poison, <2 x i32> zeroinitializer			; FIXED-NEXT: [[BROADCAST_SPLAT3:%.*]] = shufflevector <2 x i64> [[BROADCAST_SPLATINSERT2]], <2 x i64> poison, <2 x i32> zeroinitializer
	; FIXED-NEXT: br label [[VECTOR_BODY:%.*]]			; FIXED-NEXT: br label [[VECTOR_BODY:%.*]]
	; FIXED: vector.body:			; FIXED: vector.body:
	; FIXED-NEXT: [[INDEX:%.]] = phi i64 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_SDIV_CONTINUE9:%.*]] ]			; FIXED-NEXT: [[INDEX:%.]] = phi i64 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; FIXED-NEXT: [[TMP0:%.*]] = add i64 [[INDEX]], 0			; FIXED-NEXT: [[TMP0:%.*]] = add i64 [[INDEX]], 0
	; FIXED-NEXT: [[TMP1:%.*]] = add i64 [[INDEX]], 2			; FIXED-NEXT: [[TMP1:%.*]] = add i64 [[INDEX]], 2
	; FIXED-NEXT: [[TMP2:%.]] = getelementptr inbounds i64, ptr [[A:%.]], i64 [[TMP0]]			; FIXED-NEXT: [[TMP2:%.]] = getelementptr inbounds i64, ptr [[A:%.]], i64 [[TMP0]]
	; FIXED-NEXT: [[TMP3:%.*]] = getelementptr inbounds i64, ptr [[A]], i64 [[TMP1]]			; FIXED-NEXT: [[TMP3:%.*]] = getelementptr inbounds i64, ptr [[A]], i64 [[TMP1]]
	; FIXED-NEXT: [[TMP4:%.*]] = getelementptr inbounds i64, ptr [[TMP2]], i32 0			; FIXED-NEXT: [[TMP4:%.*]] = getelementptr inbounds i64, ptr [[TMP2]], i32 0
	; FIXED-NEXT: [[WIDE_LOAD:%.*]] = load <2 x i64>, ptr [[TMP4]], align 8			; FIXED-NEXT: [[WIDE_LOAD:%.*]] = load <2 x i64>, ptr [[TMP4]], align 8
	; FIXED-NEXT: [[TMP5:%.*]] = getelementptr inbounds i64, ptr [[TMP2]], i32 2			; FIXED-NEXT: [[TMP5:%.*]] = getelementptr inbounds i64, ptr [[TMP2]], i32 2
	; FIXED-NEXT: [[WIDE_LOAD1:%.*]] = load <2 x i64>, ptr [[TMP5]], align 8			; FIXED-NEXT: [[WIDE_LOAD1:%.*]] = load <2 x i64>, ptr [[TMP5]], align 8
	; FIXED-NEXT: [[TMP6:%.*]] = icmp ne <2 x i64> [[BROADCAST_SPLAT]], zeroinitializer			; FIXED-NEXT: [[TMP6:%.*]] = icmp ne <2 x i64> [[BROADCAST_SPLAT]], zeroinitializer
	; FIXED-NEXT: [[TMP7:%.*]] = icmp ne <2 x i64> [[BROADCAST_SPLAT3]], zeroinitializer			; FIXED-NEXT: [[TMP7:%.*]] = icmp ne <2 x i64> [[BROADCAST_SPLAT3]], zeroinitializer
	; FIXED-NEXT: [[TMP8:%.*]] = extractelement <2 x i1> [[TMP6]], i32 0			; FIXED-NEXT: [[TMP8:%.*]] = select <2 x i1> [[TMP6]], <2 x i64> [[BROADCAST_SPLAT]], <2 x i64> <i64 1, i64 1>
	; FIXED-NEXT: br i1 [[TMP8]], label [[PRED_SDIV_IF:%.]], label [[PRED_SDIV_CONTINUE:%.]]			; FIXED-NEXT: [[TMP9:%.*]] = select <2 x i1> [[TMP7]], <2 x i64> [[BROADCAST_SPLAT3]], <2 x i64> <i64 1, i64 1>
	; FIXED: pred.sdiv.if:			; FIXED-NEXT: [[TMP10:%.*]] = sdiv <2 x i64> [[WIDE_LOAD]], [[TMP8]]
	; FIXED-NEXT: [[TMP9:%.*]] = extractelement <2 x i64> [[WIDE_LOAD]], i32 0			; FIXED-NEXT: [[TMP11:%.*]] = sdiv <2 x i64> [[WIDE_LOAD1]], [[TMP9]]
	; FIXED-NEXT: [[TMP10:%.*]] = sdiv i64 [[TMP9]], [[V]]			; FIXED-NEXT: [[TMP12:%.*]] = xor <2 x i1> [[TMP6]], <i1 true, i1 true>
	; FIXED-NEXT: [[TMP11:%.*]] = insertelement <2 x i64> poison, i64 [[TMP10]], i32 0			; FIXED-NEXT: [[TMP13:%.*]] = xor <2 x i1> [[TMP7]], <i1 true, i1 true>
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE]]			; FIXED-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP6]], <2 x i64> [[TMP10]], <2 x i64> [[WIDE_LOAD]]
	; FIXED: pred.sdiv.continue:			; FIXED-NEXT: [[PREDPHI4:%.*]] = select <2 x i1> [[TMP7]], <2 x i64> [[TMP11]], <2 x i64> [[WIDE_LOAD1]]
	; FIXED-NEXT: [[TMP12:%.*]] = phi <2 x i64> [ poison, [[VECTOR_BODY]] ], [ [[TMP11]], [[PRED_SDIV_IF]] ]			; FIXED-NEXT: store <2 x i64> [[PREDPHI]], ptr [[TMP4]], align 8
	; FIXED-NEXT: [[TMP13:%.*]] = extractelement <2 x i1> [[TMP6]], i32 1			; FIXED-NEXT: store <2 x i64> [[PREDPHI4]], ptr [[TMP5]], align 8
	; FIXED-NEXT: br i1 [[TMP13]], label [[PRED_SDIV_IF4:%.]], label [[PRED_SDIV_CONTINUE5:%.]]
	; FIXED: pred.sdiv.if4:
	; FIXED-NEXT: [[TMP14:%.*]] = extractelement <2 x i64> [[WIDE_LOAD]], i32 1
	; FIXED-NEXT: [[TMP15:%.*]] = sdiv i64 [[TMP14]], [[V]]
	; FIXED-NEXT: [[TMP16:%.*]] = insertelement <2 x i64> [[TMP12]], i64 [[TMP15]], i32 1
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE5]]
	; FIXED: pred.sdiv.continue5:
	; FIXED-NEXT: [[TMP17:%.*]] = phi <2 x i64> [ [[TMP12]], [[PRED_SDIV_CONTINUE]] ], [ [[TMP16]], [[PRED_SDIV_IF4]] ]
	; FIXED-NEXT: [[TMP18:%.*]] = extractelement <2 x i1> [[TMP7]], i32 0
	; FIXED-NEXT: br i1 [[TMP18]], label [[PRED_SDIV_IF6:%.]], label [[PRED_SDIV_CONTINUE7:%.]]
	; FIXED: pred.sdiv.if6:
	; FIXED-NEXT: [[TMP19:%.*]] = extractelement <2 x i64> [[WIDE_LOAD1]], i32 0
	; FIXED-NEXT: [[TMP20:%.*]] = sdiv i64 [[TMP19]], [[V]]
	; FIXED-NEXT: [[TMP21:%.*]] = insertelement <2 x i64> poison, i64 [[TMP20]], i32 0
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE7]]
	; FIXED: pred.sdiv.continue7:
	; FIXED-NEXT: [[TMP22:%.*]] = phi <2 x i64> [ poison, [[PRED_SDIV_CONTINUE5]] ], [ [[TMP21]], [[PRED_SDIV_IF6]] ]
	; FIXED-NEXT: [[TMP23:%.*]] = extractelement <2 x i1> [[TMP7]], i32 1
	; FIXED-NEXT: br i1 [[TMP23]], label [[PRED_SDIV_IF8:%.*]], label [[PRED_SDIV_CONTINUE9]]
	; FIXED: pred.sdiv.if8:
	; FIXED-NEXT: [[TMP24:%.*]] = extractelement <2 x i64> [[WIDE_LOAD1]], i32 1
	; FIXED-NEXT: [[TMP25:%.*]] = sdiv i64 [[TMP24]], [[V]]
	; FIXED-NEXT: [[TMP26:%.*]] = insertelement <2 x i64> [[TMP22]], i64 [[TMP25]], i32 1
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE9]]
	; FIXED: pred.sdiv.continue9:
	; FIXED-NEXT: [[TMP27:%.*]] = phi <2 x i64> [ [[TMP22]], [[PRED_SDIV_CONTINUE7]] ], [ [[TMP26]], [[PRED_SDIV_IF8]] ]
	; FIXED-NEXT: [[TMP28:%.*]] = xor <2 x i1> [[TMP6]], <i1 true, i1 true>
	; FIXED-NEXT: [[TMP29:%.*]] = xor <2 x i1> [[TMP7]], <i1 true, i1 true>
	; FIXED-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP6]], <2 x i64> [[TMP17]], <2 x i64> [[WIDE_LOAD]]
	; FIXED-NEXT: [[PREDPHI10:%.*]] = select <2 x i1> [[TMP7]], <2 x i64> [[TMP27]], <2 x i64> [[WIDE_LOAD1]]
	; FIXED-NEXT: [[TMP30:%.*]] = getelementptr inbounds i64, ptr [[TMP2]], i32 0
	; FIXED-NEXT: store <2 x i64> [[PREDPHI]], ptr [[TMP30]], align 8
	; FIXED-NEXT: [[TMP31:%.*]] = getelementptr inbounds i64, ptr [[TMP2]], i32 2
	; FIXED-NEXT: store <2 x i64> [[PREDPHI10]], ptr [[TMP31]], align 8
	; FIXED-NEXT: [[INDEX_NEXT]] = add nuw i64 [[INDEX]], 4			; FIXED-NEXT: [[INDEX_NEXT]] = add nuw i64 [[INDEX]], 4
	; FIXED-NEXT: [[TMP32:%.*]] = icmp eq i64 [[INDEX_NEXT]], 1024			; FIXED-NEXT: [[TMP14:%.*]] = icmp eq i64 [[INDEX_NEXT]], 1024
	; FIXED-NEXT: br i1 [[TMP32]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP12:![0-9]+]]			; FIXED-NEXT: br i1 [[TMP14]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP12:![0-9]+]]
	; FIXED: middle.block:			; FIXED: middle.block:
	; FIXED-NEXT: [[CMP_N:%.*]] = icmp eq i64 1024, 1024			; FIXED-NEXT: [[CMP_N:%.*]] = icmp eq i64 1024, 1024
	; FIXED-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]			; FIXED-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; FIXED: scalar.ph:			; FIXED: scalar.ph:
	; FIXED-NEXT: [[BC_RESUME_VAL:%.]] = phi i64 [ 1024, [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]			; FIXED-NEXT: [[BC_RESUME_VAL:%.]] = phi i64 [ 1024, [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]
	; FIXED-NEXT: br label [[FOR_BODY:%.*]]			; FIXED-NEXT: br label [[FOR_BODY:%.*]]
	; FIXED: for.body:			; FIXED: for.body:
	; FIXED-NEXT: [[IV:%.]] = phi i64 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[IV_NEXT:%.]], [[LATCH:%.*]] ]			; FIXED-NEXT: [[IV:%.]] = phi i64 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[IV_NEXT:%.]], [[LATCH:%.*]] ]
	▲ Show 20 Lines • Show All 362 Lines • ▼ Show 20 Lines
	; FIXED-LABEL: @predicated_sdiv_by_minus_one(			; FIXED-LABEL: @predicated_sdiv_by_minus_one(
	; FIXED-NEXT: iter.check:			; FIXED-NEXT: iter.check:
	; FIXED-NEXT: br i1 false, label [[VEC_EPILOG_SCALAR_PH:%.]], label [[VECTOR_MAIN_LOOP_ITER_CHECK:%.]]			; FIXED-NEXT: br i1 false, label [[VEC_EPILOG_SCALAR_PH:%.]], label [[VECTOR_MAIN_LOOP_ITER_CHECK:%.]]
	; FIXED: vector.main.loop.iter.check:			; FIXED: vector.main.loop.iter.check:
	; FIXED-NEXT: br i1 false, label [[VEC_EPILOG_PH:%.]], label [[VECTOR_PH:%.]]			; FIXED-NEXT: br i1 false, label [[VEC_EPILOG_PH:%.]], label [[VECTOR_PH:%.]]
	; FIXED: vector.ph:			; FIXED: vector.ph:
	; FIXED-NEXT: br label [[VECTOR_BODY:%.*]]			; FIXED-NEXT: br label [[VECTOR_BODY:%.*]]
	; FIXED: vector.body:			; FIXED: vector.body:
	; FIXED-NEXT: [[INDEX:%.]] = phi i64 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_SDIV_CONTINUE30:%.*]] ]			; FIXED-NEXT: [[INDEX:%.]] = phi i64 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; FIXED-NEXT: [[TMP0:%.*]] = add i64 [[INDEX]], 0			; FIXED-NEXT: [[TMP0:%.*]] = add i64 [[INDEX]], 0
	; FIXED-NEXT: [[TMP1:%.]] = getelementptr inbounds i8, ptr [[A:%.]], i64 [[TMP0]]			; FIXED-NEXT: [[TMP1:%.*]] = add i64 [[INDEX]], 16
	; FIXED-NEXT: [[TMP2:%.*]] = getelementptr inbounds i8, ptr [[TMP1]], i32 0			; FIXED-NEXT: [[TMP2:%.]] = getelementptr inbounds i8, ptr [[A:%.]], i64 [[TMP0]]
	; FIXED-NEXT: [[WIDE_LOAD:%.*]] = load <16 x i8>, ptr [[TMP2]], align 1			; FIXED-NEXT: [[TMP3:%.*]] = getelementptr inbounds i8, ptr [[A]], i64 [[TMP1]]
	; FIXED-NEXT: [[TMP3:%.*]] = icmp ne <16 x i8> [[WIDE_LOAD]], <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>			; FIXED-NEXT: [[TMP4:%.*]] = getelementptr inbounds i8, ptr [[TMP2]], i32 0
	; FIXED-NEXT: [[TMP4:%.*]] = extractelement <16 x i1> [[TMP3]], i32 0			; FIXED-NEXT: [[WIDE_LOAD:%.*]] = load <16 x i8>, ptr [[TMP4]], align 1
	; FIXED-NEXT: br i1 [[TMP4]], label [[PRED_SDIV_IF:%.]], label [[PRED_SDIV_CONTINUE:%.]]			; FIXED-NEXT: [[TMP5:%.*]] = getelementptr inbounds i8, ptr [[TMP2]], i32 16
	; FIXED: pred.sdiv.if:			; FIXED-NEXT: [[WIDE_LOAD1:%.*]] = load <16 x i8>, ptr [[TMP5]], align 1
	; FIXED-NEXT: [[TMP5:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 0			; FIXED-NEXT: [[TMP6:%.*]] = icmp ne <16 x i8> [[WIDE_LOAD]], <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>
	; FIXED-NEXT: [[TMP6:%.*]] = sdiv i8 [[TMP5]], -1			; FIXED-NEXT: [[TMP7:%.*]] = icmp ne <16 x i8> [[WIDE_LOAD1]], <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>
	; FIXED-NEXT: [[TMP7:%.*]] = insertelement <16 x i8> poison, i8 [[TMP6]], i32 0			; FIXED-NEXT: [[TMP8:%.*]] = select <16 x i1> [[TMP6]], <16 x i8> <i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1>, <16 x i8> <i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1>
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE]]			; FIXED-NEXT: [[TMP9:%.*]] = select <16 x i1> [[TMP7]], <16 x i8> <i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1>, <16 x i8> <i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1>
	; FIXED: pred.sdiv.continue:			; FIXED-NEXT: [[TMP10:%.*]] = sdiv <16 x i8> [[WIDE_LOAD]], [[TMP8]]
	; FIXED-NEXT: [[TMP8:%.*]] = phi <16 x i8> [ poison, [[VECTOR_BODY]] ], [ [[TMP7]], [[PRED_SDIV_IF]] ]			; FIXED-NEXT: [[TMP11:%.*]] = sdiv <16 x i8> [[WIDE_LOAD1]], [[TMP9]]
	; FIXED-NEXT: [[TMP9:%.*]] = extractelement <16 x i1> [[TMP3]], i32 1			; FIXED-NEXT: [[TMP12:%.*]] = xor <16 x i1> [[TMP6]], <i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true>
	; FIXED-NEXT: br i1 [[TMP9]], label [[PRED_SDIV_IF1:%.]], label [[PRED_SDIV_CONTINUE2:%.]]			; FIXED-NEXT: [[TMP13:%.*]] = xor <16 x i1> [[TMP7]], <i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true>
	; FIXED: pred.sdiv.if1:			; FIXED-NEXT: [[PREDPHI:%.*]] = select <16 x i1> [[TMP6]], <16 x i8> [[TMP10]], <16 x i8> [[WIDE_LOAD]]
	; FIXED-NEXT: [[TMP10:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 1			; FIXED-NEXT: [[PREDPHI2:%.*]] = select <16 x i1> [[TMP7]], <16 x i8> [[TMP11]], <16 x i8> [[WIDE_LOAD1]]
	; FIXED-NEXT: [[TMP11:%.*]] = sdiv i8 [[TMP10]], -1			; FIXED-NEXT: store <16 x i8> [[PREDPHI]], ptr [[TMP4]], align 1
	; FIXED-NEXT: [[TMP12:%.*]] = insertelement <16 x i8> [[TMP8]], i8 [[TMP11]], i32 1			; FIXED-NEXT: store <16 x i8> [[PREDPHI2]], ptr [[TMP5]], align 1
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE2]]			; FIXED-NEXT: [[INDEX_NEXT]] = add nuw i64 [[INDEX]], 32
	; FIXED: pred.sdiv.continue2:			; FIXED-NEXT: [[TMP14:%.*]] = icmp eq i64 [[INDEX_NEXT]], 1024
	; FIXED-NEXT: [[TMP13:%.*]] = phi <16 x i8> [ [[TMP8]], [[PRED_SDIV_CONTINUE]] ], [ [[TMP12]], [[PRED_SDIV_IF1]] ]			; FIXED-NEXT: br i1 [[TMP14]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP18:![0-9]+]]
	; FIXED-NEXT: [[TMP14:%.*]] = extractelement <16 x i1> [[TMP3]], i32 2
	; FIXED-NEXT: br i1 [[TMP14]], label [[PRED_SDIV_IF3:%.]], label [[PRED_SDIV_CONTINUE4:%.]]
	; FIXED: pred.sdiv.if3:
	; FIXED-NEXT: [[TMP15:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 2
	; FIXED-NEXT: [[TMP16:%.*]] = sdiv i8 [[TMP15]], -1
	; FIXED-NEXT: [[TMP17:%.*]] = insertelement <16 x i8> [[TMP13]], i8 [[TMP16]], i32 2
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE4]]
	; FIXED: pred.sdiv.continue4:
	; FIXED-NEXT: [[TMP18:%.*]] = phi <16 x i8> [ [[TMP13]], [[PRED_SDIV_CONTINUE2]] ], [ [[TMP17]], [[PRED_SDIV_IF3]] ]
	; FIXED-NEXT: [[TMP19:%.*]] = extractelement <16 x i1> [[TMP3]], i32 3
	; FIXED-NEXT: br i1 [[TMP19]], label [[PRED_SDIV_IF5:%.]], label [[PRED_SDIV_CONTINUE6:%.]]
	; FIXED: pred.sdiv.if5:
	; FIXED-NEXT: [[TMP20:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 3
	; FIXED-NEXT: [[TMP21:%.*]] = sdiv i8 [[TMP20]], -1
	; FIXED-NEXT: [[TMP22:%.*]] = insertelement <16 x i8> [[TMP18]], i8 [[TMP21]], i32 3
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE6]]
	; FIXED: pred.sdiv.continue6:
	; FIXED-NEXT: [[TMP23:%.*]] = phi <16 x i8> [ [[TMP18]], [[PRED_SDIV_CONTINUE4]] ], [ [[TMP22]], [[PRED_SDIV_IF5]] ]
	; FIXED-NEXT: [[TMP24:%.*]] = extractelement <16 x i1> [[TMP3]], i32 4
	; FIXED-NEXT: br i1 [[TMP24]], label [[PRED_SDIV_IF7:%.]], label [[PRED_SDIV_CONTINUE8:%.]]
	; FIXED: pred.sdiv.if7:
	; FIXED-NEXT: [[TMP25:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 4
	; FIXED-NEXT: [[TMP26:%.*]] = sdiv i8 [[TMP25]], -1
	; FIXED-NEXT: [[TMP27:%.*]] = insertelement <16 x i8> [[TMP23]], i8 [[TMP26]], i32 4
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE8]]
	; FIXED: pred.sdiv.continue8:
	; FIXED-NEXT: [[TMP28:%.*]] = phi <16 x i8> [ [[TMP23]], [[PRED_SDIV_CONTINUE6]] ], [ [[TMP27]], [[PRED_SDIV_IF7]] ]
	; FIXED-NEXT: [[TMP29:%.*]] = extractelement <16 x i1> [[TMP3]], i32 5
	; FIXED-NEXT: br i1 [[TMP29]], label [[PRED_SDIV_IF9:%.]], label [[PRED_SDIV_CONTINUE10:%.]]
	; FIXED: pred.sdiv.if9:
	; FIXED-NEXT: [[TMP30:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 5
	; FIXED-NEXT: [[TMP31:%.*]] = sdiv i8 [[TMP30]], -1
	; FIXED-NEXT: [[TMP32:%.*]] = insertelement <16 x i8> [[TMP28]], i8 [[TMP31]], i32 5
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE10]]
	; FIXED: pred.sdiv.continue10:
	; FIXED-NEXT: [[TMP33:%.*]] = phi <16 x i8> [ [[TMP28]], [[PRED_SDIV_CONTINUE8]] ], [ [[TMP32]], [[PRED_SDIV_IF9]] ]
	; FIXED-NEXT: [[TMP34:%.*]] = extractelement <16 x i1> [[TMP3]], i32 6
	; FIXED-NEXT: br i1 [[TMP34]], label [[PRED_SDIV_IF11:%.]], label [[PRED_SDIV_CONTINUE12:%.]]
	; FIXED: pred.sdiv.if11:
	; FIXED-NEXT: [[TMP35:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 6
	; FIXED-NEXT: [[TMP36:%.*]] = sdiv i8 [[TMP35]], -1
	; FIXED-NEXT: [[TMP37:%.*]] = insertelement <16 x i8> [[TMP33]], i8 [[TMP36]], i32 6
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE12]]
	; FIXED: pred.sdiv.continue12:
	; FIXED-NEXT: [[TMP38:%.*]] = phi <16 x i8> [ [[TMP33]], [[PRED_SDIV_CONTINUE10]] ], [ [[TMP37]], [[PRED_SDIV_IF11]] ]
	; FIXED-NEXT: [[TMP39:%.*]] = extractelement <16 x i1> [[TMP3]], i32 7
	; FIXED-NEXT: br i1 [[TMP39]], label [[PRED_SDIV_IF13:%.]], label [[PRED_SDIV_CONTINUE14:%.]]
	; FIXED: pred.sdiv.if13:
	; FIXED-NEXT: [[TMP40:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 7
	; FIXED-NEXT: [[TMP41:%.*]] = sdiv i8 [[TMP40]], -1
	; FIXED-NEXT: [[TMP42:%.*]] = insertelement <16 x i8> [[TMP38]], i8 [[TMP41]], i32 7
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE14]]
	; FIXED: pred.sdiv.continue14:
	; FIXED-NEXT: [[TMP43:%.*]] = phi <16 x i8> [ [[TMP38]], [[PRED_SDIV_CONTINUE12]] ], [ [[TMP42]], [[PRED_SDIV_IF13]] ]
	; FIXED-NEXT: [[TMP44:%.*]] = extractelement <16 x i1> [[TMP3]], i32 8
	; FIXED-NEXT: br i1 [[TMP44]], label [[PRED_SDIV_IF15:%.]], label [[PRED_SDIV_CONTINUE16:%.]]
	; FIXED: pred.sdiv.if15:
	; FIXED-NEXT: [[TMP45:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 8
	; FIXED-NEXT: [[TMP46:%.*]] = sdiv i8 [[TMP45]], -1
	; FIXED-NEXT: [[TMP47:%.*]] = insertelement <16 x i8> [[TMP43]], i8 [[TMP46]], i32 8
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE16]]
	; FIXED: pred.sdiv.continue16:
	; FIXED-NEXT: [[TMP48:%.*]] = phi <16 x i8> [ [[TMP43]], [[PRED_SDIV_CONTINUE14]] ], [ [[TMP47]], [[PRED_SDIV_IF15]] ]
	; FIXED-NEXT: [[TMP49:%.*]] = extractelement <16 x i1> [[TMP3]], i32 9
	; FIXED-NEXT: br i1 [[TMP49]], label [[PRED_SDIV_IF17:%.]], label [[PRED_SDIV_CONTINUE18:%.]]
	; FIXED: pred.sdiv.if17:
	; FIXED-NEXT: [[TMP50:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 9
	; FIXED-NEXT: [[TMP51:%.*]] = sdiv i8 [[TMP50]], -1
	; FIXED-NEXT: [[TMP52:%.*]] = insertelement <16 x i8> [[TMP48]], i8 [[TMP51]], i32 9
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE18]]
	; FIXED: pred.sdiv.continue18:
	; FIXED-NEXT: [[TMP53:%.*]] = phi <16 x i8> [ [[TMP48]], [[PRED_SDIV_CONTINUE16]] ], [ [[TMP52]], [[PRED_SDIV_IF17]] ]
	; FIXED-NEXT: [[TMP54:%.*]] = extractelement <16 x i1> [[TMP3]], i32 10
	; FIXED-NEXT: br i1 [[TMP54]], label [[PRED_SDIV_IF19:%.]], label [[PRED_SDIV_CONTINUE20:%.]]
	; FIXED: pred.sdiv.if19:
	; FIXED-NEXT: [[TMP55:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 10
	; FIXED-NEXT: [[TMP56:%.*]] = sdiv i8 [[TMP55]], -1
	; FIXED-NEXT: [[TMP57:%.*]] = insertelement <16 x i8> [[TMP53]], i8 [[TMP56]], i32 10
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE20]]
	; FIXED: pred.sdiv.continue20:
	; FIXED-NEXT: [[TMP58:%.*]] = phi <16 x i8> [ [[TMP53]], [[PRED_SDIV_CONTINUE18]] ], [ [[TMP57]], [[PRED_SDIV_IF19]] ]
	; FIXED-NEXT: [[TMP59:%.*]] = extractelement <16 x i1> [[TMP3]], i32 11
	; FIXED-NEXT: br i1 [[TMP59]], label [[PRED_SDIV_IF21:%.]], label [[PRED_SDIV_CONTINUE22:%.]]
	; FIXED: pred.sdiv.if21:
	; FIXED-NEXT: [[TMP60:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 11
	; FIXED-NEXT: [[TMP61:%.*]] = sdiv i8 [[TMP60]], -1
	; FIXED-NEXT: [[TMP62:%.*]] = insertelement <16 x i8> [[TMP58]], i8 [[TMP61]], i32 11
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE22]]
	; FIXED: pred.sdiv.continue22:
	; FIXED-NEXT: [[TMP63:%.*]] = phi <16 x i8> [ [[TMP58]], [[PRED_SDIV_CONTINUE20]] ], [ [[TMP62]], [[PRED_SDIV_IF21]] ]
	; FIXED-NEXT: [[TMP64:%.*]] = extractelement <16 x i1> [[TMP3]], i32 12
	; FIXED-NEXT: br i1 [[TMP64]], label [[PRED_SDIV_IF23:%.]], label [[PRED_SDIV_CONTINUE24:%.]]
	; FIXED: pred.sdiv.if23:
	; FIXED-NEXT: [[TMP65:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 12
	; FIXED-NEXT: [[TMP66:%.*]] = sdiv i8 [[TMP65]], -1
	; FIXED-NEXT: [[TMP67:%.*]] = insertelement <16 x i8> [[TMP63]], i8 [[TMP66]], i32 12
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE24]]
	; FIXED: pred.sdiv.continue24:
	; FIXED-NEXT: [[TMP68:%.*]] = phi <16 x i8> [ [[TMP63]], [[PRED_SDIV_CONTINUE22]] ], [ [[TMP67]], [[PRED_SDIV_IF23]] ]
	; FIXED-NEXT: [[TMP69:%.*]] = extractelement <16 x i1> [[TMP3]], i32 13
	; FIXED-NEXT: br i1 [[TMP69]], label [[PRED_SDIV_IF25:%.]], label [[PRED_SDIV_CONTINUE26:%.]]
	; FIXED: pred.sdiv.if25:
	; FIXED-NEXT: [[TMP70:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 13
	; FIXED-NEXT: [[TMP71:%.*]] = sdiv i8 [[TMP70]], -1
	; FIXED-NEXT: [[TMP72:%.*]] = insertelement <16 x i8> [[TMP68]], i8 [[TMP71]], i32 13
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE26]]
	; FIXED: pred.sdiv.continue26:
	; FIXED-NEXT: [[TMP73:%.*]] = phi <16 x i8> [ [[TMP68]], [[PRED_SDIV_CONTINUE24]] ], [ [[TMP72]], [[PRED_SDIV_IF25]] ]
	; FIXED-NEXT: [[TMP74:%.*]] = extractelement <16 x i1> [[TMP3]], i32 14
	; FIXED-NEXT: br i1 [[TMP74]], label [[PRED_SDIV_IF27:%.]], label [[PRED_SDIV_CONTINUE28:%.]]
	; FIXED: pred.sdiv.if27:
	; FIXED-NEXT: [[TMP75:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 14
	; FIXED-NEXT: [[TMP76:%.*]] = sdiv i8 [[TMP75]], -1
	; FIXED-NEXT: [[TMP77:%.*]] = insertelement <16 x i8> [[TMP73]], i8 [[TMP76]], i32 14
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE28]]
	; FIXED: pred.sdiv.continue28:
	; FIXED-NEXT: [[TMP78:%.*]] = phi <16 x i8> [ [[TMP73]], [[PRED_SDIV_CONTINUE26]] ], [ [[TMP77]], [[PRED_SDIV_IF27]] ]
	; FIXED-NEXT: [[TMP79:%.*]] = extractelement <16 x i1> [[TMP3]], i32 15
	; FIXED-NEXT: br i1 [[TMP79]], label [[PRED_SDIV_IF29:%.*]], label [[PRED_SDIV_CONTINUE30]]
	; FIXED: pred.sdiv.if29:
	; FIXED-NEXT: [[TMP80:%.*]] = extractelement <16 x i8> [[WIDE_LOAD]], i32 15
	; FIXED-NEXT: [[TMP81:%.*]] = sdiv i8 [[TMP80]], -1
	; FIXED-NEXT: [[TMP82:%.*]] = insertelement <16 x i8> [[TMP78]], i8 [[TMP81]], i32 15
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE30]]
	; FIXED: pred.sdiv.continue30:
	; FIXED-NEXT: [[TMP83:%.*]] = phi <16 x i8> [ [[TMP78]], [[PRED_SDIV_CONTINUE28]] ], [ [[TMP82]], [[PRED_SDIV_IF29]] ]
	; FIXED-NEXT: [[TMP84:%.*]] = xor <16 x i1> [[TMP3]], <i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true>
	; FIXED-NEXT: [[PREDPHI:%.*]] = select <16 x i1> [[TMP3]], <16 x i8> [[TMP83]], <16 x i8> [[WIDE_LOAD]]
	; FIXED-NEXT: [[TMP85:%.*]] = getelementptr inbounds i8, ptr [[TMP1]], i32 0
	; FIXED-NEXT: store <16 x i8> [[PREDPHI]], ptr [[TMP85]], align 1
	; FIXED-NEXT: [[INDEX_NEXT]] = add nuw i64 [[INDEX]], 16
	; FIXED-NEXT: [[TMP86:%.*]] = icmp eq i64 [[INDEX_NEXT]], 1024
	; FIXED-NEXT: br i1 [[TMP86]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP18:![0-9]+]]
	; FIXED: middle.block:			; FIXED: middle.block:
	; FIXED-NEXT: [[CMP_N:%.*]] = icmp eq i64 1024, 1024			; FIXED-NEXT: [[CMP_N:%.*]] = icmp eq i64 1024, 1024
	; FIXED-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.]], label [[VEC_EPILOG_ITER_CHECK:%.]]			; FIXED-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.]], label [[VEC_EPILOG_ITER_CHECK:%.]]
	; FIXED: vec.epilog.iter.check:			; FIXED: vec.epilog.iter.check:
	; FIXED-NEXT: br i1 true, label [[VEC_EPILOG_SCALAR_PH]], label [[VEC_EPILOG_PH]]			; FIXED-NEXT: br i1 true, label [[VEC_EPILOG_SCALAR_PH]], label [[VEC_EPILOG_PH]]
	; FIXED: vec.epilog.ph:			; FIXED: vec.epilog.ph:
	; FIXED-NEXT: [[VEC_EPILOG_RESUME_VAL:%.*]] = phi i64 [ 1024, [[VEC_EPILOG_ITER_CHECK]] ], [ 0, [[VECTOR_MAIN_LOOP_ITER_CHECK]] ]			; FIXED-NEXT: [[VEC_EPILOG_RESUME_VAL:%.*]] = phi i64 [ 1024, [[VEC_EPILOG_ITER_CHECK]] ], [ 0, [[VECTOR_MAIN_LOOP_ITER_CHECK]] ]
	; FIXED-NEXT: br label [[VEC_EPILOG_VECTOR_BODY:%.*]]			; FIXED-NEXT: br label [[VEC_EPILOG_VECTOR_BODY:%.*]]
	; FIXED: vec.epilog.vector.body:			; FIXED: vec.epilog.vector.body:
	; FIXED-NEXT: [[OFFSET_IDX:%.]] = phi i64 [ [[VEC_EPILOG_RESUME_VAL]], [[VEC_EPILOG_PH]] ], [ [[INDEX_NEXT51:%.]], [[PRED_SDIV_CONTINUE49:%.*]] ]			; FIXED-NEXT: [[OFFSET_IDX:%.]] = phi i64 [ [[VEC_EPILOG_RESUME_VAL]], [[VEC_EPILOG_PH]] ], [ [[INDEX_NEXT7:%.]], [[VEC_EPILOG_VECTOR_BODY]] ]
	; FIXED-NEXT: [[TMP87:%.*]] = add i64 [[OFFSET_IDX]], 0			; FIXED-NEXT: [[TMP15:%.*]] = add i64 [[OFFSET_IDX]], 0
	; FIXED-NEXT: [[TMP88:%.*]] = getelementptr inbounds i8, ptr [[A]], i64 [[TMP87]]			; FIXED-NEXT: [[TMP16:%.*]] = getelementptr inbounds i8, ptr [[A]], i64 [[TMP15]]
	; FIXED-NEXT: [[TMP89:%.*]] = getelementptr inbounds i8, ptr [[TMP88]], i32 0			; FIXED-NEXT: [[TMP17:%.*]] = getelementptr inbounds i8, ptr [[TMP16]], i32 0
	; FIXED-NEXT: [[WIDE_LOAD33:%.*]] = load <8 x i8>, ptr [[TMP89]], align 1			; FIXED-NEXT: [[WIDE_LOAD5:%.*]] = load <8 x i8>, ptr [[TMP17]], align 1
	; FIXED-NEXT: [[TMP90:%.*]] = icmp ne <8 x i8> [[WIDE_LOAD33]], <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>			; FIXED-NEXT: [[TMP18:%.*]] = icmp ne <8 x i8> [[WIDE_LOAD5]], <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>
	; FIXED-NEXT: [[TMP91:%.*]] = extractelement <8 x i1> [[TMP90]], i32 0			; FIXED-NEXT: [[TMP19:%.*]] = select <8 x i1> [[TMP18]], <8 x i8> <i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1>, <8 x i8> <i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1, i8 1>
	; FIXED-NEXT: br i1 [[TMP91]], label [[PRED_SDIV_IF34:%.]], label [[PRED_SDIV_CONTINUE35:%.]]			; FIXED-NEXT: [[TMP20:%.*]] = sdiv <8 x i8> [[WIDE_LOAD5]], [[TMP19]]
	; FIXED: pred.sdiv.if34:			; FIXED-NEXT: [[TMP21:%.*]] = xor <8 x i1> [[TMP18]], <i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true>
	; FIXED-NEXT: [[TMP92:%.*]] = extractelement <8 x i8> [[WIDE_LOAD33]], i32 0			; FIXED-NEXT: [[PREDPHI6:%.*]] = select <8 x i1> [[TMP18]], <8 x i8> [[TMP20]], <8 x i8> [[WIDE_LOAD5]]
	; FIXED-NEXT: [[TMP93:%.*]] = sdiv i8 [[TMP92]], -1			; FIXED-NEXT: store <8 x i8> [[PREDPHI6]], ptr [[TMP17]], align 1
	; FIXED-NEXT: [[TMP94:%.*]] = insertelement <8 x i8> poison, i8 [[TMP93]], i32 0			; FIXED-NEXT: [[INDEX_NEXT7]] = add nuw i64 [[OFFSET_IDX]], 8
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE35]]			; FIXED-NEXT: [[TMP22:%.*]] = icmp eq i64 [[INDEX_NEXT7]], 1024
	; FIXED: pred.sdiv.continue35:			; FIXED-NEXT: br i1 [[TMP22]], label [[VEC_EPILOG_MIDDLE_BLOCK:%.*]], label [[VEC_EPILOG_VECTOR_BODY]], !llvm.loop [[LOOP19:![0-9]+]]
	; FIXED-NEXT: [[TMP95:%.*]] = phi <8 x i8> [ poison, [[VEC_EPILOG_VECTOR_BODY]] ], [ [[TMP94]], [[PRED_SDIV_IF34]] ]
	; FIXED-NEXT: [[TMP96:%.*]] = extractelement <8 x i1> [[TMP90]], i32 1
	; FIXED-NEXT: br i1 [[TMP96]], label [[PRED_SDIV_IF36:%.]], label [[PRED_SDIV_CONTINUE37:%.]]
	; FIXED: pred.sdiv.if36:
	; FIXED-NEXT: [[TMP97:%.*]] = extractelement <8 x i8> [[WIDE_LOAD33]], i32 1
	; FIXED-NEXT: [[TMP98:%.*]] = sdiv i8 [[TMP97]], -1
	; FIXED-NEXT: [[TMP99:%.*]] = insertelement <8 x i8> [[TMP95]], i8 [[TMP98]], i32 1
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE37]]
	; FIXED: pred.sdiv.continue37:
	; FIXED-NEXT: [[TMP100:%.*]] = phi <8 x i8> [ [[TMP95]], [[PRED_SDIV_CONTINUE35]] ], [ [[TMP99]], [[PRED_SDIV_IF36]] ]
	; FIXED-NEXT: [[TMP101:%.*]] = extractelement <8 x i1> [[TMP90]], i32 2
	; FIXED-NEXT: br i1 [[TMP101]], label [[PRED_SDIV_IF38:%.]], label [[PRED_SDIV_CONTINUE39:%.]]
	; FIXED: pred.sdiv.if38:
	; FIXED-NEXT: [[TMP102:%.*]] = extractelement <8 x i8> [[WIDE_LOAD33]], i32 2
	; FIXED-NEXT: [[TMP103:%.*]] = sdiv i8 [[TMP102]], -1
	; FIXED-NEXT: [[TMP104:%.*]] = insertelement <8 x i8> [[TMP100]], i8 [[TMP103]], i32 2
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE39]]
	; FIXED: pred.sdiv.continue39:
	; FIXED-NEXT: [[TMP105:%.*]] = phi <8 x i8> [ [[TMP100]], [[PRED_SDIV_CONTINUE37]] ], [ [[TMP104]], [[PRED_SDIV_IF38]] ]
	; FIXED-NEXT: [[TMP106:%.*]] = extractelement <8 x i1> [[TMP90]], i32 3
	; FIXED-NEXT: br i1 [[TMP106]], label [[PRED_SDIV_IF40:%.]], label [[PRED_SDIV_CONTINUE41:%.]]
	; FIXED: pred.sdiv.if40:
	; FIXED-NEXT: [[TMP107:%.*]] = extractelement <8 x i8> [[WIDE_LOAD33]], i32 3
	; FIXED-NEXT: [[TMP108:%.*]] = sdiv i8 [[TMP107]], -1
	; FIXED-NEXT: [[TMP109:%.*]] = insertelement <8 x i8> [[TMP105]], i8 [[TMP108]], i32 3
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE41]]
	; FIXED: pred.sdiv.continue41:
	; FIXED-NEXT: [[TMP110:%.*]] = phi <8 x i8> [ [[TMP105]], [[PRED_SDIV_CONTINUE39]] ], [ [[TMP109]], [[PRED_SDIV_IF40]] ]
	; FIXED-NEXT: [[TMP111:%.*]] = extractelement <8 x i1> [[TMP90]], i32 4
	; FIXED-NEXT: br i1 [[TMP111]], label [[PRED_SDIV_IF42:%.]], label [[PRED_SDIV_CONTINUE43:%.]]
	; FIXED: pred.sdiv.if42:
	; FIXED-NEXT: [[TMP112:%.*]] = extractelement <8 x i8> [[WIDE_LOAD33]], i32 4
	; FIXED-NEXT: [[TMP113:%.*]] = sdiv i8 [[TMP112]], -1
	; FIXED-NEXT: [[TMP114:%.*]] = insertelement <8 x i8> [[TMP110]], i8 [[TMP113]], i32 4
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE43]]
	; FIXED: pred.sdiv.continue43:
	; FIXED-NEXT: [[TMP115:%.*]] = phi <8 x i8> [ [[TMP110]], [[PRED_SDIV_CONTINUE41]] ], [ [[TMP114]], [[PRED_SDIV_IF42]] ]
	; FIXED-NEXT: [[TMP116:%.*]] = extractelement <8 x i1> [[TMP90]], i32 5
	; FIXED-NEXT: br i1 [[TMP116]], label [[PRED_SDIV_IF44:%.]], label [[PRED_SDIV_CONTINUE45:%.]]
	; FIXED: pred.sdiv.if44:
	; FIXED-NEXT: [[TMP117:%.*]] = extractelement <8 x i8> [[WIDE_LOAD33]], i32 5
	; FIXED-NEXT: [[TMP118:%.*]] = sdiv i8 [[TMP117]], -1
	; FIXED-NEXT: [[TMP119:%.*]] = insertelement <8 x i8> [[TMP115]], i8 [[TMP118]], i32 5
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE45]]
	; FIXED: pred.sdiv.continue45:
	; FIXED-NEXT: [[TMP120:%.*]] = phi <8 x i8> [ [[TMP115]], [[PRED_SDIV_CONTINUE43]] ], [ [[TMP119]], [[PRED_SDIV_IF44]] ]
	; FIXED-NEXT: [[TMP121:%.*]] = extractelement <8 x i1> [[TMP90]], i32 6
	; FIXED-NEXT: br i1 [[TMP121]], label [[PRED_SDIV_IF46:%.]], label [[PRED_SDIV_CONTINUE47:%.]]
	; FIXED: pred.sdiv.if46:
	; FIXED-NEXT: [[TMP122:%.*]] = extractelement <8 x i8> [[WIDE_LOAD33]], i32 6
	; FIXED-NEXT: [[TMP123:%.*]] = sdiv i8 [[TMP122]], -1
	; FIXED-NEXT: [[TMP124:%.*]] = insertelement <8 x i8> [[TMP120]], i8 [[TMP123]], i32 6
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE47]]
	; FIXED: pred.sdiv.continue47:
	; FIXED-NEXT: [[TMP125:%.*]] = phi <8 x i8> [ [[TMP120]], [[PRED_SDIV_CONTINUE45]] ], [ [[TMP124]], [[PRED_SDIV_IF46]] ]
	; FIXED-NEXT: [[TMP126:%.*]] = extractelement <8 x i1> [[TMP90]], i32 7
	; FIXED-NEXT: br i1 [[TMP126]], label [[PRED_SDIV_IF48:%.*]], label [[PRED_SDIV_CONTINUE49]]
	; FIXED: pred.sdiv.if48:
	; FIXED-NEXT: [[TMP127:%.*]] = extractelement <8 x i8> [[WIDE_LOAD33]], i32 7
	; FIXED-NEXT: [[TMP128:%.*]] = sdiv i8 [[TMP127]], -1
	; FIXED-NEXT: [[TMP129:%.*]] = insertelement <8 x i8> [[TMP125]], i8 [[TMP128]], i32 7
	; FIXED-NEXT: br label [[PRED_SDIV_CONTINUE49]]
	; FIXED: pred.sdiv.continue49:
	; FIXED-NEXT: [[TMP130:%.*]] = phi <8 x i8> [ [[TMP125]], [[PRED_SDIV_CONTINUE47]] ], [ [[TMP129]], [[PRED_SDIV_IF48]] ]
	; FIXED-NEXT: [[TMP131:%.*]] = xor <8 x i1> [[TMP90]], <i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true, i1 true>
	; FIXED-NEXT: [[PREDPHI50:%.*]] = select <8 x i1> [[TMP90]], <8 x i8> [[TMP130]], <8 x i8> [[WIDE_LOAD33]]
	; FIXED-NEXT: [[TMP132:%.*]] = getelementptr inbounds i8, ptr [[TMP88]], i32 0
	; FIXED-NEXT: store <8 x i8> [[PREDPHI50]], ptr [[TMP132]], align 1
	; FIXED-NEXT: [[INDEX_NEXT51]] = add nuw i64 [[OFFSET_IDX]], 8
	; FIXED-NEXT: [[TMP133:%.*]] = icmp eq i64 [[INDEX_NEXT51]], 1024
	; FIXED-NEXT: br i1 [[TMP133]], label [[VEC_EPILOG_MIDDLE_BLOCK:%.*]], label [[VEC_EPILOG_VECTOR_BODY]], !llvm.loop [[LOOP19:![0-9]+]]
	; FIXED: vec.epilog.middle.block:			; FIXED: vec.epilog.middle.block:
	; FIXED-NEXT: [[CMP_N31:%.*]] = icmp eq i64 1024, 1024			; FIXED-NEXT: [[CMP_N3:%.*]] = icmp eq i64 1024, 1024
	; FIXED-NEXT: br i1 [[CMP_N31]], label [[FOR_END]], label [[VEC_EPILOG_SCALAR_PH]]			; FIXED-NEXT: br i1 [[CMP_N3]], label [[FOR_END]], label [[VEC_EPILOG_SCALAR_PH]]
	; FIXED: vec.epilog.scalar.ph:			; FIXED: vec.epilog.scalar.ph:
	; FIXED-NEXT: [[BC_RESUME_VAL:%.]] = phi i64 [ 1024, [[VEC_EPILOG_MIDDLE_BLOCK]] ], [ 1024, [[VEC_EPILOG_ITER_CHECK]] ], [ 0, [[ITER_CHECK:%.]] ]			; FIXED-NEXT: [[BC_RESUME_VAL:%.]] = phi i64 [ 1024, [[VEC_EPILOG_MIDDLE_BLOCK]] ], [ 1024, [[VEC_EPILOG_ITER_CHECK]] ], [ 0, [[ITER_CHECK:%.]] ]
	; FIXED-NEXT: br label [[FOR_BODY:%.*]]			; FIXED-NEXT: br label [[FOR_BODY:%.*]]
	; FIXED: for.body:			; FIXED: for.body:
	; FIXED-NEXT: [[IV:%.]] = phi i64 [ [[BC_RESUME_VAL]], [[VEC_EPILOG_SCALAR_PH]] ], [ [[IV_NEXT:%.]], [[LATCH:%.*]] ]			; FIXED-NEXT: [[IV:%.]] = phi i64 [ [[BC_RESUME_VAL]], [[VEC_EPILOG_SCALAR_PH]] ], [ [[IV_NEXT:%.]], [[LATCH:%.*]] ]
	; FIXED-NEXT: [[ARRAYIDX:%.*]] = getelementptr inbounds i8, ptr [[A]], i64 [[IV]]			; FIXED-NEXT: [[ARRAYIDX:%.*]] = getelementptr inbounds i8, ptr [[A]], i64 [[IV]]
	; FIXED-NEXT: [[ELEM:%.*]] = load i8, ptr [[ARRAYIDX]], align 1			; FIXED-NEXT: [[ELEM:%.*]] = load i8, ptr [[ARRAYIDX]], align 1
	; FIXED-NEXT: [[C:%.*]] = icmp ne i8 [[ELEM]], -128			; FIXED-NEXT: [[C:%.*]] = icmp ne i8 [[ELEM]], -128
	Show All 35 Lines

llvm/test/Transforms/LoopVectorize/first-order-recurrence-sink-replicate-region.ll

	; REQUIRES: asserts			; REQUIRES: asserts
	; RUN: opt < %s -passes=loop-vectorize -force-vector-width=2 -force-vector-interleave=1 -disable-output -debug-only=loop-vectorize 2>&1 \| FileCheck %s			; RUN: opt < %s -passes=loop-vectorize -force-vector-width=2 -force-vector-interleave=1 -force-widen-divrem-via-safe-divisor=0 -disable-output -debug-only=loop-vectorize 2>&1 \| FileCheck %s

	target datalayout = "e-m:e-i64:64-i128:128-n32:64-S128"			target datalayout = "e-m:e-i64:64-i128:128-n32:64-S128"

	; Test cases for PR50009, which require sinking a replicate-region due to a			; Test cases for PR50009, which require sinking a replicate-region due to a
	; first-order recurrence.			; first-order recurrence.

	define void @sink_replicate_region_1(i32 %x, i8* %ptr, i32* noalias %dst) optsize {			define void @sink_replicate_region_1(i32 %x, i8* %ptr, i32* noalias %dst) optsize {
	; CHECK-LABEL: sink_replicate_region_1			; CHECK-LABEL: sink_replicate_region_1
	▲ Show 20 Lines • Show All 517 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/first-order-recurrence.ll

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -loop-vectorize -force-vector-width=4 -force-vector-interleave=2 -S \| FileCheck %s --check-prefix=UNROLL-NO-IC			; RUN: opt < %s -loop-vectorize -force-vector-width=4 -force-vector-interleave=2 -force-widen-divrem-via-safe-divisor=0 -S \| FileCheck %s --check-prefix=UNROLL-NO-IC
	; RUN: opt < %s -loop-vectorize -force-vector-width=1 -force-vector-interleave=2 -S \| FileCheck %s --check-prefix=UNROLL-NO-VF			; RUN: opt < %s -loop-vectorize -force-vector-width=1 -force-vector-interleave=2 -force-widen-divrem-via-safe-divisor=0 -S \| FileCheck %s --check-prefix=UNROLL-NO-VF
	; RUN: opt < %s -loop-vectorize -force-vector-width=4 -force-vector-interleave=1 -S \| FileCheck %s --check-prefix=SINK-AFTER			; RUN: opt < %s -loop-vectorize -force-vector-width=4 -force-vector-interleave=1 -force-widen-divrem-via-safe-divisor=0 -S \| FileCheck %s --check-prefix=SINK-AFTER

	target datalayout = "e-m:e-i64:64-i128:128-n32:64-S128"			target datalayout = "e-m:e-i64:64-i128:128-n32:64-S128"

	; void recurrence_1(int a, int b, int n) {			; void recurrence_1(int a, int b, int n) {
	; for(int i = 0; i < n; i++)			; for(int i = 0; i < n; i++)
	; b[i] = a[i] + a[i - 1]			; b[i] = a[i] + a[i - 1]
	; }			; }
	;			;
	▲ Show 20 Lines • Show All 3,711 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/if-pred-non-void.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -force-vector-width=2 -force-vector-interleave=1 -loop-vectorize -verify-loop-info -simplifycfg -simplifycfg-require-and-preserve-domtree=1 < %s \| FileCheck %s			; RUN: opt -S -force-vector-width=2 -force-vector-interleave=1 -loop-vectorize -verify-loop-info -simplifycfg -simplifycfg-require-and-preserve-domtree=1 -force-widen-divrem-via-safe-divisor=0 < %s \| FileCheck %s
	; RUN: opt -S -force-vector-width=1 -force-vector-interleave=2 -loop-vectorize -verify-loop-info < %s \| FileCheck %s --check-prefix=UNROLL-NO-VF			; RUN: opt -S -force-vector-width=1 -force-vector-interleave=2 -loop-vectorize -verify-loop-info -force-widen-divrem-via-safe-divisor=0 < %s \| FileCheck %s --check-prefix=UNROLL-NO-VF

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"

	; Test predication of non-void instructions, specifically (i) that these			; Test predication of non-void instructions, specifically (i) that these
	; instructions permit vectorization and (ii) the creation of an insertelement			; instructions permit vectorization and (ii) the creation of an insertelement
	; and a Phi node. We check the full 2-element sequence for all predicate instructions.			; and a Phi node. We check the full 2-element sequence for all predicate instructions.
	define void @test(i32* nocapture %asd, i32* nocapture %aud,			define void @test(i32* nocapture %asd, i32* nocapture %aud,
	; CHECK-LABEL: @test(			; CHECK-LABEL: @test(
	▲ Show 20 Lines • Show All 998 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/induction.ll

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 2,025 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: br i1 [[MIN_ITERS_CHECK]], label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; CHECK-NEXT: br i1 [[MIN_ITERS_CHECK]], label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; CHECK: vector.ph:			; CHECK: vector.ph:
	; CHECK-NEXT: [[N_MOD_VF:%.*]] = urem i32 [[SMAX]], 2			; CHECK-NEXT: [[N_MOD_VF:%.*]] = urem i32 [[SMAX]], 2
	; CHECK-NEXT: [[N_VEC:%.*]] = sub i32 [[SMAX]], [[N_MOD_VF]]			; CHECK-NEXT: [[N_VEC:%.*]] = sub i32 [[SMAX]], [[N_MOD_VF]]
	; CHECK-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i1> poison, i1 [[C:%.]], i32 0			; CHECK-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i1> poison, i1 [[C:%.]], i32 0
	; CHECK-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i1> [[BROADCAST_SPLATINSERT]], <2 x i1> poison, <2 x i32> zeroinitializer			; CHECK-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i1> [[BROADCAST_SPLATINSERT]], <2 x i1> poison, <2 x i32> zeroinitializer
	; CHECK-NEXT: br label [[VECTOR_BODY:%.*]]			; CHECK-NEXT: br label [[VECTOR_BODY:%.*]]
	; CHECK: vector.body:			; CHECK: vector.body:
	; CHECK-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_UDIV_CONTINUE2:%.*]] ]			; CHECK-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; CHECK-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP16:%.]], [[PRED_UDIV_CONTINUE2]] ]			; CHECK-NEXT: [[VEC_IND:%.]] = phi <2 x i32> [ <i32 0, i32 1>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[VECTOR_BODY]] ]
				; CHECK-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP7:%.]], [[VECTOR_BODY]] ]
				fhahnUnsubmitted Not Done Reply Inline Actions I think the division here is used intentionally to force uses of an IV in predicated blocks and with this change we are now not checking for that scenario any longer. This should probably use `-force-widen-divrem-via-safe-divisor=0` to preserve the original behavior. fhahn: I think the division here is used intentionally to force uses of an IV in predicated blocks and…
	; CHECK-NEXT: [[TMP0:%.*]] = add i32 [[INDEX]], 0			; CHECK-NEXT: [[TMP0:%.*]] = add i32 [[INDEX]], 0
	; CHECK-NEXT: [[TMP1:%.]] = getelementptr inbounds i32, i32 [[A:%.*]], i32 [[TMP0]]			; CHECK-NEXT: [[TMP1:%.]] = getelementptr inbounds i32, i32 [[A:%.*]], i32 [[TMP0]]
	; CHECK-NEXT: [[TMP2:%.]] = getelementptr inbounds i32, i32 [[TMP1]], i32 0			; CHECK-NEXT: [[TMP2:%.]] = getelementptr inbounds i32, i32 [[TMP1]], i32 0
	; CHECK-NEXT: [[TMP3:%.]] = bitcast i32 [[TMP2]] to <2 x i32>*			; CHECK-NEXT: [[TMP3:%.]] = bitcast i32 [[TMP2]] to <2 x i32>*
	; CHECK-NEXT: [[WIDE_LOAD:%.]] = load <2 x i32>, <2 x i32> [[TMP3]], align 4			; CHECK-NEXT: [[WIDE_LOAD:%.]] = load <2 x i32>, <2 x i32> [[TMP3]], align 4
	; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x i1> [[BROADCAST_SPLAT]], i32 0			; CHECK-NEXT: [[TMP4:%.*]] = select <2 x i1> [[BROADCAST_SPLAT]], <2 x i32> [[VEC_IND]], <2 x i32> <i32 1, i32 1>
	; CHECK-NEXT: br i1 [[TMP4]], label [[PRED_UDIV_IF:%.]], label [[PRED_UDIV_CONTINUE:%.]]			; CHECK-NEXT: [[TMP5:%.*]] = udiv <2 x i32> [[WIDE_LOAD]], [[TMP4]]
	; CHECK: pred.udiv.if:			; CHECK-NEXT: [[TMP6:%.*]] = xor <2 x i1> [[BROADCAST_SPLAT]], <i1 true, i1 true>
	; CHECK-NEXT: [[TMP5:%.*]] = extractelement <2 x i32> [[WIDE_LOAD]], i32 0			; CHECK-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP6]], <2 x i32> [[WIDE_LOAD]], <2 x i32> [[TMP5]]
	; CHECK-NEXT: [[TMP6:%.*]] = udiv i32 [[TMP5]], [[TMP0]]			; CHECK-NEXT: [[TMP7]] = add <2 x i32> [[PREDPHI]], [[VEC_PHI]]
	; CHECK-NEXT: [[TMP7:%.*]] = insertelement <2 x i32> poison, i32 [[TMP6]], i32 0
	; CHECK-NEXT: br label [[PRED_UDIV_CONTINUE]]
	; CHECK: pred.udiv.continue:
	; CHECK-NEXT: [[TMP8:%.*]] = phi <2 x i32> [ poison, [[VECTOR_BODY]] ], [ [[TMP7]], [[PRED_UDIV_IF]] ]
	; CHECK-NEXT: [[TMP9:%.*]] = extractelement <2 x i1> [[BROADCAST_SPLAT]], i32 1
	; CHECK-NEXT: br i1 [[TMP9]], label [[PRED_UDIV_IF1:%.*]], label [[PRED_UDIV_CONTINUE2]]
	; CHECK: pred.udiv.if1:
	; CHECK-NEXT: [[TMP10:%.*]] = add i32 [[INDEX]], 1
	; CHECK-NEXT: [[TMP11:%.*]] = extractelement <2 x i32> [[WIDE_LOAD]], i32 1
	; CHECK-NEXT: [[TMP12:%.*]] = udiv i32 [[TMP11]], [[TMP10]]
	; CHECK-NEXT: [[TMP13:%.*]] = insertelement <2 x i32> [[TMP8]], i32 [[TMP12]], i32 1
	; CHECK-NEXT: br label [[PRED_UDIV_CONTINUE2]]
	; CHECK: pred.udiv.continue2:
	; CHECK-NEXT: [[TMP14:%.*]] = phi <2 x i32> [ [[TMP8]], [[PRED_UDIV_CONTINUE]] ], [ [[TMP13]], [[PRED_UDIV_IF1]] ]
	; CHECK-NEXT: [[TMP15:%.*]] = xor <2 x i1> [[BROADCAST_SPLAT]], <i1 true, i1 true>
	; CHECK-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP15]], <2 x i32> [[WIDE_LOAD]], <2 x i32> [[TMP14]]
	; CHECK-NEXT: [[TMP16]] = add <2 x i32> [[PREDPHI]], [[VEC_PHI]]
	; CHECK-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 2			; CHECK-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 2
	; CHECK-NEXT: [[TMP17:%.*]] = icmp eq i32 [[INDEX_NEXT]], [[N_VEC]]			; CHECK-NEXT: [[VEC_IND_NEXT]] = add <2 x i32> [[VEC_IND]], <i32 2, i32 2>
	; CHECK-NEXT: br i1 [[TMP17]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP24:![0-9]+]]			; CHECK-NEXT: [[TMP8:%.*]] = icmp eq i32 [[INDEX_NEXT]], [[N_VEC]]
				; CHECK-NEXT: br i1 [[TMP8]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP24:![0-9]+]]
	; CHECK: middle.block:			; CHECK: middle.block:
	; CHECK-NEXT: [[TMP18:%.*]] = call i32 @llvm.vector.reduce.add.v2i32(<2 x i32> [[TMP16]])			; CHECK-NEXT: [[TMP9:%.*]] = call i32 @llvm.vector.reduce.add.v2i32(<2 x i32> [[TMP7]])
	; CHECK-NEXT: [[CMP_N:%.*]] = icmp eq i32 [[SMAX]], [[N_VEC]]			; CHECK-NEXT: [[CMP_N:%.*]] = icmp eq i32 [[SMAX]], [[N_VEC]]
	; CHECK-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]			; CHECK-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; CHECK: scalar.ph:			; CHECK: scalar.ph:
	; CHECK-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ [[N_VEC]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]			; CHECK-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ [[N_VEC]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]
	; CHECK-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ 0, [[ENTRY]] ], [ [[TMP18]], [[MIDDLE_BLOCK]] ]			; CHECK-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ 0, [[ENTRY]] ], [ [[TMP9]], [[MIDDLE_BLOCK]] ]
	; CHECK-NEXT: br label [[FOR_BODY:%.*]]			; CHECK-NEXT: br label [[FOR_BODY:%.*]]
	; CHECK: for.body:			; CHECK: for.body:
	; CHECK-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[IF_END:%.*]] ]			; CHECK-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[IF_END:%.*]] ]
	; CHECK-NEXT: [[SUM:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR4:%.]], [[IF_END]] ]			; CHECK-NEXT: [[SUM:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR4:%.]], [[IF_END]] ]
	; CHECK-NEXT: [[VAR0:%.]] = getelementptr inbounds i32, i32 [[A]], i32 [[I]]			; CHECK-NEXT: [[VAR0:%.]] = getelementptr inbounds i32, i32 [[A]], i32 [[I]]
	; CHECK-NEXT: [[VAR1:%.]] = load i32, i32 [[VAR0]], align 4			; CHECK-NEXT: [[VAR1:%.]] = load i32, i32 [[VAR0]], align 4
	; CHECK-NEXT: br i1 [[C]], label [[IF_THEN:%.*]], label [[IF_END]]			; CHECK-NEXT: br i1 [[C]], label [[IF_THEN:%.*]], label [[IF_END]]
	; CHECK: if.then:			; CHECK: if.then:
	; CHECK-NEXT: [[VAR2:%.*]] = udiv i32 [[VAR1]], [[I]]			; CHECK-NEXT: [[VAR2:%.*]] = udiv i32 [[VAR1]], [[I]]
	; CHECK-NEXT: br label [[IF_END]]			; CHECK-NEXT: br label [[IF_END]]
	; CHECK: if.end:			; CHECK: if.end:
	; CHECK-NEXT: [[VAR3:%.*]] = phi i32 [ [[VAR2]], [[IF_THEN]] ], [ [[VAR1]], [[FOR_BODY]] ]			; CHECK-NEXT: [[VAR3:%.*]] = phi i32 [ [[VAR2]], [[IF_THEN]] ], [ [[VAR1]], [[FOR_BODY]] ]
	; CHECK-NEXT: [[VAR4]] = add i32 [[VAR3]], [[SUM]]			; CHECK-NEXT: [[VAR4]] = add i32 [[VAR3]], [[SUM]]
	; CHECK-NEXT: [[I_NEXT]] = add nuw nsw i32 [[I]], 1			; CHECK-NEXT: [[I_NEXT]] = add nuw nsw i32 [[I]], 1
	; CHECK-NEXT: [[COND:%.*]] = icmp slt i32 [[I_NEXT]], [[N]]			; CHECK-NEXT: [[COND:%.*]] = icmp slt i32 [[I_NEXT]], [[N]]
	; CHECK-NEXT: br i1 [[COND]], label [[FOR_BODY]], label [[FOR_END]], !llvm.loop [[LOOP25:![0-9]+]]			; CHECK-NEXT: br i1 [[COND]], label [[FOR_BODY]], label [[FOR_END]], !llvm.loop [[LOOP25:![0-9]+]]
	; CHECK: for.end:			; CHECK: for.end:
	; CHECK-NEXT: [[VAR5:%.*]] = phi i32 [ [[VAR4]], [[IF_END]] ], [ [[TMP18]], [[MIDDLE_BLOCK]] ]			; CHECK-NEXT: [[VAR5:%.*]] = phi i32 [ [[VAR4]], [[IF_END]] ], [ [[TMP9]], [[MIDDLE_BLOCK]] ]
	; CHECK-NEXT: ret i32 [[VAR5]]			; CHECK-NEXT: ret i32 [[VAR5]]
	;			;
	; IND-LABEL: @scalarize_induction_variable_05(			; IND-LABEL: @scalarize_induction_variable_05(
	; IND-NEXT: entry:			; IND-NEXT: entry:
	; IND-NEXT: [[SMAX:%.]] = call i32 @llvm.smax.i32(i32 [[N:%.]], i32 1)			; IND-NEXT: [[SMAX:%.]] = call i32 @llvm.smax.i32(i32 [[N:%.]], i32 1)
	; IND-NEXT: [[MIN_ITERS_CHECK:%.*]] = icmp ult i32 [[SMAX]], 2			; IND-NEXT: [[MIN_ITERS_CHECK:%.*]] = icmp ult i32 [[SMAX]], 2
	; IND-NEXT: br i1 [[MIN_ITERS_CHECK]], label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; IND-NEXT: br i1 [[MIN_ITERS_CHECK]], label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; IND: vector.ph:			; IND: vector.ph:
	; IND-NEXT: [[N_VEC:%.*]] = and i32 [[SMAX]], 2147483646			; IND-NEXT: [[N_VEC:%.*]] = and i32 [[SMAX]], 2147483646
	; IND-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i1> poison, i1 [[C:%.]], i64 0			; IND-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i1> poison, i1 [[C:%.]], i64 0
				; IND-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i1> [[BROADCAST_SPLATINSERT]], <2 x i1> poison, <2 x i32> zeroinitializer
	; IND-NEXT: br label [[VECTOR_BODY:%.*]]			; IND-NEXT: br label [[VECTOR_BODY:%.*]]
	; IND: vector.body:			; IND: vector.body:
	; IND-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_UDIV_CONTINUE2:%.*]] ]			; IND-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; IND-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP14:%.]], [[PRED_UDIV_CONTINUE2]] ]			; IND-NEXT: [[VEC_IND:%.]] = phi <2 x i32> [ <i32 0, i32 1>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[VECTOR_BODY]] ]
				; IND-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP5:%.]], [[VECTOR_BODY]] ]
	; IND-NEXT: [[TMP0:%.*]] = sext i32 [[INDEX]] to i64			; IND-NEXT: [[TMP0:%.*]] = sext i32 [[INDEX]] to i64
	; IND-NEXT: [[TMP1:%.]] = getelementptr inbounds i32, i32 [[A:%.*]], i64 [[TMP0]]			; IND-NEXT: [[TMP1:%.]] = getelementptr inbounds i32, i32 [[A:%.*]], i64 [[TMP0]]
	; IND-NEXT: [[TMP2:%.]] = bitcast i32 [[TMP1]] to <2 x i32>*			; IND-NEXT: [[TMP2:%.]] = bitcast i32 [[TMP1]] to <2 x i32>*
	; IND-NEXT: [[WIDE_LOAD:%.]] = load <2 x i32>, <2 x i32> [[TMP2]], align 4			; IND-NEXT: [[WIDE_LOAD:%.]] = load <2 x i32>, <2 x i32> [[TMP2]], align 4
	; IND-NEXT: br i1 [[C]], label [[PRED_UDIV_IF:%.]], label [[PRED_UDIV_CONTINUE:%.]]			; IND-NEXT: [[TMP3:%.*]] = select <2 x i1> [[BROADCAST_SPLAT]], <2 x i32> [[VEC_IND]], <2 x i32> <i32 1, i32 1>
	; IND: pred.udiv.if:			; IND-NEXT: [[TMP4:%.*]] = udiv <2 x i32> [[WIDE_LOAD]], [[TMP3]]
	; IND-NEXT: [[TMP3:%.*]] = extractelement <2 x i32> [[WIDE_LOAD]], i64 0			; IND-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[BROADCAST_SPLAT]], <2 x i32> [[TMP4]], <2 x i32> [[WIDE_LOAD]]
	; IND-NEXT: [[TMP4:%.*]] = udiv i32 [[TMP3]], [[INDEX]]			; IND-NEXT: [[TMP5]] = add <2 x i32> [[PREDPHI]], [[VEC_PHI]]
	; IND-NEXT: [[TMP5:%.*]] = insertelement <2 x i32> poison, i32 [[TMP4]], i64 0
	; IND-NEXT: br label [[PRED_UDIV_CONTINUE]]
	; IND: pred.udiv.continue:
	; IND-NEXT: [[TMP6:%.*]] = phi <2 x i32> [ poison, [[VECTOR_BODY]] ], [ [[TMP5]], [[PRED_UDIV_IF]] ]
	; IND-NEXT: br i1 [[C]], label [[PRED_UDIV_IF1:%.*]], label [[PRED_UDIV_CONTINUE2]]
	; IND: pred.udiv.if1:
	; IND-NEXT: [[TMP7:%.*]] = or i32 [[INDEX]], 1
	; IND-NEXT: [[TMP8:%.*]] = extractelement <2 x i32> [[WIDE_LOAD]], i64 1
	; IND-NEXT: [[TMP9:%.*]] = udiv i32 [[TMP8]], [[TMP7]]
	; IND-NEXT: [[TMP10:%.*]] = insertelement <2 x i32> [[TMP6]], i32 [[TMP9]], i64 1
	; IND-NEXT: br label [[PRED_UDIV_CONTINUE2]]
	; IND: pred.udiv.continue2:
	; IND-NEXT: [[TMP11:%.*]] = phi <2 x i32> [ [[TMP6]], [[PRED_UDIV_CONTINUE]] ], [ [[TMP10]], [[PRED_UDIV_IF1]] ]
	; IND-NEXT: [[TMP12:%.*]] = xor <2 x i1> [[BROADCAST_SPLATINSERT]], <i1 true, i1 poison>
	; IND-NEXT: [[TMP13:%.*]] = shufflevector <2 x i1> [[TMP12]], <2 x i1> poison, <2 x i32> zeroinitializer
	; IND-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP13]], <2 x i32> [[WIDE_LOAD]], <2 x i32> [[TMP11]]
	; IND-NEXT: [[TMP14]] = add <2 x i32> [[PREDPHI]], [[VEC_PHI]]
	; IND-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 2			; IND-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 2
	; IND-NEXT: [[TMP15:%.*]] = icmp eq i32 [[INDEX_NEXT]], [[N_VEC]]			; IND-NEXT: [[VEC_IND_NEXT]] = add <2 x i32> [[VEC_IND]], <i32 2, i32 2>
	; IND-NEXT: br i1 [[TMP15]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP24:![0-9]+]]			; IND-NEXT: [[TMP6:%.*]] = icmp eq i32 [[INDEX_NEXT]], [[N_VEC]]
				; IND-NEXT: br i1 [[TMP6]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP24:![0-9]+]]
	; IND: middle.block:			; IND: middle.block:
	; IND-NEXT: [[TMP16:%.*]] = call i32 @llvm.vector.reduce.add.v2i32(<2 x i32> [[TMP14]])			; IND-NEXT: [[TMP7:%.*]] = call i32 @llvm.vector.reduce.add.v2i32(<2 x i32> [[TMP5]])
	; IND-NEXT: [[CMP_N:%.*]] = icmp eq i32 [[SMAX]], [[N_VEC]]			; IND-NEXT: [[CMP_N:%.*]] = icmp eq i32 [[SMAX]], [[N_VEC]]
	; IND-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]			; IND-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; IND: scalar.ph:			; IND: scalar.ph:
	; IND-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ [[N_VEC]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]			; IND-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ [[N_VEC]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]
	; IND-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ [[TMP16]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY]] ]			; IND-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ [[TMP7]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY]] ]
	; IND-NEXT: br label [[FOR_BODY:%.*]]			; IND-NEXT: br label [[FOR_BODY:%.*]]
	; IND: for.body:			; IND: for.body:
	; IND-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[IF_END:%.*]] ]			; IND-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[IF_END:%.*]] ]
	; IND-NEXT: [[SUM:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR4:%.]], [[IF_END]] ]			; IND-NEXT: [[SUM:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR4:%.]], [[IF_END]] ]
	; IND-NEXT: [[TMP17:%.*]] = zext i32 [[I]] to i64			; IND-NEXT: [[TMP8:%.*]] = zext i32 [[I]] to i64
	; IND-NEXT: [[VAR0:%.]] = getelementptr inbounds i32, i32 [[A]], i64 [[TMP17]]			; IND-NEXT: [[VAR0:%.]] = getelementptr inbounds i32, i32 [[A]], i64 [[TMP8]]
	; IND-NEXT: [[VAR1:%.]] = load i32, i32 [[VAR0]], align 4			; IND-NEXT: [[VAR1:%.]] = load i32, i32 [[VAR0]], align 4
	; IND-NEXT: br i1 [[C]], label [[IF_THEN:%.*]], label [[IF_END]]			; IND-NEXT: br i1 [[C]], label [[IF_THEN:%.*]], label [[IF_END]]
	; IND: if.then:			; IND: if.then:
	; IND-NEXT: [[VAR2:%.*]] = udiv i32 [[VAR1]], [[I]]			; IND-NEXT: [[VAR2:%.*]] = udiv i32 [[VAR1]], [[I]]
	; IND-NEXT: br label [[IF_END]]			; IND-NEXT: br label [[IF_END]]
	; IND: if.end:			; IND: if.end:
	; IND-NEXT: [[VAR3:%.*]] = phi i32 [ [[VAR2]], [[IF_THEN]] ], [ [[VAR1]], [[FOR_BODY]] ]			; IND-NEXT: [[VAR3:%.*]] = phi i32 [ [[VAR2]], [[IF_THEN]] ], [ [[VAR1]], [[FOR_BODY]] ]
	; IND-NEXT: [[VAR4]] = add i32 [[VAR3]], [[SUM]]			; IND-NEXT: [[VAR4]] = add i32 [[VAR3]], [[SUM]]
	; IND-NEXT: [[I_NEXT]] = add nuw nsw i32 [[I]], 1			; IND-NEXT: [[I_NEXT]] = add nuw nsw i32 [[I]], 1
	; IND-NEXT: [[COND:%.*]] = icmp slt i32 [[I_NEXT]], [[N]]			; IND-NEXT: [[COND:%.*]] = icmp slt i32 [[I_NEXT]], [[N]]
	; IND-NEXT: br i1 [[COND]], label [[FOR_BODY]], label [[FOR_END]], !llvm.loop [[LOOP25:![0-9]+]]			; IND-NEXT: br i1 [[COND]], label [[FOR_BODY]], label [[FOR_END]], !llvm.loop [[LOOP25:![0-9]+]]
	; IND: for.end:			; IND: for.end:
	; IND-NEXT: [[VAR5:%.*]] = phi i32 [ [[VAR4]], [[IF_END]] ], [ [[TMP16]], [[MIDDLE_BLOCK]] ]			; IND-NEXT: [[VAR5:%.*]] = phi i32 [ [[VAR4]], [[IF_END]] ], [ [[TMP7]], [[MIDDLE_BLOCK]] ]
	; IND-NEXT: ret i32 [[VAR5]]			; IND-NEXT: ret i32 [[VAR5]]
	;			;
	; UNROLL-LABEL: @scalarize_induction_variable_05(			; UNROLL-LABEL: @scalarize_induction_variable_05(
	; UNROLL-NEXT: entry:			; UNROLL-NEXT: entry:
	; UNROLL-NEXT: [[SMAX:%.]] = call i32 @llvm.smax.i32(i32 [[N:%.]], i32 1)			; UNROLL-NEXT: [[SMAX:%.]] = call i32 @llvm.smax.i32(i32 [[N:%.]], i32 1)
	; UNROLL-NEXT: [[MIN_ITERS_CHECK:%.*]] = icmp ult i32 [[SMAX]], 4			; UNROLL-NEXT: [[MIN_ITERS_CHECK:%.*]] = icmp ult i32 [[SMAX]], 4
	; UNROLL-NEXT: br i1 [[MIN_ITERS_CHECK]], label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; UNROLL-NEXT: br i1 [[MIN_ITERS_CHECK]], label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; UNROLL: vector.ph:			; UNROLL: vector.ph:
	; UNROLL-NEXT: [[N_VEC:%.*]] = and i32 [[SMAX]], 2147483644			; UNROLL-NEXT: [[N_VEC:%.*]] = and i32 [[SMAX]], 2147483644
	; UNROLL-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i1> poison, i1 [[C:%.]], i64 0			; UNROLL-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i1> poison, i1 [[C:%.]], i64 0
	; UNROLL-NEXT: [[BROADCAST_SPLATINSERT5:%.*]] = insertelement <2 x i1> poison, i1 [[C]], i64 0			; UNROLL-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i1> [[BROADCAST_SPLATINSERT]], <2 x i1> poison, <2 x i32> zeroinitializer
				; UNROLL-NEXT: [[BROADCAST_SPLATINSERT4:%.*]] = insertelement <2 x i1> poison, i1 [[C]], i64 0
				; UNROLL-NEXT: [[BROADCAST_SPLAT5:%.*]] = shufflevector <2 x i1> [[BROADCAST_SPLATINSERT4]], <2 x i1> poison, <2 x i32> zeroinitializer
	; UNROLL-NEXT: br label [[VECTOR_BODY:%.*]]			; UNROLL-NEXT: br label [[VECTOR_BODY:%.*]]
	; UNROLL: vector.body:			; UNROLL: vector.body:
	; UNROLL-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_UDIV_CONTINUE10:%.*]] ]			; UNROLL-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP28:%.]], [[PRED_UDIV_CONTINUE10]] ]			; UNROLL-NEXT: [[VEC_IND:%.]] = phi <2 x i32> [ <i32 0, i32 1>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NEXT: [[VEC_PHI1:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP29:%.]], [[PRED_UDIV_CONTINUE10]] ]			; UNROLL-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP9:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NEXT: [[TMP0:%.*]] = or i32 [[INDEX]], 2			; UNROLL-NEXT: [[VEC_PHI2:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP10:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NEXT: [[TMP1:%.*]] = sext i32 [[INDEX]] to i64			; UNROLL-NEXT: [[STEP_ADD:%.*]] = add <2 x i32> [[VEC_IND]], <i32 2, i32 2>
	; UNROLL-NEXT: [[TMP2:%.]] = getelementptr inbounds i32, i32 [[A:%.*]], i64 [[TMP1]]			; UNROLL-NEXT: [[TMP0:%.*]] = sext i32 [[INDEX]] to i64
	; UNROLL-NEXT: [[TMP3:%.]] = bitcast i32 [[TMP2]] to <2 x i32>*			; UNROLL-NEXT: [[TMP1:%.]] = getelementptr inbounds i32, i32 [[A:%.*]], i64 [[TMP0]]
	; UNROLL-NEXT: [[WIDE_LOAD:%.]] = load <2 x i32>, <2 x i32> [[TMP3]], align 4			; UNROLL-NEXT: [[TMP2:%.]] = bitcast i32 [[TMP1]] to <2 x i32>*
	; UNROLL-NEXT: [[TMP4:%.]] = getelementptr inbounds i32, i32 [[TMP2]], i64 2			; UNROLL-NEXT: [[WIDE_LOAD:%.]] = load <2 x i32>, <2 x i32> [[TMP2]], align 4
	; UNROLL-NEXT: [[TMP5:%.]] = bitcast i32 [[TMP4]] to <2 x i32>*			; UNROLL-NEXT: [[TMP3:%.]] = getelementptr inbounds i32, i32 [[TMP1]], i64 2
	; UNROLL-NEXT: [[WIDE_LOAD2:%.]] = load <2 x i32>, <2 x i32> [[TMP5]], align 4			; UNROLL-NEXT: [[TMP4:%.]] = bitcast i32 [[TMP3]] to <2 x i32>*
	; UNROLL-NEXT: br i1 [[C]], label [[PRED_UDIV_IF:%.]], label [[PRED_UDIV_CONTINUE:%.]]			; UNROLL-NEXT: [[WIDE_LOAD3:%.]] = load <2 x i32>, <2 x i32> [[TMP4]], align 4
	; UNROLL: pred.udiv.if:			; UNROLL-NEXT: [[TMP5:%.*]] = select <2 x i1> [[BROADCAST_SPLAT]], <2 x i32> [[VEC_IND]], <2 x i32> <i32 1, i32 1>
	; UNROLL-NEXT: [[TMP6:%.*]] = extractelement <2 x i32> [[WIDE_LOAD]], i64 0			; UNROLL-NEXT: [[TMP6:%.*]] = select <2 x i1> [[BROADCAST_SPLAT5]], <2 x i32> [[STEP_ADD]], <2 x i32> <i32 1, i32 1>
	; UNROLL-NEXT: [[TMP7:%.*]] = udiv i32 [[TMP6]], [[INDEX]]			; UNROLL-NEXT: [[TMP7:%.*]] = udiv <2 x i32> [[WIDE_LOAD]], [[TMP5]]
	; UNROLL-NEXT: [[TMP8:%.*]] = insertelement <2 x i32> poison, i32 [[TMP7]], i64 0			; UNROLL-NEXT: [[TMP8:%.*]] = udiv <2 x i32> [[WIDE_LOAD3]], [[TMP6]]
	; UNROLL-NEXT: br label [[PRED_UDIV_CONTINUE]]			; UNROLL-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[BROADCAST_SPLAT]], <2 x i32> [[TMP7]], <2 x i32> [[WIDE_LOAD]]
	; UNROLL: pred.udiv.continue:			; UNROLL-NEXT: [[PREDPHI6:%.*]] = select <2 x i1> [[BROADCAST_SPLAT5]], <2 x i32> [[TMP8]], <2 x i32> [[WIDE_LOAD3]]
	; UNROLL-NEXT: [[TMP9:%.*]] = phi <2 x i32> [ poison, [[VECTOR_BODY]] ], [ [[TMP8]], [[PRED_UDIV_IF]] ]			; UNROLL-NEXT: [[TMP9]] = add <2 x i32> [[PREDPHI]], [[VEC_PHI]]
	; UNROLL-NEXT: br i1 [[C]], label [[PRED_UDIV_IF3:%.]], label [[PRED_UDIV_CONTINUE4:%.]]			; UNROLL-NEXT: [[TMP10]] = add <2 x i32> [[PREDPHI6]], [[VEC_PHI2]]
	; UNROLL: pred.udiv.if3:
	; UNROLL-NEXT: [[TMP10:%.*]] = or i32 [[INDEX]], 1
	; UNROLL-NEXT: [[TMP11:%.*]] = extractelement <2 x i32> [[WIDE_LOAD]], i64 1
	; UNROLL-NEXT: [[TMP12:%.*]] = udiv i32 [[TMP11]], [[TMP10]]
	; UNROLL-NEXT: [[TMP13:%.*]] = insertelement <2 x i32> [[TMP9]], i32 [[TMP12]], i64 1
	; UNROLL-NEXT: br label [[PRED_UDIV_CONTINUE4]]
	; UNROLL: pred.udiv.continue4:
	; UNROLL-NEXT: [[TMP14:%.*]] = phi <2 x i32> [ [[TMP9]], [[PRED_UDIV_CONTINUE]] ], [ [[TMP13]], [[PRED_UDIV_IF3]] ]
	; UNROLL-NEXT: br i1 [[C]], label [[PRED_UDIV_IF7:%.]], label [[PRED_UDIV_CONTINUE8:%.]]
	; UNROLL: pred.udiv.if7:
	; UNROLL-NEXT: [[TMP15:%.*]] = extractelement <2 x i32> [[WIDE_LOAD2]], i64 0
	; UNROLL-NEXT: [[TMP16:%.*]] = udiv i32 [[TMP15]], [[TMP0]]
	; UNROLL-NEXT: [[TMP17:%.*]] = insertelement <2 x i32> poison, i32 [[TMP16]], i64 0
	; UNROLL-NEXT: br label [[PRED_UDIV_CONTINUE8]]
	; UNROLL: pred.udiv.continue8:
	; UNROLL-NEXT: [[TMP18:%.*]] = phi <2 x i32> [ poison, [[PRED_UDIV_CONTINUE4]] ], [ [[TMP17]], [[PRED_UDIV_IF7]] ]
	; UNROLL-NEXT: br i1 [[C]], label [[PRED_UDIV_IF9:%.*]], label [[PRED_UDIV_CONTINUE10]]
	; UNROLL: pred.udiv.if9:
	; UNROLL-NEXT: [[TMP19:%.*]] = or i32 [[INDEX]], 3
	; UNROLL-NEXT: [[TMP20:%.*]] = extractelement <2 x i32> [[WIDE_LOAD2]], i64 1
	; UNROLL-NEXT: [[TMP21:%.*]] = udiv i32 [[TMP20]], [[TMP19]]
	; UNROLL-NEXT: [[TMP22:%.*]] = insertelement <2 x i32> [[TMP18]], i32 [[TMP21]], i64 1
	; UNROLL-NEXT: br label [[PRED_UDIV_CONTINUE10]]
	; UNROLL: pred.udiv.continue10:
	; UNROLL-NEXT: [[TMP23:%.*]] = phi <2 x i32> [ [[TMP18]], [[PRED_UDIV_CONTINUE8]] ], [ [[TMP22]], [[PRED_UDIV_IF9]] ]
	; UNROLL-NEXT: [[TMP24:%.*]] = xor <2 x i1> [[BROADCAST_SPLATINSERT]], <i1 true, i1 poison>
	; UNROLL-NEXT: [[TMP25:%.*]] = shufflevector <2 x i1> [[TMP24]], <2 x i1> poison, <2 x i32> zeroinitializer
	; UNROLL-NEXT: [[TMP26:%.*]] = xor <2 x i1> [[BROADCAST_SPLATINSERT5]], <i1 true, i1 poison>
	; UNROLL-NEXT: [[TMP27:%.*]] = shufflevector <2 x i1> [[TMP26]], <2 x i1> poison, <2 x i32> zeroinitializer
	; UNROLL-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP25]], <2 x i32> [[WIDE_LOAD]], <2 x i32> [[TMP14]]
	; UNROLL-NEXT: [[PREDPHI11:%.*]] = select <2 x i1> [[TMP27]], <2 x i32> [[WIDE_LOAD2]], <2 x i32> [[TMP23]]
	; UNROLL-NEXT: [[TMP28]] = add <2 x i32> [[PREDPHI]], [[VEC_PHI]]
	; UNROLL-NEXT: [[TMP29]] = add <2 x i32> [[PREDPHI11]], [[VEC_PHI1]]
	; UNROLL-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 4			; UNROLL-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 4
	; UNROLL-NEXT: [[TMP30:%.*]] = icmp eq i32 [[INDEX_NEXT]], [[N_VEC]]			; UNROLL-NEXT: [[VEC_IND_NEXT]] = add <2 x i32> [[VEC_IND]], <i32 4, i32 4>
	; UNROLL-NEXT: br i1 [[TMP30]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP24:![0-9]+]]			; UNROLL-NEXT: [[TMP11:%.*]] = icmp eq i32 [[INDEX_NEXT]], [[N_VEC]]
				; UNROLL-NEXT: br i1 [[TMP11]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP24:![0-9]+]]
	; UNROLL: middle.block:			; UNROLL: middle.block:
	; UNROLL-NEXT: [[BIN_RDX:%.*]] = add <2 x i32> [[TMP29]], [[TMP28]]			; UNROLL-NEXT: [[BIN_RDX:%.*]] = add <2 x i32> [[TMP10]], [[TMP9]]
	; UNROLL-NEXT: [[TMP31:%.*]] = call i32 @llvm.vector.reduce.add.v2i32(<2 x i32> [[BIN_RDX]])			; UNROLL-NEXT: [[TMP12:%.*]] = call i32 @llvm.vector.reduce.add.v2i32(<2 x i32> [[BIN_RDX]])
	; UNROLL-NEXT: [[CMP_N:%.*]] = icmp eq i32 [[SMAX]], [[N_VEC]]			; UNROLL-NEXT: [[CMP_N:%.*]] = icmp eq i32 [[SMAX]], [[N_VEC]]
	; UNROLL-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]			; UNROLL-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; UNROLL: scalar.ph:			; UNROLL: scalar.ph:
	; UNROLL-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ [[N_VEC]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]			; UNROLL-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ [[N_VEC]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]
	; UNROLL-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ [[TMP31]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY]] ]			; UNROLL-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ [[TMP12]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY]] ]
	; UNROLL-NEXT: br label [[FOR_BODY:%.*]]			; UNROLL-NEXT: br label [[FOR_BODY:%.*]]
	; UNROLL: for.body:			; UNROLL: for.body:
	; UNROLL-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[IF_END:%.*]] ]			; UNROLL-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[IF_END:%.*]] ]
	; UNROLL-NEXT: [[SUM:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR4:%.]], [[IF_END]] ]			; UNROLL-NEXT: [[SUM:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR4:%.]], [[IF_END]] ]
	; UNROLL-NEXT: [[TMP32:%.*]] = zext i32 [[I]] to i64			; UNROLL-NEXT: [[TMP13:%.*]] = zext i32 [[I]] to i64
	; UNROLL-NEXT: [[VAR0:%.]] = getelementptr inbounds i32, i32 [[A]], i64 [[TMP32]]			; UNROLL-NEXT: [[VAR0:%.]] = getelementptr inbounds i32, i32 [[A]], i64 [[TMP13]]
	; UNROLL-NEXT: [[VAR1:%.]] = load i32, i32 [[VAR0]], align 4			; UNROLL-NEXT: [[VAR1:%.]] = load i32, i32 [[VAR0]], align 4
	; UNROLL-NEXT: br i1 [[C]], label [[IF_THEN:%.*]], label [[IF_END]]			; UNROLL-NEXT: br i1 [[C]], label [[IF_THEN:%.*]], label [[IF_END]]
	; UNROLL: if.then:			; UNROLL: if.then:
	; UNROLL-NEXT: [[VAR2:%.*]] = udiv i32 [[VAR1]], [[I]]			; UNROLL-NEXT: [[VAR2:%.*]] = udiv i32 [[VAR1]], [[I]]
	; UNROLL-NEXT: br label [[IF_END]]			; UNROLL-NEXT: br label [[IF_END]]
	; UNROLL: if.end:			; UNROLL: if.end:
	; UNROLL-NEXT: [[VAR3:%.*]] = phi i32 [ [[VAR2]], [[IF_THEN]] ], [ [[VAR1]], [[FOR_BODY]] ]			; UNROLL-NEXT: [[VAR3:%.*]] = phi i32 [ [[VAR2]], [[IF_THEN]] ], [ [[VAR1]], [[FOR_BODY]] ]
	; UNROLL-NEXT: [[VAR4]] = add i32 [[VAR3]], [[SUM]]			; UNROLL-NEXT: [[VAR4]] = add i32 [[VAR3]], [[SUM]]
	; UNROLL-NEXT: [[I_NEXT]] = add nuw nsw i32 [[I]], 1			; UNROLL-NEXT: [[I_NEXT]] = add nuw nsw i32 [[I]], 1
	; UNROLL-NEXT: [[COND:%.*]] = icmp slt i32 [[I_NEXT]], [[N]]			; UNROLL-NEXT: [[COND:%.*]] = icmp slt i32 [[I_NEXT]], [[N]]
	; UNROLL-NEXT: br i1 [[COND]], label [[FOR_BODY]], label [[FOR_END]], !llvm.loop [[LOOP25:![0-9]+]]			; UNROLL-NEXT: br i1 [[COND]], label [[FOR_BODY]], label [[FOR_END]], !llvm.loop [[LOOP25:![0-9]+]]
	; UNROLL: for.end:			; UNROLL: for.end:
	; UNROLL-NEXT: [[VAR5:%.*]] = phi i32 [ [[VAR4]], [[IF_END]] ], [ [[TMP31]], [[MIDDLE_BLOCK]] ]			; UNROLL-NEXT: [[VAR5:%.*]] = phi i32 [ [[VAR4]], [[IF_END]] ], [ [[TMP12]], [[MIDDLE_BLOCK]] ]
	; UNROLL-NEXT: ret i32 [[VAR5]]			; UNROLL-NEXT: ret i32 [[VAR5]]
	;			;
	; UNROLL-NO-IC-LABEL: @scalarize_induction_variable_05(			; UNROLL-NO-IC-LABEL: @scalarize_induction_variable_05(
	; UNROLL-NO-IC-NEXT: entry:			; UNROLL-NO-IC-NEXT: entry:
	; UNROLL-NO-IC-NEXT: [[SMAX:%.]] = call i32 @llvm.smax.i32(i32 [[N:%.]], i32 1)			; UNROLL-NO-IC-NEXT: [[SMAX:%.]] = call i32 @llvm.smax.i32(i32 [[N:%.]], i32 1)
	; UNROLL-NO-IC-NEXT: [[MIN_ITERS_CHECK:%.*]] = icmp ult i32 [[SMAX]], 4			; UNROLL-NO-IC-NEXT: [[MIN_ITERS_CHECK:%.*]] = icmp ult i32 [[SMAX]], 4
	; UNROLL-NO-IC-NEXT: br i1 [[MIN_ITERS_CHECK]], label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; UNROLL-NO-IC-NEXT: br i1 [[MIN_ITERS_CHECK]], label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; UNROLL-NO-IC: vector.ph:			; UNROLL-NO-IC: vector.ph:
	; UNROLL-NO-IC-NEXT: [[N_MOD_VF:%.*]] = urem i32 [[SMAX]], 4			; UNROLL-NO-IC-NEXT: [[N_MOD_VF:%.*]] = urem i32 [[SMAX]], 4
	; UNROLL-NO-IC-NEXT: [[N_VEC:%.*]] = sub i32 [[SMAX]], [[N_MOD_VF]]			; UNROLL-NO-IC-NEXT: [[N_VEC:%.*]] = sub i32 [[SMAX]], [[N_MOD_VF]]
	; UNROLL-NO-IC-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i1> poison, i1 [[C:%.]], i32 0			; UNROLL-NO-IC-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i1> poison, i1 [[C:%.]], i32 0
	; UNROLL-NO-IC-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i1> [[BROADCAST_SPLATINSERT]], <2 x i1> poison, <2 x i32> zeroinitializer			; UNROLL-NO-IC-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i1> [[BROADCAST_SPLATINSERT]], <2 x i1> poison, <2 x i32> zeroinitializer
	; UNROLL-NO-IC-NEXT: [[BROADCAST_SPLATINSERT5:%.*]] = insertelement <2 x i1> poison, i1 [[C]], i32 0			; UNROLL-NO-IC-NEXT: [[BROADCAST_SPLATINSERT4:%.*]] = insertelement <2 x i1> poison, i1 [[C]], i32 0
	; UNROLL-NO-IC-NEXT: [[BROADCAST_SPLAT6:%.*]] = shufflevector <2 x i1> [[BROADCAST_SPLATINSERT5]], <2 x i1> poison, <2 x i32> zeroinitializer			; UNROLL-NO-IC-NEXT: [[BROADCAST_SPLAT5:%.*]] = shufflevector <2 x i1> [[BROADCAST_SPLATINSERT4]], <2 x i1> poison, <2 x i32> zeroinitializer
	; UNROLL-NO-IC-NEXT: br label [[VECTOR_BODY:%.*]]			; UNROLL-NO-IC-NEXT: br label [[VECTOR_BODY:%.*]]
	; UNROLL-NO-IC: vector.body:			; UNROLL-NO-IC: vector.body:
	; UNROLL-NO-IC-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_UDIV_CONTINUE10:%.*]] ]			; UNROLL-NO-IC-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NO-IC-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP32:%.]], [[PRED_UDIV_CONTINUE10]] ]			; UNROLL-NO-IC-NEXT: [[VEC_IND:%.]] = phi <2 x i32> [ <i32 0, i32 1>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NO-IC-NEXT: [[VEC_PHI1:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP33:%.]], [[PRED_UDIV_CONTINUE10]] ]			; UNROLL-NO-IC-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP14:%.]], [[VECTOR_BODY]] ]
				; UNROLL-NO-IC-NEXT: [[VEC_PHI2:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP15:%.]], [[VECTOR_BODY]] ]
				; UNROLL-NO-IC-NEXT: [[STEP_ADD:%.*]] = add <2 x i32> [[VEC_IND]], <i32 2, i32 2>
	; UNROLL-NO-IC-NEXT: [[TMP0:%.*]] = add i32 [[INDEX]], 0			; UNROLL-NO-IC-NEXT: [[TMP0:%.*]] = add i32 [[INDEX]], 0
	; UNROLL-NO-IC-NEXT: [[TMP1:%.*]] = add i32 [[INDEX]], 2			; UNROLL-NO-IC-NEXT: [[TMP1:%.*]] = add i32 [[INDEX]], 2
	; UNROLL-NO-IC-NEXT: [[TMP2:%.]] = getelementptr inbounds i32, i32 [[A:%.*]], i32 [[TMP0]]			; UNROLL-NO-IC-NEXT: [[TMP2:%.]] = getelementptr inbounds i32, i32 [[A:%.*]], i32 [[TMP0]]
	; UNROLL-NO-IC-NEXT: [[TMP3:%.]] = getelementptr inbounds i32, i32 [[A]], i32 [[TMP1]]			; UNROLL-NO-IC-NEXT: [[TMP3:%.]] = getelementptr inbounds i32, i32 [[A]], i32 [[TMP1]]
	; UNROLL-NO-IC-NEXT: [[TMP4:%.]] = getelementptr inbounds i32, i32 [[TMP2]], i32 0			; UNROLL-NO-IC-NEXT: [[TMP4:%.]] = getelementptr inbounds i32, i32 [[TMP2]], i32 0
	; UNROLL-NO-IC-NEXT: [[TMP5:%.]] = bitcast i32 [[TMP4]] to <2 x i32>*			; UNROLL-NO-IC-NEXT: [[TMP5:%.]] = bitcast i32 [[TMP4]] to <2 x i32>*
	; UNROLL-NO-IC-NEXT: [[WIDE_LOAD:%.]] = load <2 x i32>, <2 x i32> [[TMP5]], align 4			; UNROLL-NO-IC-NEXT: [[WIDE_LOAD:%.]] = load <2 x i32>, <2 x i32> [[TMP5]], align 4
	; UNROLL-NO-IC-NEXT: [[TMP6:%.]] = getelementptr inbounds i32, i32 [[TMP2]], i32 2			; UNROLL-NO-IC-NEXT: [[TMP6:%.]] = getelementptr inbounds i32, i32 [[TMP2]], i32 2
	; UNROLL-NO-IC-NEXT: [[TMP7:%.]] = bitcast i32 [[TMP6]] to <2 x i32>*			; UNROLL-NO-IC-NEXT: [[TMP7:%.]] = bitcast i32 [[TMP6]] to <2 x i32>*
	; UNROLL-NO-IC-NEXT: [[WIDE_LOAD2:%.]] = load <2 x i32>, <2 x i32> [[TMP7]], align 4			; UNROLL-NO-IC-NEXT: [[WIDE_LOAD3:%.]] = load <2 x i32>, <2 x i32> [[TMP7]], align 4
	; UNROLL-NO-IC-NEXT: [[TMP8:%.*]] = extractelement <2 x i1> [[BROADCAST_SPLAT]], i32 0			; UNROLL-NO-IC-NEXT: [[TMP8:%.*]] = select <2 x i1> [[BROADCAST_SPLAT]], <2 x i32> [[VEC_IND]], <2 x i32> <i32 1, i32 1>
	; UNROLL-NO-IC-NEXT: br i1 [[TMP8]], label [[PRED_UDIV_IF:%.]], label [[PRED_UDIV_CONTINUE:%.]]			; UNROLL-NO-IC-NEXT: [[TMP9:%.*]] = select <2 x i1> [[BROADCAST_SPLAT5]], <2 x i32> [[STEP_ADD]], <2 x i32> <i32 1, i32 1>
	; UNROLL-NO-IC: pred.udiv.if:			; UNROLL-NO-IC-NEXT: [[TMP10:%.*]] = udiv <2 x i32> [[WIDE_LOAD]], [[TMP8]]
	; UNROLL-NO-IC-NEXT: [[TMP9:%.*]] = extractelement <2 x i32> [[WIDE_LOAD]], i32 0			; UNROLL-NO-IC-NEXT: [[TMP11:%.*]] = udiv <2 x i32> [[WIDE_LOAD3]], [[TMP9]]
	; UNROLL-NO-IC-NEXT: [[TMP10:%.*]] = udiv i32 [[TMP9]], [[TMP0]]			; UNROLL-NO-IC-NEXT: [[TMP12:%.*]] = xor <2 x i1> [[BROADCAST_SPLAT]], <i1 true, i1 true>
	; UNROLL-NO-IC-NEXT: [[TMP11:%.*]] = insertelement <2 x i32> poison, i32 [[TMP10]], i32 0			; UNROLL-NO-IC-NEXT: [[TMP13:%.*]] = xor <2 x i1> [[BROADCAST_SPLAT5]], <i1 true, i1 true>
	; UNROLL-NO-IC-NEXT: br label [[PRED_UDIV_CONTINUE]]			; UNROLL-NO-IC-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP12]], <2 x i32> [[WIDE_LOAD]], <2 x i32> [[TMP10]]
	; UNROLL-NO-IC: pred.udiv.continue:			; UNROLL-NO-IC-NEXT: [[PREDPHI6:%.*]] = select <2 x i1> [[TMP13]], <2 x i32> [[WIDE_LOAD3]], <2 x i32> [[TMP11]]
	; UNROLL-NO-IC-NEXT: [[TMP12:%.*]] = phi <2 x i32> [ poison, [[VECTOR_BODY]] ], [ [[TMP11]], [[PRED_UDIV_IF]] ]			; UNROLL-NO-IC-NEXT: [[TMP14]] = add <2 x i32> [[PREDPHI]], [[VEC_PHI]]
	; UNROLL-NO-IC-NEXT: [[TMP13:%.*]] = extractelement <2 x i1> [[BROADCAST_SPLAT]], i32 1			; UNROLL-NO-IC-NEXT: [[TMP15]] = add <2 x i32> [[PREDPHI6]], [[VEC_PHI2]]
	; UNROLL-NO-IC-NEXT: br i1 [[TMP13]], label [[PRED_UDIV_IF3:%.]], label [[PRED_UDIV_CONTINUE4:%.]]
	; UNROLL-NO-IC: pred.udiv.if3:
	; UNROLL-NO-IC-NEXT: [[TMP14:%.*]] = add i32 [[INDEX]], 1
	; UNROLL-NO-IC-NEXT: [[TMP15:%.*]] = extractelement <2 x i32> [[WIDE_LOAD]], i32 1
	; UNROLL-NO-IC-NEXT: [[TMP16:%.*]] = udiv i32 [[TMP15]], [[TMP14]]
	; UNROLL-NO-IC-NEXT: [[TMP17:%.*]] = insertelement <2 x i32> [[TMP12]], i32 [[TMP16]], i32 1
	; UNROLL-NO-IC-NEXT: br label [[PRED_UDIV_CONTINUE4]]
	; UNROLL-NO-IC: pred.udiv.continue4:
	; UNROLL-NO-IC-NEXT: [[TMP18:%.*]] = phi <2 x i32> [ [[TMP12]], [[PRED_UDIV_CONTINUE]] ], [ [[TMP17]], [[PRED_UDIV_IF3]] ]
	; UNROLL-NO-IC-NEXT: [[TMP19:%.*]] = extractelement <2 x i1> [[BROADCAST_SPLAT6]], i32 0
	; UNROLL-NO-IC-NEXT: br i1 [[TMP19]], label [[PRED_UDIV_IF7:%.]], label [[PRED_UDIV_CONTINUE8:%.]]
	; UNROLL-NO-IC: pred.udiv.if7:
	; UNROLL-NO-IC-NEXT: [[TMP20:%.*]] = extractelement <2 x i32> [[WIDE_LOAD2]], i32 0
	; UNROLL-NO-IC-NEXT: [[TMP21:%.*]] = udiv i32 [[TMP20]], [[TMP1]]
	; UNROLL-NO-IC-NEXT: [[TMP22:%.*]] = insertelement <2 x i32> poison, i32 [[TMP21]], i32 0
	; UNROLL-NO-IC-NEXT: br label [[PRED_UDIV_CONTINUE8]]
	; UNROLL-NO-IC: pred.udiv.continue8:
	; UNROLL-NO-IC-NEXT: [[TMP23:%.*]] = phi <2 x i32> [ poison, [[PRED_UDIV_CONTINUE4]] ], [ [[TMP22]], [[PRED_UDIV_IF7]] ]
	; UNROLL-NO-IC-NEXT: [[TMP24:%.*]] = extractelement <2 x i1> [[BROADCAST_SPLAT6]], i32 1
	; UNROLL-NO-IC-NEXT: br i1 [[TMP24]], label [[PRED_UDIV_IF9:%.*]], label [[PRED_UDIV_CONTINUE10]]
	; UNROLL-NO-IC: pred.udiv.if9:
	; UNROLL-NO-IC-NEXT: [[TMP25:%.*]] = add i32 [[INDEX]], 3
	; UNROLL-NO-IC-NEXT: [[TMP26:%.*]] = extractelement <2 x i32> [[WIDE_LOAD2]], i32 1
	; UNROLL-NO-IC-NEXT: [[TMP27:%.*]] = udiv i32 [[TMP26]], [[TMP25]]
	; UNROLL-NO-IC-NEXT: [[TMP28:%.*]] = insertelement <2 x i32> [[TMP23]], i32 [[TMP27]], i32 1
	; UNROLL-NO-IC-NEXT: br label [[PRED_UDIV_CONTINUE10]]
	; UNROLL-NO-IC: pred.udiv.continue10:
	; UNROLL-NO-IC-NEXT: [[TMP29:%.*]] = phi <2 x i32> [ [[TMP23]], [[PRED_UDIV_CONTINUE8]] ], [ [[TMP28]], [[PRED_UDIV_IF9]] ]
	; UNROLL-NO-IC-NEXT: [[TMP30:%.*]] = xor <2 x i1> [[BROADCAST_SPLAT]], <i1 true, i1 true>
	; UNROLL-NO-IC-NEXT: [[TMP31:%.*]] = xor <2 x i1> [[BROADCAST_SPLAT6]], <i1 true, i1 true>
	; UNROLL-NO-IC-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP30]], <2 x i32> [[WIDE_LOAD]], <2 x i32> [[TMP18]]
	; UNROLL-NO-IC-NEXT: [[PREDPHI11:%.*]] = select <2 x i1> [[TMP31]], <2 x i32> [[WIDE_LOAD2]], <2 x i32> [[TMP29]]
	; UNROLL-NO-IC-NEXT: [[TMP32]] = add <2 x i32> [[PREDPHI]], [[VEC_PHI]]
	; UNROLL-NO-IC-NEXT: [[TMP33]] = add <2 x i32> [[PREDPHI11]], [[VEC_PHI1]]
	; UNROLL-NO-IC-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 4			; UNROLL-NO-IC-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 4
	; UNROLL-NO-IC-NEXT: [[TMP34:%.*]] = icmp eq i32 [[INDEX_NEXT]], [[N_VEC]]			; UNROLL-NO-IC-NEXT: [[VEC_IND_NEXT]] = add <2 x i32> [[STEP_ADD]], <i32 2, i32 2>
	; UNROLL-NO-IC-NEXT: br i1 [[TMP34]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP24:![0-9]+]]			; UNROLL-NO-IC-NEXT: [[TMP16:%.*]] = icmp eq i32 [[INDEX_NEXT]], [[N_VEC]]
				; UNROLL-NO-IC-NEXT: br i1 [[TMP16]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP24:![0-9]+]]
	; UNROLL-NO-IC: middle.block:			; UNROLL-NO-IC: middle.block:
	; UNROLL-NO-IC-NEXT: [[BIN_RDX:%.*]] = add <2 x i32> [[TMP33]], [[TMP32]]			; UNROLL-NO-IC-NEXT: [[BIN_RDX:%.*]] = add <2 x i32> [[TMP15]], [[TMP14]]
	; UNROLL-NO-IC-NEXT: [[TMP35:%.*]] = call i32 @llvm.vector.reduce.add.v2i32(<2 x i32> [[BIN_RDX]])			; UNROLL-NO-IC-NEXT: [[TMP17:%.*]] = call i32 @llvm.vector.reduce.add.v2i32(<2 x i32> [[BIN_RDX]])
	; UNROLL-NO-IC-NEXT: [[CMP_N:%.*]] = icmp eq i32 [[SMAX]], [[N_VEC]]			; UNROLL-NO-IC-NEXT: [[CMP_N:%.*]] = icmp eq i32 [[SMAX]], [[N_VEC]]
	; UNROLL-NO-IC-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]			; UNROLL-NO-IC-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; UNROLL-NO-IC: scalar.ph:			; UNROLL-NO-IC: scalar.ph:
	; UNROLL-NO-IC-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ [[N_VEC]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]			; UNROLL-NO-IC-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ [[N_VEC]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]
	; UNROLL-NO-IC-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ 0, [[ENTRY]] ], [ [[TMP35]], [[MIDDLE_BLOCK]] ]			; UNROLL-NO-IC-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ 0, [[ENTRY]] ], [ [[TMP17]], [[MIDDLE_BLOCK]] ]
	; UNROLL-NO-IC-NEXT: br label [[FOR_BODY:%.*]]			; UNROLL-NO-IC-NEXT: br label [[FOR_BODY:%.*]]
	; UNROLL-NO-IC: for.body:			; UNROLL-NO-IC: for.body:
	; UNROLL-NO-IC-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[IF_END:%.*]] ]			; UNROLL-NO-IC-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[IF_END:%.*]] ]
	; UNROLL-NO-IC-NEXT: [[SUM:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR4:%.]], [[IF_END]] ]			; UNROLL-NO-IC-NEXT: [[SUM:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR4:%.]], [[IF_END]] ]
	; UNROLL-NO-IC-NEXT: [[VAR0:%.]] = getelementptr inbounds i32, i32 [[A]], i32 [[I]]			; UNROLL-NO-IC-NEXT: [[VAR0:%.]] = getelementptr inbounds i32, i32 [[A]], i32 [[I]]
	; UNROLL-NO-IC-NEXT: [[VAR1:%.]] = load i32, i32 [[VAR0]], align 4			; UNROLL-NO-IC-NEXT: [[VAR1:%.]] = load i32, i32 [[VAR0]], align 4
	; UNROLL-NO-IC-NEXT: br i1 [[C]], label [[IF_THEN:%.*]], label [[IF_END]]			; UNROLL-NO-IC-NEXT: br i1 [[C]], label [[IF_THEN:%.*]], label [[IF_END]]
	; UNROLL-NO-IC: if.then:			; UNROLL-NO-IC: if.then:
	; UNROLL-NO-IC-NEXT: [[VAR2:%.*]] = udiv i32 [[VAR1]], [[I]]			; UNROLL-NO-IC-NEXT: [[VAR2:%.*]] = udiv i32 [[VAR1]], [[I]]
	; UNROLL-NO-IC-NEXT: br label [[IF_END]]			; UNROLL-NO-IC-NEXT: br label [[IF_END]]
	; UNROLL-NO-IC: if.end:			; UNROLL-NO-IC: if.end:
	; UNROLL-NO-IC-NEXT: [[VAR3:%.*]] = phi i32 [ [[VAR2]], [[IF_THEN]] ], [ [[VAR1]], [[FOR_BODY]] ]			; UNROLL-NO-IC-NEXT: [[VAR3:%.*]] = phi i32 [ [[VAR2]], [[IF_THEN]] ], [ [[VAR1]], [[FOR_BODY]] ]
	; UNROLL-NO-IC-NEXT: [[VAR4]] = add i32 [[VAR3]], [[SUM]]			; UNROLL-NO-IC-NEXT: [[VAR4]] = add i32 [[VAR3]], [[SUM]]
	; UNROLL-NO-IC-NEXT: [[I_NEXT]] = add nuw nsw i32 [[I]], 1			; UNROLL-NO-IC-NEXT: [[I_NEXT]] = add nuw nsw i32 [[I]], 1
	; UNROLL-NO-IC-NEXT: [[COND:%.*]] = icmp slt i32 [[I_NEXT]], [[N]]			; UNROLL-NO-IC-NEXT: [[COND:%.*]] = icmp slt i32 [[I_NEXT]], [[N]]
	; UNROLL-NO-IC-NEXT: br i1 [[COND]], label [[FOR_BODY]], label [[FOR_END]], !llvm.loop [[LOOP25:![0-9]+]]			; UNROLL-NO-IC-NEXT: br i1 [[COND]], label [[FOR_BODY]], label [[FOR_END]], !llvm.loop [[LOOP25:![0-9]+]]
	; UNROLL-NO-IC: for.end:			; UNROLL-NO-IC: for.end:
	; UNROLL-NO-IC-NEXT: [[VAR5:%.*]] = phi i32 [ [[VAR4]], [[IF_END]] ], [ [[TMP35]], [[MIDDLE_BLOCK]] ]			; UNROLL-NO-IC-NEXT: [[VAR5:%.*]] = phi i32 [ [[VAR4]], [[IF_END]] ], [ [[TMP17]], [[MIDDLE_BLOCK]] ]
	; UNROLL-NO-IC-NEXT: ret i32 [[VAR5]]			; UNROLL-NO-IC-NEXT: ret i32 [[VAR5]]
	;			;
	; INTERLEAVE-LABEL: @scalarize_induction_variable_05(			; INTERLEAVE-LABEL: @scalarize_induction_variable_05(
	; INTERLEAVE-NEXT: entry:			; INTERLEAVE-NEXT: entry:
	; INTERLEAVE-NEXT: [[SMAX:%.]] = call i32 @llvm.smax.i32(i32 [[N:%.]], i32 1)			; INTERLEAVE-NEXT: [[SMAX:%.]] = call i32 @llvm.smax.i32(i32 [[N:%.]], i32 1)
	; INTERLEAVE-NEXT: [[MIN_ITERS_CHECK:%.*]] = icmp ult i32 [[SMAX]], 8			; INTERLEAVE-NEXT: [[MIN_ITERS_CHECK:%.*]] = icmp ult i32 [[SMAX]], 8
	; INTERLEAVE-NEXT: br i1 [[MIN_ITERS_CHECK]], label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; INTERLEAVE-NEXT: br i1 [[MIN_ITERS_CHECK]], label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; INTERLEAVE: vector.ph:			; INTERLEAVE: vector.ph:
	; INTERLEAVE-NEXT: [[N_VEC:%.*]] = and i32 [[SMAX]], 2147483640			; INTERLEAVE-NEXT: [[N_VEC:%.*]] = and i32 [[SMAX]], 2147483640
	; INTERLEAVE-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <4 x i1> poison, i1 [[C:%.]], i64 0			; INTERLEAVE-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <4 x i1> poison, i1 [[C:%.]], i64 0
	; INTERLEAVE-NEXT: [[BROADCAST_SPLATINSERT9:%.*]] = insertelement <4 x i1> poison, i1 [[C]], i64 0			; INTERLEAVE-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <4 x i1> [[BROADCAST_SPLATINSERT]], <4 x i1> poison, <4 x i32> zeroinitializer
				; INTERLEAVE-NEXT: [[BROADCAST_SPLATINSERT4:%.*]] = insertelement <4 x i1> poison, i1 [[C]], i64 0
				; INTERLEAVE-NEXT: [[BROADCAST_SPLAT5:%.*]] = shufflevector <4 x i1> [[BROADCAST_SPLATINSERT4]], <4 x i1> poison, <4 x i32> zeroinitializer
	; INTERLEAVE-NEXT: br label [[VECTOR_BODY:%.*]]			; INTERLEAVE-NEXT: br label [[VECTOR_BODY:%.*]]
	; INTERLEAVE: vector.body:			; INTERLEAVE: vector.body:
	; INTERLEAVE-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_UDIV_CONTINUE18:%.*]] ]			; INTERLEAVE-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; INTERLEAVE-NEXT: [[VEC_PHI:%.]] = phi <4 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP48:%.]], [[PRED_UDIV_CONTINUE18]] ]			; INTERLEAVE-NEXT: [[VEC_IND:%.]] = phi <4 x i32> [ <i32 0, i32 1, i32 2, i32 3>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[VECTOR_BODY]] ]
	; INTERLEAVE-NEXT: [[VEC_PHI1:%.]] = phi <4 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP49:%.]], [[PRED_UDIV_CONTINUE18]] ]			; INTERLEAVE-NEXT: [[VEC_PHI:%.]] = phi <4 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP9:%.]], [[VECTOR_BODY]] ]
	; INTERLEAVE-NEXT: [[TMP0:%.*]] = or i32 [[INDEX]], 4			; INTERLEAVE-NEXT: [[VEC_PHI2:%.]] = phi <4 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP10:%.]], [[VECTOR_BODY]] ]
	; INTERLEAVE-NEXT: [[TMP1:%.*]] = sext i32 [[INDEX]] to i64			; INTERLEAVE-NEXT: [[STEP_ADD:%.*]] = add <4 x i32> [[VEC_IND]], <i32 4, i32 4, i32 4, i32 4>
	; INTERLEAVE-NEXT: [[TMP2:%.]] = getelementptr inbounds i32, i32 [[A:%.*]], i64 [[TMP1]]			; INTERLEAVE-NEXT: [[TMP0:%.*]] = sext i32 [[INDEX]] to i64
	; INTERLEAVE-NEXT: [[TMP3:%.]] = bitcast i32 [[TMP2]] to <4 x i32>*			; INTERLEAVE-NEXT: [[TMP1:%.]] = getelementptr inbounds i32, i32 [[A:%.*]], i64 [[TMP0]]
	; INTERLEAVE-NEXT: [[WIDE_LOAD:%.]] = load <4 x i32>, <4 x i32> [[TMP3]], align 4			; INTERLEAVE-NEXT: [[TMP2:%.]] = bitcast i32 [[TMP1]] to <4 x i32>*
	; INTERLEAVE-NEXT: [[TMP4:%.]] = getelementptr inbounds i32, i32 [[TMP2]], i64 4			; INTERLEAVE-NEXT: [[WIDE_LOAD:%.]] = load <4 x i32>, <4 x i32> [[TMP2]], align 4
	; INTERLEAVE-NEXT: [[TMP5:%.]] = bitcast i32 [[TMP4]] to <4 x i32>*			; INTERLEAVE-NEXT: [[TMP3:%.]] = getelementptr inbounds i32, i32 [[TMP1]], i64 4
	; INTERLEAVE-NEXT: [[WIDE_LOAD2:%.]] = load <4 x i32>, <4 x i32> [[TMP5]], align 4			; INTERLEAVE-NEXT: [[TMP4:%.]] = bitcast i32 [[TMP3]] to <4 x i32>*
	; INTERLEAVE-NEXT: br i1 [[C]], label [[PRED_UDIV_IF:%.]], label [[PRED_UDIV_CONTINUE:%.]]			; INTERLEAVE-NEXT: [[WIDE_LOAD3:%.]] = load <4 x i32>, <4 x i32> [[TMP4]], align 4
	; INTERLEAVE: pred.udiv.if:			; INTERLEAVE-NEXT: [[TMP5:%.*]] = select <4 x i1> [[BROADCAST_SPLAT]], <4 x i32> [[VEC_IND]], <4 x i32> <i32 1, i32 1, i32 1, i32 1>
	; INTERLEAVE-NEXT: [[TMP6:%.*]] = extractelement <4 x i32> [[WIDE_LOAD]], i64 0			; INTERLEAVE-NEXT: [[TMP6:%.*]] = select <4 x i1> [[BROADCAST_SPLAT5]], <4 x i32> [[STEP_ADD]], <4 x i32> <i32 1, i32 1, i32 1, i32 1>
	; INTERLEAVE-NEXT: [[TMP7:%.*]] = udiv i32 [[TMP6]], [[INDEX]]			; INTERLEAVE-NEXT: [[TMP7:%.*]] = udiv <4 x i32> [[WIDE_LOAD]], [[TMP5]]
	; INTERLEAVE-NEXT: [[TMP8:%.*]] = insertelement <4 x i32> poison, i32 [[TMP7]], i64 0			; INTERLEAVE-NEXT: [[TMP8:%.*]] = udiv <4 x i32> [[WIDE_LOAD3]], [[TMP6]]
	; INTERLEAVE-NEXT: br label [[PRED_UDIV_CONTINUE]]			; INTERLEAVE-NEXT: [[PREDPHI:%.*]] = select <4 x i1> [[BROADCAST_SPLAT]], <4 x i32> [[TMP7]], <4 x i32> [[WIDE_LOAD]]
	; INTERLEAVE: pred.udiv.continue:			; INTERLEAVE-NEXT: [[PREDPHI6:%.*]] = select <4 x i1> [[BROADCAST_SPLAT5]], <4 x i32> [[TMP8]], <4 x i32> [[WIDE_LOAD3]]
	; INTERLEAVE-NEXT: [[TMP9:%.*]] = phi <4 x i32> [ poison, [[VECTOR_BODY]] ], [ [[TMP8]], [[PRED_UDIV_IF]] ]			; INTERLEAVE-NEXT: [[TMP9]] = add <4 x i32> [[PREDPHI]], [[VEC_PHI]]
	; INTERLEAVE-NEXT: br i1 [[C]], label [[PRED_UDIV_IF3:%.]], label [[PRED_UDIV_CONTINUE4:%.]]			; INTERLEAVE-NEXT: [[TMP10]] = add <4 x i32> [[PREDPHI6]], [[VEC_PHI2]]
	; INTERLEAVE: pred.udiv.if3:
	; INTERLEAVE-NEXT: [[TMP10:%.*]] = or i32 [[INDEX]], 1
	; INTERLEAVE-NEXT: [[TMP11:%.*]] = extractelement <4 x i32> [[WIDE_LOAD]], i64 1
	; INTERLEAVE-NEXT: [[TMP12:%.*]] = udiv i32 [[TMP11]], [[TMP10]]
	; INTERLEAVE-NEXT: [[TMP13:%.*]] = insertelement <4 x i32> [[TMP9]], i32 [[TMP12]], i64 1
	; INTERLEAVE-NEXT: br label [[PRED_UDIV_CONTINUE4]]
	; INTERLEAVE: pred.udiv.continue4:
	; INTERLEAVE-NEXT: [[TMP14:%.*]] = phi <4 x i32> [ [[TMP9]], [[PRED_UDIV_CONTINUE]] ], [ [[TMP13]], [[PRED_UDIV_IF3]] ]
	; INTERLEAVE-NEXT: br i1 [[C]], label [[PRED_UDIV_IF5:%.]], label [[PRED_UDIV_CONTINUE6:%.]]
	; INTERLEAVE: pred.udiv.if5:
	; INTERLEAVE-NEXT: [[TMP15:%.*]] = or i32 [[INDEX]], 2
	; INTERLEAVE-NEXT: [[TMP16:%.*]] = extractelement <4 x i32> [[WIDE_LOAD]], i64 2
	; INTERLEAVE-NEXT: [[TMP17:%.*]] = udiv i32 [[TMP16]], [[TMP15]]
	; INTERLEAVE-NEXT: [[TMP18:%.*]] = insertelement <4 x i32> [[TMP14]], i32 [[TMP17]], i64 2
	; INTERLEAVE-NEXT: br label [[PRED_UDIV_CONTINUE6]]
	; INTERLEAVE: pred.udiv.continue6:
	; INTERLEAVE-NEXT: [[TMP19:%.*]] = phi <4 x i32> [ [[TMP14]], [[PRED_UDIV_CONTINUE4]] ], [ [[TMP18]], [[PRED_UDIV_IF5]] ]
	; INTERLEAVE-NEXT: br i1 [[C]], label [[PRED_UDIV_IF7:%.]], label [[PRED_UDIV_CONTINUE8:%.]]
	; INTERLEAVE: pred.udiv.if7:
	; INTERLEAVE-NEXT: [[TMP20:%.*]] = or i32 [[INDEX]], 3
	; INTERLEAVE-NEXT: [[TMP21:%.*]] = extractelement <4 x i32> [[WIDE_LOAD]], i64 3
	; INTERLEAVE-NEXT: [[TMP22:%.*]] = udiv i32 [[TMP21]], [[TMP20]]
	; INTERLEAVE-NEXT: [[TMP23:%.*]] = insertelement <4 x i32> [[TMP19]], i32 [[TMP22]], i64 3
	; INTERLEAVE-NEXT: br label [[PRED_UDIV_CONTINUE8]]
	; INTERLEAVE: pred.udiv.continue8:
	; INTERLEAVE-NEXT: [[TMP24:%.*]] = phi <4 x i32> [ [[TMP19]], [[PRED_UDIV_CONTINUE6]] ], [ [[TMP23]], [[PRED_UDIV_IF7]] ]
	; INTERLEAVE-NEXT: br i1 [[C]], label [[PRED_UDIV_IF11:%.]], label [[PRED_UDIV_CONTINUE12:%.]]
	; INTERLEAVE: pred.udiv.if11:
	; INTERLEAVE-NEXT: [[TMP25:%.*]] = extractelement <4 x i32> [[WIDE_LOAD2]], i64 0
	; INTERLEAVE-NEXT: [[TMP26:%.*]] = udiv i32 [[TMP25]], [[TMP0]]
	; INTERLEAVE-NEXT: [[TMP27:%.*]] = insertelement <4 x i32> poison, i32 [[TMP26]], i64 0
	; INTERLEAVE-NEXT: br label [[PRED_UDIV_CONTINUE12]]
	; INTERLEAVE: pred.udiv.continue12:
	; INTERLEAVE-NEXT: [[TMP28:%.*]] = phi <4 x i32> [ poison, [[PRED_UDIV_CONTINUE8]] ], [ [[TMP27]], [[PRED_UDIV_IF11]] ]
	; INTERLEAVE-NEXT: br i1 [[C]], label [[PRED_UDIV_IF13:%.]], label [[PRED_UDIV_CONTINUE14:%.]]
	; INTERLEAVE: pred.udiv.if13:
	; INTERLEAVE-NEXT: [[TMP29:%.*]] = or i32 [[INDEX]], 5
	; INTERLEAVE-NEXT: [[TMP30:%.*]] = extractelement <4 x i32> [[WIDE_LOAD2]], i64 1
	; INTERLEAVE-NEXT: [[TMP31:%.*]] = udiv i32 [[TMP30]], [[TMP29]]
	; INTERLEAVE-NEXT: [[TMP32:%.*]] = insertelement <4 x i32> [[TMP28]], i32 [[TMP31]], i64 1
	; INTERLEAVE-NEXT: br label [[PRED_UDIV_CONTINUE14]]
	; INTERLEAVE: pred.udiv.continue14:
	; INTERLEAVE-NEXT: [[TMP33:%.*]] = phi <4 x i32> [ [[TMP28]], [[PRED_UDIV_CONTINUE12]] ], [ [[TMP32]], [[PRED_UDIV_IF13]] ]
	; INTERLEAVE-NEXT: br i1 [[C]], label [[PRED_UDIV_IF15:%.]], label [[PRED_UDIV_CONTINUE16:%.]]
	; INTERLEAVE: pred.udiv.if15:
	; INTERLEAVE-NEXT: [[TMP34:%.*]] = or i32 [[INDEX]], 6
	; INTERLEAVE-NEXT: [[TMP35:%.*]] = extractelement <4 x i32> [[WIDE_LOAD2]], i64 2
	; INTERLEAVE-NEXT: [[TMP36:%.*]] = udiv i32 [[TMP35]], [[TMP34]]
	; INTERLEAVE-NEXT: [[TMP37:%.*]] = insertelement <4 x i32> [[TMP33]], i32 [[TMP36]], i64 2
	; INTERLEAVE-NEXT: br label [[PRED_UDIV_CONTINUE16]]
	; INTERLEAVE: pred.udiv.continue16:
	; INTERLEAVE-NEXT: [[TMP38:%.*]] = phi <4 x i32> [ [[TMP33]], [[PRED_UDIV_CONTINUE14]] ], [ [[TMP37]], [[PRED_UDIV_IF15]] ]
	; INTERLEAVE-NEXT: br i1 [[C]], label [[PRED_UDIV_IF17:%.*]], label [[PRED_UDIV_CONTINUE18]]
	; INTERLEAVE: pred.udiv.if17:
	; INTERLEAVE-NEXT: [[TMP39:%.*]] = or i32 [[INDEX]], 7
	; INTERLEAVE-NEXT: [[TMP40:%.*]] = extractelement <4 x i32> [[WIDE_LOAD2]], i64 3
	; INTERLEAVE-NEXT: [[TMP41:%.*]] = udiv i32 [[TMP40]], [[TMP39]]
	; INTERLEAVE-NEXT: [[TMP42:%.*]] = insertelement <4 x i32> [[TMP38]], i32 [[TMP41]], i64 3
	; INTERLEAVE-NEXT: br label [[PRED_UDIV_CONTINUE18]]
	; INTERLEAVE: pred.udiv.continue18:
	; INTERLEAVE-NEXT: [[TMP43:%.*]] = phi <4 x i32> [ [[TMP38]], [[PRED_UDIV_CONTINUE16]] ], [ [[TMP42]], [[PRED_UDIV_IF17]] ]
	; INTERLEAVE-NEXT: [[TMP44:%.*]] = xor <4 x i1> [[BROADCAST_SPLATINSERT]], <i1 true, i1 poison, i1 poison, i1 poison>
	; INTERLEAVE-NEXT: [[TMP45:%.*]] = shufflevector <4 x i1> [[TMP44]], <4 x i1> poison, <4 x i32> zeroinitializer
	; INTERLEAVE-NEXT: [[TMP46:%.*]] = xor <4 x i1> [[BROADCAST_SPLATINSERT9]], <i1 true, i1 poison, i1 poison, i1 poison>
	; INTERLEAVE-NEXT: [[TMP47:%.*]] = shufflevector <4 x i1> [[TMP46]], <4 x i1> poison, <4 x i32> zeroinitializer
	; INTERLEAVE-NEXT: [[PREDPHI:%.*]] = select <4 x i1> [[TMP45]], <4 x i32> [[WIDE_LOAD]], <4 x i32> [[TMP24]]
	; INTERLEAVE-NEXT: [[PREDPHI19:%.*]] = select <4 x i1> [[TMP47]], <4 x i32> [[WIDE_LOAD2]], <4 x i32> [[TMP43]]
	; INTERLEAVE-NEXT: [[TMP48]] = add <4 x i32> [[PREDPHI]], [[VEC_PHI]]
	; INTERLEAVE-NEXT: [[TMP49]] = add <4 x i32> [[PREDPHI19]], [[VEC_PHI1]]
	; INTERLEAVE-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 8			; INTERLEAVE-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 8
	; INTERLEAVE-NEXT: [[TMP50:%.*]] = icmp eq i32 [[INDEX_NEXT]], [[N_VEC]]			; INTERLEAVE-NEXT: [[VEC_IND_NEXT]] = add <4 x i32> [[VEC_IND]], <i32 8, i32 8, i32 8, i32 8>
	; INTERLEAVE-NEXT: br i1 [[TMP50]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP24:![0-9]+]]			; INTERLEAVE-NEXT: [[TMP11:%.*]] = icmp eq i32 [[INDEX_NEXT]], [[N_VEC]]
				; INTERLEAVE-NEXT: br i1 [[TMP11]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP24:![0-9]+]]
	; INTERLEAVE: middle.block:			; INTERLEAVE: middle.block:
	; INTERLEAVE-NEXT: [[BIN_RDX:%.*]] = add <4 x i32> [[TMP49]], [[TMP48]]			; INTERLEAVE-NEXT: [[BIN_RDX:%.*]] = add <4 x i32> [[TMP10]], [[TMP9]]
	; INTERLEAVE-NEXT: [[TMP51:%.*]] = call i32 @llvm.vector.reduce.add.v4i32(<4 x i32> [[BIN_RDX]])			; INTERLEAVE-NEXT: [[TMP12:%.*]] = call i32 @llvm.vector.reduce.add.v4i32(<4 x i32> [[BIN_RDX]])
	; INTERLEAVE-NEXT: [[CMP_N:%.*]] = icmp eq i32 [[SMAX]], [[N_VEC]]			; INTERLEAVE-NEXT: [[CMP_N:%.*]] = icmp eq i32 [[SMAX]], [[N_VEC]]
	; INTERLEAVE-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]			; INTERLEAVE-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; INTERLEAVE: scalar.ph:			; INTERLEAVE: scalar.ph:
	; INTERLEAVE-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ [[N_VEC]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]			; INTERLEAVE-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ [[N_VEC]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]
	; INTERLEAVE-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ [[TMP51]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY]] ]			; INTERLEAVE-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ [[TMP12]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY]] ]
	; INTERLEAVE-NEXT: br label [[FOR_BODY:%.*]]			; INTERLEAVE-NEXT: br label [[FOR_BODY:%.*]]
	; INTERLEAVE: for.body:			; INTERLEAVE: for.body:
	; INTERLEAVE-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[IF_END:%.*]] ]			; INTERLEAVE-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[IF_END:%.*]] ]
	; INTERLEAVE-NEXT: [[SUM:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR4:%.]], [[IF_END]] ]			; INTERLEAVE-NEXT: [[SUM:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR4:%.]], [[IF_END]] ]
	; INTERLEAVE-NEXT: [[TMP52:%.*]] = zext i32 [[I]] to i64			; INTERLEAVE-NEXT: [[TMP13:%.*]] = zext i32 [[I]] to i64
	; INTERLEAVE-NEXT: [[VAR0:%.]] = getelementptr inbounds i32, i32 [[A]], i64 [[TMP52]]			; INTERLEAVE-NEXT: [[VAR0:%.]] = getelementptr inbounds i32, i32 [[A]], i64 [[TMP13]]
	; INTERLEAVE-NEXT: [[VAR1:%.]] = load i32, i32 [[VAR0]], align 4			; INTERLEAVE-NEXT: [[VAR1:%.]] = load i32, i32 [[VAR0]], align 4
	; INTERLEAVE-NEXT: br i1 [[C]], label [[IF_THEN:%.*]], label [[IF_END]]			; INTERLEAVE-NEXT: br i1 [[C]], label [[IF_THEN:%.*]], label [[IF_END]]
	; INTERLEAVE: if.then:			; INTERLEAVE: if.then:
	; INTERLEAVE-NEXT: [[VAR2:%.*]] = udiv i32 [[VAR1]], [[I]]			; INTERLEAVE-NEXT: [[VAR2:%.*]] = udiv i32 [[VAR1]], [[I]]
	; INTERLEAVE-NEXT: br label [[IF_END]]			; INTERLEAVE-NEXT: br label [[IF_END]]
	; INTERLEAVE: if.end:			; INTERLEAVE: if.end:
	; INTERLEAVE-NEXT: [[VAR3:%.*]] = phi i32 [ [[VAR2]], [[IF_THEN]] ], [ [[VAR1]], [[FOR_BODY]] ]			; INTERLEAVE-NEXT: [[VAR3:%.*]] = phi i32 [ [[VAR2]], [[IF_THEN]] ], [ [[VAR1]], [[FOR_BODY]] ]
	; INTERLEAVE-NEXT: [[VAR4]] = add i32 [[VAR3]], [[SUM]]			; INTERLEAVE-NEXT: [[VAR4]] = add i32 [[VAR3]], [[SUM]]
	; INTERLEAVE-NEXT: [[I_NEXT]] = add nuw nsw i32 [[I]], 1			; INTERLEAVE-NEXT: [[I_NEXT]] = add nuw nsw i32 [[I]], 1
	; INTERLEAVE-NEXT: [[COND:%.*]] = icmp slt i32 [[I_NEXT]], [[N]]			; INTERLEAVE-NEXT: [[COND:%.*]] = icmp slt i32 [[I_NEXT]], [[N]]
	; INTERLEAVE-NEXT: br i1 [[COND]], label [[FOR_BODY]], label [[FOR_END]], !llvm.loop [[LOOP25:![0-9]+]]			; INTERLEAVE-NEXT: br i1 [[COND]], label [[FOR_BODY]], label [[FOR_END]], !llvm.loop [[LOOP25:![0-9]+]]
	; INTERLEAVE: for.end:			; INTERLEAVE: for.end:
	; INTERLEAVE-NEXT: [[VAR5:%.*]] = phi i32 [ [[VAR4]], [[IF_END]] ], [ [[TMP51]], [[MIDDLE_BLOCK]] ]			; INTERLEAVE-NEXT: [[VAR5:%.*]] = phi i32 [ [[VAR4]], [[IF_END]] ], [ [[TMP12]], [[MIDDLE_BLOCK]] ]
	; INTERLEAVE-NEXT: ret i32 [[VAR5]]			; INTERLEAVE-NEXT: ret i32 [[VAR5]]
	;			;
	entry:			entry:
	br label %for.body			br label %for.body

	for.body:			for.body:
	%i = phi i32 [ 0, %entry ], [ %i.next, %if.end ]			%i = phi i32 [ 0, %entry ], [ %i.next, %if.end ]
	%sum = phi i32 [ 0, %entry ], [ %var4, %if.end ]			%sum = phi i32 [ 0, %entry ], [ %var4, %if.end ]
	▲ Show 20 Lines • Show All 2,755 Lines • ▼ Show 20 Lines
	;			;
	;			;
	define i32 @PR32419(i32 %a, i16 %b) {			define i32 @PR32419(i32 %a, i16 %b) {
	; CHECK-LABEL: @PR32419(			; CHECK-LABEL: @PR32419(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; CHECK-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; CHECK: vector.ph:			; CHECK: vector.ph:
	; CHECK-NEXT: [[TMP0:%.]] = insertelement <2 x i32> zeroinitializer, i32 [[A:%.]], i32 0			; CHECK-NEXT: [[TMP0:%.]] = insertelement <2 x i32> zeroinitializer, i32 [[A:%.]], i32 0
				; CHECK-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i16> poison, i16 [[B:%.]], i32 0
				; CHECK-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i16> [[BROADCAST_SPLATINSERT]], <2 x i16> poison, <2 x i32> zeroinitializer
	; CHECK-NEXT: br label [[VECTOR_BODY:%.*]]			; CHECK-NEXT: br label [[VECTOR_BODY:%.*]]
	; CHECK: vector.body:			; CHECK: vector.body:
	; CHECK-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_UREM_CONTINUE2:%.*]] ]			; CHECK-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; CHECK-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ [[TMP0]], [[VECTOR_PH]] ], [ [[TMP15:%.]], [[PRED_UREM_CONTINUE2]] ]			; CHECK-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ [[TMP0]], [[VECTOR_PH]] ], [ [[TMP6:%.]], [[VECTOR_BODY]] ]
	; CHECK-NEXT: [[VEC_IND:%.]] = phi <2 x i16> [ <i16 -20, i16 -19>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[PRED_UREM_CONTINUE2]] ]			; CHECK-NEXT: [[VEC_IND:%.]] = phi <2 x i16> [ <i16 -20, i16 -19>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[VECTOR_BODY]] ]
	; CHECK-NEXT: [[OFFSET_IDX:%.*]] = add i32 -20, [[INDEX]]			; CHECK-NEXT: [[TMP1:%.*]] = icmp eq <2 x i16> [[VEC_IND]], zeroinitializer
	; CHECK-NEXT: [[TMP1:%.*]] = trunc i32 [[OFFSET_IDX]] to i16			; CHECK-NEXT: [[TMP2:%.*]] = xor <2 x i1> [[TMP1]], <i1 true, i1 true>
	; CHECK-NEXT: [[TMP2:%.*]] = icmp eq <2 x i16> [[VEC_IND]], zeroinitializer			; CHECK-NEXT: [[TMP3:%.*]] = select <2 x i1> [[TMP2]], <2 x i16> [[VEC_IND]], <2 x i16> <i16 1, i16 1>
	; CHECK-NEXT: [[TMP3:%.*]] = xor <2 x i1> [[TMP2]], <i1 true, i1 true>			; CHECK-NEXT: [[TMP4:%.*]] = urem <2 x i16> [[BROADCAST_SPLAT]], [[TMP3]]
	; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x i1> [[TMP3]], i32 0			; CHECK-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP1]], <2 x i16> zeroinitializer, <2 x i16> [[TMP4]]
	; CHECK-NEXT: br i1 [[TMP4]], label [[PRED_UREM_IF:%.]], label [[PRED_UREM_CONTINUE:%.]]			; CHECK-NEXT: [[TMP5:%.*]] = sext <2 x i16> [[PREDPHI]] to <2 x i32>
	; CHECK: pred.urem.if:			; CHECK-NEXT: [[TMP6]] = or <2 x i32> [[VEC_PHI]], [[TMP5]]
	; CHECK-NEXT: [[TMP5:%.*]] = add i16 [[TMP1]], 0
	; CHECK-NEXT: [[TMP6:%.]] = urem i16 [[B:%.]], [[TMP5]]
	; CHECK-NEXT: [[TMP7:%.*]] = insertelement <2 x i16> poison, i16 [[TMP6]], i32 0
	; CHECK-NEXT: br label [[PRED_UREM_CONTINUE]]
	; CHECK: pred.urem.continue:
	; CHECK-NEXT: [[TMP8:%.*]] = phi <2 x i16> [ poison, [[VECTOR_BODY]] ], [ [[TMP7]], [[PRED_UREM_IF]] ]
	; CHECK-NEXT: [[TMP9:%.*]] = extractelement <2 x i1> [[TMP3]], i32 1
	; CHECK-NEXT: br i1 [[TMP9]], label [[PRED_UREM_IF1:%.*]], label [[PRED_UREM_CONTINUE2]]
	; CHECK: pred.urem.if1:
	; CHECK-NEXT: [[TMP10:%.*]] = add i16 [[TMP1]], 1
	; CHECK-NEXT: [[TMP11:%.*]] = urem i16 [[B]], [[TMP10]]
	; CHECK-NEXT: [[TMP12:%.*]] = insertelement <2 x i16> [[TMP8]], i16 [[TMP11]], i32 1
	; CHECK-NEXT: br label [[PRED_UREM_CONTINUE2]]
	; CHECK: pred.urem.continue2:
	; CHECK-NEXT: [[TMP13:%.*]] = phi <2 x i16> [ [[TMP8]], [[PRED_UREM_CONTINUE]] ], [ [[TMP12]], [[PRED_UREM_IF1]] ]
	; CHECK-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP2]], <2 x i16> zeroinitializer, <2 x i16> [[TMP13]]
	; CHECK-NEXT: [[TMP14:%.*]] = sext <2 x i16> [[PREDPHI]] to <2 x i32>
	; CHECK-NEXT: [[TMP15]] = or <2 x i32> [[VEC_PHI]], [[TMP14]]
	; CHECK-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 2			; CHECK-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 2
	; CHECK-NEXT: [[VEC_IND_NEXT]] = add <2 x i16> [[VEC_IND]], <i16 2, i16 2>			; CHECK-NEXT: [[VEC_IND_NEXT]] = add <2 x i16> [[VEC_IND]], <i16 2, i16 2>
	; CHECK-NEXT: [[TMP16:%.*]] = icmp eq i32 [[INDEX_NEXT]], 20			; CHECK-NEXT: [[TMP7:%.*]] = icmp eq i32 [[INDEX_NEXT]], 20
	; CHECK-NEXT: br i1 [[TMP16]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP48:![0-9]+]]			; CHECK-NEXT: br i1 [[TMP7]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP48:![0-9]+]]
	; CHECK: middle.block:			; CHECK: middle.block:
	; CHECK-NEXT: [[TMP17:%.*]] = call i32 @llvm.vector.reduce.or.v2i32(<2 x i32> [[TMP15]])			; CHECK-NEXT: [[TMP8:%.*]] = call i32 @llvm.vector.reduce.or.v2i32(<2 x i32> [[TMP6]])
	; CHECK-NEXT: [[CMP_N:%.*]] = icmp eq i32 20, 20			; CHECK-NEXT: [[CMP_N:%.*]] = icmp eq i32 20, 20
	; CHECK-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]			; CHECK-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; CHECK: scalar.ph:			; CHECK: scalar.ph:
	; CHECK-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ 0, [[MIDDLE_BLOCK]] ], [ -20, [[ENTRY:%.]] ]			; CHECK-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ 0, [[MIDDLE_BLOCK]] ], [ -20, [[ENTRY:%.]] ]
	; CHECK-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ [[A]], [[ENTRY]] ], [ [[TMP17]], [[MIDDLE_BLOCK]] ]			; CHECK-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ [[A]], [[ENTRY]] ], [ [[TMP8]], [[MIDDLE_BLOCK]] ]
	; CHECK-NEXT: br label [[FOR_BODY:%.*]]			; CHECK-NEXT: br label [[FOR_BODY:%.*]]
	; CHECK: for.body:			; CHECK: for.body:
	; CHECK-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[FOR_INC:%.*]] ]			; CHECK-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[FOR_INC:%.*]] ]
	; CHECK-NEXT: [[VAR0:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR6:%.]], [[FOR_INC]] ]			; CHECK-NEXT: [[VAR0:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR6:%.]], [[FOR_INC]] ]
	; CHECK-NEXT: [[VAR1:%.*]] = trunc i32 [[I]] to i16			; CHECK-NEXT: [[VAR1:%.*]] = trunc i32 [[I]] to i16
	; CHECK-NEXT: [[VAR2:%.*]] = icmp eq i16 [[VAR1]], 0			; CHECK-NEXT: [[VAR2:%.*]] = icmp eq i16 [[VAR1]], 0
	; CHECK-NEXT: br i1 [[VAR2]], label [[FOR_INC]], label [[FOR_COND:%.*]]			; CHECK-NEXT: br i1 [[VAR2]], label [[FOR_INC]], label [[FOR_COND:%.*]]
	; CHECK: for.cond:			; CHECK: for.cond:
	; CHECK-NEXT: [[VAR3:%.*]] = urem i16 [[B]], [[VAR1]]			; CHECK-NEXT: [[VAR3:%.*]] = urem i16 [[B]], [[VAR1]]
	; CHECK-NEXT: br label [[FOR_INC]]			; CHECK-NEXT: br label [[FOR_INC]]
	; CHECK: for.inc:			; CHECK: for.inc:
	; CHECK-NEXT: [[VAR4:%.*]] = phi i16 [ [[VAR3]], [[FOR_COND]] ], [ 0, [[FOR_BODY]] ]			; CHECK-NEXT: [[VAR4:%.*]] = phi i16 [ [[VAR3]], [[FOR_COND]] ], [ 0, [[FOR_BODY]] ]
	; CHECK-NEXT: [[VAR5:%.*]] = sext i16 [[VAR4]] to i32			; CHECK-NEXT: [[VAR5:%.*]] = sext i16 [[VAR4]] to i32
	; CHECK-NEXT: [[VAR6]] = or i32 [[VAR0]], [[VAR5]]			; CHECK-NEXT: [[VAR6]] = or i32 [[VAR0]], [[VAR5]]
	; CHECK-NEXT: [[I_NEXT]] = add nsw i32 [[I]], 1			; CHECK-NEXT: [[I_NEXT]] = add nsw i32 [[I]], 1
	; CHECK-NEXT: [[COND:%.*]] = icmp eq i32 [[I_NEXT]], 0			; CHECK-NEXT: [[COND:%.*]] = icmp eq i32 [[I_NEXT]], 0
	; CHECK-NEXT: br i1 [[COND]], label [[FOR_END]], label [[FOR_BODY]], !llvm.loop [[LOOP49:![0-9]+]]			; CHECK-NEXT: br i1 [[COND]], label [[FOR_END]], label [[FOR_BODY]], !llvm.loop [[LOOP49:![0-9]+]]
	; CHECK: for.end:			; CHECK: for.end:
	; CHECK-NEXT: [[VAR7:%.*]] = phi i32 [ [[VAR6]], [[FOR_INC]] ], [ [[TMP17]], [[MIDDLE_BLOCK]] ]			; CHECK-NEXT: [[VAR7:%.*]] = phi i32 [ [[VAR6]], [[FOR_INC]] ], [ [[TMP8]], [[MIDDLE_BLOCK]] ]
	; CHECK-NEXT: ret i32 [[VAR7]]			; CHECK-NEXT: ret i32 [[VAR7]]
	;			;
	; IND-LABEL: @PR32419(			; IND-LABEL: @PR32419(
	; IND-NEXT: entry:			; IND-NEXT: entry:
	; IND-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; IND-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; IND: vector.ph:			; IND: vector.ph:
	; IND-NEXT: [[TMP0:%.]] = insertelement <2 x i32> <i32 poison, i32 0>, i32 [[A:%.]], i64 0			; IND-NEXT: [[TMP0:%.]] = insertelement <2 x i32> <i32 poison, i32 0>, i32 [[A:%.]], i64 0
				; IND-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i16> poison, i16 [[B:%.]], i64 0
				; IND-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i16> [[BROADCAST_SPLATINSERT]], <2 x i16> poison, <2 x i32> zeroinitializer
	; IND-NEXT: br label [[VECTOR_BODY:%.*]]			; IND-NEXT: br label [[VECTOR_BODY:%.*]]
	; IND: vector.body:			; IND: vector.body:
	; IND-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_UREM_CONTINUE2:%.*]] ]			; IND-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; IND-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ [[TMP0]], [[VECTOR_PH]] ], [ [[TMP15:%.]], [[PRED_UREM_CONTINUE2]] ]			; IND-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ [[TMP0]], [[VECTOR_PH]] ], [ [[TMP5:%.]], [[VECTOR_BODY]] ]
	; IND-NEXT: [[VEC_IND:%.]] = phi <2 x i16> [ <i16 -20, i16 -19>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[PRED_UREM_CONTINUE2]] ]			; IND-NEXT: [[VEC_IND:%.]] = phi <2 x i16> [ <i16 -20, i16 -19>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[VECTOR_BODY]] ]
	; IND-NEXT: [[TMP1:%.*]] = trunc i32 [[INDEX]] to i16			; IND-NEXT: [[TMP1:%.*]] = icmp eq <2 x i16> [[VEC_IND]], zeroinitializer
	; IND-NEXT: [[TMP2:%.*]] = icmp eq <2 x i16> [[VEC_IND]], zeroinitializer			; IND-NEXT: [[TMP2:%.*]] = select <2 x i1> [[TMP1]], <2 x i16> <i16 1, i16 1>, <2 x i16> [[VEC_IND]]
	; IND-NEXT: [[TMP3:%.*]] = xor <2 x i1> [[TMP2]], <i1 true, i1 true>			; IND-NEXT: [[TMP3:%.*]] = urem <2 x i16> [[BROADCAST_SPLAT]], [[TMP2]]
	; IND-NEXT: [[TMP4:%.*]] = extractelement <2 x i1> [[TMP3]], i64 0			; IND-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP1]], <2 x i16> zeroinitializer, <2 x i16> [[TMP3]]
	; IND-NEXT: br i1 [[TMP4]], label [[PRED_UREM_IF:%.]], label [[PRED_UREM_CONTINUE:%.]]			; IND-NEXT: [[TMP4:%.*]] = sext <2 x i16> [[PREDPHI]] to <2 x i32>
	; IND: pred.urem.if:			; IND-NEXT: [[TMP5]] = or <2 x i32> [[VEC_PHI]], [[TMP4]]
	; IND-NEXT: [[TMP5:%.*]] = add i16 [[TMP1]], -20
	; IND-NEXT: [[TMP6:%.]] = urem i16 [[B:%.]], [[TMP5]]
	; IND-NEXT: [[TMP7:%.*]] = insertelement <2 x i16> poison, i16 [[TMP6]], i64 0
	; IND-NEXT: br label [[PRED_UREM_CONTINUE]]
	; IND: pred.urem.continue:
	; IND-NEXT: [[TMP8:%.*]] = phi <2 x i16> [ poison, [[VECTOR_BODY]] ], [ [[TMP7]], [[PRED_UREM_IF]] ]
	; IND-NEXT: [[TMP9:%.*]] = extractelement <2 x i1> [[TMP3]], i64 1
	; IND-NEXT: br i1 [[TMP9]], label [[PRED_UREM_IF1:%.*]], label [[PRED_UREM_CONTINUE2]]
	; IND: pred.urem.if1:
	; IND-NEXT: [[TMP10:%.*]] = add i16 [[TMP1]], -19
	; IND-NEXT: [[TMP11:%.*]] = urem i16 [[B]], [[TMP10]]
	; IND-NEXT: [[TMP12:%.*]] = insertelement <2 x i16> [[TMP8]], i16 [[TMP11]], i64 1
	; IND-NEXT: br label [[PRED_UREM_CONTINUE2]]
	; IND: pred.urem.continue2:
	; IND-NEXT: [[TMP13:%.*]] = phi <2 x i16> [ [[TMP8]], [[PRED_UREM_CONTINUE]] ], [ [[TMP12]], [[PRED_UREM_IF1]] ]
	; IND-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP2]], <2 x i16> zeroinitializer, <2 x i16> [[TMP13]]
	; IND-NEXT: [[TMP14:%.*]] = sext <2 x i16> [[PREDPHI]] to <2 x i32>
	; IND-NEXT: [[TMP15]] = or <2 x i32> [[VEC_PHI]], [[TMP14]]
	; IND-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 2			; IND-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 2
	; IND-NEXT: [[VEC_IND_NEXT]] = add <2 x i16> [[VEC_IND]], <i16 2, i16 2>			; IND-NEXT: [[VEC_IND_NEXT]] = add <2 x i16> [[VEC_IND]], <i16 2, i16 2>
	; IND-NEXT: [[TMP16:%.*]] = icmp eq i32 [[INDEX_NEXT]], 20			; IND-NEXT: [[TMP6:%.*]] = icmp eq i32 [[INDEX_NEXT]], 20
	; IND-NEXT: br i1 [[TMP16]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP48:![0-9]+]]			; IND-NEXT: br i1 [[TMP6]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP48:![0-9]+]]
	; IND: middle.block:			; IND: middle.block:
	; IND-NEXT: [[TMP17:%.*]] = call i32 @llvm.vector.reduce.or.v2i32(<2 x i32> [[TMP15]])			; IND-NEXT: [[TMP7:%.*]] = call i32 @llvm.vector.reduce.or.v2i32(<2 x i32> [[TMP5]])
	; IND-NEXT: br i1 true, label [[FOR_END:%.*]], label [[SCALAR_PH]]			; IND-NEXT: br i1 true, label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; IND: scalar.ph:			; IND: scalar.ph:
	; IND-NEXT: br label [[FOR_BODY:%.*]]			; IND-NEXT: br label [[FOR_BODY:%.*]]
	; IND: for.body:			; IND: for.body:
	; IND-NEXT: br i1 poison, label [[FOR_INC:%.]], label [[FOR_COND:%.]]			; IND-NEXT: br i1 poison, label [[FOR_INC:%.]], label [[FOR_COND:%.]]
	; IND: for.cond:			; IND: for.cond:
	; IND-NEXT: br label [[FOR_INC]]			; IND-NEXT: br label [[FOR_INC]]
	; IND: for.inc:			; IND: for.inc:
	; IND-NEXT: br i1 poison, label [[FOR_END]], label [[FOR_BODY]], !llvm.loop [[LOOP49:![0-9]+]]			; IND-NEXT: br i1 poison, label [[FOR_END]], label [[FOR_BODY]], !llvm.loop [[LOOP49:![0-9]+]]
	; IND: for.end:			; IND: for.end:
	; IND-NEXT: [[VAR7:%.*]] = phi i32 [ poison, [[FOR_INC]] ], [ [[TMP17]], [[MIDDLE_BLOCK]] ]			; IND-NEXT: [[VAR7:%.*]] = phi i32 [ poison, [[FOR_INC]] ], [ [[TMP7]], [[MIDDLE_BLOCK]] ]
	; IND-NEXT: ret i32 [[VAR7]]			; IND-NEXT: ret i32 [[VAR7]]
	;			;
	; UNROLL-LABEL: @PR32419(			; UNROLL-LABEL: @PR32419(
	; UNROLL-NEXT: entry:			; UNROLL-NEXT: entry:
	; UNROLL-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; UNROLL-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; UNROLL: vector.ph:			; UNROLL: vector.ph:
	; UNROLL-NEXT: [[TMP0:%.]] = insertelement <2 x i32> <i32 poison, i32 0>, i32 [[A:%.]], i64 0			; UNROLL-NEXT: [[TMP0:%.]] = insertelement <2 x i32> <i32 poison, i32 0>, i32 [[A:%.]], i64 0
				; UNROLL-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i16> poison, i16 [[B:%.]], i64 0
				; UNROLL-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i16> [[BROADCAST_SPLATINSERT]], <2 x i16> poison, <2 x i32> zeroinitializer
				; UNROLL-NEXT: [[BROADCAST_SPLATINSERT3:%.*]] = insertelement <2 x i16> poison, i16 [[B]], i64 0
				; UNROLL-NEXT: [[BROADCAST_SPLAT4:%.*]] = shufflevector <2 x i16> [[BROADCAST_SPLATINSERT3]], <2 x i16> poison, <2 x i32> zeroinitializer
	; UNROLL-NEXT: br label [[VECTOR_BODY:%.*]]			; UNROLL-NEXT: br label [[VECTOR_BODY:%.*]]
	; UNROLL: vector.body:			; UNROLL: vector.body:
	; UNROLL-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_UREM_CONTINUE8:%.*]] ]			; UNROLL-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ [[TMP0]], [[VECTOR_PH]] ], [ [[TMP28:%.]], [[PRED_UREM_CONTINUE8]] ]			; UNROLL-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ [[TMP0]], [[VECTOR_PH]] ], [ [[TMP9:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NEXT: [[VEC_PHI1:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP29:%.]], [[PRED_UREM_CONTINUE8]] ]			; UNROLL-NEXT: [[VEC_PHI1:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP10:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NEXT: [[VEC_IND:%.]] = phi <2 x i16> [ <i16 -20, i16 -19>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[PRED_UREM_CONTINUE8]] ]			; UNROLL-NEXT: [[VEC_IND:%.]] = phi <2 x i16> [ <i16 -20, i16 -19>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NEXT: [[TMP1:%.*]] = trunc i32 [[INDEX]] to i16			; UNROLL-NEXT: [[STEP_ADD:%.*]] = add <2 x i16> [[VEC_IND]], <i16 2, i16 2>
	; UNROLL-NEXT: [[TMP2:%.*]] = icmp eq <2 x i16> [[VEC_IND]], zeroinitializer			; UNROLL-NEXT: [[TMP1:%.*]] = icmp eq <2 x i16> [[VEC_IND]], zeroinitializer
	; UNROLL-NEXT: [[TMP3:%.*]] = icmp eq <2 x i16> [[VEC_IND]], <i16 -2, i16 -2>			; UNROLL-NEXT: [[TMP2:%.*]] = icmp eq <2 x i16> [[STEP_ADD]], zeroinitializer
	; UNROLL-NEXT: [[TMP4:%.*]] = xor <2 x i1> [[TMP2]], <i1 true, i1 true>			; UNROLL-NEXT: [[TMP3:%.*]] = select <2 x i1> [[TMP1]], <2 x i16> <i16 1, i16 1>, <2 x i16> [[VEC_IND]]
	; UNROLL-NEXT: [[TMP5:%.*]] = xor <2 x i1> [[TMP3]], <i1 true, i1 true>			; UNROLL-NEXT: [[TMP4:%.*]] = select <2 x i1> [[TMP2]], <2 x i16> <i16 1, i16 1>, <2 x i16> [[STEP_ADD]]
	; UNROLL-NEXT: [[TMP6:%.*]] = extractelement <2 x i1> [[TMP4]], i64 0			; UNROLL-NEXT: [[TMP5:%.*]] = urem <2 x i16> [[BROADCAST_SPLAT]], [[TMP3]]
	; UNROLL-NEXT: br i1 [[TMP6]], label [[PRED_UREM_IF:%.]], label [[PRED_UREM_CONTINUE:%.]]			; UNROLL-NEXT: [[TMP6:%.*]] = urem <2 x i16> [[BROADCAST_SPLAT4]], [[TMP4]]
	; UNROLL: pred.urem.if:			; UNROLL-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP1]], <2 x i16> zeroinitializer, <2 x i16> [[TMP5]]
	; UNROLL-NEXT: [[TMP7:%.*]] = add i16 [[TMP1]], -20			; UNROLL-NEXT: [[PREDPHI5:%.*]] = select <2 x i1> [[TMP2]], <2 x i16> zeroinitializer, <2 x i16> [[TMP6]]
	; UNROLL-NEXT: [[TMP8:%.]] = urem i16 [[B:%.]], [[TMP7]]			; UNROLL-NEXT: [[TMP7:%.*]] = sext <2 x i16> [[PREDPHI]] to <2 x i32>
	; UNROLL-NEXT: [[TMP9:%.*]] = insertelement <2 x i16> poison, i16 [[TMP8]], i64 0			; UNROLL-NEXT: [[TMP8:%.*]] = sext <2 x i16> [[PREDPHI5]] to <2 x i32>
	; UNROLL-NEXT: br label [[PRED_UREM_CONTINUE]]			; UNROLL-NEXT: [[TMP9]] = or <2 x i32> [[VEC_PHI]], [[TMP7]]
	; UNROLL: pred.urem.continue:			; UNROLL-NEXT: [[TMP10]] = or <2 x i32> [[VEC_PHI1]], [[TMP8]]
	; UNROLL-NEXT: [[TMP10:%.*]] = phi <2 x i16> [ poison, [[VECTOR_BODY]] ], [ [[TMP9]], [[PRED_UREM_IF]] ]
	; UNROLL-NEXT: [[TMP11:%.*]] = extractelement <2 x i1> [[TMP4]], i64 1
	; UNROLL-NEXT: br i1 [[TMP11]], label [[PRED_UREM_IF3:%.]], label [[PRED_UREM_CONTINUE4:%.]]
	; UNROLL: pred.urem.if3:
	; UNROLL-NEXT: [[TMP12:%.*]] = add i16 [[TMP1]], -19
	; UNROLL-NEXT: [[TMP13:%.*]] = urem i16 [[B]], [[TMP12]]
	; UNROLL-NEXT: [[TMP14:%.*]] = insertelement <2 x i16> [[TMP10]], i16 [[TMP13]], i64 1
	; UNROLL-NEXT: br label [[PRED_UREM_CONTINUE4]]
	; UNROLL: pred.urem.continue4:
	; UNROLL-NEXT: [[TMP15:%.*]] = phi <2 x i16> [ [[TMP10]], [[PRED_UREM_CONTINUE]] ], [ [[TMP14]], [[PRED_UREM_IF3]] ]
	; UNROLL-NEXT: [[TMP16:%.*]] = extractelement <2 x i1> [[TMP5]], i64 0
	; UNROLL-NEXT: br i1 [[TMP16]], label [[PRED_UREM_IF5:%.]], label [[PRED_UREM_CONTINUE6:%.]]
	; UNROLL: pred.urem.if5:
	; UNROLL-NEXT: [[TMP17:%.*]] = add i16 [[TMP1]], -18
	; UNROLL-NEXT: [[TMP18:%.*]] = urem i16 [[B]], [[TMP17]]
	; UNROLL-NEXT: [[TMP19:%.*]] = insertelement <2 x i16> poison, i16 [[TMP18]], i64 0
	; UNROLL-NEXT: br label [[PRED_UREM_CONTINUE6]]
	; UNROLL: pred.urem.continue6:
	; UNROLL-NEXT: [[TMP20:%.*]] = phi <2 x i16> [ poison, [[PRED_UREM_CONTINUE4]] ], [ [[TMP19]], [[PRED_UREM_IF5]] ]
	; UNROLL-NEXT: [[TMP21:%.*]] = extractelement <2 x i1> [[TMP5]], i64 1
	; UNROLL-NEXT: br i1 [[TMP21]], label [[PRED_UREM_IF7:%.*]], label [[PRED_UREM_CONTINUE8]]
	; UNROLL: pred.urem.if7:
	; UNROLL-NEXT: [[TMP22:%.*]] = add i16 [[TMP1]], -17
	; UNROLL-NEXT: [[TMP23:%.*]] = urem i16 [[B]], [[TMP22]]
	; UNROLL-NEXT: [[TMP24:%.*]] = insertelement <2 x i16> [[TMP20]], i16 [[TMP23]], i64 1
	; UNROLL-NEXT: br label [[PRED_UREM_CONTINUE8]]
	; UNROLL: pred.urem.continue8:
	; UNROLL-NEXT: [[TMP25:%.*]] = phi <2 x i16> [ [[TMP20]], [[PRED_UREM_CONTINUE6]] ], [ [[TMP24]], [[PRED_UREM_IF7]] ]
	; UNROLL-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP2]], <2 x i16> zeroinitializer, <2 x i16> [[TMP15]]
	; UNROLL-NEXT: [[PREDPHI9:%.*]] = select <2 x i1> [[TMP3]], <2 x i16> zeroinitializer, <2 x i16> [[TMP25]]
	; UNROLL-NEXT: [[TMP26:%.*]] = sext <2 x i16> [[PREDPHI]] to <2 x i32>
	; UNROLL-NEXT: [[TMP27:%.*]] = sext <2 x i16> [[PREDPHI9]] to <2 x i32>
	; UNROLL-NEXT: [[TMP28]] = or <2 x i32> [[VEC_PHI]], [[TMP26]]
	; UNROLL-NEXT: [[TMP29]] = or <2 x i32> [[VEC_PHI1]], [[TMP27]]
	; UNROLL-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 4			; UNROLL-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 4
	; UNROLL-NEXT: [[VEC_IND_NEXT]] = add <2 x i16> [[VEC_IND]], <i16 4, i16 4>			; UNROLL-NEXT: [[VEC_IND_NEXT]] = add <2 x i16> [[VEC_IND]], <i16 4, i16 4>
	; UNROLL-NEXT: [[TMP30:%.*]] = icmp eq i32 [[INDEX_NEXT]], 20			; UNROLL-NEXT: [[TMP11:%.*]] = icmp eq i32 [[INDEX_NEXT]], 20
	; UNROLL-NEXT: br i1 [[TMP30]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP48:![0-9]+]]			; UNROLL-NEXT: br i1 [[TMP11]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP48:![0-9]+]]
	; UNROLL: middle.block:			; UNROLL: middle.block:
	; UNROLL-NEXT: [[BIN_RDX:%.*]] = or <2 x i32> [[TMP29]], [[TMP28]]			; UNROLL-NEXT: [[BIN_RDX:%.*]] = or <2 x i32> [[TMP10]], [[TMP9]]
	; UNROLL-NEXT: [[TMP31:%.*]] = call i32 @llvm.vector.reduce.or.v2i32(<2 x i32> [[BIN_RDX]])			; UNROLL-NEXT: [[TMP12:%.*]] = call i32 @llvm.vector.reduce.or.v2i32(<2 x i32> [[BIN_RDX]])
	; UNROLL-NEXT: br i1 true, label [[FOR_END:%.*]], label [[SCALAR_PH]]			; UNROLL-NEXT: br i1 true, label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; UNROLL: scalar.ph:			; UNROLL: scalar.ph:
	; UNROLL-NEXT: br label [[FOR_BODY:%.*]]			; UNROLL-NEXT: br label [[FOR_BODY:%.*]]
	; UNROLL: for.body:			; UNROLL: for.body:
	; UNROLL-NEXT: br i1 poison, label [[FOR_INC:%.]], label [[FOR_COND:%.]]			; UNROLL-NEXT: br i1 poison, label [[FOR_INC:%.]], label [[FOR_COND:%.]]
	; UNROLL: for.cond:			; UNROLL: for.cond:
	; UNROLL-NEXT: br label [[FOR_INC]]			; UNROLL-NEXT: br label [[FOR_INC]]
	; UNROLL: for.inc:			; UNROLL: for.inc:
	; UNROLL-NEXT: br i1 poison, label [[FOR_END]], label [[FOR_BODY]], !llvm.loop [[LOOP49:![0-9]+]]			; UNROLL-NEXT: br i1 poison, label [[FOR_END]], label [[FOR_BODY]], !llvm.loop [[LOOP49:![0-9]+]]
	; UNROLL: for.end:			; UNROLL: for.end:
	; UNROLL-NEXT: [[VAR7:%.*]] = phi i32 [ poison, [[FOR_INC]] ], [ [[TMP31]], [[MIDDLE_BLOCK]] ]			; UNROLL-NEXT: [[VAR7:%.*]] = phi i32 [ poison, [[FOR_INC]] ], [ [[TMP12]], [[MIDDLE_BLOCK]] ]
	; UNROLL-NEXT: ret i32 [[VAR7]]			; UNROLL-NEXT: ret i32 [[VAR7]]
	;			;
	; UNROLL-NO-IC-LABEL: @PR32419(			; UNROLL-NO-IC-LABEL: @PR32419(
	; UNROLL-NO-IC-NEXT: entry:			; UNROLL-NO-IC-NEXT: entry:
	; UNROLL-NO-IC-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; UNROLL-NO-IC-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; UNROLL-NO-IC: vector.ph:			; UNROLL-NO-IC: vector.ph:
	; UNROLL-NO-IC-NEXT: [[TMP0:%.]] = insertelement <2 x i32> zeroinitializer, i32 [[A:%.]], i32 0			; UNROLL-NO-IC-NEXT: [[TMP0:%.]] = insertelement <2 x i32> zeroinitializer, i32 [[A:%.]], i32 0
				; UNROLL-NO-IC-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <2 x i16> poison, i16 [[B:%.]], i32 0
				; UNROLL-NO-IC-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i16> [[BROADCAST_SPLATINSERT]], <2 x i16> poison, <2 x i32> zeroinitializer
				; UNROLL-NO-IC-NEXT: [[BROADCAST_SPLATINSERT3:%.*]] = insertelement <2 x i16> poison, i16 [[B]], i32 0
				; UNROLL-NO-IC-NEXT: [[BROADCAST_SPLAT4:%.*]] = shufflevector <2 x i16> [[BROADCAST_SPLATINSERT3]], <2 x i16> poison, <2 x i32> zeroinitializer
	; UNROLL-NO-IC-NEXT: br label [[VECTOR_BODY:%.*]]			; UNROLL-NO-IC-NEXT: br label [[VECTOR_BODY:%.*]]
	; UNROLL-NO-IC: vector.body:			; UNROLL-NO-IC: vector.body:
	; UNROLL-NO-IC-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_UREM_CONTINUE8:%.*]] ]			; UNROLL-NO-IC-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NO-IC-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ [[TMP0]], [[VECTOR_PH]] ], [ [[TMP28:%.]], [[PRED_UREM_CONTINUE8]] ]			; UNROLL-NO-IC-NEXT: [[VEC_PHI:%.]] = phi <2 x i32> [ [[TMP0]], [[VECTOR_PH]] ], [ [[TMP11:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NO-IC-NEXT: [[VEC_PHI1:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP29:%.]], [[PRED_UREM_CONTINUE8]] ]			; UNROLL-NO-IC-NEXT: [[VEC_PHI1:%.]] = phi <2 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP12:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NO-IC-NEXT: [[VEC_IND:%.]] = phi <2 x i16> [ <i16 -20, i16 -19>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[PRED_UREM_CONTINUE8]] ]			; UNROLL-NO-IC-NEXT: [[VEC_IND:%.]] = phi <2 x i16> [ <i16 -20, i16 -19>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[VECTOR_BODY]] ]
	; UNROLL-NO-IC-NEXT: [[STEP_ADD:%.*]] = add <2 x i16> [[VEC_IND]], <i16 2, i16 2>			; UNROLL-NO-IC-NEXT: [[STEP_ADD:%.*]] = add <2 x i16> [[VEC_IND]], <i16 2, i16 2>
	; UNROLL-NO-IC-NEXT: [[OFFSET_IDX:%.*]] = add i32 -20, [[INDEX]]			; UNROLL-NO-IC-NEXT: [[TMP1:%.*]] = icmp eq <2 x i16> [[VEC_IND]], zeroinitializer
	; UNROLL-NO-IC-NEXT: [[TMP1:%.*]] = trunc i32 [[OFFSET_IDX]] to i16			; UNROLL-NO-IC-NEXT: [[TMP2:%.*]] = icmp eq <2 x i16> [[STEP_ADD]], zeroinitializer
	; UNROLL-NO-IC-NEXT: [[TMP2:%.*]] = icmp eq <2 x i16> [[VEC_IND]], zeroinitializer			; UNROLL-NO-IC-NEXT: [[TMP3:%.*]] = xor <2 x i1> [[TMP1]], <i1 true, i1 true>
	; UNROLL-NO-IC-NEXT: [[TMP3:%.*]] = icmp eq <2 x i16> [[STEP_ADD]], zeroinitializer
	; UNROLL-NO-IC-NEXT: [[TMP4:%.*]] = xor <2 x i1> [[TMP2]], <i1 true, i1 true>			; UNROLL-NO-IC-NEXT: [[TMP4:%.*]] = xor <2 x i1> [[TMP2]], <i1 true, i1 true>
	; UNROLL-NO-IC-NEXT: [[TMP5:%.*]] = xor <2 x i1> [[TMP3]], <i1 true, i1 true>			; UNROLL-NO-IC-NEXT: [[TMP5:%.*]] = select <2 x i1> [[TMP3]], <2 x i16> [[VEC_IND]], <2 x i16> <i16 1, i16 1>
	; UNROLL-NO-IC-NEXT: [[TMP6:%.*]] = extractelement <2 x i1> [[TMP4]], i32 0			; UNROLL-NO-IC-NEXT: [[TMP6:%.*]] = select <2 x i1> [[TMP4]], <2 x i16> [[STEP_ADD]], <2 x i16> <i16 1, i16 1>
	; UNROLL-NO-IC-NEXT: br i1 [[TMP6]], label [[PRED_UREM_IF:%.]], label [[PRED_UREM_CONTINUE:%.]]			; UNROLL-NO-IC-NEXT: [[TMP7:%.*]] = urem <2 x i16> [[BROADCAST_SPLAT]], [[TMP5]]
	; UNROLL-NO-IC: pred.urem.if:			; UNROLL-NO-IC-NEXT: [[TMP8:%.*]] = urem <2 x i16> [[BROADCAST_SPLAT4]], [[TMP6]]
	; UNROLL-NO-IC-NEXT: [[TMP7:%.*]] = add i16 [[TMP1]], 0			; UNROLL-NO-IC-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP1]], <2 x i16> zeroinitializer, <2 x i16> [[TMP7]]
	; UNROLL-NO-IC-NEXT: [[TMP8:%.]] = urem i16 [[B:%.]], [[TMP7]]			; UNROLL-NO-IC-NEXT: [[PREDPHI5:%.*]] = select <2 x i1> [[TMP2]], <2 x i16> zeroinitializer, <2 x i16> [[TMP8]]
	; UNROLL-NO-IC-NEXT: [[TMP9:%.*]] = insertelement <2 x i16> poison, i16 [[TMP8]], i32 0			; UNROLL-NO-IC-NEXT: [[TMP9:%.*]] = sext <2 x i16> [[PREDPHI]] to <2 x i32>
	; UNROLL-NO-IC-NEXT: br label [[PRED_UREM_CONTINUE]]			; UNROLL-NO-IC-NEXT: [[TMP10:%.*]] = sext <2 x i16> [[PREDPHI5]] to <2 x i32>
	; UNROLL-NO-IC: pred.urem.continue:			; UNROLL-NO-IC-NEXT: [[TMP11]] = or <2 x i32> [[VEC_PHI]], [[TMP9]]
	; UNROLL-NO-IC-NEXT: [[TMP10:%.*]] = phi <2 x i16> [ poison, [[VECTOR_BODY]] ], [ [[TMP9]], [[PRED_UREM_IF]] ]			; UNROLL-NO-IC-NEXT: [[TMP12]] = or <2 x i32> [[VEC_PHI1]], [[TMP10]]
	; UNROLL-NO-IC-NEXT: [[TMP11:%.*]] = extractelement <2 x i1> [[TMP4]], i32 1
	; UNROLL-NO-IC-NEXT: br i1 [[TMP11]], label [[PRED_UREM_IF3:%.]], label [[PRED_UREM_CONTINUE4:%.]]
	; UNROLL-NO-IC: pred.urem.if3:
	; UNROLL-NO-IC-NEXT: [[TMP12:%.*]] = add i16 [[TMP1]], 1
	; UNROLL-NO-IC-NEXT: [[TMP13:%.*]] = urem i16 [[B]], [[TMP12]]
	; UNROLL-NO-IC-NEXT: [[TMP14:%.*]] = insertelement <2 x i16> [[TMP10]], i16 [[TMP13]], i32 1
	; UNROLL-NO-IC-NEXT: br label [[PRED_UREM_CONTINUE4]]
	; UNROLL-NO-IC: pred.urem.continue4:
	; UNROLL-NO-IC-NEXT: [[TMP15:%.*]] = phi <2 x i16> [ [[TMP10]], [[PRED_UREM_CONTINUE]] ], [ [[TMP14]], [[PRED_UREM_IF3]] ]
	; UNROLL-NO-IC-NEXT: [[TMP16:%.*]] = extractelement <2 x i1> [[TMP5]], i32 0
	; UNROLL-NO-IC-NEXT: br i1 [[TMP16]], label [[PRED_UREM_IF5:%.]], label [[PRED_UREM_CONTINUE6:%.]]
	; UNROLL-NO-IC: pred.urem.if5:
	; UNROLL-NO-IC-NEXT: [[TMP17:%.*]] = add i16 [[TMP1]], 2
	; UNROLL-NO-IC-NEXT: [[TMP18:%.*]] = urem i16 [[B]], [[TMP17]]
	; UNROLL-NO-IC-NEXT: [[TMP19:%.*]] = insertelement <2 x i16> poison, i16 [[TMP18]], i32 0
	; UNROLL-NO-IC-NEXT: br label [[PRED_UREM_CONTINUE6]]
	; UNROLL-NO-IC: pred.urem.continue6:
	; UNROLL-NO-IC-NEXT: [[TMP20:%.*]] = phi <2 x i16> [ poison, [[PRED_UREM_CONTINUE4]] ], [ [[TMP19]], [[PRED_UREM_IF5]] ]
	; UNROLL-NO-IC-NEXT: [[TMP21:%.*]] = extractelement <2 x i1> [[TMP5]], i32 1
	; UNROLL-NO-IC-NEXT: br i1 [[TMP21]], label [[PRED_UREM_IF7:%.*]], label [[PRED_UREM_CONTINUE8]]
	; UNROLL-NO-IC: pred.urem.if7:
	; UNROLL-NO-IC-NEXT: [[TMP22:%.*]] = add i16 [[TMP1]], 3
	; UNROLL-NO-IC-NEXT: [[TMP23:%.*]] = urem i16 [[B]], [[TMP22]]
	; UNROLL-NO-IC-NEXT: [[TMP24:%.*]] = insertelement <2 x i16> [[TMP20]], i16 [[TMP23]], i32 1
	; UNROLL-NO-IC-NEXT: br label [[PRED_UREM_CONTINUE8]]
	; UNROLL-NO-IC: pred.urem.continue8:
	; UNROLL-NO-IC-NEXT: [[TMP25:%.*]] = phi <2 x i16> [ [[TMP20]], [[PRED_UREM_CONTINUE6]] ], [ [[TMP24]], [[PRED_UREM_IF7]] ]
	; UNROLL-NO-IC-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP2]], <2 x i16> zeroinitializer, <2 x i16> [[TMP15]]
	; UNROLL-NO-IC-NEXT: [[PREDPHI9:%.*]] = select <2 x i1> [[TMP3]], <2 x i16> zeroinitializer, <2 x i16> [[TMP25]]
	; UNROLL-NO-IC-NEXT: [[TMP26:%.*]] = sext <2 x i16> [[PREDPHI]] to <2 x i32>
	; UNROLL-NO-IC-NEXT: [[TMP27:%.*]] = sext <2 x i16> [[PREDPHI9]] to <2 x i32>
	; UNROLL-NO-IC-NEXT: [[TMP28]] = or <2 x i32> [[VEC_PHI]], [[TMP26]]
	; UNROLL-NO-IC-NEXT: [[TMP29]] = or <2 x i32> [[VEC_PHI1]], [[TMP27]]
	; UNROLL-NO-IC-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 4			; UNROLL-NO-IC-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 4
	; UNROLL-NO-IC-NEXT: [[VEC_IND_NEXT]] = add <2 x i16> [[STEP_ADD]], <i16 2, i16 2>			; UNROLL-NO-IC-NEXT: [[VEC_IND_NEXT]] = add <2 x i16> [[STEP_ADD]], <i16 2, i16 2>
	; UNROLL-NO-IC-NEXT: [[TMP30:%.*]] = icmp eq i32 [[INDEX_NEXT]], 20			; UNROLL-NO-IC-NEXT: [[TMP13:%.*]] = icmp eq i32 [[INDEX_NEXT]], 20
	; UNROLL-NO-IC-NEXT: br i1 [[TMP30]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP48:![0-9]+]]			; UNROLL-NO-IC-NEXT: br i1 [[TMP13]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP48:![0-9]+]]
	; UNROLL-NO-IC: middle.block:			; UNROLL-NO-IC: middle.block:
	; UNROLL-NO-IC-NEXT: [[BIN_RDX:%.*]] = or <2 x i32> [[TMP29]], [[TMP28]]			; UNROLL-NO-IC-NEXT: [[BIN_RDX:%.*]] = or <2 x i32> [[TMP12]], [[TMP11]]
	; UNROLL-NO-IC-NEXT: [[TMP31:%.*]] = call i32 @llvm.vector.reduce.or.v2i32(<2 x i32> [[BIN_RDX]])			; UNROLL-NO-IC-NEXT: [[TMP14:%.*]] = call i32 @llvm.vector.reduce.or.v2i32(<2 x i32> [[BIN_RDX]])
	; UNROLL-NO-IC-NEXT: [[CMP_N:%.*]] = icmp eq i32 20, 20			; UNROLL-NO-IC-NEXT: [[CMP_N:%.*]] = icmp eq i32 20, 20
	; UNROLL-NO-IC-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]			; UNROLL-NO-IC-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; UNROLL-NO-IC: scalar.ph:			; UNROLL-NO-IC: scalar.ph:
	; UNROLL-NO-IC-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ 0, [[MIDDLE_BLOCK]] ], [ -20, [[ENTRY:%.]] ]			; UNROLL-NO-IC-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ 0, [[MIDDLE_BLOCK]] ], [ -20, [[ENTRY:%.]] ]
	; UNROLL-NO-IC-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ [[A]], [[ENTRY]] ], [ [[TMP31]], [[MIDDLE_BLOCK]] ]			; UNROLL-NO-IC-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ [[A]], [[ENTRY]] ], [ [[TMP14]], [[MIDDLE_BLOCK]] ]
	; UNROLL-NO-IC-NEXT: br label [[FOR_BODY:%.*]]			; UNROLL-NO-IC-NEXT: br label [[FOR_BODY:%.*]]
	; UNROLL-NO-IC: for.body:			; UNROLL-NO-IC: for.body:
	; UNROLL-NO-IC-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[FOR_INC:%.*]] ]			; UNROLL-NO-IC-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[FOR_INC:%.*]] ]
	; UNROLL-NO-IC-NEXT: [[VAR0:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR6:%.]], [[FOR_INC]] ]			; UNROLL-NO-IC-NEXT: [[VAR0:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR6:%.]], [[FOR_INC]] ]
	; UNROLL-NO-IC-NEXT: [[VAR1:%.*]] = trunc i32 [[I]] to i16			; UNROLL-NO-IC-NEXT: [[VAR1:%.*]] = trunc i32 [[I]] to i16
	; UNROLL-NO-IC-NEXT: [[VAR2:%.*]] = icmp eq i16 [[VAR1]], 0			; UNROLL-NO-IC-NEXT: [[VAR2:%.*]] = icmp eq i16 [[VAR1]], 0
	; UNROLL-NO-IC-NEXT: br i1 [[VAR2]], label [[FOR_INC]], label [[FOR_COND:%.*]]			; UNROLL-NO-IC-NEXT: br i1 [[VAR2]], label [[FOR_INC]], label [[FOR_COND:%.*]]
	; UNROLL-NO-IC: for.cond:			; UNROLL-NO-IC: for.cond:
	; UNROLL-NO-IC-NEXT: [[VAR3:%.*]] = urem i16 [[B]], [[VAR1]]			; UNROLL-NO-IC-NEXT: [[VAR3:%.*]] = urem i16 [[B]], [[VAR1]]
	; UNROLL-NO-IC-NEXT: br label [[FOR_INC]]			; UNROLL-NO-IC-NEXT: br label [[FOR_INC]]
	; UNROLL-NO-IC: for.inc:			; UNROLL-NO-IC: for.inc:
	; UNROLL-NO-IC-NEXT: [[VAR4:%.*]] = phi i16 [ [[VAR3]], [[FOR_COND]] ], [ 0, [[FOR_BODY]] ]			; UNROLL-NO-IC-NEXT: [[VAR4:%.*]] = phi i16 [ [[VAR3]], [[FOR_COND]] ], [ 0, [[FOR_BODY]] ]
	; UNROLL-NO-IC-NEXT: [[VAR5:%.*]] = sext i16 [[VAR4]] to i32			; UNROLL-NO-IC-NEXT: [[VAR5:%.*]] = sext i16 [[VAR4]] to i32
	; UNROLL-NO-IC-NEXT: [[VAR6]] = or i32 [[VAR0]], [[VAR5]]			; UNROLL-NO-IC-NEXT: [[VAR6]] = or i32 [[VAR0]], [[VAR5]]
	; UNROLL-NO-IC-NEXT: [[I_NEXT]] = add nsw i32 [[I]], 1			; UNROLL-NO-IC-NEXT: [[I_NEXT]] = add nsw i32 [[I]], 1
	; UNROLL-NO-IC-NEXT: [[COND:%.*]] = icmp eq i32 [[I_NEXT]], 0			; UNROLL-NO-IC-NEXT: [[COND:%.*]] = icmp eq i32 [[I_NEXT]], 0
	; UNROLL-NO-IC-NEXT: br i1 [[COND]], label [[FOR_END]], label [[FOR_BODY]], !llvm.loop [[LOOP49:![0-9]+]]			; UNROLL-NO-IC-NEXT: br i1 [[COND]], label [[FOR_END]], label [[FOR_BODY]], !llvm.loop [[LOOP49:![0-9]+]]
	; UNROLL-NO-IC: for.end:			; UNROLL-NO-IC: for.end:
	; UNROLL-NO-IC-NEXT: [[VAR7:%.*]] = phi i32 [ [[VAR6]], [[FOR_INC]] ], [ [[TMP31]], [[MIDDLE_BLOCK]] ]			; UNROLL-NO-IC-NEXT: [[VAR7:%.*]] = phi i32 [ [[VAR6]], [[FOR_INC]] ], [ [[TMP14]], [[MIDDLE_BLOCK]] ]
	; UNROLL-NO-IC-NEXT: ret i32 [[VAR7]]			; UNROLL-NO-IC-NEXT: ret i32 [[VAR7]]
	;			;
	; INTERLEAVE-LABEL: @PR32419(			; INTERLEAVE-LABEL: @PR32419(
	; INTERLEAVE-NEXT: entry:			; INTERLEAVE-NEXT: entry:
	; INTERLEAVE-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; INTERLEAVE-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; INTERLEAVE: vector.ph:			; INTERLEAVE: vector.ph:
	; INTERLEAVE-NEXT: [[TMP0:%.]] = insertelement <4 x i32> <i32 poison, i32 0, i32 0, i32 0>, i32 [[A:%.]], i64 0			; INTERLEAVE-NEXT: [[TMP0:%.]] = insertelement <4 x i32> <i32 poison, i32 0, i32 0, i32 0>, i32 [[A:%.]], i64 0
				; INTERLEAVE-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <4 x i16> poison, i16 [[B:%.]], i64 0
				; INTERLEAVE-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <4 x i16> [[BROADCAST_SPLATINSERT]], <4 x i16> poison, <4 x i32> zeroinitializer
				; INTERLEAVE-NEXT: [[BROADCAST_SPLATINSERT3:%.*]] = insertelement <4 x i16> poison, i16 [[B]], i64 0
				; INTERLEAVE-NEXT: [[BROADCAST_SPLAT4:%.*]] = shufflevector <4 x i16> [[BROADCAST_SPLATINSERT3]], <4 x i16> poison, <4 x i32> zeroinitializer
	; INTERLEAVE-NEXT: br label [[VECTOR_BODY:%.*]]			; INTERLEAVE-NEXT: br label [[VECTOR_BODY:%.*]]
	; INTERLEAVE: vector.body:			; INTERLEAVE: vector.body:
	; INTERLEAVE-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_UREM_CONTINUE16:%.*]] ]			; INTERLEAVE-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; INTERLEAVE-NEXT: [[VEC_PHI:%.]] = phi <4 x i32> [ [[TMP0]], [[VECTOR_PH]] ], [ [[TMP48:%.]], [[PRED_UREM_CONTINUE16]] ]			; INTERLEAVE-NEXT: [[VEC_PHI:%.]] = phi <4 x i32> [ [[TMP0]], [[VECTOR_PH]] ], [ [[TMP9:%.]], [[VECTOR_BODY]] ]
	; INTERLEAVE-NEXT: [[VEC_PHI1:%.]] = phi <4 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP49:%.]], [[PRED_UREM_CONTINUE16]] ]			; INTERLEAVE-NEXT: [[VEC_PHI1:%.]] = phi <4 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP10:%.]], [[VECTOR_BODY]] ]
	; INTERLEAVE-NEXT: [[VEC_IND:%.]] = phi <4 x i16> [ <i16 -20, i16 -19, i16 -18, i16 -17>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[PRED_UREM_CONTINUE16]] ]			; INTERLEAVE-NEXT: [[VEC_IND:%.]] = phi <4 x i16> [ <i16 -20, i16 -19, i16 -18, i16 -17>, [[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.]], [[VECTOR_BODY]] ]
	; INTERLEAVE-NEXT: [[TMP1:%.*]] = trunc i32 [[INDEX]] to i16			; INTERLEAVE-NEXT: [[STEP_ADD:%.*]] = add <4 x i16> [[VEC_IND]], <i16 4, i16 4, i16 4, i16 4>
	; INTERLEAVE-NEXT: [[TMP2:%.*]] = icmp eq <4 x i16> [[VEC_IND]], zeroinitializer			; INTERLEAVE-NEXT: [[TMP1:%.*]] = icmp eq <4 x i16> [[VEC_IND]], zeroinitializer
	; INTERLEAVE-NEXT: [[TMP3:%.*]] = icmp eq <4 x i16> [[VEC_IND]], <i16 -4, i16 -4, i16 -4, i16 -4>			; INTERLEAVE-NEXT: [[TMP2:%.*]] = icmp eq <4 x i16> [[STEP_ADD]], zeroinitializer
	; INTERLEAVE-NEXT: [[TMP4:%.*]] = xor <4 x i1> [[TMP2]], <i1 true, i1 true, i1 true, i1 true>			; INTERLEAVE-NEXT: [[TMP3:%.*]] = select <4 x i1> [[TMP1]], <4 x i16> <i16 1, i16 1, i16 1, i16 1>, <4 x i16> [[VEC_IND]]
	; INTERLEAVE-NEXT: [[TMP5:%.*]] = xor <4 x i1> [[TMP3]], <i1 true, i1 true, i1 true, i1 true>			; INTERLEAVE-NEXT: [[TMP4:%.*]] = select <4 x i1> [[TMP2]], <4 x i16> <i16 1, i16 1, i16 1, i16 1>, <4 x i16> [[STEP_ADD]]
	; INTERLEAVE-NEXT: [[TMP6:%.*]] = extractelement <4 x i1> [[TMP4]], i64 0			; INTERLEAVE-NEXT: [[TMP5:%.*]] = urem <4 x i16> [[BROADCAST_SPLAT]], [[TMP3]]
	; INTERLEAVE-NEXT: br i1 [[TMP6]], label [[PRED_UREM_IF:%.]], label [[PRED_UREM_CONTINUE:%.]]			; INTERLEAVE-NEXT: [[TMP6:%.*]] = urem <4 x i16> [[BROADCAST_SPLAT4]], [[TMP4]]
	; INTERLEAVE: pred.urem.if:			; INTERLEAVE-NEXT: [[PREDPHI:%.*]] = select <4 x i1> [[TMP1]], <4 x i16> zeroinitializer, <4 x i16> [[TMP5]]
	; INTERLEAVE-NEXT: [[TMP7:%.*]] = add i16 [[TMP1]], -20			; INTERLEAVE-NEXT: [[PREDPHI5:%.*]] = select <4 x i1> [[TMP2]], <4 x i16> zeroinitializer, <4 x i16> [[TMP6]]
	; INTERLEAVE-NEXT: [[TMP8:%.]] = urem i16 [[B:%.]], [[TMP7]]			; INTERLEAVE-NEXT: [[TMP7:%.*]] = sext <4 x i16> [[PREDPHI]] to <4 x i32>
	; INTERLEAVE-NEXT: [[TMP9:%.*]] = insertelement <4 x i16> poison, i16 [[TMP8]], i64 0			; INTERLEAVE-NEXT: [[TMP8:%.*]] = sext <4 x i16> [[PREDPHI5]] to <4 x i32>
	; INTERLEAVE-NEXT: br label [[PRED_UREM_CONTINUE]]			; INTERLEAVE-NEXT: [[TMP9]] = or <4 x i32> [[VEC_PHI]], [[TMP7]]
	; INTERLEAVE: pred.urem.continue:			; INTERLEAVE-NEXT: [[TMP10]] = or <4 x i32> [[VEC_PHI1]], [[TMP8]]
	; INTERLEAVE-NEXT: [[TMP10:%.*]] = phi <4 x i16> [ poison, [[VECTOR_BODY]] ], [ [[TMP9]], [[PRED_UREM_IF]] ]
	; INTERLEAVE-NEXT: [[TMP11:%.*]] = extractelement <4 x i1> [[TMP4]], i64 1
	; INTERLEAVE-NEXT: br i1 [[TMP11]], label [[PRED_UREM_IF3:%.]], label [[PRED_UREM_CONTINUE4:%.]]
	; INTERLEAVE: pred.urem.if3:
	; INTERLEAVE-NEXT: [[TMP12:%.*]] = add i16 [[TMP1]], -19
	; INTERLEAVE-NEXT: [[TMP13:%.*]] = urem i16 [[B]], [[TMP12]]
	; INTERLEAVE-NEXT: [[TMP14:%.*]] = insertelement <4 x i16> [[TMP10]], i16 [[TMP13]], i64 1
	; INTERLEAVE-NEXT: br label [[PRED_UREM_CONTINUE4]]
	; INTERLEAVE: pred.urem.continue4:
	; INTERLEAVE-NEXT: [[TMP15:%.*]] = phi <4 x i16> [ [[TMP10]], [[PRED_UREM_CONTINUE]] ], [ [[TMP14]], [[PRED_UREM_IF3]] ]
	; INTERLEAVE-NEXT: [[TMP16:%.*]] = extractelement <4 x i1> [[TMP4]], i64 2
	; INTERLEAVE-NEXT: br i1 [[TMP16]], label [[PRED_UREM_IF5:%.]], label [[PRED_UREM_CONTINUE6:%.]]
	; INTERLEAVE: pred.urem.if5:
	; INTERLEAVE-NEXT: [[TMP17:%.*]] = add i16 [[TMP1]], -18
	; INTERLEAVE-NEXT: [[TMP18:%.*]] = urem i16 [[B]], [[TMP17]]
	; INTERLEAVE-NEXT: [[TMP19:%.*]] = insertelement <4 x i16> [[TMP15]], i16 [[TMP18]], i64 2
	; INTERLEAVE-NEXT: br label [[PRED_UREM_CONTINUE6]]
	; INTERLEAVE: pred.urem.continue6:
	; INTERLEAVE-NEXT: [[TMP20:%.*]] = phi <4 x i16> [ [[TMP15]], [[PRED_UREM_CONTINUE4]] ], [ [[TMP19]], [[PRED_UREM_IF5]] ]
	; INTERLEAVE-NEXT: [[TMP21:%.*]] = extractelement <4 x i1> [[TMP4]], i64 3
	; INTERLEAVE-NEXT: br i1 [[TMP21]], label [[PRED_UREM_IF7:%.]], label [[PRED_UREM_CONTINUE8:%.]]
	; INTERLEAVE: pred.urem.if7:
	; INTERLEAVE-NEXT: [[TMP22:%.*]] = add i16 [[TMP1]], -17
	; INTERLEAVE-NEXT: [[TMP23:%.*]] = urem i16 [[B]], [[TMP22]]
	; INTERLEAVE-NEXT: [[TMP24:%.*]] = insertelement <4 x i16> [[TMP20]], i16 [[TMP23]], i64 3
	; INTERLEAVE-NEXT: br label [[PRED_UREM_CONTINUE8]]
	; INTERLEAVE: pred.urem.continue8:
	; INTERLEAVE-NEXT: [[TMP25:%.*]] = phi <4 x i16> [ [[TMP20]], [[PRED_UREM_CONTINUE6]] ], [ [[TMP24]], [[PRED_UREM_IF7]] ]
	; INTERLEAVE-NEXT: [[TMP26:%.*]] = extractelement <4 x i1> [[TMP5]], i64 0
	; INTERLEAVE-NEXT: br i1 [[TMP26]], label [[PRED_UREM_IF9:%.]], label [[PRED_UREM_CONTINUE10:%.]]
	; INTERLEAVE: pred.urem.if9:
	; INTERLEAVE-NEXT: [[TMP27:%.*]] = add i16 [[TMP1]], -16
	; INTERLEAVE-NEXT: [[TMP28:%.*]] = urem i16 [[B]], [[TMP27]]
	; INTERLEAVE-NEXT: [[TMP29:%.*]] = insertelement <4 x i16> poison, i16 [[TMP28]], i64 0
	; INTERLEAVE-NEXT: br label [[PRED_UREM_CONTINUE10]]
	; INTERLEAVE: pred.urem.continue10:
	; INTERLEAVE-NEXT: [[TMP30:%.*]] = phi <4 x i16> [ poison, [[PRED_UREM_CONTINUE8]] ], [ [[TMP29]], [[PRED_UREM_IF9]] ]
	; INTERLEAVE-NEXT: [[TMP31:%.*]] = extractelement <4 x i1> [[TMP5]], i64 1
	; INTERLEAVE-NEXT: br i1 [[TMP31]], label [[PRED_UREM_IF11:%.]], label [[PRED_UREM_CONTINUE12:%.]]
	; INTERLEAVE: pred.urem.if11:
	; INTERLEAVE-NEXT: [[TMP32:%.*]] = add i16 [[TMP1]], -15
	; INTERLEAVE-NEXT: [[TMP33:%.*]] = urem i16 [[B]], [[TMP32]]
	; INTERLEAVE-NEXT: [[TMP34:%.*]] = insertelement <4 x i16> [[TMP30]], i16 [[TMP33]], i64 1
	; INTERLEAVE-NEXT: br label [[PRED_UREM_CONTINUE12]]
	; INTERLEAVE: pred.urem.continue12:
	; INTERLEAVE-NEXT: [[TMP35:%.*]] = phi <4 x i16> [ [[TMP30]], [[PRED_UREM_CONTINUE10]] ], [ [[TMP34]], [[PRED_UREM_IF11]] ]
	; INTERLEAVE-NEXT: [[TMP36:%.*]] = extractelement <4 x i1> [[TMP5]], i64 2
	; INTERLEAVE-NEXT: br i1 [[TMP36]], label [[PRED_UREM_IF13:%.]], label [[PRED_UREM_CONTINUE14:%.]]
	; INTERLEAVE: pred.urem.if13:
	; INTERLEAVE-NEXT: [[TMP37:%.*]] = add i16 [[TMP1]], -14
	; INTERLEAVE-NEXT: [[TMP38:%.*]] = urem i16 [[B]], [[TMP37]]
	; INTERLEAVE-NEXT: [[TMP39:%.*]] = insertelement <4 x i16> [[TMP35]], i16 [[TMP38]], i64 2
	; INTERLEAVE-NEXT: br label [[PRED_UREM_CONTINUE14]]
	; INTERLEAVE: pred.urem.continue14:
	; INTERLEAVE-NEXT: [[TMP40:%.*]] = phi <4 x i16> [ [[TMP35]], [[PRED_UREM_CONTINUE12]] ], [ [[TMP39]], [[PRED_UREM_IF13]] ]
	; INTERLEAVE-NEXT: [[TMP41:%.*]] = extractelement <4 x i1> [[TMP5]], i64 3
	; INTERLEAVE-NEXT: br i1 [[TMP41]], label [[PRED_UREM_IF15:%.*]], label [[PRED_UREM_CONTINUE16]]
	; INTERLEAVE: pred.urem.if15:
	; INTERLEAVE-NEXT: [[TMP42:%.*]] = add i16 [[TMP1]], -13
	; INTERLEAVE-NEXT: [[TMP43:%.*]] = urem i16 [[B]], [[TMP42]]
	; INTERLEAVE-NEXT: [[TMP44:%.*]] = insertelement <4 x i16> [[TMP40]], i16 [[TMP43]], i64 3
	; INTERLEAVE-NEXT: br label [[PRED_UREM_CONTINUE16]]
	; INTERLEAVE: pred.urem.continue16:
	; INTERLEAVE-NEXT: [[TMP45:%.*]] = phi <4 x i16> [ [[TMP40]], [[PRED_UREM_CONTINUE14]] ], [ [[TMP44]], [[PRED_UREM_IF15]] ]
	; INTERLEAVE-NEXT: [[PREDPHI:%.*]] = select <4 x i1> [[TMP2]], <4 x i16> zeroinitializer, <4 x i16> [[TMP25]]
	; INTERLEAVE-NEXT: [[PREDPHI17:%.*]] = select <4 x i1> [[TMP3]], <4 x i16> zeroinitializer, <4 x i16> [[TMP45]]
	; INTERLEAVE-NEXT: [[TMP46:%.*]] = sext <4 x i16> [[PREDPHI]] to <4 x i32>
	; INTERLEAVE-NEXT: [[TMP47:%.*]] = sext <4 x i16> [[PREDPHI17]] to <4 x i32>
	; INTERLEAVE-NEXT: [[TMP48]] = or <4 x i32> [[VEC_PHI]], [[TMP46]]
	; INTERLEAVE-NEXT: [[TMP49]] = or <4 x i32> [[VEC_PHI1]], [[TMP47]]
	; INTERLEAVE-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 8			; INTERLEAVE-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 8
	; INTERLEAVE-NEXT: [[VEC_IND_NEXT]] = add <4 x i16> [[VEC_IND]], <i16 8, i16 8, i16 8, i16 8>			; INTERLEAVE-NEXT: [[VEC_IND_NEXT]] = add <4 x i16> [[VEC_IND]], <i16 8, i16 8, i16 8, i16 8>
	; INTERLEAVE-NEXT: [[TMP50:%.*]] = icmp eq i32 [[INDEX_NEXT]], 16			; INTERLEAVE-NEXT: [[TMP11:%.*]] = icmp eq i32 [[INDEX_NEXT]], 16
	; INTERLEAVE-NEXT: br i1 [[TMP50]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP48:![0-9]+]]			; INTERLEAVE-NEXT: br i1 [[TMP11]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP48:![0-9]+]]
	; INTERLEAVE: middle.block:			; INTERLEAVE: middle.block:
	; INTERLEAVE-NEXT: [[BIN_RDX:%.*]] = or <4 x i32> [[TMP49]], [[TMP48]]			; INTERLEAVE-NEXT: [[BIN_RDX:%.*]] = or <4 x i32> [[TMP10]], [[TMP9]]
	; INTERLEAVE-NEXT: [[TMP51:%.*]] = call i32 @llvm.vector.reduce.or.v4i32(<4 x i32> [[BIN_RDX]])			; INTERLEAVE-NEXT: [[TMP12:%.*]] = call i32 @llvm.vector.reduce.or.v4i32(<4 x i32> [[BIN_RDX]])
	; INTERLEAVE-NEXT: br i1 false, label [[FOR_END:%.*]], label [[SCALAR_PH]]			; INTERLEAVE-NEXT: br i1 false, label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; INTERLEAVE: scalar.ph:			; INTERLEAVE: scalar.ph:
	; INTERLEAVE-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ -4, [[MIDDLE_BLOCK]] ], [ -20, [[ENTRY:%.]] ]			; INTERLEAVE-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ -4, [[MIDDLE_BLOCK]] ], [ -20, [[ENTRY:%.]] ]
	; INTERLEAVE-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ [[TMP51]], [[MIDDLE_BLOCK]] ], [ [[A]], [[ENTRY]] ]			; INTERLEAVE-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ [[TMP12]], [[MIDDLE_BLOCK]] ], [ [[A]], [[ENTRY]] ]
	; INTERLEAVE-NEXT: br label [[FOR_BODY:%.*]]			; INTERLEAVE-NEXT: br label [[FOR_BODY:%.*]]
	; INTERLEAVE: for.body:			; INTERLEAVE: for.body:
	; INTERLEAVE-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[FOR_INC:%.*]] ]			; INTERLEAVE-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[FOR_INC:%.*]] ]
	; INTERLEAVE-NEXT: [[VAR0:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR6:%.]], [[FOR_INC]] ]			; INTERLEAVE-NEXT: [[VAR0:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[VAR6:%.]], [[FOR_INC]] ]
	; INTERLEAVE-NEXT: [[VAR1:%.*]] = trunc i32 [[I]] to i16			; INTERLEAVE-NEXT: [[VAR1:%.*]] = trunc i32 [[I]] to i16
	; INTERLEAVE-NEXT: [[VAR2:%.*]] = icmp eq i16 [[VAR1]], 0			; INTERLEAVE-NEXT: [[VAR2:%.*]] = icmp eq i16 [[VAR1]], 0
	; INTERLEAVE-NEXT: br i1 [[VAR2]], label [[FOR_INC]], label [[FOR_COND:%.*]]			; INTERLEAVE-NEXT: br i1 [[VAR2]], label [[FOR_INC]], label [[FOR_COND:%.*]]
	; INTERLEAVE: for.cond:			; INTERLEAVE: for.cond:
	; INTERLEAVE-NEXT: [[VAR3:%.*]] = urem i16 [[B]], [[VAR1]]			; INTERLEAVE-NEXT: [[VAR3:%.*]] = urem i16 [[B]], [[VAR1]]
	; INTERLEAVE-NEXT: br label [[FOR_INC]]			; INTERLEAVE-NEXT: br label [[FOR_INC]]
	; INTERLEAVE: for.inc:			; INTERLEAVE: for.inc:
	; INTERLEAVE-NEXT: [[VAR4:%.*]] = phi i16 [ [[VAR3]], [[FOR_COND]] ], [ 0, [[FOR_BODY]] ]			; INTERLEAVE-NEXT: [[VAR4:%.*]] = phi i16 [ [[VAR3]], [[FOR_COND]] ], [ 0, [[FOR_BODY]] ]
	; INTERLEAVE-NEXT: [[VAR5:%.*]] = sext i16 [[VAR4]] to i32			; INTERLEAVE-NEXT: [[VAR5:%.*]] = sext i16 [[VAR4]] to i32
	; INTERLEAVE-NEXT: [[VAR6]] = or i32 [[VAR0]], [[VAR5]]			; INTERLEAVE-NEXT: [[VAR6]] = or i32 [[VAR0]], [[VAR5]]
	; INTERLEAVE-NEXT: [[I_NEXT]] = add nsw i32 [[I]], 1			; INTERLEAVE-NEXT: [[I_NEXT]] = add nsw i32 [[I]], 1
	; INTERLEAVE-NEXT: [[COND:%.*]] = icmp eq i32 [[I_NEXT]], 0			; INTERLEAVE-NEXT: [[COND:%.*]] = icmp eq i32 [[I_NEXT]], 0
	; INTERLEAVE-NEXT: br i1 [[COND]], label [[FOR_END]], label [[FOR_BODY]], !llvm.loop [[LOOP49:![0-9]+]]			; INTERLEAVE-NEXT: br i1 [[COND]], label [[FOR_END]], label [[FOR_BODY]], !llvm.loop [[LOOP49:![0-9]+]]
	; INTERLEAVE: for.end:			; INTERLEAVE: for.end:
	; INTERLEAVE-NEXT: [[VAR7:%.*]] = phi i32 [ [[VAR6]], [[FOR_INC]] ], [ [[TMP51]], [[MIDDLE_BLOCK]] ]			; INTERLEAVE-NEXT: [[VAR7:%.*]] = phi i32 [ [[VAR6]], [[FOR_INC]] ], [ [[TMP12]], [[MIDDLE_BLOCK]] ]
	; INTERLEAVE-NEXT: ret i32 [[VAR7]]			; INTERLEAVE-NEXT: ret i32 [[VAR7]]
	;			;
	entry:			entry:
	br label %for.body			br label %for.body

	for.body:			for.body:
	%i = phi i32 [ -20, %entry ], [ %i.next, %for.inc ]			%i = phi i32 [ -20, %entry ], [ %i.next, %for.inc ]
	%var0 = phi i32 [ %a, %entry ], [ %var6, %for.inc ]			%var0 = phi i32 [ %a, %entry ], [ %var6, %for.inc ]
	▲ Show 20 Lines • Show All 1,121 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/pr44488-predication.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -force-vector-width=2 -force-vector-interleave=1 -passes=loop-vectorize -S \| FileCheck %s			; RUN: opt < %s -force-vector-width=2 -force-vector-interleave=1 -passes=loop-vectorize -S \| FileCheck %s

	; Test case for PR44488. Checks that the correct predicates are created for			; Test case for PR44488. Checks that the correct predicates are created for
	; branches where true and false successors are equal. See the checks involving			; branches where true and false successors are equal. See the checks involving
	; CMP1 and CMP2.			; CMP1 and CMP2.

	@v_38 = global i16 12061, align 1			@v_38 = global i16 12061, align 1
	@v_39 = global i16 11333, align 1			@v_39 = global i16 11333, align 1

	define i16 @test_true_and_false_branch_equal() {			define i16 @test_true_and_false_branch_equal() {
				fhahnUnsubmitted Not Done Reply Inline Actions It looks like this test specially tests for a special case with branches and it should probably use `-force-widen-divrem-via-safe-divisor=0` . fhahn: It looks like this test specially tests for a special case with branches and it should probably…
	; CHECK-LABEL: @test_true_and_false_branch_equal(			; CHECK-LABEL: @test_true_and_false_branch_equal(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; CHECK-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; CHECK: vector.ph:			; CHECK: vector.ph:
	; CHECK-NEXT: br label [[VECTOR_BODY:%.*]]			; CHECK-NEXT: br label [[VECTOR_BODY:%.*]]
	; CHECK: vector.body:			; CHECK: vector.body:
	; CHECK-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_SREM_CONTINUE2:%.*]] ]			; CHECK-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; CHECK-NEXT: [[TMP0:%.]] = load i16, i16 @v_38, align 1			; CHECK-NEXT: [[TMP0:%.]] = load i16, i16 @v_38, align 1
	; CHECK-NEXT: [[BROADCAST_SPLATINSERT:%.*]] = insertelement <2 x i16> poison, i16 [[TMP0]], i32 0			; CHECK-NEXT: [[BROADCAST_SPLATINSERT:%.*]] = insertelement <2 x i16> poison, i16 [[TMP0]], i32 0
	; CHECK-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i16> [[BROADCAST_SPLATINSERT]], <2 x i16> poison, <2 x i32> zeroinitializer			; CHECK-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <2 x i16> [[BROADCAST_SPLATINSERT]], <2 x i16> poison, <2 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP1:%.*]] = icmp eq <2 x i16> [[BROADCAST_SPLAT]], zeroinitializer			; CHECK-NEXT: [[TMP1:%.*]] = icmp eq <2 x i16> [[BROADCAST_SPLAT]], zeroinitializer
	; CHECK-NEXT: [[TMP2:%.*]] = xor <2 x i1> [[TMP1]], <i1 true, i1 true>			; CHECK-NEXT: [[TMP2:%.*]] = xor <2 x i1> [[TMP1]], <i1 true, i1 true>
	; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x i1> [[TMP2]], i32 0			; CHECK-NEXT: [[TMP3:%.*]] = select <2 x i1> [[TMP2]], <2 x i16> [[BROADCAST_SPLAT]], <2 x i16> <i16 1, i16 1>
	; CHECK-NEXT: br i1 [[TMP3]], label [[PRED_SREM_IF:%.]], label [[PRED_SREM_CONTINUE:%.]]			; CHECK-NEXT: [[TMP4:%.*]] = srem <2 x i16> <i16 5786, i16 5786>, [[TMP3]]
	; CHECK: pred.srem.if:			; CHECK-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP1]], <2 x i16> <i16 5786, i16 5786>, <2 x i16> [[TMP4]]
	; CHECK-NEXT: [[TMP4:%.*]] = srem i16 5786, [[TMP0]]			; CHECK-NEXT: [[TMP5:%.*]] = extractelement <2 x i16> [[PREDPHI]], i32 0
	; CHECK-NEXT: [[TMP5:%.*]] = insertelement <2 x i16> poison, i16 [[TMP4]], i32 0			; CHECK-NEXT: store i16 [[TMP5]], i16* @v_39, align 1
	; CHECK-NEXT: br label [[PRED_SREM_CONTINUE]]			; CHECK-NEXT: [[TMP6:%.*]] = extractelement <2 x i16> [[PREDPHI]], i32 1
	; CHECK: pred.srem.continue:			; CHECK-NEXT: store i16 [[TMP6]], i16* @v_39, align 1
	; CHECK-NEXT: [[TMP6:%.*]] = phi <2 x i16> [ poison, [[VECTOR_BODY]] ], [ [[TMP5]], [[PRED_SREM_IF]] ]
	; CHECK-NEXT: [[TMP7:%.*]] = extractelement <2 x i1> [[TMP2]], i32 1
	; CHECK-NEXT: br i1 [[TMP7]], label [[PRED_SREM_IF1:%.*]], label [[PRED_SREM_CONTINUE2]]
	; CHECK: pred.srem.if1:
	; CHECK-NEXT: [[TMP8:%.*]] = srem i16 5786, [[TMP0]]
	; CHECK-NEXT: [[TMP9:%.*]] = insertelement <2 x i16> [[TMP6]], i16 [[TMP8]], i32 1
	; CHECK-NEXT: br label [[PRED_SREM_CONTINUE2]]
	; CHECK: pred.srem.continue2:
	; CHECK-NEXT: [[TMP10:%.*]] = phi <2 x i16> [ [[TMP6]], [[PRED_SREM_CONTINUE]] ], [ [[TMP9]], [[PRED_SREM_IF1]] ]
	; CHECK-NEXT: [[PREDPHI:%.*]] = select <2 x i1> [[TMP1]], <2 x i16> <i16 5786, i16 5786>, <2 x i16> [[TMP10]]
	; CHECK-NEXT: [[TMP11:%.*]] = extractelement <2 x i16> [[PREDPHI]], i32 0
	; CHECK-NEXT: store i16 [[TMP11]], i16* @v_39, align 1
	; CHECK-NEXT: [[TMP12:%.*]] = extractelement <2 x i16> [[PREDPHI]], i32 1
	; CHECK-NEXT: store i16 [[TMP12]], i16* @v_39, align 1
	; CHECK-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 2			; CHECK-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 2
	; CHECK-NEXT: [[TMP13:%.*]] = icmp eq i32 [[INDEX_NEXT]], 12			; CHECK-NEXT: [[TMP7:%.*]] = icmp eq i32 [[INDEX_NEXT]], 12
	; CHECK-NEXT: br i1 [[TMP13]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP0:![0-9]+]]			; CHECK-NEXT: br i1 [[TMP7]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP0:![0-9]+]]
	; CHECK: middle.block:			; CHECK: middle.block:
	; CHECK-NEXT: [[CMP_N:%.*]] = icmp eq i32 12, 12			; CHECK-NEXT: [[CMP_N:%.*]] = icmp eq i32 12, 12
	; CHECK-NEXT: br i1 [[CMP_N]], label [[EXIT:%.*]], label [[SCALAR_PH]]			; CHECK-NEXT: br i1 [[CMP_N]], label [[EXIT:%.*]], label [[SCALAR_PH]]
	; CHECK: scalar.ph:			; CHECK: scalar.ph:
	; CHECK-NEXT: [[BC_RESUME_VAL:%.]] = phi i16 [ 111, [[MIDDLE_BLOCK]] ], [ 99, [[ENTRY:%.]] ]			; CHECK-NEXT: [[BC_RESUME_VAL:%.]] = phi i16 [ 111, [[MIDDLE_BLOCK]] ], [ 99, [[ENTRY:%.]] ]
	; CHECK-NEXT: br label [[FOR_BODY:%.*]]			; CHECK-NEXT: br label [[FOR_BODY:%.*]]
	; CHECK: for.body:			; CHECK: for.body:
	; CHECK-NEXT: [[I_07:%.]] = phi i16 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[INC7:%.]], [[FOR_LATCH:%.*]] ]			; CHECK-NEXT: [[I_07:%.]] = phi i16 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[INC7:%.]], [[FOR_LATCH:%.*]] ]
	▲ Show 20 Lines • Show All 47 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/reduction-small-size.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -force-vector-width=4 -force-vector-interleave=1 -passes=loop-vectorize -S \| FileCheck %s			; RUN: opt < %s -force-vector-width=4 -force-vector-interleave=1 -passes=loop-vectorize -S \| FileCheck %s

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"

	define i8 @PR34687(i1 %c, i32 %x, i32 %n) {			define i8 @PR34687(i1 %c, i32 %x, i32 %n) {
	; CHECK-LABEL: @PR34687(			; CHECK-LABEL: @PR34687(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[MIN_ITERS_CHECK:%.]] = icmp ult i32 [[N:%.]], 4			; CHECK-NEXT: [[MIN_ITERS_CHECK:%.]] = icmp ult i32 [[N:%.]], 4
	; CHECK-NEXT: br i1 [[MIN_ITERS_CHECK]], label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; CHECK-NEXT: br i1 [[MIN_ITERS_CHECK]], label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; CHECK: vector.ph:			; CHECK: vector.ph:
	; CHECK-NEXT: [[N_MOD_VF:%.*]] = urem i32 [[N]], 4			; CHECK-NEXT: [[N_MOD_VF:%.*]] = urem i32 [[N]], 4
	; CHECK-NEXT: [[N_VEC:%.*]] = sub i32 [[N]], [[N_MOD_VF]]			; CHECK-NEXT: [[N_VEC:%.*]] = sub i32 [[N]], [[N_MOD_VF]]
	; CHECK-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <4 x i1> poison, i1 [[C:%.]], i32 0			; CHECK-NEXT: [[BROADCAST_SPLATINSERT:%.]] = insertelement <4 x i1> poison, i1 [[C:%.]], i32 0
	; CHECK-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <4 x i1> [[BROADCAST_SPLATINSERT]], <4 x i1> poison, <4 x i32> zeroinitializer			; CHECK-NEXT: [[BROADCAST_SPLAT:%.*]] = shufflevector <4 x i1> [[BROADCAST_SPLATINSERT]], <4 x i1> poison, <4 x i32> zeroinitializer
	; CHECK-NEXT: [[BROADCAST_SPLATINSERT7:%.]] = insertelement <4 x i32> poison, i32 [[X:%.]], i32 0			; CHECK-NEXT: [[BROADCAST_SPLATINSERT1:%.]] = insertelement <4 x i32> poison, i32 [[X:%.]], i32 0
	; CHECK-NEXT: [[BROADCAST_SPLAT8:%.*]] = shufflevector <4 x i32> [[BROADCAST_SPLATINSERT7]], <4 x i32> poison, <4 x i32> zeroinitializer			; CHECK-NEXT: [[BROADCAST_SPLAT2:%.*]] = shufflevector <4 x i32> [[BROADCAST_SPLATINSERT1]], <4 x i32> poison, <4 x i32> zeroinitializer
	; CHECK-NEXT: br label [[VECTOR_BODY:%.*]]			; CHECK-NEXT: br label [[VECTOR_BODY:%.*]]
	; CHECK: vector.body:			; CHECK: vector.body:
	; CHECK-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[PRED_SDIV_CONTINUE6:%.*]] ]			; CHECK-NEXT: [[INDEX:%.]] = phi i32 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; CHECK-NEXT: [[VEC_PHI:%.]] = phi <4 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP8:%.]], [[PRED_SDIV_CONTINUE6]] ]			; CHECK-NEXT: [[VEC_PHI:%.]] = phi <4 x i32> [ zeroinitializer, [[VECTOR_PH]] ], [ [[TMP5:%.]], [[VECTOR_BODY]] ]
	; CHECK-NEXT: [[TMP0:%.*]] = extractelement <4 x i1> [[BROADCAST_SPLAT]], i32 0			; CHECK-NEXT: [[TMP0:%.*]] = select <4 x i1> [[BROADCAST_SPLAT]], <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 1, i32 1>
	; CHECK-NEXT: br i1 [[TMP0]], label [[PRED_SDIV_IF:%.]], label [[PRED_SDIV_CONTINUE:%.]]			; CHECK-NEXT: [[TMP1:%.*]] = and <4 x i32> [[VEC_PHI]], <i32 255, i32 255, i32 255, i32 255>
	; CHECK: pred.sdiv.if:			; CHECK-NEXT: [[TMP2:%.*]] = add <4 x i32> [[TMP1]], [[BROADCAST_SPLAT2]]
	; CHECK-NEXT: br label [[PRED_SDIV_CONTINUE]]
	; CHECK: pred.sdiv.continue:
	; CHECK-NEXT: [[TMP1:%.*]] = extractelement <4 x i1> [[BROADCAST_SPLAT]], i32 1
	; CHECK-NEXT: br i1 [[TMP1]], label [[PRED_SDIV_IF1:%.]], label [[PRED_SDIV_CONTINUE2:%.]]
	; CHECK: pred.sdiv.if1:
	; CHECK-NEXT: br label [[PRED_SDIV_CONTINUE2]]
	; CHECK: pred.sdiv.continue2:
	; CHECK-NEXT: [[TMP2:%.*]] = extractelement <4 x i1> [[BROADCAST_SPLAT]], i32 2
	; CHECK-NEXT: br i1 [[TMP2]], label [[PRED_SDIV_IF3:%.]], label [[PRED_SDIV_CONTINUE4:%.]]
	; CHECK: pred.sdiv.if3:
	; CHECK-NEXT: br label [[PRED_SDIV_CONTINUE4]]
	; CHECK: pred.sdiv.continue4:
	; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x i1> [[BROADCAST_SPLAT]], i32 3
	; CHECK-NEXT: br i1 [[TMP3]], label [[PRED_SDIV_IF5:%.*]], label [[PRED_SDIV_CONTINUE6]]
	; CHECK: pred.sdiv.if5:
	; CHECK-NEXT: br label [[PRED_SDIV_CONTINUE6]]
	; CHECK: pred.sdiv.continue6:
	; CHECK-NEXT: [[TMP4:%.*]] = and <4 x i32> [[VEC_PHI]], <i32 255, i32 255, i32 255, i32 255>
	; CHECK-NEXT: [[TMP5:%.*]] = add <4 x i32> [[TMP4]], [[BROADCAST_SPLAT8]]
	; CHECK-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 4			; CHECK-NEXT: [[INDEX_NEXT]] = add nuw i32 [[INDEX]], 4
	; CHECK-NEXT: [[TMP6:%.*]] = icmp eq i32 [[INDEX_NEXT]], [[N_VEC]]			; CHECK-NEXT: [[TMP3:%.*]] = icmp eq i32 [[INDEX_NEXT]], [[N_VEC]]
	; CHECK-NEXT: [[TMP7:%.*]] = trunc <4 x i32> [[TMP5]] to <4 x i8>			; CHECK-NEXT: [[TMP4:%.*]] = trunc <4 x i32> [[TMP2]] to <4 x i8>
	; CHECK-NEXT: [[TMP8]] = zext <4 x i8> [[TMP7]] to <4 x i32>			; CHECK-NEXT: [[TMP5]] = zext <4 x i8> [[TMP4]] to <4 x i32>
	; CHECK-NEXT: br i1 [[TMP6]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP0:![0-9]+]]			; CHECK-NEXT: br i1 [[TMP3]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP0:![0-9]+]]
	; CHECK: middle.block:			; CHECK: middle.block:
	; CHECK-NEXT: [[TMP9:%.*]] = trunc <4 x i32> [[TMP8]] to <4 x i8>			; CHECK-NEXT: [[TMP6:%.*]] = trunc <4 x i32> [[TMP5]] to <4 x i8>
	; CHECK-NEXT: [[TMP10:%.*]] = call i8 @llvm.vector.reduce.add.v4i8(<4 x i8> [[TMP9]])			; CHECK-NEXT: [[TMP7:%.*]] = call i8 @llvm.vector.reduce.add.v4i8(<4 x i8> [[TMP6]])
	; CHECK-NEXT: [[TMP11:%.*]] = zext i8 [[TMP10]] to i32			; CHECK-NEXT: [[TMP8:%.*]] = zext i8 [[TMP7]] to i32
	; CHECK-NEXT: [[CMP_N:%.*]] = icmp eq i32 [[N]], [[N_VEC]]			; CHECK-NEXT: [[CMP_N:%.*]] = icmp eq i32 [[N]], [[N_VEC]]
	; CHECK-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]			; CHECK-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.*]], label [[SCALAR_PH]]
	; CHECK: scalar.ph:			; CHECK: scalar.ph:
	; CHECK-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ [[N_VEC]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]			; CHECK-NEXT: [[BC_RESUME_VAL:%.]] = phi i32 [ [[N_VEC]], [[MIDDLE_BLOCK]] ], [ 0, [[ENTRY:%.]] ]
	; CHECK-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ 0, [[ENTRY]] ], [ [[TMP11]], [[MIDDLE_BLOCK]] ]			; CHECK-NEXT: [[BC_MERGE_RDX:%.*]] = phi i32 [ 0, [[ENTRY]] ], [ [[TMP8]], [[MIDDLE_BLOCK]] ]
	; CHECK-NEXT: br label [[FOR_BODY:%.*]]			; CHECK-NEXT: br label [[FOR_BODY:%.*]]
	; CHECK: for.body:			; CHECK: for.body:
	; CHECK-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[IF_END:%.*]] ]			; CHECK-NEXT: [[I:%.]] = phi i32 [ [[BC_RESUME_VAL]], [[SCALAR_PH]] ], [ [[I_NEXT:%.]], [[IF_END:%.*]] ]
	; CHECK-NEXT: [[R:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[R_NEXT:%.]], [[IF_END]] ]			; CHECK-NEXT: [[R:%.]] = phi i32 [ [[BC_MERGE_RDX]], [[SCALAR_PH]] ], [ [[R_NEXT:%.]], [[IF_END]] ]
	; CHECK-NEXT: br i1 [[C]], label [[IF_THEN:%.*]], label [[IF_END]]			; CHECK-NEXT: br i1 [[C]], label [[IF_THEN:%.*]], label [[IF_END]]
	; CHECK: if.then:			; CHECK: if.then:
	; CHECK-NEXT: [[T0:%.*]] = sdiv i32 undef, undef			; CHECK-NEXT: [[T0:%.*]] = sdiv i32 undef, undef
	; CHECK-NEXT: br label [[IF_END]]			; CHECK-NEXT: br label [[IF_END]]
	; CHECK: if.end:			; CHECK: if.end:
	; CHECK-NEXT: [[T1:%.*]] = and i32 [[R]], 255			; CHECK-NEXT: [[T1:%.*]] = and i32 [[R]], 255
	; CHECK-NEXT: [[I_NEXT]] = add nsw i32 [[I]], 1			; CHECK-NEXT: [[I_NEXT]] = add nsw i32 [[I]], 1
	; CHECK-NEXT: [[R_NEXT]] = add nuw nsw i32 [[T1]], [[X]]			; CHECK-NEXT: [[R_NEXT]] = add nuw nsw i32 [[T1]], [[X]]
	; CHECK-NEXT: [[COND:%.*]] = icmp eq i32 [[I_NEXT]], [[N]]			; CHECK-NEXT: [[COND:%.*]] = icmp eq i32 [[I_NEXT]], [[N]]
	; CHECK-NEXT: br i1 [[COND]], label [[FOR_END]], label [[FOR_BODY]], !llvm.loop [[LOOP2:![0-9]+]]			; CHECK-NEXT: br i1 [[COND]], label [[FOR_END]], label [[FOR_BODY]], !llvm.loop [[LOOP2:![0-9]+]]
	; CHECK: for.end:			; CHECK: for.end:
	; CHECK-NEXT: [[T2:%.*]] = phi i32 [ [[R_NEXT]], [[IF_END]] ], [ [[TMP11]], [[MIDDLE_BLOCK]] ]			; CHECK-NEXT: [[T2:%.*]] = phi i32 [ [[R_NEXT]], [[IF_END]] ], [ [[TMP8]], [[MIDDLE_BLOCK]] ]
	; CHECK-NEXT: [[T3:%.*]] = trunc i32 [[T2]] to i8			; CHECK-NEXT: [[T3:%.*]] = trunc i32 [[T2]] to i8
	; CHECK-NEXT: ret i8 [[T3]]			; CHECK-NEXT: ret i8 [[T3]]
	;			;
	entry:			entry:
	br label %for.body			br label %for.body

	for.body:			for.body:
	%i = phi i32 [ 0, %entry ], [ %i.next, %if.end ]			%i = phi i32 [ 0, %entry ], [ %i.next, %if.end ]
	▲ Show 20 Lines • Show All 148 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/vplan-printing.ll

	; REQUIRES: asserts			; REQUIRES: asserts

	; RUN: opt -loop-vectorize -debug-only=loop-vectorize -force-vector-interleave=1 -force-vector-width=4 -prefer-inloop-reductions -enable-interleaved-mem-accesses=true -enable-masked-interleaved-mem-accesses -disable-output %s 2>&1 \| FileCheck %s			; RUN: opt -loop-vectorize -debug-only=loop-vectorize -force-vector-interleave=1 -force-vector-width=4 -prefer-inloop-reductions -enable-interleaved-mem-accesses=true -enable-masked-interleaved-mem-accesses -force-widen-divrem-via-safe-divisor=0 -disable-output %s 2>&1 \| FileCheck %s

	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"

	; Tests for printing VPlans.			; Tests for printing VPlans.

	define void @print_call_and_memory(i64 %n, float* noalias %y, float* noalias %x) nounwind uwtable {			define void @print_call_and_memory(i64 %n, float* noalias %y, float* noalias %x) nounwind uwtable {
	; CHECK-LABEL: Checking a loop in 'print_call_and_memory'			; CHECK-LABEL: Checking a loop in 'print_call_and_memory'
	; CHECK: VPlan 'Initial VPlan for VF={4},UF>=1' {			; CHECK: VPlan 'Initial VPlan for VF={4},UF>=1' {
	▲ Show 20 Lines • Show All 578 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[LV] Use safe-divisor lowering for fixed vectors if profitableClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 455327

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

llvm/test/Transforms/LoopVectorize/RISCV/divrem.ll

llvm/test/Transforms/LoopVectorize/first-order-recurrence-sink-replicate-region.ll

llvm/test/Transforms/LoopVectorize/first-order-recurrence.ll

llvm/test/Transforms/LoopVectorize/if-pred-non-void.ll

llvm/test/Transforms/LoopVectorize/induction.ll

llvm/test/Transforms/LoopVectorize/pr44488-predication.ll

llvm/test/Transforms/LoopVectorize/reduction-small-size.ll

llvm/test/Transforms/LoopVectorize/vplan-printing.ll

[LV] Use safe-divisor lowering for fixed vectors if profitable
ClosedPublic