This is an archive of the discontinued LLVM Phabricator instance.

[SLP]Fix PR57322: vectorize constant float stores.
ClosedPublic

Authored by ABataev on Aug 26 2022, 10:45 AM.

Download Raw Diff

Details

Reviewers

RKSimon
vdmitrie
vzakhari

Commits

rGbeacf9bd9e1a: [SLP]Fix PR57322: vectorize constant float stores.

Summary

Stores for constant floats must be vectorized, improve analysis in SLP
vectorizer for stores.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

ABataev created this revision.Aug 26 2022, 10:45 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 26 2022, 10:45 AM

Herald added subscribers: vporpo, hiraditya. · View Herald Transcript

ABataev requested review of this revision.Aug 26 2022, 10:45 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 26 2022, 10:45 AM

Herald added a subscriber: • pcwang-thead. · View Herald Transcript

Harbormaster completed remote builds in B183641: Diff 455956.Aug 26 2022, 12:06 PM

vdmitrie added inline comments.Aug 26 2022, 6:02 PM

llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp

5819

I'm not sure I understand why we may need this extra argument.
Is it for restraining purpose?
We can tell whether we dealing with FP or integer from type of the operand we are evaluating.
Besides, type of operation across VL have to be homogeneous so it makes sense to multi-version the method for better clarity.
And you don't need to enumerate VL if you process first element early.

Here is how the code could look:

const auto *I0 = cast<Instruction>(VL[0]);
Type *OpTy = I0->getOperand(OpIdx)->getType();
if (OpTy->isFloatingPointTy()) {
  for (Value *V : VL) {
    const auto *Inst = cast<Instruction>(V);
    assert(Inst->getOpcode() == I0->getOpcode() && "Expected same opcode");
    if (!isa<ConstantFP>(Inst->getOperand(OpIdx)))
      return {TTI::OK_AnyValue, TTI::OP_None};
  }
  return {TTI::OK_NonUniformConstantValue, TTI::OP_None};
}

// If all operands are exactly the same ConstantInt then set the
// operand kind to OK_UniformConstantValue.
// If instead not all operands are constants, then set the operand kind
// to OK_AnyValue. If all operands are constants but not the same,
// then set the operand kind to OK_NonUniformConstantValue.
ConstantInt *CInt0 = dyn_cast<ConstantInt>(I0->getOperand(OpIdx));
if (!CInt0)
  return {TTI::OK_AnyValue, TTI::OP_None};

TTI::OperandValueKind VK = TTI::OK_UniformConstantValue;
TTI::OperandValueProperties VP = TTI::OP_PowerOf2;
for (Value *V : VL.drop_front()) {
  const auto *Inst = cast<Instruction>(V);
  assert(Inst->getOpcode() == I0->getOpcode() && "Expected same opcode");
  auto *CInt = dyn_cast<ConstantInt>(Inst->getOperand(OpIdx));
  if (!CInt)
    return {TTI::OK_AnyValue, TTI::OP_None};
  if (VP == TTI::OP_PowerOf2 && !CInt->getValue().isPowerOf2())
    VP = TTI::OP_None;
  if (CInt0 != CInt)
    VK = TTI::OK_NonUniformConstantValue;
}
return {VK, VP};

vdmitrie added inline comments.Aug 26 2022, 6:10 PM

llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
5819	Minor correction for the above (VP Initialization): TTI::OperandValueProperties VP = CInt0->getValue().isPowerOf2() ? TTI::OP_PowerOf2 : TTI::OP_None;

ABataev added inline comments.Aug 26 2022, 6:27 PM

llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
5819	Just we need to perform this abalysis only for stores but not for fadd, fneg, etc. I can add a check fir the kind of the instruction instead but I thought that it would be better to provide a flag explicitly and pass it explicitly for the required instructions.

vdmitrie added inline comments.Aug 26 2022, 6:39 PM

llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
5819	only for stores but not for fadd, fneg Is that because you are targeting stores only in this patch? How does it affect other operations? It is not yet clear for me why you would like to avoid that analysis for them.

ABataev added inline comments.Aug 26 2022, 7:06 PM

llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
5819	Yes, I just don't have enough time/resources for other ops analysis.

vdmitrie added inline comments.Aug 26 2022, 7:24 PM

llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
5819	That's okay for incremental step. Then please don't declare the default value for the argument and please add a TODO comment at the call site where it is 'false' saying that impact of enabling the analysis there is yet to be determined. Also renaming the argument to enableFP or likewise would make restraining purpose more explicit.

ABataev mentioned this in D126885: [SLP]Cost for a constant buildvector..Aug 29 2022, 5:12 AM

Address comments

Harbormaster completed remote builds in B183912: Diff 456322.Aug 29 2022, 7:26 AM

Looks acceptable.

llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
5854	As I said earlier we should never have a situation of mixed types. So I do not quite understand why you are still sticking with packing the two flows into the single loop. This does not save anything but makes the code to be error prone. Just look: the condition at 5840 should probably be if (EnableFP && CFlt). You also do not use CFlt to access the object but only for null pointer check. Technically yes, this code will work. But I appreciate if you address maintenance concern as well.

This revision is now accepted and ready to land.Aug 29 2022, 9:21 AM

ABataev added inline comments.Aug 29 2022, 9:42 AM

llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
5854	Sure, just missed it.

Closed by commit rGbeacf9bd9e1a: [SLP]Fix PR57322: vectorize constant float stores. (authored by ABataev). · Explain WhyAug 29 2022, 11:04 AM

This revision was automatically updated to reflect the committed changes.

ABataev added a commit: rGbeacf9bd9e1a: [SLP]Fix PR57322: vectorize constant float stores..

Adding this only for the store case seems undesirable both long term as it's clearly not the right long term goal and short term as it adds complexity for no good reason. I'm not actively requesting you don't land this, but I do strongly want to encourage you to either a) do the analysis for the non-stores and refresh this patch, or b) commit to doing a follow up in the near future.

In D132750#3756195, @reames wrote:

Adding this only for the store case seems undesirable both long term as it's clearly not the right long term goal and short term as it adds complexity for no good reason. I'm not actively requesting you don't land this, but I do strongly want to encourage you to either a) do the analysis for the non-stores and refresh this patch, or b) commit to doing a follow up in the near future.

I can try to add OK_FPConstantValue value instead and change analysis using this value for now.

In D132750#3756281, @ABataev wrote:

In D132750#3756195, @reames wrote:

Adding this only for the store case seems undesirable both long term as it's clearly not the right long term goal and short term as it adds complexity for no good reason. I'm not actively requesting you don't land this, but I do strongly want to encourage you to either a) do the analysis for the non-stores and refresh this patch, or b) commit to doing a follow up in the near future.

I can try to add OK_FPConstantValue value instead and change analysis using this value for now.

Huh? Why do this?

I meant that we clearly want uniform constant handling to include float vectors in all cases long term. That's what I was asking you to follow up with.

In D132750#3756310, @reames wrote:

In D132750#3756281, @ABataev wrote:

In D132750#3756195, @reames wrote:

Adding this only for the store case seems undesirable both long term as it's clearly not the right long term goal and short term as it adds complexity for no good reason. I'm not actively requesting you don't land this, but I do strongly want to encourage you to either a) do the analysis for the non-stores and refresh this patch, or b) commit to doing a follow up in the near future.

I can try to add OK_FPConstantValue value instead and change analysis using this value for now.

Huh? Why do this?

I meant that we clearly want uniform constant handling to include float vectors in all cases long term. That's what I was asking you to follow up with.

I see. I can send a followup patch with uniform floats analysis but I cannot test it thoroughly.

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Vectorize/

SLPVectorizer.cpp

49 lines

test/

Transforms/

SLPVectorizer/

X86/

stores_constant_float.ll

4 lines

Diff 456410

llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,119 Lines • ▼ Show 20 Lines	private:
}		}

/// Checks if all users of \p I are the part of the vectorization tree.		/// Checks if all users of \p I are the part of the vectorization tree.
bool areAllUsersVectorized(Instruction *I,		bool areAllUsersVectorized(Instruction *I,
ArrayRef<Value *> VectorizedVals) const;		ArrayRef<Value *> VectorizedVals) const;

/// Return information about the vector formed for the specified index		/// Return information about the vector formed for the specified index
/// of a vector of (the same) instruction.		/// of a vector of (the same) instruction.
		/// \param EnableFP - If true, check for float constants.
TargetTransformInfo::OperandValueInfo		TargetTransformInfo::OperandValueInfo
getOperandInfo(ArrayRef<Value *> VL, unsigned OpIdx);		getOperandInfo(ArrayRef<Value *> VL, unsigned OpIdx, bool EnableFP);

/// \returns the cost of the vectorizable entry.		/// \returns the cost of the vectorizable entry.
InstructionCost getEntryCost(const TreeEntry *E,		InstructionCost getEntryCost(const TreeEntry *E,
ArrayRef<Value *> VectorizedVals);		ArrayRef<Value *> VectorizedVals);

/// This is the recursive part of buildTree.		/// This is the recursive part of buildTree.
void buildTree_rec(ArrayRef<Value *> Roots, unsigned Depth,		void buildTree_rec(ArrayRef<Value *> Roots, unsigned Depth,
const EdgeInfo &EI);		const EdgeInfo &EI);
▲ Show 20 Lines • Show All 3,671 Lines • ▼ Show 20 Lines	if (P0 == AltP0Swapped)
(I != MainOp &&		(I != MainOp &&
!areCompatibleCmpOps(CI0->getOperand(0), CI0->getOperand(1),		!areCompatibleCmpOps(CI0->getOperand(0), CI0->getOperand(1),
CI->getOperand(0), CI->getOperand(1)));		CI->getOperand(0), CI->getOperand(1)));
return AltP0 == CurrentPred \|\| AltP0Swapped == CurrentPred;		return AltP0 == CurrentPred \|\| AltP0Swapped == CurrentPred;
}		}
return I->getOpcode() == AltOp->getOpcode();		return I->getOpcode() == AltOp->getOpcode();
}		}

TTI::OperandValueInfo BoUpSLP::getOperandInfo(ArrayRef<Value *> VL, unsigned OpIdx) {		TTI::OperandValueInfo BoUpSLP::getOperandInfo(ArrayRef<Value *> VL,
		unsigned OpIdx, bool EnableFP) {
		vdmitrieUnsubmitted Not Done Reply Inline Actions I'm not sure I understand why we may need this extra argument. Is it for restraining purpose? We can tell whether we dealing with FP or integer from type of the operand we are evaluating. Besides, type of operation across VL have to be homogeneous so it makes sense to multi-version the method for better clarity. And you don't need to enumerate VL if you process first element early. Here is how the code could look: const auto I0 = cast<Instruction>(VL[0]); Type OpTy = I0->getOperand(OpIdx)->getType(); if (OpTy->isFloatingPointTy()) { for (Value V : VL) { const auto Inst = cast<Instruction>(V); assert(Inst->getOpcode() == I0->getOpcode() && "Expected same opcode"); if (!isa<ConstantFP>(Inst->getOperand(OpIdx))) return {TTI::OK_AnyValue, TTI::OP_None}; } return {TTI::OK_NonUniformConstantValue, TTI::OP_None}; } // If all operands are exactly the same ConstantInt then set the // operand kind to OK_UniformConstantValue. // If instead not all operands are constants, then set the operand kind // to OK_AnyValue. If all operands are constants but not the same, // then set the operand kind to OK_NonUniformConstantValue. ConstantInt CInt0 = dyn_cast<ConstantInt>(I0->getOperand(OpIdx)); if (!CInt0) return {TTI::OK_AnyValue, TTI::OP_None}; TTI::OperandValueKind VK = TTI::OK_UniformConstantValue; TTI::OperandValueProperties VP = TTI::OP_PowerOf2; for (Value V : VL.drop_front()) { const auto Inst = cast<Instruction>(V); assert(Inst->getOpcode() == I0->getOpcode() && "Expected same opcode"); auto CInt = dyn_cast<ConstantInt>(Inst->getOperand(OpIdx)); if (!CInt) return {TTI::OK_AnyValue, TTI::OP_None}; if (VP == TTI::OP_PowerOf2 && !CInt->getValue().isPowerOf2()) VP = TTI::OP_None; if (CInt0 != CInt) VK = TTI::OK_NonUniformConstantValue; } return {VK, VP}; vdmitrie: I'm not sure I understand why we may need this extra argument. Is it for restraining purpose?
		vdmitrieUnsubmitted Not Done Reply Inline Actions Minor correction for the above (VP Initialization): TTI::OperandValueProperties VP = CInt0->getValue().isPowerOf2() ? TTI::OP_PowerOf2 : TTI::OP_None; vdmitrie: Minor correction for the above (VP Initialization): TTI::OperandValueProperties VP =…
		ABataevAuthorUnsubmitted Done Reply Inline Actions Just we need to perform this abalysis only for stores but not for fadd, fneg, etc. I can add a check fir the kind of the instruction instead but I thought that it would be better to provide a flag explicitly and pass it explicitly for the required instructions. ABataev: Just we need to perform this abalysis only for stores but not for fadd, fneg, etc. I can add a…
		vdmitrieUnsubmitted Not Done Reply Inline Actions only for stores but not for fadd, fneg Is that because you are targeting stores only in this patch? How does it affect other operations? It is not yet clear for me why you would like to avoid that analysis for them. vdmitrie: > only for stores but not for fadd, fneg Is that because you are targeting stores only in this…
		ABataevAuthorUnsubmitted Done Reply Inline Actions Yes, I just don't have enough time/resources for other ops analysis. ABataev: Yes, I just don't have enough time/resources for other ops analysis.
		vdmitrieUnsubmitted Not Done Reply Inline Actions That's okay for incremental step. Then please don't declare the default value for the argument and please add a TODO comment at the call site where it is 'false' saying that impact of enabling the analysis there is yet to be determined. Also renaming the argument to enableFP or likewise would make restraining purpose more explicit. vdmitrie: That's okay for incremental step. Then please don't declare the default value for the argument…
TTI::OperandValueKind VK = TTI::OK_UniformConstantValue;		TTI::OperandValueKind VK = TTI::OK_UniformConstantValue;
TTI::OperandValueProperties VP = TTI::OP_PowerOf2;		TTI::OperandValueProperties VP = TTI::OP_PowerOf2;

		// If all float operands are constants then set the operand kind to
		// OK_NonUniformConstantValue. Otherwise, return OK_AnyValue.
		const auto *I0 = cast<Instruction>(VL.front());
		if (I0->getOperand(OpIdx)->getType()->isFloatingPointTy()) {
		if (!EnableFP \|\| any_of(VL, [OpIdx, I0](Value *V) {
		const auto *Inst = cast<Instruction>(V);
		assert(Inst->getOpcode() == I0->getOpcode() &&
		"Expected same opcode");
		return !isConstant(Inst->getOperand(OpIdx));
		}))
		return {TTI::OK_AnyValue, TTI::OP_None};
		return {TTI::OK_NonUniformConstantValue, TTI::OP_None};
		}

// If all operands are exactly the same ConstantInt then set the		// If all operands are exactly the same ConstantInt then set the
// operand kind to OK_UniformConstantValue.		// operand kind to OK_UniformConstantValue.
// If instead not all operands are constants, then set the operand kind		// If instead not all operands are constants, then set the operand kind
// to OK_AnyValue. If all operands are constants but not the same,		// to OK_AnyValue. If all operands are constants but not the same,
// then set the operand kind to OK_NonUniformConstantValue.		// then set the operand kind to OK_NonUniformConstantValue.
ConstantInt *CInt0 = nullptr;		ConstantInt *CInt0 = nullptr;
for (unsigned i = 0, e = VL.size(); i < e; ++i) {		for (Value *V : VL) {
const Instruction *I = cast<Instruction>(VL[i]);		const auto *Inst = cast<Instruction>(V);
assert(I->getOpcode() == cast<Instruction>(VL[0])->getOpcode());		assert(Inst->getOpcode() == cast<Instruction>(VL[0])->getOpcode() &&
ConstantInt *CInt = dyn_cast<ConstantInt>(I->getOperand(OpIdx));		"Expected same opcode");
		auto *CInt = dyn_cast<ConstantInt>(Inst->getOperand(OpIdx));
if (!CInt) {		if (!CInt) {
VK = TTI::OK_AnyValue;		VK = TTI::OK_AnyValue;
VP = TTI::OP_None;		VP = TTI::OP_None;
break;		break;
}		}
if (VP == TTI::OP_PowerOf2 &&		if (VP == TTI::OP_PowerOf2 && !CInt->getValue().isPowerOf2())
!CInt->getValue().isPowerOf2())
VP = TTI::OP_None;		VP = TTI::OP_None;
		vdmitrieUnsubmitted Not Done Reply Inline Actions As I said earlier we should never have a situation of mixed types. So I do not quite understand why you are still sticking with packing the two flows into the single loop. This does not save anything but makes the code to be error prone. Just look: the condition at 5840 should probably be if (EnableFP && CFlt). You also do not use CFlt to access the object but only for null pointer check. Technically yes, this code will work. But I appreciate if you address maintenance concern as well. vdmitrie: As I said earlier we should never have a situation of mixed types. So I do not quite understand…
		ABataevAuthorUnsubmitted Done Reply Inline Actions Sure, just missed it. ABataev: Sure, just missed it.
if (i == 0) {		if (!CInt0) {
CInt0 = CInt;		CInt0 = CInt;
continue;		continue;
}		}
if (CInt0 != CInt)		if (CInt0 != CInt)
VK = TTI::OK_NonUniformConstantValue;		VK = TTI::OK_NonUniformConstantValue;
}		}
return {VK, VP};		return {VK, VP};
}		}
▲ Show 20 Lines • Show All 561 Lines • ▼ Show 20 Lines	switch (ShuffleOrOp) {
case Instruction::And:		case Instruction::And:
case Instruction::Or:		case Instruction::Or:
case Instruction::Xor: {		case Instruction::Xor: {
TTI::OperandValueInfo Op1Info = {TTI::OK_AnyValue, TTI::OP_None};		TTI::OperandValueInfo Op1Info = {TTI::OK_AnyValue, TTI::OP_None};

// Certain instructions can be cheaper to vectorize if they have a		// Certain instructions can be cheaper to vectorize if they have a
// constant second vector operand.		// constant second vector operand.
const unsigned OpIdx = isa<BinaryOperator>(VL0) ? 1 : 0;		const unsigned OpIdx = isa<BinaryOperator>(VL0) ? 1 : 0;
auto Op2Info = getOperandInfo(VL, OpIdx);		// TODO: impact of enabling the analysis there is yet to be determined
		auto Op2Info = getOperandInfo(VL, OpIdx, /EnableFP=/false);

SmallVector<const Value *, 4> Operands(VL0->operand_values());		SmallVector<const Value *, 4> Operands(VL0->operand_values());
InstructionCost ScalarEltCost =		InstructionCost ScalarEltCost =
TTI->getArithmeticInstrCost(E->getOpcode(), ScalarTy, CostKind,		TTI->getArithmeticInstrCost(E->getOpcode(), ScalarTy, CostKind,
Op1Info, Op2Info,		Op1Info, Op2Info,
Operands, VL0);		Operands, VL0);
if (NeedToShuffleReuses) {		if (NeedToShuffleReuses) {
CommonCost -= (EntryVF - VL.size()) * ScalarEltCost;		CommonCost -= (EntryVF - VL.size()) * ScalarEltCost;
▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	case Instruction::Load: {
return CommonCost + VecLdCost - ScalarLdCost;		return CommonCost + VecLdCost - ScalarLdCost;
}		}
case Instruction::Store: {		case Instruction::Store: {
// We know that we can merge the stores. Calculate the cost.		// We know that we can merge the stores. Calculate the cost.
bool IsReorder = !E->ReorderIndices.empty();		bool IsReorder = !E->ReorderIndices.empty();
auto *SI =		auto *SI =
cast<StoreInst>(IsReorder ? VL[E->ReorderIndices.front()] : VL0);		cast<StoreInst>(IsReorder ? VL[E->ReorderIndices.front()] : VL0);
Align Alignment = SI->getAlign();		Align Alignment = SI->getAlign();
TTI::OperandValueInfo OpInfo = TTI::getOperandInfo(SI->getOperand(0));		TTI::OperandValueInfo OpInfo = getOperandInfo(VL, 0, /EnableFP=/true);
InstructionCost ScalarEltCost = TTI->getMemoryOpCost(		InstructionCost ScalarEltCost = TTI->getMemoryOpCost(
Instruction::Store, ScalarTy, Alignment, 0, CostKind, OpInfo, VL0);		Instruction::Store, ScalarTy, Alignment, 0, CostKind, OpInfo, VL0);
InstructionCost ScalarStCost = VecTy->getNumElements() * ScalarEltCost;		InstructionCost ScalarStCost = VecTy->getNumElements() * ScalarEltCost;
TTI::OperandValueKind OpVK = TTI::OK_AnyValue;		TTI::OperandValueKind OpVK = TTI::OK_AnyValue;
if (all_of(E->Scalars,		if (OpInfo.isConstant())
[](Value *V) {
return isConstant(cast<Instruction>(V)->getOperand(0));
}) &&
any_of(E->Scalars, [](Value *V) {
Value *Op = cast<Instruction>(V)->getOperand(0);
return !isa<UndefValue>(Op) && !cast<Constant>(Op)->isZeroValue();
}))
OpVK = TTI::OK_NonUniformConstantValue;		OpVK = TTI::OK_NonUniformConstantValue;
InstructionCost VecStCost = TTI->getMemoryOpCost(		InstructionCost VecStCost = TTI->getMemoryOpCost(
Instruction::Store, VecTy, Alignment, 0, CostKind,		Instruction::Store, VecTy, Alignment, 0, CostKind,
{OpVK, TTI::OP_None}, VL0);		{OpVK, TTI::OP_None}, VL0);
LLVM_DEBUG(dumpTreeCosts(E, CommonCost, VecStCost, ScalarStCost));		LLVM_DEBUG(dumpTreeCosts(E, CommonCost, VecStCost, ScalarStCost));
return CommonCost + VecStCost - ScalarStCost;		return CommonCost + VecStCost - ScalarStCost;
}		}
case Instruction::Call: {		case Instruction::Call: {
▲ Show 20 Lines • Show All 5,992 Lines • Show Last 20 Lines

llvm/test/Transforms/SLPVectorizer/X86/stores_constant_float.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -mtriple=x86_64-unknown-linux-gnu -slp-vectorizer -mcpu=skylake -S -o - %s \| FileCheck %s			; RUN: opt -mtriple=x86_64-unknown-linux-gnu -slp-vectorizer -mcpu=skylake -S -o - %s \| FileCheck %s

	define void @foo() {			define void @foo() {
	; CHECK-LABEL: @foo(			; CHECK-LABEL: @foo(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[C:%.*]] = alloca { double, double }, align 8			; CHECK-NEXT: [[C:%.*]] = alloca { double, double }, align 8
	; CHECK-NEXT: [[C_IMAGP:%.*]] = getelementptr inbounds { double, double }, ptr [[C]], i64 0, i32 1			; CHECK-NEXT: store <2 x double> <double 0.000000e+00, double 1.000000e+00>, ptr [[C]], align 8
	; CHECK-NEXT: store double 0.000000e+00, ptr [[C]], align 8
	; CHECK-NEXT: store double 1.000000e+00, ptr [[C_IMAGP]], align 8
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%c = alloca { double, double }, align 8			%c = alloca { double, double }, align 8
	%c.imagp = getelementptr inbounds { double, double }, ptr %c, i64 0, i32 1			%c.imagp = getelementptr inbounds { double, double }, ptr %c, i64 0, i32 1
	store double 0.000000e+00, ptr %c, align 8			store double 0.000000e+00, ptr %c, align 8
	store double 1.000000e+00, ptr %c.imagp, align 8			store double 1.000000e+00, ptr %c.imagp, align 8
	ret void			ret void
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[SLP]Fix PR57322: vectorize constant float stores.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 456410

llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp

llvm/test/Transforms/SLPVectorizer/X86/stores_constant_float.ll

[SLP]Fix PR57322: vectorize constant float stores.
ClosedPublic