This is an archive of the discontinued LLVM Phabricator instance.

[IR] Make {extract,insert}element accept an index of any integer type.
ClosedPublic

Authored by Bigcheese on Apr 25 2014, 7:17 PM.

Download Raw Diff

Details

Reviewers

Commits

rG1f10c5ea943b: [IR] Make {extract,insert}element accept an index of any integer type.
rL207801: [IR] Make {extract,insert}element accept an index of any integer type.

Summary

Given the following C code llvm currently generates suboptimal code for
x86-64:

m128 bss4( const m128 *ptr, size_t i, size_t j )
{

float f = ptr[i][j];
return (__m128) { f, f, f, f };

}

define <4 x float> @_Z4bss4PKDv4_fmm(<4 x float>* nocapture readonly %ptr, i64 %i, i64 %j) #0 {

%a1 = getelementptr inbounds <4 x float>* %ptr, i64 %i
%a2 = load <4 x float>* %a1, align 16, !tbaa !1
%a3 = trunc i64 %j to i32
%a4 = extractelement <4 x float> %a2, i32 %a3
%a5 = insertelement <4 x float> undef, float %a4, i32 0
%a6 = insertelement <4 x float> %a5, float %a4, i32 1
%a7 = insertelement <4 x float> %a6, float %a4, i32 2
%a8 = insertelement <4 x float> %a7, float %a4, i32 3
ret <4 x float> %a8

}

shlq    $4, %rsi
addq    %rdi, %rsi
movslq  %edx, %rax
vbroadcastss    (%rsi,%rax,4), %xmm0
retq

The movslq is uneeded, but is present because of the trunc to i32 and then
sext back to i64 that the backend adds for vbroadcastss.

We can't remove it because it changes the meaning. The IR that clang
generates is already suboptimal. What clang really should emit is:

%a4 = extractelement <4 x float> %a2, i64 %j

This patch makes that legal. A separate patch will teach clang to do it.

Diff Detail

Event Timeline

Bigcheese updated this revision to Diff 8860.Apr 25 2014, 7:17 PM

Bigcheese retitled this revision from to [IR] Make {extract,insert}element accept an index of any integer type..

Bigcheese updated this object.

Bigcheese edited the test plan for this revision. (Show Details)

Bigcheese added a subscriber: Unknown Object (MLST).

Updated serialization.

I audited all uses of {Insert,Extract}ElementInst and nothing makes assumptions about the type other than the bitcode reader/writer. There are lots of cases of generating a constant index, and these always use i32, but this is fine.

There are no changes needed to selection dag because it is lowered with:

SDValue InIdx = DAG.getSExtOrTrunc(getValue(I.getOperand(1)),
                                   getCurSDLoc(), TLI.getVectorIdxTy());

So the rest of the backend already sees the type it is expecting.

Bigcheese accepted this revision.May 1 2014, 3:22 PM

Bigcheese added a reviewer: Bigcheese.

This revision is now accepted and ready to land.May 1 2014, 3:22 PM

Bigcheese closed this revision.May 1 2014, 3:23 PM

Revision Contents

Path

Size

docs/

LangRef.rst

8 lines

lib/

Bitcode/

Reader/

BitcodeReader.cpp

34 lines

Writer/

BitcodeWriter.cpp

6 lines

IR/

Constants.cpp

6 lines

Instructions.cpp

4 lines

test/

CodeGen/

X86/

vec_splat.ll

15 lines

Feature/

instructions.ll

2 lines

Diff 8938

docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 4,464 Lines • ▼ Show 20 Lines
	'``extractelement``' Instruction			'``extractelement``' Instruction
	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

	Syntax:			Syntax:
	"""""""			"""""""

	::			::

	<result> = extractelement <n x <ty>> <val>, i32 <idx> ; yields <ty>			<result> = extractelement <n x <ty>> <val>, <ty2> <idx> ; yields <ty>

	Overview:			Overview:
	"""""""""			"""""""""

	The '``extractelement``' instruction extracts a single scalar element			The '``extractelement``' instruction extracts a single scalar element
	from a vector at a specified index.			from a vector at a specified index.

	Arguments:			Arguments:
	""""""""""			""""""""""

	The first operand of an '``extractelement``' instruction is a value of			The first operand of an '``extractelement``' instruction is a value of
	:ref:`vector <t_vector>` type. The second operand is an index indicating			:ref:`vector <t_vector>` type. The second operand is an index indicating
	the position from which to extract the element. The index may be a			the position from which to extract the element. The index may be a
	variable.			variable of any integer type.

	Semantics:			Semantics:
	""""""""""			""""""""""

	The result is a scalar of the same type as the element type of ``val``.			The result is a scalar of the same type as the element type of ``val``.
	Its value is the value at position ``idx`` of ``val``. If ``idx``			Its value is the value at position ``idx`` of ``val``. If ``idx``
	exceeds the length of ``val``, the results are undefined.			exceeds the length of ``val``, the results are undefined.

	Show All 9 Lines
	'``insertelement``' Instruction			'``insertelement``' Instruction
	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

	Syntax:			Syntax:
	"""""""			"""""""

	::			::

	<result> = insertelement <n x <ty>> <val>, <ty> <elt>, i32 <idx> ; yields <n x <ty>>			<result> = insertelement <n x <ty>> <val>, <ty> <elt>, <ty2> <idx> ; yields <n x <ty>>

	Overview:			Overview:
	"""""""""			"""""""""

	The '``insertelement``' instruction inserts a scalar element into a			The '``insertelement``' instruction inserts a scalar element into a
	vector at a specified index.			vector at a specified index.

	Arguments:			Arguments:
	""""""""""			""""""""""

	The first operand of an '``insertelement``' instruction is a value of			The first operand of an '``insertelement``' instruction is a value of
	:ref:`vector <t_vector>` type. The second operand is a scalar value whose			:ref:`vector <t_vector>` type. The second operand is a scalar value whose
	type must equal the element type of the first operand. The third operand			type must equal the element type of the first operand. The third operand
	is an index indicating the position at which to insert the value. The			is an index indicating the position at which to insert the value. The
	index may be a variable.			index may be a variable of any integer type.

	Semantics:			Semantics:
	""""""""""			""""""""""

	The result is a vector of the same type as ``val``. Its element values			The result is a vector of the same type as ``val``. Its element values
	are those of ``val`` except at position ``idx``, where it gets the value			are those of ``val`` except at position ``idx``, where it gets the value
	``elt``. If ``idx`` exceeds the length of ``val``, the results are			``elt``. If ``idx`` exceeds the length of ``val``, the results are
	undefined.			undefined.
	▲ Show 20 Lines • Show All 4,544 Lines • Show Last 20 Lines

lib/Bitcode/Reader/BitcodeReader.cpp

Show First 20 Lines • Show All 1,412 Lines • ▼ Show 20 Lines	case bitc::CST_CODE_CE_SELECT: { // CE_SELECT: [opval#, opval#, opval#]
VTy->getNumElements());		VTy->getNumElements());

V = ConstantExpr::getSelect(ValueList.getConstantFwdRef(Record[0],		V = ConstantExpr::getSelect(ValueList.getConstantFwdRef(Record[0],
SelectorTy),		SelectorTy),
ValueList.getConstantFwdRef(Record[1],CurTy),		ValueList.getConstantFwdRef(Record[1],CurTy),
ValueList.getConstantFwdRef(Record[2],CurTy));		ValueList.getConstantFwdRef(Record[2],CurTy));
break;		break;
}		}
case bitc::CST_CODE_CE_EXTRACTELT: { // CE_EXTRACTELT: [opty, opval, opval]		case bitc::CST_CODE_CE_EXTRACTELT
		: { // CE_EXTRACTELT: [opty, opval, opty, opval]
if (Record.size() < 3)		if (Record.size() < 3)
return Error(InvalidRecord);		return Error(InvalidRecord);
VectorType *OpTy =		VectorType *OpTy =
dyn_cast_or_null<VectorType>(getTypeByID(Record[0]));		dyn_cast_or_null<VectorType>(getTypeByID(Record[0]));
if (!OpTy)		if (!OpTy)
return Error(InvalidRecord);		return Error(InvalidRecord);
Constant *Op0 = ValueList.getConstantFwdRef(Record[1], OpTy);		Constant *Op0 = ValueList.getConstantFwdRef(Record[1], OpTy);
Constant *Op1 = ValueList.getConstantFwdRef(Record[2],		Constant *Op1 = nullptr;
Type::getInt32Ty(Context));		if (Record.size() == 4) {
		Type *IdxTy = getTypeByID(Record[2]);
		if (!IdxTy)
		return Error(InvalidRecord);
		Op1 = ValueList.getConstantFwdRef(Record[3], IdxTy);
		} else // TODO: Remove with llvm 4.0
		Op1 = ValueList.getConstantFwdRef(Record[2], Type::getInt32Ty(Context));
		if (!Op1)
		return Error(InvalidRecord);
V = ConstantExpr::getExtractElement(Op0, Op1);		V = ConstantExpr::getExtractElement(Op0, Op1);
break;		break;
}		}
case bitc::CST_CODE_CE_INSERTELT: { // CE_INSERTELT: [opval, opval, opval]		case bitc::CST_CODE_CE_INSERTELT
		: { // CE_INSERTELT: [opval, opval, opty, opval]
VectorType *OpTy = dyn_cast<VectorType>(CurTy);		VectorType *OpTy = dyn_cast<VectorType>(CurTy);
if (Record.size() < 3 \|\| !OpTy)		if (Record.size() < 3 \|\| !OpTy)
return Error(InvalidRecord);		return Error(InvalidRecord);
Constant *Op0 = ValueList.getConstantFwdRef(Record[0], OpTy);		Constant *Op0 = ValueList.getConstantFwdRef(Record[0], OpTy);
Constant *Op1 = ValueList.getConstantFwdRef(Record[1],		Constant *Op1 = ValueList.getConstantFwdRef(Record[1],
OpTy->getElementType());		OpTy->getElementType());
Constant *Op2 = ValueList.getConstantFwdRef(Record[2],		Constant *Op2 = nullptr;
Type::getInt32Ty(Context));		if (Record.size() == 4) {
		Type *IdxTy = getTypeByID(Record[2]);
		if (!IdxTy)
		return Error(InvalidRecord);
		Op2 = ValueList.getConstantFwdRef(Record[3], IdxTy);
		} else // TODO: Remove with llvm 4.0
		Op2 = ValueList.getConstantFwdRef(Record[2], Type::getInt32Ty(Context));
		if (!Op2)
		return Error(InvalidRecord);
V = ConstantExpr::getInsertElement(Op0, Op1, Op2);		V = ConstantExpr::getInsertElement(Op0, Op1, Op2);
break;		break;
}		}
case bitc::CST_CODE_CE_SHUFFLEVEC: { // CE_SHUFFLEVEC: [opval, opval, opval]		case bitc::CST_CODE_CE_SHUFFLEVEC: { // CE_SHUFFLEVEC: [opval, opval, opval]
VectorType *OpTy = dyn_cast<VectorType>(CurTy);		VectorType *OpTy = dyn_cast<VectorType>(CurTy);
if (Record.size() < 3 \|\| !OpTy)		if (Record.size() < 3 \|\| !OpTy)
return Error(InvalidRecord);		return Error(InvalidRecord);
Constant *Op0 = ValueList.getConstantFwdRef(Record[0], OpTy);		Constant *Op0 = ValueList.getConstantFwdRef(Record[0], OpTy);
▲ Show 20 Lines • Show All 1,004 Lines • ▼ Show 20 Lines	case bitc::FUNC_CODE_INST_VSELECT: {// VSELECT: [ty,opval,opval,predty,pred]
InstructionList.push_back(I);		InstructionList.push_back(I);
break;		break;
}		}

case bitc::FUNC_CODE_INST_EXTRACTELT: { // EXTRACTELT: [opty, opval, opval]		case bitc::FUNC_CODE_INST_EXTRACTELT: { // EXTRACTELT: [opty, opval, opval]
unsigned OpNum = 0;		unsigned OpNum = 0;
Value Vec, Idx;		Value Vec, Idx;
if (getValueTypePair(Record, OpNum, NextValueNo, Vec) \|\|		if (getValueTypePair(Record, OpNum, NextValueNo, Vec) \|\|
popValue(Record, OpNum, NextValueNo, Type::getInt32Ty(Context), Idx))		getValueTypePair(Record, OpNum, NextValueNo, Idx))
return Error(InvalidRecord);		return Error(InvalidRecord);
I = ExtractElementInst::Create(Vec, Idx);		I = ExtractElementInst::Create(Vec, Idx);
InstructionList.push_back(I);		InstructionList.push_back(I);
break;		break;
}		}

case bitc::FUNC_CODE_INST_INSERTELT: { // INSERTELT: [ty, opval,opval,opval]		case bitc::FUNC_CODE_INST_INSERTELT: { // INSERTELT: [ty, opval,opval,opval]
unsigned OpNum = 0;		unsigned OpNum = 0;
Value Vec, Elt, *Idx;		Value Vec, Elt, *Idx;
if (getValueTypePair(Record, OpNum, NextValueNo, Vec) \|\|		if (getValueTypePair(Record, OpNum, NextValueNo, Vec) \|\|
popValue(Record, OpNum, NextValueNo,		popValue(Record, OpNum, NextValueNo,
cast<VectorType>(Vec->getType())->getElementType(), Elt) \|\|		cast<VectorType>(Vec->getType())->getElementType(), Elt) \|\|
popValue(Record, OpNum, NextValueNo, Type::getInt32Ty(Context), Idx))		getValueTypePair(Record, OpNum, NextValueNo, Idx))
return Error(InvalidRecord);		return Error(InvalidRecord);
I = InsertElementInst::Create(Vec, Elt, Idx);		I = InsertElementInst::Create(Vec, Elt, Idx);
InstructionList.push_back(I);		InstructionList.push_back(I);
break;		break;
}		}

case bitc::FUNC_CODE_INST_SHUFFLEVEC: {// SHUFFLEVEC: [opval,ty,opval,opval]		case bitc::FUNC_CODE_INST_SHUFFLEVEC: {// SHUFFLEVEC: [opval,ty,opval,opval]
unsigned OpNum = 0;		unsigned OpNum = 0;
▲ Show 20 Lines • Show All 915 Lines • Show Last 20 Lines

lib/Bitcode/Writer/BitcodeWriter.cpp

Show First 20 Lines • Show All 1,081 Lines • ▼ Show 20 Lines	if (C->isNullValue()) {
Record.push_back(VE.getValueID(C->getOperand(0)));		Record.push_back(VE.getValueID(C->getOperand(0)));
Record.push_back(VE.getValueID(C->getOperand(1)));		Record.push_back(VE.getValueID(C->getOperand(1)));
Record.push_back(VE.getValueID(C->getOperand(2)));		Record.push_back(VE.getValueID(C->getOperand(2)));
break;		break;
case Instruction::ExtractElement:		case Instruction::ExtractElement:
Code = bitc::CST_CODE_CE_EXTRACTELT;		Code = bitc::CST_CODE_CE_EXTRACTELT;
Record.push_back(VE.getTypeID(C->getOperand(0)->getType()));		Record.push_back(VE.getTypeID(C->getOperand(0)->getType()));
Record.push_back(VE.getValueID(C->getOperand(0)));		Record.push_back(VE.getValueID(C->getOperand(0)));
		Record.push_back(VE.getTypeID(C->getOperand(1)->getType()));
Record.push_back(VE.getValueID(C->getOperand(1)));		Record.push_back(VE.getValueID(C->getOperand(1)));
break;		break;
case Instruction::InsertElement:		case Instruction::InsertElement:
Code = bitc::CST_CODE_CE_INSERTELT;		Code = bitc::CST_CODE_CE_INSERTELT;
Record.push_back(VE.getValueID(C->getOperand(0)));		Record.push_back(VE.getValueID(C->getOperand(0)));
Record.push_back(VE.getValueID(C->getOperand(1)));		Record.push_back(VE.getValueID(C->getOperand(1)));
		Record.push_back(VE.getTypeID(C->getOperand(2)->getType()));
Record.push_back(VE.getValueID(C->getOperand(2)));		Record.push_back(VE.getValueID(C->getOperand(2)));
break;		break;
case Instruction::ShuffleVector:		case Instruction::ShuffleVector:
// If the return type and argument types are the same, this is a		// If the return type and argument types are the same, this is a
// standard shufflevector instruction. If the types are different,		// standard shufflevector instruction. If the types are different,
// then the shuffle is widening or truncating the input vectors, and		// then the shuffle is widening or truncating the input vectors, and
// the argument type must also be encoded.		// the argument type must also be encoded.
if (C->getType() == C->getOperand(0)->getType()) {		if (C->getType() == C->getOperand(0)->getType()) {
▲ Show 20 Lines • Show All 144 Lines • ▼ Show 20 Lines	case Instruction::Select:
Code = bitc::FUNC_CODE_INST_VSELECT;		Code = bitc::FUNC_CODE_INST_VSELECT;
PushValueAndType(I.getOperand(1), InstID, Vals, VE);		PushValueAndType(I.getOperand(1), InstID, Vals, VE);
pushValue(I.getOperand(2), InstID, Vals, VE);		pushValue(I.getOperand(2), InstID, Vals, VE);
PushValueAndType(I.getOperand(0), InstID, Vals, VE);		PushValueAndType(I.getOperand(0), InstID, Vals, VE);
break;		break;
case Instruction::ExtractElement:		case Instruction::ExtractElement:
Code = bitc::FUNC_CODE_INST_EXTRACTELT;		Code = bitc::FUNC_CODE_INST_EXTRACTELT;
PushValueAndType(I.getOperand(0), InstID, Vals, VE);		PushValueAndType(I.getOperand(0), InstID, Vals, VE);
pushValue(I.getOperand(1), InstID, Vals, VE);		PushValueAndType(I.getOperand(1), InstID, Vals, VE);
break;		break;
case Instruction::InsertElement:		case Instruction::InsertElement:
Code = bitc::FUNC_CODE_INST_INSERTELT;		Code = bitc::FUNC_CODE_INST_INSERTELT;
PushValueAndType(I.getOperand(0), InstID, Vals, VE);		PushValueAndType(I.getOperand(0), InstID, Vals, VE);
pushValue(I.getOperand(1), InstID, Vals, VE);		pushValue(I.getOperand(1), InstID, Vals, VE);
pushValue(I.getOperand(2), InstID, Vals, VE);		PushValueAndType(I.getOperand(2), InstID, Vals, VE);
break;		break;
case Instruction::ShuffleVector:		case Instruction::ShuffleVector:
Code = bitc::FUNC_CODE_INST_SHUFFLEVEC;		Code = bitc::FUNC_CODE_INST_SHUFFLEVEC;
PushValueAndType(I.getOperand(0), InstID, Vals, VE);		PushValueAndType(I.getOperand(0), InstID, Vals, VE);
pushValue(I.getOperand(1), InstID, Vals, VE);		pushValue(I.getOperand(1), InstID, Vals, VE);
pushValue(I.getOperand(2), InstID, Vals, VE);		pushValue(I.getOperand(2), InstID, Vals, VE);
break;		break;
case Instruction::ICmp:		case Instruction::ICmp:
▲ Show 20 Lines • Show All 776 Lines • Show Last 20 Lines

lib/IR/Constants.cpp

Show First 20 Lines • Show All 1,931 Lines • ▼ Show 20 Lines	ConstantExpr::getFCmp(unsigned short pred, Constant LHS, Constant RHS) {

LLVMContextImpl *pImpl = LHS->getType()->getContext().pImpl;		LLVMContextImpl *pImpl = LHS->getType()->getContext().pImpl;
return pImpl->ExprConstants.getOrCreate(ResultTy, Key);		return pImpl->ExprConstants.getOrCreate(ResultTy, Key);
}		}

Constant ConstantExpr::getExtractElement(Constant Val, Constant *Idx) {		Constant ConstantExpr::getExtractElement(Constant Val, Constant *Idx) {
assert(Val->getType()->isVectorTy() &&		assert(Val->getType()->isVectorTy() &&
"Tried to create extractelement operation on non-vector type!");		"Tried to create extractelement operation on non-vector type!");
assert(Idx->getType()->isIntegerTy(32) &&		assert(Idx->getType()->isIntegerTy() &&
"Extractelement index must be i32 type!");		"Extractelement index must be an integer type!");

if (Constant *FC = ConstantFoldExtractElementInstruction(Val, Idx))		if (Constant *FC = ConstantFoldExtractElementInstruction(Val, Idx))
return FC; // Fold a few common cases.		return FC; // Fold a few common cases.

// Look up the constant in the table first to ensure uniqueness		// Look up the constant in the table first to ensure uniqueness
Constant *ArgVec[] = { Val, Idx };		Constant *ArgVec[] = { Val, Idx };
const ExprMapKeyType Key(Instruction::ExtractElement, ArgVec);		const ExprMapKeyType Key(Instruction::ExtractElement, ArgVec);

LLVMContextImpl *pImpl = Val->getContext().pImpl;		LLVMContextImpl *pImpl = Val->getContext().pImpl;
Type *ReqTy = Val->getType()->getVectorElementType();		Type *ReqTy = Val->getType()->getVectorElementType();
return pImpl->ExprConstants.getOrCreate(ReqTy, Key);		return pImpl->ExprConstants.getOrCreate(ReqTy, Key);
}		}

Constant ConstantExpr::getInsertElement(Constant Val, Constant *Elt,		Constant ConstantExpr::getInsertElement(Constant Val, Constant *Elt,
Constant *Idx) {		Constant *Idx) {
assert(Val->getType()->isVectorTy() &&		assert(Val->getType()->isVectorTy() &&
"Tried to create insertelement operation on non-vector type!");		"Tried to create insertelement operation on non-vector type!");
assert(Elt->getType() == Val->getType()->getVectorElementType() &&		assert(Elt->getType() == Val->getType()->getVectorElementType() &&
"Insertelement types must match!");		"Insertelement types must match!");
assert(Idx->getType()->isIntegerTy(32) &&		assert(Idx->getType()->isIntegerTy() &&
"Insertelement index must be i32 type!");		"Insertelement index must be i32 type!");

if (Constant *FC = ConstantFoldInsertElementInstruction(Val, Elt, Idx))		if (Constant *FC = ConstantFoldInsertElementInstruction(Val, Elt, Idx))
return FC; // Fold a few common cases.		return FC; // Fold a few common cases.
// Look up the constant in the table first to ensure uniqueness		// Look up the constant in the table first to ensure uniqueness
Constant *ArgVec[] = { Val, Elt, Idx };		Constant *ArgVec[] = { Val, Elt, Idx };
const ExprMapKeyType Key(Instruction::InsertElement, ArgVec);		const ExprMapKeyType Key(Instruction::InsertElement, ArgVec);

▲ Show 20 Lines • Show All 871 Lines • Show Last 20 Lines

lib/IR/Instructions.cpp

Show First 20 Lines • Show All 1,473 Lines • ▼ Show 20 Lines	ExtractElementInst::ExtractElementInst(Value Val, Value Index,

Op<0>() = Val;		Op<0>() = Val;
Op<1>() = Index;		Op<1>() = Index;
setName(Name);		setName(Name);
}		}


bool ExtractElementInst::isValidOperands(const Value Val, const Value Index) {		bool ExtractElementInst::isValidOperands(const Value Val, const Value Index) {
if (!Val->getType()->isVectorTy() \|\| !Index->getType()->isIntegerTy(32))		if (!Val->getType()->isVectorTy() \|\| !Index->getType()->isIntegerTy())
return false;		return false;
return true;		return true;
}		}


//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// InsertElementInst Implementation		// InsertElementInst Implementation
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
Show All 30 Lines
bool InsertElementInst::isValidOperands(const Value Vec, const Value Elt,		bool InsertElementInst::isValidOperands(const Value Vec, const Value Elt,
const Value *Index) {		const Value *Index) {
if (!Vec->getType()->isVectorTy())		if (!Vec->getType()->isVectorTy())
return false; // First operand of insertelement must be vector type.		return false; // First operand of insertelement must be vector type.

if (Elt->getType() != cast<VectorType>(Vec->getType())->getElementType())		if (Elt->getType() != cast<VectorType>(Vec->getType())->getElementType())
return false;// Second operand of insertelement must be vector element type.		return false;// Second operand of insertelement must be vector element type.

if (!Index->getType()->isIntegerTy(32))		if (!Index->getType()->isIntegerTy())
return false; // Third operand of insertelement must be i32.		return false; // Third operand of insertelement must be i32.
return true;		return true;
}		}


//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ShuffleVectorInst Implementation		// ShuffleVectorInst Implementation
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 2,202 Lines • Show Last 20 Lines

test/CodeGen/X86/vec_splat.ll

	Show All 32 Lines
	; SSE3-LABEL: test_v2sd:			; SSE3-LABEL: test_v2sd:
	; SSE3: movddup			; SSE3: movddup
	}			}

	; Fold extract of a load into the load's address computation. This avoids spilling to the stack.			; Fold extract of a load into the load's address computation. This avoids spilling to the stack.
	define <4 x float> @load_extract_splat(<4 x float>* nocapture readonly %ptr, i64 %i, i64 %j) nounwind {			define <4 x float> @load_extract_splat(<4 x float>* nocapture readonly %ptr, i64 %i, i64 %j) nounwind {
	%1 = getelementptr inbounds <4 x float>* %ptr, i64 %i			%1 = getelementptr inbounds <4 x float>* %ptr, i64 %i
	%2 = load <4 x float>* %1, align 16			%2 = load <4 x float>* %1, align 16
	%3 = trunc i64 %j to i32			%3 = extractelement <4 x float> %2, i64 %j
	%4 = extractelement <4 x float> %2, i32 %3			%4 = insertelement <4 x float> undef, float %3, i32 0
	%5 = insertelement <4 x float> undef, float %4, i32 0			%5 = insertelement <4 x float> %4, float %3, i32 1
	%6 = insertelement <4 x float> %5, float %4, i32 1			%6 = insertelement <4 x float> %5, float %3, i32 2
	%7 = insertelement <4 x float> %6, float %4, i32 2			%7 = insertelement <4 x float> %6, float %3, i32 3
	%8 = insertelement <4 x float> %7, float %4, i32 3			ret <4 x float> %7
	ret <4 x float> %8

	; AVX-LABEL: load_extract_splat			; AVX-LABEL: load_extract_splat
	; AVX-NOT: rsp			; AVX-NOT: mov
	; AVX: vbroadcastss			; AVX: vbroadcastss
	}			}

test/Feature/instructions.ll

	; RUN: llvm-as < %s \| llvm-dis > %t1.ll			; RUN: llvm-as < %s \| llvm-dis > %t1.ll
	; RUN: llvm-as %t1.ll -o - \| llvm-dis > %t2.ll			; RUN: llvm-as %t1.ll -o - \| llvm-dis > %t2.ll
	; RUN: diff %t1.ll %t2.ll			; RUN: diff %t1.ll %t2.ll

	define i32 @test_extractelement(<4 x i32> %V) {			define i32 @test_extractelement(<4 x i32> %V) {
	%R = extractelement <4 x i32> %V, i32 1 ; <i32> [#uses=1]			%R = extractelement <4 x i32> %V, i32 1 ; <i32> [#uses=1]
				%S = extractelement <4 x i32> %V, i64 1 ; <i32> [#uses=0]
	ret i32 %R			ret i32 %R
	}			}

	define <4 x i32> @test_insertelement(<4 x i32> %V) {			define <4 x i32> @test_insertelement(<4 x i32> %V) {
	%R = insertelement <4 x i32> %V, i32 0, i32 0 ; <<4 x i32>> [#uses=1]			%R = insertelement <4 x i32> %V, i32 0, i32 0 ; <<4 x i32>> [#uses=1]
				%S = insertelement <4 x i32> %V, i32 0, i64 0 ; <<4 x i32>> [#uses=0]
	ret <4 x i32> %R			ret <4 x i32> %R
	}			}

	define <4 x i32> @test_shufflevector_u(<4 x i32> %V) {			define <4 x i32> @test_shufflevector_u(<4 x i32> %V) {
	%R = shufflevector <4 x i32> %V, <4 x i32> %V, <4 x i32> < i32 1, i32 undef, i32 7, i32 2 > ; <<4 x i32>> [#uses=1]			%R = shufflevector <4 x i32> %V, <4 x i32> %V, <4 x i32> < i32 1, i32 undef, i32 7, i32 2 > ; <<4 x i32>> [#uses=1]
	ret <4 x i32> %R			ret <4 x i32> %R
	}			}

	define <4 x float> @test_shufflevector_f(<4 x float> %V) {			define <4 x float> @test_shufflevector_f(<4 x float> %V) {
	%R = shufflevector <4 x float> %V, <4 x float> undef, <4 x i32> < i32 1, i32 undef, i32 7, i32 2 > ; <<4 x float>> [#uses=1]			%R = shufflevector <4 x float> %V, <4 x float> undef, <4 x i32> < i32 1, i32 undef, i32 7, i32 2 > ; <<4 x float>> [#uses=1]
	ret <4 x float> %R			ret <4 x float> %R
	}			}