This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/IR/
-
llvm/
-
IR/
-
Constants.h
-
GetElementPtrTypeIterator.h
-
Instructions.h
-
PatternMatch.h
-
lib/IR/
-
IR/
1/1
AsmWriter.cpp
-
AutoUpgrade.cpp
3/4
ConstantFold.cpp
2/2
Constants.cpp
-
DataLayout.cpp
1/1
Function.cpp
-
IRBuilder.cpp
2/3
Instructions.cpp
1/2
Verifier.cpp

Differential D78130

[SVE] Fixup calls to VectorType::getNumElements() in IR
AbandonedPublic

Authored by ctetreau on Apr 14 2020, 10:11 AM.

Download Raw Diff

Details

Reviewers

efriedma

Summary

getNumElements() is going to be moved into FixedVectorType. In
preparation for this, fixup all calls to VectorType::getNumElements.
If getElementCount() can be used with no change in functionality, use
it. Otherwise cast to FixedVectorType and assume that this is correct

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

ctetreau created this revision.Apr 14 2020, 10:11 AM

Herald added a reviewer: efriedma. · View Herald TranscriptApr 14 2020, 10:12 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, psnobl, rkruppe and 2 others. · View Herald Transcript

ctetreau added a child revision: D78127: [SVE] Mark VectorType::getNumElements() deprecated.Apr 14 2020, 10:19 AM

Harbormaster failed remote builds in B53182: Diff 257397!Apr 14 2020, 10:44 AM

My concern with switching from getNumElements() to getElementCount() is that we're essentially adding new functionality. That new functionality should have regression test coverage. If we don't do this as we go, we're never going to get reasonable regression test coverage for scalable vectors.

My concern with switching from getNumElements() to getElementCount() is that we're essentially adding new functionality. That new functionality should have regression test coverage. If we don't do this as we go, we're never going to get reasonable regression test coverage for scalable vectors.

I get where you're coming from, but I think a lot of these cases are actually equivalent. As an exercise, I can speak to each one in this patch individually, and maybe we can work out which need tests and which are NFC.

ctetreau marked 7 inline comments as done.Apr 15 2020, 11:52 AM

ctetreau added inline comments.

llvm/lib/IR/AsmWriter.cpp
660	This code is generic for all vector types. So here getNumElements() was being used correctly. We can just directly use the ElementCount. Maybe it would be good to get the ElementCount prior to the if (isa<ScalableVectorType>(PtY)), and just do if (EC.Scalable) ?
llvm/lib/IR/ConstantFold.cpp
576–577	Suppose we have some vectors: %a = <4 x i1> undef %b = <4 x i8> undef %c = <vscale x 4 x i16> undef %d = <vscale x 4 x i32> undef This check means to ask "do these vectors have the same number of elements?" We should have this truth table (since sameSize(l, r) = sameSize(r, l), I'll omit duplicates): l \| r \| sameSize(l, r) ---------------------- a \| a \| true a \| b \| true a \| c \| false a \| d \| false b \| b \| true b \| c \| false b \| d \| false c \| c \| true c \| d \| true d \| d \| true With the old implementation of `sameSize(l, r) = l->getNumElements() == r->getNumElements()`, we would get true for all combinations. This is a bug. The getElementCount() version is correct. The version on the right will reject vectors that were previously accepted, which will likely change the behavior of llvm. While this represents a gap in test coverage in llvm, I don't know how we can reasonably add a few tests to plug this hole. It really seems to me that scalable vectors are largely ignored by the test suite. The fix is "all tests that concern vectors should have corresponding cases that test scalable vectors" Given plugging this hole in test coverage is hard, my questions are: Is there a reasonable test I can add to make this situation less bad? Given that the policy is that "new features need a corresponding test case", is it worth not fixing this bug? I don't know the answer to 1. For 2, I think the bug should be fixed. I think the ship already sailed, because vscale went in without test coverage.
2293	Suppose we have some vectors: %a = <4 x i1> undef %d = <vscale x 4 x i32> undef Here, we get a new vector with the same number of elements as some other vector. Using the implementation on the left, the type returned by `sameShapeWithType(i8, a)` is `<4 x i8>`, and the type returned by `sameShapeWithType(i8, b)` is `<4 x i8>`. Again, this is a bug. The version on the right (which is just a helper for `VectorType::get(OrigGEPTy, VT->getElementCount())`) will do the correct thing for any vector on the right hand side. The only case it doesn't handle is the case where you specifically do want to change the vector type. For this case, the old ElementCount overload still exists. This has the same problems as the `sameSize(l, R)` case above: the gap in test coverage is already huge, so I have the same questions: Is there a reasonable test I can add to make this situation less bad? Given that the policy is that "new features need a corresponding test case", is it worth not fixing this bug?
llvm/lib/IR/Constants.cpp
1903–1904	I should have made this one call getElementCount
1917–1918	I should have made this one call getElementCount
llvm/lib/IR/Function.cpp
1331	this is basically the same as the `==` case. I think `<`, `<=`, `>=`, and `>` should probably still call `getNumElements()` as `{2, true} > {3, false}`, while possible to define in c++, is kind of nonsensical. Maybe those operators can assert that the scalablility is the same? (I think this is probably a bad idea)
llvm/lib/IR/Instructions.cpp
1977–1979	By the way, I'm doing this to preserve the original behavior. The code used to enter this branch for scalable vectors. I don't know enough about this code to judge what it should actually do for scalable vectors so I'm making it loudly fail rather than silently be buggy.

We could say certain classes of change are "trivial", I guess, in the sense that they obviously have no effect on fixed-width types. Like changing X->getNumElements() == Y->getNumElements() to use getElementCount(), or VectorType::get(X, Y->getNumElements()); -> VectorType::get(X, Y);. And then you can make the argument that you're committing these changes as a cleanup, and any effect on scalable vectors is a side-effect.

Or you could try to make the argument that we already have test coverage for everything for non-scalable types, and that should be enough to allow making these sweeping changes, since scalable types are still not production-ready anyway.

I can sympathize with both of those arguments, and I understand constructing testcases will slow you down substantially... but ultimately, if we're going to say scalable vectors are supported, we need to hit reasonable regresssion test coverage at some point. Having regression tests is a important part of making sure that developers aren't breaking scalable vector codepaths in the future. Most people working on LLVM are not going to be routinely using scalable vectors for a while; the only way they'll know if something breaks is regression tests. And the best time to add that coverage is when you're modifying the code anyway.

Maybe we can discuss this more in the meeting tomorrow.

(If you're having trouble figuring out how to exercise certain codepaths, feel free to ask.)

llvm/lib/IR/ConstantFold.cpp
576–577	This is unreachable for scalable vectors because ConstantVector and ConstantDataVector always have FixedVectorType. So it's hard to argue this specific case; you might as well just cast to FixedVectorType. (On a side-note, if you haven't already, those values have a getType() method which returns VectorType, which should be fixed.)
llvm/lib/IR/Instructions.cpp
1977–1979	ConstantDataSequential is never scalable. (The only valid scalable constants are zero, undef, and ConstantExprs.)
llvm/lib/IR/Verifier.cpp
4908	`cast<FixedVectorType>(OperandT)->getElementCount()`?

ctetreau marked 2 inline comments as done.Apr 15 2020, 12:57 PM

ctetreau added inline comments.

llvm/lib/IR/ConstantFold.cpp
576–577	If ConstantVector and ConstantDataVector always have FixedVectorType, then they should probably return that directly. I can make that change. If you are aware of any other such cases, please let me know. As a general point, I am making a huge amount of changes to many disparate parts of the codebase, most of which I am not familiar with. As such, I'm going to miss these sorts of "obvious" simplifications. Please point them out to me if you see them. As for the overall argument, it's a defense of this class of change. For specific instances, if it's never valid for the vectors to be scalable vectors, then it makes sense to not make the change.
llvm/lib/IR/Verifier.cpp
4908	oops

ctetreau marked an inline comment as done.Apr 15 2020, 1:39 PM

ctetreau added inline comments.

llvm/lib/IR/Instructions.cpp
1977–1979	ugh. Seems I misread the code. What I meant was, for cases like: if (auto VTy = dyn_cast<VectorType>(Ty)) { auto FVTy = cast<FixedVectorType>(VTy); // immediately cast my VectorType unconditionally ... // stuff } ... that I'm doing it on purpose because trying to dyn_cast to FixedVectorType would be a behavior change.

rebase, address code review issues, catch stragglers

Harbormaster failed remote builds in B53426: Diff 257835!Apr 15 2020, 1:48 PM

catch straggler

Harbormaster failed remote builds in B53607: Diff 258120!Apr 16 2020, 12:15 PM

Hi @ctetreau , @david-arm started looking through our downstream code-base to pull out some patches where we've fixed cases like the ones in this patch and for which we may already have tests.
Since there's likely to be overlap with this patch, are there any cases you've already got tests for, or are actively working on? If you have any other suggestions on how to share the effort, just let us know!

In D78130#1993179, @sdesmalen wrote:

Hi @ctetreau , @david-arm started looking through our downstream code-base to pull out some patches where we've fixed cases like the ones in this patch and for which we may already have tests.
Since there's likely to be overlap with this patch, are there any cases you've already got tests for, or are actively working on? If you have any other suggestions on how to share the effort, just let us know!

I've not yet started writing tests. I've completed the refactor on my machine. I'm doing a cleanup pass now, and plan to start pushing the patches up to phabricator soon. After all the patches are up and reviewers added, I planned to evaluate what changes need tests, and what changes aren't actually worth doing.

Once that's done and the list compiled, I was going to work on getting the tests written.

I'd also like to mention that the changes in this patch are pretty representative of the changes I'm making in other modules.

Hi @efriedma, I've been looking at some our downstream tests that we have that we could upstream in this area and quite a few of them depend upon using shufflevector instructions to splat vectors and so we can't really upstream those yet. I think you're working on a splat vector implementation, right? I just wondered how it was going? Thanks!

Splats specifically should be fine. It's the other shuffles that are missing; I'll try to get back to that soon.

rebase

Harbormaster failed remote builds in B54164: Diff 259120!Apr 21 2020, 4:49 PM

rebase

Harbormaster failed remote builds in B54261: Diff 259301!Apr 22 2020, 9:13 AM

rebase

Harbormaster failed remote builds in B54435: Diff 259624!Apr 23 2020, 11:22 AM

Based on the discussion in this thread, I'm going to submit a less ambitious version of this patch.

ctetreau removed a child revision: D78127: [SVE] Mark VectorType::getNumElements() deprecated.Jun 17 2020, 3:23 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

IR/

Constants.h

4 lines

GetElementPtrTypeIterator.h

2 lines

Instructions.h

7 lines

PatternMatch.h

6 lines

lib/

IR/

2 lines

87 lines

46 lines

31 lines

10 lines

9 lines

14 lines

37 lines

66 lines

Diff 259624

llvm/include/llvm/IR/Constants.h

Show First 20 Lines • Show All 797 Lines • ▼ Show 20 Lines	public:
bool isSplat() const;		bool isSplat() const;

/// If this is a splat constant, meaning that all of the elements have the		/// If this is a splat constant, meaning that all of the elements have the
/// same value, return that value. Otherwise return NULL.		/// same value, return that value. Otherwise return NULL.
Constant *getSplatValue() const;		Constant *getSplatValue() const;

/// Specialize the getType() method to always return a VectorType,		/// Specialize the getType() method to always return a VectorType,
/// which reduces the amount of casting needed in parts of the compiler.		/// which reduces the amount of casting needed in parts of the compiler.
inline VectorType *getType() const {		inline FixedVectorType *getType() const {
return cast<VectorType>(Value::getType());		return cast<FixedVectorType>(Value::getType());
}		}

/// Methods for support type inquiry through isa, cast, and dyn_cast:		/// Methods for support type inquiry through isa, cast, and dyn_cast:
static bool classof(const Value *V) {		static bool classof(const Value *V) {
return V->getValueID() == ConstantDataVectorVal;		return V->getValueID() == ConstantDataVectorVal;
}		}
};		};

▲ Show 20 Lines • Show All 522 Lines • Show Last 20 Lines

llvm/include/llvm/IR/GetElementPtrTypeIterator.h

Show First 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	generic_gep_type_iterator& operator++() { // Preincrement
if (auto *ATy = dyn_cast<ArrayType>(Ty)) {		if (auto *ATy = dyn_cast<ArrayType>(Ty)) {
CurTy = ATy->getElementType();		CurTy = ATy->getElementType();
NumElements = ATy->getNumElements();		NumElements = ATy->getNumElements();
} else if (auto *VTy = dyn_cast<VectorType>(Ty)) {		} else if (auto *VTy = dyn_cast<VectorType>(Ty)) {
CurTy = VTy->getElementType();		CurTy = VTy->getElementType();
if (isa<ScalableVectorType>(VTy))		if (isa<ScalableVectorType>(VTy))
NumElements = Unbounded;		NumElements = Unbounded;
else		else
NumElements = VTy->getNumElements();		NumElements = cast<FixedVectorType>(VTy)->getNumElements();
} else		} else
CurTy = dyn_cast<StructType>(Ty);		CurTy = dyn_cast<StructType>(Ty);
++OpIt;		++OpIt;
return *this;		return *this;
}		}

generic_gep_type_iterator operator++(int) { // Postincrement		generic_gep_type_iterator operator++(int) { // Postincrement
generic_gep_type_iterator tmp = this; ++this; return tmp;		generic_gep_type_iterator tmp = this; ++this; return tmp;
▲ Show 20 Lines • Show All 75 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Instructions.h

Show First 20 Lines • Show All 1,986 Lines • ▼ Show 20 Lines	public:
ArrayRef<int> getShuffleMask() const { return ShuffleMask; }		ArrayRef<int> getShuffleMask() const { return ShuffleMask; }

/// Return true if this shuffle returns a vector with a different number of		/// Return true if this shuffle returns a vector with a different number of
/// elements than its source vectors.		/// elements than its source vectors.
/// Examples: shufflevector <4 x n> A, <4 x n> B, <1,2,3>		/// Examples: shufflevector <4 x n> A, <4 x n> B, <1,2,3>
/// shufflevector <4 x n> A, <4 x n> B, <1,2,3,4,5>		/// shufflevector <4 x n> A, <4 x n> B, <1,2,3,4,5>
bool changesLength() const {		bool changesLength() const {
unsigned NumSourceElts =		unsigned NumSourceElts =
cast<VectorType>(Op<0>()->getType())->getNumElements();		cast<FixedVectorType>(Op<0>()->getType())->getNumElements();
unsigned NumMaskElts = ShuffleMask.size();		unsigned NumMaskElts = ShuffleMask.size();
return NumSourceElts != NumMaskElts;		return NumSourceElts != NumMaskElts;
}		}

/// Return true if this shuffle returns a vector with a greater number of		/// Return true if this shuffle returns a vector with a greater number of
/// elements than its source vectors.		/// elements than its source vectors.
/// Example: shufflevector <2 x n> A, <2 x n> B, <1,2,3>		/// Example: shufflevector <2 x n> A, <2 x n> B, <1,2,3>
bool increasesLength() const {		bool increasesLength() const {
unsigned NumSourceElts =		unsigned NumSourceElts =
cast<VectorType>(Op<0>()->getType())->getNumElements();		cast<FixedVectorType>(Op<0>()->getType())->getNumElements();
unsigned NumMaskElts = ShuffleMask.size();		unsigned NumMaskElts = ShuffleMask.size();
return NumSourceElts < NumMaskElts;		return NumSourceElts < NumMaskElts;
}		}

/// Return true if this shuffle mask chooses elements from exactly one source		/// Return true if this shuffle mask chooses elements from exactly one source
/// vector.		/// vector.
/// Example: <7,5,undef,7>		/// Example: <7,5,undef,7>
/// This assumes that vector operands are the same length as the mask.		/// This assumes that vector operands are the same length as the mask.
▲ Show 20 Lines • Show All 176 Lines • ▼ Show 20 Lines	static bool isExtractSubvectorMask(const Constant *Mask, int NumSrcElts,
assert(Mask->getType()->isVectorTy() && "Shuffle needs vector constant.");		assert(Mask->getType()->isVectorTy() && "Shuffle needs vector constant.");
SmallVector<int, 16> MaskAsInts;		SmallVector<int, 16> MaskAsInts;
getShuffleMask(Mask, MaskAsInts);		getShuffleMask(Mask, MaskAsInts);
return isExtractSubvectorMask(MaskAsInts, NumSrcElts, Index);		return isExtractSubvectorMask(MaskAsInts, NumSrcElts, Index);
}		}

/// Return true if this shuffle mask is an extract subvector mask.		/// Return true if this shuffle mask is an extract subvector mask.
bool isExtractSubvectorMask(int &Index) const {		bool isExtractSubvectorMask(int &Index) const {
int NumSrcElts = cast<VectorType>(Op<0>()->getType())->getNumElements();		int NumSrcElts =
		cast<FixedVectorType>(Op<0>()->getType())->getNumElements();
return isExtractSubvectorMask(ShuffleMask, NumSrcElts, Index);		return isExtractSubvectorMask(ShuffleMask, NumSrcElts, Index);
}		}

/// Change values in a shuffle permute mask assuming the two vector operands		/// Change values in a shuffle permute mask assuming the two vector operands
/// of length InVecNumElts have swapped position.		/// of length InVecNumElts have swapped position.
static void commuteShuffleMask(MutableArrayRef<int> Mask,		static void commuteShuffleMask(MutableArrayRef<int> Mask,
unsigned InVecNumElts) {		unsigned InVecNumElts) {
for (int &Idx : Mask) {		for (int &Idx : Mask) {
▲ Show 20 Lines • Show All 3,021 Lines • Show Last 20 Lines

llvm/include/llvm/IR/PatternMatch.h

Show First 20 Lines • Show All 269 Lines • ▼ Show 20 Lines	template <typename ITy> bool match(ITy *V) {
if (const auto *CI = dyn_cast<ConstantInt>(V))		if (const auto *CI = dyn_cast<ConstantInt>(V))
return this->isValue(CI->getValue());		return this->isValue(CI->getValue());
if (V->getType()->isVectorTy()) {		if (V->getType()->isVectorTy()) {
if (const auto *C = dyn_cast<Constant>(V)) {		if (const auto *C = dyn_cast<Constant>(V)) {
if (const auto *CI = dyn_cast_or_null<ConstantInt>(C->getSplatValue()))		if (const auto *CI = dyn_cast_or_null<ConstantInt>(C->getSplatValue()))
return this->isValue(CI->getValue());		return this->isValue(CI->getValue());

// Non-splat vector constant: check each element for a match.		// Non-splat vector constant: check each element for a match.
unsigned NumElts = cast<VectorType>(V->getType())->getNumElements();		unsigned NumElts =
		cast<FixedVectorType>(V->getType())->getNumElements();
assert(NumElts != 0 && "Constant vector with no elements?");		assert(NumElts != 0 && "Constant vector with no elements?");
bool HasNonUndefElements = false;		bool HasNonUndefElements = false;
for (unsigned i = 0; i != NumElts; ++i) {		for (unsigned i = 0; i != NumElts; ++i) {
Constant *Elt = C->getAggregateElement(i);		Constant *Elt = C->getAggregateElement(i);
if (!Elt)		if (!Elt)
return false;		return false;
if (isa<UndefValue>(Elt))		if (isa<UndefValue>(Elt))
continue;		continue;
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	template <typename ITy> bool match(ITy *V) {
if (const auto *CF = dyn_cast<ConstantFP>(V))		if (const auto *CF = dyn_cast<ConstantFP>(V))
return this->isValue(CF->getValueAPF());		return this->isValue(CF->getValueAPF());
if (V->getType()->isVectorTy()) {		if (V->getType()->isVectorTy()) {
if (const auto *C = dyn_cast<Constant>(V)) {		if (const auto *C = dyn_cast<Constant>(V)) {
if (const auto *CF = dyn_cast_or_null<ConstantFP>(C->getSplatValue()))		if (const auto *CF = dyn_cast_or_null<ConstantFP>(C->getSplatValue()))
return this->isValue(CF->getValueAPF());		return this->isValue(CF->getValueAPF());

// Non-splat vector constant: check each element for a match.		// Non-splat vector constant: check each element for a match.
unsigned NumElts = cast<VectorType>(V->getType())->getNumElements();		unsigned NumElts =
		cast<FixedVectorType>(V->getType())->getNumElements();
assert(NumElts != 0 && "Constant vector with no elements?");		assert(NumElts != 0 && "Constant vector with no elements?");
bool HasNonUndefElements = false;		bool HasNonUndefElements = false;
for (unsigned i = 0; i != NumElts; ++i) {		for (unsigned i = 0; i != NumElts; ++i) {
Constant *Elt = C->getAggregateElement(i);		Constant *Elt = C->getAggregateElement(i);
if (!Elt)		if (!Elt)
return false;		return false;
if (isa<UndefValue>(Elt))		if (isa<UndefValue>(Elt))
continue;		continue;
▲ Show 20 Lines • Show All 1,848 Lines • Show Last 20 Lines

llvm/lib/IR/AsmWriter.cpp

Show First 20 Lines • Show All 651 Lines • ▼ Show 20 Lines	void TypePrinting::print(Type *Ty, raw_ostream &OS) {
}		}
case Type::FixedVectorTyID:		case Type::FixedVectorTyID:
case Type::ScalableVectorTyID: {		case Type::ScalableVectorTyID: {
VectorType *PTy = cast<VectorType>(Ty);		VectorType *PTy = cast<VectorType>(Ty);
ElementCount EC = PTy->getElementCount();		ElementCount EC = PTy->getElementCount();
OS << "<";		OS << "<";
if (EC.Scalable)		if (EC.Scalable)
OS << "vscale x ";		OS << "vscale x ";
OS << EC.Min << " x ";		OS << EC.Min << " x ";
		ctetreauAuthorUnsubmitted Done Reply Inline Actions This code is generic for all vector types. So here getNumElements() was being used correctly. We can just directly use the ElementCount. Maybe it would be good to get the ElementCount prior to the if (isa<ScalableVectorType>(PtY)), and just do if (EC.Scalable) ? ctetreau: This code is generic for all vector types. So here getNumElements() was being used correctly.
print(PTy->getElementType(), OS);		print(PTy->getElementType(), OS);
OS << '>';		OS << '>';
return;		return;
}		}
}		}
llvm_unreachable("Invalid TypeID");		llvm_unreachable("Invalid TypeID");
}		}

▲ Show 20 Lines • Show All 832 Lines • ▼ Show 20 Lines	if (const ConstantStruct *CS = dyn_cast<ConstantStruct>(CV)) {

Out << '}';		Out << '}';
if (CS->getType()->isPacked())		if (CS->getType()->isPacked())
Out << '>';		Out << '>';
return;		return;
}		}

if (isa<ConstantVector>(CV) \|\| isa<ConstantDataVector>(CV)) {		if (isa<ConstantVector>(CV) \|\| isa<ConstantDataVector>(CV)) {
auto *CVVTy = cast<VectorType>(CV->getType());		auto *CVVTy = cast<FixedVectorType>(CV->getType());
Type *ETy = CVVTy->getElementType();		Type *ETy = CVVTy->getElementType();
Out << '<';		Out << '<';
TypePrinter.print(ETy, Out);		TypePrinter.print(ETy, Out);
Out << ' ';		Out << ' ';
WriteAsOperandInternal(Out, CV->getAggregateElement(0U), &TypePrinter,		WriteAsOperandInternal(Out, CV->getAggregateElement(0U), &TypePrinter,
Machine, Context);		Machine, Context);
for (unsigned i = 1, e = CVVTy->getNumElements(); i != e; ++i) {		for (unsigned i = 1, e = CVVTy->getNumElements(); i != e; ++i) {
Out << ", ";		Out << ", ";
▲ Show 20 Lines • Show All 3,037 Lines • Show Last 20 Lines

llvm/lib/IR/AutoUpgrade.cpp

Show First 20 Lines • Show All 893 Lines • ▼ Show 20 Lines	GlobalVariable llvm::UpgradeGlobalVariable(GlobalVariable GV) {
return new GlobalVariable(NewInit->getType(), false, GV->getLinkage(),		return new GlobalVariable(NewInit->getType(), false, GV->getLinkage(),
NewInit, GV->getName());		NewInit, GV->getName());
}		}

// Handles upgrading SSE2/AVX2/AVX512BW PSLLDQ intrinsics by converting them		// Handles upgrading SSE2/AVX2/AVX512BW PSLLDQ intrinsics by converting them
// to byte shuffles.		// to byte shuffles.
static Value *UpgradeX86PSLLDQIntrinsics(IRBuilder<> &Builder,		static Value *UpgradeX86PSLLDQIntrinsics(IRBuilder<> &Builder,
Value *Op, unsigned Shift) {		Value *Op, unsigned Shift) {
auto *ResultTy = cast<VectorType>(Op->getType());		auto *ResultTy = cast<FixedVectorType>(Op->getType());
unsigned NumElts = ResultTy->getNumElements() * 8;		unsigned NumElts = ResultTy->getNumElements() * 8;

// Bitcast from a 64-bit element type to a byte element type.		// Bitcast from a 64-bit element type to a byte element type.
Type *VecTy = VectorType::get(Builder.getInt8Ty(), NumElts);		Type *VecTy = VectorType::get(Builder.getInt8Ty(), NumElts);
Op = Builder.CreateBitCast(Op, VecTy, "cast");		Op = Builder.CreateBitCast(Op, VecTy, "cast");

// We'll be shuffling in zeroes.		// We'll be shuffling in zeroes.
Value *Res = Constant::getNullValue(VecTy);		Value *Res = Constant::getNullValue(VecTy);
Show All 17 Lines	static Value *UpgradeX86PSLLDQIntrinsics(IRBuilder<> &Builder,
// Bitcast back to a 64-bit element type.		// Bitcast back to a 64-bit element type.
return Builder.CreateBitCast(Res, ResultTy, "cast");		return Builder.CreateBitCast(Res, ResultTy, "cast");
}		}

// Handles upgrading SSE2/AVX2/AVX512BW PSRLDQ intrinsics by converting them		// Handles upgrading SSE2/AVX2/AVX512BW PSRLDQ intrinsics by converting them
// to byte shuffles.		// to byte shuffles.
static Value UpgradeX86PSRLDQIntrinsics(IRBuilder<> &Builder, Value Op,		static Value UpgradeX86PSRLDQIntrinsics(IRBuilder<> &Builder, Value Op,
unsigned Shift) {		unsigned Shift) {
auto *ResultTy = cast<VectorType>(Op->getType());		auto *ResultTy = cast<FixedVectorType>(Op->getType());
unsigned NumElts = ResultTy->getNumElements() * 8;		unsigned NumElts = ResultTy->getNumElements() * 8;

// Bitcast from a 64-bit element type to a byte element type.		// Bitcast from a 64-bit element type to a byte element type.
Type *VecTy = VectorType::get(Builder.getInt8Ty(), NumElts);		Type *VecTy = VectorType::get(Builder.getInt8Ty(), NumElts);
Op = Builder.CreateBitCast(Op, VecTy, "cast");		Op = Builder.CreateBitCast(Op, VecTy, "cast");

// We'll be shuffling in zeroes.		// We'll be shuffling in zeroes.
Value *Res = Constant::getNullValue(VecTy);		Value *Res = Constant::getNullValue(VecTy);
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
static Value EmitX86Select(IRBuilder<> &Builder, Value Mask,		static Value EmitX86Select(IRBuilder<> &Builder, Value Mask,
Value Op0, Value Op1) {		Value Op0, Value Op1) {
// If the mask is all ones just emit the first operation.		// If the mask is all ones just emit the first operation.
if (const auto *C = dyn_cast<Constant>(Mask))		if (const auto *C = dyn_cast<Constant>(Mask))
if (C->isAllOnesValue())		if (C->isAllOnesValue())
return Op0;		return Op0;

Mask = getX86MaskVec(Builder, Mask,		Mask = getX86MaskVec(Builder, Mask,
cast<VectorType>(Op0->getType())->getNumElements());		cast<FixedVectorType>(Op0->getType())->getNumElements());
return Builder.CreateSelect(Mask, Op0, Op1);		return Builder.CreateSelect(Mask, Op0, Op1);
}		}

static Value EmitX86ScalarSelect(IRBuilder<> &Builder, Value Mask,		static Value EmitX86ScalarSelect(IRBuilder<> &Builder, Value Mask,
Value Op0, Value Op1) {		Value Op0, Value Op1) {
// If the mask is all ones just emit the first operation.		// If the mask is all ones just emit the first operation.
if (const auto *C = dyn_cast<Constant>(Mask))		if (const auto *C = dyn_cast<Constant>(Mask))
if (C->isAllOnesValue())		if (C->isAllOnesValue())
Show All 11 Lines
// PALIGNR handles large immediates by shifting while VALIGN masks the immediate		// PALIGNR handles large immediates by shifting while VALIGN masks the immediate
// so we need to handle both cases. VALIGN also doesn't have 128-bit lanes.		// so we need to handle both cases. VALIGN also doesn't have 128-bit lanes.
static Value UpgradeX86ALIGNIntrinsics(IRBuilder<> &Builder, Value Op0,		static Value UpgradeX86ALIGNIntrinsics(IRBuilder<> &Builder, Value Op0,
Value Op1, Value Shift,		Value Op1, Value Shift,
Value Passthru, Value Mask,		Value Passthru, Value Mask,
bool IsVALIGN) {		bool IsVALIGN) {
unsigned ShiftVal = cast<llvm::ConstantInt>(Shift)->getZExtValue();		unsigned ShiftVal = cast<llvm::ConstantInt>(Shift)->getZExtValue();

unsigned NumElts = cast<VectorType>(Op0->getType())->getNumElements();		unsigned NumElts = cast<FixedVectorType>(Op0->getType())->getNumElements();
assert((IsVALIGN \|\| NumElts % 16 == 0) && "Illegal NumElts for PALIGNR!");		assert((IsVALIGN \|\| NumElts % 16 == 0) && "Illegal NumElts for PALIGNR!");
assert((!IsVALIGN \|\| NumElts <= 16) && "NumElts too large for VALIGN!");		assert((!IsVALIGN \|\| NumElts <= 16) && "NumElts too large for VALIGN!");
assert(isPowerOf2_32(NumElts) && "NumElts not a power of 2!");		assert(isPowerOf2_32(NumElts) && "NumElts not a power of 2!");

// Mask the immediate for VALIGN.		// Mask the immediate for VALIGN.
if (IsVALIGN)		if (IsVALIGN)
ShiftVal &= (NumElts - 1);		ShiftVal &= (NumElts - 1);

▲ Show 20 Lines • Show All 114 Lines • ▼ Show 20 Lines	static Value *upgradeX86Rotate(IRBuilder<> &Builder, CallInst &CI,
Type *Ty = CI.getType();		Type *Ty = CI.getType();
Value *Src = CI.getArgOperand(0);		Value *Src = CI.getArgOperand(0);
Value *Amt = CI.getArgOperand(1);		Value *Amt = CI.getArgOperand(1);

// Amount may be scalar immediate, in which case create a splat vector.		// Amount may be scalar immediate, in which case create a splat vector.
// Funnel shifts amounts are treated as modulo and types are all power-of-2 so		// Funnel shifts amounts are treated as modulo and types are all power-of-2 so
// we only care about the lowest log2 bits anyway.		// we only care about the lowest log2 bits anyway.
if (Amt->getType() != Ty) {		if (Amt->getType() != Ty) {
unsigned NumElts = cast<VectorType>(Ty)->getNumElements();		unsigned NumElts = cast<FixedVectorType>(Ty)->getNumElements();
Amt = Builder.CreateIntCast(Amt, Ty->getScalarType(), false);		Amt = Builder.CreateIntCast(Amt, Ty->getScalarType(), false);
Amt = Builder.CreateVectorSplat(NumElts, Amt);		Amt = Builder.CreateVectorSplat(NumElts, Amt);
}		}

Intrinsic::ID IID = IsRotateRight ? Intrinsic::fshr : Intrinsic::fshl;		Intrinsic::ID IID = IsRotateRight ? Intrinsic::fshr : Intrinsic::fshl;
Function *Intrin = Intrinsic::getDeclaration(CI.getModule(), IID, Ty);		Function *Intrin = Intrinsic::getDeclaration(CI.getModule(), IID, Ty);
Value *Res = Builder.CreateCall(Intrin, {Src, Src, Amt});		Value *Res = Builder.CreateCall(Intrin, {Src, Src, Amt});

▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	static Value *upgradeX86ConcatShift(IRBuilder<> &Builder, CallInst &CI,

if (IsShiftRight)		if (IsShiftRight)
std::swap(Op0, Op1);		std::swap(Op0, Op1);

// Amount may be scalar immediate, in which case create a splat vector.		// Amount may be scalar immediate, in which case create a splat vector.
// Funnel shifts amounts are treated as modulo and types are all power-of-2 so		// Funnel shifts amounts are treated as modulo and types are all power-of-2 so
// we only care about the lowest log2 bits anyway.		// we only care about the lowest log2 bits anyway.
if (Amt->getType() != Ty) {		if (Amt->getType() != Ty) {
unsigned NumElts = cast<VectorType>(Ty)->getNumElements();		unsigned NumElts = cast<FixedVectorType>(Ty)->getNumElements();
Amt = Builder.CreateIntCast(Amt, Ty->getScalarType(), false);		Amt = Builder.CreateIntCast(Amt, Ty->getScalarType(), false);
Amt = Builder.CreateVectorSplat(NumElts, Amt);		Amt = Builder.CreateVectorSplat(NumElts, Amt);
}		}

Intrinsic::ID IID = IsShiftRight ? Intrinsic::fshr : Intrinsic::fshl;		Intrinsic::ID IID = IsShiftRight ? Intrinsic::fshr : Intrinsic::fshl;
Function *Intrin = Intrinsic::getDeclaration(CI.getModule(), IID, Ty);		Function *Intrin = Intrinsic::getDeclaration(CI.getModule(), IID, Ty);
Value *Res = Builder.CreateCall(Intrin, {Op0, Op1, Amt});		Value *Res = Builder.CreateCall(Intrin, {Op0, Op1, Amt});

Show All 20 Lines	const Align Alignment =
: Align(1);		: Align(1);

// If the mask is all ones just emit a regular store.		// If the mask is all ones just emit a regular store.
if (const auto *C = dyn_cast<Constant>(Mask))		if (const auto *C = dyn_cast<Constant>(Mask))
if (C->isAllOnesValue())		if (C->isAllOnesValue())
return Builder.CreateAlignedStore(Data, Ptr, Alignment);		return Builder.CreateAlignedStore(Data, Ptr, Alignment);

// Convert the mask from an integer type to a vector of i1.		// Convert the mask from an integer type to a vector of i1.
unsigned NumElts = cast<VectorType>(Data->getType())->getNumElements();		unsigned NumElts = cast<FixedVectorType>(Data->getType())->getNumElements();
Mask = getX86MaskVec(Builder, Mask, NumElts);		Mask = getX86MaskVec(Builder, Mask, NumElts);
return Builder.CreateMaskedStore(Data, Ptr, Alignment, Mask);		return Builder.CreateMaskedStore(Data, Ptr, Alignment, Mask);
}		}

static Value *UpgradeMaskedLoad(IRBuilder<> &Builder,		static Value *UpgradeMaskedLoad(IRBuilder<> &Builder,
Value Ptr, Value Passthru, Value *Mask,		Value Ptr, Value Passthru, Value *Mask,
bool Aligned) {		bool Aligned) {
Type *ValTy = Passthru->getType();		Type *ValTy = Passthru->getType();
// Cast the pointer to the right type.		// Cast the pointer to the right type.
Ptr = Builder.CreateBitCast(Ptr, llvm::PointerType::getUnqual(ValTy));		Ptr = Builder.CreateBitCast(Ptr, llvm::PointerType::getUnqual(ValTy));
const Align Alignment =		const Align Alignment =
Aligned		Aligned
? Align(Passthru->getType()->getPrimitiveSizeInBits().getFixedSize() /		? Align(Passthru->getType()->getPrimitiveSizeInBits().getFixedSize() /
8)		8)
: Align(1);		: Align(1);

// If the mask is all ones just emit a regular store.		// If the mask is all ones just emit a regular store.
if (const auto *C = dyn_cast<Constant>(Mask))		if (const auto *C = dyn_cast<Constant>(Mask))
if (C->isAllOnesValue())		if (C->isAllOnesValue())
return Builder.CreateAlignedLoad(ValTy, Ptr, Alignment);		return Builder.CreateAlignedLoad(ValTy, Ptr, Alignment);

// Convert the mask from an integer type to a vector of i1.		// Convert the mask from an integer type to a vector of i1.
unsigned NumElts = cast<VectorType>(Passthru->getType())->getNumElements();		unsigned NumElts =
		cast<FixedVectorType>(Passthru->getType())->getNumElements();
Mask = getX86MaskVec(Builder, Mask, NumElts);		Mask = getX86MaskVec(Builder, Mask, NumElts);
return Builder.CreateMaskedLoad(Ptr, Alignment, Mask, Passthru);		return Builder.CreateMaskedLoad(Ptr, Alignment, Mask, Passthru);
}		}

static Value *upgradeAbs(IRBuilder<> &Builder, CallInst &CI) {		static Value *upgradeAbs(IRBuilder<> &Builder, CallInst &CI) {
Value *Op0 = CI.getArgOperand(0);		Value *Op0 = CI.getArgOperand(0);
llvm::Type *Ty = Op0->getType();		llvm::Type *Ty = Op0->getType();
Value *Zero = llvm::Constant::getNullValue(Ty);		Value *Zero = llvm::Constant::getNullValue(Ty);
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	if (CI.getNumArgOperands() == 4)
Res = EmitX86Select(Builder, CI.getArgOperand(3), Res, CI.getArgOperand(2));		Res = EmitX86Select(Builder, CI.getArgOperand(3), Res, CI.getArgOperand(2));

return Res;		return Res;
}		}

// Applying mask on vector of i1's and make sure result is at least 8 bits wide.		// Applying mask on vector of i1's and make sure result is at least 8 bits wide.
static Value ApplyX86MaskOn1BitsVec(IRBuilder<> &Builder, Value Vec,		static Value ApplyX86MaskOn1BitsVec(IRBuilder<> &Builder, Value Vec,
Value *Mask) {		Value *Mask) {
unsigned NumElts = cast<VectorType>(Vec->getType())->getNumElements();		unsigned NumElts = cast<FixedVectorType>(Vec->getType())->getNumElements();
if (Mask) {		if (Mask) {
const auto *C = dyn_cast<Constant>(Mask);		const auto *C = dyn_cast<Constant>(Mask);
if (!C \|\| !C->isAllOnesValue())		if (!C \|\| !C->isAllOnesValue())
Vec = Builder.CreateAnd(Vec, getX86MaskVec(Builder, Mask, NumElts));		Vec = Builder.CreateAnd(Vec, getX86MaskVec(Builder, Mask, NumElts));
}		}

if (NumElts < 8) {		if (NumElts < 8) {
int Indices[8];		int Indices[8];
for (unsigned i = 0; i != NumElts; ++i)		for (unsigned i = 0; i != NumElts; ++i)
Indices[i] = i;		Indices[i] = i;
for (unsigned i = NumElts; i != 8; ++i)		for (unsigned i = NumElts; i != 8; ++i)
Indices[i] = NumElts + i % NumElts;		Indices[i] = NumElts + i % NumElts;
Vec = Builder.CreateShuffleVector(Vec,		Vec = Builder.CreateShuffleVector(Vec,
Constant::getNullValue(Vec->getType()),		Constant::getNullValue(Vec->getType()),
Indices);		Indices);
}		}
return Builder.CreateBitCast(Vec, Builder.getIntNTy(std::max(NumElts, 8U)));		return Builder.CreateBitCast(Vec, Builder.getIntNTy(std::max(NumElts, 8U)));
}		}

static Value *upgradeMaskedCompare(IRBuilder<> &Builder, CallInst &CI,		static Value *upgradeMaskedCompare(IRBuilder<> &Builder, CallInst &CI,
unsigned CC, bool Signed) {		unsigned CC, bool Signed) {
Value *Op0 = CI.getArgOperand(0);		Value *Op0 = CI.getArgOperand(0);
unsigned NumElts = cast<VectorType>(Op0->getType())->getNumElements();		unsigned NumElts = cast<FixedVectorType>(Op0->getType())->getNumElements();

Value *Cmp;		Value *Cmp;
if (CC == 3) {		if (CC == 3) {
Cmp = Constant::getNullValue(llvm::VectorType::get(Builder.getInt1Ty(), NumElts));		Cmp = Constant::getNullValue(llvm::VectorType::get(Builder.getInt1Ty(), NumElts));
} else if (CC == 7) {		} else if (CC == 7) {
Cmp = Constant::getAllOnesValue(llvm::VectorType::get(Builder.getInt1Ty(), NumElts));		Cmp = Constant::getAllOnesValue(llvm::VectorType::get(Builder.getInt1Ty(), NumElts));
} else {		} else {
ICmpInst::Predicate Pred;		ICmpInst::Predicate Pred;
Show All 36 Lines	static Value* upgradeMaskedMove(IRBuilder<> &Builder, CallInst &CI) {
Value* Select = Builder.CreateSelect(Cmp, Extract1, Extract2);		Value* Select = Builder.CreateSelect(Cmp, Extract1, Extract2);
return Builder.CreateInsertElement(A, Select, (uint64_t)0);		return Builder.CreateInsertElement(A, Select, (uint64_t)0);
}		}


static Value* UpgradeMaskToInt(IRBuilder<> &Builder, CallInst &CI) {		static Value* UpgradeMaskToInt(IRBuilder<> &Builder, CallInst &CI) {
Value* Op = CI.getArgOperand(0);		Value* Op = CI.getArgOperand(0);
Type* ReturnOp = CI.getType();		Type* ReturnOp = CI.getType();
unsigned NumElts = cast<VectorType>(CI.getType())->getNumElements();		unsigned NumElts = cast<FixedVectorType>(CI.getType())->getNumElements();
Value *Mask = getX86MaskVec(Builder, Op, NumElts);		Value *Mask = getX86MaskVec(Builder, Op, NumElts);
return Builder.CreateSExt(Mask, ReturnOp, "vpmovm2");		return Builder.CreateSExt(Mask, ReturnOp, "vpmovm2");
}		}

// Replace intrinsic with unmasked version and a select.		// Replace intrinsic with unmasked version and a select.
static bool upgradeAVX512MaskToSelect(StringRef Name, IRBuilder<> &Builder,		static bool upgradeAVX512MaskToSelect(StringRef Name, IRBuilder<> &Builder,
CallInst &CI, Value *&Rep) {		CallInst &CI, Value *&Rep) {
Name = Name.substr(12); // Remove avx512.mask.		Name = Name.substr(12); // Remove avx512.mask.
▲ Show 20 Lines • Show All 432 Lines • ▼ Show 20 Lines	if (IsX86 && (Name.startswith("sse2.pcmp") \|\|
Rep = Builder.CreateAnd(Op0, Op1);		Rep = Builder.CreateAnd(Op0, Op1);
llvm::Type *Ty = Op0->getType();		llvm::Type *Ty = Op0->getType();
Value *Zero = llvm::Constant::getNullValue(Ty);		Value *Zero = llvm::Constant::getNullValue(Ty);
ICmpInst::Predicate Pred =		ICmpInst::Predicate Pred =
Name.startswith("avx512.ptestm") ? ICmpInst::ICMP_NE : ICmpInst::ICMP_EQ;		Name.startswith("avx512.ptestm") ? ICmpInst::ICMP_NE : ICmpInst::ICMP_EQ;
Rep = Builder.CreateICmp(Pred, Rep, Zero);		Rep = Builder.CreateICmp(Pred, Rep, Zero);
Rep = ApplyX86MaskOn1BitsVec(Builder, Rep, Mask);		Rep = ApplyX86MaskOn1BitsVec(Builder, Rep, Mask);
} else if (IsX86 && (Name.startswith("avx512.mask.pbroadcast"))){		} else if (IsX86 && (Name.startswith("avx512.mask.pbroadcast"))){
unsigned NumElts =		unsigned NumElts = cast<FixedVectorType>(CI->getArgOperand(1)->getType())
cast<VectorType>(CI->getArgOperand(1)->getType())->getNumElements();		->getNumElements();
Rep = Builder.CreateVectorSplat(NumElts, CI->getArgOperand(0));		Rep = Builder.CreateVectorSplat(NumElts, CI->getArgOperand(0));
Rep = EmitX86Select(Builder, CI->getArgOperand(2), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(2), Rep,
CI->getArgOperand(1));		CI->getArgOperand(1));
} else if (IsX86 && (Name.startswith("avx512.kunpck"))) {		} else if (IsX86 && (Name.startswith("avx512.kunpck"))) {
unsigned NumElts = CI->getType()->getScalarSizeInBits();		unsigned NumElts = CI->getType()->getScalarSizeInBits();
Value *LHS = getX86MaskVec(Builder, CI->getArgOperand(0), NumElts);		Value *LHS = getX86MaskVec(Builder, CI->getArgOperand(0), NumElts);
Value *RHS = getX86MaskVec(Builder, CI->getArgOperand(1), NumElts);		Value *RHS = getX86MaskVec(Builder, CI->getArgOperand(1), NumElts);
int Indices[64];		int Indices[64];
▲ Show 20 Lines • Show All 233 Lines • ▼ Show 20 Lines	if (IsX86 && (Name.startswith("sse2.pcmp") \|\|
Name == "avx512.mask.cvtqq2ps.256" \|\|		Name == "avx512.mask.cvtqq2ps.256" \|\|
Name == "avx512.mask.cvtqq2ps.512" \|\|		Name == "avx512.mask.cvtqq2ps.512" \|\|
Name == "avx512.mask.cvtuqq2ps.256" \|\|		Name == "avx512.mask.cvtuqq2ps.256" \|\|
Name == "avx512.mask.cvtuqq2ps.512" \|\|		Name == "avx512.mask.cvtuqq2ps.512" \|\|
Name == "sse2.cvtps2pd" \|\|		Name == "sse2.cvtps2pd" \|\|
Name == "avx.cvt.ps2.pd.256" \|\|		Name == "avx.cvt.ps2.pd.256" \|\|
Name == "avx512.mask.cvtps2pd.128" \|\|		Name == "avx512.mask.cvtps2pd.128" \|\|
Name == "avx512.mask.cvtps2pd.256")) {		Name == "avx512.mask.cvtps2pd.256")) {
auto *DstTy = cast<VectorType>(CI->getType());		auto *DstTy = cast<FixedVectorType>(CI->getType());
Rep = CI->getArgOperand(0);		Rep = CI->getArgOperand(0);
auto *SrcTy = cast<VectorType>(Rep->getType());		auto *SrcTy = cast<FixedVectorType>(Rep->getType());

unsigned NumDstElts = DstTy->getNumElements();		unsigned NumDstElts = DstTy->getNumElements();
if (NumDstElts < SrcTy->getNumElements()) {		if (NumDstElts < SrcTy->getNumElements()) {
assert(NumDstElts == 2 && "Unexpected vector size");		assert(NumDstElts == 2 && "Unexpected vector size");
Rep = Builder.CreateShuffleVector(Rep, Rep, ArrayRef<int>{0, 1});		Rep = Builder.CreateShuffleVector(Rep, Rep, ArrayRef<int>{0, 1});
}		}

bool IsPS2PD = SrcTy->getElementType()->isFloatTy();		bool IsPS2PD = SrcTy->getElementType()->isFloatTy();
Show All 13 Lines	if (IsX86 && (Name.startswith("sse2.pcmp") \|\|
: Builder.CreateSIToFP(Rep, DstTy, "cvt");		: Builder.CreateSIToFP(Rep, DstTy, "cvt");
}		}

if (CI->getNumArgOperands() >= 3)		if (CI->getNumArgOperands() >= 3)
Rep = EmitX86Select(Builder, CI->getArgOperand(2), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(2), Rep,
CI->getArgOperand(1));		CI->getArgOperand(1));
} else if (IsX86 && (Name.startswith("avx512.mask.vcvtph2ps.") \|\|		} else if (IsX86 && (Name.startswith("avx512.mask.vcvtph2ps.") \|\|
Name.startswith("vcvtph2ps."))) {		Name.startswith("vcvtph2ps."))) {
auto *DstTy = cast<VectorType>(CI->getType());		auto *DstTy = cast<FixedVectorType>(CI->getType());
Rep = CI->getArgOperand(0);		Rep = CI->getArgOperand(0);
auto *SrcTy = cast<VectorType>(Rep->getType());		auto *SrcTy = cast<FixedVectorType>(Rep->getType());
unsigned NumDstElts = DstTy->getNumElements();		unsigned NumDstElts = DstTy->getNumElements();
if (NumDstElts != SrcTy->getNumElements()) {		if (NumDstElts != SrcTy->getNumElements()) {
assert(NumDstElts == 4 && "Unexpected vector size");		assert(NumDstElts == 4 && "Unexpected vector size");
Rep = Builder.CreateShuffleVector(Rep, Rep, ArrayRef<int>{0, 1, 2, 3});		Rep = Builder.CreateShuffleVector(Rep, Rep, ArrayRef<int>{0, 1, 2, 3});
}		}
Rep = Builder.CreateBitCast(		Rep = Builder.CreateBitCast(
Rep, VectorType::get(Type::getHalfTy(C), NumDstElts));		Rep, VectorType::get(Type::getHalfTy(C), NumDstElts));
Rep = Builder.CreateFPExt(Rep, DstTy, "cvtph2ps");		Rep = Builder.CreateFPExt(Rep, DstTy, "cvtph2ps");
if (CI->getNumArgOperands() >= 3)		if (CI->getNumArgOperands() >= 3)
Rep = EmitX86Select(Builder, CI->getArgOperand(2), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(2), Rep,
CI->getArgOperand(1));		CI->getArgOperand(1));
} else if (IsX86 && (Name.startswith("avx512.mask.loadu."))) {		} else if (IsX86 && (Name.startswith("avx512.mask.loadu."))) {
Rep = UpgradeMaskedLoad(Builder, CI->getArgOperand(0),		Rep = UpgradeMaskedLoad(Builder, CI->getArgOperand(0),
CI->getArgOperand(1), CI->getArgOperand(2),		CI->getArgOperand(1), CI->getArgOperand(2),
/Aligned/false);		/Aligned/false);
} else if (IsX86 && (Name.startswith("avx512.mask.load."))) {		} else if (IsX86 && (Name.startswith("avx512.mask.load."))) {
Rep = UpgradeMaskedLoad(Builder, CI->getArgOperand(0),		Rep = UpgradeMaskedLoad(Builder, CI->getArgOperand(0),
CI->getArgOperand(1),CI->getArgOperand(2),		CI->getArgOperand(1),CI->getArgOperand(2),
/Aligned/true);		/Aligned/true);
} else if (IsX86 && Name.startswith("avx512.mask.expand.load.")) {		} else if (IsX86 && Name.startswith("avx512.mask.expand.load.")) {
auto *ResultTy = cast<VectorType>(CI->getType());		auto *ResultTy = cast<FixedVectorType>(CI->getType());
Type *PtrTy = ResultTy->getElementType();		Type *PtrTy = ResultTy->getElementType();

// Cast the pointer to element type.		// Cast the pointer to element type.
Value *Ptr = Builder.CreateBitCast(CI->getOperand(0),		Value *Ptr = Builder.CreateBitCast(CI->getOperand(0),
llvm::PointerType::getUnqual(PtrTy));		llvm::PointerType::getUnqual(PtrTy));

Value *MaskVec = getX86MaskVec(Builder, CI->getArgOperand(2),		Value *MaskVec = getX86MaskVec(Builder, CI->getArgOperand(2),
ResultTy->getNumElements());		ResultTy->getNumElements());

Function *ELd = Intrinsic::getDeclaration(F->getParent(),		Function *ELd = Intrinsic::getDeclaration(F->getParent(),
Intrinsic::masked_expandload,		Intrinsic::masked_expandload,
ResultTy);		ResultTy);
Rep = Builder.CreateCall(ELd, { Ptr, MaskVec, CI->getOperand(1) });		Rep = Builder.CreateCall(ELd, { Ptr, MaskVec, CI->getOperand(1) });
} else if (IsX86 && Name.startswith("avx512.mask.compress.store.")) {		} else if (IsX86 && Name.startswith("avx512.mask.compress.store.")) {
auto *ResultTy = cast<VectorType>(CI->getArgOperand(1)->getType());		auto *ResultTy = cast<FixedVectorType>(CI->getArgOperand(1)->getType());
Type *PtrTy = ResultTy->getElementType();		Type *PtrTy = ResultTy->getElementType();

// Cast the pointer to element type.		// Cast the pointer to element type.
Value *Ptr = Builder.CreateBitCast(CI->getOperand(0),		Value *Ptr = Builder.CreateBitCast(CI->getOperand(0),
llvm::PointerType::getUnqual(PtrTy));		llvm::PointerType::getUnqual(PtrTy));

Value *MaskVec = getX86MaskVec(Builder, CI->getArgOperand(2),		Value *MaskVec = getX86MaskVec(Builder, CI->getArgOperand(2),
ResultTy->getNumElements());		ResultTy->getNumElements());

Function *CSt = Intrinsic::getDeclaration(F->getParent(),		Function *CSt = Intrinsic::getDeclaration(F->getParent(),
Intrinsic::masked_compressstore,		Intrinsic::masked_compressstore,
ResultTy);		ResultTy);
Rep = Builder.CreateCall(CSt, { CI->getArgOperand(1), Ptr, MaskVec });		Rep = Builder.CreateCall(CSt, { CI->getArgOperand(1), Ptr, MaskVec });
} else if (IsX86 && (Name.startswith("avx512.mask.compress.") \|\|		} else if (IsX86 && (Name.startswith("avx512.mask.compress.") \|\|
Name.startswith("avx512.mask.expand."))) {		Name.startswith("avx512.mask.expand."))) {
auto *ResultTy = cast<VectorType>(CI->getType());		auto *ResultTy = cast<FixedVectorType>(CI->getType());

Value *MaskVec = getX86MaskVec(Builder, CI->getArgOperand(2),		Value *MaskVec = getX86MaskVec(Builder, CI->getArgOperand(2),
ResultTy->getNumElements());		ResultTy->getNumElements());

bool IsCompress = Name[12] == 'c';		bool IsCompress = Name[12] == 'c';
Intrinsic::ID IID = IsCompress ? Intrinsic::x86_avx512_mask_compress		Intrinsic::ID IID = IsCompress ? Intrinsic::x86_avx512_mask_compress
: Intrinsic::x86_avx512_mask_expand;		: Intrinsic::x86_avx512_mask_expand;
Function *Intr = Intrinsic::getDeclaration(F->getParent(), IID, ResultTy);		Function *Intr = Intrinsic::getDeclaration(F->getParent(), IID, ResultTy);
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	if (IsX86 && (Name.startswith("sse2.pcmp") \|\|
Function *CRC32 = Intrinsic::getDeclaration(F->getParent(),		Function *CRC32 = Intrinsic::getDeclaration(F->getParent(),
Intrinsic::x86_sse42_crc32_32_8);		Intrinsic::x86_sse42_crc32_32_8);
Value *Trunc0 = Builder.CreateTrunc(CI->getArgOperand(0), Type::getInt32Ty(C));		Value *Trunc0 = Builder.CreateTrunc(CI->getArgOperand(0), Type::getInt32Ty(C));
Rep = Builder.CreateCall(CRC32, {Trunc0, CI->getArgOperand(1)});		Rep = Builder.CreateCall(CRC32, {Trunc0, CI->getArgOperand(1)});
Rep = Builder.CreateZExt(Rep, CI->getType(), "");		Rep = Builder.CreateZExt(Rep, CI->getType(), "");
} else if (IsX86 && (Name.startswith("avx.vbroadcast.s") \|\|		} else if (IsX86 && (Name.startswith("avx.vbroadcast.s") \|\|
Name.startswith("avx512.vbroadcast.s"))) {		Name.startswith("avx512.vbroadcast.s"))) {
// Replace broadcasts with a series of insertelements.		// Replace broadcasts with a series of insertelements.
auto *VecTy = cast<VectorType>(CI->getType());		auto *VecTy = cast<FixedVectorType>(CI->getType());
Type *EltTy = VecTy->getElementType();		Type *EltTy = VecTy->getElementType();
unsigned EltNum = VecTy->getNumElements();		unsigned EltNum = VecTy->getNumElements();
Value *Cast = Builder.CreateBitCast(CI->getArgOperand(0),		Value *Cast = Builder.CreateBitCast(CI->getArgOperand(0),
EltTy->getPointerTo());		EltTy->getPointerTo());
Value *Load = Builder.CreateLoad(EltTy, Cast);		Value *Load = Builder.CreateLoad(EltTy, Cast);
Type *I32Ty = Type::getInt32Ty(C);		Type *I32Ty = Type::getInt32Ty(C);
Rep = UndefValue::get(VecTy);		Rep = UndefValue::get(VecTy);
for (unsigned I = 0; I < EltNum; ++I)		for (unsigned I = 0; I < EltNum; ++I)
Rep = Builder.CreateInsertElement(Rep, Load,		Rep = Builder.CreateInsertElement(Rep, Load,
ConstantInt::get(I32Ty, I));		ConstantInt::get(I32Ty, I));
} else if (IsX86 && (Name.startswith("sse41.pmovsx") \|\|		} else if (IsX86 && (Name.startswith("sse41.pmovsx") \|\|
Name.startswith("sse41.pmovzx") \|\|		Name.startswith("sse41.pmovzx") \|\|
Name.startswith("avx2.pmovsx") \|\|		Name.startswith("avx2.pmovsx") \|\|
Name.startswith("avx2.pmovzx") \|\|		Name.startswith("avx2.pmovzx") \|\|
Name.startswith("avx512.mask.pmovsx") \|\|		Name.startswith("avx512.mask.pmovsx") \|\|
Name.startswith("avx512.mask.pmovzx"))) {		Name.startswith("avx512.mask.pmovzx"))) {
VectorType *SrcTy = cast<VectorType>(CI->getArgOperand(0)->getType());		auto *SrcTy = cast<VectorType>(CI->getArgOperand(0)->getType());
VectorType *DstTy = cast<VectorType>(CI->getType());		auto *DstTy = cast<FixedVectorType>(CI->getType());
unsigned NumDstElts = DstTy->getNumElements();		unsigned NumDstElts = DstTy->getNumElements();

// Extract a subvector of the first NumDstElts lanes and sign/zero extend.		// Extract a subvector of the first NumDstElts lanes and sign/zero extend.
SmallVector<int, 8> ShuffleMask(NumDstElts);		SmallVector<int, 8> ShuffleMask(NumDstElts);
for (unsigned i = 0; i != NumDstElts; ++i)		for (unsigned i = 0; i != NumDstElts; ++i)
ShuffleMask[i] = i;		ShuffleMask[i] = i;

Value *SV = Builder.CreateShuffleVector(		Value *SV = Builder.CreateShuffleVector(
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	if (IsX86 && (Name.startswith("sse2.pcmp") \|\|
}		}
Rep = Builder.CreateShuffleVector(CI->getArgOperand(0),		Rep = Builder.CreateShuffleVector(CI->getArgOperand(0),
CI->getArgOperand(1), ShuffleMask);		CI->getArgOperand(1), ShuffleMask);
Rep = EmitX86Select(Builder, CI->getArgOperand(4), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(4), Rep,
CI->getArgOperand(3));		CI->getArgOperand(3));
}else if (IsX86 && (Name.startswith("avx512.mask.broadcastf") \|\|		}else if (IsX86 && (Name.startswith("avx512.mask.broadcastf") \|\|
Name.startswith("avx512.mask.broadcasti"))) {		Name.startswith("avx512.mask.broadcasti"))) {
unsigned NumSrcElts =		unsigned NumSrcElts =
cast<VectorType>(CI->getArgOperand(0)->getType())->getNumElements();		cast<FixedVectorType>(CI->getArgOperand(0)->getType())
unsigned NumDstElts = cast<VectorType>(CI->getType())->getNumElements();		->getNumElements();
		unsigned NumDstElts =
		cast<FixedVectorType>(CI->getType())->getNumElements();

SmallVector<int, 8> ShuffleMask(NumDstElts);		SmallVector<int, 8> ShuffleMask(NumDstElts);
for (unsigned i = 0; i != NumDstElts; ++i)		for (unsigned i = 0; i != NumDstElts; ++i)
ShuffleMask[i] = i % NumSrcElts;		ShuffleMask[i] = i % NumSrcElts;

Rep = Builder.CreateShuffleVector(CI->getArgOperand(0),		Rep = Builder.CreateShuffleVector(CI->getArgOperand(0),
CI->getArgOperand(0),		CI->getArgOperand(0),
ShuffleMask);		ShuffleMask);
▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	if (!NewFn) {
} else if (IsX86 && (Name == "sse41.pblendw" \|\|		} else if (IsX86 && (Name == "sse41.pblendw" \|\|
Name.startswith("sse41.blendp") \|\|		Name.startswith("sse41.blendp") \|\|
Name.startswith("avx.blend.p") \|\|		Name.startswith("avx.blend.p") \|\|
Name == "avx2.pblendw" \|\|		Name == "avx2.pblendw" \|\|
Name.startswith("avx2.pblendd."))) {		Name.startswith("avx2.pblendd."))) {
Value *Op0 = CI->getArgOperand(0);		Value *Op0 = CI->getArgOperand(0);
Value *Op1 = CI->getArgOperand(1);		Value *Op1 = CI->getArgOperand(1);
unsigned Imm = cast <ConstantInt>(CI->getArgOperand(2))->getZExtValue();		unsigned Imm = cast <ConstantInt>(CI->getArgOperand(2))->getZExtValue();
VectorType *VecTy = cast<VectorType>(CI->getType());		auto *VecTy = cast<FixedVectorType>(CI->getType());
unsigned NumElts = VecTy->getNumElements();		unsigned NumElts = VecTy->getNumElements();

SmallVector<int, 16> Idxs(NumElts);		SmallVector<int, 16> Idxs(NumElts);
for (unsigned i = 0; i != NumElts; ++i)		for (unsigned i = 0; i != NumElts; ++i)
Idxs[i] = ((Imm >> (i%8)) & 1) ? i + NumElts : i;		Idxs[i] = ((Imm >> (i%8)) & 1) ? i + NumElts : i;

Rep = Builder.CreateShuffleVector(Op0, Op1, Idxs);		Rep = Builder.CreateShuffleVector(Op0, Op1, Idxs);
} else if (IsX86 && (Name.startswith("avx.vinsertf128.") \|\|		} else if (IsX86 && (Name.startswith("avx.vinsertf128.") \|\|
Name == "avx2.vinserti128" \|\|		Name == "avx2.vinserti128" \|\|
Name.startswith("avx512.mask.insert"))) {		Name.startswith("avx512.mask.insert"))) {
Value *Op0 = CI->getArgOperand(0);		Value *Op0 = CI->getArgOperand(0);
Value *Op1 = CI->getArgOperand(1);		Value *Op1 = CI->getArgOperand(1);
unsigned Imm = cast<ConstantInt>(CI->getArgOperand(2))->getZExtValue();		unsigned Imm = cast<ConstantInt>(CI->getArgOperand(2))->getZExtValue();
unsigned DstNumElts = cast<VectorType>(CI->getType())->getNumElements();		unsigned DstNumElts =
unsigned SrcNumElts = cast<VectorType>(Op1->getType())->getNumElements();		cast<FixedVectorType>(CI->getType())->getNumElements();
		unsigned SrcNumElts =
		cast<FixedVectorType>(Op1->getType())->getNumElements();
unsigned Scale = DstNumElts / SrcNumElts;		unsigned Scale = DstNumElts / SrcNumElts;

// Mask off the high bits of the immediate value; hardware ignores those.		// Mask off the high bits of the immediate value; hardware ignores those.
Imm = Imm % Scale;		Imm = Imm % Scale;

// Extend the second operand into a vector the size of the destination.		// Extend the second operand into a vector the size of the destination.
Value *UndefV = UndefValue::get(Op1->getType());		Value *UndefV = UndefValue::get(Op1->getType());
SmallVector<int, 8> Idxs(DstNumElts);		SmallVector<int, 8> Idxs(DstNumElts);
Show All 26 Lines	if (IsX86 && (Name.startswith("sse2.pcmp") \|\|
if (CI->getNumArgOperands() == 5)		if (CI->getNumArgOperands() == 5)
Rep = EmitX86Select(Builder, CI->getArgOperand(4), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(4), Rep,
CI->getArgOperand(3));		CI->getArgOperand(3));
} else if (IsX86 && (Name.startswith("avx.vextractf128.") \|\|		} else if (IsX86 && (Name.startswith("avx.vextractf128.") \|\|
Name == "avx2.vextracti128" \|\|		Name == "avx2.vextracti128" \|\|
Name.startswith("avx512.mask.vextract"))) {		Name.startswith("avx512.mask.vextract"))) {
Value *Op0 = CI->getArgOperand(0);		Value *Op0 = CI->getArgOperand(0);
unsigned Imm = cast<ConstantInt>(CI->getArgOperand(1))->getZExtValue();		unsigned Imm = cast<ConstantInt>(CI->getArgOperand(1))->getZExtValue();
unsigned DstNumElts = cast<VectorType>(CI->getType())->getNumElements();		unsigned DstNumElts =
unsigned SrcNumElts = cast<VectorType>(Op0->getType())->getNumElements();		cast<FixedVectorType>(CI->getType())->getNumElements();
		unsigned SrcNumElts =
		cast<FixedVectorType>(Op0->getType())->getNumElements();
unsigned Scale = SrcNumElts / DstNumElts;		unsigned Scale = SrcNumElts / DstNumElts;

// Mask off the high bits of the immediate value; hardware ignores those.		// Mask off the high bits of the immediate value; hardware ignores those.
Imm = Imm % Scale;		Imm = Imm % Scale;

// Get indexes for the subvector of the input vector.		// Get indexes for the subvector of the input vector.
SmallVector<int, 8> Idxs(DstNumElts);		SmallVector<int, 8> Idxs(DstNumElts);
for (unsigned i = 0; i != DstNumElts; ++i) {		for (unsigned i = 0; i != DstNumElts; ++i) {
Idxs[i] = i + (Imm * DstNumElts);		Idxs[i] = i + (Imm * DstNumElts);
}		}
Rep = Builder.CreateShuffleVector(Op0, Op0, Idxs);		Rep = Builder.CreateShuffleVector(Op0, Op0, Idxs);

// If the intrinsic has a mask operand, handle that.		// If the intrinsic has a mask operand, handle that.
if (CI->getNumArgOperands() == 4)		if (CI->getNumArgOperands() == 4)
Rep = EmitX86Select(Builder, CI->getArgOperand(3), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(3), Rep,
CI->getArgOperand(2));		CI->getArgOperand(2));
} else if (!IsX86 && Name == "stackprotectorcheck") {		} else if (!IsX86 && Name == "stackprotectorcheck") {
Rep = nullptr;		Rep = nullptr;
} else if (IsX86 && (Name.startswith("avx512.mask.perm.df.") \|\|		} else if (IsX86 && (Name.startswith("avx512.mask.perm.df.") \|\|
Name.startswith("avx512.mask.perm.di."))) {		Name.startswith("avx512.mask.perm.di."))) {
Value *Op0 = CI->getArgOperand(0);		Value *Op0 = CI->getArgOperand(0);
unsigned Imm = cast<ConstantInt>(CI->getArgOperand(1))->getZExtValue();		unsigned Imm = cast<ConstantInt>(CI->getArgOperand(1))->getZExtValue();
VectorType *VecTy = cast<VectorType>(CI->getType());		auto *VecTy = cast<FixedVectorType>(CI->getType());
unsigned NumElts = VecTy->getNumElements();		unsigned NumElts = VecTy->getNumElements();

SmallVector<int, 8> Idxs(NumElts);		SmallVector<int, 8> Idxs(NumElts);
for (unsigned i = 0; i != NumElts; ++i)		for (unsigned i = 0; i != NumElts; ++i)
Idxs[i] = (i & ~0x3) + ((Imm >> (2 * (i & 0x3))) & 3);		Idxs[i] = (i & ~0x3) + ((Imm >> (2 * (i & 0x3))) & 3);

Rep = Builder.CreateShuffleVector(Op0, Op0, Idxs);		Rep = Builder.CreateShuffleVector(Op0, Op0, Idxs);

if (CI->getNumArgOperands() == 4)		if (CI->getNumArgOperands() == 4)
Rep = EmitX86Select(Builder, CI->getArgOperand(3), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(3), Rep,
CI->getArgOperand(2));		CI->getArgOperand(2));
} else if (IsX86 && (Name.startswith("avx.vperm2f128.") \|\|		} else if (IsX86 && (Name.startswith("avx.vperm2f128.") \|\|
Name == "avx2.vperm2i128")) {		Name == "avx2.vperm2i128")) {
// The immediate permute control byte looks like this:		// The immediate permute control byte looks like this:
// [1:0] - select 128 bits from sources for low half of destination		// [1:0] - select 128 bits from sources for low half of destination
// [2] - ignore		// [2] - ignore
// [3] - zero low half of destination		// [3] - zero low half of destination
// [5:4] - select 128 bits from sources for high half of destination		// [5:4] - select 128 bits from sources for high half of destination
// [6] - ignore		// [6] - ignore
// [7] - zero high half of destination		// [7] - zero high half of destination

uint8_t Imm = cast<ConstantInt>(CI->getArgOperand(2))->getZExtValue();		uint8_t Imm = cast<ConstantInt>(CI->getArgOperand(2))->getZExtValue();

unsigned NumElts = cast<VectorType>(CI->getType())->getNumElements();		unsigned NumElts = cast<FixedVectorType>(CI->getType())->getNumElements();
unsigned HalfSize = NumElts / 2;		unsigned HalfSize = NumElts / 2;
SmallVector<int, 8> ShuffleMask(NumElts);		SmallVector<int, 8> ShuffleMask(NumElts);

// Determine which operand(s) are actually in use for this instruction.		// Determine which operand(s) are actually in use for this instruction.
Value *V0 = (Imm & 0x02) ? CI->getArgOperand(1) : CI->getArgOperand(0);		Value *V0 = (Imm & 0x02) ? CI->getArgOperand(1) : CI->getArgOperand(0);
Value *V1 = (Imm & 0x20) ? CI->getArgOperand(1) : CI->getArgOperand(0);		Value *V1 = (Imm & 0x20) ? CI->getArgOperand(1) : CI->getArgOperand(0);

// If needed, replace operands based on zero mask.		// If needed, replace operands based on zero mask.
Show All 13 Lines	if (IsX86 && (Name.startswith("sse2.pcmp") \|\|
Rep = Builder.CreateShuffleVector(V0, V1, ShuffleMask);		Rep = Builder.CreateShuffleVector(V0, V1, ShuffleMask);

} else if (IsX86 && (Name.startswith("avx.vpermil.") \|\|		} else if (IsX86 && (Name.startswith("avx.vpermil.") \|\|
Name == "sse2.pshuf.d" \|\|		Name == "sse2.pshuf.d" \|\|
Name.startswith("avx512.mask.vpermil.p") \|\|		Name.startswith("avx512.mask.vpermil.p") \|\|
Name.startswith("avx512.mask.pshuf.d."))) {		Name.startswith("avx512.mask.pshuf.d."))) {
Value *Op0 = CI->getArgOperand(0);		Value *Op0 = CI->getArgOperand(0);
unsigned Imm = cast<ConstantInt>(CI->getArgOperand(1))->getZExtValue();		unsigned Imm = cast<ConstantInt>(CI->getArgOperand(1))->getZExtValue();
VectorType *VecTy = cast<VectorType>(CI->getType());		auto *VecTy = cast<FixedVectorType>(CI->getType());
unsigned NumElts = VecTy->getNumElements();		unsigned NumElts = VecTy->getNumElements();
// Calculate the size of each index in the immediate.		// Calculate the size of each index in the immediate.
unsigned IdxSize = 64 / VecTy->getScalarSizeInBits();		unsigned IdxSize = 64 / VecTy->getScalarSizeInBits();
unsigned IdxMask = ((1 << IdxSize) - 1);		unsigned IdxMask = ((1 << IdxSize) - 1);

SmallVector<int, 8> Idxs(NumElts);		SmallVector<int, 8> Idxs(NumElts);
// Lookup the bits for this element, wrapping around the immediate every		// Lookup the bits for this element, wrapping around the immediate every
// 8-bits. Elements are grouped into sets of 2 or 4 elements so we need		// 8-bits. Elements are grouped into sets of 2 or 4 elements so we need
// to offset by the first index of each group.		// to offset by the first index of each group.
for (unsigned i = 0; i != NumElts; ++i)		for (unsigned i = 0; i != NumElts; ++i)
Idxs[i] = ((Imm >> ((i * IdxSize) % 8)) & IdxMask) \| (i & ~IdxMask);		Idxs[i] = ((Imm >> ((i * IdxSize) % 8)) & IdxMask) \| (i & ~IdxMask);

Rep = Builder.CreateShuffleVector(Op0, Op0, Idxs);		Rep = Builder.CreateShuffleVector(Op0, Op0, Idxs);

if (CI->getNumArgOperands() == 4)		if (CI->getNumArgOperands() == 4)
Rep = EmitX86Select(Builder, CI->getArgOperand(3), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(3), Rep,
CI->getArgOperand(2));		CI->getArgOperand(2));
} else if (IsX86 && (Name == "sse2.pshufl.w" \|\|		} else if (IsX86 && (Name == "sse2.pshufl.w" \|\|
Name.startswith("avx512.mask.pshufl.w."))) {		Name.startswith("avx512.mask.pshufl.w."))) {
Value *Op0 = CI->getArgOperand(0);		Value *Op0 = CI->getArgOperand(0);
unsigned Imm = cast<ConstantInt>(CI->getArgOperand(1))->getZExtValue();		unsigned Imm = cast<ConstantInt>(CI->getArgOperand(1))->getZExtValue();
unsigned NumElts = cast<VectorType>(CI->getType())->getNumElements();		unsigned NumElts = cast<FixedVectorType>(CI->getType())->getNumElements();

SmallVector<int, 16> Idxs(NumElts);		SmallVector<int, 16> Idxs(NumElts);
for (unsigned l = 0; l != NumElts; l += 8) {		for (unsigned l = 0; l != NumElts; l += 8) {
for (unsigned i = 0; i != 4; ++i)		for (unsigned i = 0; i != 4; ++i)
Idxs[i + l] = ((Imm >> (2 * i)) & 0x3) + l;		Idxs[i + l] = ((Imm >> (2 * i)) & 0x3) + l;
for (unsigned i = 4; i != 8; ++i)		for (unsigned i = 4; i != 8; ++i)
Idxs[i + l] = i + l;		Idxs[i + l] = i + l;
}		}

Rep = Builder.CreateShuffleVector(Op0, Op0, Idxs);		Rep = Builder.CreateShuffleVector(Op0, Op0, Idxs);

if (CI->getNumArgOperands() == 4)		if (CI->getNumArgOperands() == 4)
Rep = EmitX86Select(Builder, CI->getArgOperand(3), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(3), Rep,
CI->getArgOperand(2));		CI->getArgOperand(2));
} else if (IsX86 && (Name == "sse2.pshufh.w" \|\|		} else if (IsX86 && (Name == "sse2.pshufh.w" \|\|
Name.startswith("avx512.mask.pshufh.w."))) {		Name.startswith("avx512.mask.pshufh.w."))) {
Value *Op0 = CI->getArgOperand(0);		Value *Op0 = CI->getArgOperand(0);
unsigned Imm = cast<ConstantInt>(CI->getArgOperand(1))->getZExtValue();		unsigned Imm = cast<ConstantInt>(CI->getArgOperand(1))->getZExtValue();
unsigned NumElts = cast<VectorType>(CI->getType())->getNumElements();		unsigned NumElts = cast<FixedVectorType>(CI->getType())->getNumElements();

SmallVector<int, 16> Idxs(NumElts);		SmallVector<int, 16> Idxs(NumElts);
for (unsigned l = 0; l != NumElts; l += 8) {		for (unsigned l = 0; l != NumElts; l += 8) {
for (unsigned i = 0; i != 4; ++i)		for (unsigned i = 0; i != 4; ++i)
Idxs[i + l] = i + l;		Idxs[i + l] = i + l;
for (unsigned i = 0; i != 4; ++i)		for (unsigned i = 0; i != 4; ++i)
Idxs[i + l + 4] = ((Imm >> (2 * i)) & 0x3) + 4 + l;		Idxs[i + l + 4] = ((Imm >> (2 * i)) & 0x3) + 4 + l;
}		}

Rep = Builder.CreateShuffleVector(Op0, Op0, Idxs);		Rep = Builder.CreateShuffleVector(Op0, Op0, Idxs);

if (CI->getNumArgOperands() == 4)		if (CI->getNumArgOperands() == 4)
Rep = EmitX86Select(Builder, CI->getArgOperand(3), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(3), Rep,
CI->getArgOperand(2));		CI->getArgOperand(2));
} else if (IsX86 && Name.startswith("avx512.mask.shuf.p")) {		} else if (IsX86 && Name.startswith("avx512.mask.shuf.p")) {
Value *Op0 = CI->getArgOperand(0);		Value *Op0 = CI->getArgOperand(0);
Value *Op1 = CI->getArgOperand(1);		Value *Op1 = CI->getArgOperand(1);
unsigned Imm = cast<ConstantInt>(CI->getArgOperand(2))->getZExtValue();		unsigned Imm = cast<ConstantInt>(CI->getArgOperand(2))->getZExtValue();
unsigned NumElts = cast<VectorType>(CI->getType())->getNumElements();		unsigned NumElts = cast<FixedVectorType>(CI->getType())->getNumElements();

unsigned NumLaneElts = 128/CI->getType()->getScalarSizeInBits();		unsigned NumLaneElts = 128/CI->getType()->getScalarSizeInBits();
unsigned HalfLaneElts = NumLaneElts / 2;		unsigned HalfLaneElts = NumLaneElts / 2;

SmallVector<int, 16> Idxs(NumElts);		SmallVector<int, 16> Idxs(NumElts);
for (unsigned i = 0; i != NumElts; ++i) {		for (unsigned i = 0; i != NumElts; ++i) {
// Base index is the starting element of the lane.		// Base index is the starting element of the lane.
Idxs[i] = i - (i % NumLaneElts);		Idxs[i] = i - (i % NumLaneElts);
// If we are half way through the lane switch to the other source.		// If we are half way through the lane switch to the other source.
if ((i % NumLaneElts) >= HalfLaneElts)		if ((i % NumLaneElts) >= HalfLaneElts)
Idxs[i] += NumElts;		Idxs[i] += NumElts;
// Now select the specific element. By adding HalfLaneElts bits from		// Now select the specific element. By adding HalfLaneElts bits from
// the immediate. Wrapping around the immediate every 8-bits.		// the immediate. Wrapping around the immediate every 8-bits.
Idxs[i] += (Imm >> ((i * HalfLaneElts) % 8)) & ((1 << HalfLaneElts) - 1);		Idxs[i] += (Imm >> ((i * HalfLaneElts) % 8)) & ((1 << HalfLaneElts) - 1);
}		}

Rep = Builder.CreateShuffleVector(Op0, Op1, Idxs);		Rep = Builder.CreateShuffleVector(Op0, Op1, Idxs);

Rep = EmitX86Select(Builder, CI->getArgOperand(4), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(4), Rep,
CI->getArgOperand(3));		CI->getArgOperand(3));
} else if (IsX86 && (Name.startswith("avx512.mask.movddup") \|\|		} else if (IsX86 && (Name.startswith("avx512.mask.movddup") \|\|
Name.startswith("avx512.mask.movshdup") \|\|		Name.startswith("avx512.mask.movshdup") \|\|
Name.startswith("avx512.mask.movsldup"))) {		Name.startswith("avx512.mask.movsldup"))) {
Value *Op0 = CI->getArgOperand(0);		Value *Op0 = CI->getArgOperand(0);
unsigned NumElts = cast<VectorType>(CI->getType())->getNumElements();		unsigned NumElts = cast<FixedVectorType>(CI->getType())->getNumElements();
unsigned NumLaneElts = 128/CI->getType()->getScalarSizeInBits();		unsigned NumLaneElts = 128/CI->getType()->getScalarSizeInBits();

unsigned Offset = 0;		unsigned Offset = 0;
if (Name.startswith("avx512.mask.movshdup."))		if (Name.startswith("avx512.mask.movshdup."))
Offset = 1;		Offset = 1;

SmallVector<int, 16> Idxs(NumElts);		SmallVector<int, 16> Idxs(NumElts);
for (unsigned l = 0; l != NumElts; l += NumLaneElts)		for (unsigned l = 0; l != NumElts; l += NumLaneElts)
for (unsigned i = 0; i != NumLaneElts; i += 2) {		for (unsigned i = 0; i != NumLaneElts; i += 2) {
Idxs[i + l + 0] = i + l + Offset;		Idxs[i + l + 0] = i + l + Offset;
Idxs[i + l + 1] = i + l + Offset;		Idxs[i + l + 1] = i + l + Offset;
}		}

Rep = Builder.CreateShuffleVector(Op0, Op0, Idxs);		Rep = Builder.CreateShuffleVector(Op0, Op0, Idxs);

Rep = EmitX86Select(Builder, CI->getArgOperand(2), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(2), Rep,
CI->getArgOperand(1));		CI->getArgOperand(1));
} else if (IsX86 && (Name.startswith("avx512.mask.punpckl") \|\|		} else if (IsX86 && (Name.startswith("avx512.mask.punpckl") \|\|
Name.startswith("avx512.mask.unpckl."))) {		Name.startswith("avx512.mask.unpckl."))) {
Value *Op0 = CI->getArgOperand(0);		Value *Op0 = CI->getArgOperand(0);
Value *Op1 = CI->getArgOperand(1);		Value *Op1 = CI->getArgOperand(1);
int NumElts = cast<VectorType>(CI->getType())->getNumElements();		int NumElts = cast<FixedVectorType>(CI->getType())->getNumElements();
int NumLaneElts = 128/CI->getType()->getScalarSizeInBits();		int NumLaneElts = 128/CI->getType()->getScalarSizeInBits();

SmallVector<int, 64> Idxs(NumElts);		SmallVector<int, 64> Idxs(NumElts);
for (int l = 0; l != NumElts; l += NumLaneElts)		for (int l = 0; l != NumElts; l += NumLaneElts)
for (int i = 0; i != NumLaneElts; ++i)		for (int i = 0; i != NumLaneElts; ++i)
Idxs[i + l] = l + (i / 2) + NumElts * (i % 2);		Idxs[i + l] = l + (i / 2) + NumElts * (i % 2);

Rep = Builder.CreateShuffleVector(Op0, Op1, Idxs);		Rep = Builder.CreateShuffleVector(Op0, Op1, Idxs);

Rep = EmitX86Select(Builder, CI->getArgOperand(3), Rep,		Rep = EmitX86Select(Builder, CI->getArgOperand(3), Rep,
CI->getArgOperand(2));		CI->getArgOperand(2));
} else if (IsX86 && (Name.startswith("avx512.mask.punpckh") \|\|		} else if (IsX86 && (Name.startswith("avx512.mask.punpckh") \|\|
Name.startswith("avx512.mask.unpckh."))) {		Name.startswith("avx512.mask.unpckh."))) {
Value *Op0 = CI->getArgOperand(0);		Value *Op0 = CI->getArgOperand(0);
Value *Op1 = CI->getArgOperand(1);		Value *Op1 = CI->getArgOperand(1);
int NumElts = cast<VectorType>(CI->getType())->getNumElements();		int NumElts = cast<FixedVectorType>(CI->getType())->getNumElements();
int NumLaneElts = 128/CI->getType()->getScalarSizeInBits();		int NumLaneElts = 128/CI->getType()->getScalarSizeInBits();

SmallVector<int, 64> Idxs(NumElts);		SmallVector<int, 64> Idxs(NumElts);
for (int l = 0; l != NumElts; l += NumLaneElts)		for (int l = 0; l != NumElts; l += NumLaneElts)
for (int i = 0; i != NumLaneElts; ++i)		for (int i = 0; i != NumLaneElts; ++i)
Idxs[i + l] = (NumLaneElts / 2) + l + (i / 2) + NumElts * (i % 2);		Idxs[i + l] = (NumLaneElts / 2) + l + (i / 2) + NumElts * (i % 2);

Rep = Builder.CreateShuffleVector(Op0, Op1, Idxs);		Rep = Builder.CreateShuffleVector(Op0, Op1, Idxs);
▲ Show 20 Lines • Show All 551 Lines • ▼ Show 20 Lines
Value *Ops[] = { CI->getArgOperand(0), CI->getArgOperand(1),		Value *Ops[] = { CI->getArgOperand(0), CI->getArgOperand(1),
CI->getArgOperand(2), CI->getArgOperand(4) };		CI->getArgOperand(2), CI->getArgOperand(4) };
if (IsSubAdd)		if (IsSubAdd)
Ops[2] = Builder.CreateFNeg(Ops[2]);		Ops[2] = Builder.CreateFNeg(Ops[2]);

Rep = Builder.CreateCall(Intrinsic::getDeclaration(F->getParent(), IID),		Rep = Builder.CreateCall(Intrinsic::getDeclaration(F->getParent(), IID),
Ops);		Ops);
} else {		} else {
int NumElts = cast<VectorType>(CI->getType())->getNumElements();		int NumElts = cast<FixedVectorType>(CI->getType())->getNumElements();

Value *Ops[] = { CI->getArgOperand(0), CI->getArgOperand(1),		Value *Ops[] = { CI->getArgOperand(0), CI->getArgOperand(1),
CI->getArgOperand(2) };		CI->getArgOperand(2) };

Function *FMA = Intrinsic::getDeclaration(CI->getModule(), Intrinsic::fma,		Function *FMA = Intrinsic::getDeclaration(CI->getModule(), Intrinsic::fma,
Ops[0]->getType());		Ops[0]->getType());
Value *Odd = Builder.CreateCall(FMA, Ops);		Value *Odd = Builder.CreateCall(FMA, Ops);
Ops[2] = Builder.CreateFNeg(Ops[2]);		Ops[2] = Builder.CreateFNeg(Ops[2]);
▲ Show 20 Lines • Show All 967 Lines • Show Last 20 Lines

llvm/lib/IR/ConstantFold.cpp

Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	static Constant BitCastConstantVector(Constant CV, VectorType *DstTy) {
// Do not iterate on scalable vector. The num of elements is unknown at		// Do not iterate on scalable vector. The num of elements is unknown at
// compile-time.		// compile-time.
if (isa<ScalableVectorType>(DstTy))		if (isa<ScalableVectorType>(DstTy))
return nullptr;		return nullptr;

// If this cast changes element count then we can't handle it here:		// If this cast changes element count then we can't handle it here:
// doing so requires endianness information. This should be handled by		// doing so requires endianness information. This should be handled by
// Analysis/ConstantFolding.cpp		// Analysis/ConstantFolding.cpp
unsigned NumElts = DstTy->getNumElements();		unsigned NumElts = cast<FixedVectorType>(DstTy)->getNumElements();
if (NumElts != cast<VectorType>(CV->getType())->getNumElements())		if (NumElts != cast<FixedVectorType>(CV->getType())->getNumElements())
return nullptr;		return nullptr;

Type *DstEltTy = DstTy->getElementType();		Type *DstEltTy = DstTy->getElementType();
// Fast path for splatted constants.		// Fast path for splatted constants.
if (Constant *Splat = CV->getSplatValue()) {		if (Constant *Splat = CV->getSplatValue()) {
return ConstantVector::getSplat(DstTy->getElementCount(),		return ConstantVector::getSplat(DstTy->getElementCount(),
ConstantExpr::getBitCast(Splat, DstEltTy));		ConstantExpr::getBitCast(Splat, DstEltTy));
}		}
▲ Show 20 Lines • Show All 500 Lines • ▼ Show 20 Lines	if (ConstantExpr *CE = dyn_cast<ConstantExpr>(V)) {
}		}
}		}

// If the cast operand is a constant vector, perform the cast by		// If the cast operand is a constant vector, perform the cast by
// operating on each element. In the cast of bitcasts, the element		// operating on each element. In the cast of bitcasts, the element
// count may be mismatched; don't attempt to handle that here.		// count may be mismatched; don't attempt to handle that here.
if ((isa<ConstantVector>(V) \|\| isa<ConstantDataVector>(V)) &&		if ((isa<ConstantVector>(V) \|\| isa<ConstantDataVector>(V)) &&
DestTy->isVectorTy() &&		DestTy->isVectorTy() &&
cast<VectorType>(DestTy)->getNumElements() ==		cast<FixedVectorType>(DestTy)->getNumElements() ==
cast<VectorType>(V->getType())->getNumElements()) {		cast<FixedVectorType>(V->getType())->getNumElements()) {
		ctetreauAuthorUnsubmitted Done Reply Inline Actions Suppose we have some vectors: %a = <4 x i1> undef %b = <4 x i8> undef %c = <vscale x 4 x i16> undef %d = <vscale x 4 x i32> undef This check means to ask "do these vectors have the same number of elements?" We should have this truth table (since sameSize(l, r) = sameSize(r, l), I'll omit duplicates): l \| r \| sameSize(l, r) ---------------------- a \| a \| true a \| b \| true a \| c \| false a \| d \| false b \| b \| true b \| c \| false b \| d \| false c \| c \| true c \| d \| true d \| d \| true With the old implementation of `sameSize(l, r) = l->getNumElements() == r->getNumElements()`, we would get true for all combinations. This is a bug. The getElementCount() version is correct. The version on the right will reject vectors that were previously accepted, which will likely change the behavior of llvm. While this represents a gap in test coverage in llvm, I don't know how we can reasonably add a few tests to plug this hole. It really seems to me that scalable vectors are largely ignored by the test suite. The fix is "all tests that concern vectors should have corresponding cases that test scalable vectors" Given plugging this hole in test coverage is hard, my questions are: Is there a reasonable test I can add to make this situation less bad? Given that the policy is that "new features need a corresponding test case", is it worth not fixing this bug? I don't know the answer to 1. For 2, I think the bug should be fixed. I think the ship already sailed, because vscale went in without test coverage. ctetreau: Suppose we have some vectors: ``` %a = <4 x i1> undef %b = <4 x i8> undef %c = <vscale x 4 x…
		efriedmaUnsubmitted Not Done Reply Inline Actions This is unreachable for scalable vectors because ConstantVector and ConstantDataVector always have FixedVectorType. So it's hard to argue this specific case; you might as well just cast to FixedVectorType. (On a side-note, if you haven't already, those values have a getType() method which returns VectorType, which should be fixed.) efriedma: This is unreachable for scalable vectors because ConstantVector and ConstantDataVector always…
		ctetreauAuthorUnsubmitted Done Reply Inline Actions If ConstantVector and ConstantDataVector always have FixedVectorType, then they should probably return that directly. I can make that change. If you are aware of any other such cases, please let me know. As a general point, I am making a huge amount of changes to many disparate parts of the codebase, most of which I am not familiar with. As such, I'm going to miss these sorts of "obvious" simplifications. Please point them out to me if you see them. As for the overall argument, it's a defense of this class of change. For specific instances, if it's never valid for the vectors to be scalable vectors, then it makes sense to not make the change. ctetreau: If ConstantVector and ConstantDataVector always have FixedVectorType, then they should probably…
VectorType *DestVecTy = cast<VectorType>(DestTy);		VectorType *DestVecTy = cast<VectorType>(DestTy);
Type *DstEltTy = DestVecTy->getElementType();		Type *DstEltTy = DestVecTy->getElementType();
// Fast path for splatted constants.		// Fast path for splatted constants.
if (Constant *Splat = V->getSplatValue()) {		if (Constant *Splat = V->getSplatValue()) {
return ConstantVector::getSplat(		return ConstantVector::getSplat(
cast<VectorType>(DestTy)->getElementCount(),		cast<VectorType>(DestTy)->getElementCount(),
ConstantExpr::getCast(opc, Splat, DstEltTy));		ConstantExpr::getCast(opc, Splat, DstEltTy));
}		}
SmallVector<Constant *, 16> res;		SmallVector<Constant *, 16> res;
Type *Ty = IntegerType::get(V->getContext(), 32);		Type *Ty = IntegerType::get(V->getContext(), 32);
for (unsigned i = 0, e = cast<VectorType>(V->getType())->getNumElements();		for (unsigned i = 0,
		e = cast<FixedVectorType>(V->getType())->getNumElements();
i != e; ++i) {		i != e; ++i) {
Constant *C =		Constant *C =
ConstantExpr::getExtractElement(V, ConstantInt::get(Ty, i));		ConstantExpr::getExtractElement(V, ConstantInt::get(Ty, i));
res.push_back(ConstantExpr::getCast(opc, C, DstEltTy));		res.push_back(ConstantExpr::getCast(opc, C, DstEltTy));
}		}
return ConstantVector::get(res);		return ConstantVector::get(res);
}		}

▲ Show 20 Lines • Show All 146 Lines • ▼ Show 20 Lines
Constant llvm::ConstantFoldSelectInstruction(Constant Cond,		Constant llvm::ConstantFoldSelectInstruction(Constant Cond,
Constant V1, Constant V2) {		Constant V1, Constant V2) {
// Check for i1 and vector true/false conditions.		// Check for i1 and vector true/false conditions.
if (Cond->isNullValue()) return V2;		if (Cond->isNullValue()) return V2;
if (Cond->isAllOnesValue()) return V1;		if (Cond->isAllOnesValue()) return V1;

// If the condition is a vector constant, fold the result elementwise.		// If the condition is a vector constant, fold the result elementwise.
if (ConstantVector *CondV = dyn_cast<ConstantVector>(Cond)) {		if (ConstantVector *CondV = dyn_cast<ConstantVector>(Cond)) {
auto *V1VTy = CondV->getType();		auto *V1VTy = cast<FixedVectorType>(CondV->getType());
SmallVector<Constant*, 16> Result;		SmallVector<Constant*, 16> Result;
Type *Ty = IntegerType::get(CondV->getContext(), 32);		Type *Ty = IntegerType::get(CondV->getContext(), 32);
for (unsigned i = 0, e = V1VTy->getNumElements(); i != e; ++i) {		for (unsigned i = 0, e = V1VTy->getNumElements(); i != e; ++i) {
Constant *V;		Constant *V;
Constant *V1Element = ConstantExpr::getExtractElement(V1,		Constant *V1Element = ConstantExpr::getExtractElement(V1,
ConstantInt::get(Ty, i));		ConstantInt::get(Ty, i));
Constant *V2Element = ConstantExpr::getExtractElement(V2,		Constant *V2Element = ConstantExpr::getExtractElement(V2,
ConstantInt::get(Ty, i));		ConstantInt::get(Ty, i));
Show All 33 Lines	if (FalseVal->getOpcode() == Instruction::Select)
return ConstantExpr::getSelect(Cond, V1, FalseVal->getOperand(2));		return ConstantExpr::getSelect(Cond, V1, FalseVal->getOperand(2));
}		}

return nullptr;		return nullptr;
}		}

Constant llvm::ConstantFoldExtractElementInstruction(Constant Val,		Constant llvm::ConstantFoldExtractElementInstruction(Constant Val,
Constant *Idx) {		Constant *Idx) {
auto *ValVTy = cast<VectorType>(Val->getType());		auto *ValVTy = cast<FixedVectorType>(Val->getType());

// extractelt undef, C -> undef		// extractelt undef, C -> undef
// extractelt C, undef -> undef		// extractelt C, undef -> undef
if (isa<UndefValue>(Val) \|\| isa<UndefValue>(Idx))		if (isa<UndefValue>(Val) \|\| isa<UndefValue>(Idx))
return UndefValue::get(ValVTy->getElementType());		return UndefValue::get(ValVTy->getElementType());

auto *CIdx = dyn_cast<ConstantInt>(Idx);		auto *CIdx = dyn_cast<ConstantInt>(Idx);
if (!CIdx)		if (!CIdx)
Show All 36 Lines	Constant llvm::ConstantFoldInsertElementInstruction(Constant Val,
if (!CIdx) return nullptr;		if (!CIdx) return nullptr;

// Do not iterate on scalable vector. The num of elements is unknown at		// Do not iterate on scalable vector. The num of elements is unknown at
// compile-time.		// compile-time.
VectorType *ValTy = cast<VectorType>(Val->getType());		VectorType *ValTy = cast<VectorType>(Val->getType());
if (isa<ScalableVectorType>(ValTy))		if (isa<ScalableVectorType>(ValTy))
return nullptr;		return nullptr;

unsigned NumElts = cast<VectorType>(Val->getType())->getNumElements();		unsigned NumElts = cast<FixedVectorType>(Val->getType())->getNumElements();
if (CIdx->uge(NumElts))		if (CIdx->uge(NumElts))
return UndefValue::get(Val->getType());		return UndefValue::get(Val->getType());

SmallVector<Constant*, 16> Result;		SmallVector<Constant*, 16> Result;
Result.reserve(NumElts);		Result.reserve(NumElts);
auto *Ty = Type::getInt32Ty(Val->getContext());		auto *Ty = Type::getInt32Ty(Val->getContext());
uint64_t IdxVal = CIdx->getZExtValue();		uint64_t IdxVal = CIdx->getZExtValue();
for (unsigned i = 0; i != NumElts; ++i) {		for (unsigned i = 0; i != NumElts; ++i) {
Show All 30 Lines	Constant *Elt =
ConstantExpr::getExtractElement(V1, ConstantInt::get(Ty, 0));		ConstantExpr::getExtractElement(V1, ConstantInt::get(Ty, 0));
return ConstantVector::getSplat(MaskEltCount, Elt);		return ConstantVector::getSplat(MaskEltCount, Elt);
}		}
// Do not iterate on scalable vector. The num of elements is unknown at		// Do not iterate on scalable vector. The num of elements is unknown at
// compile-time.		// compile-time.
if (isa<ScalableVectorType>(V1VTy))		if (isa<ScalableVectorType>(V1VTy))
return nullptr;		return nullptr;

unsigned SrcNumElts = V1VTy->getNumElements();		unsigned SrcNumElts = cast<FixedVectorType>(V1VTy)->getNumElements();

// Loop over the shuffle mask, evaluating each element.		// Loop over the shuffle mask, evaluating each element.
SmallVector<Constant*, 32> Result;		SmallVector<Constant*, 32> Result;
for (unsigned i = 0; i != MaskNumElts; ++i) {		for (unsigned i = 0; i != MaskNumElts; ++i) {
int Elt = Mask[i];		int Elt = Mask[i];
if (Elt == -1) {		if (Elt == -1) {
Result.push_back(UndefValue::get(EltTy));		Result.push_back(UndefValue::get(EltTy));
continue;		continue;
▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	if (ConstantFP *CFP = dyn_cast<ConstantFP>(C)) {
// Fast path for splatted constants.		// Fast path for splatted constants.
if (Constant *Splat = C->getSplatValue()) {		if (Constant *Splat = C->getSplatValue()) {
Constant *Elt = ConstantExpr::get(Opcode, Splat);		Constant *Elt = ConstantExpr::get(Opcode, Splat);
return ConstantVector::getSplat(VTy->getElementCount(), Elt);		return ConstantVector::getSplat(VTy->getElementCount(), Elt);
}		}

// Fold each element and create a vector constant from those constants.		// Fold each element and create a vector constant from those constants.
SmallVector<Constant*, 16> Result;		SmallVector<Constant*, 16> Result;
for (unsigned i = 0, e = VTy->getNumElements(); i != e; ++i) {		for (unsigned i = 0, e = cast<FixedVectorType>(VTy)->getNumElements();
		i != e; ++i) {
Constant *ExtractIdx = ConstantInt::get(Ty, i);		Constant *ExtractIdx = ConstantInt::get(Ty, i);
Constant *Elt = ConstantExpr::getExtractElement(C, ExtractIdx);		Constant *Elt = ConstantExpr::getExtractElement(C, ExtractIdx);

Result.push_back(ConstantExpr::get(Opcode, Elt));		Result.push_back(ConstantExpr::get(Opcode, Elt));
}		}

return ConstantVector::get(Result);		return ConstantVector::get(Result);
}		}
▲ Show 20 Lines • Show All 356 Lines • ▼ Show 20 Lines	if (Constant *C2Splat = C2->getSplatValue()) {
VTy->getElementCount(),		VTy->getElementCount(),
ConstantExpr::get(Opcode, C1Splat, C2Splat));		ConstantExpr::get(Opcode, C1Splat, C2Splat));
}		}
}		}

// Fold each element and create a vector constant from those constants.		// Fold each element and create a vector constant from those constants.
SmallVector<Constant*, 16> Result;		SmallVector<Constant*, 16> Result;
Type *Ty = IntegerType::get(VTy->getContext(), 32);		Type *Ty = IntegerType::get(VTy->getContext(), 32);
for (unsigned i = 0, e = VTy->getNumElements(); i != e; ++i) {		for (unsigned i = 0, e = cast<FixedVectorType>(VTy)->getNumElements();
		i != e; ++i) {
Constant *ExtractIdx = ConstantInt::get(Ty, i);		Constant *ExtractIdx = ConstantInt::get(Ty, i);
Constant *LHS = ConstantExpr::getExtractElement(C1, ExtractIdx);		Constant *LHS = ConstantExpr::getExtractElement(C1, ExtractIdx);
Constant *RHS = ConstantExpr::getExtractElement(C2, ExtractIdx);		Constant *RHS = ConstantExpr::getExtractElement(C2, ExtractIdx);

// If any element of a divisor vector is zero, the whole op is undef.		// If any element of a divisor vector is zero, the whole op is undef.
if (Instruction::isIntDivRem(Opcode) && RHS->isNullValue())		if (Instruction::isIntDivRem(Opcode) && RHS->isNullValue())
return UndefValue::get(VTy);		return UndefValue::get(VTy);

▲ Show 20 Lines • Show All 611 Lines • ▼ Show 20 Lines	if (Constant *C1Splat = C1->getSplatValue())
C1VTy->getElementCount(),		C1VTy->getElementCount(),
ConstantExpr::getCompare(pred, C1Splat, C2Splat));		ConstantExpr::getCompare(pred, C1Splat, C2Splat));

// If we can constant fold the comparison of each element, constant fold		// If we can constant fold the comparison of each element, constant fold
// the whole vector comparison.		// the whole vector comparison.
SmallVector<Constant*, 4> ResElts;		SmallVector<Constant*, 4> ResElts;
Type *Ty = IntegerType::get(C1->getContext(), 32);		Type *Ty = IntegerType::get(C1->getContext(), 32);
// Compare the elements, producing an i1 result or constant expr.		// Compare the elements, producing an i1 result or constant expr.
for (unsigned i = 0, e = C1VTy->getNumElements(); i != e; ++i) {		for (unsigned i = 0, e = cast<FixedVectorType>(C1VTy)->getNumElements();
		i != e; ++i) {
Constant *C1E =		Constant *C1E =
ConstantExpr::getExtractElement(C1, ConstantInt::get(Ty, i));		ConstantExpr::getExtractElement(C1, ConstantInt::get(Ty, i));
Constant *C2E =		Constant *C2E =
ConstantExpr::getExtractElement(C2, ConstantInt::get(Ty, i));		ConstantExpr::getExtractElement(C2, ConstantInt::get(Ty, i));

ResElts.push_back(ConstantExpr::getCompare(pred, C1E, C2E));		ResElts.push_back(ConstantExpr::getCompare(pred, C1E, C2E));
}		}

▲ Show 20 Lines • Show All 256 Lines • ▼ Show 20 Lines	if (C->isNullValue()) {
if (isNull) {		if (isNull) {
PointerType *PtrTy = cast<PointerType>(C->getType()->getScalarType());		PointerType *PtrTy = cast<PointerType>(C->getType()->getScalarType());
Type *Ty = GetElementPtrInst::getIndexedType(PointeeTy, Idxs);		Type *Ty = GetElementPtrInst::getIndexedType(PointeeTy, Idxs);

assert(Ty && "Invalid indices for GEP!");		assert(Ty && "Invalid indices for GEP!");
Type *OrigGEPTy = PointerType::get(Ty, PtrTy->getAddressSpace());		Type *OrigGEPTy = PointerType::get(Ty, PtrTy->getAddressSpace());
Type *GEPTy = PointerType::get(Ty, PtrTy->getAddressSpace());		Type *GEPTy = PointerType::get(Ty, PtrTy->getAddressSpace());
if (VectorType *VT = dyn_cast<VectorType>(C->getType()))		if (VectorType *VT = dyn_cast<VectorType>(C->getType()))
GEPTy = VectorType::get(OrigGEPTy, VT->getNumElements());		GEPTy = VectorType::get(OrigGEPTy, VT);
		ctetreauAuthorUnsubmitted Done Reply Inline Actions Suppose we have some vectors: %a = <4 x i1> undef %d = <vscale x 4 x i32> undef Here, we get a new vector with the same number of elements as some other vector. Using the implementation on the left, the type returned by `sameShapeWithType(i8, a)` is `<4 x i8>`, and the type returned by `sameShapeWithType(i8, b)` is `<4 x i8>`. Again, this is a bug. The version on the right (which is just a helper for `VectorType::get(OrigGEPTy, VT->getElementCount())`) will do the correct thing for any vector on the right hand side. The only case it doesn't handle is the case where you specifically do want to change the vector type. For this case, the old ElementCount overload still exists. This has the same problems as the `sameSize(l, R)` case above: the gap in test coverage is already huge, so I have the same questions: Is there a reasonable test I can add to make this situation less bad? Given that the policy is that "new features need a corresponding test case", is it worth not fixing this bug? ctetreau: Suppose we have some vectors: ``` %a = <4 x i1> undef %d = <vscale x 4 x i32> undef ``` Here…

// The GEP returns a vector of pointers when one of more of		// The GEP returns a vector of pointers when one of more of
// its arguments is a vector.		// its arguments is a vector.
for (unsigned i = 0, e = Idxs.size(); i != e; ++i) {		for (unsigned i = 0, e = Idxs.size(); i != e; ++i) {
if (auto *VT = dyn_cast<VectorType>(Idxs[i]->getType())) {		if (auto *VT = dyn_cast<VectorType>(Idxs[i]->getType())) {
GEPTy = VectorType::get(OrigGEPTy, VT->getNumElements());		GEPTy = VectorType::get(OrigGEPTy, VT);
break;		break;
}		}
}		}

return Constant::getNullValue(GEPTy);		return Constant::getNullValue(GEPTy);
}		}
}		}

▲ Show 20 Lines • Show All 190 Lines • ▼ Show 20 Lines	for (unsigned i = 1, e = Idxs.size(); i != e;
auto *PrevIdx =		auto *PrevIdx =
NewIdxs[i - 1] ? NewIdxs[i - 1] : cast<Constant>(Idxs[i - 1]);		NewIdxs[i - 1] ? NewIdxs[i - 1] : cast<Constant>(Idxs[i - 1]);
bool IsCurrIdxVector = CurrIdx->getType()->isVectorTy();		bool IsCurrIdxVector = CurrIdx->getType()->isVectorTy();
bool IsPrevIdxVector = PrevIdx->getType()->isVectorTy();		bool IsPrevIdxVector = PrevIdx->getType()->isVectorTy();
bool UseVector = IsCurrIdxVector \|\| IsPrevIdxVector;		bool UseVector = IsCurrIdxVector \|\| IsPrevIdxVector;

if (!IsCurrIdxVector && IsPrevIdxVector)		if (!IsCurrIdxVector && IsPrevIdxVector)
CurrIdx = ConstantDataVector::getSplat(		CurrIdx = ConstantDataVector::getSplat(
cast<VectorType>(PrevIdx->getType())->getNumElements(), CurrIdx);		cast<FixedVectorType>(PrevIdx->getType())->getNumElements(), CurrIdx);

if (!IsPrevIdxVector && IsCurrIdxVector)		if (!IsPrevIdxVector && IsCurrIdxVector)
PrevIdx = ConstantDataVector::getSplat(		PrevIdx = ConstantDataVector::getSplat(
cast<VectorType>(CurrIdx->getType())->getNumElements(), PrevIdx);		cast<FixedVectorType>(CurrIdx->getType())->getNumElements(), PrevIdx);

Constant *Factor =		Constant *Factor =
ConstantInt::get(CurrIdx->getType()->getScalarType(), NumElements);		ConstantInt::get(CurrIdx->getType()->getScalarType(), NumElements);
if (UseVector)		if (UseVector)
Factor = ConstantDataVector::getSplat(		Factor = ConstantDataVector::getSplat(
IsPrevIdxVector		IsPrevIdxVector
? cast<VectorType>(PrevIdx->getType())->getNumElements()		? cast<FixedVectorType>(PrevIdx->getType())->getNumElements()
: cast<VectorType>(CurrIdx->getType())->getNumElements(),		: cast<FixedVectorType>(CurrIdx->getType())->getNumElements(),
Factor);		Factor);

NewIdxs[i] = ConstantExpr::getSRem(CurrIdx, Factor);		NewIdxs[i] = ConstantExpr::getSRem(CurrIdx, Factor);

Constant *Div = ConstantExpr::getSDiv(CurrIdx, Factor);		Constant *Div = ConstantExpr::getSDiv(CurrIdx, Factor);

unsigned CommonExtendedWidth =		unsigned CommonExtendedWidth =
std::max(PrevIdx->getType()->getScalarSizeInBits(),		std::max(PrevIdx->getType()->getScalarSizeInBits(),
Div->getType()->getScalarSizeInBits());		Div->getType()->getScalarSizeInBits());
CommonExtendedWidth = std::max(CommonExtendedWidth, 64U);		CommonExtendedWidth = std::max(CommonExtendedWidth, 64U);

// Before adding, extend both operands to i64 to avoid		// Before adding, extend both operands to i64 to avoid
// overflow trouble.		// overflow trouble.
Type *ExtendedTy = Type::getIntNTy(Div->getContext(), CommonExtendedWidth);		Type *ExtendedTy = Type::getIntNTy(Div->getContext(), CommonExtendedWidth);
if (UseVector)		if (UseVector)
ExtendedTy = VectorType::get(		ExtendedTy = VectorType::get(
ExtendedTy,		ExtendedTy, IsPrevIdxVector ? cast<VectorType>(PrevIdx->getType())
IsPrevIdxVector		: cast<VectorType>(CurrIdx->getType()));
? cast<VectorType>(PrevIdx->getType())->getNumElements()
: cast<VectorType>(CurrIdx->getType())->getNumElements());

if (!PrevIdx->getType()->isIntOrIntVectorTy(CommonExtendedWidth))		if (!PrevIdx->getType()->isIntOrIntVectorTy(CommonExtendedWidth))
PrevIdx = ConstantExpr::getSExt(PrevIdx, ExtendedTy);		PrevIdx = ConstantExpr::getSExt(PrevIdx, ExtendedTy);

if (!Div->getType()->isIntOrIntVectorTy(CommonExtendedWidth))		if (!Div->getType()->isIntOrIntVectorTy(CommonExtendedWidth))
Div = ConstantExpr::getSExt(Div, ExtendedTy);		Div = ConstantExpr::getSExt(Div, ExtendedTy);

NewIdxs[i - 1] = ConstantExpr::getAdd(PrevIdx, Div);		NewIdxs[i - 1] = ConstantExpr::getAdd(PrevIdx, Div);
Show All 20 Lines

llvm/lib/IR/Constants.cpp

Show First 20 Lines • Show All 155 Lines • ▼ Show 20 Lines	if (const ConstantInt *CI = dyn_cast<ConstantInt>(this))
return !CI->isOneValue();		return !CI->isOneValue();

// Check for FP which are bitcasted from 1 integers		// Check for FP which are bitcasted from 1 integers
if (const ConstantFP *CFP = dyn_cast<ConstantFP>(this))		if (const ConstantFP *CFP = dyn_cast<ConstantFP>(this))
return !CFP->getValueAPF().bitcastToAPInt().isOneValue();		return !CFP->getValueAPF().bitcastToAPInt().isOneValue();

// Check that vectors don't contain 1		// Check that vectors don't contain 1
if (auto *VTy = dyn_cast<VectorType>(this->getType())) {		if (auto *VTy = dyn_cast<VectorType>(this->getType())) {
unsigned NumElts = VTy->getNumElements();		unsigned NumElts = cast<FixedVectorType>(VTy)->getNumElements();
for (unsigned i = 0; i != NumElts; ++i) {		for (unsigned i = 0; i != NumElts; ++i) {
Constant *Elt = this->getAggregateElement(i);		Constant *Elt = this->getAggregateElement(i);
if (!Elt \|\| !Elt->isNotOneValue())		if (!Elt \|\| !Elt->isNotOneValue())
return false;		return false;
}		}
return true;		return true;
}		}

Show All 33 Lines	if (const ConstantInt *CI = dyn_cast<ConstantInt>(this))
return !CI->isMinValue(/isSigned=/true);		return !CI->isMinValue(/isSigned=/true);

// Check for FP which are bitcasted from INT_MIN integers		// Check for FP which are bitcasted from INT_MIN integers
if (const ConstantFP *CFP = dyn_cast<ConstantFP>(this))		if (const ConstantFP *CFP = dyn_cast<ConstantFP>(this))
return !CFP->getValueAPF().bitcastToAPInt().isMinSignedValue();		return !CFP->getValueAPF().bitcastToAPInt().isMinSignedValue();

// Check that vectors don't contain INT_MIN		// Check that vectors don't contain INT_MIN
if (auto *VTy = dyn_cast<VectorType>(this->getType())) {		if (auto *VTy = dyn_cast<VectorType>(this->getType())) {
unsigned NumElts = VTy->getNumElements();		unsigned NumElts = cast<FixedVectorType>(VTy)->getNumElements();
for (unsigned i = 0; i != NumElts; ++i) {		for (unsigned i = 0; i != NumElts; ++i) {
Constant *Elt = this->getAggregateElement(i);		Constant *Elt = this->getAggregateElement(i);
if (!Elt \|\| !Elt->isNotMinSignedValue())		if (!Elt \|\| !Elt->isNotMinSignedValue())
return false;		return false;
}		}
return true;		return true;
}		}

// It may contain INT_MIN, we can't tell.		// It may contain INT_MIN, we can't tell.
return false;		return false;
}		}

bool Constant::isFiniteNonZeroFP() const {		bool Constant::isFiniteNonZeroFP() const {
if (auto *CFP = dyn_cast<ConstantFP>(this))		if (auto *CFP = dyn_cast<ConstantFP>(this))
return CFP->getValueAPF().isFiniteNonZero();		return CFP->getValueAPF().isFiniteNonZero();
auto *VTy = dyn_cast<VectorType>(getType());		auto *VTy = dyn_cast<VectorType>(getType());
if (!VTy)		if (!VTy)
return false;		return false;
for (unsigned i = 0, e = VTy->getNumElements(); i != e; ++i) {		for (unsigned i = 0, e = cast<FixedVectorType>(VTy)->getNumElements(); i != e;
		++i) {
auto *CFP = dyn_cast_or_null<ConstantFP>(this->getAggregateElement(i));		auto *CFP = dyn_cast_or_null<ConstantFP>(this->getAggregateElement(i));
if (!CFP \|\| !CFP->getValueAPF().isFiniteNonZero())		if (!CFP \|\| !CFP->getValueAPF().isFiniteNonZero())
return false;		return false;
}		}
return true;		return true;
}		}

bool Constant::isNormalFP() const {		bool Constant::isNormalFP() const {
▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	bool Constant::isElementWiseEqual(Value *Y) const {
Constant C0 = ConstantExpr::getBitCast(const_cast<Constant >(this), IntTy);		Constant C0 = ConstantExpr::getBitCast(const_cast<Constant >(this), IntTy);
Constant *C1 = ConstantExpr::getBitCast(cast<Constant>(Y), IntTy);		Constant *C1 = ConstantExpr::getBitCast(cast<Constant>(Y), IntTy);
Constant *CmpEq = ConstantExpr::getICmp(ICmpInst::ICMP_EQ, C0, C1);		Constant *CmpEq = ConstantExpr::getICmp(ICmpInst::ICMP_EQ, C0, C1);
return isa<UndefValue>(CmpEq) \|\| match(CmpEq, m_One());		return isa<UndefValue>(CmpEq) \|\| match(CmpEq, m_One());
}		}

bool Constant::containsUndefElement() const {		bool Constant::containsUndefElement() const {
if (auto *VTy = dyn_cast<VectorType>(getType())) {		if (auto *VTy = dyn_cast<VectorType>(getType())) {
for (unsigned i = 0, e = VTy->getNumElements(); i != e; ++i)		for (unsigned i = 0, e = cast<FixedVectorType>(VTy)->getNumElements();
		i != e; ++i)
if (isa<UndefValue>(getAggregateElement(i)))		if (isa<UndefValue>(getAggregateElement(i)))
return true;		return true;
}		}

return false;		return false;
}		}

bool Constant::containsConstantExpression() const {		bool Constant::containsConstantExpression() const {
if (auto *VTy = dyn_cast<VectorType>(getType())) {		if (auto *VTy = dyn_cast<VectorType>(getType())) {
for (unsigned i = 0, e = VTy->getNumElements(); i != e; ++i)		for (unsigned i = 0, e = cast<FixedVectorType>(VTy)->getNumElements();
		i != e; ++i)
if (isa<ConstantExpr>(getAggregateElement(i)))		if (isa<ConstantExpr>(getAggregateElement(i)))
return true;		return true;
}		}

return false;		return false;
}		}

/// Constructor to create a '0' constant of arbitrary type.		/// Constructor to create a '0' constant of arbitrary type.
▲ Show 20 Lines • Show All 621 Lines • ▼ Show 20 Lines	if (isa<ArrayType>(getType()) \|\| isa<VectorType>(getType()))
return getSequentialElement();		return getSequentialElement();
return getStructElement(Idx);		return getStructElement(Idx);
}		}

unsigned ConstantAggregateZero::getNumElements() const {		unsigned ConstantAggregateZero::getNumElements() const {
Type *Ty = getType();		Type *Ty = getType();
if (auto *AT = dyn_cast<ArrayType>(Ty))		if (auto *AT = dyn_cast<ArrayType>(Ty))
return AT->getNumElements();		return AT->getNumElements();
if (auto *VT = dyn_cast<VectorType>(Ty))		if (auto *VT = dyn_cast<FixedVectorType>(Ty))
return VT->getNumElements();		return VT->getNumElements();
return Ty->getStructNumElements();		return Ty->getStructNumElements();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// UndefValue Implementation		// UndefValue Implementation
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

Show All 18 Lines	if (isa<ArrayType>(getType()) \|\| isa<VectorType>(getType()))
return getSequentialElement();		return getSequentialElement();
return getStructElement(Idx);		return getStructElement(Idx);
}		}

unsigned UndefValue::getNumElements() const {		unsigned UndefValue::getNumElements() const {
Type *Ty = getType();		Type *Ty = getType();
if (auto *AT = dyn_cast<ArrayType>(Ty))		if (auto *AT = dyn_cast<ArrayType>(Ty))
return AT->getNumElements();		return AT->getNumElements();
if (auto *VT = dyn_cast<VectorType>(Ty))		if (auto *VT = dyn_cast<FixedVectorType>(Ty))
return VT->getNumElements();		return VT->getNumElements();
return Ty->getStructNumElements();		return Ty->getStructNumElements();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ConstantXXX Classes		// ConstantXXX Classes
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

▲ Show 20 Lines • Show All 166 Lines • ▼ Show 20 Lines	Constant ConstantStruct::get(StructType ST, ArrayRef<Constant*> V) {
if (isUndef)		if (isUndef)
return UndefValue::get(ST);		return UndefValue::get(ST);

return ST->getContext().pImpl->StructConstants.getOrCreate(ST, V);		return ST->getContext().pImpl->StructConstants.getOrCreate(ST, V);
}		}

ConstantVector::ConstantVector(VectorType T, ArrayRef<Constant > V)		ConstantVector::ConstantVector(VectorType T, ArrayRef<Constant > V)
: ConstantAggregate(T, ConstantVectorVal, V) {		: ConstantAggregate(T, ConstantVectorVal, V) {
assert(V.size() == T->getNumElements() &&		assert(V.size() == cast<FixedVectorType>(T)->getNumElements() &&
"Invalid initializer for constant vector");		"Invalid initializer for constant vector");
}		}

// ConstantVector accessors.		// ConstantVector accessors.
Constant ConstantVector::get(ArrayRef<Constant> V) {		Constant ConstantVector::get(ArrayRef<Constant> V) {
if (Constant *C = getImpl(V))		if (Constant *C = getImpl(V))
return C;		return C;
VectorType *Ty = VectorType::get(V.front()->getType(), V.size());		VectorType *Ty = VectorType::get(V.front()->getType(), V.size());
▲ Show 20 Lines • Show All 708 Lines • ▼ Show 20 Lines
Constant ConstantExpr::getPtrToInt(Constant C, Type *DstTy,		Constant ConstantExpr::getPtrToInt(Constant C, Type *DstTy,
bool OnlyIfReduced) {		bool OnlyIfReduced) {
assert(C->getType()->isPtrOrPtrVectorTy() &&		assert(C->getType()->isPtrOrPtrVectorTy() &&
"PtrToInt source must be pointer or pointer vector");		"PtrToInt source must be pointer or pointer vector");
assert(DstTy->isIntOrIntVectorTy() &&		assert(DstTy->isIntOrIntVectorTy() &&
"PtrToInt destination must be integer or integer vector");		"PtrToInt destination must be integer or integer vector");
assert(isa<VectorType>(C->getType()) == isa<VectorType>(DstTy));		assert(isa<VectorType>(C->getType()) == isa<VectorType>(DstTy));
if (isa<VectorType>(C->getType()))		if (isa<VectorType>(C->getType()))
assert(cast<VectorType>(C->getType())->getNumElements() ==		assert(cast<VectorType>(C->getType())->getElementCount() ==
cast<VectorType>(DstTy)->getNumElements() &&		cast<VectorType>(DstTy)->getElementCount() &&
		ctetreauAuthorUnsubmitted Done Reply Inline Actions I should have made this one call getElementCount ctetreau: I should have made this one call getElementCount
"Invalid cast between a different number of vector elements");		"Invalid cast between a different number of vector elements");
return getFoldedCast(Instruction::PtrToInt, C, DstTy, OnlyIfReduced);		return getFoldedCast(Instruction::PtrToInt, C, DstTy, OnlyIfReduced);
}		}

Constant ConstantExpr::getIntToPtr(Constant C, Type *DstTy,		Constant ConstantExpr::getIntToPtr(Constant C, Type *DstTy,
bool OnlyIfReduced) {		bool OnlyIfReduced) {
assert(C->getType()->isIntOrIntVectorTy() &&		assert(C->getType()->isIntOrIntVectorTy() &&
"IntToPtr source must be integer or integer vector");		"IntToPtr source must be integer or integer vector");
assert(DstTy->isPtrOrPtrVectorTy() &&		assert(DstTy->isPtrOrPtrVectorTy() &&
"IntToPtr destination must be a pointer or pointer vector");		"IntToPtr destination must be a pointer or pointer vector");
assert(isa<VectorType>(C->getType()) == isa<VectorType>(DstTy));		assert(isa<VectorType>(C->getType()) == isa<VectorType>(DstTy));
if (isa<VectorType>(C->getType()))		if (isa<VectorType>(C->getType()))
assert(cast<VectorType>(C->getType())->getNumElements() ==		assert(cast<VectorType>(C->getType())->getElementCount() ==
cast<VectorType>(DstTy)->getNumElements() &&		cast<VectorType>(DstTy)->getElementCount() &&
		ctetreauAuthorUnsubmitted Done Reply Inline Actions I should have made this one call getElementCount ctetreau: I should have made this one call getElementCount
"Invalid cast between a different number of vector elements");		"Invalid cast between a different number of vector elements");
return getFoldedCast(Instruction::IntToPtr, C, DstTy, OnlyIfReduced);		return getFoldedCast(Instruction::IntToPtr, C, DstTy, OnlyIfReduced);
}		}

Constant ConstantExpr::getBitCast(Constant C, Type *DstTy,		Constant ConstantExpr::getBitCast(Constant C, Type *DstTy,
bool OnlyIfReduced) {		bool OnlyIfReduced) {
assert(CastInst::castIsValid(Instruction::BitCast, C, DstTy) &&		assert(CastInst::castIsValid(Instruction::BitCast, C, DstTy) &&
"Invalid constantexpr bitcast!");		"Invalid constantexpr bitcast!");
Show All 14 Lines	Constant ConstantExpr::getAddrSpaceCast(Constant C, Type *DstTy,
// bitcasting the pointer type and then converting the address space.		// bitcasting the pointer type and then converting the address space.
PointerType *SrcScalarTy = cast<PointerType>(C->getType()->getScalarType());		PointerType *SrcScalarTy = cast<PointerType>(C->getType()->getScalarType());
PointerType *DstScalarTy = cast<PointerType>(DstTy->getScalarType());		PointerType *DstScalarTy = cast<PointerType>(DstTy->getScalarType());
Type *DstElemTy = DstScalarTy->getElementType();		Type *DstElemTy = DstScalarTy->getElementType();
if (SrcScalarTy->getElementType() != DstElemTy) {		if (SrcScalarTy->getElementType() != DstElemTy) {
Type *MidTy = PointerType::get(DstElemTy, SrcScalarTy->getAddressSpace());		Type *MidTy = PointerType::get(DstElemTy, SrcScalarTy->getAddressSpace());
if (VectorType *VT = dyn_cast<VectorType>(DstTy)) {		if (VectorType *VT = dyn_cast<VectorType>(DstTy)) {
// Handle vectors of pointers.		// Handle vectors of pointers.
MidTy = VectorType::get(MidTy, VT->getNumElements());		MidTy = VectorType::get(MidTy, VT);
}		}
C = getBitCast(C, MidTy);		C = getBitCast(C, MidTy);
}		}
return getFoldedCast(Instruction::AddrSpaceCast, C, DstTy, OnlyIfReduced);		return getFoldedCast(Instruction::AddrSpaceCast, C, DstTy, OnlyIfReduced);
}		}

Constant ConstantExpr::get(unsigned Opcode, Constant C, unsigned Flags,		Constant ConstantExpr::get(unsigned Opcode, Constant C, unsigned Flags,
Type *OnlyIfReducedTy) {		Type *OnlyIfReducedTy) {
▲ Show 20 Lines • Show All 619 Lines • ▼ Show 20 Lines	if (auto *IT = dyn_cast<IntegerType>(Ty)) {
}		}
}		}
return false;		return false;
}		}

unsigned ConstantDataSequential::getNumElements() const {		unsigned ConstantDataSequential::getNumElements() const {
if (ArrayType *AT = dyn_cast<ArrayType>(getType()))		if (ArrayType *AT = dyn_cast<ArrayType>(getType()))
return AT->getNumElements();		return AT->getNumElements();
return cast<VectorType>(getType())->getNumElements();		return cast<FixedVectorType>(getType())->getNumElements();
}		}


uint64_t ConstantDataSequential::getElementByteSize() const {		uint64_t ConstantDataSequential::getElementByteSize() const {
return getElementType()->getPrimitiveSizeInBits()/8;		return getElementType()->getPrimitiveSizeInBits()/8;
}		}

/// Return the start of the specified element.		/// Return the start of the specified element.
▲ Show 20 Lines • Show All 593 Lines • Show Last 20 Lines

llvm/lib/IR/DataLayout.cpp

Show First 20 Lines • Show All 551 Lines • ▼ Show 20 Lines	if (AlignType == INTEGER_ALIGN) {
if (I != Alignments.begin()) {		if (I != Alignments.begin()) {
--I; // Go to the previous entry and see if its an integer.		--I; // Go to the previous entry and see if its an integer.
if (I->AlignType == INTEGER_ALIGN)		if (I->AlignType == INTEGER_ALIGN)
return ABIInfo ? I->ABIAlign : I->PrefAlign;		return ABIInfo ? I->ABIAlign : I->PrefAlign;
}		}
} else if (AlignType == VECTOR_ALIGN) {		} else if (AlignType == VECTOR_ALIGN) {
// By default, use natural alignment for vector types. This is consistent		// By default, use natural alignment for vector types. This is consistent
// with what clang and llvm-gcc do.		// with what clang and llvm-gcc do.
unsigned Alignment =		auto *FVTy = cast<FixedVectorType>(Ty);
getTypeAllocSize(cast<VectorType>(Ty)->getElementType());		unsigned Alignment = getTypeAllocSize(FVTy->getElementType());
Alignment *= cast<VectorType>(Ty)->getNumElements();		Alignment *= FVTy->getNumElements();
Alignment = PowerOf2Ceil(Alignment);		Alignment = PowerOf2Ceil(Alignment);
return Align(Alignment);		return Align(Alignment);
}		}

// If we still couldn't find a reasonable default alignment, fall back		// If we still couldn't find a reasonable default alignment, fall back
// to a simple heuristic that the alignment is the first power of two		// to a simple heuristic that the alignment is the first power of two
// greater-or-equal to the store size of the type. This is a reasonable		// greater-or-equal to the store size of the type. This is a reasonable
// approximation of reality, and if the user wanted something less		// approximation of reality, and if the user wanted something less
▲ Show 20 Lines • Show All 212 Lines • ▼ Show 20 Lines
}		}

Type DataLayout::getIntPtrType(Type Ty) const {		Type DataLayout::getIntPtrType(Type Ty) const {
assert(Ty->isPtrOrPtrVectorTy() &&		assert(Ty->isPtrOrPtrVectorTy() &&
"Expected a pointer or pointer vector type.");		"Expected a pointer or pointer vector type.");
unsigned NumBits = getPointerTypeSizeInBits(Ty);		unsigned NumBits = getPointerTypeSizeInBits(Ty);
IntegerType *IntTy = IntegerType::get(Ty->getContext(), NumBits);		IntegerType *IntTy = IntegerType::get(Ty->getContext(), NumBits);
if (VectorType *VecTy = dyn_cast<VectorType>(Ty))		if (VectorType *VecTy = dyn_cast<VectorType>(Ty))
return VectorType::get(IntTy, VecTy->getNumElements());		return VectorType::get(IntTy, VecTy);
return IntTy;		return IntTy;
}		}

Type *DataLayout::getSmallestLegalIntType(LLVMContext &C, unsigned Width) const {		Type *DataLayout::getSmallestLegalIntType(LLVMContext &C, unsigned Width) const {
for (unsigned LegalIntWidth : LegalIntWidths)		for (unsigned LegalIntWidth : LegalIntWidths)
if (Width <= LegalIntWidth)		if (Width <= LegalIntWidth)
return Type::getIntNTy(C, LegalIntWidth);		return Type::getIntNTy(C, LegalIntWidth);
return nullptr;		return nullptr;
}		}

unsigned DataLayout::getLargestLegalIntTypeSizeInBits() const {		unsigned DataLayout::getLargestLegalIntTypeSizeInBits() const {
auto Max = std::max_element(LegalIntWidths.begin(), LegalIntWidths.end());		auto Max = std::max_element(LegalIntWidths.begin(), LegalIntWidths.end());
return Max != LegalIntWidths.end() ? *Max : 0;		return Max != LegalIntWidths.end() ? *Max : 0;
}		}

Type DataLayout::getIndexType(Type Ty) const {		Type DataLayout::getIndexType(Type Ty) const {
assert(Ty->isPtrOrPtrVectorTy() &&		assert(Ty->isPtrOrPtrVectorTy() &&
"Expected a pointer or pointer vector type.");		"Expected a pointer or pointer vector type.");
unsigned NumBits = getIndexTypeSizeInBits(Ty);		unsigned NumBits = getIndexTypeSizeInBits(Ty);
IntegerType *IntTy = IntegerType::get(Ty->getContext(), NumBits);		IntegerType *IntTy = IntegerType::get(Ty->getContext(), NumBits);
if (VectorType *VecTy = dyn_cast<VectorType>(Ty))		if (VectorType *VecTy = dyn_cast<VectorType>(Ty))
return VectorType::get(IntTy, VecTy->getNumElements());		return VectorType::get(IntTy, VecTy);
return IntTy;		return IntTy;
}		}

int64_t DataLayout::getIndexedOffsetInType(Type *ElemTy,		int64_t DataLayout::getIndexedOffsetInType(Type *ElemTy,
ArrayRef<Value *> Indices) const {		ArrayRef<Value *> Indices) const {
int64_t Result = 0;		int64_t Result = 0;

generic_gep_type_iterator<Value* const*>		generic_gep_type_iterator<Value* const*>
▲ Show 20 Lines • Show All 67 Lines • Show Last 20 Lines

llvm/lib/IR/Function.cpp

Show First 20 Lines • Show All 1,069 Lines • ▼ Show 20 Lines	case IITDescriptor::VecOfBitcastsToInt: {
assert(VTy && "Expected an argument of Vector Type");		assert(VTy && "Expected an argument of Vector Type");
return VectorType::getInteger(VTy);		return VectorType::getInteger(VTy);
}		}
case IITDescriptor::VecOfAnyPtrsToElt:		case IITDescriptor::VecOfAnyPtrsToElt:
// Return the overloaded type (which determines the pointers address space)		// Return the overloaded type (which determines the pointers address space)
return Tys[D.getOverloadArgNumber()];		return Tys[D.getOverloadArgNumber()];
case IITDescriptor::ScalableVecArgument: {		case IITDescriptor::ScalableVecArgument: {
auto *Ty = cast<VectorType>(DecodeFixedType(Infos, Tys, Context));		auto *Ty = cast<VectorType>(DecodeFixedType(Infos, Tys, Context));
return VectorType::get(Ty->getElementType(), {Ty->getNumElements(), true});		// FIXME: will Ty ever not have getElementCount().Scalable == false?
		return VectorType::get(Ty->getElementType(),
		{Ty->getElementCount().Min, true});
}		}
}		}
llvm_unreachable("unhandled");		llvm_unreachable("unhandled");
}		}

FunctionType *Intrinsic::getType(LLVMContext &Context,		FunctionType *Intrinsic::getType(LLVMContext &Context,
ID id, ArrayRef<Type*> Tys) {		ID id, ArrayRef<Type*> Tys) {
SmallVector<IITDescriptor, 8> Table;		SmallVector<IITDescriptor, 8> Table;
▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	switch (D.Kind) {
case IITDescriptor::Metadata: return !Ty->isMetadataTy();		case IITDescriptor::Metadata: return !Ty->isMetadataTy();
case IITDescriptor::Half: return !Ty->isHalfTy();		case IITDescriptor::Half: return !Ty->isHalfTy();
case IITDescriptor::Float: return !Ty->isFloatTy();		case IITDescriptor::Float: return !Ty->isFloatTy();
case IITDescriptor::Double: return !Ty->isDoubleTy();		case IITDescriptor::Double: return !Ty->isDoubleTy();
case IITDescriptor::Quad: return !Ty->isFP128Ty();		case IITDescriptor::Quad: return !Ty->isFP128Ty();
case IITDescriptor::Integer: return !Ty->isIntegerTy(D.Integer_Width);		case IITDescriptor::Integer: return !Ty->isIntegerTy(D.Integer_Width);
case IITDescriptor::Vector: {		case IITDescriptor::Vector: {
VectorType *VT = dyn_cast<VectorType>(Ty);		VectorType *VT = dyn_cast<VectorType>(Ty);
return !VT \|\| VT->getNumElements() != D.Vector_Width \|\|		return !VT \|\|
		cast<FixedVectorType>(VT)->getNumElements() != D.Vector_Width \|\|
matchIntrinsicType(VT->getElementType(), Infos, ArgTys,		matchIntrinsicType(VT->getElementType(), Infos, ArgTys,
DeferredChecks, IsDeferredCheck);		DeferredChecks, IsDeferredCheck);
}		}
case IITDescriptor::Pointer: {		case IITDescriptor::Pointer: {
PointerType *PT = dyn_cast<PointerType>(Ty);		PointerType *PT = dyn_cast<PointerType>(Ty);
return !PT \|\| PT->getAddressSpace() != D.Pointer_AddressSpace \|\|		return !PT \|\| PT->getAddressSpace() != D.Pointer_AddressSpace \|\|
matchIntrinsicType(PT->getElementType(), Infos, ArgTys,		matchIntrinsicType(PT->getElementType(), Infos, ArgTys,
DeferredChecks, IsDeferredCheck);		DeferredChecks, IsDeferredCheck);
▲ Show 20 Lines • Show All 128 Lines • ▼ Show 20 Lines	case IITDescriptor::VecOfAnyPtrsToElt: {
}		}

// Verify the overloaded type "matches" the Ref type.		// Verify the overloaded type "matches" the Ref type.
// i.e. Ty is a vector with the same width as Ref.		// i.e. Ty is a vector with the same width as Ref.
// Composed of pointers to the same element type as Ref.		// Composed of pointers to the same element type as Ref.
VectorType *ReferenceType = dyn_cast<VectorType>(ArgTys[RefArgNumber]);		VectorType *ReferenceType = dyn_cast<VectorType>(ArgTys[RefArgNumber]);
VectorType *ThisArgVecTy = dyn_cast<VectorType>(Ty);		VectorType *ThisArgVecTy = dyn_cast<VectorType>(Ty);
if (!ThisArgVecTy \|\| !ReferenceType \|\|		if (!ThisArgVecTy \|\| !ReferenceType \|\|
(ReferenceType->getNumElements() != ThisArgVecTy->getNumElements()))		(ReferenceType->getElementCount() != ThisArgVecTy->getElementCount()))
		ctetreauAuthorUnsubmitted Done Reply Inline Actions this is basically the same as the `==` case. I think `<`, `<=`, `>=`, and `>` should probably still call `getNumElements()` as `{2, true} > {3, false}`, while possible to define in c++, is kind of nonsensical. Maybe those operators can assert that the scalablility is the same? (I think this is probably a bad idea) ctetreau: this is basically the same as the `==` case. I think `<`, `<=`, `>=`, and `>` should probably…
return true;		return true;
PointerType *ThisArgEltTy =		PointerType *ThisArgEltTy =
dyn_cast<PointerType>(ThisArgVecTy->getElementType());		dyn_cast<PointerType>(ThisArgVecTy->getElementType());
if (!ThisArgEltTy)		if (!ThisArgEltTy)
return true;		return true;
return ThisArgEltTy->getElementType() != ReferenceType->getElementType();		return ThisArgEltTy->getElementType() != ReferenceType->getElementType();
}		}
case IITDescriptor::VecElementArgument: {		case IITDescriptor::VecElementArgument: {
▲ Show 20 Lines • Show All 313 Lines • Show Last 20 Lines

llvm/lib/IR/IRBuilder.cpp

	Show First 20 Lines • Show All 518 Lines • ▼ Show 20 Lines
	/// \p PassThru - pass-through value that is used to fill the masked-off lanes			/// \p PassThru - pass-through value that is used to fill the masked-off lanes
	/// of the result			/// of the result
	/// \p Name - name of the result variable			/// \p Name - name of the result variable
	CallInst IRBuilderBase::CreateMaskedGather(Value Ptrs, Align Alignment,			CallInst IRBuilderBase::CreateMaskedGather(Value Ptrs, Align Alignment,
	Value Mask, Value PassThru,			Value Mask, Value PassThru,
	const Twine &Name) {			const Twine &Name) {
	auto PtrsTy = cast<VectorType>(Ptrs->getType());			auto PtrsTy = cast<VectorType>(Ptrs->getType());
	auto PtrTy = cast<PointerType>(PtrsTy->getElementType());			auto PtrTy = cast<PointerType>(PtrsTy->getElementType());
	unsigned NumElts = PtrsTy->getNumElements();			Type *DataTy = VectorType::get(PtrTy->getElementType(), PtrsTy);
	Type *DataTy = VectorType::get(PtrTy->getElementType(), NumElts);

	if (!Mask)			if (!Mask)
	Mask = Constant::getAllOnesValue(VectorType::get(Type::getInt1Ty(Context),			Mask = Constant::getAllOnesValue(
	NumElts));			VectorType::get(Type::getInt1Ty(Context), PtrsTy));

	if (!PassThru)			if (!PassThru)
	PassThru = UndefValue::get(DataTy);			PassThru = UndefValue::get(DataTy);

	Type *OverloadedTypes[] = {DataTy, PtrsTy};			Type *OverloadedTypes[] = {DataTy, PtrsTy};
	Value *Ops[] = {Ptrs, getInt32(Alignment.value()), Mask, PassThru};			Value *Ops[] = {Ptrs, getInt32(Alignment.value()), Mask, PassThru};

	// We specify only one type when we create this intrinsic. Types of other			// We specify only one type when we create this intrinsic. Types of other
	// arguments are derived from this type.			// arguments are derived from this type.
	return CreateMaskedIntrinsic(Intrinsic::masked_gather, Ops, OverloadedTypes,			return CreateMaskedIntrinsic(Intrinsic::masked_gather, Ops, OverloadedTypes,
	Name);			Name);
	}			}

	/// Create a call to a Masked Scatter intrinsic.			/// Create a call to a Masked Scatter intrinsic.
	/// \p Data - data to be stored,			/// \p Data - data to be stored,
	/// \p Ptrs - the vector of pointers, where the \p Data elements should be			/// \p Ptrs - the vector of pointers, where the \p Data elements should be
	/// stored			/// stored
	/// \p Align - alignment for one element			/// \p Align - alignment for one element
	/// \p Mask - vector of booleans which indicates what vector lanes should			/// \p Mask - vector of booleans which indicates what vector lanes should
	/// be accessed in memory			/// be accessed in memory
	CallInst IRBuilderBase::CreateMaskedScatter(Value Data, Value *Ptrs,			CallInst IRBuilderBase::CreateMaskedScatter(Value Data, Value *Ptrs,
	Align Alignment, Value *Mask) {			Align Alignment, Value *Mask) {
	auto PtrsTy = cast<VectorType>(Ptrs->getType());			auto PtrsTy = cast<VectorType>(Ptrs->getType());
	auto DataTy = cast<VectorType>(Data->getType());			auto DataTy = cast<VectorType>(Data->getType());
	unsigned NumElts = PtrsTy->getNumElements();

	#ifndef NDEBUG			#ifndef NDEBUG
	auto PtrTy = cast<PointerType>(PtrsTy->getElementType());			auto PtrTy = cast<PointerType>(PtrsTy->getElementType());
	assert(NumElts == DataTy->getNumElements() &&			assert(PtrsTy->getElementCount() == DataTy->getElementCount() &&
	PtrTy->getElementType() == DataTy->getElementType() &&			PtrTy->getElementType() == DataTy->getElementType() &&
	"Incompatible pointer and data types");			"Incompatible pointer and data types");
	#endif			#endif

	if (!Mask)			if (!Mask)
	Mask = Constant::getAllOnesValue(VectorType::get(Type::getInt1Ty(Context),			Mask = Constant::getAllOnesValue(
	NumElts));			VectorType::get(Type::getInt1Ty(Context), PtrsTy));

	Type *OverloadedTypes[] = {DataTy, PtrsTy};			Type *OverloadedTypes[] = {DataTy, PtrsTy};
	Value *Ops[] = {Data, Ptrs, getInt32(Alignment.value()), Mask};			Value *Ops[] = {Data, Ptrs, getInt32(Alignment.value()), Mask};

	// We specify only one type when we create this intrinsic. Types of other			// We specify only one type when we create this intrinsic. Types of other
	// arguments are derived from this type.			// arguments are derived from this type.
	return CreateMaskedIntrinsic(Intrinsic::masked_scatter, Ops, OverloadedTypes);			return CreateMaskedIntrinsic(Intrinsic::masked_scatter, Ops, OverloadedTypes);
	}			}
	▲ Show 20 Lines • Show All 564 Lines • Show Last 20 Lines

llvm/lib/IR/Instructions.cpp

Show First 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	const char SelectInst::areInvalidOperands(Value Op0, Value Op1, Value Op2) {

if (VectorType *VT = dyn_cast<VectorType>(Op0->getType())) {		if (VectorType *VT = dyn_cast<VectorType>(Op0->getType())) {
// Vector select.		// Vector select.
if (VT->getElementType() != Type::getInt1Ty(Op0->getContext()))		if (VT->getElementType() != Type::getInt1Ty(Op0->getContext()))
return "vector select condition element type must be i1";		return "vector select condition element type must be i1";
VectorType *ET = dyn_cast<VectorType>(Op1->getType());		VectorType *ET = dyn_cast<VectorType>(Op1->getType());
if (!ET)		if (!ET)
return "selected values for vector select must be vectors";		return "selected values for vector select must be vectors";
if (ET->getNumElements() != VT->getNumElements())		if (ET->getElementCount() != VT->getElementCount())
return "vector select requires selected vectors to have "		return "vector select requires selected vectors to have "
"the same vector length as select condition";		"the same vector length as select condition";
} else if (Op0->getType() != Type::getInt1Ty(Op0->getContext())) {		} else if (Op0->getType() != Type::getInt1Ty(Op0->getContext())) {
return "select condition must be i1 or <n x i1>";		return "select condition must be i1 or <n x i1>";
}		}
return nullptr;		return nullptr;
}		}

▲ Show 20 Lines • Show All 1,800 Lines • ▼ Show 20 Lines	ShuffleVectorInst::ShuffleVectorInst(Value V1, Value V2, ArrayRef<int> Mask,

Op<0>() = V1;		Op<0>() = V1;
Op<1>() = V2;		Op<1>() = V2;
setShuffleMask(Mask);		setShuffleMask(Mask);
setName(Name);		setName(Name);
}		}

void ShuffleVectorInst::commute() {		void ShuffleVectorInst::commute() {
int NumOpElts = cast<VectorType>(Op<0>()->getType())->getNumElements();		int NumOpElts = cast<FixedVectorType>(Op<0>()->getType())->getNumElements();
int NumMaskElts = ShuffleMask.size();		int NumMaskElts = ShuffleMask.size();
SmallVector<int, 16> NewMask(NumMaskElts);		SmallVector<int, 16> NewMask(NumMaskElts);
for (int i = 0; i != NumMaskElts; ++i) {		for (int i = 0; i != NumMaskElts; ++i) {
int MaskElt = getMaskValue(i);		int MaskElt = getMaskValue(i);
if (MaskElt == UndefMaskElem) {		if (MaskElt == UndefMaskElem) {
NewMask[i] = UndefMaskElem;		NewMask[i] = UndefMaskElem;
continue;		continue;
}		}
assert(MaskElt >= 0 && MaskElt < 2 * NumOpElts && "Out-of-range mask");		assert(MaskElt >= 0 && MaskElt < 2 * NumOpElts && "Out-of-range mask");
MaskElt = (MaskElt < NumOpElts) ? MaskElt + NumOpElts : MaskElt - NumOpElts;		MaskElt = (MaskElt < NumOpElts) ? MaskElt + NumOpElts : MaskElt - NumOpElts;
NewMask[i] = MaskElt;		NewMask[i] = MaskElt;
}		}
setShuffleMask(NewMask);		setShuffleMask(NewMask);
Op<0>().swap(Op<1>());		Op<0>().swap(Op<1>());
}		}

bool ShuffleVectorInst::isValidOperands(const Value V1, const Value V2,		bool ShuffleVectorInst::isValidOperands(const Value V1, const Value V2,
ArrayRef<int> Mask) {		ArrayRef<int> Mask) {
// V1 and V2 must be vectors of the same type.		// V1 and V2 must be vectors of the same type.
if (!V1->getType()->isVectorTy() \|\| V1->getType() != V2->getType())		if (!V1->getType()->isVectorTy() \|\| V1->getType() != V2->getType())
return false;		return false;

// Make sure the mask elements make sense.		// Make sure the mask elements make sense.
int V1Size = cast<VectorType>(V1->getType())->getNumElements();		int V1Size = cast<FixedVectorType>(V1->getType())->getNumElements();
for (int Elem : Mask)		for (int Elem : Mask)
if (Elem != UndefMaskElem && Elem >= V1Size * 2)		if (Elem != UndefMaskElem && Elem >= V1Size * 2)
return false;		return false;

if (isa<ScalableVectorType>(V1->getType()))		if (isa<ScalableVectorType>(V1->getType()))
if ((Mask[0] != 0 && Mask[0] != UndefMaskElem) \|\| !is_splat(Mask))		if ((Mask[0] != 0 && Mask[0] != UndefMaskElem) \|\| !is_splat(Mask))
return false;		return false;

Show All 13 Lines	if (!MaskTy \|\| !MaskTy->getElementType()->isIntegerTy(32) \|\|
isa<ScalableVectorType>(MaskTy) != isa<ScalableVectorType>(V1->getType()))		isa<ScalableVectorType>(MaskTy) != isa<ScalableVectorType>(V1->getType()))
return false;		return false;

// Check to see if Mask is valid.		// Check to see if Mask is valid.
if (isa<UndefValue>(Mask) \|\| isa<ConstantAggregateZero>(Mask))		if (isa<UndefValue>(Mask) \|\| isa<ConstantAggregateZero>(Mask))
return true;		return true;

if (const auto *MV = dyn_cast<ConstantVector>(Mask)) {		if (const auto *MV = dyn_cast<ConstantVector>(Mask)) {
unsigned V1Size = cast<VectorType>(V1->getType())->getNumElements();		unsigned V1Size = cast<FixedVectorType>(V1->getType())->getNumElements();
for (Value *Op : MV->operands()) {		for (Value *Op : MV->operands()) {
if (auto *CI = dyn_cast<ConstantInt>(Op)) {		if (auto *CI = dyn_cast<ConstantInt>(Op)) {
if (CI->uge(V1Size*2))		if (CI->uge(V1Size*2))
return false;		return false;
} else if (!isa<UndefValue>(Op)) {		} else if (!isa<UndefValue>(Op)) {
return false;		return false;
}		}
}		}
return true;		return true;
}		}

if (const auto *CDS = dyn_cast<ConstantDataSequential>(Mask)) {		if (const auto *CDS = dyn_cast<ConstantDataSequential>(Mask)) {
unsigned V1Size = cast<VectorType>(V1->getType())->getNumElements();		unsigned V1Size = cast<FixedVectorType>(V1->getType())->getNumElements();
for (unsigned i = 0, e = MaskTy->getNumElements(); i != e; ++i)		for (unsigned i = 0, e = cast<FixedVectorType>(MaskTy)->getNumElements();
		i != e; ++i)
		ctetreauAuthorUnsubmitted Done Reply Inline Actions By the way, I'm doing this to preserve the original behavior. The code used to enter this branch for scalable vectors. I don't know enough about this code to judge what it should actually do for scalable vectors so I'm making it loudly fail rather than silently be buggy. ctetreau: By the way, I'm doing this to preserve the original behavior. The code used to enter this…
		efriedmaUnsubmitted Not Done Reply Inline Actions ConstantDataSequential is never scalable. (The only valid scalable constants are zero, undef, and ConstantExprs.) efriedma: ConstantDataSequential is never scalable. (The only valid scalable constants are zero, undef…
		ctetreauAuthorUnsubmitted Done Reply Inline Actions ugh. Seems I misread the code. What I meant was, for cases like: if (auto VTy = dyn_cast<VectorType>(Ty)) { auto FVTy = cast<FixedVectorType>(VTy); // immediately cast my VectorType unconditionally ... // stuff } ... that I'm doing it on purpose because trying to dyn_cast to FixedVectorType would be a behavior change. ctetreau: ugh. Seems I misread the code. What I meant was, for cases like: ``` if (auto *VTy =…
if (CDS->getElementAsInteger(i) >= V1Size*2)		if (CDS->getElementAsInteger(i) >= V1Size*2)
return false;		return false;
return true;		return true;
}		}

return false;		return false;
}		}

▲ Show 20 Lines • Show All 179 Lines • ▼ Show 20 Lines	bool ShuffleVectorInst::isExtractSubvectorMask(ArrayRef<int> Mask,
if (0 <= SubIndex && SubIndex + (int)Mask.size() <= NumSrcElts) {		if (0 <= SubIndex && SubIndex + (int)Mask.size() <= NumSrcElts) {
Index = SubIndex;		Index = SubIndex;
return true;		return true;
}		}
return false;		return false;
}		}

bool ShuffleVectorInst::isIdentityWithPadding() const {		bool ShuffleVectorInst::isIdentityWithPadding() const {
int NumOpElts = cast<VectorType>(Op<0>()->getType())->getNumElements();		int NumOpElts = cast<FixedVectorType>(Op<0>()->getType())->getNumElements();
int NumMaskElts = cast<VectorType>(getType())->getNumElements();		int NumMaskElts = cast<FixedVectorType>(getType())->getNumElements();
if (NumMaskElts <= NumOpElts)		if (NumMaskElts <= NumOpElts)
return false;		return false;

// The first part of the mask must choose elements from exactly 1 source op.		// The first part of the mask must choose elements from exactly 1 source op.
ArrayRef<int> Mask = getShuffleMask();		ArrayRef<int> Mask = getShuffleMask();
if (!isIdentityMaskImpl(Mask, NumOpElts))		if (!isIdentityMaskImpl(Mask, NumOpElts))
return false;		return false;

// All extending must be with undef elements.		// All extending must be with undef elements.
for (int i = NumOpElts; i < NumMaskElts; ++i)		for (int i = NumOpElts; i < NumMaskElts; ++i)
if (Mask[i] != -1)		if (Mask[i] != -1)
return false;		return false;

return true;		return true;
}		}

bool ShuffleVectorInst::isIdentityWithExtract() const {		bool ShuffleVectorInst::isIdentityWithExtract() const {
int NumOpElts = cast<VectorType>(Op<0>()->getType())->getNumElements();		int NumOpElts = cast<FixedVectorType>(Op<0>()->getType())->getNumElements();
int NumMaskElts = getType()->getNumElements();		int NumMaskElts = cast<FixedVectorType>(getType())->getNumElements();
if (NumMaskElts >= NumOpElts)		if (NumMaskElts >= NumOpElts)
return false;		return false;

return isIdentityMaskImpl(getShuffleMask(), NumOpElts);		return isIdentityMaskImpl(getShuffleMask(), NumOpElts);
}		}

bool ShuffleVectorInst::isConcat() const {		bool ShuffleVectorInst::isConcat() const {
// Vector concatenation is differentiated from identity with padding.		// Vector concatenation is differentiated from identity with padding.
if (isa<UndefValue>(Op<0>()) \|\| isa<UndefValue>(Op<1>()))		if (isa<UndefValue>(Op<0>()) \|\| isa<UndefValue>(Op<1>()))
return false;		return false;

int NumOpElts = cast<VectorType>(Op<0>()->getType())->getNumElements();		int NumOpElts = cast<FixedVectorType>(Op<0>()->getType())->getNumElements();
int NumMaskElts = getType()->getNumElements();		int NumMaskElts = cast<FixedVectorType>(getType())->getNumElements();
if (NumMaskElts != NumOpElts * 2)		if (NumMaskElts != NumOpElts * 2)
return false;		return false;

// Use the mask length rather than the operands' vector lengths here. We		// Use the mask length rather than the operands' vector lengths here. We
// already know that the shuffle returns a vector twice as long as the inputs,		// already know that the shuffle returns a vector twice as long as the inputs,
// and neither of the inputs are undef vectors. If the mask picks consecutive		// and neither of the inputs are undef vectors. If the mask picks consecutive
// elements from both inputs, then this is a concatenation of the inputs.		// elements from both inputs, then this is a concatenation of the inputs.
return isIdentityMaskImpl(getShuffleMask(), NumMaskElts);		return isIdentityMaskImpl(getShuffleMask(), NumMaskElts);
▲ Show 20 Lines • Show All 724 Lines • ▼ Show 20 Lines
CastInst CastInst::CreatePointerCast(Value S, Type *Ty,		CastInst CastInst::CreatePointerCast(Value S, Type *Ty,
const Twine &Name,		const Twine &Name,
BasicBlock *InsertAtEnd) {		BasicBlock *InsertAtEnd) {
assert(S->getType()->isPtrOrPtrVectorTy() && "Invalid cast");		assert(S->getType()->isPtrOrPtrVectorTy() && "Invalid cast");
assert((Ty->isIntOrIntVectorTy() \|\| Ty->isPtrOrPtrVectorTy()) &&		assert((Ty->isIntOrIntVectorTy() \|\| Ty->isPtrOrPtrVectorTy()) &&
"Invalid cast");		"Invalid cast");
assert(Ty->isVectorTy() == S->getType()->isVectorTy() && "Invalid cast");		assert(Ty->isVectorTy() == S->getType()->isVectorTy() && "Invalid cast");
assert((!Ty->isVectorTy() \|\|		assert((!Ty->isVectorTy() \|\|
cast<VectorType>(Ty)->getNumElements() ==		cast<VectorType>(Ty)->getElementCount() ==
cast<VectorType>(S->getType())->getNumElements()) &&		cast<VectorType>(S->getType())->getElementCount()) &&
"Invalid cast");		"Invalid cast");

if (Ty->isIntOrIntVectorTy())		if (Ty->isIntOrIntVectorTy())
return Create(Instruction::PtrToInt, S, Ty, Name, InsertAtEnd);		return Create(Instruction::PtrToInt, S, Ty, Name, InsertAtEnd);

return CreatePointerBitCastOrAddrSpaceCast(S, Ty, Name, InsertAtEnd);		return CreatePointerBitCastOrAddrSpaceCast(S, Ty, Name, InsertAtEnd);
}		}

/// Create a BitCast or a PtrToInt cast instruction		/// Create a BitCast or a PtrToInt cast instruction
CastInst CastInst::CreatePointerCast(Value S, Type *Ty,		CastInst CastInst::CreatePointerCast(Value S, Type *Ty,
const Twine &Name,		const Twine &Name,
Instruction *InsertBefore) {		Instruction *InsertBefore) {
assert(S->getType()->isPtrOrPtrVectorTy() && "Invalid cast");		assert(S->getType()->isPtrOrPtrVectorTy() && "Invalid cast");
assert((Ty->isIntOrIntVectorTy() \|\| Ty->isPtrOrPtrVectorTy()) &&		assert((Ty->isIntOrIntVectorTy() \|\| Ty->isPtrOrPtrVectorTy()) &&
"Invalid cast");		"Invalid cast");
assert(Ty->isVectorTy() == S->getType()->isVectorTy() && "Invalid cast");		assert(Ty->isVectorTy() == S->getType()->isVectorTy() && "Invalid cast");
assert((!Ty->isVectorTy() \|\|		assert((!Ty->isVectorTy() \|\|
cast<VectorType>(Ty)->getNumElements() ==		cast<VectorType>(Ty)->getElementCount() ==
cast<VectorType>(S->getType())->getNumElements()) &&		cast<VectorType>(S->getType())->getElementCount()) &&
"Invalid cast");		"Invalid cast");

if (Ty->isIntOrIntVectorTy())		if (Ty->isIntOrIntVectorTy())
return Create(Instruction::PtrToInt, S, Ty, Name, InsertBefore);		return Create(Instruction::PtrToInt, S, Ty, Name, InsertBefore);

return CreatePointerBitCastOrAddrSpaceCast(S, Ty, Name, InsertBefore);		return CreatePointerBitCastOrAddrSpaceCast(S, Ty, Name, InsertBefore);
}		}

▲ Show 20 Lines • Show All 94 Lines • ▼ Show 20 Lines	bool CastInst::isCastable(Type SrcTy, Type DestTy) {
if (!SrcTy->isFirstClassType() \|\| !DestTy->isFirstClassType())		if (!SrcTy->isFirstClassType() \|\| !DestTy->isFirstClassType())
return false;		return false;

if (SrcTy == DestTy)		if (SrcTy == DestTy)
return true;		return true;

if (VectorType *SrcVecTy = dyn_cast<VectorType>(SrcTy))		if (VectorType *SrcVecTy = dyn_cast<VectorType>(SrcTy))
if (VectorType *DestVecTy = dyn_cast<VectorType>(DestTy))		if (VectorType *DestVecTy = dyn_cast<VectorType>(DestTy))
if (SrcVecTy->getNumElements() == DestVecTy->getNumElements()) {		if (SrcVecTy->getElementCount() == DestVecTy->getElementCount()) {
// An element by element cast. Valid if casting the elements is valid.		// An element by element cast. Valid if casting the elements is valid.
SrcTy = SrcVecTy->getElementType();		SrcTy = SrcVecTy->getElementType();
DestTy = DestVecTy->getElementType();		DestTy = DestVecTy->getElementType();
}		}

// Get the bit sizes, we'll need these		// Get the bit sizes, we'll need these
TypeSize SrcBits = SrcTy->getPrimitiveSizeInBits(); // 0 for ptr		TypeSize SrcBits = SrcTy->getPrimitiveSizeInBits(); // 0 for ptr
TypeSize DestBits = DestTy->getPrimitiveSizeInBits(); // 0 for ptr		TypeSize DestBits = DestTy->getPrimitiveSizeInBits(); // 0 for ptr
▲ Show 20 Lines • Show All 105 Lines • ▼ Show 20 Lines	assert(SrcTy->isFirstClassType() && DestTy->isFirstClassType() &&
"Only first class types are castable!");		"Only first class types are castable!");

if (SrcTy == DestTy)		if (SrcTy == DestTy)
return BitCast;		return BitCast;

// FIXME: Check address space sizes here		// FIXME: Check address space sizes here
if (VectorType *SrcVecTy = dyn_cast<VectorType>(SrcTy))		if (VectorType *SrcVecTy = dyn_cast<VectorType>(SrcTy))
if (VectorType *DestVecTy = dyn_cast<VectorType>(DestTy))		if (VectorType *DestVecTy = dyn_cast<VectorType>(DestTy))
if (SrcVecTy->getNumElements() == DestVecTy->getNumElements()) {		if (SrcVecTy->getElementCount() == DestVecTy->getElementCount()) {
// An element by element cast. Find the appropriate opcode based on the		// An element by element cast. Find the appropriate opcode based on the
// element types.		// element types.
SrcTy = SrcVecTy->getElementType();		SrcTy = SrcVecTy->getElementType();
DestTy = DestVecTy->getElementType();		DestTy = DestVecTy->getElementType();
}		}

// Get the bit sizes, we'll need these		// Get the bit sizes, we'll need these
unsigned SrcBits = SrcTy->getPrimitiveSizeInBits(); // 0 for ptr		unsigned SrcBits = SrcTy->getPrimitiveSizeInBits(); // 0 for ptr
▲ Show 20 Lines • Show All 1,211 Lines • Show Last 20 Lines

llvm/lib/IR/Verifier.cpp

Show First 20 Lines • Show All 2,677 Lines • ▼ Show 20 Lines	void Verifier::visitUIToFPInst(UIToFPInst &I) {
Assert(SrcVec == DstVec,		Assert(SrcVec == DstVec,
"UIToFP source and dest must both be vector or scalar", &I);		"UIToFP source and dest must both be vector or scalar", &I);
Assert(SrcTy->isIntOrIntVectorTy(),		Assert(SrcTy->isIntOrIntVectorTy(),
"UIToFP source must be integer or integer vector", &I);		"UIToFP source must be integer or integer vector", &I);
Assert(DestTy->isFPOrFPVectorTy(), "UIToFP result must be FP or FP vector",		Assert(DestTy->isFPOrFPVectorTy(), "UIToFP result must be FP or FP vector",
&I);		&I);

if (SrcVec && DstVec)		if (SrcVec && DstVec)
Assert(cast<VectorType>(SrcTy)->getNumElements() ==		Assert(cast<VectorType>(SrcTy)->getElementCount() ==
cast<VectorType>(DestTy)->getNumElements(),		cast<VectorType>(DestTy)->getElementCount(),
"UIToFP source and dest vector length mismatch", &I);		"UIToFP source and dest vector length mismatch", &I);

visitInstruction(I);		visitInstruction(I);
}		}

void Verifier::visitSIToFPInst(SIToFPInst &I) {		void Verifier::visitSIToFPInst(SIToFPInst &I) {
// Get the source and destination types		// Get the source and destination types
Type *SrcTy = I.getOperand(0)->getType();		Type *SrcTy = I.getOperand(0)->getType();
Type *DestTy = I.getType();		Type *DestTy = I.getType();

bool SrcVec = SrcTy->isVectorTy();		bool SrcVec = SrcTy->isVectorTy();
bool DstVec = DestTy->isVectorTy();		bool DstVec = DestTy->isVectorTy();

Assert(SrcVec == DstVec,		Assert(SrcVec == DstVec,
"SIToFP source and dest must both be vector or scalar", &I);		"SIToFP source and dest must both be vector or scalar", &I);
Assert(SrcTy->isIntOrIntVectorTy(),		Assert(SrcTy->isIntOrIntVectorTy(),
"SIToFP source must be integer or integer vector", &I);		"SIToFP source must be integer or integer vector", &I);
Assert(DestTy->isFPOrFPVectorTy(), "SIToFP result must be FP or FP vector",		Assert(DestTy->isFPOrFPVectorTy(), "SIToFP result must be FP or FP vector",
&I);		&I);

if (SrcVec && DstVec)		if (SrcVec && DstVec)
Assert(cast<VectorType>(SrcTy)->getNumElements() ==		Assert(cast<VectorType>(SrcTy)->getElementCount() ==
cast<VectorType>(DestTy)->getNumElements(),		cast<VectorType>(DestTy)->getElementCount(),
"SIToFP source and dest vector length mismatch", &I);		"SIToFP source and dest vector length mismatch", &I);

visitInstruction(I);		visitInstruction(I);
}		}

void Verifier::visitFPToUIInst(FPToUIInst &I) {		void Verifier::visitFPToUIInst(FPToUIInst &I) {
// Get the source and destination types		// Get the source and destination types
Type *SrcTy = I.getOperand(0)->getType();		Type *SrcTy = I.getOperand(0)->getType();
Type *DestTy = I.getType();		Type *DestTy = I.getType();

bool SrcVec = SrcTy->isVectorTy();		bool SrcVec = SrcTy->isVectorTy();
bool DstVec = DestTy->isVectorTy();		bool DstVec = DestTy->isVectorTy();

Assert(SrcVec == DstVec,		Assert(SrcVec == DstVec,
"FPToUI source and dest must both be vector or scalar", &I);		"FPToUI source and dest must both be vector or scalar", &I);
Assert(SrcTy->isFPOrFPVectorTy(), "FPToUI source must be FP or FP vector",		Assert(SrcTy->isFPOrFPVectorTy(), "FPToUI source must be FP or FP vector",
&I);		&I);
Assert(DestTy->isIntOrIntVectorTy(),		Assert(DestTy->isIntOrIntVectorTy(),
"FPToUI result must be integer or integer vector", &I);		"FPToUI result must be integer or integer vector", &I);

if (SrcVec && DstVec)		if (SrcVec && DstVec)
Assert(cast<VectorType>(SrcTy)->getNumElements() ==		Assert(cast<VectorType>(SrcTy)->getElementCount() ==
cast<VectorType>(DestTy)->getNumElements(),		cast<VectorType>(DestTy)->getElementCount(),
"FPToUI source and dest vector length mismatch", &I);		"FPToUI source and dest vector length mismatch", &I);

visitInstruction(I);		visitInstruction(I);
}		}

void Verifier::visitFPToSIInst(FPToSIInst &I) {		void Verifier::visitFPToSIInst(FPToSIInst &I) {
// Get the source and destination types		// Get the source and destination types
Type *SrcTy = I.getOperand(0)->getType();		Type *SrcTy = I.getOperand(0)->getType();
Type *DestTy = I.getType();		Type *DestTy = I.getType();

bool SrcVec = SrcTy->isVectorTy();		bool SrcVec = SrcTy->isVectorTy();
bool DstVec = DestTy->isVectorTy();		bool DstVec = DestTy->isVectorTy();

Assert(SrcVec == DstVec,		Assert(SrcVec == DstVec,
"FPToSI source and dest must both be vector or scalar", &I);		"FPToSI source and dest must both be vector or scalar", &I);
Assert(SrcTy->isFPOrFPVectorTy(), "FPToSI source must be FP or FP vector",		Assert(SrcTy->isFPOrFPVectorTy(), "FPToSI source must be FP or FP vector",
&I);		&I);
Assert(DestTy->isIntOrIntVectorTy(),		Assert(DestTy->isIntOrIntVectorTy(),
"FPToSI result must be integer or integer vector", &I);		"FPToSI result must be integer or integer vector", &I);

if (SrcVec && DstVec)		if (SrcVec && DstVec)
Assert(cast<VectorType>(SrcTy)->getNumElements() ==		Assert(cast<VectorType>(SrcTy)->getElementCount() ==
cast<VectorType>(DestTy)->getNumElements(),		cast<VectorType>(DestTy)->getElementCount(),
"FPToSI source and dest vector length mismatch", &I);		"FPToSI source and dest vector length mismatch", &I);

visitInstruction(I);		visitInstruction(I);
}		}

void Verifier::visitPtrToIntInst(PtrToIntInst &I) {		void Verifier::visitPtrToIntInst(PtrToIntInst &I) {
// Get the source and destination types		// Get the source and destination types
Type *SrcTy = I.getOperand(0)->getType();		Type *SrcTy = I.getOperand(0)->getType();
Type *DestTy = I.getType();		Type *DestTy = I.getType();

Assert(SrcTy->isPtrOrPtrVectorTy(), "PtrToInt source must be pointer", &I);		Assert(SrcTy->isPtrOrPtrVectorTy(), "PtrToInt source must be pointer", &I);

if (auto *PTy = dyn_cast<PointerType>(SrcTy->getScalarType()))		if (auto *PTy = dyn_cast<PointerType>(SrcTy->getScalarType()))
Assert(!DL.isNonIntegralPointerType(PTy),		Assert(!DL.isNonIntegralPointerType(PTy),
"ptrtoint not supported for non-integral pointers");		"ptrtoint not supported for non-integral pointers");

Assert(DestTy->isIntOrIntVectorTy(), "PtrToInt result must be integral", &I);		Assert(DestTy->isIntOrIntVectorTy(), "PtrToInt result must be integral", &I);
Assert(SrcTy->isVectorTy() == DestTy->isVectorTy(), "PtrToInt type mismatch",		Assert(SrcTy->isVectorTy() == DestTy->isVectorTy(), "PtrToInt type mismatch",
&I);		&I);

if (SrcTy->isVectorTy()) {		if (SrcTy->isVectorTy()) {
VectorType *VSrc = cast<VectorType>(SrcTy);		VectorType *VSrc = cast<VectorType>(SrcTy);
VectorType *VDest = cast<VectorType>(DestTy);		VectorType *VDest = cast<VectorType>(DestTy);
Assert(VSrc->getNumElements() == VDest->getNumElements(),		Assert(VSrc->getElementCount() == VDest->getElementCount(),
"PtrToInt Vector width mismatch", &I);		"PtrToInt Vector width mismatch", &I);
}		}

visitInstruction(I);		visitInstruction(I);
}		}

void Verifier::visitIntToPtrInst(IntToPtrInst &I) {		void Verifier::visitIntToPtrInst(IntToPtrInst &I) {
// Get the source and destination types		// Get the source and destination types
Type *SrcTy = I.getOperand(0)->getType();		Type *SrcTy = I.getOperand(0)->getType();
Type *DestTy = I.getType();		Type *DestTy = I.getType();

Assert(SrcTy->isIntOrIntVectorTy(),		Assert(SrcTy->isIntOrIntVectorTy(),
"IntToPtr source must be an integral", &I);		"IntToPtr source must be an integral", &I);
Assert(DestTy->isPtrOrPtrVectorTy(), "IntToPtr result must be a pointer", &I);		Assert(DestTy->isPtrOrPtrVectorTy(), "IntToPtr result must be a pointer", &I);

if (auto *PTy = dyn_cast<PointerType>(DestTy->getScalarType()))		if (auto *PTy = dyn_cast<PointerType>(DestTy->getScalarType()))
Assert(!DL.isNonIntegralPointerType(PTy),		Assert(!DL.isNonIntegralPointerType(PTy),
"inttoptr not supported for non-integral pointers");		"inttoptr not supported for non-integral pointers");

Assert(SrcTy->isVectorTy() == DestTy->isVectorTy(), "IntToPtr type mismatch",		Assert(SrcTy->isVectorTy() == DestTy->isVectorTy(), "IntToPtr type mismatch",
&I);		&I);
if (SrcTy->isVectorTy()) {		if (SrcTy->isVectorTy()) {
VectorType *VSrc = cast<VectorType>(SrcTy);		VectorType *VSrc = cast<VectorType>(SrcTy);
VectorType *VDest = cast<VectorType>(DestTy);		VectorType *VDest = cast<VectorType>(DestTy);
Assert(VSrc->getNumElements() == VDest->getNumElements(),		Assert(VSrc->getElementCount() == VDest->getElementCount(),
"IntToPtr Vector width mismatch", &I);		"IntToPtr Vector width mismatch", &I);
}		}
visitInstruction(I);		visitInstruction(I);
}		}

void Verifier::visitBitCastInst(BitCastInst &I) {		void Verifier::visitBitCastInst(BitCastInst &I) {
Assert(		Assert(
CastInst::castIsValid(Instruction::BitCast, I.getOperand(0), I.getType()),		CastInst::castIsValid(Instruction::BitCast, I.getOperand(0), I.getType()),
"Invalid bitcast", &I);		"Invalid bitcast", &I);
visitInstruction(I);		visitInstruction(I);
}		}

void Verifier::visitAddrSpaceCastInst(AddrSpaceCastInst &I) {		void Verifier::visitAddrSpaceCastInst(AddrSpaceCastInst &I) {
Type *SrcTy = I.getOperand(0)->getType();		Type *SrcTy = I.getOperand(0)->getType();
Type *DestTy = I.getType();		Type *DestTy = I.getType();

Assert(SrcTy->isPtrOrPtrVectorTy(), "AddrSpaceCast source must be a pointer",		Assert(SrcTy->isPtrOrPtrVectorTy(), "AddrSpaceCast source must be a pointer",
&I);		&I);
Assert(DestTy->isPtrOrPtrVectorTy(), "AddrSpaceCast result must be a pointer",		Assert(DestTy->isPtrOrPtrVectorTy(), "AddrSpaceCast result must be a pointer",
&I);		&I);
Assert(SrcTy->getPointerAddressSpace() != DestTy->getPointerAddressSpace(),		Assert(SrcTy->getPointerAddressSpace() != DestTy->getPointerAddressSpace(),
"AddrSpaceCast must be between different address spaces", &I);		"AddrSpaceCast must be between different address spaces", &I);
if (auto *SrcVTy = dyn_cast<VectorType>(SrcTy))		if (auto *SrcVTy = dyn_cast<VectorType>(SrcTy))
Assert(SrcVTy->getNumElements() ==		Assert(SrcVTy->getElementCount() ==
cast<VectorType>(DestTy)->getNumElements(),		cast<VectorType>(DestTy)->getElementCount(),
"AddrSpaceCast vector pointer number of elements mismatch", &I);		"AddrSpaceCast vector pointer number of elements mismatch", &I);
visitInstruction(I);		visitInstruction(I);
}		}

/// visitPHINode - Ensure that a PHI node is well formed.		/// visitPHINode - Ensure that a PHI node is well formed.
///		///
void Verifier::visitPHINode(PHINode &PN) {		void Verifier::visitPHINode(PHINode &PN) {
// Ensure that the PHI nodes are all grouped together at the top of the block.		// Ensure that the PHI nodes are all grouped together at the top of the block.
▲ Show 20 Lines • Show All 492 Lines • ▼ Show 20 Lines	void Verifier::visitGetElementPtrInst(GetElementPtrInst &GEP) {
Assert(ElTy, "Invalid indices for GEP pointer type!", &GEP);		Assert(ElTy, "Invalid indices for GEP pointer type!", &GEP);

Assert(GEP.getType()->isPtrOrPtrVectorTy() &&		Assert(GEP.getType()->isPtrOrPtrVectorTy() &&
GEP.getResultElementType() == ElTy,		GEP.getResultElementType() == ElTy,
"GEP is not of right type for indices!", &GEP, ElTy);		"GEP is not of right type for indices!", &GEP, ElTy);

if (auto *GEPVTy = dyn_cast<VectorType>(GEP.getType())) {		if (auto *GEPVTy = dyn_cast<VectorType>(GEP.getType())) {
// Additional checks for vector GEPs.		// Additional checks for vector GEPs.
unsigned GEPWidth = GEPVTy->getNumElements();		ElementCount GEPWidth = GEPVTy->getElementCount();
if (GEP.getPointerOperandType()->isVectorTy())		if (GEP.getPointerOperandType()->isVectorTy())
Assert(		Assert(
GEPWidth ==		GEPWidth ==
cast<VectorType>(GEP.getPointerOperandType())->getNumElements(),		cast<VectorType>(GEP.getPointerOperandType())->getElementCount(),
"Vector GEP result width doesn't match operand's", &GEP);		"Vector GEP result width doesn't match operand's", &GEP);
for (Value *Idx : Idxs) {		for (Value *Idx : Idxs) {
Type *IndexTy = Idx->getType();		Type *IndexTy = Idx->getType();
if (auto *IndexVTy = dyn_cast<VectorType>(IndexTy)) {		if (auto *IndexVTy = dyn_cast<VectorType>(IndexTy)) {
unsigned IndexWidth = IndexVTy->getNumElements();		ElementCount IndexWidth = IndexVTy->getElementCount();
Assert(IndexWidth == GEPWidth, "Invalid GEP index vector width", &GEP);		Assert(IndexWidth == GEPWidth, "Invalid GEP index vector width", &GEP);
}		}
Assert(IndexTy->isIntOrIntVectorTy(),		Assert(IndexTy->isIntOrIntVectorTy(),
"All GEP indices should be of integer type");		"All GEP indices should be of integer type");
}		}
}		}

if (auto *PTy = dyn_cast<PointerType>(GEP.getType())) {		if (auto *PTy = dyn_cast<PointerType>(GEP.getType())) {
▲ Show 20 Lines • Show All 1,297 Lines • ▼ Show 20 Lines	Assert(Alignment->getValue().isPowerOf2(),
"masked_load: alignment must be a power of 2", Call);		"masked_load: alignment must be a power of 2", Call);

// DataTy is the overloaded type		// DataTy is the overloaded type
Type *DataTy = cast<PointerType>(Ptr->getType())->getElementType();		Type *DataTy = cast<PointerType>(Ptr->getType())->getElementType();
Assert(DataTy == Call.getType(),		Assert(DataTy == Call.getType(),
"masked_load: return must match pointer type", Call);		"masked_load: return must match pointer type", Call);
Assert(PassThru->getType() == DataTy,		Assert(PassThru->getType() == DataTy,
"masked_load: pass through and data type must match", Call);		"masked_load: pass through and data type must match", Call);
Assert(cast<VectorType>(Mask->getType())->getNumElements() ==		Assert(cast<VectorType>(Mask->getType())->getElementCount() ==
cast<VectorType>(DataTy)->getNumElements(),		cast<VectorType>(DataTy)->getElementCount(),
"masked_load: vector mask must be same length as data", Call);		"masked_load: vector mask must be same length as data", Call);
break;		break;
}		}
case Intrinsic::masked_store: {		case Intrinsic::masked_store: {
Value *Val = Call.getArgOperand(0);		Value *Val = Call.getArgOperand(0);
Value *Ptr = Call.getArgOperand(1);		Value *Ptr = Call.getArgOperand(1);
ConstantInt *Alignment = cast<ConstantInt>(Call.getArgOperand(2));		ConstantInt *Alignment = cast<ConstantInt>(Call.getArgOperand(2));
Value *Mask = Call.getArgOperand(3);		Value *Mask = Call.getArgOperand(3);
Assert(Mask->getType()->isVectorTy(), "masked_store: mask must be vector",		Assert(Mask->getType()->isVectorTy(), "masked_store: mask must be vector",
Call);		Call);
Assert(Alignment->getValue().isPowerOf2(),		Assert(Alignment->getValue().isPowerOf2(),
"masked_store: alignment must be a power of 2", Call);		"masked_store: alignment must be a power of 2", Call);

// DataTy is the overloaded type		// DataTy is the overloaded type
Type *DataTy = cast<PointerType>(Ptr->getType())->getElementType();		Type *DataTy = cast<PointerType>(Ptr->getType())->getElementType();
Assert(DataTy == Val->getType(),		Assert(DataTy == Val->getType(),
"masked_store: storee must match pointer type", Call);		"masked_store: storee must match pointer type", Call);
Assert(cast<VectorType>(Mask->getType())->getNumElements() ==		Assert(cast<VectorType>(Mask->getType())->getElementCount() ==
cast<VectorType>(DataTy)->getNumElements(),		cast<VectorType>(DataTy)->getElementCount(),
"masked_store: vector mask must be same length as data", Call);		"masked_store: vector mask must be same length as data", Call);
break;		break;
}		}

case Intrinsic::masked_gather: {		case Intrinsic::masked_gather: {
const APInt &Alignment =		const APInt &Alignment =
cast<ConstantInt>(Call.getArgOperand(1))->getValue();		cast<ConstantInt>(Call.getArgOperand(1))->getValue();
Assert(Alignment.isNullValue() \|\| Alignment.isPowerOf2(),		Assert(Alignment.isNullValue() \|\| Alignment.isPowerOf2(),
▲ Show 20 Lines • Show All 103 Lines • ▼ Show 20 Lines	case Intrinsic::bswap: {
break;		break;
}		}
case Intrinsic::matrix_multiply:		case Intrinsic::matrix_multiply:
case Intrinsic::matrix_transpose:		case Intrinsic::matrix_transpose:
case Intrinsic::matrix_columnwise_load:		case Intrinsic::matrix_columnwise_load:
case Intrinsic::matrix_columnwise_store: {		case Intrinsic::matrix_columnwise_store: {
ConstantInt *NumRows;		ConstantInt *NumRows;
ConstantInt *NumColumns;		ConstantInt *NumColumns;
VectorType *TypeToCheck;		FixedVectorType *TypeToCheck;
switch (ID) {		switch (ID) {
case Intrinsic::matrix_multiply:		case Intrinsic::matrix_multiply:
NumRows = cast<ConstantInt>(Call.getArgOperand(2));		NumRows = cast<ConstantInt>(Call.getArgOperand(2));
NumColumns = cast<ConstantInt>(Call.getArgOperand(4));		NumColumns = cast<ConstantInt>(Call.getArgOperand(4));
TypeToCheck = cast<VectorType>(Call.getType());		TypeToCheck = cast<FixedVectorType>(Call.getType());
break;		break;
case Intrinsic::matrix_transpose:		case Intrinsic::matrix_transpose:
NumRows = cast<ConstantInt>(Call.getArgOperand(1));		NumRows = cast<ConstantInt>(Call.getArgOperand(1));
NumColumns = cast<ConstantInt>(Call.getArgOperand(2));		NumColumns = cast<ConstantInt>(Call.getArgOperand(2));
TypeToCheck = cast<VectorType>(Call.getType());		TypeToCheck = cast<FixedVectorType>(Call.getType());
break;		break;
case Intrinsic::matrix_columnwise_load:		case Intrinsic::matrix_columnwise_load:
NumRows = cast<ConstantInt>(Call.getArgOperand(2));		NumRows = cast<ConstantInt>(Call.getArgOperand(2));
NumColumns = cast<ConstantInt>(Call.getArgOperand(3));		NumColumns = cast<ConstantInt>(Call.getArgOperand(3));
TypeToCheck = cast<VectorType>(Call.getType());		TypeToCheck = cast<FixedVectorType>(Call.getType());
break;		break;
case Intrinsic::matrix_columnwise_store:		case Intrinsic::matrix_columnwise_store:
NumRows = cast<ConstantInt>(Call.getArgOperand(3));		NumRows = cast<ConstantInt>(Call.getArgOperand(3));
NumColumns = cast<ConstantInt>(Call.getArgOperand(4));		NumColumns = cast<ConstantInt>(Call.getArgOperand(4));
TypeToCheck = cast<VectorType>(Call.getArgOperand(0)->getType());		TypeToCheck = cast<FixedVectorType>(Call.getArgOperand(0)->getType());
break;		break;
default:		default:
llvm_unreachable("unexpected intrinsic");		llvm_unreachable("unexpected intrinsic");
}		}
Assert(TypeToCheck->getNumElements() ==		Assert(TypeToCheck->getNumElements() ==
NumRows->getZExtValue() * NumColumns->getZExtValue(),		NumRows->getZExtValue() * NumColumns->getZExtValue(),
"result of a matrix operation does not fit in the returned vector");		"result of a matrix operation does not fit in the returned vector");
break;		break;
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	case Intrinsic::experimental_constrained_fcmps: {
Assert(CmpInst::isFPPredicate(Pred),		Assert(CmpInst::isFPPredicate(Pred),
"invalid predicate for constrained FP comparison intrinsic", &FPI);		"invalid predicate for constrained FP comparison intrinsic", &FPI);
break;		break;
}		}

case Intrinsic::experimental_constrained_fptosi:		case Intrinsic::experimental_constrained_fptosi:
case Intrinsic::experimental_constrained_fptoui: {		case Intrinsic::experimental_constrained_fptoui: {
Value *Operand = FPI.getArgOperand(0);		Value *Operand = FPI.getArgOperand(0);
uint64_t NumSrcElem = 0;		Optional<ElementCount> NumSrcElem;
Assert(Operand->getType()->isFPOrFPVectorTy(),		Assert(Operand->getType()->isFPOrFPVectorTy(),
"Intrinsic first argument must be floating point", &FPI);		"Intrinsic first argument must be floating point", &FPI);
if (auto *OperandT = dyn_cast<VectorType>(Operand->getType())) {		if (auto *OperandT = dyn_cast<VectorType>(Operand->getType())) {
NumSrcElem = OperandT->getNumElements();		NumSrcElem = OperandT->getElementCount();
		efriedmaUnsubmitted Not Done Reply Inline Actions `cast<FixedVectorType>(OperandT)->getElementCount()`? efriedma: `cast<FixedVectorType>(OperandT)->getElementCount()`?
		ctetreauAuthorUnsubmitted Done Reply Inline Actions oops ctetreau: oops
}		}

Operand = &FPI;		Operand = &FPI;
Assert((NumSrcElem > 0) == Operand->getType()->isVectorTy(),		Assert(NumSrcElem.hasValue() == Operand->getType()->isVectorTy(),
"Intrinsic first argument and result disagree on vector use", &FPI);		"Intrinsic first argument and result disagree on vector use", &FPI);
Assert(Operand->getType()->isIntOrIntVectorTy(),		Assert(Operand->getType()->isIntOrIntVectorTy(),
"Intrinsic result must be an integer", &FPI);		"Intrinsic result must be an integer", &FPI);
if (auto *OperandT = dyn_cast<VectorType>(Operand->getType())) {		if (auto *OperandT = dyn_cast<VectorType>(Operand->getType())) {
Assert(NumSrcElem == OperandT->getNumElements(),		Assert(*NumSrcElem == OperandT->getElementCount(),
"Intrinsic first argument and result vector lengths must be equal",		"Intrinsic first argument and result vector lengths must be equal",
&FPI);		&FPI);
}		}
}		}
break;		break;

case Intrinsic::experimental_constrained_sitofp:		case Intrinsic::experimental_constrained_sitofp:
case Intrinsic::experimental_constrained_uitofp: {		case Intrinsic::experimental_constrained_uitofp: {
Value *Operand = FPI.getArgOperand(0);		Value *Operand = FPI.getArgOperand(0);
uint64_t NumSrcElem = 0;		Optional<ElementCount> NumSrcElem;
Assert(Operand->getType()->isIntOrIntVectorTy(),		Assert(Operand->getType()->isIntOrIntVectorTy(),
"Intrinsic first argument must be integer", &FPI);		"Intrinsic first argument must be integer", &FPI);
if (auto *OperandT = dyn_cast<VectorType>(Operand->getType())) {		if (auto *OperandT = dyn_cast<VectorType>(Operand->getType())) {
NumSrcElem = OperandT->getNumElements();		NumSrcElem = OperandT->getElementCount();
}		}

Operand = &FPI;		Operand = &FPI;
Assert((NumSrcElem > 0) == Operand->getType()->isVectorTy(),		Assert(NumSrcElem.hasValue() == Operand->getType()->isVectorTy(),
"Intrinsic first argument and result disagree on vector use", &FPI);		"Intrinsic first argument and result disagree on vector use", &FPI);
Assert(Operand->getType()->isFPOrFPVectorTy(),		Assert(Operand->getType()->isFPOrFPVectorTy(),
"Intrinsic result must be a floating point", &FPI);		"Intrinsic result must be a floating point", &FPI);
if (auto *OperandT = dyn_cast<VectorType>(Operand->getType())) {		if (auto *OperandT = dyn_cast<VectorType>(Operand->getType())) {
Assert(NumSrcElem == OperandT->getNumElements(),		Assert(*NumSrcElem == OperandT->getElementCount(),
"Intrinsic first argument and result vector lengths must be equal",		"Intrinsic first argument and result vector lengths must be equal",
&FPI);		&FPI);
}		}
} break;		} break;

case Intrinsic::experimental_constrained_fptrunc:		case Intrinsic::experimental_constrained_fptrunc:
case Intrinsic::experimental_constrained_fpext: {		case Intrinsic::experimental_constrained_fpext: {
Value *Operand = FPI.getArgOperand(0);		Value *Operand = FPI.getArgOperand(0);
Type *OperandTy = Operand->getType();		Type *OperandTy = Operand->getType();
Value *Result = &FPI;		Value *Result = &FPI;
Type *ResultTy = Result->getType();		Type *ResultTy = Result->getType();
Assert(OperandTy->isFPOrFPVectorTy(),		Assert(OperandTy->isFPOrFPVectorTy(),
"Intrinsic first argument must be FP or FP vector", &FPI);		"Intrinsic first argument must be FP or FP vector", &FPI);
Assert(ResultTy->isFPOrFPVectorTy(),		Assert(ResultTy->isFPOrFPVectorTy(),
"Intrinsic result must be FP or FP vector", &FPI);		"Intrinsic result must be FP or FP vector", &FPI);
Assert(OperandTy->isVectorTy() == ResultTy->isVectorTy(),		Assert(OperandTy->isVectorTy() == ResultTy->isVectorTy(),
"Intrinsic first argument and result disagree on vector use", &FPI);		"Intrinsic first argument and result disagree on vector use", &FPI);
if (OperandTy->isVectorTy()) {		if (OperandTy->isVectorTy()) {
auto *OperandVecTy = cast<VectorType>(OperandTy);		auto *OperandVecTy = cast<VectorType>(OperandTy);
auto *ResultVecTy = cast<VectorType>(ResultTy);		auto *ResultVecTy = cast<VectorType>(ResultTy);
Assert(OperandVecTy->getNumElements() == ResultVecTy->getNumElements(),		Assert(OperandVecTy->getElementCount() == ResultVecTy->getElementCount(),
"Intrinsic first argument and result vector lengths must be equal",		"Intrinsic first argument and result vector lengths must be equal",
&FPI);		&FPI);
}		}
if (FPI.getIntrinsicID() == Intrinsic::experimental_constrained_fptrunc) {		if (FPI.getIntrinsicID() == Intrinsic::experimental_constrained_fptrunc) {
Assert(OperandTy->getScalarSizeInBits() > ResultTy->getScalarSizeInBits(),		Assert(OperandTy->getScalarSizeInBits() > ResultTy->getScalarSizeInBits(),
"Intrinsic first argument's type must be larger than result type",		"Intrinsic first argument's type must be larger than result type",
&FPI);		&FPI);
} else {		} else {
▲ Show 20 Lines • Show All 676 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[SVE] Fixup calls to VectorType::getNumElements() in IRAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 259624

llvm/include/llvm/IR/Constants.h

llvm/include/llvm/IR/GetElementPtrTypeIterator.h

llvm/include/llvm/IR/Instructions.h

llvm/include/llvm/IR/PatternMatch.h

llvm/lib/IR/AsmWriter.cpp

llvm/lib/IR/AutoUpgrade.cpp

llvm/lib/IR/ConstantFold.cpp

llvm/lib/IR/Constants.cpp

llvm/lib/IR/DataLayout.cpp

llvm/lib/IR/Function.cpp

llvm/lib/IR/IRBuilder.cpp

llvm/lib/IR/Instructions.cpp

llvm/lib/IR/Verifier.cpp

[SVE] Fixup calls to VectorType::getNumElements() in IR
AbandonedPublic