This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/
-
llvm/
-
Analysis/
-
TargetTransformInfo.h
-
TargetTransformInfoImpl.h
-
CodeGen/
2
BasicTTIImpl.h
-
lib/
-
Analysis/
-
CostModel.cpp
-
TargetTransformInfo.cpp
-
Target/
-
AArch64/
-
AArch64TargetTransformInfo.h
-
AArch64TargetTransformInfo.cpp
-
ARM/
-
ARMTargetTransformInfo.h
-
ARMTargetTransformInfo.cpp
-
PowerPC/
-
PPCTargetTransformInfo.h
-
PPCTargetTransformInfo.cpp
-
SystemZ/
-
SystemZISelLowering.cpp
3
SystemZTargetTransformInfo.h
3/23
SystemZTargetTransformInfo.cpp
-
X86/
-
X86TargetTransformInfo.h
-
X86TargetTransformInfo.cpp
-
Transforms/Vectorize/
-
Vectorize/
1/3
LoopVectorize.cpp
-
test/Analysis/CostModel/SystemZ/
-
Analysis/
-
CostModel/
-
SystemZ/
-
cmpsel.ll
-
fp-arith.ll
-
fp-cast.ll
-
int-arith.ll
-
int-cast.ll
-
load_store.ll
-
logical.ll

Differential D29631

SystemZTargetTransformInfo cost functions and some common code changes
ClosedPublic

Authored by jonpa on Feb 7 2017, 4:49 AM.

Download Raw Diff

Details

Reviewers

rengolin
delena
uweigand
mkuper
javed.absar
mssimpso
hfinkel

Summary

SystemZTargetTransformInfo methods implemented:

int getArithmeticInstrCost()
int getCastInstrCost()
int getCmpSelInstrCost()

Common code changes:

getCmpSelInstrCost() has gotten an extra parameter to make it possible to pass the actual instruction if it exists. The motivation for this is that the actual cost on SystemZ for compare and select instructions depend on the scalar widths of the vector elements. The vector element compare instruction produces a bitmask for the elements, and the vector select operates with that bitmask. However, if the widths of the elements of the compare / select operands differ, the bitmask must be adjusted with pack or unpack instructions for instance. Therefore it is useful to look at the "other" instruction when evaluating cost for the compare or select instruction.

LoopVectorizer.cpp: Don't consider a vectorized cost for the compare if it is for the conditional back branch in the loop latch.

New ovlerloaded method getScalarizationOverhead() which was factored out of getArithmeticInstrCost(). This method is useful also in the SystemZ backend.

The fix in InstCombineVectorOps.cpp is "in progress", which is obviously needed in some cases. See https://llvm.org/bugs/show_bug.cgi?id=30630

Diff Detail

Event Timeline

jonpa created this revision.Feb 7 2017, 4:49 AM

Herald added a reviewer: javed.absar. · View Herald TranscriptFeb 7 2017, 4:49 AM

Herald added subscribers: nemanjai, mzolotukhin. · View Herald Transcript

At second thought, I removed the InstCombiner changes from this review, since it is not yet quite ready, and the other changes do not depend on it.

getShuffleCost() implemented. This method is currently in a state of mild confusion: http://lists.llvm.org/pipermail/llvm-dev/2017-February/109978.html.

Minor fixes here and there.

Costs are currently based on number of vectors, which is intuitive to me. Perhaps the more correct way to do this is to use TLI->getTypeLegalizationCost()?

Values from these cost functions now seem quite precise overall. Exception to this is for z10 (generic), values for fptui (i8/i16) and uitofp (i64) might need some tweaking if found needed.

Arbitrary constant cost values have been used for: fp-select: 4 (conditional jump), libcall: 30.

The values for compare / select are now reflecting an upcoming patch for VSELECT, so before this can be commited, it must be approved: https://reviews.llvm.org/D29489.

Use getNumberOfParts() instead of dividing with 128, to get number of vector registers needed.
Add default value of nullptr for Instruction* argument of getCmpSelInstrCost() in targets.

getMemoryOpCost() implemented.
SK_Broadcast cost adjusted to reflect vlrep type instructions, which loads and replicates in a single instruction.

Allow any VF > 16 in SystemZTargetTransformInfo cost functions, since it may actually be queried (at least VF==32).

I've looked only at the SystemZ parts ... look basically good to me, but see a number of inline comments.

lib/Target/SystemZ/SystemZTargetTransformInfo.cpp
267	This seems to partially duplicate the logic in getMemoryOpCost ... is there a way to unify those?
337	Shouldn't we have And and Xor here as well?
391	Likewise ...
480	Can this not be handled generically via getElSizeLog2Diff like below for the unpack case?
525	These tables look odd ... but I guess if they accurately reflect the cost of the code that is currently being generated, this is fine with me until we improve codegen.
615	When do we get libcalls? At least with z196 and above this should never happen, so we might want to take the ISA level into account here.
731	This needs more explanation why this adjustment is needed in this case (and only in this case).

Patch updated according to review.

CostModel tests added. The CostModel tests reflect the current costsl while using undef operands extensively, which makes the returned values lower than what is typical, for the scalarized instructions. Would it be better to rewrite these tests to use a separate function with arguments for each case, to include the operands extraction cost?

The tests depend on a patch for getOperandsScalarizationOverhead(), for it to handle vector type arguments, when it's called by CostModel:

http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20170220/432062.html:
diff --git a/include/llvm/CodeGen/BasicTTIImpl.h b/include/llvm/CodeGen/BasicTTIImpl.h
index d9131da..12f67f9 100644

a/include/llvm/CodeGen/BasicTTIImpl.h

+++ b/include/llvm/CodeGen/BasicTTIImpl.h
@@ -312,9 +312,16 @@ public:

unsigned Cost = 0;
SmallPtrSet<const Value*, 4> UniqueOperands;
for (const Value *A : Args) {

if (UniqueOperands.insert(A).second)
Cost += getScalarizationOverhead(VectorType::get(A->getType(), VF),
false, true);

+ if (!isa<UndefValue>(A) && UniqueOperands.insert(A).second) {
+ Type *VecTy = nullptr;
+ if (A->getType()->isVectorTy()) {
+ assert (VF == A->getType()->getVectorNumElements());
+ VecTy = A->getType();
+ }
+ else
+ VecTy = VectorType::get(A->getType(), VF);
+ Cost += getScalarizationOverhead(VecTy, false, true);
+ }

  }
  return Cost;
}

lib/Target/SystemZ/SystemZTargetTransformInfo.cpp
267	Yes - replaced it with a call to getMemoryOpCost() which returns the number of instructions just the same.
337	No - And and Xor are 'Legal', while the ones handled here are 'Custom'. Only for 'Custom' is this needed, since the default implementation then assumes it is twice as expensive, while it is actually not on z13.
391	(same reason as above)
480	Computed by means of a loop instead. Special case also handled.
525	Yes - that was the intent.
615	This was for i128, which shouldn't be there I suppose, so I removed it. You get a libcall if you generate a function like that, but I have not seen it in benchmarks.
731	I refactored the cost computation for a vector truncation into a new function, used both for vector truncate and vector select, where the selected element type is smaller than the compared elements.

See inline comments.

The new test cases look good to me, thanks!

lib/Target/SystemZ/SystemZTargetTransformInfo.cpp
337	I see. However, for vector types, Or is also marked as "Legal" -- only for scalar i64 is it Custom.
770	The original code in getUnrollingPreferences also used 2 for 128-bit integer types. Should this be done now here as well?

Updated per review. See inline comments.

lib/Target/SystemZ/SystemZTargetTransformInfo.cpp
337	Ahh, thanks. Removed Or.
770	That's never seen by the LoopVectorizer, but I think you are right that it should be handled here since it's actually handled by the backend. Thanks. Changed to check for scalars of 128 bits.

The SystemZ parts LGTM now.

Great!

The rest of the review shouldn't be to much to go through:

getCmpSelInstrCost() has gotten a new default nullptr parameter for the Instruction. This is so that when the instruction is available, it can be passed. This is needed for SystemZ to estimate how many instructions are needed in addition to the vector compare and vector select instructions (it depends on element widths). LoopVectorizer and CostModel passes the instruction, and elswhere it is just ignored. Target implementation has also gotten the (unused) argument in declaration and definition of the method.

BasicTTIImpl has gotten a new method getScalarizationOverhead(), that contains factored-out code from getArithmeticInstrCost(), so that the SystemZ (and potentially others) implementation can use it.

LoopVectorizer: Don't consider the compare in the loop latch (used by the conditional back-branch), to be vectorized. This doesn't make sense, it will always be scalar, right? Renato?

jonpa added reviewers: rengolin, delena.Feb 23 2017, 7:00 AM

In D29631#684674, @jonpa wrote:

getCmpSelInstrCost() has gotten a new default nullptr parameter for the Instruction. This is so that when the instruction is available, it can be passed. This is needed for SystemZ to estimate how many instructions are needed in addition to the vector compare and vector select instructions (it depends on element widths). LoopVectorizer and CostModel passes the instruction, and elswhere it is just ignored. Target implementation has also gotten the (unused) argument in declaration and definition of the method.

This doesn't look bad, but maybe Matthew/Michael have a different plan for such problem. I'm personally ok with this.

BasicTTIImpl has gotten a new method getScalarizationOverhead(), that contains factored-out code from getArithmeticInstrCost(), so that the SystemZ (and potentially others) implementation can use it.

I think you could simplify that by implementing the "empty args" logic inside getOperandsScalarizationOverhead.

LoopVectorizer: Don't consider the compare in the loop latch (used by the conditional back-branch), to be vectorized. This doesn't make sense, it will always be scalar, right? Renato?

I'm not sure it has to be always scalar. If you have masked vector instructions, the latch could be a vector comparison. Or maybe I didn't get what the problem is. Can you give an example?

--renato

include/llvm/CodeGen/BasicTTIImpl.h
334	This sounds like it should be implemented inside `getOperandsScalarizationOverhead`
lib/Target/SystemZ/SystemZTargetTransformInfo.cpp
711	Code style, try clang-format.
lib/Target/SystemZ/SystemZTargetTransformInfo.h
30	Shouldn't this just be the cost of a call, rather than being a magic constant?
58	Where is this used?

mssimpso added inline comments.Feb 24 2017, 9:25 AM

lib/Transforms/Vectorize/LoopVectorize.cpp
7231–7239	Hi, I've only looked at the vectorizer change here, but this code is not needed. Before computing costs, we collect the uniform values in collectLoopUniforms(). ICmp instructions of the kind here are marked uniform. Then in getInstructionCost(), we check if an instruction is uniform (isUniformAfterVectorization()), and if so, always return "1" for the cost, regardless of VF. Also, floating-point induction variables aren't allowed to be "primary" induction variables. So it shouldn't be the case that you would have a FCmp feeding the back edge branch.

mssimpso added inline comments.Feb 24 2017, 9:30 AM

lib/Transforms/Vectorize/LoopVectorize.cpp
7231–7239	Correction: we always return the cost of the scalar compare if it is uniform, regardless of VF.

Removed experimental heuristic that checked the context of compare instruction in LoopVectorize.cpp.

Just as with getCmpSelInstrCost() and for the same reasons, I added a a new Instruction* argument to getCastInstrCost(). Instruction pointers now passed from everywhere possible, I hope. Moved the assert of the right opcode for passed instruction up in class hiearchy into TargetTransformInfo.cpp.

SystemZTargetTransformInfo.cpp:

Handling of i1 extensions (temporary tables removed)
factored out a new function getVectorBitMaskConversionCost() from getCmpSelInstrCost(), and reused it for the i1 vector zext/sext instructions.
Scalar XOr cost fixed.
Allow a noop-truncation query.
The cost for a compare has been adjusted to reflect the improvement in the dependent vselect patch: There will no longer always be a vector compare for each vector select.
Removed experimental heuristic that checked the context of compare instruction.
New tests cmp-ext.ll and scalar-cmp-cmp-log-sel.ll

@Renato:

I think you could simplify that by implementing the "empty args" logic inside getOperandsScalarizationOverhead()

I tried this by added a third argument to getOperandsScalarizationOverhead(). LoopVectorizer calls this when it knows it has the arguments, so it doesn't need to pass it, so therefore it is default nullptr.

Is this looking better than before?

I'm not sure it has to be always scalar. If you have masked vector instructions, the latch could be a vector comparison. Or maybe I didn't get what the problem is. Can you give an example?

Removed

The suggested refactoring of getScalarizationOverhead() was changed back, because it was tricky enough to keep track of the different contexts getOperandsScalarizationOverhead() was used in, without a third argument. This seemed right after r297705.

SystemZ part:

sdiv/udiv scalar costs adjusted.
getVectorInstrCost() implemented for SystemZ (vlvgp)

Does anybody have time to look over if the common code changes are ok? Even though these changes are not very lengthy at all, I wonder if it would help if I split it up into separate reviews?

SystemZ back-end changes and test cases still LGTM. Thanks!

jonpa added inline comments.Mar 17 2017, 12:40 AM

include/llvm/CodeGen/BasicTTIImpl.h
317	This part has been approved by Hal Finkel on llvm-commits already (just the first part here that handles scalar/vector argument types).
lib/Target/SystemZ/SystemZTargetTransformInfo.h
30	Maybe, but that is also a magic constant of 10 :-) Is there any point? This is used just for FRem.
lib/Transforms/Vectorize/LoopVectorize.cpp
7231–7239	Thanks for explaining - I removed this from patch.

Patch improved:

SystemZ memory accesses inteleaving enabled, and getInterleavedMemoryOpCost() implemented. Tested with: test/Transforms/LoopVectorize/SystemZ/mem-interleaving-costs.ll
getMemoryOpCost() improved by passing the Instruction pointer, so that operations that fold a load can be considered. Tested with: test/Analysis/CostModel/SystemZ/memop-folding-int-arith.ll
BasicTTIImpl.h getCastInstrCost() improved to check for legal extending loads, in which case the cost of the z/sext instruction becomes 0. Tested with: test/Analysis/CostModel/SystemZ/ext-load.ll

Sofar, the passing of the Instruction to the cost methods have been added for:
getCastInstrCost(): For z/sext: check if the operand is a load, and in case target supports a legal matching Z/SEXTLOAD, return 0.
getCmpSelInstrCost(): SystemZ needs to check the def of the vector select mask, in order to compute the extra cost in case the vector compare had a different operand type.
getMemoryOpCost(): SystemZ checks if a load has a single user which is an (arithmetic) instruction that will fold the load into one of its operands

getOperandsScalarizationOverhead(): the assert has been changed so that the passed VF can either be 1 or match the VecTy number of elements. This is needed so that things work from different call contexts.

Added a slight penalty for moving registers out of vector pipeline into FXU units.
'bool isFPVectorizationPotentiallyUnsafe()' removed from SystemZ, since it has the same default implementation.
New CostModel tests for shuffles and extract/insert element.

Other:
Experimental option removed (CheckFoldedReloads)
New comments about fp32 -- they are expanded and Shuffles and Extracts should actually be free.

PING!

Could anyone please take a minute to review the common code changes?

See one inline comment. Otherwise the SystemZ changes still LGTM, and we now also have positive benchmark results for this change ...

lib/Target/SystemZ/SystemZTargetTransformInfo.cpp
758	Why just index == 0 ? Shouldn't the penalty apply to any element?

jonpa added inline comments.Apr 10 2017, 11:39 PM

lib/Target/SystemZ/SystemZTargetTransformInfo.cpp
758	While the extraction of any element still has a modeled cost of 1 per element, my idea with this is to penalize further the delay of the act of moving out of the vector pipeline. This would then not be per element, but rather for the whole vector register (thus added only once, at index 0). My assumption is that this happens at the point of scalarization, when all vector elements are extracted.

uweigand added inline comments.Apr 11 2017, 3:27 PM

lib/Target/SystemZ/SystemZTargetTransformInfo.cpp
758	Why would this not be per element? Moving from the vector to the integer pipeline always has the higher latency, for every element. In the end any such extracton gets implemented using a VLVG* instruction, which simply is more expensive than other vector instructions ... (Note that in the scheduler, the higher latency for VLVG* is already modeled correctly.)

jonpa added inline comments.Apr 11 2017, 10:27 PM

lib/Target/SystemZ/SystemZTargetTransformInfo.cpp
758	I agree that this should be experimented further with, but the practical argument right now is that I added the smallest possible extra penalty to get rid of a particular regression while affecting code as little as possible. I consider this to be just a first step with obvious improvements in sight for the vectorizers decisions. I thought the cost here was mainly that the vector register has to pass through the whole vector pipeline. I also imagine the VLGV* instructions would be pipelined, so that the cost of them is not a linear sum in the number of elements? Or is it really twice as expensive to scalarize a <4 x i32> rather than a <2 x i64>?

ping!

The common code parts needs review -- the SystemZ parts are done and proven on benchmarks.

The instruction pointer has been added as a default nullptr argument to: getCastInstrCost(), getCmpSelInstrCost(), getMemoryOpCost(), and the instruction pointer is passed whenever possible by the caller.
@Hal: If I understood you correctly, you are ok with this?

BasicTTIImpl.h:

getCastInstrCost() improved to check for legal extending loads, in which case the cost of the z/sext instruction becomes 0. Tested with: test/Analysis/CostModel/SystemZ/ext-load.ll This is good for SystemZ, and hopefully for any target?

new overloaded method getScalarizationOverhead() contains factored-out code from getArithmeticInstrCost(), so that the SystemZ (and potentially others) implementation can use it.

@Renato: I tried your suggestion, but then went back because it became messy. Are you ok with this at least for now?

in getOperandsScalarizationOverhead(): the assert has been changed so that the passed VF can either be 1 or match the VecTy number of elements. This is needed so that things work from different call contexts.

Hi Jonas,

Really sorry for not getting back at this earlier. The generic changes are not intrusive enough to warrant a full refactoring, and they're mostly mechanical than existential.

However, I do spot a few problems with this approach, which can be fixed in following patches to the generic parts, since this is now holding a big change in SystemZ.

Right now, some cost functions receive the opcode only, thus not having the same conflict resolution power. Most don't need it, but the choice is currently based on one target's specific needs. Other targets may need to inspect operands and uses of other instructions, and having a different interface for the same kind of functions will confuse the hell out of anyone trying to add costs to their targets. :)

We need to make this more generic and actually pass the instruction instead of the opcode for all, which would clean up the whole mess. But that's for another day.

As is, Hal seems happy with the generic parts, so am I. Ulrich is happy with the System Z parts, so there's no reason not to approve this patch.

Thanks for the large contribution and the patience, and sorry it took so long. LGTM.

cheers,
--renato

This revision is now accepted and ready to land.Apr 12 2017, 2:31 AM

Thanks for review.
r300052

Hum, seems some of the costs are wrong in your tests:

http://lab.llvm.org:8011/builders/clang-cmake-aarch64-39vma/builds/5564

Weird that you didn't see those in your machine...

cheers,
--renato

In D29631#724772, @rengolin wrote:

Hum, seems some of the costs are wrong in your tests:

http://lab.llvm.org:8011/builders/clang-cmake-aarch64-39vma/builds/5564

Weird that you didn't see those in your machine...

cheers,
--renato

Sorry - one test update unfortunately didn't go in at the first attempt, but it's fixed now.

In D29631#724779, @jonpa wrote:

Sorry - one test update unfortunately didn't go in at the first attempt, but it's fixed now.

Still isn't:

http://lab.llvm.org:8011/builders/clang-cmake-armv7-a15/builds/6066

http://lab.llvm.org:8011/builders/clang-cmake-thumbv7-a15/builds/6059

http://lab.llvm.org:8011/builders/clang-cmake-aarch64-quick/builds/5372

The errors are like:

; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res11 = and <4 x i64> undef, undef

<stdin>:13:1: note: scanning from here
Cost Model: Found an estimated cost of 1 for instruction: %res11 = and <4 x i64> undef, undef

All of them. This should have been caught in your testing and I'm not sure why it hasn't. This breaks both 32 and 64 bits ARM.

Let's revert the patch and I can help you understand the problem once the bots are green.

cheers,
--renato

All these tests fail on Hexagon as well. From what I found out, it seems that "opt -analyze" ignores the mtriple option---you could set it to xyz and the test would still run and produce some output. The Hexagon bot only builds the Hexagon backend, and sets the default triple to hexagon-unknown-elf. I'm guessing that this causes Hexagon's costs to be printed regardless of the mtriple setting, which could lead to these failures.

In D29631#724916, @kparzysz wrote:

All these tests fail on Hexagon as well. From what I found out, it seems that "opt -analyze" ignores the mtriple option---you could set it to xyz and the test would still run and produce some output. The Hexagon bot only builds the Hexagon backend, and sets the default triple to hexagon-unknown-elf. I'm guessing that this causes Hexagon's costs to be printed regardless of the mtriple setting, which could lead to these failures.

Ah! Makes sense. Then maybe just adding the lit.config file to SystemZ directory would do?

In D29631#724995, @rengolin wrote:

Ah! Makes sense. Then maybe just adding the lit.config file to SystemZ directory would do?

If you can force a specific triple to be used, then yes. If you can only require a specific target to be present, then it may happen to avoid the failures, but there is no guarantee that it will always work.

In D29631#725026, @kparzysz wrote:

In D29631#724995, @rengolin wrote:

Ah! Makes sense. Then maybe just adding the lit.config file to SystemZ directory would do?

If you can force a specific triple to be used, then yes. If you can only require a specific target to be present, then it may happen to avoid the failures, but there is no guarantee that it will always work.

Ok, I created the lit.config file like the others in r300078

In D29631#725041, @rengolin wrote:

Ok, I created the lit.config file like the others in r300078

And r300081 takes care of the rest... :(

In D29631#725109, @rengolin wrote:

And r300081 takes care of the rest... :(

Thanks Renato!

Sorry for this and thank you!

jonpa closed this revision.Apr 12 2017, 10:18 PM

Revision Contents

Path

Size

include/

llvm/

Analysis/

TargetTransformInfo.h

13 lines

TargetTransformInfoImpl.h

3 lines

CodeGen/

BasicTTIImpl.h

32 lines

lib/

Analysis/

CostModel.cpp

4 lines

TargetTransformInfo.cpp

4 lines

Target/

AArch64/

AArch64TargetTransformInfo.h

3 lines

AArch64TargetTransformInfo.cpp

4 lines

ARM/

ARMTargetTransformInfo.h

3 lines

ARMTargetTransformInfo.cpp

5 lines

PowerPC/

PPCTargetTransformInfo.h

3 lines

PPCTargetTransformInfo.cpp

5 lines

SystemZ/

SystemZISelLowering.cpp

4 lines

SystemZTargetTransformInfo.h

18 lines

SystemZTargetTransformInfo.cpp

466 lines

X86/

X86TargetTransformInfo.h

3 lines

X86TargetTransformInfo.cpp

5 lines

Transforms/

Vectorize/

LoopVectorize.cpp

17 lines

test/

Analysis/

CostModel/

SystemZ/

1806 lines

119 lines

541 lines

326 lines

199 lines

137 lines

277 lines

Diff 89362

include/llvm/Analysis/TargetTransformInfo.h

Show First 20 Lines • Show All 569 Lines • ▼ Show 20 Lines	public:
/// -1 to indicate that there is no information about the index value.		/// -1 to indicate that there is no information about the index value.
int getExtractWithExtendCost(unsigned Opcode, Type Dst, VectorType VecTy,		int getExtractWithExtendCost(unsigned Opcode, Type Dst, VectorType VecTy,
unsigned Index = -1) const;		unsigned Index = -1) const;

/// \return The expected cost of control-flow related instructions such as		/// \return The expected cost of control-flow related instructions such as
/// Phi, Ret, Br.		/// Phi, Ret, Br.
int getCFInstrCost(unsigned Opcode) const;		int getCFInstrCost(unsigned Opcode) const;

/// \returns The expected cost of compare and select instructions.		/// \returns The expected cost of compare and select instructions. If there
		/// is an existing instruction that holds Opcode, it may be passed in the
		/// 'I' parameter.
int getCmpSelInstrCost(unsigned Opcode, Type *ValTy,		int getCmpSelInstrCost(unsigned Opcode, Type *ValTy,
Type *CondTy = nullptr) const;		Type CondTy = nullptr, const Instruction I = nullptr) const;

/// \return The expected cost of vector Insert and Extract.		/// \return The expected cost of vector Insert and Extract.
/// Use -1 to indicate that there is no information on the index value.		/// Use -1 to indicate that there is no information on the index value.
int getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index = -1) const;		int getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index = -1) const;

/// \return The cost of Load and Store instructions.		/// \return The cost of Load and Store instructions.
int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,		int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,
unsigned AddressSpace) const;		unsigned AddressSpace) const;
▲ Show 20 Lines • Show All 215 Lines • ▼ Show 20 Lines	getArithmeticInstrCost(unsigned Opcode, Type *Ty, OperandValueKind Opd1Info,
ArrayRef<const Value *> Args) = 0;		ArrayRef<const Value *> Args) = 0;
virtual int getShuffleCost(ShuffleKind Kind, Type *Tp, int Index,		virtual int getShuffleCost(ShuffleKind Kind, Type *Tp, int Index,
Type *SubTp) = 0;		Type *SubTp) = 0;
virtual int getCastInstrCost(unsigned Opcode, Type Dst, Type Src) = 0;		virtual int getCastInstrCost(unsigned Opcode, Type Dst, Type Src) = 0;
virtual int getExtractWithExtendCost(unsigned Opcode, Type *Dst,		virtual int getExtractWithExtendCost(unsigned Opcode, Type *Dst,
VectorType *VecTy, unsigned Index) = 0;		VectorType *VecTy, unsigned Index) = 0;
virtual int getCFInstrCost(unsigned Opcode) = 0;		virtual int getCFInstrCost(unsigned Opcode) = 0;
virtual int getCmpSelInstrCost(unsigned Opcode, Type *ValTy,		virtual int getCmpSelInstrCost(unsigned Opcode, Type *ValTy,
Type *CondTy) = 0;		Type CondTy, const Instruction I) = 0;
virtual int getVectorInstrCost(unsigned Opcode, Type *Val,		virtual int getVectorInstrCost(unsigned Opcode, Type *Val,
unsigned Index) = 0;		unsigned Index) = 0;
virtual int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,		virtual int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,
unsigned AddressSpace) = 0;		unsigned AddressSpace) = 0;
virtual int getMaskedMemoryOpCost(unsigned Opcode, Type *Src,		virtual int getMaskedMemoryOpCost(unsigned Opcode, Type *Src,
unsigned Alignment,		unsigned Alignment,
unsigned AddressSpace) = 0;		unsigned AddressSpace) = 0;
virtual int getGatherScatterOpCost(unsigned Opcode, Type *DataTy,		virtual int getGatherScatterOpCost(unsigned Opcode, Type *DataTy,
▲ Show 20 Lines • Show All 229 Lines • ▼ Show 20 Lines	public:
}		}
int getExtractWithExtendCost(unsigned Opcode, Type Dst, VectorType VecTy,		int getExtractWithExtendCost(unsigned Opcode, Type Dst, VectorType VecTy,
unsigned Index) override {		unsigned Index) override {
return Impl.getExtractWithExtendCost(Opcode, Dst, VecTy, Index);		return Impl.getExtractWithExtendCost(Opcode, Dst, VecTy, Index);
}		}
int getCFInstrCost(unsigned Opcode) override {		int getCFInstrCost(unsigned Opcode) override {
return Impl.getCFInstrCost(Opcode);		return Impl.getCFInstrCost(Opcode);
}		}
int getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy) override {		int getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
return Impl.getCmpSelInstrCost(Opcode, ValTy, CondTy);		const Instruction *I) override {
		return Impl.getCmpSelInstrCost(Opcode, ValTy, CondTy, I);
}		}
int getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index) override {		int getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index) override {
return Impl.getVectorInstrCost(Opcode, Val, Index);		return Impl.getVectorInstrCost(Opcode, Val, Index);
}		}
int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,		int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,
unsigned AddressSpace) override {		unsigned AddressSpace) override {
return Impl.getMemoryOpCost(Opcode, Src, Alignment, AddressSpace);		return Impl.getMemoryOpCost(Opcode, Src, Alignment, AddressSpace);
}		}
▲ Show 20 Lines • Show All 188 Lines • Show Last 20 Lines

include/llvm/Analysis/TargetTransformInfoImpl.h

Show First 20 Lines • Show All 330 Lines • ▼ Show 20 Lines	public:

unsigned getExtractWithExtendCost(unsigned Opcode, Type *Dst,		unsigned getExtractWithExtendCost(unsigned Opcode, Type *Dst,
VectorType *VecTy, unsigned Index) {		VectorType *VecTy, unsigned Index) {
return 1;		return 1;
}		}

unsigned getCFInstrCost(unsigned Opcode) { return 1; }		unsigned getCFInstrCost(unsigned Opcode) { return 1; }

unsigned getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy) {		unsigned getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
		const Instruction *I) {
return 1;		return 1;
}		}

unsigned getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index) {		unsigned getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index) {
return 1;		return 1;
}		}

unsigned getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,		unsigned getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,
▲ Show 20 Lines • Show All 326 Lines • Show Last 20 Lines

include/llvm/CodeGen/BasicTTIImpl.h

Show First 20 Lines • Show All 308 Lines • ▼ Show 20 Lines	public:
/// Estimate the overhead of scalarizing an instructions unique operands.		/// Estimate the overhead of scalarizing an instructions unique operands.
unsigned getOperandsScalarizationOverhead(ArrayRef<const Value *> Args,		unsigned getOperandsScalarizationOverhead(ArrayRef<const Value *> Args,
unsigned VF) {		unsigned VF) {
unsigned Cost = 0;		unsigned Cost = 0;
SmallPtrSet<const Value*, 4> UniqueOperands;		SmallPtrSet<const Value*, 4> UniqueOperands;
for (const Value *A : Args) {		for (const Value *A : Args) {
if (UniqueOperands.insert(A).second)		if (UniqueOperands.insert(A).second)
Cost += getScalarizationOverhead(VectorType::get(A->getType(), VF),		Cost += getScalarizationOverhead(VectorType::get(A->getType(), VF),
false, true);		false, true);
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions This part has been approved by Hal Finkel on llvm-commits already (just the first part here that handles scalar/vector argument types). jonpa: This part has been approved by Hal Finkel on llvm-commits already (just the first part here…
}		}
return Cost;		return Cost;
}		}

		unsigned getScalarizationOverhead(Type VecTy, ArrayRef<const Value > Args) {
		assert (VecTy->isVectorTy());

		unsigned Cost = 0;

		Cost += getScalarizationOverhead(VecTy, true, false);
		if (!Args.empty())
		Cost += getOperandsScalarizationOverhead(Args,
		VecTy->getVectorNumElements());
		else
		// When no information on arguments is provided, we add the cost
		// associated with one argument as a heuristic.
		Cost += getScalarizationOverhead(VecTy, false, true);
		rengolinUnsubmitted Not Done Reply Inline Actions This sounds like it should be implemented inside `getOperandsScalarizationOverhead` rengolin: This sounds like it should be implemented inside `getOperandsScalarizationOverhead`

		return Cost;
		}

unsigned getMaxInterleaveFactor(unsigned VF) { return 1; }		unsigned getMaxInterleaveFactor(unsigned VF) { return 1; }

unsigned getArithmeticInstrCost(		unsigned getArithmeticInstrCost(
unsigned Opcode, Type *Ty,		unsigned Opcode, Type *Ty,
TTI::OperandValueKind Opd1Info = TTI::OK_AnyValue,		TTI::OperandValueKind Opd1Info = TTI::OK_AnyValue,
TTI::OperandValueKind Opd2Info = TTI::OK_AnyValue,		TTI::OperandValueKind Opd2Info = TTI::OK_AnyValue,
TTI::OperandValueProperties Opd1PropInfo = TTI::OP_None,		TTI::OperandValueProperties Opd1PropInfo = TTI::OP_None,
TTI::OperandValueProperties Opd2PropInfo = TTI::OP_None,		TTI::OperandValueProperties Opd2PropInfo = TTI::OP_None,
Show All 26 Lines	unsigned getArithmeticInstrCost(
// TODO: If one of the types get legalized by splitting, handle this		// TODO: If one of the types get legalized by splitting, handle this
// similarly to what getCastInstrCost() does.		// similarly to what getCastInstrCost() does.
if (Ty->isVectorTy()) {		if (Ty->isVectorTy()) {
unsigned Num = Ty->getVectorNumElements();		unsigned Num = Ty->getVectorNumElements();
unsigned Cost = static_cast<T *>(this)		unsigned Cost = static_cast<T *>(this)
->getArithmeticInstrCost(Opcode, Ty->getScalarType());		->getArithmeticInstrCost(Opcode, Ty->getScalarType());
// Return the cost of multiple scalar invocation plus the cost of		// Return the cost of multiple scalar invocation plus the cost of
// inserting and extracting the values.		// inserting and extracting the values.
unsigned TotCost = getScalarizationOverhead(Ty, true, false) + Num * Cost;		return getScalarizationOverhead(Ty, Args) + Num * Cost;
if (!Args.empty())
TotCost += getOperandsScalarizationOverhead(Args, Num);
else
// When no information on arguments is provided, we add the cost
// associated with one argument as a heuristic.
TotCost += getScalarizationOverhead(Ty, false, true);

return TotCost;
}		}

// We don't know anything about this scalar instruction.		// We don't know anything about this scalar instruction.
return OpCost;		return OpCost;
}		}

unsigned getShuffleCost(TTI::ShuffleKind Kind, Type *Tp, int Index,		unsigned getShuffleCost(TTI::ShuffleKind Kind, Type *Tp, int Index,
Type *SubTp) {		Type *SubTp) {
▲ Show 20 Lines • Show All 126 Lines • ▼ Show 20 Lines	return static_cast<T *>(this)->getVectorInstrCost(
VecTy->getElementType());		VecTy->getElementType());
}		}

unsigned getCFInstrCost(unsigned Opcode) {		unsigned getCFInstrCost(unsigned Opcode) {
// Branches are assumed to be predicted.		// Branches are assumed to be predicted.
return 0;		return 0;
}		}

unsigned getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy) {		unsigned getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
		const Instruction *I) {
const TargetLoweringBase *TLI = getTLI();		const TargetLoweringBase *TLI = getTLI();
int ISD = TLI->InstructionOpcodeToISD(Opcode);		int ISD = TLI->InstructionOpcodeToISD(Opcode);
assert(ISD && "Invalid opcode");		assert(ISD && "Invalid opcode");

// Selects on vectors are actually vector selects.		// Selects on vectors are actually vector selects.
if (ISD == ISD::SELECT) {		if (ISD == ISD::SELECT) {
assert(CondTy && "CondTy must exist");		assert(CondTy && "CondTy must exist");
if (CondTy->isVectorTy())		if (CondTy->isVectorTy())
Show All 11 Lines	unsigned getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
// Otherwise, assume that the cast is scalarized.		// Otherwise, assume that the cast is scalarized.
// TODO: If one of the types get legalized by splitting, handle this		// TODO: If one of the types get legalized by splitting, handle this
// similarly to what getCastInstrCost() does.		// similarly to what getCastInstrCost() does.
if (ValTy->isVectorTy()) {		if (ValTy->isVectorTy()) {
unsigned Num = ValTy->getVectorNumElements();		unsigned Num = ValTy->getVectorNumElements();
if (CondTy)		if (CondTy)
CondTy = CondTy->getScalarType();		CondTy = CondTy->getScalarType();
unsigned Cost = static_cast<T *>(this)->getCmpSelInstrCost(		unsigned Cost = static_cast<T *>(this)->getCmpSelInstrCost(
Opcode, ValTy->getScalarType(), CondTy);		Opcode, ValTy->getScalarType(), CondTy, I);

// Return the cost of multiple scalar invocation plus the cost of		// Return the cost of multiple scalar invocation plus the cost of
// inserting and extracting the values.		// inserting and extracting the values.
return getScalarizationOverhead(ValTy, true, false) + Num * Cost;		return getScalarizationOverhead(ValTy, true, false) + Num * Cost;
}		}

// Unknown scalar opcode.		// Unknown scalar opcode.
return 1;		return 1;
▲ Show 20 Lines • Show All 501 Lines • Show Last 20 Lines

lib/Analysis/CostModel.cpp

Show First 20 Lines • Show All 441 Lines • ▼ Show 20 Lines	case Instruction::Xor: {
return TTI->getArithmeticInstrCost(I->getOpcode(), I->getType(), Op1VK,		return TTI->getArithmeticInstrCost(I->getOpcode(), I->getType(), Op1VK,
Op2VK, TargetTransformInfo::OP_None,		Op2VK, TargetTransformInfo::OP_None,
TargetTransformInfo::OP_None,		TargetTransformInfo::OP_None,
Operands);		Operands);
}		}
case Instruction::Select: {		case Instruction::Select: {
const SelectInst *SI = cast<SelectInst>(I);		const SelectInst *SI = cast<SelectInst>(I);
Type *CondTy = SI->getCondition()->getType();		Type *CondTy = SI->getCondition()->getType();
return TTI->getCmpSelInstrCost(I->getOpcode(), I->getType(), CondTy);		return TTI->getCmpSelInstrCost(I->getOpcode(), I->getType(), CondTy, I);
}		}
case Instruction::ICmp:		case Instruction::ICmp:
case Instruction::FCmp: {		case Instruction::FCmp: {
Type *ValTy = I->getOperand(0)->getType();		Type *ValTy = I->getOperand(0)->getType();
return TTI->getCmpSelInstrCost(I->getOpcode(), ValTy);		return TTI->getCmpSelInstrCost(I->getOpcode(), ValTy, I->getType(), I);
}		}
case Instruction::Store: {		case Instruction::Store: {
const StoreInst *SI = cast<StoreInst>(I);		const StoreInst *SI = cast<StoreInst>(I);
Type *ValTy = SI->getValueOperand()->getType();		Type *ValTy = SI->getValueOperand()->getType();
return TTI->getMemoryOpCost(I->getOpcode(), ValTy,		return TTI->getMemoryOpCost(I->getOpcode(), ValTy,
SI->getAlignment(),		SI->getAlignment(),
SI->getPointerAddressSpace());		SI->getPointerAddressSpace());
}		}
▲ Show 20 Lines • Show All 115 Lines • Show Last 20 Lines

lib/Analysis/TargetTransformInfo.cpp

	Show First 20 Lines • Show All 323 Lines • ▼ Show 20 Lines

	int TargetTransformInfo::getCFInstrCost(unsigned Opcode) const {			int TargetTransformInfo::getCFInstrCost(unsigned Opcode) const {
	int Cost = TTIImpl->getCFInstrCost(Opcode);			int Cost = TTIImpl->getCFInstrCost(Opcode);
	assert(Cost >= 0 && "TTI should not produce negative costs!");			assert(Cost >= 0 && "TTI should not produce negative costs!");
	return Cost;			return Cost;
	}			}

	int TargetTransformInfo::getCmpSelInstrCost(unsigned Opcode, Type *ValTy,			int TargetTransformInfo::getCmpSelInstrCost(unsigned Opcode, Type *ValTy,
	Type *CondTy) const {			Type CondTy, const Instruction I) const {
	int Cost = TTIImpl->getCmpSelInstrCost(Opcode, ValTy, CondTy);			int Cost = TTIImpl->getCmpSelInstrCost(Opcode, ValTy, CondTy, I);
	assert(Cost >= 0 && "TTI should not produce negative costs!");			assert(Cost >= 0 && "TTI should not produce negative costs!");
	return Cost;			return Cost;
	}			}

	int TargetTransformInfo::getVectorInstrCost(unsigned Opcode, Type *Val,			int TargetTransformInfo::getVectorInstrCost(unsigned Opcode, Type *Val,
	unsigned Index) const {			unsigned Index) const {
	int Cost = TTIImpl->getVectorInstrCost(Opcode, Val, Index);			int Cost = TTIImpl->getVectorInstrCost(Opcode, Val, Index);
	assert(Cost >= 0 && "TTI should not produce negative costs!");			assert(Cost >= 0 && "TTI should not produce negative costs!");
	▲ Show 20 Lines • Show All 187 Lines • Show Last 20 Lines

lib/Target/AArch64/AArch64TargetTransformInfo.h

Show First 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	int getArithmeticInstrCost(
TTI::OperandValueKind Opd1Info = TTI::OK_AnyValue,		TTI::OperandValueKind Opd1Info = TTI::OK_AnyValue,
TTI::OperandValueKind Opd2Info = TTI::OK_AnyValue,		TTI::OperandValueKind Opd2Info = TTI::OK_AnyValue,
TTI::OperandValueProperties Opd1PropInfo = TTI::OP_None,		TTI::OperandValueProperties Opd1PropInfo = TTI::OP_None,
TTI::OperandValueProperties Opd2PropInfo = TTI::OP_None,		TTI::OperandValueProperties Opd2PropInfo = TTI::OP_None,
ArrayRef<const Value > Args = ArrayRef<const Value >());		ArrayRef<const Value > Args = ArrayRef<const Value >());

int getAddressComputationCost(Type Ty, ScalarEvolution SE, const SCEV *Ptr);		int getAddressComputationCost(Type Ty, ScalarEvolution SE, const SCEV *Ptr);

int getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy);		int getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
		const Instruction *I = nullptr);

int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,		int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,
unsigned AddressSpace);		unsigned AddressSpace);

int getCostOfKeepingLiveOverCall(ArrayRef<Type *> Tys);		int getCostOfKeepingLiveOverCall(ArrayRef<Type *> Tys);

void getUnrollingPreferences(Loop *L, TTI::UnrollingPreferences &UP);		void getUnrollingPreferences(Loop *L, TTI::UnrollingPreferences &UP);

Show All 22 Lines

lib/Target/AArch64/AArch64TargetTransformInfo.cpp

Show First 20 Lines • Show All 430 Lines • ▼ Show 20 Lines	if (Ty->isVectorTy() && SE &&
return NumVectorInstToHideOverhead;		return NumVectorInstToHideOverhead;

// In many cases the address computation is not merged into the instruction		// In many cases the address computation is not merged into the instruction
// addressing mode.		// addressing mode.
return 1;		return 1;
}		}

int AArch64TTIImpl::getCmpSelInstrCost(unsigned Opcode, Type *ValTy,		int AArch64TTIImpl::getCmpSelInstrCost(unsigned Opcode, Type *ValTy,
Type *CondTy) {		Type CondTy, const Instruction I) {

int ISD = TLI->InstructionOpcodeToISD(Opcode);		int ISD = TLI->InstructionOpcodeToISD(Opcode);
// We don't lower some vector selects well that are wider than the register		// We don't lower some vector selects well that are wider than the register
// width.		// width.
if (ValTy->isVectorTy() && ISD == ISD::SELECT) {		if (ValTy->isVectorTy() && ISD == ISD::SELECT) {
// We would need this many instructions to hide the scalarization happening.		// We would need this many instructions to hide the scalarization happening.
const int AmortizationCost = 20;		const int AmortizationCost = 20;
static const TypeConversionCostTblEntry		static const TypeConversionCostTblEntry
Show All 10 Lines	if (ValTy->isVectorTy() && ISD == ISD::SELECT) {
EVT SelValTy = TLI->getValueType(DL, ValTy);		EVT SelValTy = TLI->getValueType(DL, ValTy);
if (SelCondTy.isSimple() && SelValTy.isSimple()) {		if (SelCondTy.isSimple() && SelValTy.isSimple()) {
if (const auto *Entry = ConvertCostTableLookup(VectorSelectTbl, ISD,		if (const auto *Entry = ConvertCostTableLookup(VectorSelectTbl, ISD,
SelCondTy.getSimpleVT(),		SelCondTy.getSimpleVT(),
SelValTy.getSimpleVT()))		SelValTy.getSimpleVT()))
return Entry->Cost;		return Entry->Cost;
}		}
}		}
return BaseT::getCmpSelInstrCost(Opcode, ValTy, CondTy);		return BaseT::getCmpSelInstrCost(Opcode, ValTy, CondTy, I);
}		}

int AArch64TTIImpl::getMemoryOpCost(unsigned Opcode, Type *Ty,		int AArch64TTIImpl::getMemoryOpCost(unsigned Opcode, Type *Ty,
unsigned Alignment, unsigned AddressSpace) {		unsigned Alignment, unsigned AddressSpace) {
auto LT = TLI->getTypeLegalizationCost(DL, Ty);		auto LT = TLI->getTypeLegalizationCost(DL, Ty);

if (ST->isMisaligned128StoreSlow() && Opcode == Instruction::Store &&		if (ST->isMisaligned128StoreSlow() && Opcode == Instruction::Store &&
LT.second.is128BitVector() && Alignment < 16) {		LT.second.is128BitVector() && Alignment < 16) {
▲ Show 20 Lines • Show All 171 Lines • Show Last 20 Lines

lib/Target/ARM/ARMTargetTransformInfo.h

Show First 20 Lines • Show All 90 Lines • ▼ Show 20 Lines	public:
unsigned getMaxInterleaveFactor(unsigned VF) {		unsigned getMaxInterleaveFactor(unsigned VF) {
return ST->getMaxInterleaveFactor();		return ST->getMaxInterleaveFactor();
}		}

int getShuffleCost(TTI::ShuffleKind Kind, Type Tp, int Index, Type SubTp);		int getShuffleCost(TTI::ShuffleKind Kind, Type Tp, int Index, Type SubTp);

int getCastInstrCost(unsigned Opcode, Type Dst, Type Src);		int getCastInstrCost(unsigned Opcode, Type Dst, Type Src);

int getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy);		int getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
		const Instruction *I = nullptr);

int getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index);		int getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index);

int getAddressComputationCost(Type Val, ScalarEvolution SE,		int getAddressComputationCost(Type Val, ScalarEvolution SE,
const SCEV *Ptr);		const SCEV *Ptr);

int getFPOpCost(Type *Ty);		int getFPOpCost(Type *Ty);

Show All 30 Lines

lib/Target/ARM/ARMTargetTransformInfo.cpp

Show First 20 Lines • Show All 304 Lines • ▼ Show 20 Lines	if ((Opcode == Instruction::InsertElement \|\|
if (ValTy->isVectorTy() &&		if (ValTy->isVectorTy() &&
ValTy->getScalarSizeInBits() <= 32)		ValTy->getScalarSizeInBits() <= 32)
return std::max(BaseT::getVectorInstrCost(Opcode, ValTy, Index), 2U);		return std::max(BaseT::getVectorInstrCost(Opcode, ValTy, Index), 2U);
}		}

return BaseT::getVectorInstrCost(Opcode, ValTy, Index);		return BaseT::getVectorInstrCost(Opcode, ValTy, Index);
}		}

int ARMTTIImpl::getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy) {		int ARMTTIImpl::getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
		const Instruction *I) {

int ISD = TLI->InstructionOpcodeToISD(Opcode);		int ISD = TLI->InstructionOpcodeToISD(Opcode);
// On NEON a a vector select gets lowered to vbsl.		// On NEON a a vector select gets lowered to vbsl.
if (ST->hasNEON() && ValTy->isVectorTy() && ISD == ISD::SELECT) {		if (ST->hasNEON() && ValTy->isVectorTy() && ISD == ISD::SELECT) {
// Lowering of some vector selects is currently far from perfect.		// Lowering of some vector selects is currently far from perfect.
static const TypeConversionCostTblEntry NEONVectorSelectTbl[] = {		static const TypeConversionCostTblEntry NEONVectorSelectTbl[] = {
{ ISD::SELECT, MVT::v4i1, MVT::v4i64, 44 + 12 + 1 },		{ ISD::SELECT, MVT::v4i1, MVT::v4i64, 44 + 12 + 1 },
{ ISD::SELECT, MVT::v8i1, MVT::v8i64, 50 },		{ ISD::SELECT, MVT::v8i1, MVT::v8i64, 50 },
{ ISD::SELECT, MVT::v16i1, MVT::v16i64, 100 }		{ ISD::SELECT, MVT::v16i1, MVT::v16i64, 100 }
};		};

EVT SelCondTy = TLI->getValueType(DL, CondTy);		EVT SelCondTy = TLI->getValueType(DL, CondTy);
EVT SelValTy = TLI->getValueType(DL, ValTy);		EVT SelValTy = TLI->getValueType(DL, ValTy);
if (SelCondTy.isSimple() && SelValTy.isSimple()) {		if (SelCondTy.isSimple() && SelValTy.isSimple()) {
if (const auto *Entry = ConvertCostTableLookup(NEONVectorSelectTbl, ISD,		if (const auto *Entry = ConvertCostTableLookup(NEONVectorSelectTbl, ISD,
SelCondTy.getSimpleVT(),		SelCondTy.getSimpleVT(),
SelValTy.getSimpleVT()))		SelValTy.getSimpleVT()))
return Entry->Cost;		return Entry->Cost;
}		}

std::pair<int, MVT> LT = TLI->getTypeLegalizationCost(DL, ValTy);		std::pair<int, MVT> LT = TLI->getTypeLegalizationCost(DL, ValTy);
return LT.first;		return LT.first;
}		}

return BaseT::getCmpSelInstrCost(Opcode, ValTy, CondTy);		return BaseT::getCmpSelInstrCost(Opcode, ValTy, CondTy, I);
}		}

int ARMTTIImpl::getAddressComputationCost(Type Ty, ScalarEvolution SE,		int ARMTTIImpl::getAddressComputationCost(Type Ty, ScalarEvolution SE,
const SCEV *Ptr) {		const SCEV *Ptr) {
// Address computations in vectorized code with non-consecutive addresses will		// Address computations in vectorized code with non-consecutive addresses will
// likely result in more instructions compared to scalar code where the		// likely result in more instructions compared to scalar code where the
// computation can more often be merged into the index mode. The resulting		// computation can more often be merged into the index mode. The resulting
// extra micro-ops can significantly decrease throughput.		// extra micro-ops can significantly decrease throughput.
▲ Show 20 Lines • Show All 197 Lines • Show Last 20 Lines

lib/Target/PowerPC/PPCTargetTransformInfo.h

Show First 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	int getArithmeticInstrCost(
unsigned Opcode, Type *Ty,		unsigned Opcode, Type *Ty,
TTI::OperandValueKind Opd1Info = TTI::OK_AnyValue,		TTI::OperandValueKind Opd1Info = TTI::OK_AnyValue,
TTI::OperandValueKind Opd2Info = TTI::OK_AnyValue,		TTI::OperandValueKind Opd2Info = TTI::OK_AnyValue,
TTI::OperandValueProperties Opd1PropInfo = TTI::OP_None,		TTI::OperandValueProperties Opd1PropInfo = TTI::OP_None,
TTI::OperandValueProperties Opd2PropInfo = TTI::OP_None,		TTI::OperandValueProperties Opd2PropInfo = TTI::OP_None,
ArrayRef<const Value > Args = ArrayRef<const Value >());		ArrayRef<const Value > Args = ArrayRef<const Value >());
int getShuffleCost(TTI::ShuffleKind Kind, Type Tp, int Index, Type SubTp);		int getShuffleCost(TTI::ShuffleKind Kind, Type Tp, int Index, Type SubTp);
int getCastInstrCost(unsigned Opcode, Type Dst, Type Src);		int getCastInstrCost(unsigned Opcode, Type Dst, Type Src);
int getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy);		int getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
		const Instruction *I = nullptr);
int getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index);		int getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index);
int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,		int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,
unsigned AddressSpace);		unsigned AddressSpace);
int getInterleavedMemoryOpCost(unsigned Opcode, Type *VecTy,		int getInterleavedMemoryOpCost(unsigned Opcode, Type *VecTy,
unsigned Factor,		unsigned Factor,
ArrayRef<unsigned> Indices,		ArrayRef<unsigned> Indices,
unsigned Alignment,		unsigned Alignment,
unsigned AddressSpace);		unsigned AddressSpace);

/// @}		/// @}
};		};

} // end namespace llvm		} // end namespace llvm

#endif		#endif

lib/Target/PowerPC/PPCTargetTransformInfo.cpp

	Show First 20 Lines • Show All 302 Lines • ▼ Show 20 Lines
	}			}

	int PPCTTIImpl::getCastInstrCost(unsigned Opcode, Type Dst, Type Src) {			int PPCTTIImpl::getCastInstrCost(unsigned Opcode, Type Dst, Type Src) {
	assert(TLI->InstructionOpcodeToISD(Opcode) && "Invalid opcode");			assert(TLI->InstructionOpcodeToISD(Opcode) && "Invalid opcode");

	return BaseT::getCastInstrCost(Opcode, Dst, Src);			return BaseT::getCastInstrCost(Opcode, Dst, Src);
	}			}

	int PPCTTIImpl::getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy) {			int PPCTTIImpl::getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
	return BaseT::getCmpSelInstrCost(Opcode, ValTy, CondTy);			const Instruction *I) {
				return BaseT::getCmpSelInstrCost(Opcode, ValTy, CondTy, I);
	}			}

	int PPCTTIImpl::getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index) {			int PPCTTIImpl::getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index) {
	assert(Val->isVectorTy() && "This must be a vector type");			assert(Val->isVectorTy() && "This must be a vector type");

	int ISD = TLI->InstructionOpcodeToISD(Opcode);			int ISD = TLI->InstructionOpcodeToISD(Opcode);
	assert(ISD && "Invalid opcode");			assert(ISD && "Invalid opcode");

	▲ Show 20 Lines • Show All 127 Lines • Show Last 20 Lines

lib/Target/SystemZ/SystemZISelLowering.cpp

Show First 20 Lines • Show All 341 Lines • ▼ Show 20 Lines	if (isTypeLegal(VT)) {
setOperationAction(ISD::SETCC, VT, Custom);		setOperationAction(ISD::SETCC, VT, Custom);
}		}
}		}

if (Subtarget.hasVector()) {		if (Subtarget.hasVector()) {
// There should be no need to check for float types other than v2f64		// There should be no need to check for float types other than v2f64
// since <2 x f32> isn't a legal type.		// since <2 x f32> isn't a legal type.
setOperationAction(ISD::FP_TO_SINT, MVT::v2i64, Legal);		setOperationAction(ISD::FP_TO_SINT, MVT::v2i64, Legal);
		setOperationAction(ISD::FP_TO_SINT, MVT::v2f64, Legal);
setOperationAction(ISD::FP_TO_UINT, MVT::v2i64, Legal);		setOperationAction(ISD::FP_TO_UINT, MVT::v2i64, Legal);
		setOperationAction(ISD::FP_TO_UINT, MVT::v2f64, Legal);
setOperationAction(ISD::SINT_TO_FP, MVT::v2i64, Legal);		setOperationAction(ISD::SINT_TO_FP, MVT::v2i64, Legal);
		setOperationAction(ISD::SINT_TO_FP, MVT::v2f64, Legal);
setOperationAction(ISD::UINT_TO_FP, MVT::v2i64, Legal);		setOperationAction(ISD::UINT_TO_FP, MVT::v2i64, Legal);
		setOperationAction(ISD::UINT_TO_FP, MVT::v2f64, Legal);
}		}

// Handle floating-point types.		// Handle floating-point types.
for (unsigned I = MVT::FIRST_FP_VALUETYPE;		for (unsigned I = MVT::FIRST_FP_VALUETYPE;
I <= MVT::LAST_FP_VALUETYPE;		I <= MVT::LAST_FP_VALUETYPE;
++I) {		++I) {
MVT VT = MVT::SimpleValueType(I);		MVT VT = MVT::SimpleValueType(I);
if (isTypeLegal(VT)) {		if (isTypeLegal(VT)) {
▲ Show 20 Lines • Show All 5,983 Lines • Show Last 20 Lines

lib/Target/SystemZ/SystemZTargetTransformInfo.h

Show All 21 Lines	class SystemZTTIImpl : public BasicTTIImplBase<SystemZTTIImpl> {
friend BaseT;		friend BaseT;

const SystemZSubtarget *ST;		const SystemZSubtarget *ST;
const SystemZTargetLowering *TLI;		const SystemZTargetLowering *TLI;

const SystemZSubtarget *getST() const { return ST; }		const SystemZSubtarget *getST() const { return ST; }
const SystemZTargetLowering *getTLI() const { return TLI; }		const SystemZTargetLowering *getTLI() const { return TLI; }

		unsigned const LIBCALL_COST = 30;
		rengolinUnsubmitted Not Done Reply Inline Actions Shouldn't this just be the cost of a call, rather than being a magic constant? rengolin: Shouldn't this just be the cost of a call, rather than being a magic constant?
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions Maybe, but that is also a magic constant of 10 :-) Is there any point? This is used just for FRem. jonpa: Maybe, but that is also a magic constant of 10 :-) Is there any point? This is used just for…

public:		public:
explicit SystemZTTIImpl(const SystemZTargetMachine *TM, const Function &F)		explicit SystemZTTIImpl(const SystemZTargetMachine *TM, const Function &F)
: BaseT(TM, F.getParent()->getDataLayout()), ST(TM->getSubtargetImpl(F)),		: BaseT(TM, F.getParent()->getDataLayout()), ST(TM->getSubtargetImpl(F)),
TLI(ST->getTargetLowering()) {}		TLI(ST->getTargetLowering()) {}

/// \name Scalar TTI Implementations		/// \name Scalar TTI Implementations
/// @{		/// @{

Show All 10 Lines	public:
/// @}		/// @}

/// \name Vector TTI Implementations		/// \name Vector TTI Implementations
/// @{		/// @{

unsigned getNumberOfRegisters(bool Vector);		unsigned getNumberOfRegisters(bool Vector);
unsigned getRegisterBitWidth(bool Vector);		unsigned getRegisterBitWidth(bool Vector);

		bool isFPVectorizationPotentiallyUnsafe() { return false; }
		rengolinUnsubmitted Not Done Reply Inline Actions Where is this used? rengolin: Where is this used?

		int getArithmeticInstrCost(
		unsigned Opcode, Type *Ty,
		TTI::OperandValueKind Opd1Info = TTI::OK_AnyValue,
		TTI::OperandValueKind Opd2Info = TTI::OK_AnyValue,
		TTI::OperandValueProperties Opd1PropInfo = TTI::OP_None,
		TTI::OperandValueProperties Opd2PropInfo = TTI::OP_None,
		ArrayRef<const Value > Args = ArrayRef<const Value >());
		int getShuffleCost(TTI::ShuffleKind Kind, Type Tp, int Index, Type SubTp);
		unsigned getVectorTruncCost(Type SrcTy, Type DstTy);
		int getCastInstrCost(unsigned Opcode, Type Dst, Type Src);
		int getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
		const Instruction *I = nullptr);
		int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,
		unsigned AddressSpace);
/// @}		/// @}
};		};

} // end namespace llvm		} // end namespace llvm

#endif		#endif

lib/Target/SystemZ/SystemZTargetTransformInfo.cpp

Show First 20 Lines • Show All 253 Lines • ▼ Show 20 Lines	for (auto &I : *BB) {
if (F->getIntrinsicID() == Intrinsic::memcpy \|\|		if (F->getIntrinsicID() == Intrinsic::memcpy \|\|
F->getIntrinsicID() == Intrinsic::memset)		F->getIntrinsicID() == Intrinsic::memset)
NumStores++;		NumStores++;
} else { // indirect call.		} else { // indirect call.
HasCall = true;		HasCall = true;
}		}
}		}
if (isa<StoreInst>(&I)) {		if (isa<StoreInst>(&I)) {
NumStores++;
Type *MemAccessTy = I.getOperand(0)->getType();		Type *MemAccessTy = I.getOperand(0)->getType();
if((MemAccessTy->isIntegerTy() \|\| MemAccessTy->isFloatingPointTy()) &&		NumStores += getMemoryOpCost(Instruction::Store, MemAccessTy, 0, 0);
(getDataLayout().getTypeSizeInBits(MemAccessTy) == 128))
NumStores++; // 128 bit fp/int stores get split.
}		}
}		}

// The z13 processor will run out of store tags if too many stores		// The z13 processor will run out of store tags if too many stores
		uweigandUnsubmitted Done Reply Inline Actions This seems to partially duplicate the logic in getMemoryOpCost ... is there a way to unify those? uweigand: This seems to partially duplicate the logic in getMemoryOpCost ... is there a way to unify…
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions Yes - replaced it with a call to getMemoryOpCost() which returns the number of instructions just the same. jonpa: Yes - replaced it with a call to getMemoryOpCost() which returns the number of instructions…
// are fed into it too quickly. Therefore make sure there are not		// are fed into it too quickly. Therefore make sure there are not
// too many stores in the resulting unrolled loop.		// too many stores in the resulting unrolled loop.
unsigned const Max = (NumStores ? (12 / NumStores) : UINT_MAX);		unsigned const Max = (NumStores ? (12 / NumStores) : UINT_MAX);

if (HasCall) {		if (HasCall) {
// Only allow full unrolling if loop has any calls.		// Only allow full unrolling if loop has any calls.
UP.FullUnrollMaxCount = Max;		UP.FullUnrollMaxCount = Max;
UP.MaxCount = 1;		UP.MaxCount = 1;
Show All 29 Lines
unsigned SystemZTTIImpl::getRegisterBitWidth(bool Vector) {		unsigned SystemZTTIImpl::getRegisterBitWidth(bool Vector) {
if (!Vector)		if (!Vector)
return 64;		return 64;
if (ST->hasVector())		if (ST->hasVector())
return 128;		return 128;
return 0;		return 0;
}		}

		int SystemZTTIImpl::getArithmeticInstrCost(
		unsigned Opcode, Type *Ty,
		TTI::OperandValueKind Op1Info, TTI::OperandValueKind Op2Info,
		TTI::OperandValueProperties Opd1PropInfo,
		TTI::OperandValueProperties Opd2PropInfo,
		ArrayRef<const Value *> Args) {

		// TODO: return a good value for BB-VECTORIZER that includes the
		// immediate loads, which we do not want to count for the loop
		// vectorizer, since they are hopefully hoisted out of the loop. This
		// would require a new parameter 'InLoop', but not sure if constant
		// args are common enough to motivate this.

		unsigned ScalarBits = Ty->getScalarSizeInBits();

		if (Ty->isVectorTy()) {
		assert (ST->hasVector() && "getArithmeticInstrCost() called with vector type.");
		unsigned VF = Ty->getVectorNumElements();
		unsigned NumVectors = getNumberOfParts(Ty);

		// These vector operations are custom handled, but are still supported
		// with one instruction per vector, regardless of element size.
		if (Opcode == Instruction::Shl \|\| Opcode == Instruction::LShr \|\|
		Opcode == Instruction::AShr \|\| Opcode == Instruction::Or) {
		return NumVectors;
		uweigandUnsubmitted Not Done Reply Inline Actions Shouldn't we have And and Xor here as well? uweigand: Shouldn't we have And and Xor here as well?
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions No - And and Xor are 'Legal', while the ones handled here are 'Custom'. Only for 'Custom' is this needed, since the default implementation then assumes it is twice as expensive, while it is actually not on z13. jonpa: No - And and Xor are 'Legal', while the ones handled here are 'Custom'. Only for 'Custom' is…
		uweigandUnsubmitted Done Reply Inline Actions I see. However, for vector types, Or is also marked as "Legal" -- only for scalar i64 is it Custom. uweigand: I see. However, for vector types, Or is also marked as "Legal" -- only for scalar i64 is it…
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions Ahh, thanks. Removed Or. jonpa: Ahh, thanks. Removed Or.
		}

		// These FP operations are supported with a single vector instruction for
		// double (base implementation assumes float generally costs 2). For
		// FP128, the scalar cost is 1, and there is no overhead since the values
		// are already in scalar registers.
		if (Opcode == Instruction::FAdd \|\| Opcode == Instruction::FSub \|\|
		Opcode == Instruction::FMul \|\| Opcode == Instruction::FDiv) {
		switch (ScalarBits) {
		case 32: {
		// Return the cost of multiple scalar invocation plus the cost of
		// inserting and extracting the values.
		unsigned ScalarCost = getArithmeticInstrCost(Opcode, Ty->getScalarType());
		unsigned Cost = (VF * ScalarCost) + getScalarizationOverhead(Ty, Args);
		// FIXME: VF 2 for these FP operations are currently just as
		// expensive as for VF 4.
		if (VF == 2)
		Cost *= 2;
		return Cost;
		}
		case 64:
		case 128:
		return NumVectors;
		default:
		break;
		}
		}

		// There is no native support for FRem.
		if (Opcode == Instruction::FRem) {
		unsigned Cost = (VF * LIBCALL_COST) + getScalarizationOverhead(Ty, Args);
		// FIXME: VF 2 for float is currently just as expensive as for VF 4.
		if (VF == 2 && ScalarBits == 32)
		Cost *= 2;
		return Cost;
		}
		}
		else { // Scalar:
		// These FP operations are supported with a dedicated instruction for
		// float, double and fp128 (base implementation assumes float generally
		// costs 2).
		if (Opcode == Instruction::FAdd \|\| Opcode == Instruction::FSub \|\|
		Opcode == Instruction::FMul \|\| Opcode == Instruction::FDiv)
		return 1;

		// There is no native support for FRem.
		if (Opcode == Instruction::FRem)
		return LIBCALL_COST;

		if (Opcode == Instruction::LShr \|\| Opcode == Instruction::AShr)
		return (ScalarBits >= 32 ? 1 : 2 /ext/);

		// Or requires one instruction, although it has custom handling for i64.
		if (Opcode == Instruction::Or)
		uweigandUnsubmitted Not Done Reply Inline Actions Likewise ... uweigand: Likewise ...
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions (same reason as above) jonpa: (same reason as above)
		return 1;

		// An extra extension for narrow types is needed.
		if ((Opcode == Instruction::SDiv \|\| Opcode == Instruction::SRem))
		return (ScalarBits < 32 ? 4 /sext of ops/ : 2);

		if (Opcode == Instruction::UDiv \|\| Opcode == Instruction::URem)
		return (ScalarBits < 32 ? 4 /zext of both ops/ : 3);
		}

		// Fallback to the default implementation.
		return BaseT::getArithmeticInstrCost(Opcode, Ty, Op1Info, Op2Info,
		Opd1PropInfo, Opd2PropInfo, Args);
		}


		int SystemZTTIImpl::getShuffleCost(TTI::ShuffleKind Kind, Type *Tp, int Index,
		Type *SubTp) {
		assert (Tp->isVectorTy());
		assert (ST->hasVector() && "getShuffleCost() called.");
		unsigned NumVectors = getNumberOfParts(Tp);

		// FP128 values are always in scalar registers, so there is no work
		// involved with a shuffle, except for broadcast. In that case a register
		// moves are done with a single instruction per element.
		if (Tp->getScalarType()->isFP128Ty())
		return (Kind == TargetTransformInfo::SK_Broadcast ? NumVectors - 1 : 0);

		switch (Kind) {
		case TargetTransformInfo::SK_ExtractSubvector:
		// ExtractSubvector Index indicates start offset.

		// Extracting a subvector from first index is a noop.
		return (Index == 0 ? 0 : NumVectors);

		case TargetTransformInfo::SK_Broadcast:
		// Loop vectorizer calls here to figure out the extra cost of
		// broadcasting a loaded value to all elements of a vector. Since vlrep
		// loads and replicates with a single instruction, adjust the returned
		// value.
		return NumVectors - 1;

		default:

		// SystemZ supports single instruction permutation / replication.
		return NumVectors;
		}

		return BaseT::getShuffleCost(Kind, Tp, Index, SubTp);
		}

		// Return the log2 difference of the element sizes of the two vector types.
		static unsigned getElSizeLog2Diff(Type Ty0, Type Ty1) {
		unsigned Bits0 = Ty0->getScalarSizeInBits();
		unsigned Bits1 = Ty1->getScalarSizeInBits();

		if (Bits1 > Bits0)
		return (Log2_32(Bits1) - Log2_32(Bits0));

		return (Log2_32(Bits0) - Log2_32(Bits1));
		}

		// Return the number of instructions needed to truncate SrcTy to DstTy.
		unsigned SystemZTTIImpl::
		getVectorTruncCost(Type SrcTy, Type DstTy) {
		assert (SrcTy->isVectorTy() && DstTy->isVectorTy());
		assert (SrcTy->getPrimitiveSizeInBits() > DstTy->getPrimitiveSizeInBits() &&
		"Packing must reduce size of vector type.");
		assert (SrcTy->getVectorNumElements() == DstTy->getVectorNumElements() &&
		"Packing should not change number of elements.");

		unsigned NumParts = getNumberOfParts(SrcTy);
		if (NumParts <= 2)
		// Up to 2 vector registers can be truncated efficiently with pack or
		// permute. The latter requires an immediate mask to be loaded, which
		// typically gets hoisted out of a loop. TODO: return a good value for
		// BB-VECTORIZER that includes the immediate loads, which we do not want
		// to count for the loop vectorizer.
		return 1;

		unsigned Cost = 0;
		unsigned Log2Diff = getElSizeLog2Diff(SrcTy, DstTy);
		unsigned VF = SrcTy->getVectorNumElements();
		for (unsigned P = 0; P < Log2Diff; ++P) {
		if (NumParts > 1)
		NumParts /= 2;
		Cost += NumParts;
		}

		uweigandUnsubmitted Not Done Reply Inline Actions Can this not be handled generically via getElSizeLog2Diff like below for the unpack case? uweigand: Can this not be handled generically via getElSizeLog2Diff like below for the unpack case?
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions Computed by means of a loop instead. Special case also handled. jonpa: Computed by means of a loop instead. Special case also handled.
		// Currently, a general mix of permutes and pack instructions is output by
		// isel, which follow the cost computation above except for this case which
		// is one instruction less:
		if (VF == 8 && SrcTy->getScalarSizeInBits() == 64 &&
		DstTy->getScalarSizeInBits() == 8)
		Cost--;

		return Cost;
		}

		int SystemZTTIImpl::getCastInstrCost(unsigned Opcode, Type Dst, Type Src) {

		unsigned DstScalarBits = Dst->getScalarSizeInBits();
		unsigned SrcScalarBits = Src->getScalarSizeInBits();

		if (Src->isVectorTy()) {
		assert (ST->hasVector() && "getCastInstrCost() called with vector type.");
		assert (Dst->isVectorTy());
		unsigned VF = Src->getVectorNumElements();
		unsigned NumDstVectors = getNumberOfParts(Dst);
		unsigned NumSrcVectors = getNumberOfParts(Src);

		if (Opcode == Instruction::Trunc)
		return getVectorTruncCost(Src, Dst);

		if (Opcode == Instruction::ZExt \|\| Opcode == Instruction::SExt) {
		if (SrcScalarBits >= 8) {
		// ZExt/SExt will be handled with one unpack per doubling of width.
		unsigned NumUnpacks = getElSizeLog2Diff(Src, Dst);

		// For types that spans multiple vector registers, some additional
		// instructions are used to setup the unpacking.
		unsigned NumSrcVectorOps =
		(NumUnpacks > 1 ? (NumDstVectors - NumSrcVectors)
		: (NumDstVectors / 2));

		return (NumUnpacks * NumDstVectors) + NumSrcVectorOps;
		}
		else if (SrcScalarBits == 1) {
		// FIXME: i1 isn't optimally treated.
		// These values reflect the current handling of i1 for sext/zext.
		if (Opcode == Instruction::SExt) {
		static const CostTblEntry SextCostTable[] = {
		{ ISD::SIGN_EXTEND, MVT::v2i8, 3},
		{ ISD::SIGN_EXTEND, MVT::v2i16, 3},
		uweigandUnsubmitted Not Done Reply Inline Actions These tables look odd ... but I guess if they accurately reflect the cost of the code that is currently being generated, this is fine with me until we improve codegen. uweigand: These tables look odd ... but I guess if they accurately reflect the cost of the code that is…
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions Yes - that was the intent. jonpa: Yes - that was the intent.
		{ ISD::SIGN_EXTEND, MVT::v2i32, 3},
		{ ISD::SIGN_EXTEND, MVT::v2i64, 2},
		{ ISD::SIGN_EXTEND, MVT::v4i8, 3},
		{ ISD::SIGN_EXTEND, MVT::v4i16, 3},
		{ ISD::SIGN_EXTEND, MVT::v4i32, 2},
		{ ISD::SIGN_EXTEND, MVT::v4i64, 6},
		{ ISD::SIGN_EXTEND, MVT::v8i8, 3},
		{ ISD::SIGN_EXTEND, MVT::v8i16, 2},
		{ ISD::SIGN_EXTEND, MVT::v8i32, 6},
		{ ISD::SIGN_EXTEND, MVT::v8i64, 13},
		{ ISD::SIGN_EXTEND, MVT::v16i8, 2},
		{ ISD::SIGN_EXTEND, MVT::v16i16, 6},
		{ ISD::SIGN_EXTEND, MVT::v16i32, 12},
		{ ISD::SIGN_EXTEND, MVT::v16i64, 23},
		};
		MVT MTy = TLI->getValueType(DL, Dst).getSimpleVT();
		if (const auto *Entry =
		CostTableLookup(SextCostTable, ISD::SIGN_EXTEND, MTy))
		return Entry->Cost;
		}
		else { // ZExt
		static const CostTblEntry ZextCostTable[] = {
		{ ISD::ZERO_EXTEND, MVT::v2i8, 2},
		{ ISD::ZERO_EXTEND, MVT::v2i16, 2},
		{ ISD::ZERO_EXTEND, MVT::v2i32, 2},
		{ ISD::ZERO_EXTEND, MVT::v2i64, 1},
		{ ISD::ZERO_EXTEND, MVT::v4i8, 2},
		{ ISD::ZERO_EXTEND, MVT::v4i16, 2},
		{ ISD::ZERO_EXTEND, MVT::v4i32, 1},
		{ ISD::ZERO_EXTEND, MVT::v4i64, 4},
		{ ISD::ZERO_EXTEND, MVT::v8i8, 2},
		{ ISD::ZERO_EXTEND, MVT::v8i16, 1},
		{ ISD::ZERO_EXTEND, MVT::v8i32, 4},
		{ ISD::ZERO_EXTEND, MVT::v8i64, 12},
		{ ISD::ZERO_EXTEND, MVT::v16i8, 1},
		{ ISD::ZERO_EXTEND, MVT::v16i16, 4},
		{ ISD::ZERO_EXTEND, MVT::v16i32, 12},
		{ ISD::ZERO_EXTEND, MVT::v16i64, 32},
		};
		MVT MTy = TLI->getValueType(DL, Dst).getSimpleVT();
		if (const auto *Entry =
		CostTableLookup(ZextCostTable, ISD::ZERO_EXTEND, MTy))
		return Entry->Cost;
		}
		}
		}

		if (Opcode == Instruction::SIToFP \|\| Opcode == Instruction::UIToFP \|\|
		Opcode == Instruction::FPToSI \|\| Opcode == Instruction::FPToUI) {
		// TODO: Fix base implementation which could simplify things a bit here
		// (seems to miss on differentiating on scalar/vector types).

		// Only 64 bit vector conversions are natively supported.
		if (SrcScalarBits == 64 && DstScalarBits == 64)
		return NumDstVectors;

		// Return the cost of multiple scalar invocation plus the cost of
		// inserting and extracting the values. Base implementation does not
		// realize float->int gets scalarized.
		unsigned ScalarCost = getCastInstrCost(Opcode, Dst->getScalarType(),
		Src->getScalarType());
		unsigned TotCost = VF * ScalarCost;
		bool NeedsInserts = true, NeedsExtracts = true;
		// FP128 registers do not get inserted or extracted.
		if (DstScalarBits == 128 &&
		(Opcode == Instruction::SIToFP \|\| Opcode == Instruction::UIToFP))
		NeedsInserts = false;
		if (SrcScalarBits == 128 &&
		(Opcode == Instruction::FPToSI \|\| Opcode == Instruction::FPToUI))
		NeedsExtracts = false;

		TotCost += getScalarizationOverhead(Dst, NeedsInserts, NeedsExtracts);

		// FIXME: VF 2 for float<->i32 is currently just as expensive as for VF 4.
		if (VF == 2 && SrcScalarBits == 32 && DstScalarBits == 32)
		TotCost *= 2;

		return TotCost;
		}

		if (Opcode == Instruction::FPTrunc) {
		if (SrcScalarBits == 128) // fp128 -> double/float + inserts of elements.
		return VF /ldxbr/lexbr/ + getScalarizationOverhead(Dst, true, false);
		else // double -> float
		return VF / 2 /vledb/ + std::max(1U, VF / 4 /vperm/);
		}

		if (Opcode == Instruction::FPExt) {
		if (SrcScalarBits == 32 && DstScalarBits == 64) {
		// float -> double is very rare and currently unoptimized. Instead of
		uweigandUnsubmitted Not Done Reply Inline Actions When do we get libcalls? At least with z196 and above this should never happen, so we might want to take the ISA level into account here. uweigand: When do we get libcalls? At least with z196 and above this should never happen, so we might…
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions This was for i128, which shouldn't be there I suppose, so I removed it. You get a libcall if you generate a function like that, but I have not seen it in benchmarks. jonpa: This was for i128, which shouldn't be there I suppose, so I removed it. You get a libcall if…
		// using vldeb, which can do two at a time, all conversions are
		// scalarized.
		return VF * 2;
		}
		// -> fp128. VF * lxdb/lxeb + extraction of elements.
		return VF + getScalarizationOverhead(Src, false, true);
		}
		}
		else { // Scalar
		assert (!Dst->isVectorTy());

		if (Opcode == Instruction::SIToFP \|\| Opcode == Instruction::UIToFP)
		return (SrcScalarBits >= 32 ? 1 : 2 /i8/i16 extend/);

		if (Opcode == Instruction::SExt && Src->isIntegerTy(1))
		// nilf/risbgn + lcr/lcgr
		return 2;
		}

		return BaseT::getCastInstrCost(Opcode, Dst, Src);
		}

		static Type ToVectorTy(Type T, unsigned VF) {
		if (!T->isVectorTy() && VF > 1)
		return VectorType::get(T, VF);
		return T;
		}

		int SystemZTTIImpl::getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
		const Instruction *I) {

		// Hand over to common code if it's a compare for branch.
		if (I != nullptr && I->hasOneUse() &&
		isa<BranchInst>(I->use_begin()->getUser()))
		return BaseT::getCmpSelInstrCost(Opcode, ValTy, CondTy, nullptr);

		if (ValTy->isVectorTy()) {
		assert (ST->hasVector() && "getCmpSelInstrCost() called with vector type.");
		unsigned VF = ValTy->getVectorNumElements();

		// Called with a compare instruction.
		if (Opcode == Instruction::ICmp \|\| Opcode == Instruction::FCmp) {
		Type *SelectedTy = nullptr;
		unsigned PredicateExtraCost = 0;
		if (I != nullptr) {
		assert (isa<CmpInst>(I));
		if (I->hasOneUse()) { // FIXME: Need to handle several users?
		if (SelectInst *SI = dyn_cast<SelectInst>(I->use_begin()->getUser()))
		SelectedTy = ToVectorTy(SI->getType(), VF);
		}

		// Some predicates cost one or two extra instructions.
		switch (dyn_cast<CmpInst>(I)->getPredicate()) {
		case CmpInst::Predicate::ICMP_NE:
		case CmpInst::Predicate::ICMP_UGE:
		case CmpInst::Predicate::ICMP_ULE:
		case CmpInst::Predicate::ICMP_SGE:
		case CmpInst::Predicate::ICMP_SLE:
		PredicateExtraCost = 1;
		break;
		case CmpInst::Predicate::FCMP_ONE:
		case CmpInst::Predicate::FCMP_ORD:
		case CmpInst::Predicate::FCMP_UEQ:
		case CmpInst::Predicate::FCMP_UNO:
		PredicateExtraCost = 2;
		break;
		default:
		break;
		}
		}

		// Float is handled with 2vmr[lh]f + 2vldeb + vfchdb for each pair of
		// floats. FIXME: <2 x float> generates same code as <4 x float>.
		unsigned CmpCostPerVector = (ValTy->getScalarType()->isFloatTy() ? 10 : 1);
		unsigned NumVecs_cmp = getNumberOfParts(ValTy);
		unsigned NumVecs_sel = (SelectedTy != nullptr ?
		getNumberOfParts(SelectedTy) : 1);

		// If the vector select is split, one compare will be done for each part.
		unsigned Cost = (std::max(NumVecs_cmp, NumVecs_sel) *
		(CmpCostPerVector + PredicateExtraCost));

		// In case the select gets split, and the compared element type is
		// smaller than the selected one, extra instructions are needed to move
		// the values into the operands for the compares.
		if (SelectedTy != nullptr && NumVecs_sel > 1 && NumVecs_cmp < NumVecs_sel) {
		Cost += NumVecs_sel;
		unsigned Log2Diff = getElSizeLog2Diff(ValTy, SelectedTy);
		if (NumVecs_sel >= 4 && Log2Diff > 1)
		Cost += NumVecs_sel / 2;
		if (NumVecs_sel >= 8 && Log2Diff > 2)
		Cost += NumVecs_sel / 4;
		}

		return Cost;
		}
		rengolinUnsubmitted Not Done Reply Inline Actions Code style, try clang-format. rengolin: Code style, try clang-format.
		else { // Called with a select instruction.
		assert (Opcode == Instruction::Select);

		unsigned NumVecs_sel = getNumberOfParts(ValTy);

		// We can figure out the extra cost of packing / unpacking if the
		// instruction was passed and the compare instruction is found.
		unsigned PackCost = 0;
		if (I != nullptr) {
		assert (isa<SelectInst>(I));

		Type *ComparedTy = nullptr;
		if (CmpInst *CI = dyn_cast<CmpInst>(I->getOperand(0)))
		ComparedTy = ToVectorTy(CI->getOperand(0)->getType(), VF);

		if (ComparedTy != nullptr) {
		unsigned SelScalarBits = ValTy->getScalarSizeInBits();
		unsigned CmpScalarBits = ComparedTy->getScalarSizeInBits();
		unsigned Log2Diff = getElSizeLog2Diff(ValTy, ComparedTy);
		if (CmpScalarBits > SelScalarBits)
		uweigandUnsubmitted Done Reply Inline Actions This needs more explanation why this adjustment is needed in this case (and only in this case). uweigand: This needs more explanation why this adjustment is needed in this case (and only in this case).
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions I refactored the cost computation for a vector truncation into a new function, used both for vector truncate and vector select, where the selected element type is smaller than the compared elements. jonpa: I refactored the cost computation for a vector truncation into a new function, used both for…
		// The bitmask will be truncated.
		PackCost = getVectorTruncCost(ComparedTy, ValTy);
		else if (SelScalarBits > CmpScalarBits)
		// Each vector select needs its part of the bitmask unpacked.
		PackCost = Log2Diff * NumVecs_sel;
		}
		}

		return NumVecs_sel /vsel/ + PackCost;
		}
		}
		else { // Scalar
		switch (Opcode) {
		case Instruction::ICmp: {
		unsigned Cost = 1;
		if (ValTy->getScalarSizeInBits() <= 16)
		Cost += 2; // extend both operands
		return Cost;
		}
		case Instruction::Select:
		if (ValTy->isFloatingPointTy())
		return 4; // No load on condition for FP, so this costs a conditional jump.
		return 1; // Load On Condition.
		}
		}

		return BaseT::getCmpSelInstrCost(Opcode, ValTy, CondTy, nullptr);
		uweigandUnsubmitted Not Done Reply Inline Actions Why just index == 0 ? Shouldn't the penalty apply to any element? uweigand: Why just index == 0 ? Shouldn't the penalty apply to any element?
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions While the extraction of any element still has a modeled cost of 1 per element, my idea with this is to penalize further the delay of the act of moving out of the vector pipeline. This would then not be per element, but rather for the whole vector register (thus added only once, at index 0). My assumption is that this happens at the point of scalarization, when all vector elements are extracted. jonpa: While the extraction of any element still has a modeled cost of 1 per element, my idea with…
		uweigandUnsubmitted Not Done Reply Inline Actions Why would this not be per element? Moving from the vector to the integer pipeline always has the higher latency, for every element. In the end any such extracton gets implemented using a VLVG* instruction, which simply is more expensive than other vector instructions ... (Note that in the scheduler, the higher latency for VLVG* is already modeled correctly.) uweigand: Why would this not be per element? Moving from the vector to the integer pipeline always has…
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions I agree that this should be experimented further with, but the practical argument right now is that I added the smallest possible extra penalty to get rid of a particular regression while affecting code as little as possible. I consider this to be just a first step with obvious improvements in sight for the vectorizers decisions. I thought the cost here was mainly that the vector register has to pass through the whole vector pipeline. I also imagine the VLGV* instructions would be pipelined, so that the cost of them is not a linear sum in the number of elements? Or is it really twice as expensive to scalarize a <4 x i32> rather than a <2 x i64>? jonpa: I agree that this should be experimented further with, but the practical argument right now is…
		}

		int SystemZTTIImpl::getMemoryOpCost(unsigned Opcode, Type *Src,
		unsigned Alignment, unsigned AddressSpace) {
		assert(!Src->isVoidTy() && "Invalid type");

		unsigned NumOps = getNumberOfParts(Src);

		if (Src->getScalarType()->isFP128Ty())
		// FP128 is held in a pair of two 64 bit fp registers.
		NumOps *= 2;

		uweigandUnsubmitted Not Done Reply Inline Actions The original code in getUnrollingPreferences also used 2 for 128-bit integer types. Should this be done now here as well? uweigand: The original code in getUnrollingPreferences also used 2 for 128-bit integer types. Should…
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions That's never seen by the LoopVectorizer, but I think you are right that it should be handled here since it's actually handled by the backend. Thanks. Changed to check for scalars of 128 bits. jonpa: That's never seen by the LoopVectorizer, but I think you are right that it should be handled…
		return NumOps;
		}

lib/Target/X86/X86TargetTransformInfo.h

Show First 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	int getArithmeticInstrCost(
unsigned Opcode, Type *Ty,		unsigned Opcode, Type *Ty,
TTI::OperandValueKind Opd1Info = TTI::OK_AnyValue,		TTI::OperandValueKind Opd1Info = TTI::OK_AnyValue,
TTI::OperandValueKind Opd2Info = TTI::OK_AnyValue,		TTI::OperandValueKind Opd2Info = TTI::OK_AnyValue,
TTI::OperandValueProperties Opd1PropInfo = TTI::OP_None,		TTI::OperandValueProperties Opd1PropInfo = TTI::OP_None,
TTI::OperandValueProperties Opd2PropInfo = TTI::OP_None,		TTI::OperandValueProperties Opd2PropInfo = TTI::OP_None,
ArrayRef<const Value > Args = ArrayRef<const Value >());		ArrayRef<const Value > Args = ArrayRef<const Value >());
int getShuffleCost(TTI::ShuffleKind Kind, Type Tp, int Index, Type SubTp);		int getShuffleCost(TTI::ShuffleKind Kind, Type Tp, int Index, Type SubTp);
int getCastInstrCost(unsigned Opcode, Type Dst, Type Src);		int getCastInstrCost(unsigned Opcode, Type Dst, Type Src);
int getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy);		int getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
		const Instruction *I = nullptr);
int getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index);		int getVectorInstrCost(unsigned Opcode, Type *Val, unsigned Index);
int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,		int getMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,
unsigned AddressSpace);		unsigned AddressSpace);
int getMaskedMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,		int getMaskedMemoryOpCost(unsigned Opcode, Type *Src, unsigned Alignment,
unsigned AddressSpace);		unsigned AddressSpace);
int getGatherScatterOpCost(unsigned Opcode, Type DataTy, Value Ptr,		int getGatherScatterOpCost(unsigned Opcode, Type DataTy, Value Ptr,
bool VariableMask, unsigned Alignment);		bool VariableMask, unsigned Alignment);
int getAddressComputationCost(Type PtrTy, ScalarEvolution SE,		int getAddressComputationCost(Type PtrTy, ScalarEvolution SE,
▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines

lib/Target/X86/X86TargetTransformInfo.cpp

Show First 20 Lines • Show All 1,294 Lines • ▼ Show 20 Lines	if (const auto *Entry = ConvertCostTableLookup(SSE2ConversionTbl, ISD,
DstTy.getSimpleVT(),		DstTy.getSimpleVT(),
SrcTy.getSimpleVT()))		SrcTy.getSimpleVT()))
return Entry->Cost;		return Entry->Cost;
}		}

return BaseT::getCastInstrCost(Opcode, Dst, Src);		return BaseT::getCastInstrCost(Opcode, Dst, Src);
}		}

int X86TTIImpl::getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy) {		int X86TTIImpl::getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
		const Instruction *I) {
// Legalize the type.		// Legalize the type.
std::pair<int, MVT> LT = TLI->getTypeLegalizationCost(DL, ValTy);		std::pair<int, MVT> LT = TLI->getTypeLegalizationCost(DL, ValTy);

MVT MTy = LT.second;		MVT MTy = LT.second;

int ISD = TLI->InstructionOpcodeToISD(Opcode);		int ISD = TLI->InstructionOpcodeToISD(Opcode);
assert(ISD && "Invalid opcode");		assert(ISD && "Invalid opcode");

▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	int X86TTIImpl::getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
if (ST->hasSSE42())		if (ST->hasSSE42())
if (const auto *Entry = CostTableLookup(SSE42CostTbl, ISD, MTy))		if (const auto *Entry = CostTableLookup(SSE42CostTbl, ISD, MTy))
return LT.first * Entry->Cost;		return LT.first * Entry->Cost;

if (ST->hasSSE2())		if (ST->hasSSE2())
if (const auto *Entry = CostTableLookup(SSE2CostTbl, ISD, MTy))		if (const auto *Entry = CostTableLookup(SSE2CostTbl, ISD, MTy))
return LT.first * Entry->Cost;		return LT.first * Entry->Cost;

return BaseT::getCmpSelInstrCost(Opcode, ValTy, CondTy);		return BaseT::getCmpSelInstrCost(Opcode, ValTy, CondTy, I);
}		}

int X86TTIImpl::getIntrinsicInstrCost(Intrinsic::ID IID, Type *RetTy,		int X86TTIImpl::getIntrinsicInstrCost(Intrinsic::ID IID, Type *RetTy,
ArrayRef<Type *> Tys, FastMathFlags FMF) {		ArrayRef<Type *> Tys, FastMathFlags FMF) {
// Costs should match the codegen from:		// Costs should match the codegen from:
// BITREVERSE: llvm\test\CodeGen\X86\vector-bitreverse.ll		// BITREVERSE: llvm\test\CodeGen\X86\vector-bitreverse.ll
// BSWAP: llvm\test\CodeGen\X86\bswap-vector.ll		// BSWAP: llvm\test\CodeGen\X86\bswap-vector.ll
// CTLZ: llvm\test\CodeGen\X86\vector-lzcnt-*.ll		// CTLZ: llvm\test\CodeGen\X86\vector-lzcnt-*.ll
▲ Show 20 Lines • Show All 885 Lines • Show Last 20 Lines

lib/Transforms/Vectorize/LoopVectorize.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,218 Lines • ▼ Show 20 Lines	unsigned LoopVectorizationCostModel::getInstructionCost(Instruction *I,
case Instruction::Select: {		case Instruction::Select: {
SelectInst *SI = cast<SelectInst>(I);		SelectInst *SI = cast<SelectInst>(I);
const SCEV *CondSCEV = SE->getSCEV(SI->getCondition());		const SCEV *CondSCEV = SE->getSCEV(SI->getCondition());
bool ScalarCond = (SE->isLoopInvariant(CondSCEV, TheLoop));		bool ScalarCond = (SE->isLoopInvariant(CondSCEV, TheLoop));
Type *CondTy = SI->getCondition()->getType();		Type *CondTy = SI->getCondition()->getType();
if (!ScalarCond)		if (!ScalarCond)
CondTy = VectorType::get(CondTy, VF);		CondTy = VectorType::get(CondTy, VF);

return TTI.getCmpSelInstrCost(I->getOpcode(), VectorTy, CondTy);		return TTI.getCmpSelInstrCost(I->getOpcode(), VectorTy, CondTy, I);
}		}
case Instruction::ICmp:		case Instruction::ICmp:
case Instruction::FCmp: {		case Instruction::FCmp: {
		// If this is the loop-latch compare for the back branch, just add the
		// scalar value. Should this check be done in caller instead?
		bool LikelyVectorized = true;
		if (I->hasOneUse()) {
		if (BranchInst *BI = dyn_cast<BranchInst>(I->use_begin()->getUser())) {
		if (BI->getParent() == TheLoop->getLoopLatch())
		LikelyVectorized = false;
		}
		}
		mssimpsoUnsubmitted Not Done Reply Inline Actions Hi, I've only looked at the vectorizer change here, but this code is not needed. Before computing costs, we collect the uniform values in collectLoopUniforms(). ICmp instructions of the kind here are marked uniform. Then in getInstructionCost(), we check if an instruction is uniform (isUniformAfterVectorization()), and if so, always return "1" for the cost, regardless of VF. Also, floating-point induction variables aren't allowed to be "primary" induction variables. So it shouldn't be the case that you would have a FCmp feeding the back edge branch. mssimpso: Hi, I've only looked at the vectorizer change here, but this code is not needed. Before…
		mssimpsoUnsubmitted Done Reply Inline Actions Correction: we always return the cost of the scalar compare if it is uniform, regardless of VF. mssimpso: Correction: we always return the cost of the scalar compare if it is uniform, regardless of…
		jonpaAuthorUnsubmitted Not Done Reply Inline Actions Thanks for explaining - I removed this from patch. jonpa: Thanks for explaining - I removed this from patch.
Type *ValTy = I->getOperand(0)->getType();		Type *ValTy = I->getOperand(0)->getType();
Instruction *Op0AsInstruction = dyn_cast<Instruction>(I->getOperand(0));		Instruction *Op0AsInstruction = dyn_cast<Instruction>(I->getOperand(0));
if (canTruncateToMinimalBitwidth(Op0AsInstruction, VF))		if (canTruncateToMinimalBitwidth(Op0AsInstruction, VF))
ValTy = IntegerType::get(ValTy->getContext(), MinBWs[Op0AsInstruction]);		ValTy = IntegerType::get(ValTy->getContext(), MinBWs[Op0AsInstruction]);

		if (LikelyVectorized)
VectorTy = ToVectorTy(ValTy, VF);		VectorTy = ToVectorTy(ValTy, VF);
return TTI.getCmpSelInstrCost(I->getOpcode(), VectorTy);		return TTI.getCmpSelInstrCost(I->getOpcode(), VectorTy, nullptr, I);
}		}
case Instruction::Store:		case Instruction::Store:
case Instruction::Load: {		case Instruction::Load: {
VectorTy = ToVectorTy(getMemInstValueType(I), VF);		VectorTy = ToVectorTy(getMemInstValueType(I), VF);
return getMemoryInstructionCost(I, VF);		return getMemoryInstructionCost(I, VF);
}		}
case Instruction::ZExt:		case Instruction::ZExt:
case Instruction::SExt:		case Instruction::SExt:
▲ Show 20 Lines • Show All 570 Lines • Show Last 20 Lines

test/Analysis/CostModel/SystemZ/cmpsel.ll

This file was added.

				; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \| FileCheck %s
				;
				; Note: Cost estimates of select of a fp-type is somewhat arbitrary, since it
				; involves a conditional jump.
				; Note: Vector fp32 is not directly supported, and not quite exact in
				; estimates (but it is big absolute values).

				define i8 @fun0(i8 %val1, i8 %val2,
				i8 %val3, i8 %val4) {
				%cmp = icmp eq i8 %val1, %val2
				%sel = select i1 %cmp, i8 %val3, i8 %val4
				ret i8 %sel

				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %cmp = icmp eq i8 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i8 %val3, i8 %val4
				}

				define i16 @fun1(i8 %val1, i8 %val2,
				i16 %val3, i16 %val4) {
				%cmp = icmp eq i8 %val1, %val2
				%sel = select i1 %cmp, i16 %val3, i16 %val4
				ret i16 %sel

				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %cmp = icmp eq i8 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i16 %val3, i16 %val4
				}

				define i32 @fun2(i8 %val1, i8 %val2,
				i32 %val3, i32 %val4) {
				%cmp = icmp eq i8 %val1, %val2
				%sel = select i1 %cmp, i32 %val3, i32 %val4
				ret i32 %sel

				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %cmp = icmp eq i8 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i32 %val3, i32 %val4
				}

				define i64 @fun3(i8 %val1, i8 %val2,
				i64 %val3, i64 %val4) {
				%cmp = icmp eq i8 %val1, %val2
				%sel = select i1 %cmp, i64 %val3, i64 %val4
				ret i64 %sel

				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %cmp = icmp eq i8 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i64 %val3, i64 %val4
				}

				define float @fun4(i8 %val1, i8 %val2,
				float %val3, float %val4) {
				%cmp = icmp eq i8 %val1, %val2
				%sel = select i1 %cmp, float %val3, float %val4
				ret float %sel

				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %cmp = icmp eq i8 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select i1 %cmp, float %val3, float %val4
				}

				define double @fun5(i8 %val1, i8 %val2,
				double %val3, double %val4) {
				%cmp = icmp eq i8 %val1, %val2
				%sel = select i1 %cmp, double %val3, double %val4
				ret double %sel

				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %cmp = icmp eq i8 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select i1 %cmp, double %val3, double %val4
				}

				define i8 @fun6(i16 %val1, i16 %val2,
				i8 %val3, i8 %val4) {
				%cmp = icmp eq i16 %val1, %val2
				%sel = select i1 %cmp, i8 %val3, i8 %val4
				ret i8 %sel

				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %cmp = icmp eq i16 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i8 %val3, i8 %val4
				}

				define i16 @fun7(i16 %val1, i16 %val2,
				i16 %val3, i16 %val4) {
				%cmp = icmp eq i16 %val1, %val2
				%sel = select i1 %cmp, i16 %val3, i16 %val4
				ret i16 %sel

				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %cmp = icmp eq i16 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i16 %val3, i16 %val4
				}

				define i32 @fun8(i16 %val1, i16 %val2,
				i32 %val3, i32 %val4) {
				%cmp = icmp eq i16 %val1, %val2
				%sel = select i1 %cmp, i32 %val3, i32 %val4
				ret i32 %sel

				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %cmp = icmp eq i16 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i32 %val3, i32 %val4
				}

				define i64 @fun9(i16 %val1, i16 %val2,
				i64 %val3, i64 %val4) {
				%cmp = icmp eq i16 %val1, %val2
				%sel = select i1 %cmp, i64 %val3, i64 %val4
				ret i64 %sel

				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %cmp = icmp eq i16 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i64 %val3, i64 %val4
				}

				define float @fun10(i16 %val1, i16 %val2,
				float %val3, float %val4) {
				%cmp = icmp eq i16 %val1, %val2
				%sel = select i1 %cmp, float %val3, float %val4
				ret float %sel

				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %cmp = icmp eq i16 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select i1 %cmp, float %val3, float %val4
				}

				define double @fun11(i16 %val1, i16 %val2,
				double %val3, double %val4) {
				%cmp = icmp eq i16 %val1, %val2
				%sel = select i1 %cmp, double %val3, double %val4
				ret double %sel

				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %cmp = icmp eq i16 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select i1 %cmp, double %val3, double %val4
				}

				define i8 @fun12(i32 %val1, i32 %val2,
				i8 %val3, i8 %val4) {
				%cmp = icmp eq i32 %val1, %val2
				%sel = select i1 %cmp, i8 %val3, i8 %val4
				ret i8 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq i32 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i8 %val3, i8 %val4
				}

				define i16 @fun13(i32 %val1, i32 %val2,
				i16 %val3, i16 %val4) {
				%cmp = icmp eq i32 %val1, %val2
				%sel = select i1 %cmp, i16 %val3, i16 %val4
				ret i16 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq i32 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i16 %val3, i16 %val4
				}

				define i32 @fun14(i32 %val1, i32 %val2,
				i32 %val3, i32 %val4) {
				%cmp = icmp eq i32 %val1, %val2
				%sel = select i1 %cmp, i32 %val3, i32 %val4
				ret i32 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq i32 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i32 %val3, i32 %val4
				}

				define i64 @fun15(i32 %val1, i32 %val2,
				i64 %val3, i64 %val4) {
				%cmp = icmp eq i32 %val1, %val2
				%sel = select i1 %cmp, i64 %val3, i64 %val4
				ret i64 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq i32 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i64 %val3, i64 %val4
				}

				define float @fun16(i32 %val1, i32 %val2,
				float %val3, float %val4) {
				%cmp = icmp eq i32 %val1, %val2
				%sel = select i1 %cmp, float %val3, float %val4
				ret float %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq i32 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select i1 %cmp, float %val3, float %val4
				}

				define double @fun17(i32 %val1, i32 %val2,
				double %val3, double %val4) {
				%cmp = icmp eq i32 %val1, %val2
				%sel = select i1 %cmp, double %val3, double %val4
				ret double %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq i32 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select i1 %cmp, double %val3, double %val4
				}

				define i8 @fun18(i64 %val1, i64 %val2,
				i8 %val3, i8 %val4) {
				%cmp = icmp eq i64 %val1, %val2
				%sel = select i1 %cmp, i8 %val3, i8 %val4
				ret i8 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq i64 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i8 %val3, i8 %val4
				}

				define i16 @fun19(i64 %val1, i64 %val2,
				i16 %val3, i16 %val4) {
				%cmp = icmp eq i64 %val1, %val2
				%sel = select i1 %cmp, i16 %val3, i16 %val4
				ret i16 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq i64 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i16 %val3, i16 %val4
				}

				define i32 @fun20(i64 %val1, i64 %val2,
				i32 %val3, i32 %val4) {
				%cmp = icmp eq i64 %val1, %val2
				%sel = select i1 %cmp, i32 %val3, i32 %val4
				ret i32 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq i64 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i32 %val3, i32 %val4
				}

				define i64 @fun21(i64 %val1, i64 %val2,
				i64 %val3, i64 %val4) {
				%cmp = icmp eq i64 %val1, %val2
				%sel = select i1 %cmp, i64 %val3, i64 %val4
				ret i64 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq i64 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i64 %val3, i64 %val4
				}

				define float @fun22(i64 %val1, i64 %val2,
				float %val3, float %val4) {
				%cmp = icmp eq i64 %val1, %val2
				%sel = select i1 %cmp, float %val3, float %val4
				ret float %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq i64 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select i1 %cmp, float %val3, float %val4
				}

				define double @fun23(i64 %val1, i64 %val2,
				double %val3, double %val4) {
				%cmp = icmp eq i64 %val1, %val2
				%sel = select i1 %cmp, double %val3, double %val4
				ret double %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq i64 %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select i1 %cmp, double %val3, double %val4
				}

				define <2 x i8> @fun24(<2 x i8> %val1, <2 x i8> %val2,
				<2 x i8> %val3, <2 x i8> %val4) {
				%cmp = icmp eq <2 x i8> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i8> %val3, <2 x i8> %val4
				ret <2 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <2 x i1> %cmp, <2 x i8> %val3, <2 x i8> %val4
				}

				define <2 x i16> @fun25(<2 x i8> %val1, <2 x i8> %val2,
				<2 x i16> %val3, <2 x i16> %val4) {
				%cmp = icmp eq <2 x i8> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i16> %val3, <2 x i16> %val4
				ret <2 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i16> %val3, <2 x i16> %val4
				}

				define <2 x i32> @fun26(<2 x i8> %val1, <2 x i8> %val2,
				<2 x i32> %val3, <2 x i32> %val4) {
				%cmp = icmp eq <2 x i8> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i32> %val3, <2 x i32> %val4
				ret <2 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %sel = select <2 x i1> %cmp, <2 x i32> %val3, <2 x i32> %val4
				}

				define <2 x i64> @fun27(<2 x i8> %val1, <2 x i8> %val2,
				<2 x i64> %val3, <2 x i64> %val4) {
				%cmp = icmp eq <2 x i8> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i64> %val3, <2 x i64> %val4
				ret <2 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <2 x i1> %cmp, <2 x i64> %val3, <2 x i64> %val4
				}

				define <2 x float> @fun28(<2 x i8> %val1, <2 x i8> %val2,
				<2 x float> %val3, <2 x float> %val4) {
				%cmp = icmp eq <2 x i8> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x float> %val3, <2 x float> %val4
				ret <2 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %sel = select <2 x i1> %cmp, <2 x float> %val3, <2 x float> %val4
				}

				define <2 x double> @fun29(<2 x i8> %val1, <2 x i8> %val2,
				<2 x double> %val3, <2 x double> %val4) {
				%cmp = icmp eq <2 x i8> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x double> %val3, <2 x double> %val4
				ret <2 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <2 x i1> %cmp, <2 x double> %val3, <2 x double> %val4
				}

				define <2 x i8> @fun30(<2 x i16> %val1, <2 x i16> %val2,
				<2 x i8> %val3, <2 x i8> %val4) {
				%cmp = icmp eq <2 x i16> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i8> %val3, <2 x i8> %val4
				ret <2 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i8> %val3, <2 x i8> %val4
				}

				define <2 x i16> @fun31(<2 x i16> %val1, <2 x i16> %val2,
				<2 x i16> %val3, <2 x i16> %val4) {
				%cmp = icmp eq <2 x i16> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i16> %val3, <2 x i16> %val4
				ret <2 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <2 x i1> %cmp, <2 x i16> %val3, <2 x i16> %val4
				}

				define <2 x i32> @fun32(<2 x i16> %val1, <2 x i16> %val2,
				<2 x i32> %val3, <2 x i32> %val4) {
				%cmp = icmp eq <2 x i16> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i32> %val3, <2 x i32> %val4
				ret <2 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i32> %val3, <2 x i32> %val4
				}

				define <2 x i64> @fun33(<2 x i16> %val1, <2 x i16> %val2,
				<2 x i64> %val3, <2 x i64> %val4) {
				%cmp = icmp eq <2 x i16> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i64> %val3, <2 x i64> %val4
				ret <2 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %sel = select <2 x i1> %cmp, <2 x i64> %val3, <2 x i64> %val4
				}

				define <2 x float> @fun34(<2 x i16> %val1, <2 x i16> %val2,
				<2 x float> %val3, <2 x float> %val4) {
				%cmp = icmp eq <2 x i16> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x float> %val3, <2 x float> %val4
				ret <2 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x float> %val3, <2 x float> %val4
				}

				define <2 x double> @fun35(<2 x i16> %val1, <2 x i16> %val2,
				<2 x double> %val3, <2 x double> %val4) {
				%cmp = icmp eq <2 x i16> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x double> %val3, <2 x double> %val4
				ret <2 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %sel = select <2 x i1> %cmp, <2 x double> %val3, <2 x double> %val4
				}

				define <2 x i8> @fun36(<2 x i32> %val1, <2 x i32> %val2,
				<2 x i8> %val3, <2 x i8> %val4) {
				%cmp = icmp eq <2 x i32> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i8> %val3, <2 x i8> %val4
				ret <2 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i8> %val3, <2 x i8> %val4
				}

				define <2 x i16> @fun37(<2 x i32> %val1, <2 x i32> %val2,
				<2 x i16> %val3, <2 x i16> %val4) {
				%cmp = icmp eq <2 x i32> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i16> %val3, <2 x i16> %val4
				ret <2 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i16> %val3, <2 x i16> %val4
				}

				define <2 x i32> @fun38(<2 x i32> %val1, <2 x i32> %val2,
				<2 x i32> %val3, <2 x i32> %val4) {
				%cmp = icmp eq <2 x i32> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i32> %val3, <2 x i32> %val4
				ret <2 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <2 x i1> %cmp, <2 x i32> %val3, <2 x i32> %val4
				}

				define <2 x i64> @fun39(<2 x i32> %val1, <2 x i32> %val2,
				<2 x i64> %val3, <2 x i64> %val4) {
				%cmp = icmp eq <2 x i32> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i64> %val3, <2 x i64> %val4
				ret <2 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i64> %val3, <2 x i64> %val4
				}

				define <2 x float> @fun40(<2 x i32> %val1, <2 x i32> %val2,
				<2 x float> %val3, <2 x float> %val4) {
				%cmp = icmp eq <2 x i32> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x float> %val3, <2 x float> %val4
				ret <2 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <2 x i1> %cmp, <2 x float> %val3, <2 x float> %val4
				}

				define <2 x double> @fun41(<2 x i32> %val1, <2 x i32> %val2,
				<2 x double> %val3, <2 x double> %val4) {
				%cmp = icmp eq <2 x i32> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x double> %val3, <2 x double> %val4
				ret <2 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x double> %val3, <2 x double> %val4
				}

				define <2 x i8> @fun42(<2 x i64> %val1, <2 x i64> %val2,
				<2 x i8> %val3, <2 x i8> %val4) {
				%cmp = icmp eq <2 x i64> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i8> %val3, <2 x i8> %val4
				ret <2 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i8> %val3, <2 x i8> %val4
				}

				define <2 x i16> @fun43(<2 x i64> %val1, <2 x i64> %val2,
				<2 x i16> %val3, <2 x i16> %val4) {
				%cmp = icmp eq <2 x i64> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i16> %val3, <2 x i16> %val4
				ret <2 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i16> %val3, <2 x i16> %val4
				}

				define <2 x i32> @fun44(<2 x i64> %val1, <2 x i64> %val2,
				<2 x i32> %val3, <2 x i32> %val4) {
				%cmp = icmp eq <2 x i64> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i32> %val3, <2 x i32> %val4
				ret <2 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i32> %val3, <2 x i32> %val4
				}

				define <2 x i64> @fun45(<2 x i64> %val1, <2 x i64> %val2,
				<2 x i64> %val3, <2 x i64> %val4) {
				%cmp = icmp eq <2 x i64> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i64> %val3, <2 x i64> %val4
				ret <2 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <2 x i1> %cmp, <2 x i64> %val3, <2 x i64> %val4
				}

				define <2 x float> @fun46(<2 x i64> %val1, <2 x i64> %val2,
				<2 x float> %val3, <2 x float> %val4) {
				%cmp = icmp eq <2 x i64> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x float> %val3, <2 x float> %val4
				ret <2 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x float> %val3, <2 x float> %val4
				}

				define <2 x double> @fun47(<2 x i64> %val1, <2 x i64> %val2,
				<2 x double> %val3, <2 x double> %val4) {
				%cmp = icmp eq <2 x i64> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x double> %val3, <2 x double> %val4
				ret <2 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <2 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <2 x i1> %cmp, <2 x double> %val3, <2 x double> %val4
				}

				define <4 x i8> @fun48(<4 x i8> %val1, <4 x i8> %val2,
				<4 x i8> %val3, <4 x i8> %val4) {
				%cmp = icmp eq <4 x i8> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i8> %val3, <4 x i8> %val4
				ret <4 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <4 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <4 x i1> %cmp, <4 x i8> %val3, <4 x i8> %val4
				}

				define <4 x i16> @fun49(<4 x i8> %val1, <4 x i8> %val2,
				<4 x i16> %val3, <4 x i16> %val4) {
				%cmp = icmp eq <4 x i8> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i16> %val3, <4 x i16> %val4
				ret <4 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <4 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i16> %val3, <4 x i16> %val4
				}

				define <4 x i32> @fun50(<4 x i8> %val1, <4 x i8> %val2,
				<4 x i32> %val3, <4 x i32> %val4) {
				%cmp = icmp eq <4 x i8> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i32> %val3, <4 x i32> %val4
				ret <4 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <4 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %sel = select <4 x i1> %cmp, <4 x i32> %val3, <4 x i32> %val4
				}

				define <4 x i64> @fun51(<4 x i8> %val1, <4 x i8> %val2,
				<4 x i64> %val3, <4 x i64> %val4) {
				%cmp = icmp eq <4 x i8> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i64> %val3, <4 x i64> %val4
				ret <4 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <4 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <4 x i1> %cmp, <4 x i64> %val3, <4 x i64> %val4
				}

				define <4 x float> @fun52(<4 x i8> %val1, <4 x i8> %val2,
				<4 x float> %val3, <4 x float> %val4) {
				%cmp = icmp eq <4 x i8> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x float> %val3, <4 x float> %val4
				ret <4 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <4 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %sel = select <4 x i1> %cmp, <4 x float> %val3, <4 x float> %val4
				}

				define <4 x double> @fun53(<4 x i8> %val1, <4 x i8> %val2,
				<4 x double> %val3, <4 x double> %val4) {
				%cmp = icmp eq <4 x i8> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x double> %val3, <4 x double> %val4
				ret <4 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <4 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <4 x i1> %cmp, <4 x double> %val3, <4 x double> %val4
				}

				define <4 x i8> @fun54(<4 x i16> %val1, <4 x i16> %val2,
				<4 x i8> %val3, <4 x i8> %val4) {
				%cmp = icmp eq <4 x i16> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i8> %val3, <4 x i8> %val4
				ret <4 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <4 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i8> %val3, <4 x i8> %val4
				}

				define <4 x i16> @fun55(<4 x i16> %val1, <4 x i16> %val2,
				<4 x i16> %val3, <4 x i16> %val4) {
				%cmp = icmp eq <4 x i16> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i16> %val3, <4 x i16> %val4
				ret <4 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <4 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <4 x i1> %cmp, <4 x i16> %val3, <4 x i16> %val4
				}

				define <4 x i32> @fun56(<4 x i16> %val1, <4 x i16> %val2,
				<4 x i32> %val3, <4 x i32> %val4) {
				%cmp = icmp eq <4 x i16> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i32> %val3, <4 x i32> %val4
				ret <4 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <4 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i32> %val3, <4 x i32> %val4
				}

				define <4 x i64> @fun57(<4 x i16> %val1, <4 x i16> %val2,
				<4 x i64> %val3, <4 x i64> %val4) {
				%cmp = icmp eq <4 x i16> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i64> %val3, <4 x i64> %val4
				ret <4 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <4 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %sel = select <4 x i1> %cmp, <4 x i64> %val3, <4 x i64> %val4
				}

				define <4 x float> @fun58(<4 x i16> %val1, <4 x i16> %val2,
				<4 x float> %val3, <4 x float> %val4) {
				%cmp = icmp eq <4 x i16> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x float> %val3, <4 x float> %val4
				ret <4 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <4 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x float> %val3, <4 x float> %val4
				}

				define <4 x double> @fun59(<4 x i16> %val1, <4 x i16> %val2,
				<4 x double> %val3, <4 x double> %val4) {
				%cmp = icmp eq <4 x i16> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x double> %val3, <4 x double> %val4
				ret <4 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <4 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %sel = select <4 x i1> %cmp, <4 x double> %val3, <4 x double> %val4
				}

				define <4 x i8> @fun60(<4 x i32> %val1, <4 x i32> %val2,
				<4 x i8> %val3, <4 x i8> %val4) {
				%cmp = icmp eq <4 x i32> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i8> %val3, <4 x i8> %val4
				ret <4 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <4 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i8> %val3, <4 x i8> %val4
				}

				define <4 x i16> @fun61(<4 x i32> %val1, <4 x i32> %val2,
				<4 x i16> %val3, <4 x i16> %val4) {
				%cmp = icmp eq <4 x i32> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i16> %val3, <4 x i16> %val4
				ret <4 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <4 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i16> %val3, <4 x i16> %val4
				}

				define <4 x i32> @fun62(<4 x i32> %val1, <4 x i32> %val2,
				<4 x i32> %val3, <4 x i32> %val4) {
				%cmp = icmp eq <4 x i32> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i32> %val3, <4 x i32> %val4
				ret <4 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <4 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <4 x i1> %cmp, <4 x i32> %val3, <4 x i32> %val4
				}

				define <4 x i64> @fun63(<4 x i32> %val1, <4 x i32> %val2,
				<4 x i64> %val3, <4 x i64> %val4) {
				%cmp = icmp eq <4 x i32> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i64> %val3, <4 x i64> %val4
				ret <4 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <4 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <4 x i1> %cmp, <4 x i64> %val3, <4 x i64> %val4
				}

				define <4 x float> @fun64(<4 x i32> %val1, <4 x i32> %val2,
				<4 x float> %val3, <4 x float> %val4) {
				%cmp = icmp eq <4 x i32> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x float> %val3, <4 x float> %val4
				ret <4 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <4 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <4 x i1> %cmp, <4 x float> %val3, <4 x float> %val4
				}

				define <4 x double> @fun65(<4 x i32> %val1, <4 x i32> %val2,
				<4 x double> %val3, <4 x double> %val4) {
				%cmp = icmp eq <4 x i32> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x double> %val3, <4 x double> %val4
				ret <4 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <4 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <4 x i1> %cmp, <4 x double> %val3, <4 x double> %val4
				}

				define <4 x i8> @fun66(<4 x i64> %val1, <4 x i64> %val2,
				<4 x i8> %val3, <4 x i8> %val4) {
				%cmp = icmp eq <4 x i64> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i8> %val3, <4 x i8> %val4
				ret <4 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = icmp eq <4 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i8> %val3, <4 x i8> %val4
				}

				define <4 x i16> @fun67(<4 x i64> %val1, <4 x i64> %val2,
				<4 x i16> %val3, <4 x i16> %val4) {
				%cmp = icmp eq <4 x i64> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i16> %val3, <4 x i16> %val4
				ret <4 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = icmp eq <4 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i16> %val3, <4 x i16> %val4
				}

				define <4 x i32> @fun68(<4 x i64> %val1, <4 x i64> %val2,
				<4 x i32> %val3, <4 x i32> %val4) {
				%cmp = icmp eq <4 x i64> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i32> %val3, <4 x i32> %val4
				ret <4 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = icmp eq <4 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i32> %val3, <4 x i32> %val4
				}

				define <4 x i64> @fun69(<4 x i64> %val1, <4 x i64> %val2,
				<4 x i64> %val3, <4 x i64> %val4) {
				%cmp = icmp eq <4 x i64> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i64> %val3, <4 x i64> %val4
				ret <4 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = icmp eq <4 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i64> %val3, <4 x i64> %val4
				}

				define <4 x float> @fun70(<4 x i64> %val1, <4 x i64> %val2,
				<4 x float> %val3, <4 x float> %val4) {
				%cmp = icmp eq <4 x i64> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x float> %val3, <4 x float> %val4
				ret <4 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = icmp eq <4 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x float> %val3, <4 x float> %val4
				}

				define <4 x double> @fun71(<4 x i64> %val1, <4 x i64> %val2,
				<4 x double> %val3, <4 x double> %val4) {
				%cmp = icmp eq <4 x i64> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x double> %val3, <4 x double> %val4
				ret <4 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = icmp eq <4 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x double> %val3, <4 x double> %val4
				}

				define <8 x i8> @fun72(<8 x i8> %val1, <8 x i8> %val2,
				<8 x i8> %val3, <8 x i8> %val4) {
				%cmp = icmp eq <8 x i8> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i8> %val3, <8 x i8> %val4
				ret <8 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <8 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <8 x i1> %cmp, <8 x i8> %val3, <8 x i8> %val4
				}

				define <8 x i16> @fun73(<8 x i8> %val1, <8 x i8> %val2,
				<8 x i16> %val3, <8 x i16> %val4) {
				%cmp = icmp eq <8 x i8> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i16> %val3, <8 x i16> %val4
				ret <8 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <8 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <8 x i1> %cmp, <8 x i16> %val3, <8 x i16> %val4
				}

				define <8 x i32> @fun74(<8 x i8> %val1, <8 x i8> %val2,
				<8 x i32> %val3, <8 x i32> %val4) {
				%cmp = icmp eq <8 x i8> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i32> %val3, <8 x i32> %val4
				ret <8 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <8 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %sel = select <8 x i1> %cmp, <8 x i32> %val3, <8 x i32> %val4
				}

				define <8 x i64> @fun75(<8 x i8> %val1, <8 x i8> %val2,
				<8 x i64> %val3, <8 x i64> %val4) {
				%cmp = icmp eq <8 x i8> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i64> %val3, <8 x i64> %val4
				ret <8 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = icmp eq <8 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %sel = select <8 x i1> %cmp, <8 x i64> %val3, <8 x i64> %val4
				}

				define <8 x float> @fun76(<8 x i8> %val1, <8 x i8> %val2,
				<8 x float> %val3, <8 x float> %val4) {
				%cmp = icmp eq <8 x i8> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x float> %val3, <8 x float> %val4
				ret <8 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <8 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %sel = select <8 x i1> %cmp, <8 x float> %val3, <8 x float> %val4
				}

				define <8 x double> @fun77(<8 x i8> %val1, <8 x i8> %val2,
				<8 x double> %val3, <8 x double> %val4) {
				%cmp = icmp eq <8 x i8> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x double> %val3, <8 x double> %val4
				ret <8 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = icmp eq <8 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %sel = select <8 x i1> %cmp, <8 x double> %val3, <8 x double> %val4
				}

				define <8 x i8> @fun78(<8 x i16> %val1, <8 x i16> %val2,
				<8 x i8> %val3, <8 x i8> %val4) {
				%cmp = icmp eq <8 x i16> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i8> %val3, <8 x i8> %val4
				ret <8 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <8 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <8 x i1> %cmp, <8 x i8> %val3, <8 x i8> %val4
				}

				define <8 x i16> @fun79(<8 x i16> %val1, <8 x i16> %val2,
				<8 x i16> %val3, <8 x i16> %val4) {
				%cmp = icmp eq <8 x i16> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i16> %val3, <8 x i16> %val4
				ret <8 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <8 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <8 x i1> %cmp, <8 x i16> %val3, <8 x i16> %val4
				}

				define <8 x i32> @fun80(<8 x i16> %val1, <8 x i16> %val2,
				<8 x i32> %val3, <8 x i32> %val4) {
				%cmp = icmp eq <8 x i16> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i32> %val3, <8 x i32> %val4
				ret <8 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <8 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x i32> %val3, <8 x i32> %val4
				}

				define <8 x i64> @fun81(<8 x i16> %val1, <8 x i16> %val2,
				<8 x i64> %val3, <8 x i64> %val4) {
				%cmp = icmp eq <8 x i16> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i64> %val3, <8 x i64> %val4
				ret <8 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = icmp eq <8 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %sel = select <8 x i1> %cmp, <8 x i64> %val3, <8 x i64> %val4
				}

				define <8 x float> @fun82(<8 x i16> %val1, <8 x i16> %val2,
				<8 x float> %val3, <8 x float> %val4) {
				%cmp = icmp eq <8 x i16> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x float> %val3, <8 x float> %val4
				ret <8 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <8 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x float> %val3, <8 x float> %val4
				}

				define <8 x double> @fun83(<8 x i16> %val1, <8 x i16> %val2,
				<8 x double> %val3, <8 x double> %val4) {
				%cmp = icmp eq <8 x i16> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x double> %val3, <8 x double> %val4
				ret <8 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = icmp eq <8 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %sel = select <8 x i1> %cmp, <8 x double> %val3, <8 x double> %val4
				}

				define <8 x i8> @fun84(<8 x i32> %val1, <8 x i32> %val2,
				<8 x i8> %val3, <8 x i8> %val4) {
				%cmp = icmp eq <8 x i32> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i8> %val3, <8 x i8> %val4
				ret <8 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = icmp eq <8 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <8 x i1> %cmp, <8 x i8> %val3, <8 x i8> %val4
				}

				define <8 x i16> @fun85(<8 x i32> %val1, <8 x i32> %val2,
				<8 x i16> %val3, <8 x i16> %val4) {
				%cmp = icmp eq <8 x i32> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i16> %val3, <8 x i16> %val4
				ret <8 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = icmp eq <8 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <8 x i1> %cmp, <8 x i16> %val3, <8 x i16> %val4
				}

				define <8 x i32> @fun86(<8 x i32> %val1, <8 x i32> %val2,
				<8 x i32> %val3, <8 x i32> %val4) {
				%cmp = icmp eq <8 x i32> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i32> %val3, <8 x i32> %val4
				ret <8 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = icmp eq <8 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <8 x i1> %cmp, <8 x i32> %val3, <8 x i32> %val4
				}

				define <8 x i64> @fun87(<8 x i32> %val1, <8 x i32> %val2,
				<8 x i64> %val3, <8 x i64> %val4) {
				%cmp = icmp eq <8 x i32> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i64> %val3, <8 x i64> %val4
				ret <8 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = icmp eq <8 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <8 x i1> %cmp, <8 x i64> %val3, <8 x i64> %val4
				}

				define <8 x float> @fun88(<8 x i32> %val1, <8 x i32> %val2,
				<8 x float> %val3, <8 x float> %val4) {
				%cmp = icmp eq <8 x i32> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x float> %val3, <8 x float> %val4
				ret <8 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = icmp eq <8 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <8 x i1> %cmp, <8 x float> %val3, <8 x float> %val4
				}

				define <8 x double> @fun89(<8 x i32> %val1, <8 x i32> %val2,
				<8 x double> %val3, <8 x double> %val4) {
				%cmp = icmp eq <8 x i32> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x double> %val3, <8 x double> %val4
				ret <8 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = icmp eq <8 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <8 x i1> %cmp, <8 x double> %val3, <8 x double> %val4
				}

				define <8 x i8> @fun90(<8 x i64> %val1, <8 x i64> %val2,
				<8 x i8> %val3, <8 x i8> %val4) {
				%cmp = icmp eq <8 x i64> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i8> %val3, <8 x i8> %val4
				ret <8 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <8 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x i8> %val3, <8 x i8> %val4
				}

				define <8 x i16> @fun91(<8 x i64> %val1, <8 x i64> %val2,
				<8 x i16> %val3, <8 x i16> %val4) {
				%cmp = icmp eq <8 x i64> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i16> %val3, <8 x i16> %val4
				ret <8 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <8 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x i16> %val3, <8 x i16> %val4
				}

				define <8 x i32> @fun92(<8 x i64> %val1, <8 x i64> %val2,
				<8 x i32> %val3, <8 x i32> %val4) {
				%cmp = icmp eq <8 x i64> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i32> %val3, <8 x i32> %val4
				ret <8 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <8 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x i32> %val3, <8 x i32> %val4
				}

				define <8 x i64> @fun93(<8 x i64> %val1, <8 x i64> %val2,
				<8 x i64> %val3, <8 x i64> %val4) {
				%cmp = icmp eq <8 x i64> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i64> %val3, <8 x i64> %val4
				ret <8 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <8 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x i64> %val3, <8 x i64> %val4
				}

				define <8 x float> @fun94(<8 x i64> %val1, <8 x i64> %val2,
				<8 x float> %val3, <8 x float> %val4) {
				%cmp = icmp eq <8 x i64> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x float> %val3, <8 x float> %val4
				ret <8 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <8 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x float> %val3, <8 x float> %val4
				}

				define <8 x double> @fun95(<8 x i64> %val1, <8 x i64> %val2,
				<8 x double> %val3, <8 x double> %val4) {
				%cmp = icmp eq <8 x i64> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x double> %val3, <8 x double> %val4
				ret <8 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <8 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x double> %val3, <8 x double> %val4
				}

				define <16 x i8> @fun96(<16 x i8> %val1, <16 x i8> %val2,
				<16 x i8> %val3, <16 x i8> %val4) {
				%cmp = icmp eq <16 x i8> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i8> %val3, <16 x i8> %val4
				ret <16 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = icmp eq <16 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <16 x i1> %cmp, <16 x i8> %val3, <16 x i8> %val4
				}

				define <16 x i16> @fun97(<16 x i8> %val1, <16 x i8> %val2,
				<16 x i16> %val3, <16 x i16> %val4) {
				%cmp = icmp eq <16 x i8> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i16> %val3, <16 x i16> %val4
				ret <16 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <16 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <16 x i1> %cmp, <16 x i16> %val3, <16 x i16> %val4
				}

				define <16 x i32> @fun98(<16 x i8> %val1, <16 x i8> %val2,
				<16 x i32> %val3, <16 x i32> %val4) {
				%cmp = icmp eq <16 x i8> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i32> %val3, <16 x i32> %val4
				ret <16 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = icmp eq <16 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %sel = select <16 x i1> %cmp, <16 x i32> %val3, <16 x i32> %val4
				}

				define <16 x i64> @fun99(<16 x i8> %val1, <16 x i8> %val2,
				<16 x i64> %val3, <16 x i64> %val4) {
				%cmp = icmp eq <16 x i8> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i64> %val3, <16 x i64> %val4
				ret <16 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 22 for instruction: %cmp = icmp eq <16 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %sel = select <16 x i1> %cmp, <16 x i64> %val3, <16 x i64> %val4
				}

				define <16 x float> @fun100(<16 x i8> %val1, <16 x i8> %val2,
				<16 x float> %val3, <16 x float> %val4) {
				%cmp = icmp eq <16 x i8> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x float> %val3, <16 x float> %val4
				ret <16 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = icmp eq <16 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %sel = select <16 x i1> %cmp, <16 x float> %val3, <16 x float> %val4
				}

				define <16 x double> @fun101(<16 x i8> %val1, <16 x i8> %val2,
				<16 x double> %val3, <16 x double> %val4) {
				%cmp = icmp eq <16 x i8> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x double> %val3, <16 x double> %val4
				ret <16 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 22 for instruction: %cmp = icmp eq <16 x i8> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %sel = select <16 x i1> %cmp, <16 x double> %val3, <16 x double> %val4
				}

				define <16 x i8> @fun102(<16 x i16> %val1, <16 x i16> %val2,
				<16 x i8> %val3, <16 x i8> %val4) {
				%cmp = icmp eq <16 x i16> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i8> %val3, <16 x i8> %val4
				ret <16 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = icmp eq <16 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <16 x i1> %cmp, <16 x i8> %val3, <16 x i8> %val4
				}

				define <16 x i16> @fun103(<16 x i16> %val1, <16 x i16> %val2,
				<16 x i16> %val3, <16 x i16> %val4) {
				%cmp = icmp eq <16 x i16> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i16> %val3, <16 x i16> %val4
				ret <16 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = icmp eq <16 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <16 x i1> %cmp, <16 x i16> %val3, <16 x i16> %val4
				}

				define <16 x i32> @fun104(<16 x i16> %val1, <16 x i16> %val2,
				<16 x i32> %val3, <16 x i32> %val4) {
				%cmp = icmp eq <16 x i16> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i32> %val3, <16 x i32> %val4
				ret <16 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = icmp eq <16 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x i32> %val3, <16 x i32> %val4
				}

				define <16 x i64> @fun105(<16 x i16> %val1, <16 x i16> %val2,
				<16 x i64> %val3, <16 x i64> %val4) {
				%cmp = icmp eq <16 x i16> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i64> %val3, <16 x i64> %val4
				ret <16 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %cmp = icmp eq <16 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %sel = select <16 x i1> %cmp, <16 x i64> %val3, <16 x i64> %val4
				}

				define <16 x float> @fun106(<16 x i16> %val1, <16 x i16> %val2,
				<16 x float> %val3, <16 x float> %val4) {
				%cmp = icmp eq <16 x i16> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x float> %val3, <16 x float> %val4
				ret <16 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = icmp eq <16 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x float> %val3, <16 x float> %val4
				}

				define <16 x double> @fun107(<16 x i16> %val1, <16 x i16> %val2,
				<16 x double> %val3, <16 x double> %val4) {
				%cmp = icmp eq <16 x i16> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x double> %val3, <16 x double> %val4
				ret <16 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %cmp = icmp eq <16 x i16> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %sel = select <16 x i1> %cmp, <16 x double> %val3, <16 x double> %val4
				}

				define <16 x i8> @fun108(<16 x i32> %val1, <16 x i32> %val2,
				<16 x i8> %val3, <16 x i8> %val4) {
				%cmp = icmp eq <16 x i32> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i8> %val3, <16 x i8> %val4
				ret <16 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <16 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <16 x i1> %cmp, <16 x i8> %val3, <16 x i8> %val4
				}

				define <16 x i16> @fun109(<16 x i32> %val1, <16 x i32> %val2,
				<16 x i16> %val3, <16 x i16> %val4) {
				%cmp = icmp eq <16 x i32> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i16> %val3, <16 x i16> %val4
				ret <16 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <16 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <16 x i1> %cmp, <16 x i16> %val3, <16 x i16> %val4
				}

				define <16 x i32> @fun110(<16 x i32> %val1, <16 x i32> %val2,
				<16 x i32> %val3, <16 x i32> %val4) {
				%cmp = icmp eq <16 x i32> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i32> %val3, <16 x i32> %val4
				ret <16 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <16 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <16 x i1> %cmp, <16 x i32> %val3, <16 x i32> %val4
				}

				define <16 x i64> @fun111(<16 x i32> %val1, <16 x i32> %val2,
				<16 x i64> %val3, <16 x i64> %val4) {
				%cmp = icmp eq <16 x i32> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i64> %val3, <16 x i64> %val4
				ret <16 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %cmp = icmp eq <16 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %sel = select <16 x i1> %cmp, <16 x i64> %val3, <16 x i64> %val4
				}

				define <16 x float> @fun112(<16 x i32> %val1, <16 x i32> %val2,
				<16 x float> %val3, <16 x float> %val4) {
				%cmp = icmp eq <16 x i32> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x float> %val3, <16 x float> %val4
				ret <16 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = icmp eq <16 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <16 x i1> %cmp, <16 x float> %val3, <16 x float> %val4
				}

				define <16 x double> @fun113(<16 x i32> %val1, <16 x i32> %val2,
				<16 x double> %val3, <16 x double> %val4) {
				%cmp = icmp eq <16 x i32> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x double> %val3, <16 x double> %val4
				ret <16 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %cmp = icmp eq <16 x i32> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %sel = select <16 x i1> %cmp, <16 x double> %val3, <16 x double> %val4
				}

				define <16 x i8> @fun114(<16 x i64> %val1, <16 x i64> %val2,
				<16 x i8> %val3, <16 x i8> %val4) {
				%cmp = icmp eq <16 x i64> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i8> %val3, <16 x i8> %val4
				ret <16 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = icmp eq <16 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x i8> %val3, <16 x i8> %val4
				}

				define <16 x i16> @fun115(<16 x i64> %val1, <16 x i64> %val2,
				<16 x i16> %val3, <16 x i16> %val4) {
				%cmp = icmp eq <16 x i64> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i16> %val3, <16 x i16> %val4
				ret <16 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = icmp eq <16 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x i16> %val3, <16 x i16> %val4
				}

				define <16 x i32> @fun116(<16 x i64> %val1, <16 x i64> %val2,
				<16 x i32> %val3, <16 x i32> %val4) {
				%cmp = icmp eq <16 x i64> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i32> %val3, <16 x i32> %val4
				ret <16 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = icmp eq <16 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x i32> %val3, <16 x i32> %val4
				}

				define <16 x i64> @fun117(<16 x i64> %val1, <16 x i64> %val2,
				<16 x i64> %val3, <16 x i64> %val4) {
				%cmp = icmp eq <16 x i64> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i64> %val3, <16 x i64> %val4
				ret <16 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = icmp eq <16 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x i64> %val3, <16 x i64> %val4
				}

				define <16 x float> @fun118(<16 x i64> %val1, <16 x i64> %val2,
				<16 x float> %val3, <16 x float> %val4) {
				%cmp = icmp eq <16 x i64> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x float> %val3, <16 x float> %val4
				ret <16 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = icmp eq <16 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x float> %val3, <16 x float> %val4
				}

				define <16 x double> @fun119(<16 x i64> %val1, <16 x i64> %val2,
				<16 x double> %val3, <16 x double> %val4) {
				%cmp = icmp eq <16 x i64> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x double> %val3, <16 x double> %val4
				ret <16 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = icmp eq <16 x i64> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x double> %val3, <16 x double> %val4
				}

				define i8 @fun120(float %val1, float %val2,
				i8 %val3, i8 %val4) {
				%cmp = fcmp ogt float %val1, %val2
				%sel = select i1 %cmp, i8 %val3, i8 %val4
				ret i8 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt float %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i8 %val3, i8 %val4
				}

				define i16 @fun121(float %val1, float %val2,
				i16 %val3, i16 %val4) {
				%cmp = fcmp ogt float %val1, %val2
				%sel = select i1 %cmp, i16 %val3, i16 %val4
				ret i16 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt float %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i16 %val3, i16 %val4
				}

				define i32 @fun122(float %val1, float %val2,
				i32 %val3, i32 %val4) {
				%cmp = fcmp ogt float %val1, %val2
				%sel = select i1 %cmp, i32 %val3, i32 %val4
				ret i32 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt float %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i32 %val3, i32 %val4
				}

				define i64 @fun123(float %val1, float %val2,
				i64 %val3, i64 %val4) {
				%cmp = fcmp ogt float %val1, %val2
				%sel = select i1 %cmp, i64 %val3, i64 %val4
				ret i64 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt float %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i64 %val3, i64 %val4
				}

				define float @fun124(float %val1, float %val2,
				float %val3, float %val4) {
				%cmp = fcmp ogt float %val1, %val2
				%sel = select i1 %cmp, float %val3, float %val4
				ret float %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt float %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select i1 %cmp, float %val3, float %val4
				}

				define double @fun125(float %val1, float %val2,
				double %val3, double %val4) {
				%cmp = fcmp ogt float %val1, %val2
				%sel = select i1 %cmp, double %val3, double %val4
				ret double %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt float %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select i1 %cmp, double %val3, double %val4
				}

				define i8 @fun126(double %val1, double %val2,
				i8 %val3, i8 %val4) {
				%cmp = fcmp ogt double %val1, %val2
				%sel = select i1 %cmp, i8 %val3, i8 %val4
				ret i8 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt double %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i8 %val3, i8 %val4
				}

				define i16 @fun127(double %val1, double %val2,
				i16 %val3, i16 %val4) {
				%cmp = fcmp ogt double %val1, %val2
				%sel = select i1 %cmp, i16 %val3, i16 %val4
				ret i16 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt double %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i16 %val3, i16 %val4
				}

				define i32 @fun128(double %val1, double %val2,
				i32 %val3, i32 %val4) {
				%cmp = fcmp ogt double %val1, %val2
				%sel = select i1 %cmp, i32 %val3, i32 %val4
				ret i32 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt double %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i32 %val3, i32 %val4
				}

				define i64 @fun129(double %val1, double %val2,
				i64 %val3, i64 %val4) {
				%cmp = fcmp ogt double %val1, %val2
				%sel = select i1 %cmp, i64 %val3, i64 %val4
				ret i64 %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt double %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select i1 %cmp, i64 %val3, i64 %val4
				}

				define float @fun130(double %val1, double %val2,
				float %val3, float %val4) {
				%cmp = fcmp ogt double %val1, %val2
				%sel = select i1 %cmp, float %val3, float %val4
				ret float %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt double %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select i1 %cmp, float %val3, float %val4
				}

				define double @fun131(double %val1, double %val2,
				double %val3, double %val4) {
				%cmp = fcmp ogt double %val1, %val2
				%sel = select i1 %cmp, double %val3, double %val4
				ret double %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt double %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select i1 %cmp, double %val3, double %val4
				}

				define <2 x i8> @fun132(<2 x float> %val1, <2 x float> %val2,
				<2 x i8> %val3, <2 x i8> %val4) {
				%cmp = fcmp ogt <2 x float> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i8> %val3, <2 x i8> %val4
				ret <2 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = fcmp ogt <2 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i8> %val3, <2 x i8> %val4
				}

				define <2 x i16> @fun133(<2 x float> %val1, <2 x float> %val2,
				<2 x i16> %val3, <2 x i16> %val4) {
				%cmp = fcmp ogt <2 x float> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i16> %val3, <2 x i16> %val4
				ret <2 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = fcmp ogt <2 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i16> %val3, <2 x i16> %val4
				}

				define <2 x i32> @fun134(<2 x float> %val1, <2 x float> %val2,
				<2 x i32> %val3, <2 x i32> %val4) {
				%cmp = fcmp ogt <2 x float> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i32> %val3, <2 x i32> %val4
				ret <2 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = fcmp ogt <2 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <2 x i1> %cmp, <2 x i32> %val3, <2 x i32> %val4
				}

				define <2 x i64> @fun135(<2 x float> %val1, <2 x float> %val2,
				<2 x i64> %val3, <2 x i64> %val4) {
				%cmp = fcmp ogt <2 x float> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i64> %val3, <2 x i64> %val4
				ret <2 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = fcmp ogt <2 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i64> %val3, <2 x i64> %val4
				}

				define <2 x float> @fun136(<2 x float> %val1, <2 x float> %val2,
				<2 x float> %val3, <2 x float> %val4) {
				%cmp = fcmp ogt <2 x float> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x float> %val3, <2 x float> %val4
				ret <2 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = fcmp ogt <2 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <2 x i1> %cmp, <2 x float> %val3, <2 x float> %val4
				}

				define <2 x double> @fun137(<2 x float> %val1, <2 x float> %val2,
				<2 x double> %val3, <2 x double> %val4) {
				%cmp = fcmp ogt <2 x float> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x double> %val3, <2 x double> %val4
				ret <2 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = fcmp ogt <2 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x double> %val3, <2 x double> %val4
				}

				define <2 x i8> @fun138(<2 x double> %val1, <2 x double> %val2,
				<2 x i8> %val3, <2 x i8> %val4) {
				%cmp = fcmp ogt <2 x double> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i8> %val3, <2 x i8> %val4
				ret <2 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt <2 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i8> %val3, <2 x i8> %val4
				}

				define <2 x i16> @fun139(<2 x double> %val1, <2 x double> %val2,
				<2 x i16> %val3, <2 x i16> %val4) {
				%cmp = fcmp ogt <2 x double> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i16> %val3, <2 x i16> %val4
				ret <2 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt <2 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i16> %val3, <2 x i16> %val4
				}

				define <2 x i32> @fun140(<2 x double> %val1, <2 x double> %val2,
				<2 x i32> %val3, <2 x i32> %val4) {
				%cmp = fcmp ogt <2 x double> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i32> %val3, <2 x i32> %val4
				ret <2 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt <2 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x i32> %val3, <2 x i32> %val4
				}

				define <2 x i64> @fun141(<2 x double> %val1, <2 x double> %val2,
				<2 x i64> %val3, <2 x i64> %val4) {
				%cmp = fcmp ogt <2 x double> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x i64> %val3, <2 x i64> %val4
				ret <2 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt <2 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <2 x i1> %cmp, <2 x i64> %val3, <2 x i64> %val4
				}

				define <2 x float> @fun142(<2 x double> %val1, <2 x double> %val2,
				<2 x float> %val3, <2 x float> %val4) {
				%cmp = fcmp ogt <2 x double> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x float> %val3, <2 x float> %val4
				ret <2 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt <2 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <2 x i1> %cmp, <2 x float> %val3, <2 x float> %val4
				}

				define <2 x double> @fun143(<2 x double> %val1, <2 x double> %val2,
				<2 x double> %val3, <2 x double> %val4) {
				%cmp = fcmp ogt <2 x double> %val1, %val2
				%sel = select <2 x i1> %cmp, <2 x double> %val3, <2 x double> %val4
				ret <2 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %cmp = fcmp ogt <2 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <2 x i1> %cmp, <2 x double> %val3, <2 x double> %val4
				}

				define <4 x i8> @fun144(<4 x float> %val1, <4 x float> %val2,
				<4 x i8> %val3, <4 x i8> %val4) {
				%cmp = fcmp ogt <4 x float> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i8> %val3, <4 x i8> %val4
				ret <4 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = fcmp ogt <4 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i8> %val3, <4 x i8> %val4
				}

				define <4 x i16> @fun145(<4 x float> %val1, <4 x float> %val2,
				<4 x i16> %val3, <4 x i16> %val4) {
				%cmp = fcmp ogt <4 x float> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i16> %val3, <4 x i16> %val4
				ret <4 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = fcmp ogt <4 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i16> %val3, <4 x i16> %val4
				}

				define <4 x i32> @fun146(<4 x float> %val1, <4 x float> %val2,
				<4 x i32> %val3, <4 x i32> %val4) {
				%cmp = fcmp ogt <4 x float> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i32> %val3, <4 x i32> %val4
				ret <4 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = fcmp ogt <4 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <4 x i1> %cmp, <4 x i32> %val3, <4 x i32> %val4
				}

				define <4 x i64> @fun147(<4 x float> %val1, <4 x float> %val2,
				<4 x i64> %val3, <4 x i64> %val4) {
				%cmp = fcmp ogt <4 x float> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i64> %val3, <4 x i64> %val4
				ret <4 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 22 for instruction: %cmp = fcmp ogt <4 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <4 x i1> %cmp, <4 x i64> %val3, <4 x i64> %val4
				}

				define <4 x float> @fun148(<4 x float> %val1, <4 x float> %val2,
				<4 x float> %val3, <4 x float> %val4) {
				%cmp = fcmp ogt <4 x float> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x float> %val3, <4 x float> %val4
				ret <4 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %cmp = fcmp ogt <4 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %sel = select <4 x i1> %cmp, <4 x float> %val3, <4 x float> %val4
				}

				define <4 x double> @fun149(<4 x float> %val1, <4 x float> %val2,
				<4 x double> %val3, <4 x double> %val4) {
				%cmp = fcmp ogt <4 x float> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x double> %val3, <4 x double> %val4
				ret <4 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 22 for instruction: %cmp = fcmp ogt <4 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <4 x i1> %cmp, <4 x double> %val3, <4 x double> %val4
				}

				define <4 x i8> @fun150(<4 x double> %val1, <4 x double> %val2,
				<4 x i8> %val3, <4 x i8> %val4) {
				%cmp = fcmp ogt <4 x double> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i8> %val3, <4 x i8> %val4
				ret <4 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = fcmp ogt <4 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i8> %val3, <4 x i8> %val4
				}

				define <4 x i16> @fun151(<4 x double> %val1, <4 x double> %val2,
				<4 x i16> %val3, <4 x i16> %val4) {
				%cmp = fcmp ogt <4 x double> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i16> %val3, <4 x i16> %val4
				ret <4 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = fcmp ogt <4 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i16> %val3, <4 x i16> %val4
				}

				define <4 x i32> @fun152(<4 x double> %val1, <4 x double> %val2,
				<4 x i32> %val3, <4 x i32> %val4) {
				%cmp = fcmp ogt <4 x double> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i32> %val3, <4 x i32> %val4
				ret <4 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = fcmp ogt <4 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i32> %val3, <4 x i32> %val4
				}

				define <4 x i64> @fun153(<4 x double> %val1, <4 x double> %val2,
				<4 x i64> %val3, <4 x i64> %val4) {
				%cmp = fcmp ogt <4 x double> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x i64> %val3, <4 x i64> %val4
				ret <4 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = fcmp ogt <4 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x i64> %val3, <4 x i64> %val4
				}

				define <4 x float> @fun154(<4 x double> %val1, <4 x double> %val2,
				<4 x float> %val3, <4 x float> %val4) {
				%cmp = fcmp ogt <4 x double> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x float> %val3, <4 x float> %val4
				ret <4 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = fcmp ogt <4 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x float> %val3, <4 x float> %val4
				}

				define <4 x double> @fun155(<4 x double> %val1, <4 x double> %val2,
				<4 x double> %val3, <4 x double> %val4) {
				%cmp = fcmp ogt <4 x double> %val1, %val2
				%sel = select <4 x i1> %cmp, <4 x double> %val3, <4 x double> %val4
				ret <4 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %cmp = fcmp ogt <4 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <4 x i1> %cmp, <4 x double> %val3, <4 x double> %val4
				}

				define <8 x i8> @fun156(<8 x float> %val1, <8 x float> %val2,
				<8 x i8> %val3, <8 x i8> %val4) {
				%cmp = fcmp ogt <8 x float> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i8> %val3, <8 x i8> %val4
				ret <8 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %cmp = fcmp ogt <8 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <8 x i1> %cmp, <8 x i8> %val3, <8 x i8> %val4
				}

				define <8 x i16> @fun157(<8 x float> %val1, <8 x float> %val2,
				<8 x i16> %val3, <8 x i16> %val4) {
				%cmp = fcmp ogt <8 x float> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i16> %val3, <8 x i16> %val4
				ret <8 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %cmp = fcmp ogt <8 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <8 x i1> %cmp, <8 x i16> %val3, <8 x i16> %val4
				}

				define <8 x i32> @fun158(<8 x float> %val1, <8 x float> %val2,
				<8 x i32> %val3, <8 x i32> %val4) {
				%cmp = fcmp ogt <8 x float> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i32> %val3, <8 x i32> %val4
				ret <8 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %cmp = fcmp ogt <8 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <8 x i1> %cmp, <8 x i32> %val3, <8 x i32> %val4
				}

				define <8 x i64> @fun159(<8 x float> %val1, <8 x float> %val2,
				<8 x i64> %val3, <8 x i64> %val4) {
				%cmp = fcmp ogt <8 x float> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i64> %val3, <8 x i64> %val4
				ret <8 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 44 for instruction: %cmp = fcmp ogt <8 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <8 x i1> %cmp, <8 x i64> %val3, <8 x i64> %val4
				}

				define <8 x float> @fun160(<8 x float> %val1, <8 x float> %val2,
				<8 x float> %val3, <8 x float> %val4) {
				%cmp = fcmp ogt <8 x float> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x float> %val3, <8 x float> %val4
				ret <8 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %cmp = fcmp ogt <8 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %sel = select <8 x i1> %cmp, <8 x float> %val3, <8 x float> %val4
				}

				define <8 x double> @fun161(<8 x float> %val1, <8 x float> %val2,
				<8 x double> %val3, <8 x double> %val4) {
				%cmp = fcmp ogt <8 x float> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x double> %val3, <8 x double> %val4
				ret <8 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 44 for instruction: %cmp = fcmp ogt <8 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <8 x i1> %cmp, <8 x double> %val3, <8 x double> %val4
				}

				define <8 x i8> @fun162(<8 x double> %val1, <8 x double> %val2,
				<8 x i8> %val3, <8 x i8> %val4) {
				%cmp = fcmp ogt <8 x double> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i8> %val3, <8 x i8> %val4
				ret <8 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = fcmp ogt <8 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x i8> %val3, <8 x i8> %val4
				}

				define <8 x i16> @fun163(<8 x double> %val1, <8 x double> %val2,
				<8 x i16> %val3, <8 x i16> %val4) {
				%cmp = fcmp ogt <8 x double> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i16> %val3, <8 x i16> %val4
				ret <8 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = fcmp ogt <8 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x i16> %val3, <8 x i16> %val4
				}

				define <8 x i32> @fun164(<8 x double> %val1, <8 x double> %val2,
				<8 x i32> %val3, <8 x i32> %val4) {
				%cmp = fcmp ogt <8 x double> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i32> %val3, <8 x i32> %val4
				ret <8 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = fcmp ogt <8 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x i32> %val3, <8 x i32> %val4
				}

				define <8 x i64> @fun165(<8 x double> %val1, <8 x double> %val2,
				<8 x i64> %val3, <8 x i64> %val4) {
				%cmp = fcmp ogt <8 x double> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x i64> %val3, <8 x i64> %val4
				ret <8 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = fcmp ogt <8 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x i64> %val3, <8 x i64> %val4
				}

				define <8 x float> @fun166(<8 x double> %val1, <8 x double> %val2,
				<8 x float> %val3, <8 x float> %val4) {
				%cmp = fcmp ogt <8 x double> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x float> %val3, <8 x float> %val4
				ret <8 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = fcmp ogt <8 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x float> %val3, <8 x float> %val4
				}

				define <8 x double> @fun167(<8 x double> %val1, <8 x double> %val2,
				<8 x double> %val3, <8 x double> %val4) {
				%cmp = fcmp ogt <8 x double> %val1, %val2
				%sel = select <8 x i1> %cmp, <8 x double> %val3, <8 x double> %val4
				ret <8 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %cmp = fcmp ogt <8 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <8 x i1> %cmp, <8 x double> %val3, <8 x double> %val4
				}

				define <16 x i8> @fun168(<16 x float> %val1, <16 x float> %val2,
				<16 x i8> %val3, <16 x i8> %val4) {
				%cmp = fcmp ogt <16 x float> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i8> %val3, <16 x i8> %val4
				ret <16 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 40 for instruction: %cmp = fcmp ogt <16 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <16 x i1> %cmp, <16 x i8> %val3, <16 x i8> %val4
				}

				define <16 x i16> @fun169(<16 x float> %val1, <16 x float> %val2,
				<16 x i16> %val3, <16 x i16> %val4) {
				%cmp = fcmp ogt <16 x float> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i16> %val3, <16 x i16> %val4
				ret <16 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 40 for instruction: %cmp = fcmp ogt <16 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <16 x i1> %cmp, <16 x i16> %val3, <16 x i16> %val4
				}

				define <16 x i32> @fun170(<16 x float> %val1, <16 x float> %val2,
				<16 x i32> %val3, <16 x i32> %val4) {
				%cmp = fcmp ogt <16 x float> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i32> %val3, <16 x i32> %val4
				ret <16 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 40 for instruction: %cmp = fcmp ogt <16 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <16 x i1> %cmp, <16 x i32> %val3, <16 x i32> %val4
				}

				define <16 x i64> @fun171(<16 x float> %val1, <16 x float> %val2,
				<16 x i64> %val3, <16 x i64> %val4) {
				%cmp = fcmp ogt <16 x float> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i64> %val3, <16 x i64> %val4
				ret <16 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 88 for instruction: %cmp = fcmp ogt <16 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %sel = select <16 x i1> %cmp, <16 x i64> %val3, <16 x i64> %val4
				}

				define <16 x float> @fun172(<16 x float> %val1, <16 x float> %val2,
				<16 x float> %val3, <16 x float> %val4) {
				%cmp = fcmp ogt <16 x float> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x float> %val3, <16 x float> %val4
				ret <16 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 40 for instruction: %cmp = fcmp ogt <16 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %sel = select <16 x i1> %cmp, <16 x float> %val3, <16 x float> %val4
				}

				define <16 x double> @fun173(<16 x float> %val1, <16 x float> %val2,
				<16 x double> %val3, <16 x double> %val4) {
				%cmp = fcmp ogt <16 x float> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x double> %val3, <16 x double> %val4
				ret <16 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 88 for instruction: %cmp = fcmp ogt <16 x float> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %sel = select <16 x i1> %cmp, <16 x double> %val3, <16 x double> %val4
				}

				define <16 x i8> @fun174(<16 x double> %val1, <16 x double> %val2,
				<16 x i8> %val3, <16 x i8> %val4) {
				%cmp = fcmp ogt <16 x double> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i8> %val3, <16 x i8> %val4
				ret <16 x i8> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = fcmp ogt <16 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x i8> %val3, <16 x i8> %val4
				}

				define <16 x i16> @fun175(<16 x double> %val1, <16 x double> %val2,
				<16 x i16> %val3, <16 x i16> %val4) {
				%cmp = fcmp ogt <16 x double> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i16> %val3, <16 x i16> %val4
				ret <16 x i16> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = fcmp ogt <16 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x i16> %val3, <16 x i16> %val4
				}

				define <16 x i32> @fun176(<16 x double> %val1, <16 x double> %val2,
				<16 x i32> %val3, <16 x i32> %val4) {
				%cmp = fcmp ogt <16 x double> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i32> %val3, <16 x i32> %val4
				ret <16 x i32> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = fcmp ogt <16 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x i32> %val3, <16 x i32> %val4
				}

				define <16 x i64> @fun177(<16 x double> %val1, <16 x double> %val2,
				<16 x i64> %val3, <16 x i64> %val4) {
				%cmp = fcmp ogt <16 x double> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x i64> %val3, <16 x i64> %val4
				ret <16 x i64> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = fcmp ogt <16 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x i64> %val3, <16 x i64> %val4
				}

				define <16 x float> @fun178(<16 x double> %val1, <16 x double> %val2,
				<16 x float> %val3, <16 x float> %val4) {
				%cmp = fcmp ogt <16 x double> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x float> %val3, <16 x float> %val4
				ret <16 x float> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = fcmp ogt <16 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x float> %val3, <16 x float> %val4
				}

				define <16 x double> @fun179(<16 x double> %val1, <16 x double> %val2,
				<16 x double> %val3, <16 x double> %val4) {
				%cmp = fcmp ogt <16 x double> %val1, %val2
				%sel = select <16 x i1> %cmp, <16 x double> %val3, <16 x double> %val4
				ret <16 x double> %sel

				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %cmp = fcmp ogt <16 x double> %val1, %val2
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %sel = select <16 x i1> %cmp, <16 x double> %val3, <16 x double> %val4
				}

test/Analysis/CostModel/SystemZ/fp-arith.ll

This file was added.

				; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \| FileCheck %s
				;
				; Note: The scalarized vector instructions cost is not including any
				; extracts, due to the undef operands
				;
				; Note: FRem is implemented with libcall, so not included here.

				define void @fadd() {
				%res0 = fadd float undef, undef
				%res1 = fadd double undef, undef
				%res2 = fadd fp128 undef, undef
				%res3 = fadd <2 x float> undef, undef
				%res4 = fadd <2 x double> undef, undef
				%res5 = fadd <4 x float> undef, undef
				%res6 = fadd <4 x double> undef, undef
				%res7 = fadd <8 x float> undef, undef
				%res8 = fadd <8 x double> undef, undef
				%res9 = fadd <16 x float> undef, undef
				%res10 = fadd <16 x double> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res0 = fadd float undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res1 = fadd double undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = fadd fp128 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res3 = fadd <2 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = fadd <2 x double> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res5 = fadd <4 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res6 = fadd <4 x double> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %res7 = fadd <8 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res8 = fadd <8 x double> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %res9 = fadd <16 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res10 = fadd <16 x double> undef, undef

				ret void;
				}

				define void @fsub() {
				%res0 = fsub float undef, undef
				%res1 = fsub double undef, undef
				%res2 = fsub fp128 undef, undef
				%res3 = fsub <2 x float> undef, undef
				%res4 = fsub <2 x double> undef, undef
				%res5 = fsub <4 x float> undef, undef
				%res6 = fsub <4 x double> undef, undef
				%res7 = fsub <8 x float> undef, undef
				%res8 = fsub <8 x double> undef, undef
				%res9 = fsub <16 x float> undef, undef
				%res10 = fsub <16 x double> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res0 = fsub float undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res1 = fsub double undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = fsub fp128 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res3 = fsub <2 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = fsub <2 x double> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res5 = fsub <4 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res6 = fsub <4 x double> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %res7 = fsub <8 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res8 = fsub <8 x double> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %res9 = fsub <16 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res10 = fsub <16 x double> undef, undef

				ret void;
				}

				define void @fmul() {
				%res0 = fmul float undef, undef
				%res1 = fmul double undef, undef
				%res2 = fmul fp128 undef, undef
				%res3 = fmul <2 x float> undef, undef
				%res4 = fmul <2 x double> undef, undef
				%res5 = fmul <4 x float> undef, undef
				%res6 = fmul <4 x double> undef, undef
				%res7 = fmul <8 x float> undef, undef
				%res8 = fmul <8 x double> undef, undef
				%res9 = fmul <16 x float> undef, undef
				%res10 = fmul <16 x double> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res0 = fmul float undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res1 = fmul double undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = fmul fp128 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res3 = fmul <2 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = fmul <2 x double> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res5 = fmul <4 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res6 = fmul <4 x double> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %res7 = fmul <8 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res8 = fmul <8 x double> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %res9 = fmul <16 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res10 = fmul <16 x double> undef, undef

				ret void;
				}

				define void @fdiv() {
				%res0 = fdiv float undef, undef
				%res1 = fdiv double undef, undef
				%res2 = fdiv fp128 undef, undef
				%res3 = fdiv <2 x float> undef, undef
				%res4 = fdiv <2 x double> undef, undef
				%res5 = fdiv <4 x float> undef, undef
				%res6 = fdiv <4 x double> undef, undef
				%res7 = fdiv <8 x float> undef, undef
				%res8 = fdiv <8 x double> undef, undef
				%res9 = fdiv <16 x float> undef, undef
				%res10 = fdiv <16 x double> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res0 = fdiv float undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res1 = fdiv double undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = fdiv fp128 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res3 = fdiv <2 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = fdiv <2 x double> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res5 = fdiv <4 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res6 = fdiv <4 x double> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %res7 = fdiv <8 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res8 = fdiv <8 x double> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %res9 = fdiv <16 x float> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res10 = fdiv <16 x double> undef, undef

				ret void;
				}

test/Analysis/CostModel/SystemZ/fp-cast.ll

This file was added.

				; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \| FileCheck %s
				;
				; Note: The scalarized vector instructions costs are not including any
				; extracts, due to the undef operands.

				define void @fpext() {
				%v0 = fpext double undef to fp128
				%v1 = fpext float undef to fp128
				%v2 = fpext float undef to double
				%v3 = fpext <2 x double> undef to <2 x fp128>
				%v4 = fpext <2 x float> undef to <2 x fp128>
				%v5 = fpext <2 x float> undef to <2 x double>
				%v6 = fpext <4 x double> undef to <4 x fp128>
				%v7 = fpext <4 x float> undef to <4 x fp128>
				%v8 = fpext <4 x float> undef to <4 x double>
				%v9 = fpext <8 x double> undef to <8 x fp128>
				%v10 = fpext <8 x float> undef to <8 x fp128>
				%v11 = fpext <8 x float> undef to <8 x double>
				%v12 = fpext <16 x float> undef to <16 x double>

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v0 = fpext double undef to fp128
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v1 = fpext float undef to fp128
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v2 = fpext float undef to double
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v3 = fpext <2 x double> undef to <2 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v4 = fpext <2 x float> undef to <2 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v5 = fpext <2 x float> undef to <2 x double>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v6 = fpext <4 x double> undef to <4 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v7 = fpext <4 x float> undef to <4 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v8 = fpext <4 x float> undef to <4 x double>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v9 = fpext <8 x double> undef to <8 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v10 = fpext <8 x float> undef to <8 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v11 = fpext <8 x float> undef to <8 x double>
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %v12 = fpext <16 x float> undef to <16 x double>

				ret void;
				}

				define void @fptosi() {
				%v0 = fptosi fp128 undef to i64
				%v1 = fptosi fp128 undef to i32
				%v2 = fptosi fp128 undef to i16
				%v3 = fptosi fp128 undef to i8
				%v4 = fptosi double undef to i64
				%v5 = fptosi double undef to i32
				%v6 = fptosi double undef to i16
				%v7 = fptosi double undef to i8
				%v8 = fptosi float undef to i64
				%v9 = fptosi float undef to i32
				%v10 = fptosi float undef to i16
				%v11 = fptosi float undef to i8
				%v12 = fptosi <2 x fp128> undef to <2 x i64>
				%v13 = fptosi <2 x fp128> undef to <2 x i32>
				%v14 = fptosi <2 x fp128> undef to <2 x i16>
				%v15 = fptosi <2 x fp128> undef to <2 x i8>
				%v16 = fptosi <2 x double> undef to <2 x i64>
				%v17 = fptosi <2 x double> undef to <2 x i32>
				%v18 = fptosi <2 x double> undef to <2 x i16>
				%v19 = fptosi <2 x double> undef to <2 x i8>
				%v20 = fptosi <2 x float> undef to <2 x i64>
				%v21 = fptosi <2 x float> undef to <2 x i32>
				%v22 = fptosi <2 x float> undef to <2 x i16>
				%v23 = fptosi <2 x float> undef to <2 x i8>
				%v24 = fptosi <4 x fp128> undef to <4 x i64>
				%v25 = fptosi <4 x fp128> undef to <4 x i32>
				%v26 = fptosi <4 x fp128> undef to <4 x i16>
				%v27 = fptosi <4 x fp128> undef to <4 x i8>
				%v28 = fptosi <4 x double> undef to <4 x i64>
				%v29 = fptosi <4 x double> undef to <4 x i32>
				%v30 = fptosi <4 x double> undef to <4 x i16>
				%v31 = fptosi <4 x double> undef to <4 x i8>
				%v32 = fptosi <4 x float> undef to <4 x i64>
				%v33 = fptosi <4 x float> undef to <4 x i32>
				%v34 = fptosi <4 x float> undef to <4 x i16>
				%v35 = fptosi <4 x float> undef to <4 x i8>
				%v36 = fptosi <8 x fp128> undef to <8 x i64>
				%v37 = fptosi <8 x fp128> undef to <8 x i32>
				%v38 = fptosi <8 x fp128> undef to <8 x i16>
				%v39 = fptosi <8 x fp128> undef to <8 x i8>
				%v40 = fptosi <8 x double> undef to <8 x i64>
				%v41 = fptosi <8 x double> undef to <8 x i32>
				%v42 = fptosi <8 x double> undef to <8 x i16>
				%v43 = fptosi <8 x double> undef to <8 x i8>
				%v44 = fptosi <8 x float> undef to <8 x i64>
				%v45 = fptosi <8 x float> undef to <8 x i32>
				%v46 = fptosi <8 x float> undef to <8 x i16>
				%v47 = fptosi <8 x float> undef to <8 x i8>
				%v48 = fptosi <16 x double> undef to <16 x i64>
				%v49 = fptosi <16 x double> undef to <16 x i32>
				%v50 = fptosi <16 x double> undef to <16 x i16>
				%v51 = fptosi <16 x double> undef to <16 x i8>
				%v52 = fptosi <16 x float> undef to <16 x i64>
				%v53 = fptosi <16 x float> undef to <16 x i32>
				%v54 = fptosi <16 x float> undef to <16 x i16>
				%v55 = fptosi <16 x float> undef to <16 x i8>

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v0 = fptosi fp128 undef to i64
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v1 = fptosi fp128 undef to i32
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v2 = fptosi fp128 undef to i16
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v3 = fptosi fp128 undef to i8
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v4 = fptosi double undef to i64
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v5 = fptosi double undef to i32
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v6 = fptosi double undef to i16
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v7 = fptosi double undef to i8
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v8 = fptosi float undef to i64
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v9 = fptosi float undef to i32
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v10 = fptosi float undef to i16
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v11 = fptosi float undef to i8
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v12 = fptosi <2 x fp128> undef to <2 x i64>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v13 = fptosi <2 x fp128> undef to <2 x i32>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v14 = fptosi <2 x fp128> undef to <2 x i16>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v15 = fptosi <2 x fp128> undef to <2 x i8>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v16 = fptosi <2 x double> undef to <2 x i64>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v17 = fptosi <2 x double> undef to <2 x i32>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v18 = fptosi <2 x double> undef to <2 x i16>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v19 = fptosi <2 x double> undef to <2 x i8>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v20 = fptosi <2 x float> undef to <2 x i64>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v21 = fptosi <2 x float> undef to <2 x i32>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v22 = fptosi <2 x float> undef to <2 x i16>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v23 = fptosi <2 x float> undef to <2 x i8>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v24 = fptosi <4 x fp128> undef to <4 x i64>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v25 = fptosi <4 x fp128> undef to <4 x i32>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v26 = fptosi <4 x fp128> undef to <4 x i16>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v27 = fptosi <4 x fp128> undef to <4 x i8>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v28 = fptosi <4 x double> undef to <4 x i64>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v29 = fptosi <4 x double> undef to <4 x i32>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v30 = fptosi <4 x double> undef to <4 x i16>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v31 = fptosi <4 x double> undef to <4 x i8>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v32 = fptosi <4 x float> undef to <4 x i64>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v33 = fptosi <4 x float> undef to <4 x i32>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v34 = fptosi <4 x float> undef to <4 x i16>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v35 = fptosi <4 x float> undef to <4 x i8>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v36 = fptosi <8 x fp128> undef to <8 x i64>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v37 = fptosi <8 x fp128> undef to <8 x i32>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v38 = fptosi <8 x fp128> undef to <8 x i16>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v39 = fptosi <8 x fp128> undef to <8 x i8>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v40 = fptosi <8 x double> undef to <8 x i64>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v41 = fptosi <8 x double> undef to <8 x i32>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v42 = fptosi <8 x double> undef to <8 x i16>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v43 = fptosi <8 x double> undef to <8 x i8>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v44 = fptosi <8 x float> undef to <8 x i64>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v45 = fptosi <8 x float> undef to <8 x i32>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v46 = fptosi <8 x float> undef to <8 x i16>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v47 = fptosi <8 x float> undef to <8 x i8>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v48 = fptosi <16 x double> undef to <16 x i64>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v49 = fptosi <16 x double> undef to <16 x i32>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v50 = fptosi <16 x double> undef to <16 x i16>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v51 = fptosi <16 x double> undef to <16 x i8>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v52 = fptosi <16 x float> undef to <16 x i64>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v53 = fptosi <16 x float> undef to <16 x i32>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v54 = fptosi <16 x float> undef to <16 x i16>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v55 = fptosi <16 x float> undef to <16 x i8>

				ret void;
				}


				define void @fptoui() {
				%v0 = fptoui fp128 undef to i64
				%v1 = fptoui fp128 undef to i32
				%v2 = fptoui fp128 undef to i16
				%v3 = fptoui fp128 undef to i8
				%v4 = fptoui double undef to i64
				%v5 = fptoui double undef to i32
				%v6 = fptoui double undef to i16
				%v7 = fptoui double undef to i8
				%v8 = fptoui float undef to i64
				%v9 = fptoui float undef to i32
				%v10 = fptoui float undef to i16
				%v11 = fptoui float undef to i8
				%v12 = fptoui <2 x fp128> undef to <2 x i64>
				%v13 = fptoui <2 x fp128> undef to <2 x i32>
				%v14 = fptoui <2 x fp128> undef to <2 x i16>
				%v15 = fptoui <2 x fp128> undef to <2 x i8>
				%v16 = fptoui <2 x double> undef to <2 x i64>
				%v17 = fptoui <2 x double> undef to <2 x i32>
				%v18 = fptoui <2 x double> undef to <2 x i16>
				%v19 = fptoui <2 x double> undef to <2 x i8>
				%v20 = fptoui <2 x float> undef to <2 x i64>
				%v21 = fptoui <2 x float> undef to <2 x i32>
				%v22 = fptoui <2 x float> undef to <2 x i16>
				%v23 = fptoui <2 x float> undef to <2 x i8>
				%v24 = fptoui <4 x fp128> undef to <4 x i64>
				%v25 = fptoui <4 x fp128> undef to <4 x i32>
				%v26 = fptoui <4 x fp128> undef to <4 x i16>
				%v27 = fptoui <4 x fp128> undef to <4 x i8>
				%v28 = fptoui <4 x double> undef to <4 x i64>
				%v29 = fptoui <4 x double> undef to <4 x i32>
				%v30 = fptoui <4 x double> undef to <4 x i16>
				%v31 = fptoui <4 x double> undef to <4 x i8>
				%v32 = fptoui <4 x float> undef to <4 x i64>
				%v33 = fptoui <4 x float> undef to <4 x i32>
				%v34 = fptoui <4 x float> undef to <4 x i16>
				%v35 = fptoui <4 x float> undef to <4 x i8>
				%v36 = fptoui <8 x fp128> undef to <8 x i64>
				%v37 = fptoui <8 x fp128> undef to <8 x i32>
				%v38 = fptoui <8 x fp128> undef to <8 x i16>
				%v39 = fptoui <8 x fp128> undef to <8 x i8>
				%v40 = fptoui <8 x double> undef to <8 x i64>
				%v41 = fptoui <8 x double> undef to <8 x i32>
				%v42 = fptoui <8 x double> undef to <8 x i16>
				%v43 = fptoui <8 x double> undef to <8 x i8>
				%v44 = fptoui <8 x float> undef to <8 x i64>
				%v45 = fptoui <8 x float> undef to <8 x i32>
				%v46 = fptoui <8 x float> undef to <8 x i16>
				%v47 = fptoui <8 x float> undef to <8 x i8>
				%v48 = fptoui <16 x double> undef to <16 x i64>
				%v49 = fptoui <16 x double> undef to <16 x i32>
				%v50 = fptoui <16 x double> undef to <16 x i16>
				%v51 = fptoui <16 x double> undef to <16 x i8>
				%v52 = fptoui <16 x float> undef to <16 x i64>
				%v53 = fptoui <16 x float> undef to <16 x i32>
				%v54 = fptoui <16 x float> undef to <16 x i16>
				%v55 = fptoui <16 x float> undef to <16 x i8>

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v0 = fptoui fp128 undef to i64
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v1 = fptoui fp128 undef to i32
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v2 = fptoui fp128 undef to i16
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v3 = fptoui fp128 undef to i8
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v4 = fptoui double undef to i64
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v5 = fptoui double undef to i32
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v6 = fptoui double undef to i16
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v7 = fptoui double undef to i8
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v8 = fptoui float undef to i64
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v9 = fptoui float undef to i32
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v10 = fptoui float undef to i16
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v11 = fptoui float undef to i8
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v12 = fptoui <2 x fp128> undef to <2 x i64>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v13 = fptoui <2 x fp128> undef to <2 x i32>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v14 = fptoui <2 x fp128> undef to <2 x i16>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v15 = fptoui <2 x fp128> undef to <2 x i8>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v16 = fptoui <2 x double> undef to <2 x i64>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v17 = fptoui <2 x double> undef to <2 x i32>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v18 = fptoui <2 x double> undef to <2 x i16>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v19 = fptoui <2 x double> undef to <2 x i8>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v20 = fptoui <2 x float> undef to <2 x i64>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v21 = fptoui <2 x float> undef to <2 x i32>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v22 = fptoui <2 x float> undef to <2 x i16>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v23 = fptoui <2 x float> undef to <2 x i8>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v24 = fptoui <4 x fp128> undef to <4 x i64>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v25 = fptoui <4 x fp128> undef to <4 x i32>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v26 = fptoui <4 x fp128> undef to <4 x i16>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v27 = fptoui <4 x fp128> undef to <4 x i8>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v28 = fptoui <4 x double> undef to <4 x i64>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v29 = fptoui <4 x double> undef to <4 x i32>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v30 = fptoui <4 x double> undef to <4 x i16>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v31 = fptoui <4 x double> undef to <4 x i8>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v32 = fptoui <4 x float> undef to <4 x i64>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v33 = fptoui <4 x float> undef to <4 x i32>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v34 = fptoui <4 x float> undef to <4 x i16>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v35 = fptoui <4 x float> undef to <4 x i8>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v36 = fptoui <8 x fp128> undef to <8 x i64>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v37 = fptoui <8 x fp128> undef to <8 x i32>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v38 = fptoui <8 x fp128> undef to <8 x i16>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v39 = fptoui <8 x fp128> undef to <8 x i8>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v40 = fptoui <8 x double> undef to <8 x i64>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v41 = fptoui <8 x double> undef to <8 x i32>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v42 = fptoui <8 x double> undef to <8 x i16>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v43 = fptoui <8 x double> undef to <8 x i8>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v44 = fptoui <8 x float> undef to <8 x i64>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v45 = fptoui <8 x float> undef to <8 x i32>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v46 = fptoui <8 x float> undef to <8 x i16>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v47 = fptoui <8 x float> undef to <8 x i8>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v48 = fptoui <16 x double> undef to <16 x i64>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v49 = fptoui <16 x double> undef to <16 x i32>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v50 = fptoui <16 x double> undef to <16 x i16>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v51 = fptoui <16 x double> undef to <16 x i8>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v52 = fptoui <16 x float> undef to <16 x i64>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v53 = fptoui <16 x float> undef to <16 x i32>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v54 = fptoui <16 x float> undef to <16 x i16>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v55 = fptoui <16 x float> undef to <16 x i8>

				ret void;
				}

				define void @fptrunc() {
				%v0 = fptrunc fp128 undef to double
				%v1 = fptrunc fp128 undef to float
				%v2 = fptrunc double undef to float
				%v3 = fptrunc <2 x fp128> undef to <2 x double>
				%v4 = fptrunc <2 x fp128> undef to <2 x float>
				%v5 = fptrunc <2 x double> undef to <2 x float>
				%v6 = fptrunc <4 x fp128> undef to <4 x double>
				%v7 = fptrunc <4 x fp128> undef to <4 x float>
				%v8 = fptrunc <4 x double> undef to <4 x float>
				%v9 = fptrunc <8 x fp128> undef to <8 x double>
				%v10 = fptrunc <8 x fp128> undef to <8 x float>
				%v11 = fptrunc <8 x double> undef to <8 x float>
				%v12 = fptrunc <16 x double> undef to <16 x float>

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v0 = fptrunc fp128 undef to double
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v1 = fptrunc fp128 undef to float
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v2 = fptrunc double undef to float
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v3 = fptrunc <2 x fp128> undef to <2 x double>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v4 = fptrunc <2 x fp128> undef to <2 x float>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v5 = fptrunc <2 x double> undef to <2 x float>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v6 = fptrunc <4 x fp128> undef to <4 x double>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v7 = fptrunc <4 x fp128> undef to <4 x float>
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %v8 = fptrunc <4 x double> undef to <4 x float>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v9 = fptrunc <8 x fp128> undef to <8 x double>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v10 = fptrunc <8 x fp128> undef to <8 x float>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v11 = fptrunc <8 x double> undef to <8 x float>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v12 = fptrunc <16 x double> undef to <16 x float>

				ret void;
				}

				define void @sitofp() {
				%v0 = sitofp i64 undef to fp128
				%v1 = sitofp i64 undef to double
				%v2 = sitofp i64 undef to float
				%v3 = sitofp i32 undef to fp128
				%v4 = sitofp i32 undef to double
				%v5 = sitofp i32 undef to float
				%v6 = sitofp i16 undef to fp128
				%v7 = sitofp i16 undef to double
				%v8 = sitofp i16 undef to float
				%v9 = sitofp i8 undef to fp128
				%v10 = sitofp i8 undef to double
				%v11 = sitofp i8 undef to float
				%v12 = sitofp <2 x i64> undef to <2 x fp128>
				%v13 = sitofp <2 x i64> undef to <2 x double>
				%v14 = sitofp <2 x i64> undef to <2 x float>
				%v15 = sitofp <2 x i32> undef to <2 x fp128>
				%v16 = sitofp <2 x i32> undef to <2 x double>
				%v17 = sitofp <2 x i32> undef to <2 x float>
				%v18 = sitofp <2 x i16> undef to <2 x fp128>
				%v19 = sitofp <2 x i16> undef to <2 x double>
				%v20 = sitofp <2 x i16> undef to <2 x float>
				%v21 = sitofp <2 x i8> undef to <2 x fp128>
				%v22 = sitofp <2 x i8> undef to <2 x double>
				%v23 = sitofp <2 x i8> undef to <2 x float>
				%v24 = sitofp <4 x i64> undef to <4 x fp128>
				%v25 = sitofp <4 x i64> undef to <4 x double>
				%v26 = sitofp <4 x i64> undef to <4 x float>
				%v27 = sitofp <4 x i32> undef to <4 x fp128>
				%v28 = sitofp <4 x i32> undef to <4 x double>
				%v29 = sitofp <4 x i32> undef to <4 x float>
				%v30 = sitofp <4 x i16> undef to <4 x fp128>
				%v31 = sitofp <4 x i16> undef to <4 x double>
				%v32 = sitofp <4 x i16> undef to <4 x float>
				%v33 = sitofp <4 x i8> undef to <4 x fp128>
				%v34 = sitofp <4 x i8> undef to <4 x double>
				%v35 = sitofp <4 x i8> undef to <4 x float>
				%v36 = sitofp <8 x i64> undef to <8 x fp128>
				%v37 = sitofp <8 x i64> undef to <8 x double>
				%v38 = sitofp <8 x i64> undef to <8 x float>
				%v39 = sitofp <8 x i32> undef to <8 x fp128>
				%v40 = sitofp <8 x i32> undef to <8 x double>
				%v41 = sitofp <8 x i32> undef to <8 x float>
				%v42 = sitofp <8 x i16> undef to <8 x fp128>
				%v43 = sitofp <8 x i16> undef to <8 x double>
				%v44 = sitofp <8 x i16> undef to <8 x float>
				%v45 = sitofp <8 x i8> undef to <8 x fp128>
				%v46 = sitofp <8 x i8> undef to <8 x double>
				%v47 = sitofp <8 x i8> undef to <8 x float>
				%v48 = sitofp <16 x i64> undef to <16 x double>
				%v49 = sitofp <16 x i64> undef to <16 x float>
				%v50 = sitofp <16 x i32> undef to <16 x double>
				%v51 = sitofp <16 x i32> undef to <16 x float>
				%v52 = sitofp <16 x i16> undef to <16 x double>
				%v53 = sitofp <16 x i16> undef to <16 x float>
				%v54 = sitofp <16 x i8> undef to <16 x double>
				%v55 = sitofp <16 x i8> undef to <16 x float>

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v0 = sitofp i64 undef to fp128
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v1 = sitofp i64 undef to double
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v2 = sitofp i64 undef to float
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v3 = sitofp i32 undef to fp128
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v4 = sitofp i32 undef to double
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v5 = sitofp i32 undef to float
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v6 = sitofp i16 undef to fp128
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v7 = sitofp i16 undef to double
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v8 = sitofp i16 undef to float
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v9 = sitofp i8 undef to fp128
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v10 = sitofp i8 undef to double
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v11 = sitofp i8 undef to float
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v12 = sitofp <2 x i64> undef to <2 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v13 = sitofp <2 x i64> undef to <2 x double>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v14 = sitofp <2 x i64> undef to <2 x float>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v15 = sitofp <2 x i32> undef to <2 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v16 = sitofp <2 x i32> undef to <2 x double>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v17 = sitofp <2 x i32> undef to <2 x float>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v18 = sitofp <2 x i16> undef to <2 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v19 = sitofp <2 x i16> undef to <2 x double>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v20 = sitofp <2 x i16> undef to <2 x float>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v21 = sitofp <2 x i8> undef to <2 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v22 = sitofp <2 x i8> undef to <2 x double>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v23 = sitofp <2 x i8> undef to <2 x float>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v24 = sitofp <4 x i64> undef to <4 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v25 = sitofp <4 x i64> undef to <4 x double>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v26 = sitofp <4 x i64> undef to <4 x float>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v27 = sitofp <4 x i32> undef to <4 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v28 = sitofp <4 x i32> undef to <4 x double>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v29 = sitofp <4 x i32> undef to <4 x float>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v30 = sitofp <4 x i16> undef to <4 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v31 = sitofp <4 x i16> undef to <4 x double>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v32 = sitofp <4 x i16> undef to <4 x float>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v33 = sitofp <4 x i8> undef to <4 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v34 = sitofp <4 x i8> undef to <4 x double>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v35 = sitofp <4 x i8> undef to <4 x float>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v36 = sitofp <8 x i64> undef to <8 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v37 = sitofp <8 x i64> undef to <8 x double>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v38 = sitofp <8 x i64> undef to <8 x float>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v39 = sitofp <8 x i32> undef to <8 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v40 = sitofp <8 x i32> undef to <8 x double>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v41 = sitofp <8 x i32> undef to <8 x float>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v42 = sitofp <8 x i16> undef to <8 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %v43 = sitofp <8 x i16> undef to <8 x double>
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %v44 = sitofp <8 x i16> undef to <8 x float>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v45 = sitofp <8 x i8> undef to <8 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %v46 = sitofp <8 x i8> undef to <8 x double>
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %v47 = sitofp <8 x i8> undef to <8 x float>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v48 = sitofp <16 x i64> undef to <16 x double>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v49 = sitofp <16 x i64> undef to <16 x float>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v50 = sitofp <16 x i32> undef to <16 x double>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v51 = sitofp <16 x i32> undef to <16 x float>
				; CHECK: Cost Model: Found an estimated cost of 64 for instruction: %v52 = sitofp <16 x i16> undef to <16 x double>
				; CHECK: Cost Model: Found an estimated cost of 64 for instruction: %v53 = sitofp <16 x i16> undef to <16 x float>
				; CHECK: Cost Model: Found an estimated cost of 64 for instruction: %v54 = sitofp <16 x i8> undef to <16 x double>
				; CHECK: Cost Model: Found an estimated cost of 64 for instruction: %v55 = sitofp <16 x i8> undef to <16 x float>

				ret void;
				}

				define void @uitofp() {
				%v0 = uitofp i64 undef to fp128
				%v1 = uitofp i64 undef to double
				%v2 = uitofp i64 undef to float
				%v3 = uitofp i32 undef to fp128
				%v4 = uitofp i32 undef to double
				%v5 = uitofp i32 undef to float
				%v6 = uitofp i16 undef to fp128
				%v7 = uitofp i16 undef to double
				%v8 = uitofp i16 undef to float
				%v9 = uitofp i8 undef to fp128
				%v10 = uitofp i8 undef to double
				%v11 = uitofp i8 undef to float
				%v12 = uitofp <2 x i64> undef to <2 x fp128>
				%v13 = uitofp <2 x i64> undef to <2 x double>
				%v14 = uitofp <2 x i64> undef to <2 x float>
				%v15 = uitofp <2 x i32> undef to <2 x fp128>
				%v16 = uitofp <2 x i32> undef to <2 x double>
				%v17 = uitofp <2 x i32> undef to <2 x float>
				%v18 = uitofp <2 x i16> undef to <2 x fp128>
				%v19 = uitofp <2 x i16> undef to <2 x double>
				%v20 = uitofp <2 x i16> undef to <2 x float>
				%v21 = uitofp <2 x i8> undef to <2 x fp128>
				%v22 = uitofp <2 x i8> undef to <2 x double>
				%v23 = uitofp <2 x i8> undef to <2 x float>
				%v24 = uitofp <4 x i64> undef to <4 x fp128>
				%v25 = uitofp <4 x i64> undef to <4 x double>
				%v26 = uitofp <4 x i64> undef to <4 x float>
				%v27 = uitofp <4 x i32> undef to <4 x fp128>
				%v28 = uitofp <4 x i32> undef to <4 x double>
				%v29 = uitofp <4 x i32> undef to <4 x float>
				%v30 = uitofp <4 x i16> undef to <4 x fp128>
				%v31 = uitofp <4 x i16> undef to <4 x double>
				%v32 = uitofp <4 x i16> undef to <4 x float>
				%v33 = uitofp <4 x i8> undef to <4 x fp128>
				%v34 = uitofp <4 x i8> undef to <4 x double>
				%v35 = uitofp <4 x i8> undef to <4 x float>
				%v36 = uitofp <8 x i64> undef to <8 x fp128>
				%v37 = uitofp <8 x i64> undef to <8 x double>
				%v38 = uitofp <8 x i64> undef to <8 x float>
				%v39 = uitofp <8 x i32> undef to <8 x fp128>
				%v40 = uitofp <8 x i32> undef to <8 x double>
				%v41 = uitofp <8 x i32> undef to <8 x float>
				%v42 = uitofp <8 x i16> undef to <8 x fp128>
				%v43 = uitofp <8 x i16> undef to <8 x double>
				%v44 = uitofp <8 x i16> undef to <8 x float>
				%v45 = uitofp <8 x i8> undef to <8 x fp128>
				%v46 = uitofp <8 x i8> undef to <8 x double>
				%v47 = uitofp <8 x i8> undef to <8 x float>
				%v48 = uitofp <16 x i64> undef to <16 x double>
				%v49 = uitofp <16 x i64> undef to <16 x float>
				%v50 = uitofp <16 x i32> undef to <16 x double>
				%v51 = uitofp <16 x i32> undef to <16 x float>
				%v52 = uitofp <16 x i16> undef to <16 x double>
				%v53 = uitofp <16 x i16> undef to <16 x float>
				%v54 = uitofp <16 x i8> undef to <16 x double>
				%v55 = uitofp <16 x i8> undef to <16 x float>

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v0 = uitofp i64 undef to fp128
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v1 = uitofp i64 undef to double
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v2 = uitofp i64 undef to float
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v3 = uitofp i32 undef to fp128
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v4 = uitofp i32 undef to double
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v5 = uitofp i32 undef to float
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v6 = uitofp i16 undef to fp128
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v7 = uitofp i16 undef to double
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v8 = uitofp i16 undef to float
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v9 = uitofp i8 undef to fp128
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v10 = uitofp i8 undef to double
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v11 = uitofp i8 undef to float
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v12 = uitofp <2 x i64> undef to <2 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v13 = uitofp <2 x i64> undef to <2 x double>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v14 = uitofp <2 x i64> undef to <2 x float>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v15 = uitofp <2 x i32> undef to <2 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v16 = uitofp <2 x i32> undef to <2 x double>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v17 = uitofp <2 x i32> undef to <2 x float>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v18 = uitofp <2 x i16> undef to <2 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v19 = uitofp <2 x i16> undef to <2 x double>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v20 = uitofp <2 x i16> undef to <2 x float>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v21 = uitofp <2 x i8> undef to <2 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v22 = uitofp <2 x i8> undef to <2 x double>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v23 = uitofp <2 x i8> undef to <2 x float>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v24 = uitofp <4 x i64> undef to <4 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v25 = uitofp <4 x i64> undef to <4 x double>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v26 = uitofp <4 x i64> undef to <4 x float>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v27 = uitofp <4 x i32> undef to <4 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v28 = uitofp <4 x i32> undef to <4 x double>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v29 = uitofp <4 x i32> undef to <4 x float>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v30 = uitofp <4 x i16> undef to <4 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v31 = uitofp <4 x i16> undef to <4 x double>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v32 = uitofp <4 x i16> undef to <4 x float>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v33 = uitofp <4 x i8> undef to <4 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v34 = uitofp <4 x i8> undef to <4 x double>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v35 = uitofp <4 x i8> undef to <4 x float>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v36 = uitofp <8 x i64> undef to <8 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v37 = uitofp <8 x i64> undef to <8 x double>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v38 = uitofp <8 x i64> undef to <8 x float>
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %v39 = uitofp <8 x i32> undef to <8 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v40 = uitofp <8 x i32> undef to <8 x double>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v41 = uitofp <8 x i32> undef to <8 x float>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v42 = uitofp <8 x i16> undef to <8 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %v43 = uitofp <8 x i16> undef to <8 x double>
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %v44 = uitofp <8 x i16> undef to <8 x float>
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %v45 = uitofp <8 x i8> undef to <8 x fp128>
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %v46 = uitofp <8 x i8> undef to <8 x double>
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %v47 = uitofp <8 x i8> undef to <8 x float>
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %v48 = uitofp <16 x i64> undef to <16 x double>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v49 = uitofp <16 x i64> undef to <16 x float>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v50 = uitofp <16 x i32> undef to <16 x double>
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %v51 = uitofp <16 x i32> undef to <16 x float>
				; CHECK: Cost Model: Found an estimated cost of 64 for instruction: %v52 = uitofp <16 x i16> undef to <16 x double>
				; CHECK: Cost Model: Found an estimated cost of 64 for instruction: %v53 = uitofp <16 x i16> undef to <16 x float>
				; CHECK: Cost Model: Found an estimated cost of 64 for instruction: %v54 = uitofp <16 x i8> undef to <16 x double>
				; CHECK: Cost Model: Found an estimated cost of 64 for instruction: %v55 = uitofp <16 x i8> undef to <16 x float>

				ret void;
				}

test/Analysis/CostModel/SystemZ/int-arith.ll

This file was added.

				; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \| FileCheck %s
				;
				; Note: The scalarized vector instructions costs are not including any
				; extracts, due to the undef operands.

				define void @add() {
				%res0 = add i8 undef, undef
				%res1 = add i16 undef, undef
				%res2 = add i32 undef, undef
				%res3 = add i64 undef, undef
				%res4 = add <2 x i8> undef, undef
				%res5 = add <2 x i16> undef, undef
				%res6 = add <2 x i32> undef, undef
				%res7 = add <2 x i64> undef, undef
				%res8 = add <4 x i8> undef, undef
				%res9 = add <4 x i16> undef, undef
				%res10 = add <4 x i32> undef, undef
				%res11 = add <4 x i64> undef, undef
				%res12 = add <8 x i8> undef, undef
				%res13 = add <8 x i16> undef, undef
				%res14 = add <8 x i32> undef, undef
				%res15 = add <8 x i64> undef, undef
				%res16 = add <16 x i8> undef, undef
				%res17 = add <16 x i16> undef, undef
				%res18 = add <16 x i32> undef, undef
				%res19 = add <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res0 = add i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res1 = add i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = add i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res3 = add i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = add <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res5 = add <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res6 = add <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res7 = add <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res8 = add <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res9 = add <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res10 = add <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res11 = add <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res12 = add <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res13 = add <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res14 = add <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res15 = add <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res16 = add <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res17 = add <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res18 = add <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res19 = add <16 x i64> undef, undef

				ret void;
				}

				define void @sub() {
				%res0 = sub i8 undef, undef
				%res1 = sub i16 undef, undef
				%res2 = sub i32 undef, undef
				%res3 = sub i64 undef, undef
				%res4 = sub <2 x i8> undef, undef
				%res5 = sub <2 x i16> undef, undef
				%res6 = sub <2 x i32> undef, undef
				%res7 = sub <2 x i64> undef, undef
				%res8 = sub <4 x i8> undef, undef
				%res9 = sub <4 x i16> undef, undef
				%res10 = sub <4 x i32> undef, undef
				%res11 = sub <4 x i64> undef, undef
				%res12 = sub <8 x i8> undef, undef
				%res13 = sub <8 x i16> undef, undef
				%res14 = sub <8 x i32> undef, undef
				%res15 = sub <8 x i64> undef, undef
				%res16 = sub <16 x i8> undef, undef
				%res17 = sub <16 x i16> undef, undef
				%res18 = sub <16 x i32> undef, undef
				%res19 = sub <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res0 = sub i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res1 = sub i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = sub i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res3 = sub i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = sub <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res5 = sub <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res6 = sub <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res7 = sub <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res8 = sub <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res9 = sub <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res10 = sub <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res11 = sub <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res12 = sub <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res13 = sub <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res14 = sub <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res15 = sub <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res16 = sub <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res17 = sub <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res18 = sub <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res19 = sub <16 x i64> undef, undef

				ret void;
				}

				define void @mul() {
				%res0 = mul i8 undef, undef
				%res1 = mul i16 undef, undef
				%res2 = mul i32 undef, undef
				%res3 = mul i64 undef, undef
				%res4 = mul <2 x i8> undef, undef
				%res5 = mul <2 x i16> undef, undef
				%res6 = mul <2 x i32> undef, undef
				%res7 = mul <2 x i64> undef, undef
				%res8 = mul <4 x i8> undef, undef
				%res9 = mul <4 x i16> undef, undef
				%res10 = mul <4 x i32> undef, undef
				%res11 = mul <4 x i64> undef, undef
				%res12 = mul <8 x i8> undef, undef
				%res13 = mul <8 x i16> undef, undef
				%res14 = mul <8 x i32> undef, undef
				%res15 = mul <8 x i64> undef, undef
				%res16 = mul <16 x i8> undef, undef
				%res17 = mul <16 x i16> undef, undef
				%res18 = mul <16 x i32> undef, undef
				%res19 = mul <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res0 = mul i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res1 = mul i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = mul i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res3 = mul i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = mul <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res5 = mul <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res6 = mul <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res7 = mul <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res8 = mul <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res9 = mul <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res10 = mul <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res11 = mul <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res12 = mul <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res13 = mul <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res14 = mul <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %res15 = mul <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res16 = mul <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res17 = mul <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res18 = mul <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %res19 = mul <16 x i64> undef, undef

				ret void;
				}

				define void @sdiv() {
				%res0 = sdiv i8 undef, undef
				%res1 = sdiv i16 undef, undef
				%res2 = sdiv i32 undef, undef
				%res3 = sdiv i64 undef, undef
				%res4 = sdiv <2 x i8> undef, undef
				%res5 = sdiv <2 x i16> undef, undef
				%res6 = sdiv <2 x i32> undef, undef
				%res7 = sdiv <2 x i64> undef, undef
				%res8 = sdiv <4 x i8> undef, undef
				%res9 = sdiv <4 x i16> undef, undef
				%res10 = sdiv <4 x i32> undef, undef
				%res11 = sdiv <4 x i64> undef, undef
				%res12 = sdiv <8 x i8> undef, undef
				%res13 = sdiv <8 x i16> undef, undef
				%res14 = sdiv <8 x i32> undef, undef
				%res15 = sdiv <8 x i64> undef, undef
				%res16 = sdiv <16 x i8> undef, undef
				%res17 = sdiv <16 x i16> undef, undef
				%res18 = sdiv <16 x i32> undef, undef
				%res19 = sdiv <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res0 = sdiv i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res1 = sdiv i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res2 = sdiv i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res3 = sdiv i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res4 = sdiv <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res5 = sdiv <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %res6 = sdiv <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %res7 = sdiv <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res8 = sdiv <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res9 = sdiv <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %res10 = sdiv <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %res11 = sdiv <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 40 for instruction: %res12 = sdiv <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 40 for instruction: %res13 = sdiv <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %res14 = sdiv <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %res15 = sdiv <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 80 for instruction: %res16 = sdiv <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 80 for instruction: %res17 = sdiv <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %res18 = sdiv <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %res19 = sdiv <16 x i64> undef, undef

				ret void;
				}

				define void @srem() {
				%res0 = srem i8 undef, undef
				%res1 = srem i16 undef, undef
				%res2 = srem i32 undef, undef
				%res3 = srem i64 undef, undef
				%res4 = srem <2 x i8> undef, undef
				%res5 = srem <2 x i16> undef, undef
				%res6 = srem <2 x i32> undef, undef
				%res7 = srem <2 x i64> undef, undef
				%res8 = srem <4 x i8> undef, undef
				%res9 = srem <4 x i16> undef, undef
				%res10 = srem <4 x i32> undef, undef
				%res11 = srem <4 x i64> undef, undef
				%res12 = srem <8 x i8> undef, undef
				%res13 = srem <8 x i16> undef, undef
				%res14 = srem <8 x i32> undef, undef
				%res15 = srem <8 x i64> undef, undef
				%res16 = srem <16 x i8> undef, undef
				%res17 = srem <16 x i16> undef, undef
				%res18 = srem <16 x i32> undef, undef
				%res19 = srem <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res0 = srem i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res1 = srem i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res2 = srem i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res3 = srem i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res4 = srem <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res5 = srem <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %res6 = srem <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %res7 = srem <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res8 = srem <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res9 = srem <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %res10 = srem <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %res11 = srem <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 40 for instruction: %res12 = srem <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 40 for instruction: %res13 = srem <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %res14 = srem <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %res15 = srem <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 80 for instruction: %res16 = srem <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 80 for instruction: %res17 = srem <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %res18 = srem <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 48 for instruction: %res19 = srem <16 x i64> undef, undef

				ret void;
				}

				define void @udiv() {
				%res0 = udiv i8 undef, undef
				%res1 = udiv i16 undef, undef
				%res2 = udiv i32 undef, undef
				%res3 = udiv i64 undef, undef
				%res4 = udiv <2 x i8> undef, undef
				%res5 = udiv <2 x i16> undef, undef
				%res6 = udiv <2 x i32> undef, undef
				%res7 = udiv <2 x i64> undef, undef
				%res8 = udiv <4 x i8> undef, undef
				%res9 = udiv <4 x i16> undef, undef
				%res10 = udiv <4 x i32> undef, undef
				%res11 = udiv <4 x i64> undef, undef
				%res12 = udiv <8 x i8> undef, undef
				%res13 = udiv <8 x i16> undef, undef
				%res14 = udiv <8 x i32> undef, undef
				%res15 = udiv <8 x i64> undef, undef
				%res16 = udiv <16 x i8> undef, undef
				%res17 = udiv <16 x i16> undef, undef
				%res18 = udiv <16 x i32> undef, undef
				%res19 = udiv <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res0 = udiv i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res1 = udiv i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %res2 = udiv i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %res3 = udiv i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res4 = udiv <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res5 = udiv <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res6 = udiv <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res7 = udiv <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res8 = udiv <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res9 = udiv <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %res10 = udiv <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %res11 = udiv <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 40 for instruction: %res12 = udiv <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 40 for instruction: %res13 = udiv <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %res14 = udiv <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %res15 = udiv <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 80 for instruction: %res16 = udiv <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 80 for instruction: %res17 = udiv <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 64 for instruction: %res18 = udiv <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 64 for instruction: %res19 = udiv <16 x i64> undef, undef

				ret void;
				}

				define void @urem() {
				%res0 = urem i8 undef, undef
				%res1 = urem i16 undef, undef
				%res2 = urem i32 undef, undef
				%res3 = urem i64 undef, undef
				%res4 = urem <2 x i8> undef, undef
				%res5 = urem <2 x i16> undef, undef
				%res6 = urem <2 x i32> undef, undef
				%res7 = urem <2 x i64> undef, undef
				%res8 = urem <4 x i8> undef, undef
				%res9 = urem <4 x i16> undef, undef
				%res10 = urem <4 x i32> undef, undef
				%res11 = urem <4 x i64> undef, undef
				%res12 = urem <8 x i8> undef, undef
				%res13 = urem <8 x i16> undef, undef
				%res14 = urem <8 x i32> undef, undef
				%res15 = urem <8 x i64> undef, undef
				%res16 = urem <16 x i8> undef, undef
				%res17 = urem <16 x i16> undef, undef
				%res18 = urem <16 x i32> undef, undef
				%res19 = urem <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res0 = urem i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res1 = urem i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %res2 = urem i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %res3 = urem i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res4 = urem <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res5 = urem <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res6 = urem <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res7 = urem <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res8 = urem <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res9 = urem <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %res10 = urem <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 16 for instruction: %res11 = urem <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 40 for instruction: %res12 = urem <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 40 for instruction: %res13 = urem <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %res14 = urem <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 32 for instruction: %res15 = urem <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 80 for instruction: %res16 = urem <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 80 for instruction: %res17 = urem <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 64 for instruction: %res18 = urem <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 64 for instruction: %res19 = urem <16 x i64> undef, undef

				ret void;
				}

test/Analysis/CostModel/SystemZ/int-cast.ll

This file was added.

				; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \| FileCheck %s

				define void @sext() {
				%v0 = sext i8 undef to i16
				%v1 = sext i8 undef to i32
				%v2 = sext i8 undef to i64
				%v3 = sext i16 undef to i32
				%v4 = sext i16 undef to i64
				%v5 = sext i32 undef to i64
				%v6 = sext <2 x i8> undef to <2 x i16>
				%v7 = sext <2 x i8> undef to <2 x i32>
				%v8 = sext <2 x i8> undef to <2 x i64>
				%v9 = sext <2 x i16> undef to <2 x i32>
				%v10 = sext <2 x i16> undef to <2 x i64>
				%v11 = sext <2 x i32> undef to <2 x i64>
				%v12 = sext <4 x i8> undef to <4 x i16>
				%v13 = sext <4 x i8> undef to <4 x i32>
				%v14 = sext <4 x i8> undef to <4 x i64>
				%v15 = sext <4 x i16> undef to <4 x i32>
				%v16 = sext <4 x i16> undef to <4 x i64>
				%v17 = sext <4 x i32> undef to <4 x i64>
				%v18 = sext <8 x i8> undef to <8 x i16>
				%v19 = sext <8 x i8> undef to <8 x i32>
				%v20 = sext <8 x i8> undef to <8 x i64>
				%v21 = sext <8 x i16> undef to <8 x i32>
				%v22 = sext <8 x i16> undef to <8 x i64>
				%v23 = sext <8 x i32> undef to <8 x i64>
				%v24 = sext <16 x i8> undef to <16 x i16>
				%v25 = sext <16 x i8> undef to <16 x i32>
				%v26 = sext <16 x i8> undef to <16 x i64>
				%v27 = sext <16 x i16> undef to <16 x i32>
				%v28 = sext <16 x i16> undef to <16 x i64>
				%v29 = sext <16 x i32> undef to <16 x i64>

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v0 = sext i8 undef to i16
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v1 = sext i8 undef to i32
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v2 = sext i8 undef to i64
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v3 = sext i16 undef to i32
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v4 = sext i16 undef to i64
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v5 = sext i32 undef to i64
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v6 = sext <2 x i8> undef to <2 x i16>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v7 = sext <2 x i8> undef to <2 x i32>
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %v8 = sext <2 x i8> undef to <2 x i64>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v9 = sext <2 x i16> undef to <2 x i32>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v10 = sext <2 x i16> undef to <2 x i64>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v11 = sext <2 x i32> undef to <2 x i64>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v12 = sext <4 x i8> undef to <4 x i16>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v13 = sext <4 x i8> undef to <4 x i32>
				; CHECK: Cost Model: Found an estimated cost of 7 for instruction: %v14 = sext <4 x i8> undef to <4 x i64>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v15 = sext <4 x i16> undef to <4 x i32>
				; CHECK: Cost Model: Found an estimated cost of 5 for instruction: %v16 = sext <4 x i16> undef to <4 x i64>
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %v17 = sext <4 x i32> undef to <4 x i64>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v18 = sext <8 x i8> undef to <8 x i16>
				; CHECK: Cost Model: Found an estimated cost of 5 for instruction: %v19 = sext <8 x i8> undef to <8 x i32>
				; CHECK: Cost Model: Found an estimated cost of 15 for instruction: %v20 = sext <8 x i8> undef to <8 x i64>
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %v21 = sext <8 x i16> undef to <8 x i32>
				; CHECK: Cost Model: Found an estimated cost of 11 for instruction: %v22 = sext <8 x i16> undef to <8 x i64>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v23 = sext <8 x i32> undef to <8 x i64>
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %v24 = sext <16 x i8> undef to <16 x i16>
				; CHECK: Cost Model: Found an estimated cost of 11 for instruction: %v25 = sext <16 x i8> undef to <16 x i32>
				; CHECK: Cost Model: Found an estimated cost of 31 for instruction: %v26 = sext <16 x i8> undef to <16 x i64>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v27 = sext <16 x i16> undef to <16 x i32>
				; CHECK: Cost Model: Found an estimated cost of 22 for instruction: %v28 = sext <16 x i16> undef to <16 x i64>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v29 = sext <16 x i32> undef to <16 x i64>

				ret void
				}

				define void @zext() {
				%v0 = zext i8 undef to i16
				%v1 = zext i8 undef to i32
				%v2 = zext i8 undef to i64
				%v3 = zext i16 undef to i32
				%v4 = zext i16 undef to i64
				%v5 = zext i32 undef to i64
				%v6 = zext <2 x i8> undef to <2 x i16>
				%v7 = zext <2 x i8> undef to <2 x i32>
				%v8 = zext <2 x i8> undef to <2 x i64>
				%v9 = zext <2 x i16> undef to <2 x i32>
				%v10 = zext <2 x i16> undef to <2 x i64>
				%v11 = zext <2 x i32> undef to <2 x i64>
				%v12 = zext <4 x i8> undef to <4 x i16>
				%v13 = zext <4 x i8> undef to <4 x i32>
				%v14 = zext <4 x i8> undef to <4 x i64>
				%v15 = zext <4 x i16> undef to <4 x i32>
				%v16 = zext <4 x i16> undef to <4 x i64>
				%v17 = zext <4 x i32> undef to <4 x i64>
				%v18 = zext <8 x i8> undef to <8 x i16>
				%v19 = zext <8 x i8> undef to <8 x i32>
				%v20 = zext <8 x i8> undef to <8 x i64>
				%v21 = zext <8 x i16> undef to <8 x i32>
				%v22 = zext <8 x i16> undef to <8 x i64>
				%v23 = zext <8 x i32> undef to <8 x i64>
				%v24 = zext <16 x i8> undef to <16 x i16>
				%v25 = zext <16 x i8> undef to <16 x i32>
				%v26 = zext <16 x i8> undef to <16 x i64>
				%v27 = zext <16 x i16> undef to <16 x i32>
				%v28 = zext <16 x i16> undef to <16 x i64>
				%v29 = zext <16 x i32> undef to <16 x i64>

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v0 = zext i8 undef to i16
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v1 = zext i8 undef to i32
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v2 = zext i8 undef to i64
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v3 = zext i16 undef to i32
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v4 = zext i16 undef to i64
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v5 = zext i32 undef to i64
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v6 = zext <2 x i8> undef to <2 x i16>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v7 = zext <2 x i8> undef to <2 x i32>
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %v8 = zext <2 x i8> undef to <2 x i64>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v9 = zext <2 x i16> undef to <2 x i32>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v10 = zext <2 x i16> undef to <2 x i64>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v11 = zext <2 x i32> undef to <2 x i64>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v12 = zext <4 x i8> undef to <4 x i16>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v13 = zext <4 x i8> undef to <4 x i32>
				; CHECK: Cost Model: Found an estimated cost of 7 for instruction: %v14 = zext <4 x i8> undef to <4 x i64>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v15 = zext <4 x i16> undef to <4 x i32>
				; CHECK: Cost Model: Found an estimated cost of 5 for instruction: %v16 = zext <4 x i16> undef to <4 x i64>
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %v17 = zext <4 x i32> undef to <4 x i64>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v18 = zext <8 x i8> undef to <8 x i16>
				; CHECK: Cost Model: Found an estimated cost of 5 for instruction: %v19 = zext <8 x i8> undef to <8 x i32>
				; CHECK: Cost Model: Found an estimated cost of 15 for instruction: %v20 = zext <8 x i8> undef to <8 x i64>
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %v21 = zext <8 x i16> undef to <8 x i32>
				; CHECK: Cost Model: Found an estimated cost of 11 for instruction: %v22 = zext <8 x i16> undef to <8 x i64>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v23 = zext <8 x i32> undef to <8 x i64>
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %v24 = zext <16 x i8> undef to <16 x i16>
				; CHECK: Cost Model: Found an estimated cost of 11 for instruction: %v25 = zext <16 x i8> undef to <16 x i32>
				; CHECK: Cost Model: Found an estimated cost of 31 for instruction: %v26 = zext <16 x i8> undef to <16 x i64>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v27 = zext <16 x i16> undef to <16 x i32>
				; CHECK: Cost Model: Found an estimated cost of 22 for instruction: %v28 = zext <16 x i16> undef to <16 x i64>
				; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %v29 = zext <16 x i32> undef to <16 x i64>

				ret void
				}

				define void @trunc() {
				%v0 = trunc i16 undef to i8
				%v1 = trunc i32 undef to i16
				%v2 = trunc i32 undef to i8
				%v3 = trunc i64 undef to i32
				%v4 = trunc i64 undef to i16
				%v5 = trunc i64 undef to i8
				%v6 = trunc <2 x i16> undef to <2 x i8>
				%v7 = trunc <2 x i32> undef to <2 x i16>
				%v8 = trunc <2 x i32> undef to <2 x i8>
				%v9 = trunc <2 x i64> undef to <2 x i32>
				%v10 = trunc <2 x i64> undef to <2 x i16>
				%v11 = trunc <2 x i64> undef to <2 x i8>
				%v12 = trunc <4 x i16> undef to <4 x i8>
				%v13 = trunc <4 x i32> undef to <4 x i16>
				%v14 = trunc <4 x i32> undef to <4 x i8>
				%v15 = trunc <4 x i64> undef to <4 x i32>
				%v16 = trunc <4 x i64> undef to <4 x i16>
				%v17 = trunc <4 x i64> undef to <4 x i8>
				%v18 = trunc <8 x i16> undef to <8 x i8>
				%v19 = trunc <8 x i32> undef to <8 x i16>
				%v20 = trunc <8 x i32> undef to <8 x i8>
				%v21 = trunc <8 x i64> undef to <8 x i32>
				%v22 = trunc <8 x i64> undef to <8 x i16>
				%v23 = trunc <8 x i64> undef to <8 x i8>
				%v24 = trunc <16 x i16> undef to <16 x i8>
				%v25 = trunc <16 x i32> undef to <16 x i16>
				%v26 = trunc <16 x i32> undef to <16 x i8>
				%v27 = trunc <16 x i64> undef to <16 x i32>
				%v28 = trunc <16 x i64> undef to <16 x i16>
				%v29 = trunc <16 x i64> undef to <16 x i8>

				; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %v0 = trunc i16 undef to i8
				; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %v1 = trunc i32 undef to i16
				; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %v2 = trunc i32 undef to i8
				; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %v3 = trunc i64 undef to i32
				; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %v4 = trunc i64 undef to i16
				; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %v5 = trunc i64 undef to i8
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v6 = trunc <2 x i16> undef to <2 x i8>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v7 = trunc <2 x i32> undef to <2 x i16>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v8 = trunc <2 x i32> undef to <2 x i8>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v9 = trunc <2 x i64> undef to <2 x i32>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v10 = trunc <2 x i64> undef to <2 x i16>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v11 = trunc <2 x i64> undef to <2 x i8>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v12 = trunc <4 x i16> undef to <4 x i8>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v13 = trunc <4 x i32> undef to <4 x i16>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v14 = trunc <4 x i32> undef to <4 x i8>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v15 = trunc <4 x i64> undef to <4 x i32>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v16 = trunc <4 x i64> undef to <4 x i16>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v17 = trunc <4 x i64> undef to <4 x i8>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v18 = trunc <8 x i16> undef to <8 x i8>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v19 = trunc <8 x i32> undef to <8 x i16>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v20 = trunc <8 x i32> undef to <8 x i8>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v21 = trunc <8 x i64> undef to <8 x i32>
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %v22 = trunc <8 x i64> undef to <8 x i16>
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %v23 = trunc <8 x i64> undef to <8 x i8>
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %v24 = trunc <16 x i16> undef to <16 x i8>
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %v25 = trunc <16 x i32> undef to <16 x i16>
				; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %v26 = trunc <16 x i32> undef to <16 x i8>
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %v27 = trunc <16 x i64> undef to <16 x i32>
				; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %v28 = trunc <16 x i64> undef to <16 x i16>
				; CHECK: Cost Model: Found an estimated cost of 7 for instruction: %v29 = trunc <16 x i64> undef to <16 x i8>

				ret void
				}

test/Analysis/CostModel/SystemZ/load_store.ll

This file was added.

				; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \| FileCheck %s

				define void @store() {
				store i8 undef, i8* undef
				store i16 undef, i16* undef
				store i32 undef, i32* undef
				store i64 undef, i64* undef
				store float undef, float* undef
				store double undef, double* undef
				store fp128 undef, fp128* undef
				store <2 x i8> undef, <2 x i8>* undef
				store <2 x i16> undef, <2 x i16>* undef
				store <2 x i32> undef, <2 x i32>* undef
				store <2 x i64> undef, <2 x i64>* undef
				store <2 x float> undef, <2 x float>* undef
				store <2 x double> undef, <2 x double>* undef
				store <4 x i8> undef, <4 x i8>* undef
				store <4 x i16> undef, <4 x i16>* undef
				store <4 x i32> undef, <4 x i32>* undef
				store <4 x i64> undef, <4 x i64>* undef
				store <4 x float> undef, <4 x float>* undef
				store <4 x double> undef, <4 x double>* undef
				store <8 x i8> undef, <8 x i8>* undef
				store <8 x i16> undef, <8 x i16>* undef
				store <8 x i32> undef, <8 x i32>* undef
				store <8 x i64> undef, <8 x i64>* undef
				store <8 x float> undef, <8 x float>* undef
				store <8 x double> undef, <8 x double>* undef
				store <16 x i8> undef, <16 x i8>* undef
				store <16 x i16> undef, <16 x i16>* undef
				store <16 x i32> undef, <16 x i32>* undef
				store <16 x i64> undef, <16 x i64>* undef
				store <16 x float> undef, <16 x float>* undef
				store <16 x double> undef, <16 x double>* undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store i8 undef, i8* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store i16 undef, i16* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store i32 undef, i32* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store i64 undef, i64* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store float undef, float* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store double undef, double* undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: store fp128 undef, fp128* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <2 x i8> undef, <2 x i8>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <2 x i16> undef, <2 x i16>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <2 x i32> undef, <2 x i32>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <2 x i64> undef, <2 x i64>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <2 x float> undef, <2 x float>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <2 x double> undef, <2 x double>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <4 x i8> undef, <4 x i8>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <4 x i16> undef, <4 x i16>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <4 x i32> undef, <4 x i32>* undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: store <4 x i64> undef, <4 x i64>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <4 x float> undef, <4 x float>* undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: store <4 x double> undef, <4 x double>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <8 x i8> undef, <8 x i8>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <8 x i16> undef, <8 x i16>* undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: store <8 x i32> undef, <8 x i32>* undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: store <8 x i64> undef, <8 x i64>* undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: store <8 x float> undef, <8 x float>* undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: store <8 x double> undef, <8 x double>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: store <16 x i8> undef, <16 x i8>* undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: store <16 x i16> undef, <16 x i16>* undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: store <16 x i32> undef, <16 x i32>* undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: store <16 x i64> undef, <16 x i64>* undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: store <16 x float> undef, <16 x float>* undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: store <16 x double> undef, <16 x double>* undef

				ret void;
				}

				define void @load() {
				load i8, i8* undef
				load i16, i16* undef
				load i32, i32* undef
				load i64, i64* undef
				load float, float* undef
				load double, double* undef
				load fp128, fp128* undef
				load <2 x i8>, <2 x i8>* undef
				load <2 x i16>, <2 x i16>* undef
				load <2 x i32>, <2 x i32>* undef
				load <2 x i64>, <2 x i64>* undef
				load <2 x float>, <2 x float>* undef
				load <2 x double>, <2 x double>* undef
				load <4 x i8>, <4 x i8>* undef
				load <4 x i16>, <4 x i16>* undef
				load <4 x i32>, <4 x i32>* undef
				load <4 x i64>, <4 x i64>* undef
				load <4 x float>, <4 x float>* undef
				load <4 x double>, <4 x double>* undef
				load <8 x i8>, <8 x i8>* undef
				load <8 x i16>, <8 x i16>* undef
				load <8 x i32>, <8 x i32>* undef
				load <8 x i64>, <8 x i64>* undef
				load <8 x float>, <8 x float>* undef
				load <8 x double>, <8 x double>* undef
				load <16 x i8>, <16 x i8>* undef
				load <16 x i16>, <16 x i16>* undef
				load <16 x i32>, <16 x i32>* undef
				load <16 x i64>, <16 x i64>* undef
				load <16 x float>, <16 x float>* undef
				load <16 x double>, <16 x double>* undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %1 = load i8, i8* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %2 = load i16, i16* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %3 = load i32, i32* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %4 = load i64, i64* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %5 = load float, float* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %6 = load double, double* undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %7 = load fp128, fp128* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %8 = load <2 x i8>, <2 x i8>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %9 = load <2 x i16>, <2 x i16>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %10 = load <2 x i32>, <2 x i32>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %11 = load <2 x i64>, <2 x i64>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %12 = load <2 x float>, <2 x float>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %13 = load <2 x double>, <2 x double>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %14 = load <4 x i8>, <4 x i8>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %15 = load <4 x i16>, <4 x i16>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %16 = load <4 x i32>, <4 x i32>* undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %17 = load <4 x i64>, <4 x i64>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %18 = load <4 x float>, <4 x float>* undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %19 = load <4 x double>, <4 x double>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %20 = load <8 x i8>, <8 x i8>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %21 = load <8 x i16>, <8 x i16>* undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %22 = load <8 x i32>, <8 x i32>* undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %23 = load <8 x i64>, <8 x i64>* undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %24 = load <8 x float>, <8 x float>* undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %25 = load <8 x double>, <8 x double>* undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %26 = load <16 x i8>, <16 x i8>* undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %27 = load <16 x i16>, <16 x i16>* undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %28 = load <16 x i32>, <16 x i32>* undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %29 = load <16 x i64>, <16 x i64>* undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %30 = load <16 x float>, <16 x float>* undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %31 = load <16 x double>, <16 x double>* undef

				ret void;
				}

test/Analysis/CostModel/SystemZ/logical.ll

This file was added.

				; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \| FileCheck %s

				define void @and() {
				%res0 = and i8 undef, undef
				%res1 = and i16 undef, undef
				%res2 = and i32 undef, undef
				%res3 = and i64 undef, undef
				%res4 = and <2 x i8> undef, undef
				%res5 = and <2 x i16> undef, undef
				%res6 = and <2 x i32> undef, undef
				%res7 = and <2 x i64> undef, undef
				%res8 = and <4 x i8> undef, undef
				%res9 = and <4 x i16> undef, undef
				%res10 = and <4 x i32> undef, undef
				%res11 = and <4 x i64> undef, undef
				%res12 = and <8 x i8> undef, undef
				%res13 = and <8 x i16> undef, undef
				%res14 = and <8 x i32> undef, undef
				%res15 = and <8 x i64> undef, undef
				%res16 = and <16 x i8> undef, undef
				%res17 = and <16 x i16> undef, undef
				%res18 = and <16 x i32> undef, undef
				%res19 = and <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res0 = and i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res1 = and i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = and i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res3 = and i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = and <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res5 = and <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res6 = and <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res7 = and <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res8 = and <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res9 = and <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res10 = and <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res11 = and <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res12 = and <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res13 = and <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res14 = and <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res15 = and <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res16 = and <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res17 = and <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res18 = and <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res19 = and <16 x i64> undef, undef

				ret void;
				}

				define void @ashr() {
				%res0 = ashr i8 undef, undef
				%res1 = ashr i16 undef, undef
				%res2 = ashr i32 undef, undef
				%res3 = ashr i64 undef, undef
				%res4 = ashr <2 x i8> undef, undef
				%res5 = ashr <2 x i16> undef, undef
				%res6 = ashr <2 x i32> undef, undef
				%res7 = ashr <2 x i64> undef, undef
				%res8 = ashr <4 x i8> undef, undef
				%res9 = ashr <4 x i16> undef, undef
				%res10 = ashr <4 x i32> undef, undef
				%res11 = ashr <4 x i64> undef, undef
				%res12 = ashr <8 x i8> undef, undef
				%res13 = ashr <8 x i16> undef, undef
				%res14 = ashr <8 x i32> undef, undef
				%res15 = ashr <8 x i64> undef, undef
				%res16 = ashr <16 x i8> undef, undef
				%res17 = ashr <16 x i16> undef, undef
				%res18 = ashr <16 x i32> undef, undef
				%res19 = ashr <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res0 = ashr i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res1 = ashr i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = ashr i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res3 = ashr i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = ashr <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res5 = ashr <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res6 = ashr <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res7 = ashr <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res8 = ashr <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res9 = ashr <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res10 = ashr <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res11 = ashr <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res12 = ashr <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res13 = ashr <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res14 = ashr <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res15 = ashr <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res16 = ashr <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res17 = ashr <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res18 = ashr <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res19 = ashr <16 x i64> undef, undef

				ret void;
				}

				define void @lshr() {
				%res0 = lshr i8 undef, undef
				%res1 = lshr i16 undef, undef
				%res2 = lshr i32 undef, undef
				%res3 = lshr i64 undef, undef
				%res4 = lshr <2 x i8> undef, undef
				%res5 = lshr <2 x i16> undef, undef
				%res6 = lshr <2 x i32> undef, undef
				%res7 = lshr <2 x i64> undef, undef
				%res8 = lshr <4 x i8> undef, undef
				%res9 = lshr <4 x i16> undef, undef
				%res10 = lshr <4 x i32> undef, undef
				%res11 = lshr <4 x i64> undef, undef
				%res12 = lshr <8 x i8> undef, undef
				%res13 = lshr <8 x i16> undef, undef
				%res14 = lshr <8 x i32> undef, undef
				%res15 = lshr <8 x i64> undef, undef
				%res16 = lshr <16 x i8> undef, undef
				%res17 = lshr <16 x i16> undef, undef
				%res18 = lshr <16 x i32> undef, undef
				%res19 = lshr <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res0 = lshr i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res1 = lshr i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = lshr i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res3 = lshr i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = lshr <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res5 = lshr <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res6 = lshr <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res7 = lshr <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res8 = lshr <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res9 = lshr <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res10 = lshr <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res11 = lshr <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res12 = lshr <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res13 = lshr <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res14 = lshr <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res15 = lshr <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res16 = lshr <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res17 = lshr <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res18 = lshr <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res19 = lshr <16 x i64> undef, undef

				ret void;
				}

				define void @or() {
				%res0 = or i8 undef, undef
				%res1 = or i16 undef, undef
				%res2 = or i32 undef, undef
				%res3 = or i64 undef, undef
				%res4 = or <2 x i8> undef, undef
				%res5 = or <2 x i16> undef, undef
				%res6 = or <2 x i32> undef, undef
				%res7 = or <2 x i64> undef, undef
				%res8 = or <4 x i8> undef, undef
				%res9 = or <4 x i16> undef, undef
				%res10 = or <4 x i32> undef, undef
				%res11 = or <4 x i64> undef, undef
				%res12 = or <8 x i8> undef, undef
				%res13 = or <8 x i16> undef, undef
				%res14 = or <8 x i32> undef, undef
				%res15 = or <8 x i64> undef, undef
				%res16 = or <16 x i8> undef, undef
				%res17 = or <16 x i16> undef, undef
				%res18 = or <16 x i32> undef, undef
				%res19 = or <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res0 = or i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res1 = or i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = or i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res3 = or i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = or <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res5 = or <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res6 = or <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res7 = or <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res8 = or <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res9 = or <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res10 = or <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res11 = or <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res12 = or <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res13 = or <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res14 = or <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res15 = or <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res16 = or <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res17 = or <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res18 = or <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res19 = or <16 x i64> undef, undef

				ret void;
				}

				define void @shl() {
				%res0 = shl i8 undef, undef
				%res1 = shl i16 undef, undef
				%res2 = shl i32 undef, undef
				%res3 = shl i64 undef, undef
				%res4 = shl <2 x i8> undef, undef
				%res5 = shl <2 x i16> undef, undef
				%res6 = shl <2 x i32> undef, undef
				%res7 = shl <2 x i64> undef, undef
				%res8 = shl <4 x i8> undef, undef
				%res9 = shl <4 x i16> undef, undef
				%res10 = shl <4 x i32> undef, undef
				%res11 = shl <4 x i64> undef, undef
				%res12 = shl <8 x i8> undef, undef
				%res13 = shl <8 x i16> undef, undef
				%res14 = shl <8 x i32> undef, undef
				%res15 = shl <8 x i64> undef, undef
				%res16 = shl <16 x i8> undef, undef
				%res17 = shl <16 x i16> undef, undef
				%res18 = shl <16 x i32> undef, undef
				%res19 = shl <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res0 = shl i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res1 = shl i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = shl i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res3 = shl i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = shl <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res5 = shl <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res6 = shl <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res7 = shl <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res8 = shl <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res9 = shl <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res10 = shl <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res11 = shl <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res12 = shl <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res13 = shl <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res14 = shl <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res15 = shl <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res16 = shl <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res17 = shl <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res18 = shl <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res19 = shl <16 x i64> undef, undef

				ret void;
				}

				define void @xor() {
				%res0 = xor i8 undef, undef
				%res1 = xor i16 undef, undef
				%res2 = xor i32 undef, undef
				%res3 = xor i64 undef, undef
				%res4 = xor <2 x i8> undef, undef
				%res5 = xor <2 x i16> undef, undef
				%res6 = xor <2 x i32> undef, undef
				%res7 = xor <2 x i64> undef, undef
				%res8 = xor <4 x i8> undef, undef
				%res9 = xor <4 x i16> undef, undef
				%res10 = xor <4 x i32> undef, undef
				%res11 = xor <4 x i64> undef, undef
				%res12 = xor <8 x i8> undef, undef
				%res13 = xor <8 x i16> undef, undef
				%res14 = xor <8 x i32> undef, undef
				%res15 = xor <8 x i64> undef, undef
				%res16 = xor <16 x i8> undef, undef
				%res17 = xor <16 x i16> undef, undef
				%res18 = xor <16 x i32> undef, undef
				%res19 = xor <16 x i64> undef, undef

				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res0 = xor i8 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res1 = xor i16 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res2 = xor i32 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res3 = xor i64 undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res4 = xor <2 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res5 = xor <2 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res6 = xor <2 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res7 = xor <2 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res8 = xor <4 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res9 = xor <4 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res10 = xor <4 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res11 = xor <4 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res12 = xor <8 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res13 = xor <8 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res14 = xor <8 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res15 = xor <8 x i64> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res16 = xor <16 x i8> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res17 = xor <16 x i16> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res18 = xor <16 x i32> undef, undef
				; CHECK: Cost Model: Found an estimated cost of 8 for instruction: %res19 = xor <16 x i64> undef, undef

				ret void;
				}

This is an archive of the discontinued LLVM Phabricator instance.

SystemZTargetTransformInfo cost functions and some common code changesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 89362

include/llvm/Analysis/TargetTransformInfo.h

include/llvm/Analysis/TargetTransformInfoImpl.h

include/llvm/CodeGen/BasicTTIImpl.h

lib/Analysis/CostModel.cpp

lib/Analysis/TargetTransformInfo.cpp

lib/Target/AArch64/AArch64TargetTransformInfo.h

lib/Target/AArch64/AArch64TargetTransformInfo.cpp

lib/Target/ARM/ARMTargetTransformInfo.h

lib/Target/ARM/ARMTargetTransformInfo.cpp

lib/Target/PowerPC/PPCTargetTransformInfo.h

lib/Target/PowerPC/PPCTargetTransformInfo.cpp

lib/Target/SystemZ/SystemZISelLowering.cpp

lib/Target/SystemZ/SystemZTargetTransformInfo.h

lib/Target/SystemZ/SystemZTargetTransformInfo.cpp

lib/Target/X86/X86TargetTransformInfo.h

lib/Target/X86/X86TargetTransformInfo.cpp

lib/Transforms/Vectorize/LoopVectorize.cpp

test/Analysis/CostModel/SystemZ/cmpsel.ll

test/Analysis/CostModel/SystemZ/fp-arith.ll

test/Analysis/CostModel/SystemZ/fp-cast.ll

test/Analysis/CostModel/SystemZ/int-arith.ll

test/Analysis/CostModel/SystemZ/int-cast.ll

test/Analysis/CostModel/SystemZ/load_store.ll

test/Analysis/CostModel/SystemZ/logical.ll

SystemZTargetTransformInfo cost functions and some common code changes
ClosedPublic