This is an archive of the discontinued LLVM Phabricator instance.

Add v16f64 value type
ClosedPublic

Authored by rampitec on May 14 2020, 11:14 AM.

Download Raw Diff

Details

Reviewers

arsenm
craig.topper
dmgreen
samparker
efriedma

Commits

rG184b38345746: Add v16f64 value type

Summary

We need to use it to handle <16 x double> indirect indexes
in the AMDGPU BE.

The only visible change from adding it is in ARM cost model.
To me it looks reasonable. With doubling a vector size it
quadruples the cost up to the size 8 and then it did only
double it. Now it also quadruples, which seems a logical
progression to me.

Actual AMDGPU code is to follow, this is a common part, plus
load/store legalization in the AMDGPU BE not to break what
works now.

Diff Detail

Event Timeline

rampitec created this revision.May 14 2020, 11:14 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 14 2020, 11:14 AM

Herald added subscribers: kerbowa, jdoerfert, aheejin and 7 others. · View Herald Transcript

dmgreen added a subscriber: dmgreen.May 14 2020, 11:49 AM

dmgreen added inline comments.

llvm/test/Analysis/CostModel/ARM/cast.ll
375	Yeah. Sounds fine.

The change to the ARM cost models indicates a bug in the ARM code; someone should probably look at that.

Otherwise looks fine.

(Fixing the ARM code model doesn't need to block this.)

This revision is now accepted and ready to land.May 14 2020, 11:57 AM

rampitec added a child revision: D79960: [AMDGPU] Make v16f64/v16i64 legal.May 14 2020, 1:03 PM

Closed by commit rG184b38345746: Add v16f64 value type (authored by rampitec). · Explain WhyMay 14 2020, 2:43 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

ValueTypes.td

109 lines

IR/

Intrinsics.td

1 line

Support/

MachineValueType.h

117 lines

lib/

CodeGen/

ValueTypes.cpp

1 line

Target/

AMDGPU/

AMDGPUISelLowering.cpp

16 lines

test/

Analysis/

CostModel/

ARM/

cast.ll

44 lines

utils/

TableGen/

CodeGenTarget.cpp

1 line

Diff 264039

llvm/include/llvm/CodeGen/ValueTypes.td

	Show First 20 Lines • Show All 106 Lines • ▼ Show 20 Lines
	def v256f32 : ValueType<8182, 80>; // 256 x f32 vector value			def v256f32 : ValueType<8182, 80>; // 256 x f32 vector value
	def v512f32 : ValueType<16384, 81>; // 512 x f32 vector value			def v512f32 : ValueType<16384, 81>; // 512 x f32 vector value
	def v1024f32 : ValueType<32768, 82>; // 1024 x f32 vector value			def v1024f32 : ValueType<32768, 82>; // 1024 x f32 vector value
	def v2048f32 : ValueType<65536, 83>; // 2048 x f32 vector value			def v2048f32 : ValueType<65536, 83>; // 2048 x f32 vector value
	def v1f64 : ValueType<64, 84>; // 1 x f64 vector value			def v1f64 : ValueType<64, 84>; // 1 x f64 vector value
	def v2f64 : ValueType<128, 85>; // 2 x f64 vector value			def v2f64 : ValueType<128, 85>; // 2 x f64 vector value
	def v4f64 : ValueType<256, 86>; // 4 x f64 vector value			def v4f64 : ValueType<256, 86>; // 4 x f64 vector value
	def v8f64 : ValueType<512, 87>; // 8 x f64 vector value			def v8f64 : ValueType<512, 87>; // 8 x f64 vector value
				def v16f64 : ValueType<1024, 88>; // 16 x f64 vector value

	def nxv1i1 : ValueType<1, 88>; // n x 1 x i1 vector value			def nxv1i1 : ValueType<1, 89>; // n x 1 x i1 vector value
	def nxv2i1 : ValueType<2, 89>; // n x 2 x i1 vector value			def nxv2i1 : ValueType<2, 90>; // n x 2 x i1 vector value
	def nxv4i1 : ValueType<4, 90>; // n x 4 x i1 vector value			def nxv4i1 : ValueType<4, 91>; // n x 4 x i1 vector value
	def nxv8i1 : ValueType<8, 91>; // n x 8 x i1 vector value			def nxv8i1 : ValueType<8, 92>; // n x 8 x i1 vector value
	def nxv16i1 : ValueType<16, 92>; // n x 16 x i1 vector value			def nxv16i1 : ValueType<16, 93>; // n x 16 x i1 vector value
	def nxv32i1 : ValueType<32, 93>; // n x 32 x i1 vector value			def nxv32i1 : ValueType<32, 94>; // n x 32 x i1 vector value

	def nxv1i8 : ValueType<8, 94>; // n x 1 x i8 vector value			def nxv1i8 : ValueType<8, 95>; // n x 1 x i8 vector value
	def nxv2i8 : ValueType<16, 95>; // n x 2 x i8 vector value			def nxv2i8 : ValueType<16, 96>; // n x 2 x i8 vector value
	def nxv4i8 : ValueType<32, 96>; // n x 4 x i8 vector value			def nxv4i8 : ValueType<32, 97>; // n x 4 x i8 vector value
	def nxv8i8 : ValueType<64, 97>; // n x 8 x i8 vector value			def nxv8i8 : ValueType<64, 98>; // n x 8 x i8 vector value
	def nxv16i8 : ValueType<128, 98>; // n x 16 x i8 vector value			def nxv16i8 : ValueType<128, 99>; // n x 16 x i8 vector value
	def nxv32i8 : ValueType<256, 99>; // n x 32 x i8 vector value			def nxv32i8 : ValueType<256, 100>; // n x 32 x i8 vector value

	def nxv1i16 : ValueType<16, 100>; // n x 1 x i16 vector value			def nxv1i16 : ValueType<16, 101>; // n x 1 x i16 vector value
	def nxv2i16 : ValueType<32, 101>; // n x 2 x i16 vector value			def nxv2i16 : ValueType<32, 102>; // n x 2 x i16 vector value
	def nxv4i16 : ValueType<64, 102>; // n x 4 x i16 vector value			def nxv4i16 : ValueType<64, 103>; // n x 4 x i16 vector value
	def nxv8i16 : ValueType<128, 103>; // n x 8 x i16 vector value			def nxv8i16 : ValueType<128, 104>; // n x 8 x i16 vector value
	def nxv16i16: ValueType<256, 104>; // n x 16 x i16 vector value			def nxv16i16: ValueType<256, 105>; // n x 16 x i16 vector value
	def nxv32i16: ValueType<512, 105>; // n x 32 x i16 vector value			def nxv32i16: ValueType<512, 106>; // n x 32 x i16 vector value

	def nxv1i32 : ValueType<32, 106>; // n x 1 x i32 vector value			def nxv1i32 : ValueType<32, 107>; // n x 1 x i32 vector value
	def nxv2i32 : ValueType<64, 107>; // n x 2 x i32 vector value			def nxv2i32 : ValueType<64, 108>; // n x 2 x i32 vector value
	def nxv4i32 : ValueType<128, 108>; // n x 4 x i32 vector value			def nxv4i32 : ValueType<128, 109>; // n x 4 x i32 vector value
	def nxv8i32 : ValueType<256, 109>; // n x 8 x i32 vector value			def nxv8i32 : ValueType<256, 110>; // n x 8 x i32 vector value
	def nxv16i32: ValueType<512, 110>; // n x 16 x i32 vector value			def nxv16i32: ValueType<512, 111>; // n x 16 x i32 vector value
	def nxv32i32: ValueType<1024,111>; // n x 32 x i32 vector value			def nxv32i32: ValueType<1024,112>; // n x 32 x i32 vector value

	def nxv1i64 : ValueType<64, 112>; // n x 1 x i64 vector value			def nxv1i64 : ValueType<64, 113>; // n x 1 x i64 vector value
	def nxv2i64 : ValueType<128, 113>; // n x 2 x i64 vector value			def nxv2i64 : ValueType<128, 114>; // n x 2 x i64 vector value
	def nxv4i64 : ValueType<256, 114>; // n x 4 x i64 vector value			def nxv4i64 : ValueType<256, 115>; // n x 4 x i64 vector value
	def nxv8i64 : ValueType<512, 115>; // n x 8 x i64 vector value			def nxv8i64 : ValueType<512, 116>; // n x 8 x i64 vector value
	def nxv16i64: ValueType<1024,116>; // n x 16 x i64 vector value			def nxv16i64: ValueType<1024,117>; // n x 16 x i64 vector value
	def nxv32i64: ValueType<2048,117>; // n x 32 x i64 vector value			def nxv32i64: ValueType<2048,118>; // n x 32 x i64 vector value

	def nxv2f16 : ValueType<32 , 118>; // n x 2 x f16 vector value			def nxv2f16 : ValueType<32 , 119>; // n x 2 x f16 vector value
	def nxv4f16 : ValueType<64 , 119>; // n x 4 x f16 vector value			def nxv4f16 : ValueType<64 , 120>; // n x 4 x f16 vector value
	def nxv8f16 : ValueType<128, 120>; // n x 8 x f16 vector value			def nxv8f16 : ValueType<128, 121>; // n x 8 x f16 vector value
	def nxv1f32 : ValueType<32 , 121>; // n x 1 x f32 vector value			def nxv1f32 : ValueType<32 , 122>; // n x 1 x f32 vector value
	def nxv2f32 : ValueType<64 , 122>; // n x 2 x f32 vector value			def nxv2f32 : ValueType<64 , 123>; // n x 2 x f32 vector value
	def nxv4f32 : ValueType<128, 123>; // n x 4 x f32 vector value			def nxv4f32 : ValueType<128, 124>; // n x 4 x f32 vector value
	def nxv8f32 : ValueType<256, 124>; // n x 8 x f32 vector value			def nxv8f32 : ValueType<256, 125>; // n x 8 x f32 vector value
	def nxv16f32 : ValueType<512, 125>; // n x 16 x f32 vector value			def nxv16f32 : ValueType<512, 126>; // n x 16 x f32 vector value
	def nxv1f64 : ValueType<64, 126>; // n x 1 x f64 vector value			def nxv1f64 : ValueType<64, 127>; // n x 1 x f64 vector value
	def nxv2f64 : ValueType<128, 127>; // n x 2 x f64 vector value			def nxv2f64 : ValueType<128, 128>; // n x 2 x f64 vector value
	def nxv4f64 : ValueType<256, 128>; // n x 4 x f64 vector value			def nxv4f64 : ValueType<256, 129>; // n x 4 x f64 vector value
	def nxv8f64 : ValueType<512, 129>; // n x 8 x f64 vector value			def nxv8f64 : ValueType<512, 130>; // n x 8 x f64 vector value

	def x86mmx : ValueType<64 , 130>; // X86 MMX value			def x86mmx : ValueType<64 , 131>; // X86 MMX value
	def FlagVT : ValueType<0 , 131>; // Pre-RA sched glue			def FlagVT : ValueType<0 , 132>; // Pre-RA sched glue
	def isVoid : ValueType<0 , 132>; // Produces no value			def isVoid : ValueType<0 , 133>; // Produces no value
	def untyped: ValueType<8 , 133>; // Produces an untyped value			def untyped: ValueType<8 , 134>; // Produces an untyped value
	def exnref: ValueType<0, 134>; // WebAssembly's exnref type			def exnref : ValueType<0 , 135>; // WebAssembly's exnref type
	def token : ValueType<0 , 248>; // TokenTy			def token : ValueType<0 , 248>; // TokenTy
	def MetadataVT: ValueType<0, 249>; // Metadata			def MetadataVT: ValueType<0, 249>; // Metadata

	// Pseudo valuetype mapped to the current pointer size to any address space.			// Pseudo valuetype mapped to the current pointer size to any address space.
	// Should only be used in TableGen.			// Should only be used in TableGen.
	def iPTRAny : ValueType<0, 250>;			def iPTRAny : ValueType<0, 250>;

	// Pseudo valuetype to represent "vector of any size"			// Pseudo valuetype to represent "vector of any size"
	Show All 24 Lines

llvm/include/llvm/IR/Intrinsics.td

	Show First 20 Lines • Show All 283 Lines • ▼ Show 20 Lines
	def llvm_v4f32_ty : LLVMType<v4f32>; // 4 x float			def llvm_v4f32_ty : LLVMType<v4f32>; // 4 x float
	def llvm_v8f32_ty : LLVMType<v8f32>; // 8 x float			def llvm_v8f32_ty : LLVMType<v8f32>; // 8 x float
	def llvm_v16f32_ty : LLVMType<v16f32>; // 16 x float			def llvm_v16f32_ty : LLVMType<v16f32>; // 16 x float
	def llvm_v32f32_ty : LLVMType<v32f32>; // 32 x float			def llvm_v32f32_ty : LLVMType<v32f32>; // 32 x float
	def llvm_v1f64_ty : LLVMType<v1f64>; // 1 x double			def llvm_v1f64_ty : LLVMType<v1f64>; // 1 x double
	def llvm_v2f64_ty : LLVMType<v2f64>; // 2 x double			def llvm_v2f64_ty : LLVMType<v2f64>; // 2 x double
	def llvm_v4f64_ty : LLVMType<v4f64>; // 4 x double			def llvm_v4f64_ty : LLVMType<v4f64>; // 4 x double
	def llvm_v8f64_ty : LLVMType<v8f64>; // 8 x double			def llvm_v8f64_ty : LLVMType<v8f64>; // 8 x double
				def llvm_v16f64_ty : LLVMType<v16f64>; // 16 x double

	def llvm_vararg_ty : LLVMType<isVoid>; // this means vararg here			def llvm_vararg_ty : LLVMType<isVoid>; // this means vararg here

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Intrinsic Definitions.			// Intrinsic Definitions.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	// Intrinsic class - This is used to define one LLVM intrinsic. The name of the			// Intrinsic class - This is used to define one LLVM intrinsic. The name of the
	▲ Show 20 Lines • Show All 1,180 Lines • Show Last 20 Lines

llvm/include/llvm/Support/MachineValueType.h

Show First 20 Lines • Show All 134 Lines • ▼ Show 20 Lines	enum SimpleValueType : uint8_t {
v256f32 = 80, // 256 x f32		v256f32 = 80, // 256 x f32
v512f32 = 81, // 512 x f32		v512f32 = 81, // 512 x f32
v1024f32 = 82, // 1024 x f32		v1024f32 = 82, // 1024 x f32
v2048f32 = 83, // 2048 x f32		v2048f32 = 83, // 2048 x f32
v1f64 = 84, // 1 x f64		v1f64 = 84, // 1 x f64
v2f64 = 85, // 2 x f64		v2f64 = 85, // 2 x f64
v4f64 = 86, // 4 x f64		v4f64 = 86, // 4 x f64
v8f64 = 87, // 8 x f64		v8f64 = 87, // 8 x f64
		v16f64 = 88, // 16 x f64

FIRST_FP_FIXEDLEN_VECTOR_VALUETYPE = v2f16,		FIRST_FP_FIXEDLEN_VECTOR_VALUETYPE = v2f16,
LAST_FP_FIXEDLEN_VECTOR_VALUETYPE = v8f64,		LAST_FP_FIXEDLEN_VECTOR_VALUETYPE = v16f64,

FIRST_FIXEDLEN_VECTOR_VALUETYPE = v1i1,		FIRST_FIXEDLEN_VECTOR_VALUETYPE = v1i1,
LAST_FIXEDLEN_VECTOR_VALUETYPE = v8f64,		LAST_FIXEDLEN_VECTOR_VALUETYPE = v16f64,

nxv1i1 = 88, // n x 1 x i1		nxv1i1 = 89, // n x 1 x i1
nxv2i1 = 89, // n x 2 x i1		nxv2i1 = 90, // n x 2 x i1
nxv4i1 = 90, // n x 4 x i1		nxv4i1 = 91, // n x 4 x i1
nxv8i1 = 91, // n x 8 x i1		nxv8i1 = 92, // n x 8 x i1
nxv16i1 = 92, // n x 16 x i1		nxv16i1 = 93, // n x 16 x i1
nxv32i1 = 93, // n x 32 x i1		nxv32i1 = 94, // n x 32 x i1

nxv1i8 = 94, // n x 1 x i8		nxv1i8 = 95, // n x 1 x i8
nxv2i8 = 95, // n x 2 x i8		nxv2i8 = 96, // n x 2 x i8
nxv4i8 = 96, // n x 4 x i8		nxv4i8 = 97, // n x 4 x i8
nxv8i8 = 97, // n x 8 x i8		nxv8i8 = 98, // n x 8 x i8
nxv16i8 = 98, // n x 16 x i8		nxv16i8 = 99, // n x 16 x i8
nxv32i8 = 99, // n x 32 x i8		nxv32i8 = 100, // n x 32 x i8

nxv1i16 = 100, // n x 1 x i16		nxv1i16 = 101, // n x 1 x i16
nxv2i16 = 101, // n x 2 x i16		nxv2i16 = 102, // n x 2 x i16
nxv4i16 = 102, // n x 4 x i16		nxv4i16 = 103, // n x 4 x i16
nxv8i16 = 103, // n x 8 x i16		nxv8i16 = 104, // n x 8 x i16
nxv16i16 = 104, // n x 16 x i16		nxv16i16 = 105, // n x 16 x i16
nxv32i16 = 105, // n x 32 x i16		nxv32i16 = 106, // n x 32 x i16

nxv1i32 = 106, // n x 1 x i32		nxv1i32 = 107, // n x 1 x i32
nxv2i32 = 107, // n x 2 x i32		nxv2i32 = 108, // n x 2 x i32
nxv4i32 = 108, // n x 4 x i32		nxv4i32 = 109, // n x 4 x i32
nxv8i32 = 109, // n x 8 x i32		nxv8i32 = 110, // n x 8 x i32
nxv16i32 = 110, // n x 16 x i32		nxv16i32 = 111, // n x 16 x i32
nxv32i32 = 111, // n x 32 x i32		nxv32i32 = 112, // n x 32 x i32

nxv1i64 = 112, // n x 1 x i64		nxv1i64 = 113, // n x 1 x i64
nxv2i64 = 113, // n x 2 x i64		nxv2i64 = 114, // n x 2 x i64
nxv4i64 = 114, // n x 4 x i64		nxv4i64 = 115, // n x 4 x i64
nxv8i64 = 115, // n x 8 x i64		nxv8i64 = 116, // n x 8 x i64
nxv16i64 = 116, // n x 16 x i64		nxv16i64 = 117, // n x 16 x i64
nxv32i64 = 117, // n x 32 x i64		nxv32i64 = 118, // n x 32 x i64

FIRST_INTEGER_SCALABLE_VECTOR_VALUETYPE = nxv1i1,		FIRST_INTEGER_SCALABLE_VECTOR_VALUETYPE = nxv1i1,
LAST_INTEGER_SCALABLE_VECTOR_VALUETYPE = nxv32i64,		LAST_INTEGER_SCALABLE_VECTOR_VALUETYPE = nxv32i64,

nxv2f16 = 118, // n x 2 x f16		nxv2f16 = 119, // n x 2 x f16
nxv4f16 = 119, // n x 4 x f16		nxv4f16 = 120, // n x 4 x f16
nxv8f16 = 120, // n x 8 x f16		nxv8f16 = 121, // n x 8 x f16
nxv1f32 = 121, // n x 1 x f32		nxv1f32 = 122, // n x 1 x f32
nxv2f32 = 122, // n x 2 x f32		nxv2f32 = 123, // n x 2 x f32
nxv4f32 = 123, // n x 4 x f32		nxv4f32 = 124, // n x 4 x f32
nxv8f32 = 124, // n x 8 x f32		nxv8f32 = 125, // n x 8 x f32
nxv16f32 = 125, // n x 16 x f32		nxv16f32 = 126, // n x 16 x f32
nxv1f64 = 126, // n x 1 x f64		nxv1f64 = 127, // n x 1 x f64
nxv2f64 = 127, // n x 2 x f64		nxv2f64 = 128, // n x 2 x f64
nxv4f64 = 128, // n x 4 x f64		nxv4f64 = 129, // n x 4 x f64
nxv8f64 = 129, // n x 8 x f64		nxv8f64 = 130, // n x 8 x f64

FIRST_FP_SCALABLE_VECTOR_VALUETYPE = nxv2f16,		FIRST_FP_SCALABLE_VECTOR_VALUETYPE = nxv2f16,
LAST_FP_SCALABLE_VECTOR_VALUETYPE = nxv8f64,		LAST_FP_SCALABLE_VECTOR_VALUETYPE = nxv8f64,

FIRST_SCALABLE_VECTOR_VALUETYPE = nxv1i1,		FIRST_SCALABLE_VECTOR_VALUETYPE = nxv1i1,
LAST_SCALABLE_VECTOR_VALUETYPE = nxv8f64,		LAST_SCALABLE_VECTOR_VALUETYPE = nxv8f64,

FIRST_VECTOR_VALUETYPE = v1i1,		FIRST_VECTOR_VALUETYPE = v1i1,
LAST_VECTOR_VALUETYPE = nxv8f64,		LAST_VECTOR_VALUETYPE = nxv8f64,

x86mmx = 130, // This is an X86 MMX value		x86mmx = 131, // This is an X86 MMX value

Glue = 131, // This glues nodes together during pre-RA sched		Glue = 132, // This glues nodes together during pre-RA sched

isVoid = 132, // This has no value		isVoid = 133, // This has no value

Untyped = 133, // This value takes a register, but has		Untyped = 134, // This value takes a register, but has
// unspecified type. The register class		// unspecified type. The register class
// will be determined by the opcode.		// will be determined by the opcode.

exnref = 134, // WebAssembly's exnref type		exnref = 135, // WebAssembly's exnref type

FIRST_VALUETYPE = 1, // This is always the beginning of the list.		FIRST_VALUETYPE = 1, // This is always the beginning of the list.
LAST_VALUETYPE = 135, // This always remains at the end of the list.		LAST_VALUETYPE = 136, // This always remains at the end of the list.

// This is the current maximum for LAST_VALUETYPE.		// This is the current maximum for LAST_VALUETYPE.
// MVT::MAX_ALLOWED_VALUETYPE is used for asserts and to size bit vectors		// MVT::MAX_ALLOWED_VALUETYPE is used for asserts and to size bit vectors
// This value must be a multiple of 32.		// This value must be a multiple of 32.
MAX_ALLOWED_VALUETYPE = 160,		MAX_ALLOWED_VALUETYPE = 160,

// A value of type llvm::TokenTy		// A value of type llvm::TokenTy
token = 248,		token = 248,
▲ Show 20 Lines • Show All 137 Lines • ▼ Show 20 Lines	bool is512BitVector() const {
SimpleTy == MVT::v64i8 \|\| SimpleTy == MVT::v32i16 \|\|		SimpleTy == MVT::v64i8 \|\| SimpleTy == MVT::v32i16 \|\|
SimpleTy == MVT::v16i32 \|\| SimpleTy == MVT::v8i64);		SimpleTy == MVT::v16i32 \|\| SimpleTy == MVT::v8i64);
}		}

/// Return true if this is a 1024-bit vector type.		/// Return true if this is a 1024-bit vector type.
bool is1024BitVector() const {		bool is1024BitVector() const {
return (SimpleTy == MVT::v1024i1 \|\| SimpleTy == MVT::v128i8 \|\|		return (SimpleTy == MVT::v1024i1 \|\| SimpleTy == MVT::v128i8 \|\|
SimpleTy == MVT::v64i16 \|\| SimpleTy == MVT::v32i32 \|\|		SimpleTy == MVT::v64i16 \|\| SimpleTy == MVT::v32i32 \|\|
SimpleTy == MVT::v16i64);		SimpleTy == MVT::v16i64 \|\| SimpleTy == MVT::v16f64);
}		}

/// Return true if this is a 2048-bit vector type.		/// Return true if this is a 2048-bit vector type.
bool is2048BitVector() const {		bool is2048BitVector() const {
return (SimpleTy == MVT::v256i8 \|\| SimpleTy == MVT::v128i16 \|\|		return (SimpleTy == MVT::v256i8 \|\| SimpleTy == MVT::v128i16 \|\|
SimpleTy == MVT::v64i32 \|\| SimpleTy == MVT::v32i64);		SimpleTy == MVT::v64i32 \|\| SimpleTy == MVT::v32i64);
}		}

▲ Show 20 Lines • Show All 146 Lines • ▼ Show 20 Lines	MVT getVectorElementType() const {
case nxv2f32:		case nxv2f32:
case nxv4f32:		case nxv4f32:
case nxv8f32:		case nxv8f32:
case nxv16f32: return f32;		case nxv16f32: return f32;
case v1f64:		case v1f64:
case v2f64:		case v2f64:
case v4f64:		case v4f64:
case v8f64:		case v8f64:
		case v16f64:
case nxv1f64:		case nxv1f64:
case nxv2f64:		case nxv2f64:
case nxv4f64:		case nxv4f64:
case nxv8f64: return f64;		case nxv8f64: return f64;
}		}
}		}

unsigned getVectorNumElements() const {		unsigned getVectorNumElements() const {
Show All 36 Lines	unsigned getVectorNumElements() const {
case nxv32i64: return 32;		case nxv32i64: return 32;
case v16i1:		case v16i1:
case v16i8:		case v16i8:
case v16i16:		case v16i16:
case v16i32:		case v16i32:
case v16i64:		case v16i64:
case v16f16:		case v16f16:
case v16f32:		case v16f32:
		case v16f64:
case nxv16i1:		case nxv16i1:
case nxv16i8:		case nxv16i8:
case nxv16i16:		case nxv16i16:
case nxv16i32:		case nxv16i32:
case nxv16i64:		case nxv16i64:
case nxv16f32: return 16;		case nxv16f32: return 16;
case v8i1:		case v8i1:
case v8i8:		case v8i8:
▲ Show 20 Lines • Show All 200 Lines • ▼ Show 20 Lines	TypeSize getSizeInBits() const {
case nxv8i64:		case nxv8i64:
case nxv16f32:		case nxv16f32:
case nxv8f64: return TypeSize::Scalable(512);		case nxv8f64: return TypeSize::Scalable(512);
case v1024i1:		case v1024i1:
case v128i8:		case v128i8:
case v64i16:		case v64i16:
case v32i32:		case v32i32:
case v16i64:		case v16i64:
		case v16f64:
case v32f32: return TypeSize::Fixed(1024);		case v32f32: return TypeSize::Fixed(1024);
case nxv32i32:		case nxv32i32:
case nxv16i64: return TypeSize::Scalable(1024);		case nxv16i64: return TypeSize::Scalable(1024);
case v256i8:		case v256i8:
case v128i16:		case v128i16:
case v64i32:		case v64i32:
case v32i64:		case v32i64:
case v64f32: return TypeSize::Fixed(2048);		case v64f32: return TypeSize::Fixed(2048);
▲ Show 20 Lines • Show All 189 Lines • ▼ Show 20 Lines	static MVT getVectorVT(MVT VT, unsigned NumElements) {
if (NumElements == 1024) return MVT::v1024f32;		if (NumElements == 1024) return MVT::v1024f32;
if (NumElements == 2048) return MVT::v2048f32;		if (NumElements == 2048) return MVT::v2048f32;
break;		break;
case MVT::f64:		case MVT::f64:
if (NumElements == 1) return MVT::v1f64;		if (NumElements == 1) return MVT::v1f64;
if (NumElements == 2) return MVT::v2f64;		if (NumElements == 2) return MVT::v2f64;
if (NumElements == 4) return MVT::v4f64;		if (NumElements == 4) return MVT::v4f64;
if (NumElements == 8) return MVT::v8f64;		if (NumElements == 8) return MVT::v8f64;
		if (NumElements == 16) return MVT::v16f64;
break;		break;
}		}
return (MVT::SimpleValueType)(MVT::INVALID_SIMPLE_VALUE_TYPE);		return (MVT::SimpleValueType)(MVT::INVALID_SIMPLE_VALUE_TYPE);
}		}

static MVT getScalableVectorVT(MVT VT, unsigned NumElements) {		static MVT getScalableVectorVT(MVT VT, unsigned NumElements) {
switch(VT.SimpleTy) {		switch(VT.SimpleTy) {
default:		default:
▲ Show 20 Lines • Show All 164 Lines • Show Last 20 Lines

llvm/lib/CodeGen/ValueTypes.cpp

Show First 20 Lines • Show All 248 Lines • ▼ Show 20 Lines	Type *EVT::getTypeForEVT(LLVMContext &Context) const {
case MVT::v256f32: return VectorType::get(Type::getFloatTy(Context), 256);		case MVT::v256f32: return VectorType::get(Type::getFloatTy(Context), 256);
case MVT::v512f32: return VectorType::get(Type::getFloatTy(Context), 512);		case MVT::v512f32: return VectorType::get(Type::getFloatTy(Context), 512);
case MVT::v1024f32:return VectorType::get(Type::getFloatTy(Context), 1024);		case MVT::v1024f32:return VectorType::get(Type::getFloatTy(Context), 1024);
case MVT::v2048f32:return VectorType::get(Type::getFloatTy(Context), 2048);		case MVT::v2048f32:return VectorType::get(Type::getFloatTy(Context), 2048);
case MVT::v1f64: return VectorType::get(Type::getDoubleTy(Context), 1);		case MVT::v1f64: return VectorType::get(Type::getDoubleTy(Context), 1);
case MVT::v2f64: return VectorType::get(Type::getDoubleTy(Context), 2);		case MVT::v2f64: return VectorType::get(Type::getDoubleTy(Context), 2);
case MVT::v4f64: return VectorType::get(Type::getDoubleTy(Context), 4);		case MVT::v4f64: return VectorType::get(Type::getDoubleTy(Context), 4);
case MVT::v8f64: return VectorType::get(Type::getDoubleTy(Context), 8);		case MVT::v8f64: return VectorType::get(Type::getDoubleTy(Context), 8);
		case MVT::v16f64: return VectorType::get(Type::getDoubleTy(Context), 16);
case MVT::nxv1i1:		case MVT::nxv1i1:
return VectorType::get(Type::getInt1Ty(Context), 1, /Scalable=/ true);		return VectorType::get(Type::getInt1Ty(Context), 1, /Scalable=/ true);
case MVT::nxv2i1:		case MVT::nxv2i1:
return VectorType::get(Type::getInt1Ty(Context), 2, /Scalable=/ true);		return VectorType::get(Type::getInt1Ty(Context), 2, /Scalable=/ true);
case MVT::nxv4i1:		case MVT::nxv4i1:
return VectorType::get(Type::getInt1Ty(Context), 4, /Scalable=/ true);		return VectorType::get(Type::getInt1Ty(Context), 4, /Scalable=/ true);
case MVT::nxv8i1:		case MVT::nxv8i1:
return VectorType::get(Type::getInt1Ty(Context), 8, /Scalable=/ true);		return VectorType::get(Type::getInt1Ty(Context), 8, /Scalable=/ true);
▲ Show 20 Lines • Show All 128 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp

Show First 20 Lines • Show All 113 Lines • ▼ Show 20 Lines	AMDGPUTargetLowering::AMDGPUTargetLowering(const TargetMachine &TM,
AddPromotedToType(ISD::LOAD, MVT::v4f64, MVT::v8i32);		AddPromotedToType(ISD::LOAD, MVT::v4f64, MVT::v8i32);

setOperationAction(ISD::LOAD, MVT::v8i64, Promote);		setOperationAction(ISD::LOAD, MVT::v8i64, Promote);
AddPromotedToType(ISD::LOAD, MVT::v8i64, MVT::v16i32);		AddPromotedToType(ISD::LOAD, MVT::v8i64, MVT::v16i32);

setOperationAction(ISD::LOAD, MVT::v8f64, Promote);		setOperationAction(ISD::LOAD, MVT::v8f64, Promote);
AddPromotedToType(ISD::LOAD, MVT::v8f64, MVT::v16i32);		AddPromotedToType(ISD::LOAD, MVT::v8f64, MVT::v16i32);

		setOperationAction(ISD::LOAD, MVT::v16i64, Promote);
		AddPromotedToType(ISD::LOAD, MVT::v16i64, MVT::v32i32);

		setOperationAction(ISD::LOAD, MVT::v16f64, Promote);
		AddPromotedToType(ISD::LOAD, MVT::v16f64, MVT::v32i32);

// There are no 64-bit extloads. These should be done as a 32-bit extload and		// There are no 64-bit extloads. These should be done as a 32-bit extload and
// an extension to 64-bit.		// an extension to 64-bit.
for (MVT VT : MVT::integer_valuetypes()) {		for (MVT VT : MVT::integer_valuetypes()) {
setLoadExtAction(ISD::EXTLOAD, MVT::i64, VT, Expand);		setLoadExtAction(ISD::EXTLOAD, MVT::i64, VT, Expand);
setLoadExtAction(ISD::SEXTLOAD, MVT::i64, VT, Expand);		setLoadExtAction(ISD::SEXTLOAD, MVT::i64, VT, Expand);
setLoadExtAction(ISD::ZEXTLOAD, MVT::i64, VT, Expand);		setLoadExtAction(ISD::ZEXTLOAD, MVT::i64, VT, Expand);
}		}

▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	AMDGPUTargetLowering::AMDGPUTargetLowering(const TargetMachine &TM,
setLoadExtAction(ISD::EXTLOAD, MVT::v8f32, MVT::v8f16, Expand);		setLoadExtAction(ISD::EXTLOAD, MVT::v8f32, MVT::v8f16, Expand);
setLoadExtAction(ISD::EXTLOAD, MVT::v16f32, MVT::v16f16, Expand);		setLoadExtAction(ISD::EXTLOAD, MVT::v16f32, MVT::v16f16, Expand);
setLoadExtAction(ISD::EXTLOAD, MVT::v32f32, MVT::v32f16, Expand);		setLoadExtAction(ISD::EXTLOAD, MVT::v32f32, MVT::v32f16, Expand);

setLoadExtAction(ISD::EXTLOAD, MVT::f64, MVT::f32, Expand);		setLoadExtAction(ISD::EXTLOAD, MVT::f64, MVT::f32, Expand);
setLoadExtAction(ISD::EXTLOAD, MVT::v2f64, MVT::v2f32, Expand);		setLoadExtAction(ISD::EXTLOAD, MVT::v2f64, MVT::v2f32, Expand);
setLoadExtAction(ISD::EXTLOAD, MVT::v4f64, MVT::v4f32, Expand);		setLoadExtAction(ISD::EXTLOAD, MVT::v4f64, MVT::v4f32, Expand);
setLoadExtAction(ISD::EXTLOAD, MVT::v8f64, MVT::v8f32, Expand);		setLoadExtAction(ISD::EXTLOAD, MVT::v8f64, MVT::v8f32, Expand);
		setLoadExtAction(ISD::EXTLOAD, MVT::v16f64, MVT::v16f32, Expand);

setLoadExtAction(ISD::EXTLOAD, MVT::f64, MVT::f16, Expand);		setLoadExtAction(ISD::EXTLOAD, MVT::f64, MVT::f16, Expand);
setLoadExtAction(ISD::EXTLOAD, MVT::v2f64, MVT::v2f16, Expand);		setLoadExtAction(ISD::EXTLOAD, MVT::v2f64, MVT::v2f16, Expand);
setLoadExtAction(ISD::EXTLOAD, MVT::v4f64, MVT::v4f16, Expand);		setLoadExtAction(ISD::EXTLOAD, MVT::v4f64, MVT::v4f16, Expand);
setLoadExtAction(ISD::EXTLOAD, MVT::v8f64, MVT::v8f16, Expand);		setLoadExtAction(ISD::EXTLOAD, MVT::v8f64, MVT::v8f16, Expand);
		setLoadExtAction(ISD::EXTLOAD, MVT::v16f64, MVT::v16f16, Expand);

setOperationAction(ISD::STORE, MVT::f32, Promote);		setOperationAction(ISD::STORE, MVT::f32, Promote);
AddPromotedToType(ISD::STORE, MVT::f32, MVT::i32);		AddPromotedToType(ISD::STORE, MVT::f32, MVT::i32);

setOperationAction(ISD::STORE, MVT::v2f32, Promote);		setOperationAction(ISD::STORE, MVT::v2f32, Promote);
AddPromotedToType(ISD::STORE, MVT::v2f32, MVT::v2i32);		AddPromotedToType(ISD::STORE, MVT::v2f32, MVT::v2i32);

setOperationAction(ISD::STORE, MVT::v3f32, Promote);		setOperationAction(ISD::STORE, MVT::v3f32, Promote);
Show All 33 Lines	AMDGPUTargetLowering::AMDGPUTargetLowering(const TargetMachine &TM,
AddPromotedToType(ISD::STORE, MVT::v4f64, MVT::v8i32);		AddPromotedToType(ISD::STORE, MVT::v4f64, MVT::v8i32);

setOperationAction(ISD::STORE, MVT::v8i64, Promote);		setOperationAction(ISD::STORE, MVT::v8i64, Promote);
AddPromotedToType(ISD::STORE, MVT::v8i64, MVT::v16i32);		AddPromotedToType(ISD::STORE, MVT::v8i64, MVT::v16i32);

setOperationAction(ISD::STORE, MVT::v8f64, Promote);		setOperationAction(ISD::STORE, MVT::v8f64, Promote);
AddPromotedToType(ISD::STORE, MVT::v8f64, MVT::v16i32);		AddPromotedToType(ISD::STORE, MVT::v8f64, MVT::v16i32);

		setOperationAction(ISD::STORE, MVT::v16i64, Promote);
		AddPromotedToType(ISD::STORE, MVT::v16i64, MVT::v32i32);

		setOperationAction(ISD::STORE, MVT::v16f64, Promote);
		AddPromotedToType(ISD::STORE, MVT::v16f64, MVT::v32i32);

setTruncStoreAction(MVT::i64, MVT::i1, Expand);		setTruncStoreAction(MVT::i64, MVT::i1, Expand);
setTruncStoreAction(MVT::i64, MVT::i8, Expand);		setTruncStoreAction(MVT::i64, MVT::i8, Expand);
setTruncStoreAction(MVT::i64, MVT::i16, Expand);		setTruncStoreAction(MVT::i64, MVT::i16, Expand);
setTruncStoreAction(MVT::i64, MVT::i32, Expand);		setTruncStoreAction(MVT::i64, MVT::i32, Expand);

setTruncStoreAction(MVT::v2i64, MVT::v2i1, Expand);		setTruncStoreAction(MVT::v2i64, MVT::v2i1, Expand);
setTruncStoreAction(MVT::v2i64, MVT::v2i8, Expand);		setTruncStoreAction(MVT::v2i64, MVT::v2i8, Expand);
setTruncStoreAction(MVT::v2i64, MVT::v2i16, Expand);		setTruncStoreAction(MVT::v2i64, MVT::v2i16, Expand);
Show All 16 Lines	AMDGPUTargetLowering::AMDGPUTargetLowering(const TargetMachine &TM,
setTruncStoreAction(MVT::v4i64, MVT::v4i32, Expand);		setTruncStoreAction(MVT::v4i64, MVT::v4i32, Expand);
setTruncStoreAction(MVT::v4i64, MVT::v4i16, Expand);		setTruncStoreAction(MVT::v4i64, MVT::v4i16, Expand);
setTruncStoreAction(MVT::v4f64, MVT::v4f32, Expand);		setTruncStoreAction(MVT::v4f64, MVT::v4f32, Expand);
setTruncStoreAction(MVT::v4f64, MVT::v4f16, Expand);		setTruncStoreAction(MVT::v4f64, MVT::v4f16, Expand);

setTruncStoreAction(MVT::v8f64, MVT::v8f32, Expand);		setTruncStoreAction(MVT::v8f64, MVT::v8f32, Expand);
setTruncStoreAction(MVT::v8f64, MVT::v8f16, Expand);		setTruncStoreAction(MVT::v8f64, MVT::v8f16, Expand);

		setTruncStoreAction(MVT::v16f64, MVT::v16f32, Expand);
		setTruncStoreAction(MVT::v16f64, MVT::v16f16, Expand);

setOperationAction(ISD::Constant, MVT::i32, Legal);		setOperationAction(ISD::Constant, MVT::i32, Legal);
setOperationAction(ISD::Constant, MVT::i64, Legal);		setOperationAction(ISD::Constant, MVT::i64, Legal);
setOperationAction(ISD::ConstantFP, MVT::f32, Legal);		setOperationAction(ISD::ConstantFP, MVT::f32, Legal);
setOperationAction(ISD::ConstantFP, MVT::f64, Legal);		setOperationAction(ISD::ConstantFP, MVT::f64, Legal);

setOperationAction(ISD::BR_JT, MVT::Other, Expand);		setOperationAction(ISD::BR_JT, MVT::Other, Expand);
setOperationAction(ISD::BRIND, MVT::Other, Expand);		setOperationAction(ISD::BRIND, MVT::Other, Expand);
▲ Show 20 Lines • Show All 4,494 Lines • Show Last 20 Lines

llvm/test/Analysis/CostModel/ARM/cast.ll

	Show First 20 Lines • Show All 366 Lines • ▼ Show 20 Lines
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 16 for instruction: %rext_a = sext <2 x i32> undef to <2 x i64>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 16 for instruction: %rext_a = sext <2 x i32> undef to <2 x i64>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 4 for instruction: %rext_b = zext <2 x i32> undef to <2 x i64>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 4 for instruction: %rext_b = zext <2 x i32> undef to <2 x i64>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r74 = trunc <8 x i32> undef to <8 x i8>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r74 = trunc <8 x i32> undef to <8 x i8>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 10 for instruction: %r75 = trunc <16 x i32> undef to <16 x i8>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 10 for instruction: %r75 = trunc <16 x i32> undef to <16 x i8>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r80 = fptrunc double undef to float			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r80 = fptrunc double undef to float
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r81 = fptrunc <2 x double> undef to <2 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r81 = fptrunc <2 x double> undef to <2 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 10 for instruction: %r82 = fptrunc <4 x double> undef to <4 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 10 for instruction: %r82 = fptrunc <4 x double> undef to <4 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 40 for instruction: %r83 = fptrunc <8 x double> undef to <8 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 40 for instruction: %r83 = fptrunc <8 x double> undef to <8 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 80 for instruction: %r84 = fptrunc <16 x double> undef to <16 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 160 for instruction: %r84 = fptrunc <16 x double> undef to <16 x float>
				dmgreenUnsubmitted Not Done Reply Inline Actions Yeah. Sounds fine. dmgreen: Yeah. Sounds fine.
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r85 = fpext float undef to double			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r85 = fpext float undef to double
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r86 = fpext <2 x float> undef to <2 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r86 = fpext <2 x float> undef to <2 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 82 for instruction: %r87 = fpext <4 x float> undef to <4 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 82 for instruction: %r87 = fpext <4 x float> undef to <4 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 328 for instruction: %r88 = fpext <8 x float> undef to <8 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 328 for instruction: %r88 = fpext <8 x float> undef to <8 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 656 for instruction: %r89 = fpext <16 x float> undef to <16 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1312 for instruction: %r89 = fpext <16 x float> undef to <16 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r90 = fptoui <2 x float> undef to <2 x i1>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r90 = fptoui <2 x float> undef to <2 x i1>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r91 = fptosi <2 x float> undef to <2 x i1>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r91 = fptosi <2 x float> undef to <2 x i1>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r92 = fptoui <2 x float> undef to <2 x i8>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r92 = fptoui <2 x float> undef to <2 x i8>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r93 = fptosi <2 x float> undef to <2 x i8>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r93 = fptosi <2 x float> undef to <2 x i8>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r94 = fptoui <2 x float> undef to <2 x i16>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r94 = fptoui <2 x float> undef to <2 x i16>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r95 = fptosi <2 x float> undef to <2 x i16>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r95 = fptosi <2 x float> undef to <2 x i16>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r96 = fptoui <2 x float> undef to <2 x i32>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r96 = fptoui <2 x float> undef to <2 x i32>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r97 = fptosi <2 x float> undef to <2 x i32>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 20 for instruction: %r97 = fptosi <2 x float> undef to <2 x i32>
	▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 42 for instruction: %r152 = fptoui <16 x float> undef to <16 x i8>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 42 for instruction: %r152 = fptoui <16 x float> undef to <16 x i8>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 42 for instruction: %r153 = fptosi <16 x float> undef to <16 x i8>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 42 for instruction: %r153 = fptosi <16 x float> undef to <16 x i8>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 40 for instruction: %r154 = fptoui <16 x float> undef to <16 x i16>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 40 for instruction: %r154 = fptoui <16 x float> undef to <16 x i16>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 40 for instruction: %r155 = fptosi <16 x float> undef to <16 x i16>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 40 for instruction: %r155 = fptosi <16 x float> undef to <16 x i16>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 8 for instruction: %r156 = fptoui <16 x float> undef to <16 x i32>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 8 for instruction: %r156 = fptoui <16 x float> undef to <16 x i32>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 8 for instruction: %r157 = fptosi <16 x float> undef to <16 x i32>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 8 for instruction: %r157 = fptosi <16 x float> undef to <16 x i32>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1312 for instruction: %r158 = fptoui <16 x float> undef to <16 x i64>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1312 for instruction: %r158 = fptoui <16 x float> undef to <16 x i64>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1312 for instruction: %r159 = fptosi <16 x float> undef to <16 x i64>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1312 for instruction: %r159 = fptosi <16 x float> undef to <16 x i64>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 661 for instruction: %r160 = fptoui <16 x double> undef to <16 x i1>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1322 for instruction: %r160 = fptoui <16 x double> undef to <16 x i1>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 661 for instruction: %r161 = fptosi <16 x double> undef to <16 x i1>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1322 for instruction: %r161 = fptosi <16 x double> undef to <16 x i1>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 661 for instruction: %r162 = fptoui <16 x double> undef to <16 x i8>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1322 for instruction: %r162 = fptoui <16 x double> undef to <16 x i8>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 661 for instruction: %r163 = fptosi <16 x double> undef to <16 x i8>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1322 for instruction: %r163 = fptosi <16 x double> undef to <16 x i8>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 660 for instruction: %r164 = fptoui <16 x double> undef to <16 x i16>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1320 for instruction: %r164 = fptoui <16 x double> undef to <16 x i16>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 660 for instruction: %r165 = fptosi <16 x double> undef to <16 x i16>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1320 for instruction: %r165 = fptosi <16 x double> undef to <16 x i16>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 656 for instruction: %r166 = fptoui <16 x double> undef to <16 x i32>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1312 for instruction: %r166 = fptoui <16 x double> undef to <16 x i32>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 656 for instruction: %r167 = fptosi <16 x double> undef to <16 x i32>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1312 for instruction: %r167 = fptosi <16 x double> undef to <16 x i32>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 640 for instruction: %r168 = fptoui <16 x double> undef to <16 x i64>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1280 for instruction: %r168 = fptoui <16 x double> undef to <16 x i64>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 640 for instruction: %r169 = fptosi <16 x double> undef to <16 x i64>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1280 for instruction: %r169 = fptosi <16 x double> undef to <16 x i64>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r170 = uitofp <2 x i1> undef to <2 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r170 = uitofp <2 x i1> undef to <2 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r171 = sitofp <2 x i1> undef to <2 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r171 = sitofp <2 x i1> undef to <2 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r172 = uitofp <2 x i8> undef to <2 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r172 = uitofp <2 x i8> undef to <2 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r173 = sitofp <2 x i8> undef to <2 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r173 = sitofp <2 x i8> undef to <2 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r174 = uitofp <2 x i16> undef to <2 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r174 = uitofp <2 x i16> undef to <2 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r175 = sitofp <2 x i16> undef to <2 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r175 = sitofp <2 x i16> undef to <2 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r176 = uitofp <2 x i32> undef to <2 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r176 = uitofp <2 x i32> undef to <2 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r177 = sitofp <2 x i32> undef to <2 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2 for instruction: %r177 = sitofp <2 x i32> undef to <2 x float>
	▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 42 for instruction: %r232 = uitofp <16 x i8> undef to <16 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 42 for instruction: %r232 = uitofp <16 x i8> undef to <16 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 42 for instruction: %r233 = sitofp <16 x i8> undef to <16 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 42 for instruction: %r233 = sitofp <16 x i8> undef to <16 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 40 for instruction: %r234 = uitofp <16 x i16> undef to <16 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 40 for instruction: %r234 = uitofp <16 x i16> undef to <16 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 40 for instruction: %r235 = sitofp <16 x i16> undef to <16 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 40 for instruction: %r235 = sitofp <16 x i16> undef to <16 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 8 for instruction: %r236 = uitofp <16 x i32> undef to <16 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 8 for instruction: %r236 = uitofp <16 x i32> undef to <16 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 8 for instruction: %r237 = sitofp <16 x i32> undef to <16 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 8 for instruction: %r237 = sitofp <16 x i32> undef to <16 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 160 for instruction: %r238 = uitofp <16 x i64> undef to <16 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 160 for instruction: %r238 = uitofp <16 x i64> undef to <16 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 160 for instruction: %r239 = sitofp <16 x i64> undef to <16 x float>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 160 for instruction: %r239 = sitofp <16 x i64> undef to <16 x float>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1045 for instruction: %r240 = uitofp <16 x i1> undef to <16 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2090 for instruction: %r240 = uitofp <16 x i1> undef to <16 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1045 for instruction: %r241 = sitofp <16 x i1> undef to <16 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2090 for instruction: %r241 = sitofp <16 x i1> undef to <16 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1045 for instruction: %r242 = uitofp <16 x i8> undef to <16 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2090 for instruction: %r242 = uitofp <16 x i8> undef to <16 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1045 for instruction: %r243 = sitofp <16 x i8> undef to <16 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2090 for instruction: %r243 = sitofp <16 x i8> undef to <16 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1044 for instruction: %r244 = uitofp <16 x i16> undef to <16 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2088 for instruction: %r244 = uitofp <16 x i16> undef to <16 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1044 for instruction: %r245 = sitofp <16 x i16> undef to <16 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2088 for instruction: %r245 = sitofp <16 x i16> undef to <16 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1044 for instruction: %r246 = uitofp <16 x i16> undef to <16 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2088 for instruction: %r246 = uitofp <16 x i16> undef to <16 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1044 for instruction: %r247 = sitofp <16 x i16> undef to <16 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2088 for instruction: %r247 = sitofp <16 x i16> undef to <16 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1024 for instruction: %r248 = uitofp <16 x i64> undef to <16 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2048 for instruction: %r248 = uitofp <16 x i64> undef to <16 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 1024 for instruction: %r249 = sitofp <16 x i64> undef to <16 x double>			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 2048 for instruction: %r249 = sitofp <16 x i64> undef to <16 x double>
	; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 0 for instruction: ret i32 undef			; CHECK-MVE-NEXT: Cost Model: Found an estimated cost of 0 for instruction: ret i32 undef
	;			;
	; CHECK-V8M-MAIN-LABEL: 'casts'			; CHECK-V8M-MAIN-LABEL: 'casts'
	; CHECK-V8M-MAIN-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r0 = sext i1 undef to i8			; CHECK-V8M-MAIN-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r0 = sext i1 undef to i8
	; CHECK-V8M-MAIN-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r1 = zext i1 undef to i8			; CHECK-V8M-MAIN-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r1 = zext i1 undef to i8
	; CHECK-V8M-MAIN-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r2 = sext i1 undef to i16			; CHECK-V8M-MAIN-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r2 = sext i1 undef to i16
	; CHECK-V8M-MAIN-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r3 = zext i1 undef to i16			; CHECK-V8M-MAIN-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r3 = zext i1 undef to i16
	; CHECK-V8M-MAIN-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r4 = sext i1 undef to i32			; CHECK-V8M-MAIN-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %r4 = sext i1 undef to i32
	▲ Show 20 Lines • Show All 1,384 Lines • Show Last 20 Lines

llvm/utils/TableGen/CodeGenTarget.cpp

Show First 20 Lines • Show All 144 Lines • ▼ Show 20 Lines	StringRef llvm::getEnumName(MVT::SimpleValueType T) {
case MVT::v256f32: return "MVT::v256f32";		case MVT::v256f32: return "MVT::v256f32";
case MVT::v512f32: return "MVT::v512f32";		case MVT::v512f32: return "MVT::v512f32";
case MVT::v1024f32: return "MVT::v1024f32";		case MVT::v1024f32: return "MVT::v1024f32";
case MVT::v2048f32: return "MVT::v2048f32";		case MVT::v2048f32: return "MVT::v2048f32";
case MVT::v1f64: return "MVT::v1f64";		case MVT::v1f64: return "MVT::v1f64";
case MVT::v2f64: return "MVT::v2f64";		case MVT::v2f64: return "MVT::v2f64";
case MVT::v4f64: return "MVT::v4f64";		case MVT::v4f64: return "MVT::v4f64";
case MVT::v8f64: return "MVT::v8f64";		case MVT::v8f64: return "MVT::v8f64";
		case MVT::v16f64: return "MVT::v16f64";
case MVT::nxv1i1: return "MVT::nxv1i1";		case MVT::nxv1i1: return "MVT::nxv1i1";
case MVT::nxv2i1: return "MVT::nxv2i1";		case MVT::nxv2i1: return "MVT::nxv2i1";
case MVT::nxv4i1: return "MVT::nxv4i1";		case MVT::nxv4i1: return "MVT::nxv4i1";
case MVT::nxv8i1: return "MVT::nxv8i1";		case MVT::nxv8i1: return "MVT::nxv8i1";
case MVT::nxv16i1: return "MVT::nxv16i1";		case MVT::nxv16i1: return "MVT::nxv16i1";
case MVT::nxv32i1: return "MVT::nxv32i1";		case MVT::nxv32i1: return "MVT::nxv32i1";
case MVT::nxv1i8: return "MVT::nxv1i8";		case MVT::nxv1i8: return "MVT::nxv1i8";
case MVT::nxv2i8: return "MVT::nxv2i8";		case MVT::nxv2i8: return "MVT::nxv2i8";
▲ Show 20 Lines • Show All 668 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Add v16f64 value typeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 264039

llvm/include/llvm/CodeGen/ValueTypes.td

llvm/include/llvm/IR/Intrinsics.td

llvm/include/llvm/Support/MachineValueType.h

llvm/lib/CodeGen/ValueTypes.cpp

llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp

llvm/test/Analysis/CostModel/ARM/cast.ll

llvm/utils/TableGen/CodeGenTarget.cpp

Add v16f64 value type
ClosedPublic