This is an archive of the discontinued LLVM Phabricator instance.

[IR] Match intrinsic paramater by scalar/vectorwidth
ClosedPublic

Authored by RKSimon on Jan 23 2019, 3:25 AM.

Download Raw Diff

Details

Reviewers

spatel
andreadb
craig.topper
delena
leonardchan
majnemer

Commits

rGf87226eb7081: [IR] Match intrinsic parameter by scalar/vectorwidth
rL351957: [IR] Match intrinsic parameter by scalar/vectorwidth

Summary

This patch replaces the existing LLVMVectorSameWidth matcher with LLVMScalarOrSameVectorWidth.

The matching args must be either scalars or vectors with the same number of elements, but in either case the scalar/element type can differ, specified by LLVMScalarOrSameVectorWidth.

I've updated the _overflow intrinsics to demonstrate this - allowing it to return a i1 or <N x i1> overflow result, matching the scalar/vectorwidth of the other (add/sub/mul) result type.

The masked load/store/gather/scatter intrinsics have also been updated to use this, although as we specify the reference type to be llvm_anyvector_ty we guarantee the mask will be <N x i1> so no change in behaviour

Diff Detail

Repository: rL LLVM

Event Timeline

RKSimon created this revision.Jan 23 2019, 3:25 AM

RKSimon mentioned this in D56907: [TTI] Add generic UADDSAT/USUBSAT and UADDO/USUBO costs.Jan 23 2019, 3:27 AM

The patch looks good to me. However, I don't claim to be an expert of this area. So getting another pair of eyes on this could be useful.

I was about to suggest to add a small example to the code comment of LLVMScalarOrSameVectorWidth. However, in retrospect, I think your comment is fine.

This revision is now accepted and ready to land.Jan 23 2019, 4:45 AM

This implies that we should change the LangRef for overflowing ops to match? Currently, there's no mention of vector types:
http://llvm.org/docs/LangRef.html#arithmetic-with-overflow-intrinsics

In D57090#1367719, @spatel wrote:

This implies that we should change the LangRef for overflowing ops to match? Currently, there's no mention of vector types:
http://llvm.org/docs/LangRef.html#arithmetic-with-overflow-intrinsics

If its alright, I'd prefer to wait on updating the docs until I have full testing of vector types in place, which would be in a followup patch.

In D57090#1367749, @RKSimon wrote:

In D57090#1367719, @spatel wrote:

This implies that we should change the LangRef for overflowing ops to match? Currently, there's no mention of vector types:
http://llvm.org/docs/LangRef.html#arithmetic-with-overflow-intrinsics

If its alright, I'd prefer to wait on updating the docs until I have full testing of vector types in place, which would be in a followup patch.

Yes, that sounds ok to me. LGTM - see inline for a couple of nits.

include/llvm/IR/Intrinsics.td
163	Not sure if I've ever looked at this code before, so it wasn't immediately clear what 'num' was referring to. Would it make sense to name that "index" or "opIndex"? (Similarly in the existing code around this).
lib/IR/Function.cpp
951	use 'auto' for consistency with the other dyn_casts around here.

Closed by commit rL351957: [IR] Match intrinsic parameter by scalar/vectorwidth (authored by RKSimon). · Explain WhyJan 23 2019, 8:00 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

include/

llvm/

IR/

	Intrinsics.td
	Intrinsics.td (revision 351935)

37 lines

lib/

IR/

	Function.cpp
	Function.cpp (revision 351935)

25 lines

test/

Analysis/

CostModel/

X86/

	arith-overflow.ll
	arith-overflow.ll (nonexistent)

414 lines

utils/

TableGen/

	CodeGenTarget.cpp
	CodeGenTarget.cpp (revision 351935)

2 lines

	IntrinsicEmitter.cpp
	IntrinsicEmitter.cpp (revision 351935)

2 lines

Diff 183073

include/llvm/IR/Intrinsics.td

	Show First 20 Lines • Show All 150 Lines • ▼ Show 20 Lines
	}			}

	// Match the type of another intrinsic parameter that is expected to be based on			// Match the type of another intrinsic parameter that is expected to be based on
	// an integral type (i.e. either iN or <N x iM>), but change the scalar size to			// an integral type (i.e. either iN or <N x iM>), but change the scalar size to
	// be twice as wide or half as wide as the other type. This is only useful when			// be twice as wide or half as wide as the other type. This is only useful when
	// the intrinsic is overloaded, so the matched type should be declared as iAny.			// the intrinsic is overloaded, so the matched type should be declared as iAny.
	class LLVMExtendedType<int num> : LLVMMatchType<num>;			class LLVMExtendedType<int num> : LLVMMatchType<num>;
	class LLVMTruncatedType<int num> : LLVMMatchType<num>;			class LLVMTruncatedType<int num> : LLVMMatchType<num>;
	class LLVMVectorSameWidth<int num, LLVMType elty>
				// Match the scalar/vector of another intrinsic parameter but with a different
				// element type. Either both are scalars or both are vectors with the same
				// number of elements.
				class LLVMScalarOrSameVectorWidth<int num, LLVMType elty>
				spatelUnsubmitted Not Done Reply Inline Actions Not sure if I've ever looked at this code before, so it wasn't immediately clear what 'num' was referring to. Would it make sense to name that "index" or "opIndex"? (Similarly in the existing code around this). spatel: Not sure if I've ever looked at this code before, so it wasn't immediately clear what 'num' was…
	: LLVMMatchType<num> {			: LLVMMatchType<num> {
	ValueType ElTy = elty.VT;			ValueType ElTy = elty.VT;
	}			}

	class LLVMPointerTo<int num> : LLVMMatchType<num>;			class LLVMPointerTo<int num> : LLVMMatchType<num>;
	class LLVMPointerToElt<int num> : LLVMMatchType<num>;			class LLVMPointerToElt<int num> : LLVMMatchType<num>;
	class LLVMVectorOfAnyPointersToElt<int num> : LLVMMatchType<num>;			class LLVMVectorOfAnyPointersToElt<int num> : LLVMMatchType<num>;

	// Match the type of another intrinsic parameter that is expected to be a			// Match the type of another intrinsic parameter that is expected to be a
	// vector type, but change the element count to be half as many			// vector type, but change the element count to be half as many
	class LLVMHalfElementsVectorType<int num> : LLVMMatchType<num>;			class LLVMHalfElementsVectorType<int num> : LLVMMatchType<num>;

	▲ Show 20 Lines • Show All 620 Lines • ▼ Show 20 Lines
	def int_adjust_trampoline : Intrinsic<[llvm_ptr_ty], [llvm_ptr_ty],			def int_adjust_trampoline : Intrinsic<[llvm_ptr_ty], [llvm_ptr_ty],
	[IntrReadMem, IntrArgMemOnly]>,			[IntrReadMem, IntrArgMemOnly]>,
	GCCBuiltin<"__builtin_adjust_trampoline">;			GCCBuiltin<"__builtin_adjust_trampoline">;

	//===------------------------ Overflow Intrinsics -------------------------===//			//===------------------------ Overflow Intrinsics -------------------------===//
	//			//

	// Expose the carry flag from add operations on two integrals.			// Expose the carry flag from add operations on two integrals.
	def int_sadd_with_overflow : Intrinsic<[llvm_anyint_ty, llvm_i1_ty],			def int_sadd_with_overflow : Intrinsic<[llvm_anyint_ty,
				LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],
	[LLVMMatchType<0>, LLVMMatchType<0>],			[LLVMMatchType<0>, LLVMMatchType<0>],
	[IntrNoMem, IntrSpeculatable]>;			[IntrNoMem, IntrSpeculatable]>;
	def int_uadd_with_overflow : Intrinsic<[llvm_anyint_ty, llvm_i1_ty],			def int_uadd_with_overflow : Intrinsic<[llvm_anyint_ty,
				LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],
	[LLVMMatchType<0>, LLVMMatchType<0>],			[LLVMMatchType<0>, LLVMMatchType<0>],
	[IntrNoMem, IntrSpeculatable]>;			[IntrNoMem, IntrSpeculatable]>;

	def int_ssub_with_overflow : Intrinsic<[llvm_anyint_ty, llvm_i1_ty],			def int_ssub_with_overflow : Intrinsic<[llvm_anyint_ty,
				LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],
	[LLVMMatchType<0>, LLVMMatchType<0>],			[LLVMMatchType<0>, LLVMMatchType<0>],
	[IntrNoMem, IntrSpeculatable]>;			[IntrNoMem, IntrSpeculatable]>;
	def int_usub_with_overflow : Intrinsic<[llvm_anyint_ty, llvm_i1_ty],			def int_usub_with_overflow : Intrinsic<[llvm_anyint_ty,
				LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],
	[LLVMMatchType<0>, LLVMMatchType<0>],			[LLVMMatchType<0>, LLVMMatchType<0>],
	[IntrNoMem, IntrSpeculatable]>;			[IntrNoMem, IntrSpeculatable]>;

	def int_smul_with_overflow : Intrinsic<[llvm_anyint_ty, llvm_i1_ty],			def int_smul_with_overflow : Intrinsic<[llvm_anyint_ty,
				LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],
	[LLVMMatchType<0>, LLVMMatchType<0>],			[LLVMMatchType<0>, LLVMMatchType<0>],
	[IntrNoMem, IntrSpeculatable]>;			[IntrNoMem, IntrSpeculatable]>;
	def int_umul_with_overflow : Intrinsic<[llvm_anyint_ty, llvm_i1_ty],			def int_umul_with_overflow : Intrinsic<[llvm_anyint_ty,
				LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],
	[LLVMMatchType<0>, LLVMMatchType<0>],			[LLVMMatchType<0>, LLVMMatchType<0>],
	[IntrNoMem, IntrSpeculatable]>;			[IntrNoMem, IntrSpeculatable]>;

	//===------------------------- Saturation Arithmetic Intrinsics ---------------------===//			//===------------------------- Saturation Arithmetic Intrinsics ---------------------===//
	//			//
	def int_sadd_sat : Intrinsic<[llvm_anyint_ty],			def int_sadd_sat : Intrinsic<[llvm_anyint_ty],
	[LLVMMatchType<0>, LLVMMatchType<0>],			[LLVMMatchType<0>, LLVMMatchType<0>],
	[IntrNoMem, IntrSpeculatable, Commutative]>;			[IntrNoMem, IntrSpeculatable, Commutative]>;
	▲ Show 20 Lines • Show All 171 Lines • ▼ Show 20 Lines
	def int_is_constant : Intrinsic<[llvm_i1_ty], [llvm_any_ty], [IntrNoMem], "llvm.is.constant">;			def int_is_constant : Intrinsic<[llvm_i1_ty], [llvm_any_ty], [IntrNoMem], "llvm.is.constant">;


	//===-------------------------- Masked Intrinsics -------------------------===//			//===-------------------------- Masked Intrinsics -------------------------===//
	//			//
	def int_masked_store : Intrinsic<[], [llvm_anyvector_ty,			def int_masked_store : Intrinsic<[], [llvm_anyvector_ty,
	LLVMAnyPointerType<LLVMMatchType<0>>,			LLVMAnyPointerType<LLVMMatchType<0>>,
	llvm_i32_ty,			llvm_i32_ty,
	LLVMVectorSameWidth<0, llvm_i1_ty>],			LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],
	[IntrArgMemOnly]>;			[IntrArgMemOnly]>;

	def int_masked_load : Intrinsic<[llvm_anyvector_ty],			def int_masked_load : Intrinsic<[llvm_anyvector_ty],
	[LLVMAnyPointerType<LLVMMatchType<0>>, llvm_i32_ty,			[LLVMAnyPointerType<LLVMMatchType<0>>, llvm_i32_ty,
	LLVMVectorSameWidth<0, llvm_i1_ty>, LLVMMatchType<0>],			LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>, LLVMMatchType<0>],
	[IntrReadMem, IntrArgMemOnly]>;			[IntrReadMem, IntrArgMemOnly]>;

	def int_masked_gather: Intrinsic<[llvm_anyvector_ty],			def int_masked_gather: Intrinsic<[llvm_anyvector_ty],
	[LLVMVectorOfAnyPointersToElt<0>, llvm_i32_ty,			[LLVMVectorOfAnyPointersToElt<0>, llvm_i32_ty,
	LLVMVectorSameWidth<0, llvm_i1_ty>,			LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
	LLVMMatchType<0>],			LLVMMatchType<0>],
	[IntrReadMem]>;			[IntrReadMem]>;

	def int_masked_scatter: Intrinsic<[],			def int_masked_scatter: Intrinsic<[],
	[llvm_anyvector_ty,			[llvm_anyvector_ty,
	LLVMVectorOfAnyPointersToElt<0>, llvm_i32_ty,			LLVMVectorOfAnyPointersToElt<0>, llvm_i32_ty,
	LLVMVectorSameWidth<0, llvm_i1_ty>]>;			LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>]>;

	def int_masked_expandload: Intrinsic<[llvm_anyvector_ty],			def int_masked_expandload: Intrinsic<[llvm_anyvector_ty],
	[LLVMPointerToElt<0>,			[LLVMPointerToElt<0>,
	LLVMVectorSameWidth<0, llvm_i1_ty>,			LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
	LLVMMatchType<0>],			LLVMMatchType<0>],
	[IntrReadMem]>;			[IntrReadMem]>;

	def int_masked_compressstore: Intrinsic<[],			def int_masked_compressstore: Intrinsic<[],
	[llvm_anyvector_ty,			[llvm_anyvector_ty,
	LLVMPointerToElt<0>,			LLVMPointerToElt<0>,
	LLVMVectorSameWidth<0, llvm_i1_ty>],			LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],
	[IntrArgMemOnly]>;			[IntrArgMemOnly]>;

	// Test whether a pointer is associated with a type metadata identifier.			// Test whether a pointer is associated with a type metadata identifier.
	def int_type_test : Intrinsic<[llvm_i1_ty], [llvm_ptr_ty, llvm_metadata_ty],			def int_type_test : Intrinsic<[llvm_i1_ty], [llvm_ptr_ty, llvm_metadata_ty],
	[IntrNoMem]>;			[IntrNoMem]>;

	// Safely loads a function pointer from a virtual table pointer using type metadata.			// Safely loads a function pointer from a virtual table pointer using type metadata.
	def int_type_checked_load : Intrinsic<[llvm_ptr_ty, llvm_i1_ty],			def int_type_checked_load : Intrinsic<[llvm_ptr_ty, llvm_i1_ty],
	▲ Show 20 Lines • Show All 120 Lines • Show Last 20 Lines

lib/IR/Function.cpp

Show First 20 Lines • Show All 942 Lines • ▼ Show 20 Lines	case IITDescriptor::TruncArgument: {
return IntegerType::get(Context, ITy->getBitWidth() / 2);		return IntegerType::get(Context, ITy->getBitWidth() / 2);
}		}
case IITDescriptor::HalfVecArgument:		case IITDescriptor::HalfVecArgument:
return VectorType::getHalfElementsVectorType(cast<VectorType>(		return VectorType::getHalfElementsVectorType(cast<VectorType>(
Tys[D.getArgumentNumber()]));		Tys[D.getArgumentNumber()]));
case IITDescriptor::SameVecWidthArgument: {		case IITDescriptor::SameVecWidthArgument: {
Type *EltTy = DecodeFixedType(Infos, Tys, Context);		Type *EltTy = DecodeFixedType(Infos, Tys, Context);
Type *Ty = Tys[D.getArgumentNumber()];		Type *Ty = Tys[D.getArgumentNumber()];
if (VectorType *VTy = dyn_cast<VectorType>(Ty)) {		if (VectorType *VTy = dyn_cast<VectorType>(Ty))
		spatelUnsubmitted Not Done Reply Inline Actions use 'auto' for consistency with the other dyn_casts around here. spatel: use 'auto' for consistency with the other dyn_casts around here.
return VectorType::get(EltTy, VTy->getNumElements());		return VectorType::get(EltTy, VTy->getNumElements());
}		return EltTy;
llvm_unreachable("unhandled");
}		}
case IITDescriptor::PtrToArgument: {		case IITDescriptor::PtrToArgument: {
Type *Ty = Tys[D.getArgumentNumber()];		Type *Ty = Tys[D.getArgumentNumber()];
return PointerType::getUnqual(Ty);		return PointerType::getUnqual(Ty);
}		}
case IITDescriptor::PtrToElt: {		case IITDescriptor::PtrToElt: {
Type *Ty = Tys[D.getArgumentNumber()];		Type *Ty = Tys[D.getArgumentNumber()];
VectorType *VTy = dyn_cast<VectorType>(Ty);		VectorType *VTy = dyn_cast<VectorType>(Ty);
▲ Show 20 Lines • Show All 167 Lines • ▼ Show 20 Lines	case IITDescriptor::HalfVecArgument:
// This may only be used when referring to a previous vector argument.		// This may only be used when referring to a previous vector argument.
return D.getArgumentNumber() >= ArgTys.size() \|\|		return D.getArgumentNumber() >= ArgTys.size() \|\|
!isa<VectorType>(ArgTys[D.getArgumentNumber()]) \|\|		!isa<VectorType>(ArgTys[D.getArgumentNumber()]) \|\|
VectorType::getHalfElementsVectorType(		VectorType::getHalfElementsVectorType(
cast<VectorType>(ArgTys[D.getArgumentNumber()])) != Ty;		cast<VectorType>(ArgTys[D.getArgumentNumber()])) != Ty;
case IITDescriptor::SameVecWidthArgument: {		case IITDescriptor::SameVecWidthArgument: {
if (D.getArgumentNumber() >= ArgTys.size())		if (D.getArgumentNumber() >= ArgTys.size())
return true;		return true;
VectorType * ReferenceType =		auto *ReferenceType = dyn_cast<VectorType>(ArgTys[D.getArgumentNumber()]);
dyn_cast<VectorType>(ArgTys[D.getArgumentNumber()]);		auto *ThisArgType = dyn_cast<VectorType>(Ty);
VectorType *ThisArgType = dyn_cast<VectorType>(Ty);		// Both must be vectors of the same number of elements or neither.
if (!ThisArgType \|\| !ReferenceType \|\|		if ((ReferenceType != nullptr) != (ThisArgType != nullptr))
(ReferenceType->getVectorNumElements() !=		return true;
ThisArgType->getVectorNumElements()))		Type *EltTy = Ty;
		if (ThisArgType) {
		if (ReferenceType->getVectorNumElements() !=
		ThisArgType->getVectorNumElements())
return true;		return true;
return matchIntrinsicType(ThisArgType->getVectorElementType(),		EltTy = ThisArgType->getVectorElementType();
Infos, ArgTys);		}
		return matchIntrinsicType(EltTy, Infos, ArgTys);
}		}
case IITDescriptor::PtrToArgument: {		case IITDescriptor::PtrToArgument: {
if (D.getArgumentNumber() >= ArgTys.size())		if (D.getArgumentNumber() >= ArgTys.size())
return true;		return true;
Type * ReferenceType = ArgTys[D.getArgumentNumber()];		Type * ReferenceType = ArgTys[D.getArgumentNumber()];
PointerType *ThisArgType = dyn_cast<PointerType>(Ty);		PointerType *ThisArgType = dyn_cast<PointerType>(Ty);
return (!ThisArgType \|\| ThisArgType->getElementType() != ReferenceType);		return (!ThisArgType \|\| ThisArgType->getElementType() != ReferenceType);
}		}
▲ Show 20 Lines • Show All 290 Lines • Show Last 20 Lines

test/Analysis/CostModel/X86/arith-overflow.ll

				; NOTE: Assertions have been autogenerated by utils/update_analyze_test_checks.py
				; RUN: opt < %s -cost-model -analyze -mtriple=x86_64-apple-macosx10.8.0 -mattr=+ssse3 \| FileCheck %s --check-prefixes=CHECK,SSE,SSSE3
				; RUN: opt < %s -cost-model -analyze -mtriple=x86_64-apple-macosx10.8.0 -mattr=+sse4.2 \| FileCheck %s --check-prefixes=CHECK,SSE,SSE42
				; RUN: opt < %s -cost-model -analyze -mtriple=x86_64-apple-macosx10.8.0 -mattr=+avx \| FileCheck %s --check-prefixes=CHECK,AVX,AVX1
				; RUN: opt < %s -cost-model -analyze -mtriple=x86_64-apple-macosx10.8.0 -mattr=+avx2 \| FileCheck %s --check-prefixes=CHECK,AVX,AVX2
				; RUN: opt < %s -cost-model -analyze -mtriple=x86_64-apple-macosx10.8.0 -mattr=+avx512f \| FileCheck %s --check-prefixes=CHECK,AVX512,AVX512F
				; RUN: opt < %s -cost-model -analyze -mtriple=x86_64-apple-macosx10.8.0 -mattr=+avx512f,+avx512bw \| FileCheck %s --check-prefixes=CHECK,AVX512,AVX512BW
				; RUN: opt < %s -cost-model -analyze -mtriple=x86_64-apple-macosx10.8.0 -mattr=+avx512f,+avx512dq \| FileCheck %s --check-prefixes=CHECK,AVX512,AVX512DQ
				;
				; RUN: opt < %s -cost-model -analyze -mtriple=x86_64-apple-macosx10.8.0 -mcpu=slm \| FileCheck %s --check-prefixes=CHECK,SLM
				; RUN: opt < %s -cost-model -analyze -mtriple=x86_64-apple-macosx10.8.0 -mcpu=goldmont \| FileCheck %s --check-prefixes=CHECK,GLM
				; RUN: opt < %s -cost-model -analyze -mtriple=x86_64-apple-macosx10.8.0 -mcpu=btver2 \| FileCheck %s --check-prefixes=CHECK,BTVER2

				;
				; sadd.with.overflow
				;

				declare {i64, i1} @llvm.sadd.with.overflow.i64(i64, i64)
				declare {<2 x i64>, <2 x i1>} @llvm.sadd.with.overflow.v2i64(<2 x i64>, <2 x i64>)
				declare {<4 x i64>, <4 x i1>} @llvm.sadd.with.overflow.v4i64(<4 x i64>, <4 x i64>)
				declare {<8 x i64>, <8 x i1>} @llvm.sadd.with.overflow.v8i64(<8 x i64>, <8 x i64>)

				declare {i32, i1} @llvm.sadd.with.overflow.i32(i32, i32)
				declare {<4 x i32>, <4 x i1>} @llvm.sadd.with.overflow.v4i32(<4 x i32>, <4 x i32>)
				declare {<8 x i32>, <8 x i1>} @llvm.sadd.with.overflow.v8i32(<8 x i32>, <8 x i32>)
				declare {<16 x i32>, <16 x i1>} @llvm.sadd.with.overflow.v16i32(<16 x i32>, <16 x i32>)

				declare {i16, i1} @llvm.sadd.with.overflow.i16(i16, i16)
				declare {<8 x i16>, <8 x i1>} @llvm.sadd.with.overflow.v8i16(<8 x i16>, <8 x i16>)
				declare {<16 x i16>, <16 x i1>} @llvm.sadd.with.overflow.v16i16(<16 x i16>, <16 x i16>)
				declare {<32 x i16>, <32 x i1>} @llvm.sadd.with.overflow.v32i16(<32 x i16>, <32 x i16>)

				declare {i8, i1} @llvm.sadd.with.overflow.i8(i8, i8)
				declare {<16 x i8>, <16 x i1>} @llvm.sadd.with.overflow.v16i8(<16 x i8>, <16 x i8>)
				declare {<32 x i8>, <32 x i1>} @llvm.sadd.with.overflow.v32i8(<32 x i8>, <32 x i8>)
				declare {<64 x i8>, <64 x i1>} @llvm.sadd.with.overflow.v64i8(<64 x i8>, <64 x i8>)

				define i32 @sadd(i32 %arg) {
				; CHECK-LABEL: 'sadd'
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I64 = call { i64, i1 } @llvm.sadd.with.overflow.i64(i64 undef, i64 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %V2I64 = call { <2 x i64>, <2 x i1> } @llvm.sadd.with.overflow.v2i64(<2 x i64> undef, <2 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 11 for instruction: %V4I64 = call { <4 x i64>, <4 x i1> } @llvm.sadd.with.overflow.v4i64(<4 x i64> undef, <4 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I64 = call { <8 x i64>, <8 x i1> } @llvm.sadd.with.overflow.v8i64(<8 x i64> undef, <8 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I32 = call { i32, i1 } @llvm.sadd.with.overflow.i32(i32 undef, i32 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 11 for instruction: %V4I32 = call { <4 x i32>, <4 x i1> } @llvm.sadd.with.overflow.v4i32(<4 x i32> undef, <4 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I32 = call { <8 x i32>, <8 x i1> } @llvm.sadd.with.overflow.v8i32(<8 x i32> undef, <8 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I32 = call { <16 x i32>, <16 x i1> } @llvm.sadd.with.overflow.v16i32(<16 x i32> undef, <16 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I16 = call { i16, i1 } @llvm.sadd.with.overflow.i16(i16 undef, i16 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I16 = call { <8 x i16>, <8 x i1> } @llvm.sadd.with.overflow.v8i16(<8 x i16> undef, <8 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I16 = call { <16 x i16>, <16 x i1> } @llvm.sadd.with.overflow.v16i16(<16 x i16> undef, <16 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 95 for instruction: %V32I16 = call { <32 x i16>, <32 x i1> } @llvm.sadd.with.overflow.v32i16(<32 x i16> undef, <32 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I8 = call { i8, i1 } @llvm.sadd.with.overflow.i8(i8 undef, i8 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I8 = call { <16 x i8>, <16 x i1> } @llvm.sadd.with.overflow.v16i8(<16 x i8> undef, <16 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 95 for instruction: %V32I8 = call { <32 x i8>, <32 x i1> } @llvm.sadd.with.overflow.v32i8(<32 x i8> undef, <32 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 191 for instruction: %V64I8 = call { <64 x i8>, <64 x i1> } @llvm.sadd.with.overflow.v64i8(<64 x i8> undef, <64 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 0 for instruction: ret i32 undef
				;
				%I64 = call {i64, i1} @llvm.sadd.with.overflow.i64(i64 undef, i64 undef)
				%V2I64 = call {<2 x i64>, <2 x i1>} @llvm.sadd.with.overflow.v2i64(<2 x i64> undef, <2 x i64> undef)
				%V4I64 = call {<4 x i64>, <4 x i1>} @llvm.sadd.with.overflow.v4i64(<4 x i64> undef, <4 x i64> undef)
				%V8I64 = call {<8 x i64>, <8 x i1>} @llvm.sadd.with.overflow.v8i64(<8 x i64> undef, <8 x i64> undef)

				%I32 = call {i32, i1} @llvm.sadd.with.overflow.i32(i32 undef, i32 undef)
				%V4I32 = call {<4 x i32>, <4 x i1>} @llvm.sadd.with.overflow.v4i32(<4 x i32> undef, <4 x i32> undef)
				%V8I32 = call {<8 x i32>, <8 x i1>} @llvm.sadd.with.overflow.v8i32(<8 x i32> undef, <8 x i32> undef)
				%V16I32 = call {<16 x i32>, <16 x i1>} @llvm.sadd.with.overflow.v16i32(<16 x i32> undef, <16 x i32> undef)

				%I16 = call {i16, i1} @llvm.sadd.with.overflow.i16(i16 undef, i16 undef)
				%V8I16 = call {<8 x i16>, <8 x i1>} @llvm.sadd.with.overflow.v8i16(<8 x i16> undef, <8 x i16> undef)
				%V16I16 = call {<16 x i16>, <16 x i1>} @llvm.sadd.with.overflow.v16i16(<16 x i16> undef, <16 x i16> undef)
				%V32I16 = call {<32 x i16>, <32 x i1>} @llvm.sadd.with.overflow.v32i16(<32 x i16> undef, <32 x i16> undef)

				%I8 = call {i8, i1} @llvm.sadd.with.overflow.i8(i8 undef, i8 undef)
				%V16I8 = call {<16 x i8>, <16 x i1>} @llvm.sadd.with.overflow.v16i8(<16 x i8> undef, <16 x i8> undef)
				%V32I8 = call {<32 x i8>, <32 x i1>} @llvm.sadd.with.overflow.v32i8(<32 x i8> undef, <32 x i8> undef)
				%V64I8 = call {<64 x i8>, <64 x i1>} @llvm.sadd.with.overflow.v64i8(<64 x i8> undef, <64 x i8> undef)

				ret i32 undef
				}

				;
				; uadd.with.overflow
				;

				declare {i64, i1} @llvm.uadd.with.overflow.i64(i64, i64)
				declare {<2 x i64>, <2 x i1>} @llvm.uadd.with.overflow.v2i64(<2 x i64>, <2 x i64>)
				declare {<4 x i64>, <4 x i1>} @llvm.uadd.with.overflow.v4i64(<4 x i64>, <4 x i64>)
				declare {<8 x i64>, <8 x i1>} @llvm.uadd.with.overflow.v8i64(<8 x i64>, <8 x i64>)

				declare {i32, i1} @llvm.uadd.with.overflow.i32(i32, i32)
				declare {<4 x i32>, <4 x i1>} @llvm.uadd.with.overflow.v4i32(<4 x i32>, <4 x i32>)
				declare {<8 x i32>, <8 x i1>} @llvm.uadd.with.overflow.v8i32(<8 x i32>, <8 x i32>)
				declare {<16 x i32>, <16 x i1>} @llvm.uadd.with.overflow.v16i32(<16 x i32>, <16 x i32>)

				declare {i16, i1} @llvm.uadd.with.overflow.i16(i16, i16)
				declare {<8 x i16>, <8 x i1>} @llvm.uadd.with.overflow.v8i16(<8 x i16>, <8 x i16>)
				declare {<16 x i16>, <16 x i1>} @llvm.uadd.with.overflow.v16i16(<16 x i16>, <16 x i16>)
				declare {<32 x i16>, <32 x i1>} @llvm.uadd.with.overflow.v32i16(<32 x i16>, <32 x i16>)

				declare {i8, i1} @llvm.uadd.with.overflow.i8(i8, i8)
				declare {<16 x i8>, <16 x i1>} @llvm.uadd.with.overflow.v16i8(<16 x i8>, <16 x i8>)
				declare {<32 x i8>, <32 x i1>} @llvm.uadd.with.overflow.v32i8(<32 x i8>, <32 x i8>)
				declare {<64 x i8>, <64 x i1>} @llvm.uadd.with.overflow.v64i8(<64 x i8>, <64 x i8>)

				define i32 @uadd(i32 %arg) {
				; CHECK-LABEL: 'uadd'
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I64 = call { i64, i1 } @llvm.uadd.with.overflow.i64(i64 undef, i64 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %V2I64 = call { <2 x i64>, <2 x i1> } @llvm.uadd.with.overflow.v2i64(<2 x i64> undef, <2 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 11 for instruction: %V4I64 = call { <4 x i64>, <4 x i1> } @llvm.uadd.with.overflow.v4i64(<4 x i64> undef, <4 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I64 = call { <8 x i64>, <8 x i1> } @llvm.uadd.with.overflow.v8i64(<8 x i64> undef, <8 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I32 = call { i32, i1 } @llvm.uadd.with.overflow.i32(i32 undef, i32 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 11 for instruction: %V4I32 = call { <4 x i32>, <4 x i1> } @llvm.uadd.with.overflow.v4i32(<4 x i32> undef, <4 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I32 = call { <8 x i32>, <8 x i1> } @llvm.uadd.with.overflow.v8i32(<8 x i32> undef, <8 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I32 = call { <16 x i32>, <16 x i1> } @llvm.uadd.with.overflow.v16i32(<16 x i32> undef, <16 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I16 = call { i16, i1 } @llvm.uadd.with.overflow.i16(i16 undef, i16 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I16 = call { <8 x i16>, <8 x i1> } @llvm.uadd.with.overflow.v8i16(<8 x i16> undef, <8 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I16 = call { <16 x i16>, <16 x i1> } @llvm.uadd.with.overflow.v16i16(<16 x i16> undef, <16 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 95 for instruction: %V32I16 = call { <32 x i16>, <32 x i1> } @llvm.uadd.with.overflow.v32i16(<32 x i16> undef, <32 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I8 = call { i8, i1 } @llvm.uadd.with.overflow.i8(i8 undef, i8 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I8 = call { <16 x i8>, <16 x i1> } @llvm.uadd.with.overflow.v16i8(<16 x i8> undef, <16 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 95 for instruction: %V32I8 = call { <32 x i8>, <32 x i1> } @llvm.uadd.with.overflow.v32i8(<32 x i8> undef, <32 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 191 for instruction: %V64I8 = call { <64 x i8>, <64 x i1> } @llvm.uadd.with.overflow.v64i8(<64 x i8> undef, <64 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 0 for instruction: ret i32 undef
				;
				%I64 = call {i64, i1} @llvm.uadd.with.overflow.i64(i64 undef, i64 undef)
				%V2I64 = call {<2 x i64>, <2 x i1>} @llvm.uadd.with.overflow.v2i64(<2 x i64> undef, <2 x i64> undef)
				%V4I64 = call {<4 x i64>, <4 x i1>} @llvm.uadd.with.overflow.v4i64(<4 x i64> undef, <4 x i64> undef)
				%V8I64 = call {<8 x i64>, <8 x i1>} @llvm.uadd.with.overflow.v8i64(<8 x i64> undef, <8 x i64> undef)

				%I32 = call {i32, i1} @llvm.uadd.with.overflow.i32(i32 undef, i32 undef)
				%V4I32 = call {<4 x i32>, <4 x i1>} @llvm.uadd.with.overflow.v4i32(<4 x i32> undef, <4 x i32> undef)
				%V8I32 = call {<8 x i32>, <8 x i1>} @llvm.uadd.with.overflow.v8i32(<8 x i32> undef, <8 x i32> undef)
				%V16I32 = call {<16 x i32>, <16 x i1>} @llvm.uadd.with.overflow.v16i32(<16 x i32> undef, <16 x i32> undef)

				%I16 = call {i16, i1} @llvm.uadd.with.overflow.i16(i16 undef, i16 undef)
				%V8I16 = call {<8 x i16>, <8 x i1>} @llvm.uadd.with.overflow.v8i16(<8 x i16> undef, <8 x i16> undef)
				%V16I16 = call {<16 x i16>, <16 x i1>} @llvm.uadd.with.overflow.v16i16(<16 x i16> undef, <16 x i16> undef)
				%V32I16 = call {<32 x i16>, <32 x i1>} @llvm.uadd.with.overflow.v32i16(<32 x i16> undef, <32 x i16> undef)

				%I8 = call {i8, i1} @llvm.uadd.with.overflow.i8(i8 undef, i8 undef)
				%V16I8 = call {<16 x i8>, <16 x i1>} @llvm.uadd.with.overflow.v16i8(<16 x i8> undef, <16 x i8> undef)
				%V32I8 = call {<32 x i8>, <32 x i1>} @llvm.uadd.with.overflow.v32i8(<32 x i8> undef, <32 x i8> undef)
				%V64I8 = call {<64 x i8>, <64 x i1>} @llvm.uadd.with.overflow.v64i8(<64 x i8> undef, <64 x i8> undef)

				ret i32 undef
				}

				;
				; ssub.with.overflow
				;

				declare {i64, i1} @llvm.ssub.with.overflow.i64(i64, i64)
				declare {<2 x i64>, <2 x i1>} @llvm.ssub.with.overflow.v2i64(<2 x i64>, <2 x i64>)
				declare {<4 x i64>, <4 x i1>} @llvm.ssub.with.overflow.v4i64(<4 x i64>, <4 x i64>)
				declare {<8 x i64>, <8 x i1>} @llvm.ssub.with.overflow.v8i64(<8 x i64>, <8 x i64>)

				declare {i32, i1} @llvm.ssub.with.overflow.i32(i32, i32)
				declare {<4 x i32>, <4 x i1>} @llvm.ssub.with.overflow.v4i32(<4 x i32>, <4 x i32>)
				declare {<8 x i32>, <8 x i1>} @llvm.ssub.with.overflow.v8i32(<8 x i32>, <8 x i32>)
				declare {<16 x i32>, <16 x i1>} @llvm.ssub.with.overflow.v16i32(<16 x i32>, <16 x i32>)

				declare {i16, i1} @llvm.ssub.with.overflow.i16(i16, i16)
				declare {<8 x i16>, <8 x i1>} @llvm.ssub.with.overflow.v8i16(<8 x i16>, <8 x i16>)
				declare {<16 x i16>, <16 x i1>} @llvm.ssub.with.overflow.v16i16(<16 x i16>, <16 x i16>)
				declare {<32 x i16>, <32 x i1>} @llvm.ssub.with.overflow.v32i16(<32 x i16>, <32 x i16>)

				declare {i8, i1} @llvm.ssub.with.overflow.i8(i8, i8)
				declare {<16 x i8>, <16 x i1>} @llvm.ssub.with.overflow.v16i8(<16 x i8>, <16 x i8>)
				declare {<32 x i8>, <32 x i1>} @llvm.ssub.with.overflow.v32i8(<32 x i8>, <32 x i8>)
				declare {<64 x i8>, <64 x i1>} @llvm.ssub.with.overflow.v64i8(<64 x i8>, <64 x i8>)

				define i32 @ssub(i32 %arg) {
				; CHECK-LABEL: 'ssub'
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I64 = call { i64, i1 } @llvm.ssub.with.overflow.i64(i64 undef, i64 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %V2I64 = call { <2 x i64>, <2 x i1> } @llvm.ssub.with.overflow.v2i64(<2 x i64> undef, <2 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 11 for instruction: %V4I64 = call { <4 x i64>, <4 x i1> } @llvm.ssub.with.overflow.v4i64(<4 x i64> undef, <4 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I64 = call { <8 x i64>, <8 x i1> } @llvm.ssub.with.overflow.v8i64(<8 x i64> undef, <8 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I32 = call { i32, i1 } @llvm.ssub.with.overflow.i32(i32 undef, i32 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 11 for instruction: %V4I32 = call { <4 x i32>, <4 x i1> } @llvm.ssub.with.overflow.v4i32(<4 x i32> undef, <4 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I32 = call { <8 x i32>, <8 x i1> } @llvm.ssub.with.overflow.v8i32(<8 x i32> undef, <8 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I32 = call { <16 x i32>, <16 x i1> } @llvm.ssub.with.overflow.v16i32(<16 x i32> undef, <16 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I16 = call { i16, i1 } @llvm.ssub.with.overflow.i16(i16 undef, i16 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I16 = call { <8 x i16>, <8 x i1> } @llvm.ssub.with.overflow.v8i16(<8 x i16> undef, <8 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I16 = call { <16 x i16>, <16 x i1> } @llvm.ssub.with.overflow.v16i16(<16 x i16> undef, <16 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 95 for instruction: %V32I16 = call { <32 x i16>, <32 x i1> } @llvm.ssub.with.overflow.v32i16(<32 x i16> undef, <32 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I8 = call { i8, i1 } @llvm.ssub.with.overflow.i8(i8 undef, i8 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I8 = call { <16 x i8>, <16 x i1> } @llvm.ssub.with.overflow.v16i8(<16 x i8> undef, <16 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 95 for instruction: %V32I8 = call { <32 x i8>, <32 x i1> } @llvm.ssub.with.overflow.v32i8(<32 x i8> undef, <32 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 191 for instruction: %V64I8 = call { <64 x i8>, <64 x i1> } @llvm.ssub.with.overflow.v64i8(<64 x i8> undef, <64 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 0 for instruction: ret i32 undef
				;
				%I64 = call {i64, i1} @llvm.ssub.with.overflow.i64(i64 undef, i64 undef)
				%V2I64 = call {<2 x i64>, <2 x i1>} @llvm.ssub.with.overflow.v2i64(<2 x i64> undef, <2 x i64> undef)
				%V4I64 = call {<4 x i64>, <4 x i1>} @llvm.ssub.with.overflow.v4i64(<4 x i64> undef, <4 x i64> undef)
				%V8I64 = call {<8 x i64>, <8 x i1>} @llvm.ssub.with.overflow.v8i64(<8 x i64> undef, <8 x i64> undef)

				%I32 = call {i32, i1} @llvm.ssub.with.overflow.i32(i32 undef, i32 undef)
				%V4I32 = call {<4 x i32>, <4 x i1>} @llvm.ssub.with.overflow.v4i32(<4 x i32> undef, <4 x i32> undef)
				%V8I32 = call {<8 x i32>, <8 x i1>} @llvm.ssub.with.overflow.v8i32(<8 x i32> undef, <8 x i32> undef)
				%V16I32 = call {<16 x i32>, <16 x i1>} @llvm.ssub.with.overflow.v16i32(<16 x i32> undef, <16 x i32> undef)

				%I16 = call {i16, i1} @llvm.ssub.with.overflow.i16(i16 undef, i16 undef)
				%V8I16 = call {<8 x i16>, <8 x i1>} @llvm.ssub.with.overflow.v8i16(<8 x i16> undef, <8 x i16> undef)
				%V16I16 = call {<16 x i16>, <16 x i1>} @llvm.ssub.with.overflow.v16i16(<16 x i16> undef, <16 x i16> undef)
				%V32I16 = call {<32 x i16>, <32 x i1>} @llvm.ssub.with.overflow.v32i16(<32 x i16> undef, <32 x i16> undef)

				%I8 = call {i8, i1} @llvm.ssub.with.overflow.i8(i8 undef, i8 undef)
				%V16I8 = call {<16 x i8>, <16 x i1>} @llvm.ssub.with.overflow.v16i8(<16 x i8> undef, <16 x i8> undef)
				%V32I8 = call {<32 x i8>, <32 x i1>} @llvm.ssub.with.overflow.v32i8(<32 x i8> undef, <32 x i8> undef)
				%V64I8 = call {<64 x i8>, <64 x i1>} @llvm.ssub.with.overflow.v64i8(<64 x i8> undef, <64 x i8> undef)

				ret i32 undef
				}

				;
				; usub.with.overflow
				;

				declare {i64, i1} @llvm.usub.with.overflow.i64(i64, i64)
				declare {<2 x i64>, <2 x i1>} @llvm.usub.with.overflow.v2i64(<2 x i64>, <2 x i64>)
				declare {<4 x i64>, <4 x i1>} @llvm.usub.with.overflow.v4i64(<4 x i64>, <4 x i64>)
				declare {<8 x i64>, <8 x i1>} @llvm.usub.with.overflow.v8i64(<8 x i64>, <8 x i64>)

				declare {i32, i1} @llvm.usub.with.overflow.i32(i32, i32)
				declare {<4 x i32>, <4 x i1>} @llvm.usub.with.overflow.v4i32(<4 x i32>, <4 x i32>)
				declare {<8 x i32>, <8 x i1>} @llvm.usub.with.overflow.v8i32(<8 x i32>, <8 x i32>)
				declare {<16 x i32>, <16 x i1>} @llvm.usub.with.overflow.v16i32(<16 x i32>, <16 x i32>)

				declare {i16, i1} @llvm.usub.with.overflow.i16(i16, i16)
				declare {<8 x i16>, <8 x i1>} @llvm.usub.with.overflow.v8i16(<8 x i16>, <8 x i16>)
				declare {<16 x i16>, <16 x i1>} @llvm.usub.with.overflow.v16i16(<16 x i16>, <16 x i16>)
				declare {<32 x i16>, <32 x i1>} @llvm.usub.with.overflow.v32i16(<32 x i16>, <32 x i16>)

				declare {i8, i1} @llvm.usub.with.overflow.i8(i8, i8)
				declare {<16 x i8>, <16 x i1>} @llvm.usub.with.overflow.v16i8(<16 x i8>, <16 x i8>)
				declare {<32 x i8>, <32 x i1>} @llvm.usub.with.overflow.v32i8(<32 x i8>, <32 x i8>)
				declare {<64 x i8>, <64 x i1>} @llvm.usub.with.overflow.v64i8(<64 x i8>, <64 x i8>)

				define i32 @usub(i32 %arg) {
				; CHECK-LABEL: 'usub'
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I64 = call { i64, i1 } @llvm.usub.with.overflow.i64(i64 undef, i64 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %V2I64 = call { <2 x i64>, <2 x i1> } @llvm.usub.with.overflow.v2i64(<2 x i64> undef, <2 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 11 for instruction: %V4I64 = call { <4 x i64>, <4 x i1> } @llvm.usub.with.overflow.v4i64(<4 x i64> undef, <4 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I64 = call { <8 x i64>, <8 x i1> } @llvm.usub.with.overflow.v8i64(<8 x i64> undef, <8 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I32 = call { i32, i1 } @llvm.usub.with.overflow.i32(i32 undef, i32 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 11 for instruction: %V4I32 = call { <4 x i32>, <4 x i1> } @llvm.usub.with.overflow.v4i32(<4 x i32> undef, <4 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I32 = call { <8 x i32>, <8 x i1> } @llvm.usub.with.overflow.v8i32(<8 x i32> undef, <8 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I32 = call { <16 x i32>, <16 x i1> } @llvm.usub.with.overflow.v16i32(<16 x i32> undef, <16 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I16 = call { i16, i1 } @llvm.usub.with.overflow.i16(i16 undef, i16 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I16 = call { <8 x i16>, <8 x i1> } @llvm.usub.with.overflow.v8i16(<8 x i16> undef, <8 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I16 = call { <16 x i16>, <16 x i1> } @llvm.usub.with.overflow.v16i16(<16 x i16> undef, <16 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 95 for instruction: %V32I16 = call { <32 x i16>, <32 x i1> } @llvm.usub.with.overflow.v32i16(<32 x i16> undef, <32 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I8 = call { i8, i1 } @llvm.usub.with.overflow.i8(i8 undef, i8 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I8 = call { <16 x i8>, <16 x i1> } @llvm.usub.with.overflow.v16i8(<16 x i8> undef, <16 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 95 for instruction: %V32I8 = call { <32 x i8>, <32 x i1> } @llvm.usub.with.overflow.v32i8(<32 x i8> undef, <32 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 191 for instruction: %V64I8 = call { <64 x i8>, <64 x i1> } @llvm.usub.with.overflow.v64i8(<64 x i8> undef, <64 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 0 for instruction: ret i32 undef
				;
				%I64 = call {i64, i1} @llvm.usub.with.overflow.i64(i64 undef, i64 undef)
				%V2I64 = call {<2 x i64>, <2 x i1>} @llvm.usub.with.overflow.v2i64(<2 x i64> undef, <2 x i64> undef)
				%V4I64 = call {<4 x i64>, <4 x i1>} @llvm.usub.with.overflow.v4i64(<4 x i64> undef, <4 x i64> undef)
				%V8I64 = call {<8 x i64>, <8 x i1>} @llvm.usub.with.overflow.v8i64(<8 x i64> undef, <8 x i64> undef)

				%I32 = call {i32, i1} @llvm.usub.with.overflow.i32(i32 undef, i32 undef)
				%V4I32 = call {<4 x i32>, <4 x i1>} @llvm.usub.with.overflow.v4i32(<4 x i32> undef, <4 x i32> undef)
				%V8I32 = call {<8 x i32>, <8 x i1>} @llvm.usub.with.overflow.v8i32(<8 x i32> undef, <8 x i32> undef)
				%V16I32 = call {<16 x i32>, <16 x i1>} @llvm.usub.with.overflow.v16i32(<16 x i32> undef, <16 x i32> undef)

				%I16 = call {i16, i1} @llvm.usub.with.overflow.i16(i16 undef, i16 undef)
				%V8I16 = call {<8 x i16>, <8 x i1>} @llvm.usub.with.overflow.v8i16(<8 x i16> undef, <8 x i16> undef)
				%V16I16 = call {<16 x i16>, <16 x i1>} @llvm.usub.with.overflow.v16i16(<16 x i16> undef, <16 x i16> undef)
				%V32I16 = call {<32 x i16>, <32 x i1>} @llvm.usub.with.overflow.v32i16(<32 x i16> undef, <32 x i16> undef)

				%I8 = call {i8, i1} @llvm.usub.with.overflow.i8(i8 undef, i8 undef)
				%V16I8 = call {<16 x i8>, <16 x i1>} @llvm.usub.with.overflow.v16i8(<16 x i8> undef, <16 x i8> undef)
				%V32I8 = call {<32 x i8>, <32 x i1>} @llvm.usub.with.overflow.v32i8(<32 x i8> undef, <32 x i8> undef)
				%V64I8 = call {<64 x i8>, <64 x i1>} @llvm.usub.with.overflow.v64i8(<64 x i8> undef, <64 x i8> undef)

				ret i32 undef
				}

				;
				; smul.with.overflow
				;

				declare {i64, i1} @llvm.smul.with.overflow.i64(i64, i64)
				declare {<2 x i64>, <2 x i1>} @llvm.smul.with.overflow.v2i64(<2 x i64>, <2 x i64>)
				declare {<4 x i64>, <4 x i1>} @llvm.smul.with.overflow.v4i64(<4 x i64>, <4 x i64>)
				declare {<8 x i64>, <8 x i1>} @llvm.smul.with.overflow.v8i64(<8 x i64>, <8 x i64>)

				declare {i32, i1} @llvm.smul.with.overflow.i32(i32, i32)
				declare {<4 x i32>, <4 x i1>} @llvm.smul.with.overflow.v4i32(<4 x i32>, <4 x i32>)
				declare {<8 x i32>, <8 x i1>} @llvm.smul.with.overflow.v8i32(<8 x i32>, <8 x i32>)
				declare {<16 x i32>, <16 x i1>} @llvm.smul.with.overflow.v16i32(<16 x i32>, <16 x i32>)

				declare {i16, i1} @llvm.smul.with.overflow.i16(i16, i16)
				declare {<8 x i16>, <8 x i1>} @llvm.smul.with.overflow.v8i16(<8 x i16>, <8 x i16>)
				declare {<16 x i16>, <16 x i1>} @llvm.smul.with.overflow.v16i16(<16 x i16>, <16 x i16>)
				declare {<32 x i16>, <32 x i1>} @llvm.smul.with.overflow.v32i16(<32 x i16>, <32 x i16>)

				declare {i8, i1} @llvm.smul.with.overflow.i8(i8, i8)
				declare {<16 x i8>, <16 x i1>} @llvm.smul.with.overflow.v16i8(<16 x i8>, <16 x i8>)
				declare {<32 x i8>, <32 x i1>} @llvm.smul.with.overflow.v32i8(<32 x i8>, <32 x i8>)
				declare {<64 x i8>, <64 x i1>} @llvm.smul.with.overflow.v64i8(<64 x i8>, <64 x i8>)

				define i32 @smul(i32 %arg) {
				; CHECK-LABEL: 'smul'
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I64 = call { i64, i1 } @llvm.smul.with.overflow.i64(i64 undef, i64 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %V2I64 = call { <2 x i64>, <2 x i1> } @llvm.smul.with.overflow.v2i64(<2 x i64> undef, <2 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 11 for instruction: %V4I64 = call { <4 x i64>, <4 x i1> } @llvm.smul.with.overflow.v4i64(<4 x i64> undef, <4 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I64 = call { <8 x i64>, <8 x i1> } @llvm.smul.with.overflow.v8i64(<8 x i64> undef, <8 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I32 = call { i32, i1 } @llvm.smul.with.overflow.i32(i32 undef, i32 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 11 for instruction: %V4I32 = call { <4 x i32>, <4 x i1> } @llvm.smul.with.overflow.v4i32(<4 x i32> undef, <4 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I32 = call { <8 x i32>, <8 x i1> } @llvm.smul.with.overflow.v8i32(<8 x i32> undef, <8 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I32 = call { <16 x i32>, <16 x i1> } @llvm.smul.with.overflow.v16i32(<16 x i32> undef, <16 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I16 = call { i16, i1 } @llvm.smul.with.overflow.i16(i16 undef, i16 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I16 = call { <8 x i16>, <8 x i1> } @llvm.smul.with.overflow.v8i16(<8 x i16> undef, <8 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I16 = call { <16 x i16>, <16 x i1> } @llvm.smul.with.overflow.v16i16(<16 x i16> undef, <16 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 95 for instruction: %V32I16 = call { <32 x i16>, <32 x i1> } @llvm.smul.with.overflow.v32i16(<32 x i16> undef, <32 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I8 = call { i8, i1 } @llvm.smul.with.overflow.i8(i8 undef, i8 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I8 = call { <16 x i8>, <16 x i1> } @llvm.smul.with.overflow.v16i8(<16 x i8> undef, <16 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 95 for instruction: %V32I8 = call { <32 x i8>, <32 x i1> } @llvm.smul.with.overflow.v32i8(<32 x i8> undef, <32 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 191 for instruction: %V64I8 = call { <64 x i8>, <64 x i1> } @llvm.smul.with.overflow.v64i8(<64 x i8> undef, <64 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 0 for instruction: ret i32 undef
				;
				%I64 = call {i64, i1} @llvm.smul.with.overflow.i64(i64 undef, i64 undef)
				%V2I64 = call {<2 x i64>, <2 x i1>} @llvm.smul.with.overflow.v2i64(<2 x i64> undef, <2 x i64> undef)
				%V4I64 = call {<4 x i64>, <4 x i1>} @llvm.smul.with.overflow.v4i64(<4 x i64> undef, <4 x i64> undef)
				%V8I64 = call {<8 x i64>, <8 x i1>} @llvm.smul.with.overflow.v8i64(<8 x i64> undef, <8 x i64> undef)

				%I32 = call {i32, i1} @llvm.smul.with.overflow.i32(i32 undef, i32 undef)
				%V4I32 = call {<4 x i32>, <4 x i1>} @llvm.smul.with.overflow.v4i32(<4 x i32> undef, <4 x i32> undef)
				%V8I32 = call {<8 x i32>, <8 x i1>} @llvm.smul.with.overflow.v8i32(<8 x i32> undef, <8 x i32> undef)
				%V16I32 = call {<16 x i32>, <16 x i1>} @llvm.smul.with.overflow.v16i32(<16 x i32> undef, <16 x i32> undef)

				%I16 = call {i16, i1} @llvm.smul.with.overflow.i16(i16 undef, i16 undef)
				%V8I16 = call {<8 x i16>, <8 x i1>} @llvm.smul.with.overflow.v8i16(<8 x i16> undef, <8 x i16> undef)
				%V16I16 = call {<16 x i16>, <16 x i1>} @llvm.smul.with.overflow.v16i16(<16 x i16> undef, <16 x i16> undef)
				%V32I16 = call {<32 x i16>, <32 x i1>} @llvm.smul.with.overflow.v32i16(<32 x i16> undef, <32 x i16> undef)

				%I8 = call {i8, i1} @llvm.smul.with.overflow.i8(i8 undef, i8 undef)
				%V16I8 = call {<16 x i8>, <16 x i1>} @llvm.smul.with.overflow.v16i8(<16 x i8> undef, <16 x i8> undef)
				%V32I8 = call {<32 x i8>, <32 x i1>} @llvm.smul.with.overflow.v32i8(<32 x i8> undef, <32 x i8> undef)
				%V64I8 = call {<64 x i8>, <64 x i1>} @llvm.smul.with.overflow.v64i8(<64 x i8> undef, <64 x i8> undef)

				ret i32 undef
				}

				;
				; umul.with.overflow
				;

				declare {i64, i1} @llvm.umul.with.overflow.i64(i64, i64)
				declare {<2 x i64>, <2 x i1>} @llvm.umul.with.overflow.v2i64(<2 x i64>, <2 x i64>)
				declare {<4 x i64>, <4 x i1>} @llvm.umul.with.overflow.v4i64(<4 x i64>, <4 x i64>)
				declare {<8 x i64>, <8 x i1>} @llvm.umul.with.overflow.v8i64(<8 x i64>, <8 x i64>)

				declare {i32, i1} @llvm.umul.with.overflow.i32(i32, i32)
				declare {<4 x i32>, <4 x i1>} @llvm.umul.with.overflow.v4i32(<4 x i32>, <4 x i32>)
				declare {<8 x i32>, <8 x i1>} @llvm.umul.with.overflow.v8i32(<8 x i32>, <8 x i32>)
				declare {<16 x i32>, <16 x i1>} @llvm.umul.with.overflow.v16i32(<16 x i32>, <16 x i32>)

				declare {i16, i1} @llvm.umul.with.overflow.i16(i16, i16)
				declare {<8 x i16>, <8 x i1>} @llvm.umul.with.overflow.v8i16(<8 x i16>, <8 x i16>)
				declare {<16 x i16>, <16 x i1>} @llvm.umul.with.overflow.v16i16(<16 x i16>, <16 x i16>)
				declare {<32 x i16>, <32 x i1>} @llvm.umul.with.overflow.v32i16(<32 x i16>, <32 x i16>)

				declare {i8, i1} @llvm.umul.with.overflow.i8(i8, i8)
				declare {<16 x i8>, <16 x i1>} @llvm.umul.with.overflow.v16i8(<16 x i8>, <16 x i8>)
				declare {<32 x i8>, <32 x i1>} @llvm.umul.with.overflow.v32i8(<32 x i8>, <32 x i8>)
				declare {<64 x i8>, <64 x i1>} @llvm.umul.with.overflow.v64i8(<64 x i8>, <64 x i8>)

				define i32 @umul(i32 %arg) {
				; CHECK-LABEL: 'umul'
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I64 = call { i64, i1 } @llvm.umul.with.overflow.i64(i64 undef, i64 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %V2I64 = call { <2 x i64>, <2 x i1> } @llvm.umul.with.overflow.v2i64(<2 x i64> undef, <2 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 11 for instruction: %V4I64 = call { <4 x i64>, <4 x i1> } @llvm.umul.with.overflow.v4i64(<4 x i64> undef, <4 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I64 = call { <8 x i64>, <8 x i1> } @llvm.umul.with.overflow.v8i64(<8 x i64> undef, <8 x i64> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I32 = call { i32, i1 } @llvm.umul.with.overflow.i32(i32 undef, i32 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 11 for instruction: %V4I32 = call { <4 x i32>, <4 x i1> } @llvm.umul.with.overflow.v4i32(<4 x i32> undef, <4 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I32 = call { <8 x i32>, <8 x i1> } @llvm.umul.with.overflow.v8i32(<8 x i32> undef, <8 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I32 = call { <16 x i32>, <16 x i1> } @llvm.umul.with.overflow.v16i32(<16 x i32> undef, <16 x i32> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I16 = call { i16, i1 } @llvm.umul.with.overflow.i16(i16 undef, i16 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 23 for instruction: %V8I16 = call { <8 x i16>, <8 x i1> } @llvm.umul.with.overflow.v8i16(<8 x i16> undef, <8 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I16 = call { <16 x i16>, <16 x i1> } @llvm.umul.with.overflow.v16i16(<16 x i16> undef, <16 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 95 for instruction: %V32I16 = call { <32 x i16>, <32 x i1> } @llvm.umul.with.overflow.v32i16(<32 x i16> undef, <32 x i16> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %I8 = call { i8, i1 } @llvm.umul.with.overflow.i8(i8 undef, i8 undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 47 for instruction: %V16I8 = call { <16 x i8>, <16 x i1> } @llvm.umul.with.overflow.v16i8(<16 x i8> undef, <16 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 95 for instruction: %V32I8 = call { <32 x i8>, <32 x i1> } @llvm.umul.with.overflow.v32i8(<32 x i8> undef, <32 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 191 for instruction: %V64I8 = call { <64 x i8>, <64 x i1> } @llvm.umul.with.overflow.v64i8(<64 x i8> undef, <64 x i8> undef)
				; CHECK-NEXT: Cost Model: Found an estimated cost of 0 for instruction: ret i32 undef
				;
				%I64 = call {i64, i1} @llvm.umul.with.overflow.i64(i64 undef, i64 undef)
				%V2I64 = call {<2 x i64>, <2 x i1>} @llvm.umul.with.overflow.v2i64(<2 x i64> undef, <2 x i64> undef)
				%V4I64 = call {<4 x i64>, <4 x i1>} @llvm.umul.with.overflow.v4i64(<4 x i64> undef, <4 x i64> undef)
				%V8I64 = call {<8 x i64>, <8 x i1>} @llvm.umul.with.overflow.v8i64(<8 x i64> undef, <8 x i64> undef)

				%I32 = call {i32, i1} @llvm.umul.with.overflow.i32(i32 undef, i32 undef)
				%V4I32 = call {<4 x i32>, <4 x i1>} @llvm.umul.with.overflow.v4i32(<4 x i32> undef, <4 x i32> undef)
				%V8I32 = call {<8 x i32>, <8 x i1>} @llvm.umul.with.overflow.v8i32(<8 x i32> undef, <8 x i32> undef)
				%V16I32 = call {<16 x i32>, <16 x i1>} @llvm.umul.with.overflow.v16i32(<16 x i32> undef, <16 x i32> undef)

				%I16 = call {i16, i1} @llvm.umul.with.overflow.i16(i16 undef, i16 undef)
				%V8I16 = call {<8 x i16>, <8 x i1>} @llvm.umul.with.overflow.v8i16(<8 x i16> undef, <8 x i16> undef)
				%V16I16 = call {<16 x i16>, <16 x i1>} @llvm.umul.with.overflow.v16i16(<16 x i16> undef, <16 x i16> undef)
				%V32I16 = call {<32 x i16>, <32 x i1>} @llvm.umul.with.overflow.v32i16(<32 x i16> undef, <32 x i16> undef)

				%I8 = call {i8, i1} @llvm.umul.with.overflow.i8(i8 undef, i8 undef)
				%V16I8 = call {<16 x i8>, <16 x i1>} @llvm.umul.with.overflow.v16i8(<16 x i8> undef, <16 x i8> undef)
				%V32I8 = call {<32 x i8>, <32 x i1>} @llvm.umul.with.overflow.v32i8(<32 x i8> undef, <32 x i8> undef)
				%V64I8 = call {<64 x i8>, <64 x i1>} @llvm.umul.with.overflow.v64i8(<64 x i8> undef, <64 x i8> undef)

				ret i32 undef
				}

utils/TableGen/CodeGenTarget.cpp

Show First 20 Lines • Show All 627 Lines • ▼ Show 20 Lines	if (TyEl->isSubClassOf("LLVMMatchType")) {
PrintFatalError(Twine("ParamTypes is ") + TypeList->getAsString());		PrintFatalError(Twine("ParamTypes is ") + TypeList->getAsString());
}		}
VT = OverloadedVTs[MatchTy];		VT = OverloadedVTs[MatchTy];
// It only makes sense to use the extended and truncated vector element		// It only makes sense to use the extended and truncated vector element
// variants with iAny types; otherwise, if the intrinsic is not		// variants with iAny types; otherwise, if the intrinsic is not
// overloaded, all the types can be specified directly.		// overloaded, all the types can be specified directly.
assert(((!TyEl->isSubClassOf("LLVMExtendedType") &&		assert(((!TyEl->isSubClassOf("LLVMExtendedType") &&
!TyEl->isSubClassOf("LLVMTruncatedType") &&		!TyEl->isSubClassOf("LLVMTruncatedType") &&
!TyEl->isSubClassOf("LLVMVectorSameWidth")) \|\|		!TyEl->isSubClassOf("LLVMScalarOrSameVectorWidth")) \|\|
VT == MVT::iAny \|\| VT == MVT::vAny) &&		VT == MVT::iAny \|\| VT == MVT::vAny) &&
"Expected iAny or vAny type");		"Expected iAny or vAny type");
} else		} else
VT = getValueType(TyEl->getValueAsDef("VT"));		VT = getValueType(TyEl->getValueAsDef("VT"));

if (MVT(VT).isOverloaded()) {		if (MVT(VT).isOverloaded()) {
OverloadedVTs.push_back(VT);		OverloadedVTs.push_back(VT);
isOverloaded = true;		isOverloaded = true;
▲ Show 20 Lines • Show All 71 Lines • Show Last 20 Lines

utils/TableGen/IntrinsicEmitter.cpp

Show First 20 Lines • Show All 263 Lines • ▼ Show 20 Lines	if (R->isSubClassOf("LLVMMatchType")) {
unsigned Number = R->getValueAsInt("Number");		unsigned Number = R->getValueAsInt("Number");
assert(Number < ArgCodes.size() && "Invalid matching number!");		assert(Number < ArgCodes.size() && "Invalid matching number!");
if (R->isSubClassOf("LLVMExtendedType"))		if (R->isSubClassOf("LLVMExtendedType"))
Sig.push_back(IIT_EXTEND_ARG);		Sig.push_back(IIT_EXTEND_ARG);
else if (R->isSubClassOf("LLVMTruncatedType"))		else if (R->isSubClassOf("LLVMTruncatedType"))
Sig.push_back(IIT_TRUNC_ARG);		Sig.push_back(IIT_TRUNC_ARG);
else if (R->isSubClassOf("LLVMHalfElementsVectorType"))		else if (R->isSubClassOf("LLVMHalfElementsVectorType"))
Sig.push_back(IIT_HALF_VEC_ARG);		Sig.push_back(IIT_HALF_VEC_ARG);
else if (R->isSubClassOf("LLVMVectorSameWidth")) {		else if (R->isSubClassOf("LLVMScalarOrSameVectorWidth")) {
Sig.push_back(IIT_SAME_VEC_WIDTH_ARG);		Sig.push_back(IIT_SAME_VEC_WIDTH_ARG);
Sig.push_back((Number << 3) \| ArgCodes[Number]);		Sig.push_back((Number << 3) \| ArgCodes[Number]);
MVT::SimpleValueType VT = getValueType(R->getValueAsDef("ElTy"));		MVT::SimpleValueType VT = getValueType(R->getValueAsDef("ElTy"));
EncodeFixedValueType(VT, Sig);		EncodeFixedValueType(VT, Sig);
return;		return;
}		}
else if (R->isSubClassOf("LLVMPointerTo"))		else if (R->isSubClassOf("LLVMPointerTo"))
Sig.push_back(IIT_PTR_TO_ARG);		Sig.push_back(IIT_PTR_TO_ARG);
▲ Show 20 Lines • Show All 582 Lines • Show Last 20 Lines