This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/AArch64/
-
Target/
-
AArch64/
-
AArch64TargetTransformInfo.cpp
-
test/Analysis/CostModel/AArch64/
-
Analysis/
-
CostModel/
-
AArch64/
-
vector-select.ll

Differential D118256

[AArch64] Fix costs of float vector compare/selects pairs.
ClosedPublic

Authored by fhahn on Jan 26 2022, 8:41 AM.

Download Raw Diff

Details

Reviewers

dmgreen
ab
t.p.northover
samparker

Commits

rG17ebd68ae694: [AArch64] Fix costs of float vector compare/selects pairs.

Summary

The current cost-model overestimates the cost of vector compares &
selects for ordered floating point compares. This patch fixes that by
extending the existing logic for integer predicates.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	1,920 ms	x64 debian > AddressSanitizer-x86_64-linux.TestCases::strcmp.c
	3,940 ms	x64 debian > libarcher.races::critical-unrelated.c
	4,970 ms	x64 debian > libarcher.races::lock-nested-unrelated.c
	3,990 ms	x64 debian > libarcher.races::lock-unrelated.c
	4,080 ms	x64 debian > libarcher.races::parallel-simple.c
		View Full Test Results (10 Failed)

Event Timeline

fhahn created this revision.Jan 26 2022, 8:41 AM

Herald added subscribers: hiraditya, kristof.beyls. · View Herald TranscriptJan 26 2022, 8:41 AM

fhahn requested review of this revision.Jan 26 2022, 8:41 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 26 2022, 8:41 AM

This sounds OK. I went down a bit of a rabbit hole though.

An FCMP_UNE will be cheap too.
The other fcmp operators are not much more expensive, and it is the compare that takes 2 compares and an Or - you can argue that the select is still cheap.
With fast the others are cheap too.
A default cost of 13 for a vselect seems quite high to me. Given a mask (as vector compares generate on AArch64), the cost of a select will usually be a bif/bic/bsl. (In the not super distant past they would be and's and nots and ors, but things are generally better since then). If the compare isn't operating on elements of the same size as the select, the mask may need to be extended or truncated first though.
Does this account for times when the compare and select type sizes do not match? That becomes harder to get correct.

Most of that would be better left for other patches though (if it turns out to be useful at-all!)
I would guess that FCMP_UNE are worth adding here, at least.

Harbormaster completed remote builds in B145761: Diff 403286.Jan 27 2022, 3:27 AM

fhahn mentioned this in rGcb3df1a29956: [AArch64] Add vector compare/select tests with UNE predicate..Jan 27 2022, 6:22 AM

In D118256#3275273, @dmgreen wrote:

This sounds OK. I went down a bit of a rabbit hole though.

Thanks for that! Unfortuantely there still are many other cases where we assign sub-optimal costs (usually too high).

An FCMP_UNE will be cheap too.

Great point, I included it in the pathc.

The other fcmp operators are not much more expensive, and it is the compare that takes 2 compares and an Or - you can argue that the select is still cheap.

Agreed! But I think that's best improved separately, as all cases included in the patch at the moment follow exactly the same reasoning as the integer cases (select will cost exactly one instruction).

With fast the others are cheap too.

Agreed, cmp + select might only be single min/max instruction. At the moment, I don't think there's a convenient way to check if the compare had the right fast-math flags here. I am also not sure if changing the cost from 2 to 1 would make a huge difference in practice.
j

A default cost of 13 for a vselect seems quite high to me. Given a mask (as vector compares generate on AArch64), the cost of a select will usually be a bif/bic/bsl. (In the not super distant past they would be and's and nots and ors, but things are generally better since then). If the compare isn't operating on elements of the same size as the select, the mask may need to be extended or truncated first though.

Yeah, the default cost seems too high, especially for vectors with more than 2 elements. But that's a separate issue I think.

Does this account for times when the compare and select type sizes do not match? That becomes harder to get correct.

Good point, and no I don't think so. We have the same issue for integer predicates I think. So maybe that could be fixed independently, possibly by looking at the context instruction?

Most of that would be better left for other patches though (if it turns out to be useful at-all!)
I would guess that FCMP_UNE are worth adding here, at least.

Thanks. LGTM

This revision is now accepted and ready to land.Jan 27 2022, 8:00 AM

Harbormaster completed remote builds in B146019: Diff 403638.Jan 27 2022, 11:38 AM

This revision was landed with ongoing or failed builds.Jan 31 2022, 2:18 AM

Closed by commit rG17ebd68ae694: [AArch64] Fix costs of float vector compare/selects pairs. (authored by fhahn). · Explain Why

This revision was automatically updated to reflect the committed changes.

fhahn added a commit: rG17ebd68ae694: [AArch64] Fix costs of float vector compare/selects pairs..

Revision Contents

Path

Size

llvm/

lib/

Target/

AArch64/

AArch64TargetTransformInfo.cpp

21 lines

test/

Analysis/

CostModel/

AArch64/

vector-select.ll

60 lines

Diff 403638

llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp

Show First 20 Lines • Show All 1,880 Lines • ▼ Show 20 Lines	if (isa<FixedVectorType>(ValTy) && ISD == ISD::SELECT) {
// If VecPred is not set, check if we can get a predicate from the context		// If VecPred is not set, check if we can get a predicate from the context
// instruction, if its type matches the requested ValTy.		// instruction, if its type matches the requested ValTy.
if (VecPred == CmpInst::BAD_ICMP_PREDICATE && I && I->getType() == ValTy) {		if (VecPred == CmpInst::BAD_ICMP_PREDICATE && I && I->getType() == ValTy) {
CmpInst::Predicate CurrentPred;		CmpInst::Predicate CurrentPred;
if (match(I, m_Select(m_Cmp(CurrentPred, m_Value(), m_Value()), m_Value(),		if (match(I, m_Select(m_Cmp(CurrentPred, m_Value(), m_Value()), m_Value(),
m_Value())))		m_Value())))
VecPred = CurrentPred;		VecPred = CurrentPred;
}		}
// Check if we have a compare/select chain that can be lowered using CMxx &		// Check if we have a compare/select chain that can be lowered using
// BFI pair.		// a (F)CMxx & BFI pair.
if (CmpInst::isIntPredicate(VecPred)) {		if (CmpInst::isIntPredicate(VecPred) \|\| VecPred == CmpInst::FCMP_OLE \|\|
static const auto ValidMinMaxTys = {MVT::v8i8, MVT::v16i8, MVT::v4i16,		VecPred == CmpInst::FCMP_OLT \|\| VecPred == CmpInst::FCMP_OGT \|\|
MVT::v8i16, MVT::v2i32, MVT::v4i32,		VecPred == CmpInst::FCMP_OGE \|\| VecPred == CmpInst::FCMP_OEQ \|\|
MVT::v2i64};		VecPred == CmpInst::FCMP_UNE) {
		static const auto ValidMinMaxTys = {
		MVT::v8i8, MVT::v16i8, MVT::v4i16, MVT::v8i16, MVT::v2i32,
		MVT::v4i32, MVT::v2i64, MVT::v2f32, MVT::v4f32, MVT::v2f64};
		static const auto ValidFP16MinMaxTys = {MVT::v4f16, MVT::v8f16};

auto LT = TLI->getTypeLegalizationCost(DL, ValTy);		auto LT = TLI->getTypeLegalizationCost(DL, ValTy);
if (any_of(ValidMinMaxTys, [&LT](MVT M) { return M == LT.second; }))		if (any_of(ValidMinMaxTys, [&LT](MVT M) { return M == LT.second; }) \|\|
		(ST->hasFullFP16() &&
		any_of(ValidFP16MinMaxTys, [&LT](MVT M) { return M == LT.second; })))
return LT.first;		return LT.first;
}		}

static const TypeConversionCostTblEntry		static const TypeConversionCostTblEntry
VectorSelectTbl[] = {		VectorSelectTbl[] = {
{ ISD::SELECT, MVT::v16i1, MVT::v16i16, 16 },		{ ISD::SELECT, MVT::v16i1, MVT::v16i16, 16 },
{ ISD::SELECT, MVT::v8i1, MVT::v8i32, 8 },		{ ISD::SELECT, MVT::v8i1, MVT::v8i32, 8 },
{ ISD::SELECT, MVT::v16i1, MVT::v16i32, 16 },		{ ISD::SELECT, MVT::v16i1, MVT::v16i32, 16 },
▲ Show 20 Lines • Show All 754 Lines • Show Last 20 Lines

llvm/test/Analysis/CostModel/AArch64/vector-select.ll

Show First 20 Lines • Show All 152 Lines • ▼ Show 20 Lines	define <2 x i64> @v2i64_select_no_cmp(<2 x i64> %a, <2 x i64> %b, <2 x i1> %cond) {
ret <2 x i64> %s.1		ret <2 x i64> %s.1
}		}

define <4 x half> @v4f16_select_ogt(<4 x half> %a, <4 x half> %b, <4 x half> %c) {		define <4 x half> @v4f16_select_ogt(<4 x half> %a, <4 x half> %b, <4 x half> %c) {
; COST-LABEL: v4f16_select_ogt		; COST-LABEL: v4f16_select_ogt
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %cmp.1 = fcmp ogt <4 x half> %a, %b		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %cmp.1 = fcmp ogt <4 x half> %a, %b
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ogt <4 x half> %a, %b		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ogt <4 x half> %a, %b
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
;		;
; CODE-LABEL: v4f16_select_ogt		; CODE-LABEL: v4f16_select_ogt
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmgt v{{.+}}.4h, v{{.+}}.4h, v{{.+}}.4h		; CODE-NEXT: fcmgt v{{.+}}.4h, v{{.+}}.4h, v{{.+}}.4h
; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b		; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp ogt <4 x half> %a, %b		%cmp.1 = fcmp ogt <4 x half> %a, %b
%s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		%s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
ret <4 x half> %s.1		ret <4 x half> %s.1
}		}

define <8 x half> @v8f16_select_ogt(<8 x half> %a, <8 x half> %b, <8 x half> %c) {		define <8 x half> @v8f16_select_ogt(<8 x half> %a, <8 x half> %b, <8 x half> %c) {
; COST-LABEL: v8f16_select_ogt		; COST-LABEL: v8f16_select_ogt
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %cmp.1 = fcmp ogt <8 x half> %a, %b		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %cmp.1 = fcmp ogt <8 x half> %a, %b
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ogt <8 x half> %a, %b		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ogt <8 x half> %a, %b
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
;		;
; CODE-LABEL: v8f16_select_ogt		; CODE-LABEL: v8f16_select_ogt
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmgt v{{.+}}.8h, v{{.+}}.8h, v{{.+}}.8h		; CODE-NEXT: fcmgt v{{.+}}.8h, v{{.+}}.8h, v{{.+}}.8h
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp ogt <8 x half> %a, %b		%cmp.1 = fcmp ogt <8 x half> %a, %b
%s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		%s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
ret <8 x half> %s.1		ret <8 x half> %s.1
}		}

define <2 x float> @v2f32_select_ogt(<2 x float> %a, <2 x float> %b, <2 x float> %c) {		define <2 x float> @v2f32_select_ogt(<2 x float> %a, <2 x float> %b, <2 x float> %c) {
; COST-LABEL: v2f32_select_ogt		; COST-LABEL: v2f32_select_ogt
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ogt <2 x float> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ogt <2 x float> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c
;		;
; CODE-LABEL: v2f32_select_ogt		; CODE-LABEL: v2f32_select_ogt
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmgt v{{.+}}.2s, v{{.+}}.2s, v{{.+}}.2s		; CODE-NEXT: fcmgt v{{.+}}.2s, v{{.+}}.2s, v{{.+}}.2s
; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b		; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp ogt <2 x float> %a, %b		%cmp.1 = fcmp ogt <2 x float> %a, %b
%s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c		%s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c
ret <2 x float> %s.1		ret <2 x float> %s.1
}		}

define <4 x float> @v4f32_select_ogt(<4 x float> %a, <4 x float> %b, <4 x float> %c) {		define <4 x float> @v4f32_select_ogt(<4 x float> %a, <4 x float> %b, <4 x float> %c) {
; COST-LABEL: v4f32_select_ogt		; COST-LABEL: v4f32_select_ogt
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ogt <4 x float> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ogt <4 x float> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c
;		;
; CODE-LABEL: v4f32_select_ogt		; CODE-LABEL: v4f32_select_ogt
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmgt v{{.+}}.4s, v{{.+}}.4s, v{{.+}}.4s		; CODE-NEXT: fcmgt v{{.+}}.4s, v{{.+}}.4s, v{{.+}}.4s
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp ogt <4 x float> %a, %b		%cmp.1 = fcmp ogt <4 x float> %a, %b
%s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c		%s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c
ret <4 x float> %s.1		ret <4 x float> %s.1
}		}

define <2 x double> @v2f64_select_ogt(<2 x double> %a, <2 x double> %b, <2 x double> %c) {		define <2 x double> @v2f64_select_ogt(<2 x double> %a, <2 x double> %b, <2 x double> %c) {
; COST-LABEL: v2f64_select_ogt		; COST-LABEL: v2f64_select_ogt
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ogt <2 x double> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ogt <2 x double> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c
;		;
; CODE-LABEL: v2f64_select_ogt		; CODE-LABEL: v2f64_select_ogt
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmgt v{{.+}}.2d, v{{.+}}.2d, v{{.+}}.2d		; CODE-NEXT: fcmgt v{{.+}}.2d, v{{.+}}.2d, v{{.+}}.2d
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp ogt <2 x double> %a, %b		%cmp.1 = fcmp ogt <2 x double> %a, %b
%s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c		%s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c
ret <2 x double> %s.1		ret <2 x double> %s.1
}		}

define <4 x half> @v4f16_select_oge(<4 x half> %a, <4 x half> %b, <4 x half> %c) {		define <4 x half> @v4f16_select_oge(<4 x half> %a, <4 x half> %b, <4 x half> %c) {
; COST-LABEL: v4f16_select_oge		; COST-LABEL: v4f16_select_oge
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %cmp.1 = fcmp oge <4 x half> %a, %b		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %cmp.1 = fcmp oge <4 x half> %a, %b
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oge <4 x half> %a, %b		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oge <4 x half> %a, %b
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
;		;
; CODE-LABEL: v4f16_select_oge		; CODE-LABEL: v4f16_select_oge
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmge v{{.+}}.4h, v{{.+}}.4h, v{{.+}}.4h		; CODE-NEXT: fcmge v{{.+}}.4h, v{{.+}}.4h, v{{.+}}.4h
; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b		; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp oge <4 x half> %a, %b		%cmp.1 = fcmp oge <4 x half> %a, %b
%s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		%s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
ret <4 x half> %s.1		ret <4 x half> %s.1
}		}

define <8 x half> @v8f16_select_oge(<8 x half> %a, <8 x half> %b, <8 x half> %c) {		define <8 x half> @v8f16_select_oge(<8 x half> %a, <8 x half> %b, <8 x half> %c) {
; COST-LABEL: v8f16_select_oge		; COST-LABEL: v8f16_select_oge
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %cmp.1 = fcmp oge <8 x half> %a, %b		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %cmp.1 = fcmp oge <8 x half> %a, %b
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oge <8 x half> %a, %b		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oge <8 x half> %a, %b
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
;		;
; CODE-LABEL: v8f16_select_oge		; CODE-LABEL: v8f16_select_oge
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmge v{{.+}}.8h, v{{.+}}.8h, v{{.+}}.8h		; CODE-NEXT: fcmge v{{.+}}.8h, v{{.+}}.8h, v{{.+}}.8h
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp oge <8 x half> %a, %b		%cmp.1 = fcmp oge <8 x half> %a, %b
%s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		%s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
ret <8 x half> %s.1		ret <8 x half> %s.1
}		}

define <2 x float> @v2f32_select_oge(<2 x float> %a, <2 x float> %b, <2 x float> %c) {		define <2 x float> @v2f32_select_oge(<2 x float> %a, <2 x float> %b, <2 x float> %c) {
; COST-LABEL: v2f32_select_oge		; COST-LABEL: v2f32_select_oge
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oge <2 x float> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oge <2 x float> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c
;		;
; CODE-LABEL: v2f32_select_oge		; CODE-LABEL: v2f32_select_oge
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmge v{{.+}}.2s, v{{.+}}.2s, v{{.+}}.2s		; CODE-NEXT: fcmge v{{.+}}.2s, v{{.+}}.2s, v{{.+}}.2s
; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b		; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp oge <2 x float> %a, %b		%cmp.1 = fcmp oge <2 x float> %a, %b
%s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c		%s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c
ret <2 x float> %s.1		ret <2 x float> %s.1
}		}

define <4 x float> @v4f32_select_oge(<4 x float> %a, <4 x float> %b, <4 x float> %c) {		define <4 x float> @v4f32_select_oge(<4 x float> %a, <4 x float> %b, <4 x float> %c) {
; COST-LABEL: v4f32_select_oge		; COST-LABEL: v4f32_select_oge
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oge <4 x float> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oge <4 x float> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c
;		;
; CODE-LABEL: v4f32_select_oge		; CODE-LABEL: v4f32_select_oge
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmge v{{.+}}.4s, v{{.+}}.4s, v{{.+}}.4s		; CODE-NEXT: fcmge v{{.+}}.4s, v{{.+}}.4s, v{{.+}}.4s
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp oge <4 x float> %a, %b		%cmp.1 = fcmp oge <4 x float> %a, %b
%s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c		%s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c
ret <4 x float> %s.1		ret <4 x float> %s.1
}		}

define <2 x double> @v2f64_select_oge(<2 x double> %a, <2 x double> %b, <2 x double> %c) {		define <2 x double> @v2f64_select_oge(<2 x double> %a, <2 x double> %b, <2 x double> %c) {
; COST-LABEL: v2f64_select_oge		; COST-LABEL: v2f64_select_oge
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oge <2 x double> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oge <2 x double> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c
;		;
; CODE-LABEL: v2f64_select_oge		; CODE-LABEL: v2f64_select_oge
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmge v{{.+}}.2d, v{{.+}}.2d, v{{.+}}.2d		; CODE-NEXT: fcmge v{{.+}}.2d, v{{.+}}.2d, v{{.+}}.2d
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp oge <2 x double> %a, %b		%cmp.1 = fcmp oge <2 x double> %a, %b
%s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c		%s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c
ret <2 x double> %s.1		ret <2 x double> %s.1
}		}

define <4 x half> @v4f16_select_olt(<4 x half> %a, <4 x half> %b, <4 x half> %c) {		define <4 x half> @v4f16_select_olt(<4 x half> %a, <4 x half> %b, <4 x half> %c) {
; COST-LABEL: v4f16_select_olt		; COST-LABEL: v4f16_select_olt
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %cmp.1 = fcmp olt <4 x half> %a, %b		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %cmp.1 = fcmp olt <4 x half> %a, %b
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp olt <4 x half> %a, %b		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp olt <4 x half> %a, %b
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
;		;
; CODE-LABEL: v4f16_select_olt		; CODE-LABEL: v4f16_select_olt
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmgt v{{.+}}.4h, v{{.+}}.4h, v{{.+}}.4h		; CODE-NEXT: fcmgt v{{.+}}.4h, v{{.+}}.4h, v{{.+}}.4h
; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b		; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp olt <4 x half> %a, %b		%cmp.1 = fcmp olt <4 x half> %a, %b
%s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		%s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
ret <4 x half> %s.1		ret <4 x half> %s.1
}		}

define <8 x half> @v8f16_select_olt(<8 x half> %a, <8 x half> %b, <8 x half> %c) {		define <8 x half> @v8f16_select_olt(<8 x half> %a, <8 x half> %b, <8 x half> %c) {
; COST-LABEL: v8f16_select_olt		; COST-LABEL: v8f16_select_olt
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %cmp.1 = fcmp olt <8 x half> %a, %b		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %cmp.1 = fcmp olt <8 x half> %a, %b
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp olt <8 x half> %a, %b		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp olt <8 x half> %a, %b
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
;		;
; CODE-LABEL: v8f16_select_olt		; CODE-LABEL: v8f16_select_olt
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmgt v{{.+}}.8h, v{{.+}}.8h, v{{.+}}.8h		; CODE-NEXT: fcmgt v{{.+}}.8h, v{{.+}}.8h, v{{.+}}.8h
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp olt <8 x half> %a, %b		%cmp.1 = fcmp olt <8 x half> %a, %b
%s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		%s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
ret <8 x half> %s.1		ret <8 x half> %s.1
}		}

define <2 x float> @v2f32_select_olt(<2 x float> %a, <2 x float> %b, <2 x float> %c) {		define <2 x float> @v2f32_select_olt(<2 x float> %a, <2 x float> %b, <2 x float> %c) {
; COST-LABEL: v2f32_select_olt		; COST-LABEL: v2f32_select_olt
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp olt <2 x float> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp olt <2 x float> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c
;		;
; CODE-LABEL: v2f32_select_olt		; CODE-LABEL: v2f32_select_olt
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmgt v{{.+}}.2s, v{{.+}}.2s, v{{.+}}.2s		; CODE-NEXT: fcmgt v{{.+}}.2s, v{{.+}}.2s, v{{.+}}.2s
; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b		; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp olt <2 x float> %a, %b		%cmp.1 = fcmp olt <2 x float> %a, %b
%s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c		%s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c
ret <2 x float> %s.1		ret <2 x float> %s.1
}		}

define <4 x float> @v4f32_select_olt(<4 x float> %a, <4 x float> %b, <4 x float> %c) {		define <4 x float> @v4f32_select_olt(<4 x float> %a, <4 x float> %b, <4 x float> %c) {
; COST-LABEL: v4f32_select_olt		; COST-LABEL: v4f32_select_olt
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp olt <4 x float> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp olt <4 x float> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c
;		;
; CODE-LABEL: v4f32_select_olt		; CODE-LABEL: v4f32_select_olt
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmgt v{{.+}}.4s, v{{.+}}.4s, v{{.+}}.4s		; CODE-NEXT: fcmgt v{{.+}}.4s, v{{.+}}.4s, v{{.+}}.4s
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp olt <4 x float> %a, %b		%cmp.1 = fcmp olt <4 x float> %a, %b
%s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c		%s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c
ret <4 x float> %s.1		ret <4 x float> %s.1
}		}

define <2 x double> @v2f64_select_olt(<2 x double> %a, <2 x double> %b, <2 x double> %c) {		define <2 x double> @v2f64_select_olt(<2 x double> %a, <2 x double> %b, <2 x double> %c) {
; COST-LABEL: v2f64_select_olt		; COST-LABEL: v2f64_select_olt
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp olt <2 x double> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp olt <2 x double> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c
;		;
; CODE-LABEL: v2f64_select_olt		; CODE-LABEL: v2f64_select_olt
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmgt v{{.+}}.2d, v{{.+}}.2d, v{{.+}}.2d		; CODE-NEXT: fcmgt v{{.+}}.2d, v{{.+}}.2d, v{{.+}}.2d
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp olt <2 x double> %a, %b		%cmp.1 = fcmp olt <2 x double> %a, %b
%s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c		%s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c
ret <2 x double> %s.1		ret <2 x double> %s.1
}		}

define <4 x half> @v4f16_select_ole(<4 x half> %a, <4 x half> %b, <4 x half> %c) {		define <4 x half> @v4f16_select_ole(<4 x half> %a, <4 x half> %b, <4 x half> %c) {
; COST-LABEL: v4f16_select_ole		; COST-LABEL: v4f16_select_ole
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %cmp.1 = fcmp ole <4 x half> %a, %b		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %cmp.1 = fcmp ole <4 x half> %a, %b
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ole <4 x half> %a, %b		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ole <4 x half> %a, %b
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
;		;
; CODE-LABEL: v4f16_select_ole		; CODE-LABEL: v4f16_select_ole
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmge v{{.+}}.4h, v{{.+}}.4h, v{{.+}}.4h		; CODE-NEXT: fcmge v{{.+}}.4h, v{{.+}}.4h, v{{.+}}.4h
; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b		; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp ole <4 x half> %a, %b		%cmp.1 = fcmp ole <4 x half> %a, %b
%s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		%s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
ret <4 x half> %s.1		ret <4 x half> %s.1
}		}

define <8 x half> @v8f16_select_ole(<8 x half> %a, <8 x half> %b, <8 x half> %c) {		define <8 x half> @v8f16_select_ole(<8 x half> %a, <8 x half> %b, <8 x half> %c) {
; COST-LABEL: v8f16_select_ole		; COST-LABEL: v8f16_select_ole
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %cmp.1 = fcmp ole <8 x half> %a, %b		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %cmp.1 = fcmp ole <8 x half> %a, %b
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ole <8 x half> %a, %b		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ole <8 x half> %a, %b
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
;		;
; CODE-LABEL: v8f16_select_ole		; CODE-LABEL: v8f16_select_ole
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmge v{{.+}}.8h, v{{.+}}.8h, v{{.+}}.8h		; CODE-NEXT: fcmge v{{.+}}.8h, v{{.+}}.8h, v{{.+}}.8h
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp ole <8 x half> %a, %b		%cmp.1 = fcmp ole <8 x half> %a, %b
%s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		%s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
ret <8 x half> %s.1		ret <8 x half> %s.1
}		}

define <2 x float> @v2f32_select_ole(<2 x float> %a, <2 x float> %b, <2 x float> %c) {		define <2 x float> @v2f32_select_ole(<2 x float> %a, <2 x float> %b, <2 x float> %c) {
; COST-LABEL: v2f32_select_ole		; COST-LABEL: v2f32_select_ole
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ole <2 x float> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ole <2 x float> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c
;		;
; CODE-LABEL: v2f32_select_ole		; CODE-LABEL: v2f32_select_ole
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmge v{{.+}}.2s, v{{.+}}.2s, v{{.+}}.2s		; CODE-NEXT: fcmge v{{.+}}.2s, v{{.+}}.2s, v{{.+}}.2s
; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b		; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp ole <2 x float> %a, %b		%cmp.1 = fcmp ole <2 x float> %a, %b
%s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c		%s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c
ret <2 x float> %s.1		ret <2 x float> %s.1
}		}

define <4 x float> @v4f32_select_ole(<4 x float> %a, <4 x float> %b, <4 x float> %c) {		define <4 x float> @v4f32_select_ole(<4 x float> %a, <4 x float> %b, <4 x float> %c) {
; COST-LABEL: v4f32_select_ole		; COST-LABEL: v4f32_select_ole
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ole <4 x float> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ole <4 x float> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c
;		;
; CODE-LABEL: v4f32_select_ole		; CODE-LABEL: v4f32_select_ole
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmge v{{.+}}.4s, v{{.+}}.4s, v{{.+}}.4s		; CODE-NEXT: fcmge v{{.+}}.4s, v{{.+}}.4s, v{{.+}}.4s
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp ole <4 x float> %a, %b		%cmp.1 = fcmp ole <4 x float> %a, %b
%s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c		%s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c
ret <4 x float> %s.1		ret <4 x float> %s.1
}		}

define <2 x double> @v2f64_select_ole(<2 x double> %a, <2 x double> %b, <2 x double> %c) {		define <2 x double> @v2f64_select_ole(<2 x double> %a, <2 x double> %b, <2 x double> %c) {
; COST-LABEL: v2f64_select_ole		; COST-LABEL: v2f64_select_ole
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ole <2 x double> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp ole <2 x double> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c
;		;
; CODE-LABEL: v2f64_select_ole		; CODE-LABEL: v2f64_select_ole
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmge v{{.+}}.2d, v{{.+}}.2d, v{{.+}}.2d		; CODE-NEXT: fcmge v{{.+}}.2d, v{{.+}}.2d, v{{.+}}.2d
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp ole <2 x double> %a, %b		%cmp.1 = fcmp ole <2 x double> %a, %b
%s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c		%s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c
ret <2 x double> %s.1		ret <2 x double> %s.1
}		}

define <4 x half> @v4f16_select_oeq(<4 x half> %a, <4 x half> %b, <4 x half> %c) {		define <4 x half> @v4f16_select_oeq(<4 x half> %a, <4 x half> %b, <4 x half> %c) {
; COST-LABEL: v4f16_select_oeq		; COST-LABEL: v4f16_select_oeq
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %cmp.1 = fcmp oeq <4 x half> %a, %b		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %cmp.1 = fcmp oeq <4 x half> %a, %b
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oeq <4 x half> %a, %b		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oeq <4 x half> %a, %b
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
;		;
; CODE-LABEL: v4f16_select_oeq		; CODE-LABEL: v4f16_select_oeq
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmeq v{{.+}}.4h, v{{.+}}.4h, v{{.+}}.4h		; CODE-NEXT: fcmeq v{{.+}}.4h, v{{.+}}.4h, v{{.+}}.4h
; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b		; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp oeq <4 x half> %a, %b		%cmp.1 = fcmp oeq <4 x half> %a, %b
%s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		%s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
ret <4 x half> %s.1		ret <4 x half> %s.1
}		}

define <8 x half> @v8f16_select_oeq(<8 x half> %a, <8 x half> %b, <8 x half> %c) {		define <8 x half> @v8f16_select_oeq(<8 x half> %a, <8 x half> %b, <8 x half> %c) {
; COST-LABEL: v8f16_select_oeq		; COST-LABEL: v8f16_select_oeq
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %cmp.1 = fcmp oeq <8 x half> %a, %b		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %cmp.1 = fcmp oeq <8 x half> %a, %b
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oeq <8 x half> %a, %b		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oeq <8 x half> %a, %b
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
;		;
; CODE-LABEL: v8f16_select_oeq		; CODE-LABEL: v8f16_select_oeq
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmeq v{{.+}}.8h, v{{.+}}.8h, v{{.+}}.8h		; CODE-NEXT: fcmeq v{{.+}}.8h, v{{.+}}.8h, v{{.+}}.8h
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp oeq <8 x half> %a, %b		%cmp.1 = fcmp oeq <8 x half> %a, %b
%s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		%s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
ret <8 x half> %s.1		ret <8 x half> %s.1
}		}

define <2 x float> @v2f32_select_oeq(<2 x float> %a, <2 x float> %b, <2 x float> %c) {		define <2 x float> @v2f32_select_oeq(<2 x float> %a, <2 x float> %b, <2 x float> %c) {
; COST-LABEL: v2f32_select_oeq		; COST-LABEL: v2f32_select_oeq
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oeq <2 x float> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oeq <2 x float> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c
;		;
; CODE-LABEL: v2f32_select_oeq		; CODE-LABEL: v2f32_select_oeq
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmeq v{{.+}}.2s, v{{.+}}.2s, v{{.+}}.2s		; CODE-NEXT: fcmeq v{{.+}}.2s, v{{.+}}.2s, v{{.+}}.2s
; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b		; CODE-NEXT: bif v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp oeq <2 x float> %a, %b		%cmp.1 = fcmp oeq <2 x float> %a, %b
%s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c		%s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c
ret <2 x float> %s.1		ret <2 x float> %s.1
}		}

define <4 x float> @v4f32_select_oeq(<4 x float> %a, <4 x float> %b, <4 x float> %c) {		define <4 x float> @v4f32_select_oeq(<4 x float> %a, <4 x float> %b, <4 x float> %c) {
; COST-LABEL: v4f32_select_oeq		; COST-LABEL: v4f32_select_oeq
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oeq <4 x float> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oeq <4 x float> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c
;		;
; CODE-LABEL: v4f32_select_oeq		; CODE-LABEL: v4f32_select_oeq
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmeq v{{.+}}.4s, v{{.+}}.4s, v{{.+}}.4s		; CODE-NEXT: fcmeq v{{.+}}.4s, v{{.+}}.4s, v{{.+}}.4s
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp oeq <4 x float> %a, %b		%cmp.1 = fcmp oeq <4 x float> %a, %b
%s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c		%s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c
ret <4 x float> %s.1		ret <4 x float> %s.1
}		}

define <2 x double> @v2f64_select_oeq(<2 x double> %a, <2 x double> %b, <2 x double> %c) {		define <2 x double> @v2f64_select_oeq(<2 x double> %a, <2 x double> %b, <2 x double> %c) {
; COST-LABEL: v2f64_select_oeq		; COST-LABEL: v2f64_select_oeq
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oeq <2 x double> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp oeq <2 x double> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c
;		;
; CODE-LABEL: v2f64_select_oeq		; CODE-LABEL: v2f64_select_oeq
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmeq v{{.+}}.2d, v{{.+}}.2d, v{{.+}}.2d		; CODE-NEXT: fcmeq v{{.+}}.2d, v{{.+}}.2d, v{{.+}}.2d
; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bif v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp oeq <2 x double> %a, %b		%cmp.1 = fcmp oeq <2 x double> %a, %b
▲ Show 20 Lines • Show All 95 Lines • ▼ Show 20 Lines	;
ret <2 x double> %s.1		ret <2 x double> %s.1
}		}

define <4 x half> @v4f16_select_une(<4 x half> %a, <4 x half> %b, <4 x half> %c) {		define <4 x half> @v4f16_select_une(<4 x half> %a, <4 x half> %b, <4 x half> %c) {
; COST-LABEL: v4f16_select_une		; COST-LABEL: v4f16_select_une
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %cmp.1 = fcmp une <4 x half> %a, %b		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %cmp.1 = fcmp une <4 x half> %a, %b
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp une <4 x half> %a, %b		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp une <4 x half> %a, %b
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
;		;
; CODE-LABEL: v4f16_select_une		; CODE-LABEL: v4f16_select_une
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmeq v{{.+}}.4h, v{{.+}}.4h, v{{.+}}.4h		; CODE-NEXT: fcmeq v{{.+}}.4h, v{{.+}}.4h, v{{.+}}.4h
; CODE-NEXT: bit v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b		; CODE-NEXT: bit v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp une <4 x half> %a, %b		%cmp.1 = fcmp une <4 x half> %a, %b
%s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c		%s.1 = select <4 x i1> %cmp.1, <4 x half> %a, <4 x half> %c
ret <4 x half> %s.1		ret <4 x half> %s.1
}		}

define <8 x half> @v8f16_select_une(<8 x half> %a, <8 x half> %b, <8 x half> %c) {		define <8 x half> @v8f16_select_une(<8 x half> %a, <8 x half> %b, <8 x half> %c) {
; COST-LABEL: v8f16_select_une		; COST-LABEL: v8f16_select_une
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %cmp.1 = fcmp une <8 x half> %a, %b		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %cmp.1 = fcmp une <8 x half> %a, %b
; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		; COST-NOFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp une <8 x half> %a, %b		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp une <8 x half> %a, %b
; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 29 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		; COST-FULLFP16-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
;		;
; CODE-LABEL: v8f16_select_une		; CODE-LABEL: v8f16_select_une
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmeq v{{.+}}.8h, v{{.+}}.8h, v{{.+}}.8h		; CODE-NEXT: fcmeq v{{.+}}.8h, v{{.+}}.8h, v{{.+}}.8h
; CODE-NEXT: bit v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bit v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp une <8 x half> %a, %b		%cmp.1 = fcmp une <8 x half> %a, %b
%s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c		%s.1 = select <8 x i1> %cmp.1, <8 x half> %a, <8 x half> %c
ret <8 x half> %s.1		ret <8 x half> %s.1
}		}

define <2 x float> @v2f32_select_une(<2 x float> %a, <2 x float> %b, <2 x float> %c) {		define <2 x float> @v2f32_select_une(<2 x float> %a, <2 x float> %b, <2 x float> %c) {
; COST-LABEL: v2f32_select_une		; COST-LABEL: v2f32_select_une
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp une <2 x float> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp une <2 x float> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c
;		;
; CODE-LABEL: v2f32_select_une		; CODE-LABEL: v2f32_select_une
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmeq v{{.+}}.2s, v{{.+}}.2s, v{{.+}}.2s		; CODE-NEXT: fcmeq v{{.+}}.2s, v{{.+}}.2s, v{{.+}}.2s
; CODE-NEXT: bit v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b		; CODE-NEXT: bit v{{.+}}.8b, v{{.+}}.8b, v{{.+}}.8b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp une <2 x float> %a, %b		%cmp.1 = fcmp une <2 x float> %a, %b
%s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c		%s.1 = select <2 x i1> %cmp.1, <2 x float> %a, <2 x float> %c
ret <2 x float> %s.1		ret <2 x float> %s.1
}		}

define <4 x float> @v4f32_select_une(<4 x float> %a, <4 x float> %b, <4 x float> %c) {		define <4 x float> @v4f32_select_une(<4 x float> %a, <4 x float> %b, <4 x float> %c) {
; COST-LABEL: v4f32_select_une		; COST-LABEL: v4f32_select_une
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp une <4 x float> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp une <4 x float> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 13 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c
;		;
; CODE-LABEL: v4f32_select_une		; CODE-LABEL: v4f32_select_une
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmeq v{{.+}}.4s, v{{.+}}.4s, v{{.+}}.4s		; CODE-NEXT: fcmeq v{{.+}}.4s, v{{.+}}.4s, v{{.+}}.4s
; CODE-NEXT: bit v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bit v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp une <4 x float> %a, %b		%cmp.1 = fcmp une <4 x float> %a, %b
%s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c		%s.1 = select <4 x i1> %cmp.1, <4 x float> %a, <4 x float> %c
ret <4 x float> %s.1		ret <4 x float> %s.1
}		}

define <2 x double> @v2f64_select_une(<2 x double> %a, <2 x double> %b, <2 x double> %c) {		define <2 x double> @v2f64_select_une(<2 x double> %a, <2 x double> %b, <2 x double> %c) {
; COST-LABEL: v2f64_select_une		; COST-LABEL: v2f64_select_une
; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp une <2 x double> %a, %b		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %cmp.1 = fcmp une <2 x double> %a, %b
; COST-NEXT: Cost Model: Found an estimated cost of 5 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c		; COST-NEXT: Cost Model: Found an estimated cost of 1 for instruction: %s.1 = select <2 x i1> %cmp.1, <2 x double> %a, <2 x double> %c
;		;
; CODE-LABEL: v2f64_select_une		; CODE-LABEL: v2f64_select_une
; CODE: bb.0		; CODE: bb.0
; CODE-NEXT: fcmeq v{{.+}}.2d, v{{.+}}.2d, v{{.+}}.2d		; CODE-NEXT: fcmeq v{{.+}}.2d, v{{.+}}.2d, v{{.+}}.2d
; CODE-NEXT: bit v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b		; CODE-NEXT: bit v{{.+}}.16b, v{{.+}}.16b, v{{.+}}.16b
; CODE-NEXT: ret		; CODE-NEXT: ret
;		;
%cmp.1 = fcmp une <2 x double> %a, %b		%cmp.1 = fcmp une <2 x double> %a, %b
▲ Show 20 Lines • Show All 57 Lines • Show Last 20 Lines