This is an archive of the discontinued LLVM Phabricator instance.

lib/Target/PowerPC/PPCInstrAltivec.td
931 ↗	(On Diff #148430)	Hi Nemanja, I have one concern on whether these two hardware instructions for vector float point can be perfectly mapped to these two ISDNode. As the description of fmaxnum/fminnum "in the case where a single input is NaN, the non-NaN input is returned.", while the description for the vmaxfp/vminfp in ISA like "The maximum of +0 and -0 is +0. The maximum of any value and a NaN is a QNaN." It looks more suitable for the fmaxnan/fminnan?
lib/Target/PowerPC/PPCInstrVSX.td
1507 ↗	(On Diff #148430)	I'm not sure why not use similar patterns like VMAXSW in PPCInstrAltivec.td but being located in HasP8Vector scope? Is there some special reasons with COPY_TO_REGCLASS?
test/CodeGen/PowerPC/vec-min-max.ll
53 ↗	(On Diff #148430)	Some more cases cover sge/sle seems trivial but the coverage is better?

jedilyn added a reviewer: jedilyn.Jul 18 2018, 6:13 AM

@jedilyn Hi Ke Wen, thanks for your comments. This needs some cleanup with regard to which instructions match the semantics of F{MIN|MAX}NUM vs. F{MIN|MAX}NAN. I'll clean that up and re-post this for your review. Thanks.

lib/Target/PowerPC/PPCInstrAltivec.td
931 ↗	(On Diff #148430)	I am really sorry about such a long delay in responding to this. You are absolutely right. The correct nodes are `FMINNAN` and `FMAXNAN`. I think what I meant to use here are `XVMAXDP, XVMINDP, XVMAXSP, XVMINSP`. Those have the mentioned semantics (i.e. comparing a value and a NaN returns the value). And I don't think we need to worry about signalling NaNs at this time.
lib/Target/PowerPC/PPCInstrVSX.td
1507 ↗	(On Diff #148430)	These instructions are new in ISA 2.07 so they have to be in the P8Vector block. Also, the `COPY_TO_REGCLASS` is needed because `VRRC` registers cannot contain `v2i64` values.

jedilyn mentioned this in D54783: [PowerPC] suboptimal vec_abs for some cases on P9.Dec 14 2018, 12:35 AM

RKSimon added a subscriber: RKSimon.Dec 14 2018, 12:36 AM

Herald added a subscriber: jsji. · View Herald TranscriptDec 14 2018, 12:36 AM

This covers PR39130 right?

Maybe worth adding the new vec-min-max.ll test file to trunk with current codegen so this patch shows the improved codegen diff?

test/CodeGen/PowerPC/vec-min-max.ll
5 ↗	(On Diff #148430)	You might be better using common prefixes to share (and reduce) checks. --check-prefixes=CHECK,P8VEC --check-prefixes=CHECK,NOP8VEC

In D47332#1330931, @RKSimon wrote:

This covers PR39130 right?

Maybe worth adding the new vec-min-max.ll test file to trunk with current codegen so this patch shows the improved codegen diff?

Yes, this will fix the PR for vector types. Scalar types will come later.

test/CodeGen/PowerPC/vec-min-max.ll
5 ↗	(On Diff #148430)	Oh cool. I was not aware of this functionality. Thank you.

Updated to remove the patterns for the Altivec versions of vector min/max as they have IEEE semantics wrt. handling NaN. A subsequent patch will legalize the _IEEE versions of the nodes for single precision and provide patterns to match them to vmaxfp/vminfp.

Is anything happening with this? We've hit PPC issues with ISD::ABS on https://reviews.llvm.org/D49837 and I noticed that PPCTargetLowering::LowerABS has a reference to this patch.

Herald added a project: Restricted Project. · View Herald TranscriptMar 11 2019, 4:22 AM

Herald added a subscriber: jdoerfert. · View Herald Transcript

In D47332#1424366, @RKSimon wrote:

Is anything happening with this? We've hit PPC issues with ISD::ABS on https://reviews.llvm.org/D49837 and I noticed that PPCTargetLowering::LowerABS has a reference to this patch.

Thanks for pointing this out - I forgot about this patch. It should be ready to commit, perhaps @hfinkel, @echristo or @jsji can go through this and give their opinion on whether we can go ahead with this.

LGTM. Thanks for your time!

This revision is now accepted and ready to land.Mar 12 2019, 8:44 PM

Closed by commit rL362759: [PowerPC] Exploit the vector min/max instructions (authored by nemanjai). · Explain WhyJun 6 2019, 4:47 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

lib/

Target/

PowerPC/

PPCISelLowering.cpp

18 lines

PPCInstrAltivec.td

26 lines

PPCInstrVSX.td

21 lines

test/

CodeGen/

PowerPC/

ctr-minmaxnum.ll

21 lines

sat-add.ll

24 lines

vec-min-max.ll

239 lines

vector-constrained-fp-intrinsics.ll

524 lines

Diff 203468

llvm/trunk/lib/Target/PowerPC/PPCISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 549 Lines • ▼ Show 20 Lines	PPCTargetLowering::PPCTargetLowering(const PPCTargetMachine &TM,
if (Subtarget.hasAltivec()) {		if (Subtarget.hasAltivec()) {
// First set operation action for all vector types to expand. Then we		// First set operation action for all vector types to expand. Then we
// will selectively turn on ones that can be effectively codegen'd.		// will selectively turn on ones that can be effectively codegen'd.
for (MVT VT : MVT::vector_valuetypes()) {		for (MVT VT : MVT::vector_valuetypes()) {
// add/sub are legal for all supported vector VT's.		// add/sub are legal for all supported vector VT's.
setOperationAction(ISD::ADD, VT, Legal);		setOperationAction(ISD::ADD, VT, Legal);
setOperationAction(ISD::SUB, VT, Legal);		setOperationAction(ISD::SUB, VT, Legal);

		// For v2i64, these are only valid with P8Vector. This is corrected after
		// the loop.
		setOperationAction(ISD::SMAX, VT, Legal);
		setOperationAction(ISD::SMIN, VT, Legal);
		setOperationAction(ISD::UMAX, VT, Legal);
		setOperationAction(ISD::UMIN, VT, Legal);

		if (Subtarget.hasVSX()) {
		setOperationAction(ISD::FMAXNUM, VT, Legal);
		setOperationAction(ISD::FMINNUM, VT, Legal);
		}

// Vector instructions introduced in P8		// Vector instructions introduced in P8
if (Subtarget.hasP8Altivec() && (VT.SimpleTy != MVT::v1i128)) {		if (Subtarget.hasP8Altivec() && (VT.SimpleTy != MVT::v1i128)) {
setOperationAction(ISD::CTPOP, VT, Legal);		setOperationAction(ISD::CTPOP, VT, Legal);
setOperationAction(ISD::CTLZ, VT, Legal);		setOperationAction(ISD::CTLZ, VT, Legal);
}		}
else {		else {
setOperationAction(ISD::CTPOP, VT, Expand);		setOperationAction(ISD::CTPOP, VT, Expand);
setOperationAction(ISD::CTLZ, VT, Expand);		setOperationAction(ISD::CTLZ, VT, Expand);
▲ Show 20 Lines • Show All 67 Lines • ▼ Show 20 Lines	for (MVT VT : MVT::vector_valuetypes()) {

for (MVT InnerVT : MVT::vector_valuetypes()) {		for (MVT InnerVT : MVT::vector_valuetypes()) {
setTruncStoreAction(VT, InnerVT, Expand);		setTruncStoreAction(VT, InnerVT, Expand);
setLoadExtAction(ISD::SEXTLOAD, VT, InnerVT, Expand);		setLoadExtAction(ISD::SEXTLOAD, VT, InnerVT, Expand);
setLoadExtAction(ISD::ZEXTLOAD, VT, InnerVT, Expand);		setLoadExtAction(ISD::ZEXTLOAD, VT, InnerVT, Expand);
setLoadExtAction(ISD::EXTLOAD, VT, InnerVT, Expand);		setLoadExtAction(ISD::EXTLOAD, VT, InnerVT, Expand);
}		}
}		}
		if (!Subtarget.hasP8Vector()) {
		setOperationAction(ISD::SMAX, MVT::v2i64, Expand);
		setOperationAction(ISD::SMIN, MVT::v2i64, Expand);
		setOperationAction(ISD::UMAX, MVT::v2i64, Expand);
		setOperationAction(ISD::UMIN, MVT::v2i64, Expand);
		}

for (auto VT : {MVT::v2i64, MVT::v4i32, MVT::v8i16, MVT::v16i8})		for (auto VT : {MVT::v2i64, MVT::v4i32, MVT::v8i16, MVT::v16i8})
setOperationAction(ISD::ABS, VT, Custom);		setOperationAction(ISD::ABS, VT, Custom);

// We can custom expand all VECTOR_SHUFFLEs to VPERM, others we can handle		// We can custom expand all VECTOR_SHUFFLEs to VPERM, others we can handle
// with merges, splats, etc.		// with merges, splats, etc.
setOperationAction(ISD::VECTOR_SHUFFLE, MVT::v16i8, Custom);		setOperationAction(ISD::VECTOR_SHUFFLE, MVT::v16i8, Custom);

▲ Show 20 Lines • Show All 14,559 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/PowerPC/PPCInstrAltivec.td

	Show First 20 Lines • Show All 894 Lines • ▼ Show 20 Lines
	def : Pat<(v2i64 (bitconvert (v1i128 VRRC:$src))), (v2i64 VRRC:$src)>;			def : Pat<(v2i64 (bitconvert (v1i128 VRRC:$src))), (v2i64 VRRC:$src)>;

	def : Pat<(v1i128 (bitconvert (v16i8 VRRC:$src))), (v1i128 VRRC:$src)>;			def : Pat<(v1i128 (bitconvert (v16i8 VRRC:$src))), (v1i128 VRRC:$src)>;
	def : Pat<(v1i128 (bitconvert (v8i16 VRRC:$src))), (v1i128 VRRC:$src)>;			def : Pat<(v1i128 (bitconvert (v8i16 VRRC:$src))), (v1i128 VRRC:$src)>;
	def : Pat<(v1i128 (bitconvert (v4i32 VRRC:$src))), (v1i128 VRRC:$src)>;			def : Pat<(v1i128 (bitconvert (v4i32 VRRC:$src))), (v1i128 VRRC:$src)>;
	def : Pat<(v1i128 (bitconvert (v4f32 VRRC:$src))), (v1i128 VRRC:$src)>;			def : Pat<(v1i128 (bitconvert (v4f32 VRRC:$src))), (v1i128 VRRC:$src)>;
	def : Pat<(v1i128 (bitconvert (v2i64 VRRC:$src))), (v1i128 VRRC:$src)>;			def : Pat<(v1i128 (bitconvert (v2i64 VRRC:$src))), (v1i128 VRRC:$src)>;

				// Max/Min
				def : Pat<(v16i8 (umax v16i8:$src1, v16i8:$src2)),
				(v16i8 (VMAXUB $src1, $src2))>;
				def : Pat<(v16i8 (smax v16i8:$src1, v16i8:$src2)),
				(v16i8 (VMAXSB $src1, $src2))>;
				def : Pat<(v8i16 (umax v8i16:$src1, v8i16:$src2)),
				(v8i16 (VMAXUH $src1, $src2))>;
				def : Pat<(v8i16 (smax v8i16:$src1, v8i16:$src2)),
				(v8i16 (VMAXSH $src1, $src2))>;
				def : Pat<(v4i32 (umax v4i32:$src1, v4i32:$src2)),
				(v4i32 (VMAXUW $src1, $src2))>;
				def : Pat<(v4i32 (smax v4i32:$src1, v4i32:$src2)),
				(v4i32 (VMAXSW $src1, $src2))>;
				def : Pat<(v16i8 (umin v16i8:$src1, v16i8:$src2)),
				(v16i8 (VMINUB $src1, $src2))>;
				def : Pat<(v16i8 (smin v16i8:$src1, v16i8:$src2)),
				(v16i8 (VMINSB $src1, $src2))>;
				def : Pat<(v8i16 (umin v8i16:$src1, v8i16:$src2)),
				(v8i16 (VMINUH $src1, $src2))>;
				def : Pat<(v8i16 (smin v8i16:$src1, v8i16:$src2)),
				(v8i16 (VMINSH $src1, $src2))>;
				def : Pat<(v4i32 (umin v4i32:$src1, v4i32:$src2)),
				(v4i32 (VMINUW $src1, $src2))>;
				def : Pat<(v4i32 (smin v4i32:$src1, v4i32:$src2)),
				(v4i32 (VMINSW $src1, $src2))>;

	// Shuffles.			// Shuffles.

	// Match vsldoi(x,x), vpkuwum(x,x), vpkuhum(x,x)			// Match vsldoi(x,x), vpkuwum(x,x), vpkuhum(x,x)
	def:Pat<(vsldoi_unary_shuffle:$in v16i8:$vA, undef),			def:Pat<(vsldoi_unary_shuffle:$in v16i8:$vA, undef),
	(VSLDOI $vA, $vA, (VSLDOI_unary_get_imm $in))>;			(VSLDOI $vA, $vA, (VSLDOI_unary_get_imm $in))>;
	def:Pat<(vpkuwum_unary_shuffle v16i8:$vA, undef),			def:Pat<(vpkuwum_unary_shuffle v16i8:$vA, undef),
	(VPKUWUM $vA, $vA)>;			(VPKUWUM $vA, $vA)>;
	def:Pat<(vpkuhum_unary_shuffle v16i8:$vA, undef),			def:Pat<(vpkuhum_unary_shuffle v16i8:$vA, undef),
	▲ Show 20 Lines • Show All 612 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/PowerPC/PPCInstrVSX.td

Show First 20 Lines • Show All 1,182 Lines • ▼ Show 20 Lines	def : Pat<(vselect v4i32:$vA, v4i32:$vB, v4i32:$vC),
(XXSEL $vC, $vB, $vA)>;		(XXSEL $vC, $vB, $vA)>;
def : Pat<(vselect v2i64:$vA, v2i64:$vB, v2i64:$vC),		def : Pat<(vselect v2i64:$vA, v2i64:$vB, v2i64:$vC),
(XXSEL $vC, $vB, $vA)>;		(XXSEL $vC, $vB, $vA)>;
def : Pat<(vselect v4i32:$vA, v4f32:$vB, v4f32:$vC),		def : Pat<(vselect v4i32:$vA, v4f32:$vB, v4f32:$vC),
(XXSEL $vC, $vB, $vA)>;		(XXSEL $vC, $vB, $vA)>;
def : Pat<(vselect v2i64:$vA, v2f64:$vB, v2f64:$vC),		def : Pat<(vselect v2i64:$vA, v2f64:$vB, v2f64:$vC),
(XXSEL $vC, $vB, $vA)>;		(XXSEL $vC, $vB, $vA)>;

		def : Pat<(v4f32 (fmaxnum v4f32:$src1, v4f32:$src2)),
		(v4f32 (XVMAXSP $src1, $src2))>;
		def : Pat<(v4f32 (fminnum v4f32:$src1, v4f32:$src2)),
		(v4f32 (XVMINSP $src1, $src2))>;
		def : Pat<(v2f64 (fmaxnum v2f64:$src1, v2f64:$src2)),
		(v2f64 (XVMAXDP $src1, $src2))>;
		def : Pat<(v2f64 (fminnum v2f64:$src1, v2f64:$src2)),
		(v2f64 (XVMINDP $src1, $src2))>;

let Predicates = [IsLittleEndian] in {		let Predicates = [IsLittleEndian] in {
def : Pat<(f64 (PPCfcfid (PPCmtvsra (i64 (vector_extract v2i64:$S, 0))))),		def : Pat<(f64 (PPCfcfid (PPCmtvsra (i64 (vector_extract v2i64:$S, 0))))),
(f64 (XSCVSXDDP (COPY_TO_REGCLASS (XXPERMDI $S, $S, 2), VSFRC)))>;		(f64 (XSCVSXDDP (COPY_TO_REGCLASS (XXPERMDI $S, $S, 2), VSFRC)))>;
def : Pat<(f64 (PPCfcfid (PPCmtvsra (i64 (vector_extract v2i64:$S, 1))))),		def : Pat<(f64 (PPCfcfid (PPCmtvsra (i64 (vector_extract v2i64:$S, 1))))),
(f64 (XSCVSXDDP (COPY_TO_REGCLASS (f64 (COPY_TO_REGCLASS $S, VSRC)), VSFRC)))>;		(f64 (XSCVSXDDP (COPY_TO_REGCLASS (f64 (COPY_TO_REGCLASS $S, VSRC)), VSFRC)))>;
def : Pat<(f64 (PPCfcfidu (PPCmtvsra (i64 (vector_extract v2i64:$S, 0))))),		def : Pat<(f64 (PPCfcfidu (PPCmtvsra (i64 (vector_extract v2i64:$S, 0))))),
(f64 (XSCVUXDDP (COPY_TO_REGCLASS (XXPERMDI $S, $S, 2), VSFRC)))>;		(f64 (XSCVUXDDP (COPY_TO_REGCLASS (XXPERMDI $S, $S, 2), VSFRC)))>;
def : Pat<(f64 (PPCfcfidu (PPCmtvsra (i64 (vector_extract v2i64:$S, 1))))),		def : Pat<(f64 (PPCfcfidu (PPCmtvsra (i64 (vector_extract v2i64:$S, 1))))),
▲ Show 20 Lines • Show All 318 Lines • ▼ Show 20 Lines	let AddedComplexity = 400 in { // Prefer VSX patterns over non-VSX patterns.
// Instructions for converting float to i32 feeding a store.		// Instructions for converting float to i32 feeding a store.
def : Pat<(PPCstore_scal_int_from_vsr		def : Pat<(PPCstore_scal_int_from_vsr
(f64 (PPCcv_fp_to_sint_in_vsr f64:$src)), xoaddr:$dst, 4),		(f64 (PPCcv_fp_to_sint_in_vsr f64:$src)), xoaddr:$dst, 4),
(STIWX (XSCVDPSXWS f64:$src), xoaddr:$dst)>;		(STIWX (XSCVDPSXWS f64:$src), xoaddr:$dst)>;
def : Pat<(PPCstore_scal_int_from_vsr		def : Pat<(PPCstore_scal_int_from_vsr
(f64 (PPCcv_fp_to_uint_in_vsr f64:$src)), xoaddr:$dst, 4),		(f64 (PPCcv_fp_to_uint_in_vsr f64:$src)), xoaddr:$dst, 4),
(STIWX (XSCVDPUXWS f64:$src), xoaddr:$dst)>;		(STIWX (XSCVDPUXWS f64:$src), xoaddr:$dst)>;

		def : Pat<(v2i64 (smax v2i64:$src1, v2i64:$src2)),
		(v2i64 (VMAXSD (COPY_TO_REGCLASS $src1, VRRC),
		(COPY_TO_REGCLASS $src2, VRRC)))>;
		def : Pat<(v2i64 (umax v2i64:$src1, v2i64:$src2)),
		(v2i64 (VMAXUD (COPY_TO_REGCLASS $src1, VRRC),
		(COPY_TO_REGCLASS $src2, VRRC)))>;
		def : Pat<(v2i64 (smin v2i64:$src1, v2i64:$src2)),
		(v2i64 (VMINSD (COPY_TO_REGCLASS $src1, VRRC),
		(COPY_TO_REGCLASS $src2, VRRC)))>;
		def : Pat<(v2i64 (umin v2i64:$src1, v2i64:$src2)),
		(v2i64 (VMINUD (COPY_TO_REGCLASS $src1, VRRC),
		(COPY_TO_REGCLASS $src2, VRRC)))>;
} // AddedComplexity = 400		} // AddedComplexity = 400
} // HasP8Vector		} // HasP8Vector

let AddedComplexity = 400 in {		let AddedComplexity = 400 in {
let Predicates = [HasDirectMove] in {		let Predicates = [HasDirectMove] in {
// VSX direct move instructions		// VSX direct move instructions
def MFVSRD : XX1_RS6_RD5_XO<31, 51, (outs g8rc:$rA), (ins vsfrc:$XT),		def MFVSRD : XX1_RS6_RD5_XO<31, 51, (outs g8rc:$rA), (ins vsfrc:$XT),
"mfvsrd $rA, $XT", IIC_VecGeneral,		"mfvsrd $rA, $XT", IIC_VecGeneral,
▲ Show 20 Lines • Show All 2,637 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/PowerPC/ctr-minmaxnum.ll

Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	loop_body:
%2 = icmp eq i64 %1, 4		%2 = icmp eq i64 %1, 4
br i1 %2, label %loop_exit, label %loop_body		br i1 %2, label %loop_exit, label %loop_body

loop_exit:		loop_exit:
ret void		ret void
}		}

; CHECK-LABEL: test1v:		; CHECK-LABEL: test1v:
; CHECK: bl fminf		; CHECK: xvminsp
; CHECK-NOT: mtctr		; CHECK-NOT: bl fminf
; CHECK: bl fminf		; CHECK: mtctr
; CHECK-NOT: mtctr
; CHECK: bl fminf
; CHECK-NOT: mtctr
; CHECK: bl fminf
; CHECK-NOT: bl fminf		; CHECK-NOT: bl fminf
; CHECK: blr		; CHECK: blr

; QPX-LABEL: test1v:		; QPX-LABEL: test1v:
; QPX: mtctr		; QPX: mtctr
; QPX-NOT: bl fminf		; QPX-NOT: bl fminf
; QPX: blr		; QPX: blr

▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	loop_body:
%2 = icmp eq i64 %1, 4		%2 = icmp eq i64 %1, 4
br i1 %2, label %loop_exit, label %loop_body		br i1 %2, label %loop_exit, label %loop_body

loop_exit:		loop_exit:
ret void		ret void
}		}

; CHECK-LABEL: test2v:		; CHECK-LABEL: test2v:
; CHECK: bl fmax		; CHECK: xvmaxdp
; CHECK-NOT: mtctr		; CHECK: xvmaxdp
; CHECK: bl fmax		; CHECK-NOT: bl fmax
; CHECK-NOT: mtctr		; CHECK: mtctr
; CHECK: bl fmax
; CHECK-NOT: mtctr
; CHECK: bl fmax
; CHECK-NOT: bl fmax		; CHECK-NOT: bl fmax
; CHECK: blr		; CHECK: blr

; QPX-LABEL: test2v:		; QPX-LABEL: test2v:
; QPX: mtctr		; QPX: mtctr
; QPX-NOT: bl fmax		; QPX-NOT: bl fmax
; QPX: blr		; QPX: blr

▲ Show 20 Lines • Show All 115 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/PowerPC/sat-add.ll

Show First 20 Lines • Show All 376 Lines • ▼ Show 20 Lines
define <16 x i8> @unsigned_sat_constant_v16i8_using_min(<16 x i8> %x) {		define <16 x i8> @unsigned_sat_constant_v16i8_using_min(<16 x i8> %x) {
; CHECK-LABEL: unsigned_sat_constant_v16i8_using_min:		; CHECK-LABEL: unsigned_sat_constant_v16i8_using_min:
; CHECK: # %bb.0:		; CHECK: # %bb.0:
; CHECK-NEXT: addis 3, 2, .LCPI24_0@toc@ha		; CHECK-NEXT: addis 3, 2, .LCPI24_0@toc@ha
; CHECK-NEXT: addi 3, 3, .LCPI24_0@toc@l		; CHECK-NEXT: addi 3, 3, .LCPI24_0@toc@l
; CHECK-NEXT: lvx 3, 0, 3		; CHECK-NEXT: lvx 3, 0, 3
; CHECK-NEXT: addis 3, 2, .LCPI24_1@toc@ha		; CHECK-NEXT: addis 3, 2, .LCPI24_1@toc@ha
; CHECK-NEXT: addi 3, 3, .LCPI24_1@toc@l		; CHECK-NEXT: addi 3, 3, .LCPI24_1@toc@l
; CHECK-NEXT: vcmpgtub 4, 3, 2		; CHECK-NEXT: vminub 2, 2, 3
; CHECK-NEXT: xxsel 34, 35, 34, 36
; CHECK-NEXT: lvx 3, 0, 3		; CHECK-NEXT: lvx 3, 0, 3
; CHECK-NEXT: vaddubm 2, 2, 3		; CHECK-NEXT: vaddubm 2, 2, 3
; CHECK-NEXT: blr		; CHECK-NEXT: blr
%c = icmp ult <16 x i8> %x, <i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43>		%c = icmp ult <16 x i8> %x, <i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43>
%s = select <16 x i1> %c, <16 x i8> %x, <16 x i8> <i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43>		%s = select <16 x i1> %c, <16 x i8> %x, <16 x i8> <i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43, i8 -43>
%r = add <16 x i8> %s, <i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42>		%r = add <16 x i8> %s, <i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42, i8 42>
ret <16 x i8> %r		ret <16 x i8> %r
}		}
Show All 38 Lines
define <8 x i16> @unsigned_sat_constant_v8i16_using_min(<8 x i16> %x) {		define <8 x i16> @unsigned_sat_constant_v8i16_using_min(<8 x i16> %x) {
; CHECK-LABEL: unsigned_sat_constant_v8i16_using_min:		; CHECK-LABEL: unsigned_sat_constant_v8i16_using_min:
; CHECK: # %bb.0:		; CHECK: # %bb.0:
; CHECK-NEXT: addis 3, 2, .LCPI27_0@toc@ha		; CHECK-NEXT: addis 3, 2, .LCPI27_0@toc@ha
; CHECK-NEXT: addi 3, 3, .LCPI27_0@toc@l		; CHECK-NEXT: addi 3, 3, .LCPI27_0@toc@l
; CHECK-NEXT: lvx 3, 0, 3		; CHECK-NEXT: lvx 3, 0, 3
; CHECK-NEXT: addis 3, 2, .LCPI27_1@toc@ha		; CHECK-NEXT: addis 3, 2, .LCPI27_1@toc@ha
; CHECK-NEXT: addi 3, 3, .LCPI27_1@toc@l		; CHECK-NEXT: addi 3, 3, .LCPI27_1@toc@l
; CHECK-NEXT: vcmpgtuh 4, 3, 2		; CHECK-NEXT: vminuh 2, 2, 3
; CHECK-NEXT: xxsel 34, 35, 34, 36
; CHECK-NEXT: lvx 3, 0, 3		; CHECK-NEXT: lvx 3, 0, 3
; CHECK-NEXT: vadduhm 2, 2, 3		; CHECK-NEXT: vadduhm 2, 2, 3
; CHECK-NEXT: blr		; CHECK-NEXT: blr
%c = icmp ult <8 x i16> %x, <i16 -43, i16 -43, i16 -43, i16 -43, i16 -43, i16 -43, i16 -43, i16 -43>		%c = icmp ult <8 x i16> %x, <i16 -43, i16 -43, i16 -43, i16 -43, i16 -43, i16 -43, i16 -43, i16 -43>
%s = select <8 x i1> %c, <8 x i16> %x, <8 x i16> <i16 -43, i16 -43, i16 -43, i16 -43, i16 -43, i16 -43, i16 -43, i16 -43>		%s = select <8 x i1> %c, <8 x i16> %x, <8 x i16> <i16 -43, i16 -43, i16 -43, i16 -43, i16 -43, i16 -43, i16 -43, i16 -43>
%r = add <8 x i16> %s, <i16 42, i16 42, i16 42, i16 42, i16 42, i16 42, i16 42, i16 42>		%r = add <8 x i16> %s, <i16 42, i16 42, i16 42, i16 42, i16 42, i16 42, i16 42, i16 42>
ret <8 x i16> %r		ret <8 x i16> %r
}		}
Show All 38 Lines
define <4 x i32> @unsigned_sat_constant_v4i32_using_min(<4 x i32> %x) {		define <4 x i32> @unsigned_sat_constant_v4i32_using_min(<4 x i32> %x) {
; CHECK-LABEL: unsigned_sat_constant_v4i32_using_min:		; CHECK-LABEL: unsigned_sat_constant_v4i32_using_min:
; CHECK: # %bb.0:		; CHECK: # %bb.0:
; CHECK-NEXT: addis 3, 2, .LCPI30_0@toc@ha		; CHECK-NEXT: addis 3, 2, .LCPI30_0@toc@ha
; CHECK-NEXT: addi 3, 3, .LCPI30_0@toc@l		; CHECK-NEXT: addi 3, 3, .LCPI30_0@toc@l
; CHECK-NEXT: lvx 3, 0, 3		; CHECK-NEXT: lvx 3, 0, 3
; CHECK-NEXT: addis 3, 2, .LCPI30_1@toc@ha		; CHECK-NEXT: addis 3, 2, .LCPI30_1@toc@ha
; CHECK-NEXT: addi 3, 3, .LCPI30_1@toc@l		; CHECK-NEXT: addi 3, 3, .LCPI30_1@toc@l
; CHECK-NEXT: vcmpgtuw 4, 3, 2		; CHECK-NEXT: vminuw 2, 2, 3
; CHECK-NEXT: xxsel 34, 35, 34, 36
; CHECK-NEXT: lvx 3, 0, 3		; CHECK-NEXT: lvx 3, 0, 3
; CHECK-NEXT: vadduwm 2, 2, 3		; CHECK-NEXT: vadduwm 2, 2, 3
; CHECK-NEXT: blr		; CHECK-NEXT: blr
%c = icmp ult <4 x i32> %x, <i32 -43, i32 -43, i32 -43, i32 -43>		%c = icmp ult <4 x i32> %x, <i32 -43, i32 -43, i32 -43, i32 -43>
%s = select <4 x i1> %c, <4 x i32> %x, <4 x i32> <i32 -43, i32 -43, i32 -43, i32 -43>		%s = select <4 x i1> %c, <4 x i32> %x, <4 x i32> <i32 -43, i32 -43, i32 -43, i32 -43>
%r = add <4 x i32> %s, <i32 42, i32 42, i32 42, i32 42>		%r = add <4 x i32> %s, <i32 42, i32 42, i32 42, i32 42>
ret <4 x i32> %r		ret <4 x i32> %r
}		}
Show All 40 Lines
; CHECK: # %bb.0:		; CHECK: # %bb.0:
; CHECK-NEXT: addis 3, 2, .LCPI33_0@toc@ha		; CHECK-NEXT: addis 3, 2, .LCPI33_0@toc@ha
; CHECK-NEXT: addi 3, 3, .LCPI33_0@toc@l		; CHECK-NEXT: addi 3, 3, .LCPI33_0@toc@l
; CHECK-NEXT: lxvd2x 0, 0, 3		; CHECK-NEXT: lxvd2x 0, 0, 3
; CHECK-NEXT: addis 3, 2, .LCPI33_1@toc@ha		; CHECK-NEXT: addis 3, 2, .LCPI33_1@toc@ha
; CHECK-NEXT: addi 3, 3, .LCPI33_1@toc@l		; CHECK-NEXT: addi 3, 3, .LCPI33_1@toc@l
; CHECK-NEXT: xxswapd 35, 0		; CHECK-NEXT: xxswapd 35, 0
; CHECK-NEXT: lxvd2x 0, 0, 3		; CHECK-NEXT: lxvd2x 0, 0, 3
; CHECK-NEXT: vcmpgtud 4, 3, 2		; CHECK-NEXT: vminud 2, 2, 3
; CHECK-NEXT: xxsel 34, 35, 34, 36
; CHECK-NEXT: xxswapd 35, 0		; CHECK-NEXT: xxswapd 35, 0
; CHECK-NEXT: vaddudm 2, 2, 3		; CHECK-NEXT: vaddudm 2, 2, 3
; CHECK-NEXT: blr		; CHECK-NEXT: blr
%c = icmp ult <2 x i64> %x, <i64 -43, i64 -43>		%c = icmp ult <2 x i64> %x, <i64 -43, i64 -43>
%s = select <2 x i1> %c, <2 x i64> %x, <2 x i64> <i64 -43, i64 -43>		%s = select <2 x i1> %c, <2 x i64> %x, <2 x i64> <i64 -43, i64 -43>
%r = add <2 x i64> %s, <i64 42, i64 42>		%r = add <2 x i64> %s, <i64 42, i64 42>
ret <2 x i64> %r		ret <2 x i64> %r
}		}
Show All 37 Lines	; CHECK-NEXT: blr
%r = select <2 x i1> %c, <2 x i64> <i64 -1, i64 -1>, <2 x i64> %a		%r = select <2 x i1> %c, <2 x i64> <i64 -1, i64 -1>, <2 x i64> %a
ret <2 x i64> %r		ret <2 x i64> %r
}		}

define <16 x i8> @unsigned_sat_variable_v16i8_using_min(<16 x i8> %x, <16 x i8> %y) {		define <16 x i8> @unsigned_sat_variable_v16i8_using_min(<16 x i8> %x, <16 x i8> %y) {
; CHECK-LABEL: unsigned_sat_variable_v16i8_using_min:		; CHECK-LABEL: unsigned_sat_variable_v16i8_using_min:
; CHECK: # %bb.0:		; CHECK: # %bb.0:
; CHECK-NEXT: xxlnor 36, 35, 35		; CHECK-NEXT: xxlnor 36, 35, 35
; CHECK-NEXT: vcmpgtub 5, 4, 2		; CHECK-NEXT: vminub 2, 2, 4
; CHECK-NEXT: xxsel 34, 36, 34, 37
; CHECK-NEXT: vaddubm 2, 2, 3		; CHECK-NEXT: vaddubm 2, 2, 3
; CHECK-NEXT: blr		; CHECK-NEXT: blr
%noty = xor <16 x i8> %y, <i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1>		%noty = xor <16 x i8> %y, <i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1>
%c = icmp ult <16 x i8> %x, %noty		%c = icmp ult <16 x i8> %x, %noty
%s = select <16 x i1> %c, <16 x i8> %x, <16 x i8> %noty		%s = select <16 x i1> %c, <16 x i8> %x, <16 x i8> %noty
%r = add <16 x i8> %s, %y		%r = add <16 x i8> %s, %y
ret <16 x i8> %r		ret <16 x i8> %r
}		}
Show All 27 Lines	; CHECK-NEXT: blr
%r = select <16 x i1> %c, <16 x i8> <i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1>, <16 x i8> %a		%r = select <16 x i1> %c, <16 x i8> <i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1, i8 -1>, <16 x i8> %a
ret <16 x i8> %r		ret <16 x i8> %r
}		}

define <8 x i16> @unsigned_sat_variable_v8i16_using_min(<8 x i16> %x, <8 x i16> %y) {		define <8 x i16> @unsigned_sat_variable_v8i16_using_min(<8 x i16> %x, <8 x i16> %y) {
; CHECK-LABEL: unsigned_sat_variable_v8i16_using_min:		; CHECK-LABEL: unsigned_sat_variable_v8i16_using_min:
; CHECK: # %bb.0:		; CHECK: # %bb.0:
; CHECK-NEXT: xxlnor 36, 35, 35		; CHECK-NEXT: xxlnor 36, 35, 35
; CHECK-NEXT: vcmpgtuh 5, 4, 2		; CHECK-NEXT: vminuh 2, 2, 4
; CHECK-NEXT: xxsel 34, 36, 34, 37
; CHECK-NEXT: vadduhm 2, 2, 3		; CHECK-NEXT: vadduhm 2, 2, 3
; CHECK-NEXT: blr		; CHECK-NEXT: blr
%noty = xor <8 x i16> %y, <i16 -1, i16 -1, i16 -1, i16 -1, i16 -1, i16 -1, i16 -1, i16 -1>		%noty = xor <8 x i16> %y, <i16 -1, i16 -1, i16 -1, i16 -1, i16 -1, i16 -1, i16 -1, i16 -1>
%c = icmp ult <8 x i16> %x, %noty		%c = icmp ult <8 x i16> %x, %noty
%s = select <8 x i1> %c, <8 x i16> %x, <8 x i16> %noty		%s = select <8 x i1> %c, <8 x i16> %x, <8 x i16> %noty
%r = add <8 x i16> %s, %y		%r = add <8 x i16> %s, %y
ret <8 x i16> %r		ret <8 x i16> %r
}		}
Show All 27 Lines	; CHECK-NEXT: blr
%r = select <8 x i1> %c, <8 x i16> <i16 -1, i16 -1, i16 -1, i16 -1, i16 -1, i16 -1, i16 -1, i16 -1>, <8 x i16> %a		%r = select <8 x i1> %c, <8 x i16> <i16 -1, i16 -1, i16 -1, i16 -1, i16 -1, i16 -1, i16 -1, i16 -1>, <8 x i16> %a
ret <8 x i16> %r		ret <8 x i16> %r
}		}

define <4 x i32> @unsigned_sat_variable_v4i32_using_min(<4 x i32> %x, <4 x i32> %y) {		define <4 x i32> @unsigned_sat_variable_v4i32_using_min(<4 x i32> %x, <4 x i32> %y) {
; CHECK-LABEL: unsigned_sat_variable_v4i32_using_min:		; CHECK-LABEL: unsigned_sat_variable_v4i32_using_min:
; CHECK: # %bb.0:		; CHECK: # %bb.0:
; CHECK-NEXT: xxlnor 36, 35, 35		; CHECK-NEXT: xxlnor 36, 35, 35
; CHECK-NEXT: vcmpgtuw 5, 4, 2		; CHECK-NEXT: vminuw 2, 2, 4
; CHECK-NEXT: xxsel 34, 36, 34, 37
; CHECK-NEXT: vadduwm 2, 2, 3		; CHECK-NEXT: vadduwm 2, 2, 3
; CHECK-NEXT: blr		; CHECK-NEXT: blr
%noty = xor <4 x i32> %y, <i32 -1, i32 -1, i32 -1, i32 -1>		%noty = xor <4 x i32> %y, <i32 -1, i32 -1, i32 -1, i32 -1>
%c = icmp ult <4 x i32> %x, %noty		%c = icmp ult <4 x i32> %x, %noty
%s = select <4 x i1> %c, <4 x i32> %x, <4 x i32> %noty		%s = select <4 x i1> %c, <4 x i32> %x, <4 x i32> %noty
%r = add <4 x i32> %s, %y		%r = add <4 x i32> %s, %y
ret <4 x i32> %r		ret <4 x i32> %r
}		}
Show All 27 Lines	; CHECK-NEXT: blr
%r = select <4 x i1> %c, <4 x i32> <i32 -1, i32 -1, i32 -1, i32 -1>, <4 x i32> %a		%r = select <4 x i1> %c, <4 x i32> <i32 -1, i32 -1, i32 -1, i32 -1>, <4 x i32> %a
ret <4 x i32> %r		ret <4 x i32> %r
}		}

define <2 x i64> @unsigned_sat_variable_v2i64_using_min(<2 x i64> %x, <2 x i64> %y) {		define <2 x i64> @unsigned_sat_variable_v2i64_using_min(<2 x i64> %x, <2 x i64> %y) {
; CHECK-LABEL: unsigned_sat_variable_v2i64_using_min:		; CHECK-LABEL: unsigned_sat_variable_v2i64_using_min:
; CHECK: # %bb.0:		; CHECK: # %bb.0:
; CHECK-NEXT: xxlnor 36, 35, 35		; CHECK-NEXT: xxlnor 36, 35, 35
; CHECK-NEXT: vcmpgtud 5, 4, 2		; CHECK-NEXT: vminud 2, 2, 4
; CHECK-NEXT: xxsel 34, 36, 34, 37
; CHECK-NEXT: vaddudm 2, 2, 3		; CHECK-NEXT: vaddudm 2, 2, 3
; CHECK-NEXT: blr		; CHECK-NEXT: blr
%noty = xor <2 x i64> %y, <i64 -1, i64 -1>		%noty = xor <2 x i64> %y, <i64 -1, i64 -1>
%c = icmp ult <2 x i64> %x, %noty		%c = icmp ult <2 x i64> %x, %noty
%s = select <2 x i1> %c, <2 x i64> %x, <2 x i64> %noty		%s = select <2 x i1> %c, <2 x i64> %x, <2 x i64> %noty
%r = add <2 x i64> %s, %y		%r = add <2 x i64> %s, %y
ret <2 x i64> %r		ret <2 x i64> %r
}		}
Show All 31 Lines

llvm/trunk/test/CodeGen/PowerPC/vec-min-max.ll

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc < %s -mtriple=powerpc64le-unknown-unknown -mcpu=pwr8 \
				; RUN: -verify-machineinstrs \| FileCheck %s
				; RUN: llc < %s -mtriple=powerpc64le-unknown-unknown -mcpu=pwr7 \
				; RUN: -verify-machineinstrs \| FileCheck %s --check-prefix=NOP8VEC
				define <16 x i8> @getsmaxi8(<16 x i8> %a, <16 x i8> %b) {
				; CHECK-LABEL: getsmaxi8:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: vmaxsb 2, 2, 3
				; CHECK-NEXT: blr
				;
				; NOP8VEC-LABEL: getsmaxi8:
				; NOP8VEC: # %bb.0: # %entry
				; NOP8VEC-NEXT: vmaxsb 2, 2, 3
				; NOP8VEC-NEXT: blr
				entry:
				%0 = icmp sgt <16 x i8> %a, %b
				%1 = select <16 x i1> %0, <16 x i8> %a, <16 x i8> %b
				ret <16 x i8> %1
				}

				define <8 x i16> @getsmaxi16(<8 x i16> %a, <8 x i16> %b) {
				; CHECK-LABEL: getsmaxi16:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: vmaxsh 2, 2, 3
				; CHECK-NEXT: blr
				;
				; NOP8VEC-LABEL: getsmaxi16:
				; NOP8VEC: # %bb.0: # %entry
				; NOP8VEC-NEXT: vmaxsh 2, 2, 3
				; NOP8VEC-NEXT: blr
				entry:
				%0 = icmp sgt <8 x i16> %a, %b
				%1 = select <8 x i1> %0, <8 x i16> %a, <8 x i16> %b
				ret <8 x i16> %1
				}

				define <4 x i32> @getsmaxi32(<4 x i32> %a, <4 x i32> %b) {
				; CHECK-LABEL: getsmaxi32:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: vmaxsw 2, 2, 3
				; CHECK-NEXT: blr
				;
				; NOP8VEC-LABEL: getsmaxi32:
				; NOP8VEC: # %bb.0: # %entry
				; NOP8VEC-NEXT: vmaxsw 2, 2, 3
				; NOP8VEC-NEXT: blr
				entry:
				%0 = icmp sgt <4 x i32> %a, %b
				%1 = select <4 x i1> %0, <4 x i32> %a, <4 x i32> %b
				ret <4 x i32> %1
				}

				define <2 x i64> @getsmaxi64(<2 x i64> %a, <2 x i64> %b) {
				; CHECK-LABEL: getsmaxi64:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: vmaxsd 2, 2, 3
				; CHECK-NEXT: blr
				;
				; NOP8VEC-LABEL: getsmaxi64:
				; NOP8VEC: # %bb.0: # %entry
				; NOP8VEC-NEXT: xxswapd 0, 35
				; NOP8VEC-NEXT: addi 3, 1, -32
				; NOP8VEC-NEXT: addi 4, 1, -48
				; NOP8VEC-NEXT: xxswapd 1, 34
				; NOP8VEC-NEXT: stxvd2x 0, 0, 3
				; NOP8VEC-NEXT: stxvd2x 1, 0, 4
				; NOP8VEC-NEXT: ld 3, -24(1)
				; NOP8VEC-NEXT: ld 4, -40(1)
				; NOP8VEC-NEXT: cmpd 4, 3
				; NOP8VEC-NEXT: li 3, 0
				; NOP8VEC-NEXT: li 4, -1
				; NOP8VEC-NEXT: isel 5, 4, 3, 1
				; NOP8VEC-NEXT: std 5, -8(1)
				; NOP8VEC-NEXT: ld 5, -32(1)
				; NOP8VEC-NEXT: ld 6, -48(1)
				; NOP8VEC-NEXT: cmpd 6, 5
				; NOP8VEC-NEXT: isel 3, 4, 3, 1
				; NOP8VEC-NEXT: std 3, -16(1)
				; NOP8VEC-NEXT: addi 3, 1, -16
				; NOP8VEC-NEXT: lxvd2x 0, 0, 3
				; NOP8VEC-NEXT: xxswapd 36, 0
				; NOP8VEC-NEXT: xxsel 34, 35, 34, 36
				; NOP8VEC-NEXT: blr
				entry:
				%0 = icmp sgt <2 x i64> %a, %b
				%1 = select <2 x i1> %0, <2 x i64> %a, <2 x i64> %b
				ret <2 x i64> %1
				}

				define <4 x float> @getsmaxf32(<4 x float> %a, <4 x float> %b) {
				; CHECK-LABEL: getsmaxf32:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: xvmaxsp 34, 34, 35
				; CHECK-NEXT: blr
				;
				; NOP8VEC-LABEL: getsmaxf32:
				; NOP8VEC: # %bb.0: # %entry
				; NOP8VEC-NEXT: xvmaxsp 34, 34, 35
				; NOP8VEC-NEXT: blr
				entry:
				%0 = fcmp fast oge <4 x float> %a, %b
				%1 = select <4 x i1> %0, <4 x float> %a, <4 x float> %b
				ret <4 x float> %1
				}

				define <2 x double> @getsmaxf64(<2 x double> %a, <2 x double> %b) {
				; CHECK-LABEL: getsmaxf64:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: xvmaxdp 34, 34, 35
				; CHECK-NEXT: blr
				;
				; NOP8VEC-LABEL: getsmaxf64:
				; NOP8VEC: # %bb.0: # %entry
				; NOP8VEC-NEXT: xvmaxdp 34, 34, 35
				; NOP8VEC-NEXT: blr
				entry:
				%0 = fcmp fast oge <2 x double> %a, %b
				%1 = select <2 x i1> %0, <2 x double> %a, <2 x double> %b
				ret <2 x double> %1
				}

				define <16 x i8> @getsmini8(<16 x i8> %a, <16 x i8> %b) {
				; CHECK-LABEL: getsmini8:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: vminsb 2, 2, 3
				; CHECK-NEXT: blr
				;
				; NOP8VEC-LABEL: getsmini8:
				; NOP8VEC: # %bb.0: # %entry
				; NOP8VEC-NEXT: vminsb 2, 2, 3
				; NOP8VEC-NEXT: blr
				entry:
				%0 = icmp slt <16 x i8> %a, %b
				%1 = select <16 x i1> %0, <16 x i8> %a, <16 x i8> %b
				ret <16 x i8> %1
				}

				define <8 x i16> @getsmini16(<8 x i16> %a, <8 x i16> %b) {
				; CHECK-LABEL: getsmini16:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: vminsh 2, 2, 3
				; CHECK-NEXT: blr
				;
				; NOP8VEC-LABEL: getsmini16:
				; NOP8VEC: # %bb.0: # %entry
				; NOP8VEC-NEXT: vminsh 2, 2, 3
				; NOP8VEC-NEXT: blr
				entry:
				%0 = icmp slt <8 x i16> %a, %b
				%1 = select <8 x i1> %0, <8 x i16> %a, <8 x i16> %b
				ret <8 x i16> %1
				}

				define <4 x i32> @getsmini32(<4 x i32> %a, <4 x i32> %b) {
				; CHECK-LABEL: getsmini32:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: vminsw 2, 2, 3
				; CHECK-NEXT: blr
				;
				; NOP8VEC-LABEL: getsmini32:
				; NOP8VEC: # %bb.0: # %entry
				; NOP8VEC-NEXT: vminsw 2, 2, 3
				; NOP8VEC-NEXT: blr
				entry:
				%0 = icmp slt <4 x i32> %a, %b
				%1 = select <4 x i1> %0, <4 x i32> %a, <4 x i32> %b
				ret <4 x i32> %1
				}

				define <2 x i64> @getsmini64(<2 x i64> %a, <2 x i64> %b) {
				; CHECK-LABEL: getsmini64:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: vminsd 2, 2, 3
				; CHECK-NEXT: blr
				;
				; NOP8VEC-LABEL: getsmini64:
				; NOP8VEC: # %bb.0: # %entry
				; NOP8VEC-NEXT: xxswapd 0, 35
				; NOP8VEC-NEXT: addi 3, 1, -32
				; NOP8VEC-NEXT: addi 4, 1, -48
				; NOP8VEC-NEXT: xxswapd 1, 34
				; NOP8VEC-NEXT: stxvd2x 0, 0, 3
				; NOP8VEC-NEXT: stxvd2x 1, 0, 4
				; NOP8VEC-NEXT: ld 3, -24(1)
				; NOP8VEC-NEXT: ld 4, -40(1)
				; NOP8VEC-NEXT: cmpd 4, 3
				; NOP8VEC-NEXT: li 3, 0
				; NOP8VEC-NEXT: li 4, -1
				; NOP8VEC-NEXT: isel 5, 4, 3, 0
				; NOP8VEC-NEXT: std 5, -8(1)
				; NOP8VEC-NEXT: ld 5, -32(1)
				; NOP8VEC-NEXT: ld 6, -48(1)
				; NOP8VEC-NEXT: cmpd 6, 5
				; NOP8VEC-NEXT: isel 3, 4, 3, 0
				; NOP8VEC-NEXT: std 3, -16(1)
				; NOP8VEC-NEXT: addi 3, 1, -16
				; NOP8VEC-NEXT: lxvd2x 0, 0, 3
				; NOP8VEC-NEXT: xxswapd 36, 0
				; NOP8VEC-NEXT: xxsel 34, 35, 34, 36
				; NOP8VEC-NEXT: blr
				entry:
				%0 = icmp slt <2 x i64> %a, %b
				%1 = select <2 x i1> %0, <2 x i64> %a, <2 x i64> %b
				ret <2 x i64> %1
				}

				define <4 x float> @getsminf32(<4 x float> %a, <4 x float> %b) {
				; CHECK-LABEL: getsminf32:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: xvminsp 34, 34, 35
				; CHECK-NEXT: blr
				;
				; NOP8VEC-LABEL: getsminf32:
				; NOP8VEC: # %bb.0: # %entry
				; NOP8VEC-NEXT: xvminsp 34, 34, 35
				; NOP8VEC-NEXT: blr
				entry:
				%0 = fcmp fast ole <4 x float> %a, %b
				%1 = select <4 x i1> %0, <4 x float> %a, <4 x float> %b
				ret <4 x float> %1
				}

				define <2 x double> @getsminf64(<2 x double> %a, <2 x double> %b) {
				; CHECK-LABEL: getsminf64:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: xvmindp 34, 34, 35
				; CHECK-NEXT: blr
				;
				; NOP8VEC-LABEL: getsminf64:
				; NOP8VEC: # %bb.0: # %entry
				; NOP8VEC-NEXT: xvmindp 34, 34, 35
				; NOP8VEC-NEXT: blr
				entry:
				%0 = fcmp fast ole <2 x double> %a, %b
				%1 = select <2 x i1> %0, <2 x double> %a, <2 x double> %b
				ret <2 x double> %1
				}

llvm/trunk/test/CodeGen/PowerPC/vector-constrained-fp-intrinsics.ll

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 656 Lines • ▼ Show 20 Lines
; PC64LE9-NEXT: lxv 63, 48(1) # 16-byte Folded Reload		; PC64LE9-NEXT: lxv 63, 48(1) # 16-byte Folded Reload
; PC64LE9-NEXT: lfd 31, 72(1) # 8-byte Folded Reload		; PC64LE9-NEXT: lfd 31, 72(1) # 8-byte Folded Reload
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1		; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: xxmrghd 35, 1, 0		; PC64LE9-NEXT: xxmrghd 35, 1, 0
; PC64LE9-NEXT: addi 1, 1, 80		; PC64LE9-NEXT: addi 1, 1, 80
; PC64LE9-NEXT: ld 0, 16(1)		; PC64LE9-NEXT: ld 0, 16(1)
; PC64LE9-NEXT: mtlr 0		; PC64LE9-NEXT: mtlr 0
; PC64LE9-NEXT: blr		; PC64LE9-NEXT: blr
;
%rem = call <4 x double> @llvm.experimental.constrained.frem.v4f64(		%rem = call <4 x double> @llvm.experimental.constrained.frem.v4f64(
<4 x double> <double 1.000000e+00, double 2.000000e+00,		<4 x double> <double 1.000000e+00, double 2.000000e+00,
double 3.000000e+00, double 4.000000e+00>,		double 3.000000e+00, double 4.000000e+00>,
<4 x double> <double 1.000000e+01, double 1.000000e+01,		<4 x double> <double 1.000000e+01, double 1.000000e+01,
double 1.000000e+01, double 1.000000e+01>,		double 1.000000e+01, double 1.000000e+01>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <4 x double> %rem		ret <4 x double> %rem
▲ Show 20 Lines • Show All 816 Lines • ▼ Show 20 Lines
; PC64LE9-NEXT: addi 3, 3, .LCPI29_0@toc@l		; PC64LE9-NEXT: addi 3, 3, .LCPI29_0@toc@l
; PC64LE9-NEXT: lxvx 0, 0, 3		; PC64LE9-NEXT: lxvx 0, 0, 3
; PC64LE9-NEXT: addis 3, 2, .LCPI29_1@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI29_1@toc@ha
; PC64LE9-NEXT: addi 3, 3, .LCPI29_1@toc@l		; PC64LE9-NEXT: addi 3, 3, .LCPI29_1@toc@l
; PC64LE9-NEXT: xvsqrtdp 34, 0		; PC64LE9-NEXT: xvsqrtdp 34, 0
; PC64LE9-NEXT: lxvx 0, 0, 3		; PC64LE9-NEXT: lxvx 0, 0, 3
; PC64LE9-NEXT: xvsqrtdp 35, 0		; PC64LE9-NEXT: xvsqrtdp 35, 0
; PC64LE9-NEXT: blr		; PC64LE9-NEXT: blr
;
entry:		entry:
%sqrt = call <4 x double> @llvm.experimental.constrained.sqrt.v4f64(		%sqrt = call <4 x double> @llvm.experimental.constrained.sqrt.v4f64(
<4 x double> <double 42.0, double 42.1,		<4 x double> <double 42.0, double 42.1,
double 42.2, double 42.3>,		double 42.2, double 42.3>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <4 x double> %sqrt		ret <4 x double> %sqrt
}		}
▲ Show 20 Lines • Show All 3,701 Lines • ▼ Show 20 Lines
; PC64LE9-NEXT: xxsldwi 36, 0, 0, 1		; PC64LE9-NEXT: xxsldwi 36, 0, 0, 1
; PC64LE9-NEXT: vperm 2, 4, 2, 3		; PC64LE9-NEXT: vperm 2, 4, 2, 3
; PC64LE9-NEXT: addi 1, 1, 48		; PC64LE9-NEXT: addi 1, 1, 48
; PC64LE9-NEXT: ld 0, 16(1)		; PC64LE9-NEXT: ld 0, 16(1)
; PC64LE9-NEXT: mtlr 0		; PC64LE9-NEXT: mtlr 0
; PC64LE9-NEXT: lfd 31, -8(1) # 8-byte Folded Reload		; PC64LE9-NEXT: lfd 31, -8(1) # 8-byte Folded Reload
; PC64LE9-NEXT: lfd 30, -16(1) # 8-byte Folded Reload		; PC64LE9-NEXT: lfd 30, -16(1) # 8-byte Folded Reload
; PC64LE9-NEXT: blr		; PC64LE9-NEXT: blr
;
entry:		entry:
%rint = call <3 x float> @llvm.experimental.constrained.rint.v3f32(		%rint = call <3 x float> @llvm.experimental.constrained.rint.v3f32(
<3 x float> <float 42.0, float 43.0, float 44.0>,		<3 x float> <float 42.0, float 43.0, float 44.0>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <3 x float> %rint		ret <3 x float> %rint
}		}

▲ Show 20 Lines • Show All 479 Lines • ▼ Show 20 Lines	%max = call <1 x float> @llvm.experimental.constrained.maxnum.v1f32(
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <1 x float> %max		ret <1 x float> %max
}		}

define <2 x double> @constrained_vector_maxnum_v2f64() {		define <2 x double> @constrained_vector_maxnum_v2f64() {
; PC64LE-LABEL: constrained_vector_maxnum_v2f64:		; PC64LE-LABEL: constrained_vector_maxnum_v2f64:
; PC64LE: # %bb.0: # %entry		; PC64LE: # %bb.0: # %entry
; PC64LE-NEXT: mflr 0
; PC64LE-NEXT: std 0, 16(1)
; PC64LE-NEXT: stdu 1, -64(1)
; PC64LE-NEXT: .cfi_def_cfa_offset 64
; PC64LE-NEXT: .cfi_offset lr, 16
; PC64LE-NEXT: addis 3, 2, .LCPI86_0@toc@ha		; PC64LE-NEXT: addis 3, 2, .LCPI86_0@toc@ha
; PC64LE-NEXT: addis 4, 2, .LCPI86_1@toc@ha		; PC64LE-NEXT: addis 4, 2, .LCPI86_1@toc@ha
; PC64LE-NEXT: lfs 1, .LCPI86_0@toc@l(3)		; PC64LE-NEXT: addi 3, 3, .LCPI86_0@toc@l
; PC64LE-NEXT: lfs 2, .LCPI86_1@toc@l(4)		; PC64LE-NEXT: addi 4, 4, .LCPI86_1@toc@l
; PC64LE-NEXT: bl fmax		; PC64LE-NEXT: lxvd2x 0, 0, 3
; PC64LE-NEXT: nop		; PC64LE-NEXT: lxvd2x 1, 0, 4
; PC64LE-NEXT: li 3, 48		; PC64LE-NEXT: xxswapd 0, 0
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1		; PC64LE-NEXT: xxswapd 1, 1
; PC64LE-NEXT: addis 4, 2, .LCPI86_3@toc@ha		; PC64LE-NEXT: xvmaxdp 34, 1, 0
; PC64LE-NEXT: stxvd2x 1, 1, 3 # 16-byte Folded Spill
; PC64LE-NEXT: addis 3, 2, .LCPI86_2@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI86_3@toc@l(4)
; PC64LE-NEXT: lfs 1, .LCPI86_2@toc@l(3)
; PC64LE-NEXT: bl fmax
; PC64LE-NEXT: nop
; PC64LE-NEXT: li 3, 48
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE-NEXT: lxvd2x 0, 1, 3 # 16-byte Folded Reload
; PC64LE-NEXT: xxmrghd 34, 1, 0
; PC64LE-NEXT: addi 1, 1, 64
; PC64LE-NEXT: ld 0, 16(1)
; PC64LE-NEXT: mtlr 0
; PC64LE-NEXT: blr		; PC64LE-NEXT: blr
;		;
; PC64LE9-LABEL: constrained_vector_maxnum_v2f64:		; PC64LE9-LABEL: constrained_vector_maxnum_v2f64:
; PC64LE9: # %bb.0: # %entry		; PC64LE9: # %bb.0: # %entry
; PC64LE9-NEXT: mflr 0
; PC64LE9-NEXT: std 0, 16(1)
; PC64LE9-NEXT: stdu 1, -48(1)
; PC64LE9-NEXT: .cfi_def_cfa_offset 48
; PC64LE9-NEXT: .cfi_offset lr, 16
; PC64LE9-NEXT: addis 3, 2, .LCPI86_0@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI86_0@toc@ha
; PC64LE9-NEXT: lfs 1, .LCPI86_0@toc@l(3)		; PC64LE9-NEXT: addi 3, 3, .LCPI86_0@toc@l
		; PC64LE9-NEXT: lxvx 0, 0, 3
; PC64LE9-NEXT: addis 3, 2, .LCPI86_1@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI86_1@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI86_1@toc@l(3)		; PC64LE9-NEXT: addi 3, 3, .LCPI86_1@toc@l
; PC64LE9-NEXT: bl fmax		; PC64LE9-NEXT: lxvx 1, 0, 3
; PC64LE9-NEXT: nop		; PC64LE9-NEXT: xvmaxdp 34, 1, 0
; PC64LE9-NEXT: addis 3, 2, .LCPI86_2@toc@ha
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: stxv 1, 32(1) # 16-byte Folded Spill
; PC64LE9-NEXT: lfs 1, .LCPI86_2@toc@l(3)
; PC64LE9-NEXT: addis 3, 2, .LCPI86_3@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI86_3@toc@l(3)
; PC64LE9-NEXT: bl fmax
; PC64LE9-NEXT: nop
; PC64LE9-NEXT: lxv 0, 32(1) # 16-byte Folded Reload
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: xxmrghd 34, 1, 0
; PC64LE9-NEXT: addi 1, 1, 48
; PC64LE9-NEXT: ld 0, 16(1)
; PC64LE9-NEXT: mtlr 0
; PC64LE9-NEXT: blr		; PC64LE9-NEXT: blr
entry:		entry:
%max = call <2 x double> @llvm.experimental.constrained.maxnum.v2f64(		%max = call <2 x double> @llvm.experimental.constrained.maxnum.v2f64(
<2 x double> <double 43.0, double 42.0>,		<2 x double> <double 43.0, double 42.0>,
<2 x double> <double 41.0, double 40.0>,		<2 x double> <double 41.0, double 40.0>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <2 x double> %max		ret <2 x double> %max
▲ Show 20 Lines • Show All 112 Lines • ▼ Show 20 Lines	entry:
ret <3 x float> %max		ret <3 x float> %max
}		}

define <3 x double> @constrained_vector_max_v3f64() {		define <3 x double> @constrained_vector_max_v3f64() {
; PC64LE-LABEL: constrained_vector_max_v3f64:		; PC64LE-LABEL: constrained_vector_max_v3f64:
; PC64LE: # %bb.0: # %entry		; PC64LE: # %bb.0: # %entry
; PC64LE-NEXT: mflr 0		; PC64LE-NEXT: mflr 0
; PC64LE-NEXT: std 0, 16(1)		; PC64LE-NEXT: std 0, 16(1)
; PC64LE-NEXT: stdu 1, -80(1)		; PC64LE-NEXT: stdu 1, -32(1)
; PC64LE-NEXT: .cfi_def_cfa_offset 80		; PC64LE-NEXT: .cfi_def_cfa_offset 32
; PC64LE-NEXT: .cfi_offset lr, 16		; PC64LE-NEXT: .cfi_offset lr, 16
; PC64LE-NEXT: .cfi_offset v31, -16
; PC64LE-NEXT: li 3, 64
; PC64LE-NEXT: addis 4, 2, .LCPI88_1@toc@ha
; PC64LE-NEXT: stxvd2x 63, 1, 3 # 16-byte Folded Spill
; PC64LE-NEXT: addis 3, 2, .LCPI88_0@toc@ha		; PC64LE-NEXT: addis 3, 2, .LCPI88_0@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI88_1@toc@l(4)		; PC64LE-NEXT: addis 4, 2, .LCPI88_1@toc@ha
; PC64LE-NEXT: lfs 1, .LCPI88_0@toc@l(3)		; PC64LE-NEXT: lfs 1, .LCPI88_0@toc@l(3)
		; PC64LE-NEXT: lfs 2, .LCPI88_1@toc@l(4)
; PC64LE-NEXT: bl fmax		; PC64LE-NEXT: bl fmax
; PC64LE-NEXT: nop		; PC64LE-NEXT: nop
; PC64LE-NEXT: li 3, 48
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE-NEXT: addis 4, 2, .LCPI88_3@toc@ha
; PC64LE-NEXT: stxvd2x 1, 1, 3 # 16-byte Folded Spill
; PC64LE-NEXT: addis 3, 2, .LCPI88_2@toc@ha		; PC64LE-NEXT: addis 3, 2, .LCPI88_2@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI88_3@toc@l(4)		; PC64LE-NEXT: addis 4, 2, .LCPI88_3@toc@ha
; PC64LE-NEXT: lfs 1, .LCPI88_2@toc@l(3)
; PC64LE-NEXT: bl fmax
; PC64LE-NEXT: nop
; PC64LE-NEXT: li 3, 48
; PC64LE-NEXT: addis 4, 2, .LCPI88_5@toc@ha
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE-NEXT: lxvd2x 0, 1, 3 # 16-byte Folded Reload
; PC64LE-NEXT: addis 3, 2, .LCPI88_4@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI88_5@toc@l(4)
; PC64LE-NEXT: xxmrghd 63, 1, 0
; PC64LE-NEXT: lfs 1, .LCPI88_4@toc@l(3)
; PC64LE-NEXT: bl fmax
; PC64LE-NEXT: nop
; PC64LE-NEXT: li 3, 64
; PC64LE-NEXT: fmr 3, 1		; PC64LE-NEXT: fmr 3, 1
; PC64LE-NEXT: xxlor 1, 63, 63		; PC64LE-NEXT: addi 3, 3, .LCPI88_2@toc@l
; PC64LE-NEXT: xxlor 2, 63, 63		; PC64LE-NEXT: addi 4, 4, .LCPI88_3@toc@l
; PC64LE-NEXT: lxvd2x 63, 1, 3 # 16-byte Folded Reload		; PC64LE-NEXT: lxvd2x 0, 0, 3
; PC64LE-NEXT: addi 1, 1, 80		; PC64LE-NEXT: lxvd2x 2, 0, 4
		; PC64LE-NEXT: xxswapd 0, 0
		; PC64LE-NEXT: xxswapd 2, 2
		; PC64LE-NEXT: xvmaxdp 2, 2, 0
		; PC64LE-NEXT: xxswapd 0, 2
		; PC64LE-NEXT: # kill: def $f2 killed $f2 killed $vsl2
		; PC64LE-NEXT: fmr 1, 0
		; PC64LE-NEXT: addi 1, 1, 32
; PC64LE-NEXT: ld 0, 16(1)		; PC64LE-NEXT: ld 0, 16(1)
; PC64LE-NEXT: mtlr 0		; PC64LE-NEXT: mtlr 0
; PC64LE-NEXT: blr		; PC64LE-NEXT: blr
;		;
; PC64LE9-LABEL: constrained_vector_max_v3f64:		; PC64LE9-LABEL: constrained_vector_max_v3f64:
; PC64LE9: # %bb.0: # %entry		; PC64LE9: # %bb.0: # %entry
; PC64LE9-NEXT: mflr 0		; PC64LE9-NEXT: mflr 0
; PC64LE9-NEXT: std 0, 16(1)		; PC64LE9-NEXT: std 0, 16(1)
; PC64LE9-NEXT: stdu 1, -64(1)		; PC64LE9-NEXT: stdu 1, -32(1)
; PC64LE9-NEXT: .cfi_def_cfa_offset 64		; PC64LE9-NEXT: .cfi_def_cfa_offset 32
; PC64LE9-NEXT: .cfi_offset lr, 16		; PC64LE9-NEXT: .cfi_offset lr, 16
; PC64LE9-NEXT: .cfi_offset v31, -16
; PC64LE9-NEXT: addis 3, 2, .LCPI88_0@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI88_0@toc@ha
; PC64LE9-NEXT: lfs 1, .LCPI88_0@toc@l(3)		; PC64LE9-NEXT: lfs 1, .LCPI88_0@toc@l(3)
; PC64LE9-NEXT: addis 3, 2, .LCPI88_1@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI88_1@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI88_1@toc@l(3)		; PC64LE9-NEXT: lfs 2, .LCPI88_1@toc@l(3)
; PC64LE9-NEXT: stxv 63, 48(1) # 16-byte Folded Spill
; PC64LE9-NEXT: bl fmax		; PC64LE9-NEXT: bl fmax
; PC64LE9-NEXT: nop		; PC64LE9-NEXT: nop
; PC64LE9-NEXT: addis 3, 2, .LCPI88_2@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI88_2@toc@ha
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1		; PC64LE9-NEXT: addi 3, 3, .LCPI88_2@toc@l
; PC64LE9-NEXT: stxv 1, 32(1) # 16-byte Folded Spill		; PC64LE9-NEXT: lxvx 0, 0, 3
; PC64LE9-NEXT: lfs 1, .LCPI88_2@toc@l(3)
; PC64LE9-NEXT: addis 3, 2, .LCPI88_3@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI88_3@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI88_3@toc@l(3)		; PC64LE9-NEXT: addi 3, 3, .LCPI88_3@toc@l
; PC64LE9-NEXT: bl fmax
; PC64LE9-NEXT: nop
; PC64LE9-NEXT: lxv 0, 32(1) # 16-byte Folded Reload
; PC64LE9-NEXT: addis 3, 2, .LCPI88_4@toc@ha
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: xxmrghd 63, 1, 0
; PC64LE9-NEXT: lfs 1, .LCPI88_4@toc@l(3)
; PC64LE9-NEXT: addis 3, 2, .LCPI88_5@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI88_5@toc@l(3)
; PC64LE9-NEXT: bl fmax
; PC64LE9-NEXT: nop
; PC64LE9-NEXT: fmr 3, 1		; PC64LE9-NEXT: fmr 3, 1
; PC64LE9-NEXT: xscpsgndp 1, 63, 63		; PC64LE9-NEXT: lxvx 1, 0, 3
; PC64LE9-NEXT: xscpsgndp 2, 63, 63		; PC64LE9-NEXT: xvmaxdp 2, 1, 0
; PC64LE9-NEXT: lxv 63, 48(1) # 16-byte Folded Reload		; PC64LE9-NEXT: xxswapd 1, 2
; PC64LE9-NEXT: addi 1, 1, 64		; PC64LE9-NEXT: # kill: def $f1 killed $f1 killed $vsl1
		; PC64LE9-NEXT: # kill: def $f2 killed $f2 killed $vsl2
		; PC64LE9-NEXT: addi 1, 1, 32
; PC64LE9-NEXT: ld 0, 16(1)		; PC64LE9-NEXT: ld 0, 16(1)
; PC64LE9-NEXT: mtlr 0		; PC64LE9-NEXT: mtlr 0
; PC64LE9-NEXT: blr		; PC64LE9-NEXT: blr
entry:		entry:
%max = call <3 x double> @llvm.experimental.constrained.maxnum.v3f64(		%max = call <3 x double> @llvm.experimental.constrained.maxnum.v3f64(
<3 x double> <double 43.0, double 44.0, double 45.0>,		<3 x double> <double 43.0, double 44.0, double 45.0>,
<3 x double> <double 40.0, double 41.0, double 42.0>,		<3 x double> <double 40.0, double 41.0, double 42.0>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <3 x double> %max		ret <3 x double> %max
}		}

define <4 x double> @constrained_vector_maxnum_v4f64() {		define <4 x double> @constrained_vector_maxnum_v4f64() {
; PC64LE-LABEL: constrained_vector_maxnum_v4f64:		; PC64LE-LABEL: constrained_vector_maxnum_v4f64:
; PC64LE: # %bb.0: # %entry		; PC64LE: # %bb.0: # %entry
; PC64LE-NEXT: mflr 0
; PC64LE-NEXT: std 0, 16(1)
; PC64LE-NEXT: stdu 1, -80(1)
; PC64LE-NEXT: .cfi_def_cfa_offset 80
; PC64LE-NEXT: .cfi_offset lr, 16
; PC64LE-NEXT: .cfi_offset v31, -16
; PC64LE-NEXT: li 3, 64
; PC64LE-NEXT: addis 4, 2, .LCPI89_1@toc@ha
; PC64LE-NEXT: stxvd2x 63, 1, 3 # 16-byte Folded Spill
; PC64LE-NEXT: addis 3, 2, .LCPI89_0@toc@ha		; PC64LE-NEXT: addis 3, 2, .LCPI89_0@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI89_1@toc@l(4)		; PC64LE-NEXT: addis 4, 2, .LCPI89_1@toc@ha
; PC64LE-NEXT: lfs 1, .LCPI89_0@toc@l(3)		; PC64LE-NEXT: addis 5, 2, .LCPI89_2@toc@ha
; PC64LE-NEXT: bl fmax		; PC64LE-NEXT: addis 6, 2, .LCPI89_3@toc@ha
; PC64LE-NEXT: nop		; PC64LE-NEXT: addi 3, 3, .LCPI89_0@toc@l
; PC64LE-NEXT: li 3, 48		; PC64LE-NEXT: addi 4, 4, .LCPI89_1@toc@l
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1		; PC64LE-NEXT: lxvd2x 0, 0, 3
; PC64LE-NEXT: addis 4, 2, .LCPI89_3@toc@ha		; PC64LE-NEXT: lxvd2x 1, 0, 4
; PC64LE-NEXT: stxvd2x 1, 1, 3 # 16-byte Folded Spill		; PC64LE-NEXT: addi 3, 5, .LCPI89_2@toc@l
; PC64LE-NEXT: addis 3, 2, .LCPI89_2@toc@ha		; PC64LE-NEXT: addi 4, 6, .LCPI89_3@toc@l
; PC64LE-NEXT: lfs 2, .LCPI89_3@toc@l(4)		; PC64LE-NEXT: lxvd2x 2, 0, 3
; PC64LE-NEXT: lfs 1, .LCPI89_2@toc@l(3)		; PC64LE-NEXT: lxvd2x 3, 0, 4
; PC64LE-NEXT: bl fmax		; PC64LE-NEXT: xxswapd 0, 0
; PC64LE-NEXT: nop		; PC64LE-NEXT: xxswapd 1, 1
; PC64LE-NEXT: li 3, 48		; PC64LE-NEXT: xxswapd 2, 2
; PC64LE-NEXT: addis 4, 2, .LCPI89_5@toc@ha		; PC64LE-NEXT: xxswapd 3, 3
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1		; PC64LE-NEXT: xvmaxdp 34, 1, 0
; PC64LE-NEXT: lxvd2x 0, 1, 3 # 16-byte Folded Reload		; PC64LE-NEXT: xvmaxdp 35, 3, 2
; PC64LE-NEXT: addis 3, 2, .LCPI89_4@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI89_5@toc@l(4)
; PC64LE-NEXT: xxmrghd 63, 1, 0
; PC64LE-NEXT: lfs 1, .LCPI89_4@toc@l(3)
; PC64LE-NEXT: bl fmax
; PC64LE-NEXT: nop
; PC64LE-NEXT: li 3, 48
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE-NEXT: addis 4, 2, .LCPI89_7@toc@ha
; PC64LE-NEXT: stxvd2x 1, 1, 3 # 16-byte Folded Spill
; PC64LE-NEXT: addis 3, 2, .LCPI89_6@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI89_7@toc@l(4)
; PC64LE-NEXT: lfs 1, .LCPI89_6@toc@l(3)
; PC64LE-NEXT: bl fmax
; PC64LE-NEXT: nop
; PC64LE-NEXT: li 3, 48
; PC64LE-NEXT: vmr 2, 31
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE-NEXT: lxvd2x 0, 1, 3 # 16-byte Folded Reload
; PC64LE-NEXT: li 3, 64
; PC64LE-NEXT: lxvd2x 63, 1, 3 # 16-byte Folded Reload
; PC64LE-NEXT: xxmrghd 35, 1, 0
; PC64LE-NEXT: addi 1, 1, 80
; PC64LE-NEXT: ld 0, 16(1)
; PC64LE-NEXT: mtlr 0
; PC64LE-NEXT: blr		; PC64LE-NEXT: blr
;		;
; PC64LE9-LABEL: constrained_vector_maxnum_v4f64:		; PC64LE9-LABEL: constrained_vector_maxnum_v4f64:
; PC64LE9: # %bb.0: # %entry		; PC64LE9: # %bb.0: # %entry
; PC64LE9-NEXT: mflr 0
; PC64LE9-NEXT: std 0, 16(1)
; PC64LE9-NEXT: stdu 1, -64(1)
; PC64LE9-NEXT: .cfi_def_cfa_offset 64
; PC64LE9-NEXT: .cfi_offset lr, 16
; PC64LE9-NEXT: .cfi_offset v31, -16
; PC64LE9-NEXT: addis 3, 2, .LCPI89_0@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI89_0@toc@ha
; PC64LE9-NEXT: lfs 1, .LCPI89_0@toc@l(3)		; PC64LE9-NEXT: addi 3, 3, .LCPI89_0@toc@l
		; PC64LE9-NEXT: lxvx 0, 0, 3
; PC64LE9-NEXT: addis 3, 2, .LCPI89_1@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI89_1@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI89_1@toc@l(3)		; PC64LE9-NEXT: addi 3, 3, .LCPI89_1@toc@l
; PC64LE9-NEXT: stxv 63, 48(1) # 16-byte Folded Spill		; PC64LE9-NEXT: lxvx 1, 0, 3
; PC64LE9-NEXT: bl fmax
; PC64LE9-NEXT: nop
; PC64LE9-NEXT: addis 3, 2, .LCPI89_2@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI89_2@toc@ha
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1		; PC64LE9-NEXT: addi 3, 3, .LCPI89_2@toc@l
; PC64LE9-NEXT: stxv 1, 32(1) # 16-byte Folded Spill		; PC64LE9-NEXT: xvmaxdp 34, 1, 0
; PC64LE9-NEXT: lfs 1, .LCPI89_2@toc@l(3)		; PC64LE9-NEXT: lxvx 0, 0, 3
; PC64LE9-NEXT: addis 3, 2, .LCPI89_3@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI89_3@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI89_3@toc@l(3)		; PC64LE9-NEXT: addi 3, 3, .LCPI89_3@toc@l
; PC64LE9-NEXT: bl fmax		; PC64LE9-NEXT: lxvx 1, 0, 3
; PC64LE9-NEXT: nop		; PC64LE9-NEXT: xvmaxdp 35, 1, 0
; PC64LE9-NEXT: lxv 0, 32(1) # 16-byte Folded Reload
; PC64LE9-NEXT: addis 3, 2, .LCPI89_4@toc@ha
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: xxmrghd 63, 1, 0
; PC64LE9-NEXT: lfs 1, .LCPI89_4@toc@l(3)
; PC64LE9-NEXT: addis 3, 2, .LCPI89_5@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI89_5@toc@l(3)
; PC64LE9-NEXT: bl fmax
; PC64LE9-NEXT: nop
; PC64LE9-NEXT: addis 3, 2, .LCPI89_6@toc@ha
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: stxv 1, 32(1) # 16-byte Folded Spill
; PC64LE9-NEXT: lfs 1, .LCPI89_6@toc@l(3)
; PC64LE9-NEXT: addis 3, 2, .LCPI89_7@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI89_7@toc@l(3)
; PC64LE9-NEXT: bl fmax
; PC64LE9-NEXT: nop
; PC64LE9-NEXT: lxv 0, 32(1) # 16-byte Folded Reload
; PC64LE9-NEXT: vmr 2, 31
; PC64LE9-NEXT: lxv 63, 48(1) # 16-byte Folded Reload
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: xxmrghd 35, 1, 0
; PC64LE9-NEXT: addi 1, 1, 64
; PC64LE9-NEXT: ld 0, 16(1)
; PC64LE9-NEXT: mtlr 0
; PC64LE9-NEXT: blr		; PC64LE9-NEXT: blr
entry:		entry:
%max = call <4 x double> @llvm.experimental.constrained.maxnum.v4f64(		%max = call <4 x double> @llvm.experimental.constrained.maxnum.v4f64(
<4 x double> <double 44.0, double 45.0,		<4 x double> <double 44.0, double 45.0,
double 46.0, double 47.0>,		double 46.0, double 47.0>,
<4 x double> <double 40.0, double 41.0,		<4 x double> <double 40.0, double 41.0,
double 42.0, double 43.0>,		double 42.0, double 43.0>,
metadata !"round.dynamic",		metadata !"round.dynamic",
Show All 36 Lines
; PC64LE9-NEXT: bl fminf		; PC64LE9-NEXT: bl fminf
; PC64LE9-NEXT: nop		; PC64LE9-NEXT: nop
; PC64LE9-NEXT: xscvdpspn 0, 1		; PC64LE9-NEXT: xscvdpspn 0, 1
; PC64LE9-NEXT: xxsldwi 34, 0, 0, 1		; PC64LE9-NEXT: xxsldwi 34, 0, 0, 1
; PC64LE9-NEXT: addi 1, 1, 32		; PC64LE9-NEXT: addi 1, 1, 32
; PC64LE9-NEXT: ld 0, 16(1)		; PC64LE9-NEXT: ld 0, 16(1)
; PC64LE9-NEXT: mtlr 0		; PC64LE9-NEXT: mtlr 0
; PC64LE9-NEXT: blr		; PC64LE9-NEXT: blr
;
entry:		entry:
%min = call <1 x float> @llvm.experimental.constrained.minnum.v1f32(		%min = call <1 x float> @llvm.experimental.constrained.minnum.v1f32(
<1 x float> <float 42.0>, <1 x float> <float 41.0>,		<1 x float> <float 42.0>, <1 x float> <float 41.0>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <1 x float> %min		ret <1 x float> %min
}		}

define <2 x double> @constrained_vector_minnum_v2f64() {		define <2 x double> @constrained_vector_minnum_v2f64() {
; PC64LE-LABEL: constrained_vector_minnum_v2f64:		; PC64LE-LABEL: constrained_vector_minnum_v2f64:
; PC64LE: # %bb.0: # %entry		; PC64LE: # %bb.0: # %entry
; PC64LE-NEXT: mflr 0
; PC64LE-NEXT: std 0, 16(1)
; PC64LE-NEXT: stdu 1, -64(1)
; PC64LE-NEXT: .cfi_def_cfa_offset 64
; PC64LE-NEXT: .cfi_offset lr, 16
; PC64LE-NEXT: addis 3, 2, .LCPI91_0@toc@ha		; PC64LE-NEXT: addis 3, 2, .LCPI91_0@toc@ha
; PC64LE-NEXT: addis 4, 2, .LCPI91_1@toc@ha		; PC64LE-NEXT: addis 4, 2, .LCPI91_1@toc@ha
; PC64LE-NEXT: lfs 1, .LCPI91_0@toc@l(3)		; PC64LE-NEXT: addi 3, 3, .LCPI91_0@toc@l
; PC64LE-NEXT: lfs 2, .LCPI91_1@toc@l(4)		; PC64LE-NEXT: addi 4, 4, .LCPI91_1@toc@l
; PC64LE-NEXT: bl fmin		; PC64LE-NEXT: lxvd2x 0, 0, 3
; PC64LE-NEXT: nop		; PC64LE-NEXT: lxvd2x 1, 0, 4
; PC64LE-NEXT: li 3, 48		; PC64LE-NEXT: xxswapd 0, 0
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1		; PC64LE-NEXT: xxswapd 1, 1
; PC64LE-NEXT: addis 4, 2, .LCPI91_3@toc@ha		; PC64LE-NEXT: xvmindp 34, 1, 0
; PC64LE-NEXT: stxvd2x 1, 1, 3 # 16-byte Folded Spill
; PC64LE-NEXT: addis 3, 2, .LCPI91_2@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI91_3@toc@l(4)
; PC64LE-NEXT: lfs 1, .LCPI91_2@toc@l(3)
; PC64LE-NEXT: bl fmin
; PC64LE-NEXT: nop
; PC64LE-NEXT: li 3, 48
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE-NEXT: lxvd2x 0, 1, 3 # 16-byte Folded Reload
; PC64LE-NEXT: xxmrghd 34, 1, 0
; PC64LE-NEXT: addi 1, 1, 64
; PC64LE-NEXT: ld 0, 16(1)
; PC64LE-NEXT: mtlr 0
; PC64LE-NEXT: blr		; PC64LE-NEXT: blr
;		;
; PC64LE9-LABEL: constrained_vector_minnum_v2f64:		; PC64LE9-LABEL: constrained_vector_minnum_v2f64:
; PC64LE9: # %bb.0: # %entry		; PC64LE9: # %bb.0: # %entry
; PC64LE9-NEXT: mflr 0
; PC64LE9-NEXT: std 0, 16(1)
; PC64LE9-NEXT: stdu 1, -48(1)
; PC64LE9-NEXT: .cfi_def_cfa_offset 48
; PC64LE9-NEXT: .cfi_offset lr, 16
; PC64LE9-NEXT: addis 3, 2, .LCPI91_0@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI91_0@toc@ha
; PC64LE9-NEXT: lfs 1, .LCPI91_0@toc@l(3)		; PC64LE9-NEXT: addi 3, 3, .LCPI91_0@toc@l
		; PC64LE9-NEXT: lxvx 0, 0, 3
; PC64LE9-NEXT: addis 3, 2, .LCPI91_1@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI91_1@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI91_1@toc@l(3)		; PC64LE9-NEXT: addi 3, 3, .LCPI91_1@toc@l
; PC64LE9-NEXT: bl fmin		; PC64LE9-NEXT: lxvx 1, 0, 3
; PC64LE9-NEXT: nop		; PC64LE9-NEXT: xvmindp 34, 1, 0
; PC64LE9-NEXT: addis 3, 2, .LCPI91_2@toc@ha
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: stxv 1, 32(1) # 16-byte Folded Spill
; PC64LE9-NEXT: lfs 1, .LCPI91_2@toc@l(3)
; PC64LE9-NEXT: addis 3, 2, .LCPI91_3@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI91_3@toc@l(3)
; PC64LE9-NEXT: bl fmin
; PC64LE9-NEXT: nop
; PC64LE9-NEXT: lxv 0, 32(1) # 16-byte Folded Reload
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: xxmrghd 34, 1, 0
; PC64LE9-NEXT: addi 1, 1, 48
; PC64LE9-NEXT: ld 0, 16(1)
; PC64LE9-NEXT: mtlr 0
; PC64LE9-NEXT: blr		; PC64LE9-NEXT: blr
entry:		entry:
%min = call <2 x double> @llvm.experimental.constrained.minnum.v2f64(		%min = call <2 x double> @llvm.experimental.constrained.minnum.v2f64(
<2 x double> <double 43.0, double 42.0>,		<2 x double> <double 43.0, double 42.0>,
<2 x double> <double 41.0, double 40.0>,		<2 x double> <double 41.0, double 40.0>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <2 x double> %min		ret <2 x double> %min
▲ Show 20 Lines • Show All 112 Lines • ▼ Show 20 Lines	entry:
ret <3 x float> %min		ret <3 x float> %min
}		}

define <3 x double> @constrained_vector_min_v3f64() {		define <3 x double> @constrained_vector_min_v3f64() {
; PC64LE-LABEL: constrained_vector_min_v3f64:		; PC64LE-LABEL: constrained_vector_min_v3f64:
; PC64LE: # %bb.0: # %entry		; PC64LE: # %bb.0: # %entry
; PC64LE-NEXT: mflr 0		; PC64LE-NEXT: mflr 0
; PC64LE-NEXT: std 0, 16(1)		; PC64LE-NEXT: std 0, 16(1)
; PC64LE-NEXT: stdu 1, -80(1)		; PC64LE-NEXT: stdu 1, -32(1)
; PC64LE-NEXT: .cfi_def_cfa_offset 80		; PC64LE-NEXT: .cfi_def_cfa_offset 32
; PC64LE-NEXT: .cfi_offset lr, 16		; PC64LE-NEXT: .cfi_offset lr, 16
; PC64LE-NEXT: .cfi_offset v31, -16
; PC64LE-NEXT: li 3, 64
; PC64LE-NEXT: addis 4, 2, .LCPI93_1@toc@ha
; PC64LE-NEXT: stxvd2x 63, 1, 3 # 16-byte Folded Spill
; PC64LE-NEXT: addis 3, 2, .LCPI93_0@toc@ha		; PC64LE-NEXT: addis 3, 2, .LCPI93_0@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI93_1@toc@l(4)		; PC64LE-NEXT: addis 4, 2, .LCPI93_1@toc@ha
; PC64LE-NEXT: lfs 1, .LCPI93_0@toc@l(3)		; PC64LE-NEXT: lfs 1, .LCPI93_0@toc@l(3)
		; PC64LE-NEXT: lfs 2, .LCPI93_1@toc@l(4)
; PC64LE-NEXT: bl fmin		; PC64LE-NEXT: bl fmin
; PC64LE-NEXT: nop		; PC64LE-NEXT: nop
; PC64LE-NEXT: li 3, 48
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE-NEXT: addis 4, 2, .LCPI93_3@toc@ha
; PC64LE-NEXT: stxvd2x 1, 1, 3 # 16-byte Folded Spill
; PC64LE-NEXT: addis 3, 2, .LCPI93_2@toc@ha		; PC64LE-NEXT: addis 3, 2, .LCPI93_2@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI93_3@toc@l(4)		; PC64LE-NEXT: addis 4, 2, .LCPI93_3@toc@ha
; PC64LE-NEXT: lfs 1, .LCPI93_2@toc@l(3)
; PC64LE-NEXT: bl fmin
; PC64LE-NEXT: nop
; PC64LE-NEXT: li 3, 48
; PC64LE-NEXT: addis 4, 2, .LCPI93_5@toc@ha
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE-NEXT: lxvd2x 0, 1, 3 # 16-byte Folded Reload
; PC64LE-NEXT: addis 3, 2, .LCPI93_4@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI93_5@toc@l(4)
; PC64LE-NEXT: xxmrghd 63, 1, 0
; PC64LE-NEXT: lfs 1, .LCPI93_4@toc@l(3)
; PC64LE-NEXT: bl fmin
; PC64LE-NEXT: nop
; PC64LE-NEXT: li 3, 64
; PC64LE-NEXT: fmr 3, 1		; PC64LE-NEXT: fmr 3, 1
; PC64LE-NEXT: xxlor 1, 63, 63		; PC64LE-NEXT: addi 3, 3, .LCPI93_2@toc@l
; PC64LE-NEXT: xxlor 2, 63, 63		; PC64LE-NEXT: addi 4, 4, .LCPI93_3@toc@l
; PC64LE-NEXT: lxvd2x 63, 1, 3 # 16-byte Folded Reload		; PC64LE-NEXT: lxvd2x 0, 0, 3
; PC64LE-NEXT: addi 1, 1, 80		; PC64LE-NEXT: lxvd2x 2, 0, 4
		; PC64LE-NEXT: xxswapd 0, 0
		; PC64LE-NEXT: xxswapd 2, 2
		; PC64LE-NEXT: xvmindp 2, 2, 0
		; PC64LE-NEXT: xxswapd 0, 2
		; PC64LE-NEXT: # kill: def $f2 killed $f2 killed $vsl2
		; PC64LE-NEXT: fmr 1, 0
		; PC64LE-NEXT: addi 1, 1, 32
; PC64LE-NEXT: ld 0, 16(1)		; PC64LE-NEXT: ld 0, 16(1)
; PC64LE-NEXT: mtlr 0		; PC64LE-NEXT: mtlr 0
; PC64LE-NEXT: blr		; PC64LE-NEXT: blr
;		;
; PC64LE9-LABEL: constrained_vector_min_v3f64:		; PC64LE9-LABEL: constrained_vector_min_v3f64:
; PC64LE9: # %bb.0: # %entry		; PC64LE9: # %bb.0: # %entry
; PC64LE9-NEXT: mflr 0		; PC64LE9-NEXT: mflr 0
; PC64LE9-NEXT: std 0, 16(1)		; PC64LE9-NEXT: std 0, 16(1)
; PC64LE9-NEXT: stdu 1, -64(1)		; PC64LE9-NEXT: stdu 1, -32(1)
; PC64LE9-NEXT: .cfi_def_cfa_offset 64		; PC64LE9-NEXT: .cfi_def_cfa_offset 32
; PC64LE9-NEXT: .cfi_offset lr, 16		; PC64LE9-NEXT: .cfi_offset lr, 16
; PC64LE9-NEXT: .cfi_offset v31, -16
; PC64LE9-NEXT: addis 3, 2, .LCPI93_0@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI93_0@toc@ha
; PC64LE9-NEXT: lfs 1, .LCPI93_0@toc@l(3)		; PC64LE9-NEXT: lfs 1, .LCPI93_0@toc@l(3)
; PC64LE9-NEXT: addis 3, 2, .LCPI93_1@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI93_1@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI93_1@toc@l(3)		; PC64LE9-NEXT: lfs 2, .LCPI93_1@toc@l(3)
; PC64LE9-NEXT: stxv 63, 48(1) # 16-byte Folded Spill
; PC64LE9-NEXT: bl fmin		; PC64LE9-NEXT: bl fmin
; PC64LE9-NEXT: nop		; PC64LE9-NEXT: nop
; PC64LE9-NEXT: addis 3, 2, .LCPI93_2@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI93_2@toc@ha
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1		; PC64LE9-NEXT: addi 3, 3, .LCPI93_2@toc@l
; PC64LE9-NEXT: stxv 1, 32(1) # 16-byte Folded Spill		; PC64LE9-NEXT: lxvx 0, 0, 3
; PC64LE9-NEXT: lfs 1, .LCPI93_2@toc@l(3)
; PC64LE9-NEXT: addis 3, 2, .LCPI93_3@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI93_3@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI93_3@toc@l(3)		; PC64LE9-NEXT: addi 3, 3, .LCPI93_3@toc@l
; PC64LE9-NEXT: bl fmin
; PC64LE9-NEXT: nop
; PC64LE9-NEXT: lxv 0, 32(1) # 16-byte Folded Reload
; PC64LE9-NEXT: addis 3, 2, .LCPI93_4@toc@ha
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: xxmrghd 63, 1, 0
; PC64LE9-NEXT: lfs 1, .LCPI93_4@toc@l(3)
; PC64LE9-NEXT: addis 3, 2, .LCPI93_5@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI93_5@toc@l(3)
; PC64LE9-NEXT: bl fmin
; PC64LE9-NEXT: nop
; PC64LE9-NEXT: fmr 3, 1		; PC64LE9-NEXT: fmr 3, 1
; PC64LE9-NEXT: xscpsgndp 1, 63, 63		; PC64LE9-NEXT: lxvx 1, 0, 3
; PC64LE9-NEXT: xscpsgndp 2, 63, 63		; PC64LE9-NEXT: xvmindp 2, 1, 0
; PC64LE9-NEXT: lxv 63, 48(1) # 16-byte Folded Reload		; PC64LE9-NEXT: xxswapd 1, 2
; PC64LE9-NEXT: addi 1, 1, 64		; PC64LE9-NEXT: # kill: def $f1 killed $f1 killed $vsl1
		; PC64LE9-NEXT: # kill: def $f2 killed $f2 killed $vsl2
		; PC64LE9-NEXT: addi 1, 1, 32
; PC64LE9-NEXT: ld 0, 16(1)		; PC64LE9-NEXT: ld 0, 16(1)
; PC64LE9-NEXT: mtlr 0		; PC64LE9-NEXT: mtlr 0
; PC64LE9-NEXT: blr		; PC64LE9-NEXT: blr
entry:		entry:
%min = call <3 x double> @llvm.experimental.constrained.minnum.v3f64(		%min = call <3 x double> @llvm.experimental.constrained.minnum.v3f64(
<3 x double> <double 43.0, double 44.0, double 45.0>,		<3 x double> <double 43.0, double 44.0, double 45.0>,
<3 x double> <double 40.0, double 41.0, double 42.0>,		<3 x double> <double 40.0, double 41.0, double 42.0>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <3 x double> %min		ret <3 x double> %min
}		}

define <4 x double> @constrained_vector_minnum_v4f64() {		define <4 x double> @constrained_vector_minnum_v4f64() {
; PC64LE-LABEL: constrained_vector_minnum_v4f64:		; PC64LE-LABEL: constrained_vector_minnum_v4f64:
; PC64LE: # %bb.0: # %entry		; PC64LE: # %bb.0: # %entry
; PC64LE-NEXT: mflr 0
; PC64LE-NEXT: std 0, 16(1)
; PC64LE-NEXT: stdu 1, -80(1)
; PC64LE-NEXT: .cfi_def_cfa_offset 80
; PC64LE-NEXT: .cfi_offset lr, 16
; PC64LE-NEXT: .cfi_offset v31, -16
; PC64LE-NEXT: li 3, 64
; PC64LE-NEXT: addis 4, 2, .LCPI94_1@toc@ha
; PC64LE-NEXT: stxvd2x 63, 1, 3 # 16-byte Folded Spill
; PC64LE-NEXT: addis 3, 2, .LCPI94_0@toc@ha		; PC64LE-NEXT: addis 3, 2, .LCPI94_0@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI94_1@toc@l(4)		; PC64LE-NEXT: addis 4, 2, .LCPI94_1@toc@ha
; PC64LE-NEXT: lfs 1, .LCPI94_0@toc@l(3)		; PC64LE-NEXT: addis 5, 2, .LCPI94_2@toc@ha
; PC64LE-NEXT: bl fmin		; PC64LE-NEXT: addis 6, 2, .LCPI94_3@toc@ha
; PC64LE-NEXT: nop		; PC64LE-NEXT: addi 3, 3, .LCPI94_0@toc@l
; PC64LE-NEXT: li 3, 48		; PC64LE-NEXT: addi 4, 4, .LCPI94_1@toc@l
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1		; PC64LE-NEXT: lxvd2x 0, 0, 3
; PC64LE-NEXT: addis 4, 2, .LCPI94_3@toc@ha		; PC64LE-NEXT: lxvd2x 1, 0, 4
; PC64LE-NEXT: stxvd2x 1, 1, 3 # 16-byte Folded Spill		; PC64LE-NEXT: addi 3, 5, .LCPI94_2@toc@l
; PC64LE-NEXT: addis 3, 2, .LCPI94_2@toc@ha		; PC64LE-NEXT: addi 4, 6, .LCPI94_3@toc@l
; PC64LE-NEXT: lfs 2, .LCPI94_3@toc@l(4)		; PC64LE-NEXT: lxvd2x 2, 0, 3
; PC64LE-NEXT: lfs 1, .LCPI94_2@toc@l(3)		; PC64LE-NEXT: lxvd2x 3, 0, 4
; PC64LE-NEXT: bl fmin		; PC64LE-NEXT: xxswapd 0, 0
; PC64LE-NEXT: nop		; PC64LE-NEXT: xxswapd 1, 1
; PC64LE-NEXT: li 3, 48		; PC64LE-NEXT: xxswapd 2, 2
; PC64LE-NEXT: addis 4, 2, .LCPI94_5@toc@ha		; PC64LE-NEXT: xxswapd 3, 3
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1		; PC64LE-NEXT: xvmindp 34, 1, 0
; PC64LE-NEXT: lxvd2x 0, 1, 3 # 16-byte Folded Reload		; PC64LE-NEXT: xvmindp 35, 3, 2
; PC64LE-NEXT: addis 3, 2, .LCPI94_4@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI94_5@toc@l(4)
; PC64LE-NEXT: xxmrghd 63, 1, 0
; PC64LE-NEXT: lfs 1, .LCPI94_4@toc@l(3)
; PC64LE-NEXT: bl fmin
; PC64LE-NEXT: nop
; PC64LE-NEXT: li 3, 48
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE-NEXT: addis 4, 2, .LCPI94_7@toc@ha
; PC64LE-NEXT: stxvd2x 1, 1, 3 # 16-byte Folded Spill
; PC64LE-NEXT: addis 3, 2, .LCPI94_6@toc@ha
; PC64LE-NEXT: lfs 2, .LCPI94_7@toc@l(4)
; PC64LE-NEXT: lfs 1, .LCPI94_6@toc@l(3)
; PC64LE-NEXT: bl fmin
; PC64LE-NEXT: nop
; PC64LE-NEXT: li 3, 48
; PC64LE-NEXT: vmr 2, 31
; PC64LE-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE-NEXT: lxvd2x 0, 1, 3 # 16-byte Folded Reload
; PC64LE-NEXT: li 3, 64
; PC64LE-NEXT: lxvd2x 63, 1, 3 # 16-byte Folded Reload
; PC64LE-NEXT: xxmrghd 35, 1, 0
; PC64LE-NEXT: addi 1, 1, 80
; PC64LE-NEXT: ld 0, 16(1)
; PC64LE-NEXT: mtlr 0
; PC64LE-NEXT: blr		; PC64LE-NEXT: blr
;		;
; PC64LE9-LABEL: constrained_vector_minnum_v4f64:		; PC64LE9-LABEL: constrained_vector_minnum_v4f64:
; PC64LE9: # %bb.0: # %entry		; PC64LE9: # %bb.0: # %entry
; PC64LE9-NEXT: mflr 0
; PC64LE9-NEXT: std 0, 16(1)
; PC64LE9-NEXT: stdu 1, -64(1)
; PC64LE9-NEXT: .cfi_def_cfa_offset 64
; PC64LE9-NEXT: .cfi_offset lr, 16
; PC64LE9-NEXT: .cfi_offset v31, -16
; PC64LE9-NEXT: addis 3, 2, .LCPI94_0@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI94_0@toc@ha
; PC64LE9-NEXT: lfs 1, .LCPI94_0@toc@l(3)		; PC64LE9-NEXT: addi 3, 3, .LCPI94_0@toc@l
		; PC64LE9-NEXT: lxvx 0, 0, 3
; PC64LE9-NEXT: addis 3, 2, .LCPI94_1@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI94_1@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI94_1@toc@l(3)		; PC64LE9-NEXT: addi 3, 3, .LCPI94_1@toc@l
; PC64LE9-NEXT: stxv 63, 48(1) # 16-byte Folded Spill		; PC64LE9-NEXT: lxvx 1, 0, 3
; PC64LE9-NEXT: bl fmin
; PC64LE9-NEXT: nop
; PC64LE9-NEXT: addis 3, 2, .LCPI94_2@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI94_2@toc@ha
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1		; PC64LE9-NEXT: addi 3, 3, .LCPI94_2@toc@l
; PC64LE9-NEXT: stxv 1, 32(1) # 16-byte Folded Spill		; PC64LE9-NEXT: xvmindp 34, 1, 0
; PC64LE9-NEXT: lfs 1, .LCPI94_2@toc@l(3)		; PC64LE9-NEXT: lxvx 0, 0, 3
; PC64LE9-NEXT: addis 3, 2, .LCPI94_3@toc@ha		; PC64LE9-NEXT: addis 3, 2, .LCPI94_3@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI94_3@toc@l(3)		; PC64LE9-NEXT: addi 3, 3, .LCPI94_3@toc@l
; PC64LE9-NEXT: bl fmin		; PC64LE9-NEXT: lxvx 1, 0, 3
; PC64LE9-NEXT: nop		; PC64LE9-NEXT: xvmindp 35, 1, 0
; PC64LE9-NEXT: lxv 0, 32(1) # 16-byte Folded Reload
; PC64LE9-NEXT: addis 3, 2, .LCPI94_4@toc@ha
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: xxmrghd 63, 1, 0
; PC64LE9-NEXT: lfs 1, .LCPI94_4@toc@l(3)
; PC64LE9-NEXT: addis 3, 2, .LCPI94_5@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI94_5@toc@l(3)
; PC64LE9-NEXT: bl fmin
; PC64LE9-NEXT: nop
; PC64LE9-NEXT: addis 3, 2, .LCPI94_6@toc@ha
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: stxv 1, 32(1) # 16-byte Folded Spill
; PC64LE9-NEXT: lfs 1, .LCPI94_6@toc@l(3)
; PC64LE9-NEXT: addis 3, 2, .LCPI94_7@toc@ha
; PC64LE9-NEXT: lfs 2, .LCPI94_7@toc@l(3)
; PC64LE9-NEXT: bl fmin
; PC64LE9-NEXT: nop
; PC64LE9-NEXT: lxv 0, 32(1) # 16-byte Folded Reload
; PC64LE9-NEXT: vmr 2, 31
; PC64LE9-NEXT: lxv 63, 48(1) # 16-byte Folded Reload
; PC64LE9-NEXT: # kill: def $f1 killed $f1 def $vsl1
; PC64LE9-NEXT: xxmrghd 35, 1, 0
; PC64LE9-NEXT: addi 1, 1, 64
; PC64LE9-NEXT: ld 0, 16(1)
; PC64LE9-NEXT: mtlr 0
; PC64LE9-NEXT: blr		; PC64LE9-NEXT: blr
entry:		entry:
%min = call <4 x double> @llvm.experimental.constrained.minnum.v4f64(		%min = call <4 x double> @llvm.experimental.constrained.minnum.v4f64(
<4 x double> <double 44.0, double 45.0,		<4 x double> <double 44.0, double 45.0,
double 46.0, double 47.0>,		double 46.0, double 47.0>,
<4 x double> <double 40.0, double 41.0,		<4 x double> <double 40.0, double 41.0,
double 42.0, double 43.0>,		double 42.0, double 43.0>,
metadata !"round.dynamic",		metadata !"round.dynamic",
▲ Show 20 Lines • Show All 987 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[PowerPC] Exploit the vector min/max instructionsClosedPublic

Details

Diff Detail

Event Timeline