This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/CodeGen/SelectionDAG/
-
CodeGen/
-
SelectionDAG/
1
DAGCombiner.cpp
-
test/CodeGen/
-
CodeGen/
-
AArch64/
-
fpclamptosat.ll
-
fpclamptosat_vec.ll
-
ARM/
-
fpclamptosat.ll
-
RISCV/
-
fpclamptosat.ll
-
Thumb2/
-
mve-fpclamptosat_vec.ll
-
WebAssembly/
-
fpclamptosat.ll
-
fpclamptosat_vec.ll

Differential D114964

[DAG] Create fptoui.sat from clamped fptoui
ClosedPublic

Authored by dmgreen on Dec 2 2021, 8:37 AM.

Download Raw Diff

Details

Reviewers

spatel
RKSimon
craig.topper
tlively
efriedma
SjoerdMeijer
sjarus

Commits

rG57356d6bb72a: [DAG] Create fptoui.sat from clamped fptoui

Summary

This is the unsigned variant of D111976, where we convert a clamped fptoui to a fptoui.sat. Because we are unsigned, the condition this time is only UMIN of UINT_MAX. Similarly to D111976 it handles ISD::UMIN, ISD::SETCC/ISD::SELECT, ISD::VSELECT or ISD::SELECT_CC nodes.

This especially helps on ARM/AArch64 where the vcvt instructions naturally saturate the result.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dmgreen created this revision.Dec 2 2021, 8:37 AM

Herald added a reviewer: sjarus. · View Herald TranscriptDec 2 2021, 8:37 AM

Herald added subscribers: armkevincheng, eric-k256, frasercrmck and 26 others. · View Herald Transcript

dmgreen requested review of this revision.Dec 2 2021, 8:37 AM

Herald added a project: Restricted Project. · View Herald TranscriptDec 2 2021, 8:38 AM

Herald added subscribers: MaskRay, aheejin. · View Herald Transcript

Harbormaster completed remote builds in B137149: Diff 391337.Dec 2 2021, 8:38 AM

This seems fine as an extension of the previous patch.
I haven't been following the progress in this area closely though. What prevents folding these patterns to the saturating intrinsics in IR?
https://llvm.org/docs/LangRef.html#llvm-fptoui-sat-intrinsic

In D114964#3167836, @spatel wrote:

This seems fine as an extension of the previous patch.
I haven't been following the progress in this area closely though. What prevents folding these patterns to the saturating intrinsics in IR?
https://llvm.org/docs/LangRef.html#llvm-fptoui-sat-intrinsic

An fptoui.sat is more defined than a fptoui + umin. For the fptoui any out-of-range value produces poison - for the fptoui.sat the out of range values are defined to saturate. So the transform isn't reversible and on some architectures produces worse code.

More concretely a iN fptoui.sat(X) can be expanded into fptoui(fmax(fmin(X, (float)(2^N)-1), 0)) (or something else with float compares and int selects if fmin/fmax are not available). Plus it needs to handle Nan for fptosi, making sure it becomes 0.

I would actually really like it to be done in IR. We would need to vectorize these if we can, and they are much simpler to vectorize if we already have the scalar instructions. But it needs to be a costed decision, not an inst-combine canonicalization decision. Any ideas of a good place to make that happen?

In D114964#3168031, @dmgreen wrote:

An fptoui.sat is more defined than a fptoui + umin. For the fptoui any out-of-range value produces poison - for the fptoui.sat the out of range values are defined to saturate. So the transform isn't reversible and on some architectures produces worse code.

Ah, right. This is similar to a problem we have with funnel-shift intrinsics. After several improvements in the default expansion, we are able to canonicalize most patterns, but there are still a few that could escape because the safe expansion has more instructions than the unsafe pattern on targets that don't have good shift/rotates.

I would actually really like it to be done in IR. We would need to vectorize these if we can, and they are much simpler to vectorize if we already have the scalar instructions. But it needs to be a costed decision, not an inst-combine canonicalization decision. Any ideas of a good place to make that happen?

Yes, trying to bend the cost models to recognize larger patterns like this is tough. VectorCombine does generic transforms using costs, but this doesn't quite fall into that category...

cc'ing @nikic @bjope based on the history:
https://groups.google.com/g/llvm-dev/c/cgDFaBmCnDQ/m/CZAIMj4IBAAJ
D54749

I'm not sure if there was a plan for these intrinsics to be used from C-like source. Maybe we need an fp-combine pass?

In D114964#3169854, @spatel wrote:

In D114964#3168031, @dmgreen wrote:

An fptoui.sat is more defined than a fptoui + umin. For the fptoui any out-of-range value produces poison - for the fptoui.sat the out of range values are defined to saturate. So the transform isn't reversible and on some architectures produces worse code.

Ah, right. This is similar to a problem we have with funnel-shift intrinsics. After several improvements in the default expansion, we are able to canonicalize most patterns, but there are still a few that could escape because the safe expansion has more instructions than the unsafe pattern on targets that don't have good shift/rotates.

I would actually really like it to be done in IR. We would need to vectorize these if we can, and they are much simpler to vectorize if we already have the scalar instructions. But it needs to be a costed decision, not an inst-combine canonicalization decision. Any ideas of a good place to make that happen?

Yes, trying to bend the cost models to recognize larger patterns like this is tough. VectorCombine does generic transforms using costs, but this doesn't quite fall into that category...

cc'ing @nikic @bjope based on the history:
https://groups.google.com/g/llvm-dev/c/cgDFaBmCnDQ/m/CZAIMj4IBAAJ
D54749

I'm not sure if there was a plan for these intrinsics to be used from C-like source. Maybe we need an fp-combine pass?

My use case for these were to related to the Embedded-C support, to implement conversion between fixed point types and floating point types. This was added in https://reviews.llvm.org/D86632. In that patch the intrinsics are used directly when producing IR in the frontend.

Downstream we set FP_TO_SINT_SAT as "custom" as we can do optimized lowering in some situation (depending on involved types and saturation width). I realize that we probably want to override shouldConvertFpToSat to avoid any conversion to these saturated intrinsics when it isn't beneficial for our target.

My use case for these were to related to the Embedded-C support, to implement conversion between fixed point types and floating point types. This was added in https://reviews.llvm.org/D86632. In that patch the intrinsics are used directly when producing IR in the frontend.

Downstream we set FP_TO_SINT_SAT as "custom" as we can do optimized lowering in some situation (depending on involved types and saturation width). I realize that we probably want to override shouldConvertFpToSat to avoid any conversion to these saturated intrinsics when it isn't beneficial for our target.

I believe they may be used in rust too.

I know that scalable vectors are not supported for all cases (they can try to unroll) - that is on my list of things to look at.

Yeah, making an shouldConvertFpToSat override can be important - for example only converting them when the operations are available like we do on Arm. You may be able to copy one of the existing fpclamptosat.ll test cases as a starting point.

As an outsider, similar to VectorCombine maybe other FooCombine passes (special purpose) with costs. As an addition/extension to InstCombine.

dmgreen mentioned this in D113291: [AggressiveInstCombine] Lower Table Based CTTZ .Jan 18 2022, 2:19 AM

RKSimon added inline comments.Jan 20 2022, 7:11 AM

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
4915	Isn't this (C1 + 1).exactLogBase2()?

Update as per comments. Thanks!

Harbormaster completed remote builds in B144876: Diff 402037.Jan 21 2022, 12:50 PM

LGTM - cheers

This revision is now accepted and ready to land.Jan 25 2022, 8:53 AM

Herald added a subscriber: • pcwang-thead. · View Herald TranscriptJan 25 2022, 8:53 AM

Closed by commit rG57356d6bb72a: [DAG] Create fptoui.sat from clamped fptoui (authored by dmgreen). · Explain WhyJan 26 2022, 12:37 AM

This revision was automatically updated to reflect the committed changes.

dmgreen added a commit: rG57356d6bb72a: [DAG] Create fptoui.sat from clamped fptoui.

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

SelectionDAG/

DAGCombiner.cpp

43 lines

test/

CodeGen/

AArch64/

fpclamptosat.ll

44 lines

fpclamptosat_vec.ll

216 lines

ARM/

fpclamptosat.ll

113 lines

RISCV/

fpclamptosat.ll

148 lines

Thumb2/

mve-fpclamptosat_vec.ll

108 lines

WebAssembly/

fpclamptosat.ll

60 lines

fpclamptosat_vec.ll

244 lines

Diff 403159

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 4,886 Lines • ▼ Show 20 Lines	if (!DAG.getTargetLoweringInfo().shouldConvertFpToSat(NewOpc, FPVT, NewVT))
return SDValue();		return SDValue();
SDLoc DL(Fp);		SDLoc DL(Fp);
SDValue Sat = DAG.getNode(NewOpc, DL, NewVT, Fp.getOperand(0),		SDValue Sat = DAG.getNode(NewOpc, DL, NewVT, Fp.getOperand(0),
DAG.getValueType(NewVT.getScalarType()));		DAG.getValueType(NewVT.getScalarType()));
return Unsigned ? DAG.getZExtOrTrunc(Sat, DL, N2->getValueType(0))		return Unsigned ? DAG.getZExtOrTrunc(Sat, DL, N2->getValueType(0))
: DAG.getSExtOrTrunc(Sat, DL, N2->getValueType(0));		: DAG.getSExtOrTrunc(Sat, DL, N2->getValueType(0));
}		}

		static SDValue PerformUMinFpToSatCombine(SDValue N0, SDValue N1, SDValue N2,
		SDValue N3, ISD::CondCode CC,
		SelectionDAG &DAG) {
		// We are looking for UMIN(FPTOUI(X), (2^n)-1), which may have come via a
		// select/vselect/select_cc. The two operands pairs for the select (N2/N3) may
		// be truncated versions of the the setcc (N0/N1).
		if ((N0 != N2 &&
		(N2.getOpcode() != ISD::TRUNCATE \|\| N0 != N2.getOperand(0))) \|\|
		N0.getOpcode() != ISD::FP_TO_UINT \|\| CC != ISD::SETULT)
		return SDValue();
		ConstantSDNode *N1C = isConstOrConstSplat(N1);
		ConstantSDNode *N3C = isConstOrConstSplat(N3);
		if (!N1C \|\| !N3C)
		return SDValue();
		const APInt &C1 = N1C->getAPIntValue();
		const APInt &C3 = N3C->getAPIntValue();
		if (!(C1 + 1).isPowerOf2() \|\| C1.getBitWidth() < C3.getBitWidth() \|\|
		C1 != C3.zextOrSelf(C1.getBitWidth()))
		return SDValue();

		unsigned BW = (C1 + 1).exactLogBase2();
		RKSimonUnsubmitted Not Done Reply Inline Actions Isn't this (C1 + 1).exactLogBase2()? RKSimon: Isn't this (C1 + 1).exactLogBase2()?
		EVT FPVT = N0.getOperand(0).getValueType();
		EVT NewVT = EVT::getIntegerVT(*DAG.getContext(), BW);
		if (FPVT.isVector())
		NewVT = EVT::getVectorVT(*DAG.getContext(), NewVT,
		FPVT.getVectorElementCount());
		if (!DAG.getTargetLoweringInfo().shouldConvertFpToSat(ISD::FP_TO_UINT_SAT,
		FPVT, NewVT))
		return SDValue();

		SDValue Sat =
		DAG.getNode(ISD::FP_TO_UINT_SAT, SDLoc(N0), NewVT, N0.getOperand(0),
		DAG.getValueType(NewVT.getScalarType()));
		return DAG.getZExtOrTrunc(Sat, SDLoc(N0), N3.getValueType());
		}

SDValue DAGCombiner::visitIMINMAX(SDNode *N) {		SDValue DAGCombiner::visitIMINMAX(SDNode *N) {
SDValue N0 = N->getOperand(0);		SDValue N0 = N->getOperand(0);
SDValue N1 = N->getOperand(1);		SDValue N1 = N->getOperand(1);
EVT VT = N0.getValueType();		EVT VT = N0.getValueType();
unsigned Opcode = N->getOpcode();		unsigned Opcode = N->getOpcode();
SDLoc DL(N);		SDLoc DL(N);

// fold operation with constant operands.		// fold operation with constant operands.
Show All 26 Lines	if (!TLI.isOperationLegal(Opcode, VT) &&
if (TLI.isOperationLegal(AltOpcode, VT))		if (TLI.isOperationLegal(AltOpcode, VT))
return DAG.getNode(AltOpcode, DL, VT, N0, N1);		return DAG.getNode(AltOpcode, DL, VT, N0, N1);
}		}

if (Opcode == ISD::SMIN \|\| Opcode == ISD::SMAX)		if (Opcode == ISD::SMIN \|\| Opcode == ISD::SMAX)
if (SDValue S = PerformMinMaxFpToSatCombine(		if (SDValue S = PerformMinMaxFpToSatCombine(
N0, N1, N0, N1, Opcode == ISD::SMIN ? ISD::SETLT : ISD::SETGT, DAG))		N0, N1, N0, N1, Opcode == ISD::SMIN ? ISD::SETLT : ISD::SETGT, DAG))
return S;		return S;
		if (Opcode == ISD::UMIN)
		if (SDValue S = PerformUMinFpToSatCombine(N0, N1, N0, N1, ISD::SETULT, DAG))
		return S;

// Simplify the operands using demanded-bits information.		// Simplify the operands using demanded-bits information.
if (SimplifyDemandedBits(SDValue(N, 0)))		if (SimplifyDemandedBits(SDValue(N, 0)))
return SDValue(N, 0);		return SDValue(N, 0);

return SDValue();		return SDValue();
}		}

▲ Show 20 Lines • Show All 5,364 Lines • ▼ Show 20 Lines	if (N0.getOpcode() == ISD::SETCC) {
if (N0.hasOneUse() && isLegalToCombineMinNumMaxNum(DAG, LHS, RHS, TLI)) {		if (N0.hasOneUse() && isLegalToCombineMinNumMaxNum(DAG, LHS, RHS, TLI)) {
if (SDValue FMinMax =		if (SDValue FMinMax =
combineMinNumMaxNum(DL, VT, LHS, RHS, N1, N2, CC, TLI, DAG))		combineMinNumMaxNum(DL, VT, LHS, RHS, N1, N2, CC, TLI, DAG))
return FMinMax;		return FMinMax;
}		}

if (SDValue S = PerformMinMaxFpToSatCombine(LHS, RHS, N1, N2, CC, DAG))		if (SDValue S = PerformMinMaxFpToSatCombine(LHS, RHS, N1, N2, CC, DAG))
return S;		return S;
		if (SDValue S = PerformUMinFpToSatCombine(LHS, RHS, N1, N2, CC, DAG))
		return S;

// If this select has a condition (setcc) with narrower operands than the		// If this select has a condition (setcc) with narrower operands than the
// select, try to widen the compare to match the select width.		// select, try to widen the compare to match the select width.
// TODO: This should be extended to handle any constant.		// TODO: This should be extended to handle any constant.
// TODO: This could be extended to handle non-loading patterns, but that		// TODO: This could be extended to handle non-loading patterns, but that
// requires thorough testing to avoid regressions.		// requires thorough testing to avoid regressions.
if (isNullOrNullSplat(RHS)) {		if (isNullOrNullSplat(RHS)) {
EVT NarrowVT = LHS.getValueType();		EVT NarrowVT = LHS.getValueType();
▲ Show 20 Lines • Show All 13,042 Lines • ▼ Show 20 Lines	SDValue ASR = DAG.getNode(
ISD::SRA, DL, CmpOpVT, N0,		ISD::SRA, DL, CmpOpVT, N0,
DAG.getConstant(CmpOpVT.getScalarSizeInBits() - 1, DL, CmpOpVT));		DAG.getConstant(CmpOpVT.getScalarSizeInBits() - 1, DL, CmpOpVT));
return DAG.getNode(ISD::XOR, DL, VT, DAG.getSExtOrTrunc(ASR, DL, VT),		return DAG.getNode(ISD::XOR, DL, VT, DAG.getSExtOrTrunc(ASR, DL, VT),
DAG.getSExtOrTrunc(CC == ISD::SETLT ? N3 : N2, DL, VT));		DAG.getSExtOrTrunc(CC == ISD::SETLT ? N3 : N2, DL, VT));
}		}

if (SDValue S = PerformMinMaxFpToSatCombine(N0, N1, N2, N3, CC, DAG))		if (SDValue S = PerformMinMaxFpToSatCombine(N0, N1, N2, N3, CC, DAG))
return S;		return S;
		if (SDValue S = PerformUMinFpToSatCombine(N0, N1, N2, N3, CC, DAG))
		return S;

return SDValue();		return SDValue();
}		}

/// This is a stub for TargetLowering::SimplifySetCC.		/// This is a stub for TargetLowering::SimplifySetCC.
SDValue DAGCombiner::SimplifySetCC(EVT VT, SDValue N0, SDValue N1,		SDValue DAGCombiner::SimplifySetCC(EVT VT, SDValue N0, SDValue N1,
ISD::CondCode Cond, const SDLoc &DL,		ISD::CondCode Cond, const SDLoc &DL,
bool foldBooleans) {		bool foldBooleans) {
▲ Show 20 Lines • Show All 719 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/fpclamptosat.ll

Show All 16 Lines	entry:
%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648		%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utest_f64i32(double %x) {		define i32 @utest_f64i32(double %x) {
; CHECK-LABEL: utest_f64i32:		; CHECK-LABEL: utest_f64i32:
; CHECK: // %bb.0: // %entry		; CHECK: // %bb.0: // %entry
; CHECK-NEXT: fcvtzu x8, d0		; CHECK-NEXT: fcvtzu w0, d0
; CHECK-NEXT: mov w9, #-1
; CHECK-NEXT: cmp x8, x9
; CHECK-NEXT: csinv w0, w8, wzr, lo
; CHECK-NEXT: ret		; CHECK-NEXT: ret
entry:		entry:
%conv = fptoui double %x to i64		%conv = fptoui double %x to i64
%0 = icmp ult i64 %conv, 4294967295		%0 = icmp ult i64 %conv, 4294967295
%spec.store.select = select i1 %0, i64 %conv, i64 4294967295		%spec.store.select = select i1 %0, i64 %conv, i64 4294967295
%conv6 = trunc i64 %spec.store.select to i32		%conv6 = trunc i64 %spec.store.select to i32
ret i32 %conv6		ret i32 %conv6
}		}
Show All 26 Lines	entry:
%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648		%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utest_f32i32(float %x) {		define i32 @utest_f32i32(float %x) {
; CHECK-LABEL: utest_f32i32:		; CHECK-LABEL: utest_f32i32:
; CHECK: // %bb.0: // %entry		; CHECK: // %bb.0: // %entry
; CHECK-NEXT: fcvtzu x8, s0		; CHECK-NEXT: fcvtzu w0, s0
; CHECK-NEXT: mov w9, #-1
; CHECK-NEXT: cmp x8, x9
; CHECK-NEXT: csinv w0, w8, wzr, lo
; CHECK-NEXT: ret		; CHECK-NEXT: ret
entry:		entry:
%conv = fptoui float %x to i64		%conv = fptoui float %x to i64
%0 = icmp ult i64 %conv, 4294967295		%0 = icmp ult i64 %conv, 4294967295
%spec.store.select = select i1 %0, i64 %conv, i64 4294967295		%spec.store.select = select i1 %0, i64 %conv, i64 4294967295
%conv6 = trunc i64 %spec.store.select to i32		%conv6 = trunc i64 %spec.store.select to i32
ret i32 %conv6		ret i32 %conv6
}		}
Show All 33 Lines	entry:
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utesth_f16i32(half %x) {		define i32 @utesth_f16i32(half %x) {
; CHECK-CVT-LABEL: utesth_f16i32:		; CHECK-CVT-LABEL: utesth_f16i32:
; CHECK-CVT: // %bb.0: // %entry		; CHECK-CVT: // %bb.0: // %entry
; CHECK-CVT-NEXT: fcvt s0, h0		; CHECK-CVT-NEXT: fcvt s0, h0
; CHECK-CVT-NEXT: mov w9, #-1		; CHECK-CVT-NEXT: fcvtzu w0, s0
; CHECK-CVT-NEXT: fcvtzu x8, s0
; CHECK-CVT-NEXT: cmp x8, x9
; CHECK-CVT-NEXT: csinv w0, w8, wzr, lo
; CHECK-CVT-NEXT: ret		; CHECK-CVT-NEXT: ret
;		;
; CHECK-FP16-LABEL: utesth_f16i32:		; CHECK-FP16-LABEL: utesth_f16i32:
; CHECK-FP16: // %bb.0: // %entry		; CHECK-FP16: // %bb.0: // %entry
; CHECK-FP16-NEXT: fcvtzu x8, h0		; CHECK-FP16-NEXT: fcvtzu w0, h0
; CHECK-FP16-NEXT: mov w9, #-1
; CHECK-FP16-NEXT: cmp x8, x9
; CHECK-FP16-NEXT: csinv w0, w8, wzr, lo
; CHECK-FP16-NEXT: ret		; CHECK-FP16-NEXT: ret
entry:		entry:
%conv = fptoui half %x to i64		%conv = fptoui half %x to i64
%0 = icmp ult i64 %conv, 4294967295		%0 = icmp ult i64 %conv, 4294967295
%spec.store.select = select i1 %0, i64 %conv, i64 4294967295		%spec.store.select = select i1 %0, i64 %conv, i64 4294967295
%conv6 = trunc i64 %spec.store.select to i32		%conv6 = trunc i64 %spec.store.select to i32
ret i32 %conv6		ret i32 %conv6
}		}
▲ Show 20 Lines • Show All 432 Lines • ▼ Show 20 Lines	entry:
%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)		%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utest_f64i32_mm(double %x) {		define i32 @utest_f64i32_mm(double %x) {
; CHECK-LABEL: utest_f64i32_mm:		; CHECK-LABEL: utest_f64i32_mm:
; CHECK: // %bb.0: // %entry		; CHECK: // %bb.0: // %entry
; CHECK-NEXT: fcvtzu x8, d0		; CHECK-NEXT: fcvtzu w0, d0
; CHECK-NEXT: mov w9, #-1
; CHECK-NEXT: cmp x8, x9
; CHECK-NEXT: csel x0, x8, x9, lo
; CHECK-NEXT: // kill: def $w0 killed $w0 killed $x0
; CHECK-NEXT: ret		; CHECK-NEXT: ret
entry:		entry:
%conv = fptoui double %x to i64		%conv = fptoui double %x to i64
%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)		%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)
%conv6 = trunc i64 %spec.store.select to i32		%conv6 = trunc i64 %spec.store.select to i32
ret i32 %conv6		ret i32 %conv6
}		}

Show All 21 Lines	entry:
%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)		%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utest_f32i32_mm(float %x) {		define i32 @utest_f32i32_mm(float %x) {
; CHECK-LABEL: utest_f32i32_mm:		; CHECK-LABEL: utest_f32i32_mm:
; CHECK: // %bb.0: // %entry		; CHECK: // %bb.0: // %entry
; CHECK-NEXT: fcvtzu x8, s0		; CHECK-NEXT: fcvtzu w0, s0
; CHECK-NEXT: mov w9, #-1
; CHECK-NEXT: cmp x8, x9
; CHECK-NEXT: csel x0, x8, x9, lo
; CHECK-NEXT: // kill: def $w0 killed $w0 killed $x0
; CHECK-NEXT: ret		; CHECK-NEXT: ret
entry:		entry:
%conv = fptoui float %x to i64		%conv = fptoui float %x to i64
%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)		%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)
%conv6 = trunc i64 %spec.store.select to i32		%conv6 = trunc i64 %spec.store.select to i32
ret i32 %conv6		ret i32 %conv6
}		}

Show All 28 Lines	entry:
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utesth_f16i32_mm(half %x) {		define i32 @utesth_f16i32_mm(half %x) {
; CHECK-CVT-LABEL: utesth_f16i32_mm:		; CHECK-CVT-LABEL: utesth_f16i32_mm:
; CHECK-CVT: // %bb.0: // %entry		; CHECK-CVT: // %bb.0: // %entry
; CHECK-CVT-NEXT: fcvt s0, h0		; CHECK-CVT-NEXT: fcvt s0, h0
; CHECK-CVT-NEXT: mov w9, #-1		; CHECK-CVT-NEXT: fcvtzu w0, s0
; CHECK-CVT-NEXT: fcvtzu x8, s0
; CHECK-CVT-NEXT: cmp x8, x9
; CHECK-CVT-NEXT: csel x0, x8, x9, lo
; CHECK-CVT-NEXT: // kill: def $w0 killed $w0 killed $x0
; CHECK-CVT-NEXT: ret		; CHECK-CVT-NEXT: ret
;		;
; CHECK-FP16-LABEL: utesth_f16i32_mm:		; CHECK-FP16-LABEL: utesth_f16i32_mm:
; CHECK-FP16: // %bb.0: // %entry		; CHECK-FP16: // %bb.0: // %entry
; CHECK-FP16-NEXT: fcvtzu x8, h0		; CHECK-FP16-NEXT: fcvtzu w0, h0
; CHECK-FP16-NEXT: mov w9, #-1
; CHECK-FP16-NEXT: cmp x8, x9
; CHECK-FP16-NEXT: csel x0, x8, x9, lo
; CHECK-FP16-NEXT: // kill: def $w0 killed $w0 killed $x0
; CHECK-FP16-NEXT: ret		; CHECK-FP16-NEXT: ret
entry:		entry:
%conv = fptoui half %x to i64		%conv = fptoui half %x to i64
%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)		%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)
%conv6 = trunc i64 %spec.store.select to i32		%conv6 = trunc i64 %spec.store.select to i32
ret i32 %conv6		ret i32 %conv6
}		}

▲ Show 20 Lines • Show All 397 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/fpclamptosat_vec.ll

Show All 21 Lines	entry:
%spec.store.select7 = select <2 x i1> %1, <2 x i64> %spec.store.select, <2 x i64> <i64 -2147483648, i64 -2147483648>		%spec.store.select7 = select <2 x i1> %1, <2 x i64> %spec.store.select, <2 x i64> <i64 -2147483648, i64 -2147483648>
%conv6 = trunc <2 x i64> %spec.store.select7 to <2 x i32>		%conv6 = trunc <2 x i64> %spec.store.select7 to <2 x i32>
ret <2 x i32> %conv6		ret <2 x i32> %conv6
}		}

define <2 x i32> @utest_f64i32(<2 x double> %x) {		define <2 x i32> @utest_f64i32(<2 x double> %x) {
; CHECK-LABEL: utest_f64i32:		; CHECK-LABEL: utest_f64i32:
; CHECK: // %bb.0: // %entry		; CHECK: // %bb.0: // %entry
; CHECK-NEXT: movi v1.2d, #0x000000ffffffff		; CHECK-NEXT: mov d1, v0.d[1]
; CHECK-NEXT: fcvtzu v0.2d, v0.2d		; CHECK-NEXT: fcvtzu w8, d0
; CHECK-NEXT: cmhi v1.2d, v1.2d, v0.2d		; CHECK-NEXT: fmov s0, w8
; CHECK-NEXT: and v0.16b, v0.16b, v1.16b		; CHECK-NEXT: fcvtzu w8, d1
; CHECK-NEXT: orn v0.16b, v0.16b, v1.16b		; CHECK-NEXT: mov v0.s[1], w8
; CHECK-NEXT: xtn v0.2s, v0.2d		; CHECK-NEXT: // kill: def $d0 killed $d0 killed $q0
; CHECK-NEXT: ret		; CHECK-NEXT: ret
entry:		entry:
%conv = fptoui <2 x double> %x to <2 x i64>		%conv = fptoui <2 x double> %x to <2 x i64>
%0 = icmp ult <2 x i64> %conv, <i64 4294967295, i64 4294967295>		%0 = icmp ult <2 x i64> %conv, <i64 4294967295, i64 4294967295>
%spec.store.select = select <2 x i1> %0, <2 x i64> %conv, <2 x i64> <i64 4294967295, i64 4294967295>		%spec.store.select = select <2 x i1> %0, <2 x i64> %conv, <2 x i64> <i64 4294967295, i64 4294967295>
%conv6 = trunc <2 x i64> %spec.store.select to <2 x i32>		%conv6 = trunc <2 x i64> %spec.store.select to <2 x i32>
ret <2 x i32> %conv6		ret <2 x i32> %conv6
}		}
Show All 31 Lines	entry:
%spec.store.select7 = select <4 x i1> %1, <4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>		%spec.store.select7 = select <4 x i1> %1, <4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>
%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

define <4 x i32> @utest_f32i32(<4 x float> %x) {		define <4 x i32> @utest_f32i32(<4 x float> %x) {
; CHECK-LABEL: utest_f32i32:		; CHECK-LABEL: utest_f32i32:
; CHECK: // %bb.0: // %entry		; CHECK: // %bb.0: // %entry
; CHECK-NEXT: fcvtl2 v2.2d, v0.4s		; CHECK-NEXT: fcvtzu v0.4s, v0.4s
; CHECK-NEXT: fcvtl v0.2d, v0.2s
; CHECK-NEXT: movi v1.2d, #0x000000ffffffff
; CHECK-NEXT: fcvtzu v2.2d, v2.2d
; CHECK-NEXT: fcvtzu v0.2d, v0.2d
; CHECK-NEXT: cmhi v3.2d, v1.2d, v2.2d
; CHECK-NEXT: cmhi v1.2d, v1.2d, v0.2d
; CHECK-NEXT: and v2.16b, v2.16b, v3.16b
; CHECK-NEXT: and v0.16b, v0.16b, v1.16b
; CHECK-NEXT: orn v2.16b, v2.16b, v3.16b
; CHECK-NEXT: orn v0.16b, v0.16b, v1.16b
; CHECK-NEXT: uzp1 v0.4s, v0.4s, v2.4s
; CHECK-NEXT: ret		; CHECK-NEXT: ret
entry:		entry:
%conv = fptoui <4 x float> %x to <4 x i64>		%conv = fptoui <4 x float> %x to <4 x i64>
%0 = icmp ult <4 x i64> %conv, <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>		%0 = icmp ult <4 x i64> %conv, <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>
%spec.store.select = select <4 x i1> %0, <4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>		%spec.store.select = select <4 x i1> %0, <4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>
%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}
Show All 25 Lines	entry:
%spec.store.select = select <4 x i1> %0, <4 x i64> %conv, <4 x i64> <i64 2147483647, i64 2147483647, i64 2147483647, i64 2147483647>		%spec.store.select = select <4 x i1> %0, <4 x i64> %conv, <4 x i64> <i64 2147483647, i64 2147483647, i64 2147483647, i64 2147483647>
%1 = icmp sgt <4 x i64> %spec.store.select, <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>		%1 = icmp sgt <4 x i64> %spec.store.select, <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>
%spec.store.select7 = select <4 x i1> %1, <4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>		%spec.store.select7 = select <4 x i1> %1, <4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>
%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

define <4 x i32> @utesth_f16i32(<4 x half> %x) {		define <4 x i32> @utesth_f16i32(<4 x half> %x) {
; CHECK-CVT-LABEL: utesth_f16i32:		; CHECK-LABEL: utesth_f16i32:
; CHECK-CVT: // %bb.0: // %entry		; CHECK: // %bb.0: // %entry
; CHECK-CVT-NEXT: // kill: def $d0 killed $d0 def $q0		; CHECK-NEXT: fcvtl v0.4s, v0.4h
; CHECK-CVT-NEXT: mov h2, v0.h[2]		; CHECK-NEXT: fcvtzu v0.4s, v0.4s
; CHECK-CVT-NEXT: mov h3, v0.h[3]		; CHECK-NEXT: ret
; CHECK-CVT-NEXT: mov h4, v0.h[1]
; CHECK-CVT-NEXT: fcvt s0, h0
; CHECK-CVT-NEXT: movi v1.2d, #0x000000ffffffff
; CHECK-CVT-NEXT: fcvt s2, h2
; CHECK-CVT-NEXT: fcvt s3, h3
; CHECK-CVT-NEXT: fcvtzu x9, s0
; CHECK-CVT-NEXT: fcvtzu x8, s2
; CHECK-CVT-NEXT: fcvt s2, h4
; CHECK-CVT-NEXT: fmov d0, x8
; CHECK-CVT-NEXT: fcvtzu x8, s3
; CHECK-CVT-NEXT: fmov d3, x9
; CHECK-CVT-NEXT: fcvtzu x9, s2
; CHECK-CVT-NEXT: mov v0.d[1], x8
; CHECK-CVT-NEXT: mov v3.d[1], x9
; CHECK-CVT-NEXT: cmhi v2.2d, v1.2d, v0.2d
; CHECK-CVT-NEXT: cmhi v1.2d, v1.2d, v3.2d
; CHECK-CVT-NEXT: and v0.16b, v0.16b, v2.16b
; CHECK-CVT-NEXT: and v3.16b, v3.16b, v1.16b
; CHECK-CVT-NEXT: orn v0.16b, v0.16b, v2.16b
; CHECK-CVT-NEXT: orn v1.16b, v3.16b, v1.16b
; CHECK-CVT-NEXT: uzp1 v0.4s, v1.4s, v0.4s
; CHECK-CVT-NEXT: ret
;
; CHECK-FP16-LABEL: utesth_f16i32:
; CHECK-FP16: // %bb.0: // %entry
; CHECK-FP16-NEXT: // kill: def $d0 killed $d0 def $q0
; CHECK-FP16-NEXT: mov h2, v0.h[2]
; CHECK-FP16-NEXT: mov h3, v0.h[3]
; CHECK-FP16-NEXT: fcvtzu x9, h0
; CHECK-FP16-NEXT: movi v1.2d, #0x000000ffffffff
; CHECK-FP16-NEXT: fcvtzu x8, h2
; CHECK-FP16-NEXT: mov h2, v0.h[1]
; CHECK-FP16-NEXT: fmov d0, x8
; CHECK-FP16-NEXT: fcvtzu x8, h3
; CHECK-FP16-NEXT: fmov d3, x9
; CHECK-FP16-NEXT: fcvtzu x9, h2
; CHECK-FP16-NEXT: mov v0.d[1], x8
; CHECK-FP16-NEXT: mov v3.d[1], x9
; CHECK-FP16-NEXT: cmhi v2.2d, v1.2d, v0.2d
; CHECK-FP16-NEXT: cmhi v1.2d, v1.2d, v3.2d
; CHECK-FP16-NEXT: and v0.16b, v0.16b, v2.16b
; CHECK-FP16-NEXT: and v3.16b, v3.16b, v1.16b
; CHECK-FP16-NEXT: orn v0.16b, v0.16b, v2.16b
; CHECK-FP16-NEXT: orn v1.16b, v3.16b, v1.16b
; CHECK-FP16-NEXT: uzp1 v0.4s, v1.4s, v0.4s
; CHECK-FP16-NEXT: ret
entry:		entry:
%conv = fptoui <4 x half> %x to <4 x i64>		%conv = fptoui <4 x half> %x to <4 x i64>
%0 = icmp ult <4 x i64> %conv, <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>		%0 = icmp ult <4 x i64> %conv, <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>
%spec.store.select = select <4 x i1> %0, <4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>		%spec.store.select = select <4 x i1> %0, <4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>
%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

▲ Show 20 Lines • Show All 138 Lines • ▼ Show 20 Lines	entry:
%spec.store.select = select <8 x i1> %0, <8 x i32> %conv, <8 x i32> <i32 32767, i32 32767, i32 32767, i32 32767, i32 32767, i32 32767, i32 32767, i32 32767>		%spec.store.select = select <8 x i1> %0, <8 x i32> %conv, <8 x i32> <i32 32767, i32 32767, i32 32767, i32 32767, i32 32767, i32 32767, i32 32767, i32 32767>
%1 = icmp sgt <8 x i32> %spec.store.select, <i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768>		%1 = icmp sgt <8 x i32> %spec.store.select, <i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768>
%spec.store.select7 = select <8 x i1> %1, <8 x i32> %spec.store.select, <8 x i32> <i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768>		%spec.store.select7 = select <8 x i1> %1, <8 x i32> %spec.store.select, <8 x i32> <i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768>
%conv6 = trunc <8 x i32> %spec.store.select7 to <8 x i16>		%conv6 = trunc <8 x i32> %spec.store.select7 to <8 x i16>
ret <8 x i16> %conv6		ret <8 x i16> %conv6
}		}

define <8 x i16> @utesth_f16i16(<8 x half> %x) {		define <8 x i16> @utesth_f16i16(<8 x half> %x) {
; CHECK-LABEL: utesth_f16i16:		; CHECK-CVT-LABEL: utesth_f16i16:
; CHECK: // %bb.0: // %entry		; CHECK-CVT: // %bb.0: // %entry
; CHECK-NEXT: fcvtl2 v2.4s, v0.8h		; CHECK-CVT-NEXT: fcvtl2 v2.4s, v0.8h
; CHECK-NEXT: fcvtl v0.4s, v0.4h		; CHECK-CVT-NEXT: fcvtl v0.4s, v0.4h
; CHECK-NEXT: movi v1.2d, #0x00ffff0000ffff		; CHECK-CVT-NEXT: movi v1.2d, #0x00ffff0000ffff
; CHECK-NEXT: fcvtzu v2.4s, v2.4s		; CHECK-CVT-NEXT: fcvtzu v2.4s, v2.4s
; CHECK-NEXT: fcvtzu v0.4s, v0.4s		; CHECK-CVT-NEXT: fcvtzu v0.4s, v0.4s
; CHECK-NEXT: umin v2.4s, v2.4s, v1.4s		; CHECK-CVT-NEXT: umin v2.4s, v2.4s, v1.4s
; CHECK-NEXT: umin v0.4s, v0.4s, v1.4s		; CHECK-CVT-NEXT: umin v0.4s, v0.4s, v1.4s
; CHECK-NEXT: uzp1 v0.8h, v0.8h, v2.8h		; CHECK-CVT-NEXT: uzp1 v0.8h, v0.8h, v2.8h
; CHECK-NEXT: ret		; CHECK-CVT-NEXT: ret
		;
		; CHECK-FP16-LABEL: utesth_f16i16:
		; CHECK-FP16: // %bb.0: // %entry
		; CHECK-FP16-NEXT: fcvtzu v0.8h, v0.8h
		; CHECK-FP16-NEXT: ret
entry:		entry:
%conv = fptoui <8 x half> %x to <8 x i32>		%conv = fptoui <8 x half> %x to <8 x i32>
%0 = icmp ult <8 x i32> %conv, <i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535>		%0 = icmp ult <8 x i32> %conv, <i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535>
%spec.store.select = select <8 x i1> %0, <8 x i32> %conv, <8 x i32> <i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535>		%spec.store.select = select <8 x i1> %0, <8 x i32> %conv, <8 x i32> <i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535>
%conv6 = trunc <8 x i32> %spec.store.select to <8 x i16>		%conv6 = trunc <8 x i32> %spec.store.select to <8 x i16>
ret <8 x i16> %conv6		ret <8 x i16> %conv6
}		}

▲ Show 20 Lines • Show All 393 Lines • ▼ Show 20 Lines	entry:
%spec.store.select7 = call <2 x i64> @llvm.smax.v2i64(<2 x i64> %spec.store.select, <2 x i64> <i64 -2147483648, i64 -2147483648>)		%spec.store.select7 = call <2 x i64> @llvm.smax.v2i64(<2 x i64> %spec.store.select, <2 x i64> <i64 -2147483648, i64 -2147483648>)
%conv6 = trunc <2 x i64> %spec.store.select7 to <2 x i32>		%conv6 = trunc <2 x i64> %spec.store.select7 to <2 x i32>
ret <2 x i32> %conv6		ret <2 x i32> %conv6
}		}

define <2 x i32> @utest_f64i32_mm(<2 x double> %x) {		define <2 x i32> @utest_f64i32_mm(<2 x double> %x) {
; CHECK-LABEL: utest_f64i32_mm:		; CHECK-LABEL: utest_f64i32_mm:
; CHECK: // %bb.0: // %entry		; CHECK: // %bb.0: // %entry
; CHECK-NEXT: movi v1.2d, #0x000000ffffffff		; CHECK-NEXT: mov d1, v0.d[1]
; CHECK-NEXT: fcvtzu v0.2d, v0.2d		; CHECK-NEXT: fcvtzu w8, d0
; CHECK-NEXT: cmhi v1.2d, v1.2d, v0.2d		; CHECK-NEXT: fmov s0, w8
; CHECK-NEXT: and v0.16b, v0.16b, v1.16b		; CHECK-NEXT: fcvtzu w8, d1
; CHECK-NEXT: orn v0.16b, v0.16b, v1.16b		; CHECK-NEXT: mov v0.s[1], w8
; CHECK-NEXT: xtn v0.2s, v0.2d		; CHECK-NEXT: // kill: def $d0 killed $d0 killed $q0
; CHECK-NEXT: ret		; CHECK-NEXT: ret
entry:		entry:
%conv = fptoui <2 x double> %x to <2 x i64>		%conv = fptoui <2 x double> %x to <2 x i64>
%spec.store.select = call <2 x i64> @llvm.umin.v2i64(<2 x i64> %conv, <2 x i64> <i64 4294967295, i64 4294967295>)		%spec.store.select = call <2 x i64> @llvm.umin.v2i64(<2 x i64> %conv, <2 x i64> <i64 4294967295, i64 4294967295>)
%conv6 = trunc <2 x i64> %spec.store.select to <2 x i32>		%conv6 = trunc <2 x i64> %spec.store.select to <2 x i32>
ret <2 x i32> %conv6		ret <2 x i32> %conv6
}		}

Show All 26 Lines	entry:
%spec.store.select7 = call <4 x i64> @llvm.smax.v4i64(<4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>)		%spec.store.select7 = call <4 x i64> @llvm.smax.v4i64(<4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>)
%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

define <4 x i32> @utest_f32i32_mm(<4 x float> %x) {		define <4 x i32> @utest_f32i32_mm(<4 x float> %x) {
; CHECK-LABEL: utest_f32i32_mm:		; CHECK-LABEL: utest_f32i32_mm:
; CHECK: // %bb.0: // %entry		; CHECK: // %bb.0: // %entry
; CHECK-NEXT: fcvtl2 v2.2d, v0.4s		; CHECK-NEXT: fcvtzu v0.4s, v0.4s
; CHECK-NEXT: fcvtl v0.2d, v0.2s
; CHECK-NEXT: movi v1.2d, #0x000000ffffffff
; CHECK-NEXT: fcvtzu v2.2d, v2.2d
; CHECK-NEXT: fcvtzu v0.2d, v0.2d
; CHECK-NEXT: cmhi v3.2d, v1.2d, v2.2d
; CHECK-NEXT: cmhi v1.2d, v1.2d, v0.2d
; CHECK-NEXT: and v2.16b, v2.16b, v3.16b
; CHECK-NEXT: and v0.16b, v0.16b, v1.16b
; CHECK-NEXT: orn v2.16b, v2.16b, v3.16b
; CHECK-NEXT: orn v0.16b, v0.16b, v1.16b
; CHECK-NEXT: uzp1 v0.4s, v0.4s, v2.4s
; CHECK-NEXT: ret		; CHECK-NEXT: ret
entry:		entry:
%conv = fptoui <4 x float> %x to <4 x i64>		%conv = fptoui <4 x float> %x to <4 x i64>
%spec.store.select = call <4 x i64> @llvm.umin.v4i64(<4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>)		%spec.store.select = call <4 x i64> @llvm.umin.v4i64(<4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>)
%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

Show All 20 Lines	entry:
%conv = fptosi <4 x half> %x to <4 x i64>		%conv = fptosi <4 x half> %x to <4 x i64>
%spec.store.select = call <4 x i64> @llvm.smin.v4i64(<4 x i64> %conv, <4 x i64> <i64 2147483647, i64 2147483647, i64 2147483647, i64 2147483647>)		%spec.store.select = call <4 x i64> @llvm.smin.v4i64(<4 x i64> %conv, <4 x i64> <i64 2147483647, i64 2147483647, i64 2147483647, i64 2147483647>)
%spec.store.select7 = call <4 x i64> @llvm.smax.v4i64(<4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>)		%spec.store.select7 = call <4 x i64> @llvm.smax.v4i64(<4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>)
%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

define <4 x i32> @utesth_f16i32_mm(<4 x half> %x) {		define <4 x i32> @utesth_f16i32_mm(<4 x half> %x) {
; CHECK-CVT-LABEL: utesth_f16i32_mm:		; CHECK-LABEL: utesth_f16i32_mm:
; CHECK-CVT: // %bb.0: // %entry		; CHECK: // %bb.0: // %entry
; CHECK-CVT-NEXT: // kill: def $d0 killed $d0 def $q0		; CHECK-NEXT: fcvtl v0.4s, v0.4h
; CHECK-CVT-NEXT: mov h2, v0.h[2]		; CHECK-NEXT: fcvtzu v0.4s, v0.4s
; CHECK-CVT-NEXT: mov h3, v0.h[3]		; CHECK-NEXT: ret
; CHECK-CVT-NEXT: mov h4, v0.h[1]
; CHECK-CVT-NEXT: fcvt s0, h0
; CHECK-CVT-NEXT: movi v1.2d, #0x000000ffffffff
; CHECK-CVT-NEXT: fcvt s2, h2
; CHECK-CVT-NEXT: fcvt s3, h3
; CHECK-CVT-NEXT: fcvtzu x9, s0
; CHECK-CVT-NEXT: fcvtzu x8, s2
; CHECK-CVT-NEXT: fcvt s2, h4
; CHECK-CVT-NEXT: fmov d0, x8
; CHECK-CVT-NEXT: fcvtzu x8, s3
; CHECK-CVT-NEXT: fmov d3, x9
; CHECK-CVT-NEXT: fcvtzu x9, s2
; CHECK-CVT-NEXT: mov v0.d[1], x8
; CHECK-CVT-NEXT: mov v3.d[1], x9
; CHECK-CVT-NEXT: cmhi v2.2d, v1.2d, v0.2d
; CHECK-CVT-NEXT: cmhi v1.2d, v1.2d, v3.2d
; CHECK-CVT-NEXT: and v0.16b, v0.16b, v2.16b
; CHECK-CVT-NEXT: and v3.16b, v3.16b, v1.16b
; CHECK-CVT-NEXT: orn v0.16b, v0.16b, v2.16b
; CHECK-CVT-NEXT: orn v1.16b, v3.16b, v1.16b
; CHECK-CVT-NEXT: uzp1 v0.4s, v1.4s, v0.4s
; CHECK-CVT-NEXT: ret
;
; CHECK-FP16-LABEL: utesth_f16i32_mm:
; CHECK-FP16: // %bb.0: // %entry
; CHECK-FP16-NEXT: // kill: def $d0 killed $d0 def $q0
; CHECK-FP16-NEXT: mov h2, v0.h[2]
; CHECK-FP16-NEXT: mov h3, v0.h[3]
; CHECK-FP16-NEXT: fcvtzu x9, h0
; CHECK-FP16-NEXT: movi v1.2d, #0x000000ffffffff
; CHECK-FP16-NEXT: fcvtzu x8, h2
; CHECK-FP16-NEXT: mov h2, v0.h[1]
; CHECK-FP16-NEXT: fmov d0, x8
; CHECK-FP16-NEXT: fcvtzu x8, h3
; CHECK-FP16-NEXT: fmov d3, x9
; CHECK-FP16-NEXT: fcvtzu x9, h2
; CHECK-FP16-NEXT: mov v0.d[1], x8
; CHECK-FP16-NEXT: mov v3.d[1], x9
; CHECK-FP16-NEXT: cmhi v2.2d, v1.2d, v0.2d
; CHECK-FP16-NEXT: cmhi v1.2d, v1.2d, v3.2d
; CHECK-FP16-NEXT: and v0.16b, v0.16b, v2.16b
; CHECK-FP16-NEXT: and v3.16b, v3.16b, v1.16b
; CHECK-FP16-NEXT: orn v0.16b, v0.16b, v2.16b
; CHECK-FP16-NEXT: orn v1.16b, v3.16b, v1.16b
; CHECK-FP16-NEXT: uzp1 v0.4s, v1.4s, v0.4s
; CHECK-FP16-NEXT: ret
entry:		entry:
%conv = fptoui <4 x half> %x to <4 x i64>		%conv = fptoui <4 x half> %x to <4 x i64>
%spec.store.select = call <4 x i64> @llvm.umin.v4i64(<4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>)		%spec.store.select = call <4 x i64> @llvm.umin.v4i64(<4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>)
%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

define <4 x i32> @ustest_f16i32_mm(<4 x half> %x) {		define <4 x i32> @ustest_f16i32_mm(<4 x half> %x) {
▲ Show 20 Lines • Show All 123 Lines • ▼ Show 20 Lines	entry:
%conv = fptosi <8 x half> %x to <8 x i32>		%conv = fptosi <8 x half> %x to <8 x i32>
%spec.store.select = call <8 x i32> @llvm.smin.v8i32(<8 x i32> %conv, <8 x i32> <i32 32767, i32 32767, i32 32767, i32 32767, i32 32767, i32 32767, i32 32767, i32 32767>)		%spec.store.select = call <8 x i32> @llvm.smin.v8i32(<8 x i32> %conv, <8 x i32> <i32 32767, i32 32767, i32 32767, i32 32767, i32 32767, i32 32767, i32 32767, i32 32767>)
%spec.store.select7 = call <8 x i32> @llvm.smax.v8i32(<8 x i32> %spec.store.select, <8 x i32> <i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768>)		%spec.store.select7 = call <8 x i32> @llvm.smax.v8i32(<8 x i32> %spec.store.select, <8 x i32> <i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768, i32 -32768>)
%conv6 = trunc <8 x i32> %spec.store.select7 to <8 x i16>		%conv6 = trunc <8 x i32> %spec.store.select7 to <8 x i16>
ret <8 x i16> %conv6		ret <8 x i16> %conv6
}		}

define <8 x i16> @utesth_f16i16_mm(<8 x half> %x) {		define <8 x i16> @utesth_f16i16_mm(<8 x half> %x) {
; CHECK-LABEL: utesth_f16i16_mm:		; CHECK-CVT-LABEL: utesth_f16i16_mm:
; CHECK: // %bb.0: // %entry		; CHECK-CVT: // %bb.0: // %entry
; CHECK-NEXT: fcvtl2 v2.4s, v0.8h		; CHECK-CVT-NEXT: fcvtl2 v2.4s, v0.8h
; CHECK-NEXT: fcvtl v0.4s, v0.4h		; CHECK-CVT-NEXT: fcvtl v0.4s, v0.4h
; CHECK-NEXT: movi v1.2d, #0x00ffff0000ffff		; CHECK-CVT-NEXT: movi v1.2d, #0x00ffff0000ffff
; CHECK-NEXT: fcvtzu v2.4s, v2.4s		; CHECK-CVT-NEXT: fcvtzu v2.4s, v2.4s
; CHECK-NEXT: fcvtzu v0.4s, v0.4s		; CHECK-CVT-NEXT: fcvtzu v0.4s, v0.4s
; CHECK-NEXT: umin v2.4s, v2.4s, v1.4s		; CHECK-CVT-NEXT: umin v2.4s, v2.4s, v1.4s
; CHECK-NEXT: umin v0.4s, v0.4s, v1.4s		; CHECK-CVT-NEXT: umin v0.4s, v0.4s, v1.4s
; CHECK-NEXT: uzp1 v0.8h, v0.8h, v2.8h		; CHECK-CVT-NEXT: uzp1 v0.8h, v0.8h, v2.8h
; CHECK-NEXT: ret		; CHECK-CVT-NEXT: ret
		;
		; CHECK-FP16-LABEL: utesth_f16i16_mm:
		; CHECK-FP16: // %bb.0: // %entry
		; CHECK-FP16-NEXT: fcvtzu v0.8h, v0.8h
		; CHECK-FP16-NEXT: ret
entry:		entry:
%conv = fptoui <8 x half> %x to <8 x i32>		%conv = fptoui <8 x half> %x to <8 x i32>
%spec.store.select = call <8 x i32> @llvm.umin.v8i32(<8 x i32> %conv, <8 x i32> <i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535>)		%spec.store.select = call <8 x i32> @llvm.umin.v8i32(<8 x i32> %conv, <8 x i32> <i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535, i32 65535>)
%conv6 = trunc <8 x i32> %spec.store.select to <8 x i16>		%conv6 = trunc <8 x i32> %spec.store.select to <8 x i16>
ret <8 x i16> %conv6		ret <8 x i16> %conv6
}		}

define <8 x i16> @ustest_f16i16_mm(<8 x half> %x) {		define <8 x i16> @ustest_f16i16_mm(<8 x half> %x) {
▲ Show 20 Lines • Show All 370 Lines • Show Last 20 Lines

llvm/test/CodeGen/ARM/fpclamptosat.ll

	Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines
	; SOFT-NEXT: adds r3, r0, #1			; SOFT-NEXT: adds r3, r0, #1
	; SOFT-NEXT: sbcs r1, r2			; SOFT-NEXT: sbcs r1, r2
	; SOFT-NEXT: blo .LBB1_2			; SOFT-NEXT: blo .LBB1_2
	; SOFT-NEXT: @ %bb.1: @ %entry			; SOFT-NEXT: @ %bb.1: @ %entry
	; SOFT-NEXT: mvns r0, r2			; SOFT-NEXT: mvns r0, r2
	; SOFT-NEXT: .LBB1_2: @ %entry			; SOFT-NEXT: .LBB1_2: @ %entry
	; SOFT-NEXT: pop {r7, pc}			; SOFT-NEXT: pop {r7, pc}
	;			;
	; VFP-LABEL: utest_f64i32:			; VFP2-LABEL: utest_f64i32:
	; VFP: @ %bb.0: @ %entry			; VFP2: @ %bb.0: @ %entry
	; VFP-NEXT: .save {r7, lr}			; VFP2-NEXT: .save {r7, lr}
	; VFP-NEXT: push {r7, lr}			; VFP2-NEXT: push {r7, lr}
	; VFP-NEXT: vmov r0, r1, d0			; VFP2-NEXT: vmov r0, r1, d0
	; VFP-NEXT: bl __aeabi_d2ulz			; VFP2-NEXT: bl __aeabi_d2ulz
	; VFP-NEXT: subs.w r2, r0, #-1			; VFP2-NEXT: subs.w r2, r0, #-1
	; VFP-NEXT: sbcs r1, r1, #0			; VFP2-NEXT: sbcs r1, r1, #0
	; VFP-NEXT: it hs			; VFP2-NEXT: it hs
	; VFP-NEXT: movhs.w r0, #-1			; VFP2-NEXT: movhs.w r0, #-1
	; VFP-NEXT: pop {r7, pc}			; VFP2-NEXT: pop {r7, pc}
				;
				; FULL-LABEL: utest_f64i32:
				; FULL: @ %bb.0: @ %entry
				; FULL-NEXT: vcvt.u32.f64 s0, d0
				; FULL-NEXT: vmov r0, s0
				; FULL-NEXT: bx lr
	entry:			entry:
	%conv = fptoui double %x to i64			%conv = fptoui double %x to i64
	%0 = icmp ult i64 %conv, 4294967295			%0 = icmp ult i64 %conv, 4294967295
	%spec.store.select = select i1 %0, i64 %conv, i64 4294967295			%spec.store.select = select i1 %0, i64 %conv, i64 4294967295
	%conv6 = trunc i64 %spec.store.select to i32			%conv6 = trunc i64 %spec.store.select to i32
	ret i32 %conv6			ret i32 %conv6
	}			}

	▲ Show 20 Lines • Show All 158 Lines • ▼ Show 20 Lines
	; SOFT-NEXT: blo .LBB4_2			; SOFT-NEXT: blo .LBB4_2
	; SOFT-NEXT: @ %bb.1: @ %entry			; SOFT-NEXT: @ %bb.1: @ %entry
	; SOFT-NEXT: mvns r0, r2			; SOFT-NEXT: mvns r0, r2
	; SOFT-NEXT: .LBB4_2: @ %entry			; SOFT-NEXT: .LBB4_2: @ %entry
	; SOFT-NEXT: pop {r7, pc}			; SOFT-NEXT: pop {r7, pc}
	;			;
	; VFP-LABEL: utest_f32i32:			; VFP-LABEL: utest_f32i32:
	; VFP: @ %bb.0: @ %entry			; VFP: @ %bb.0: @ %entry
	; VFP-NEXT: .save {r7, lr}			; VFP-NEXT: vcvt.u32.f32 s0, s0
	; VFP-NEXT: push {r7, lr}
	; VFP-NEXT: vmov r0, s0			; VFP-NEXT: vmov r0, s0
	; VFP-NEXT: bl __aeabi_f2ulz			; VFP-NEXT: bx lr
	; VFP-NEXT: subs.w r2, r0, #-1
	; VFP-NEXT: sbcs r1, r1, #0
	; VFP-NEXT: it hs
	; VFP-NEXT: movhs.w r0, #-1
	; VFP-NEXT: pop {r7, pc}
	entry:			entry:
	%conv = fptoui float %x to i64			%conv = fptoui float %x to i64
	%0 = icmp ult i64 %conv, 4294967295			%0 = icmp ult i64 %conv, 4294967295
	%spec.store.select = select i1 %0, i64 %conv, i64 4294967295			%spec.store.select = select i1 %0, i64 %conv, i64 4294967295
	%conv6 = trunc i64 %spec.store.select to i32			%conv6 = trunc i64 %spec.store.select to i32
	ret i32 %conv6			ret i32 %conv6
	}			}

	▲ Show 20 Lines • Show All 152 Lines • ▼ Show 20 Lines
	; SOFT-NEXT: pop {r7, pc}			; SOFT-NEXT: pop {r7, pc}
	;			;
	; VFP2-LABEL: utesth_f16i32:			; VFP2-LABEL: utesth_f16i32:
	; VFP2: @ %bb.0: @ %entry			; VFP2: @ %bb.0: @ %entry
	; VFP2-NEXT: .save {r7, lr}			; VFP2-NEXT: .save {r7, lr}
	; VFP2-NEXT: push {r7, lr}			; VFP2-NEXT: push {r7, lr}
	; VFP2-NEXT: vmov r0, s0			; VFP2-NEXT: vmov r0, s0
	; VFP2-NEXT: bl __aeabi_h2f			; VFP2-NEXT: bl __aeabi_h2f
	; VFP2-NEXT: bl __aeabi_f2ulz			; VFP2-NEXT: vmov s0, r0
	; VFP2-NEXT: subs.w r2, r0, #-1			; VFP2-NEXT: vcvt.u32.f32 s0, s0
	; VFP2-NEXT: sbcs r1, r1, #0			; VFP2-NEXT: vmov r0, s0
	; VFP2-NEXT: it hs
	; VFP2-NEXT: movhs.w r0, #-1
	; VFP2-NEXT: pop {r7, pc}			; VFP2-NEXT: pop {r7, pc}
	;			;
	; FULL-LABEL: utesth_f16i32:			; FULL-LABEL: utesth_f16i32:
	; FULL: @ %bb.0: @ %entry			; FULL: @ %bb.0: @ %entry
	; FULL-NEXT: .save {r7, lr}			; FULL-NEXT: vcvt.u32.f16 s0, s0
	; FULL-NEXT: push {r7, lr}			; FULL-NEXT: vmov r0, s0
	; FULL-NEXT: vmov.f16 r0, s0			; FULL-NEXT: bx lr
	; FULL-NEXT: vmov s0, r0
	; FULL-NEXT: bl __fixunshfdi
	; FULL-NEXT: subs.w r2, r0, #-1
	; FULL-NEXT: sbcs r1, r1, #0
	; FULL-NEXT: it hs
	; FULL-NEXT: movhs.w r0, #-1
	; FULL-NEXT: pop {r7, pc}
	entry:			entry:
	%conv = fptoui half %x to i64			%conv = fptoui half %x to i64
	%0 = icmp ult i64 %conv, 4294967295			%0 = icmp ult i64 %conv, 4294967295
	%spec.store.select = select i1 %0, i64 %conv, i64 4294967295			%spec.store.select = select i1 %0, i64 %conv, i64 4294967295
	%conv6 = trunc i64 %spec.store.select to i32			%conv6 = trunc i64 %spec.store.select to i32
	ret i32 %conv6			ret i32 %conv6
	}			}

	▲ Show 20 Lines • Show All 1,739 Lines • ▼ Show 20 Lines
	; SOFT-NEXT: cmp r1, #0			; SOFT-NEXT: cmp r1, #0
	; SOFT-NEXT: beq .LBB28_2			; SOFT-NEXT: beq .LBB28_2
	; SOFT-NEXT: @ %bb.1: @ %entry			; SOFT-NEXT: @ %bb.1: @ %entry
	; SOFT-NEXT: movs r0, #0			; SOFT-NEXT: movs r0, #0
	; SOFT-NEXT: mvns r0, r0			; SOFT-NEXT: mvns r0, r0
	; SOFT-NEXT: .LBB28_2: @ %entry			; SOFT-NEXT: .LBB28_2: @ %entry
	; SOFT-NEXT: pop {r7, pc}			; SOFT-NEXT: pop {r7, pc}
	;			;
	; VFP-LABEL: utest_f64i32_mm:			; VFP2-LABEL: utest_f64i32_mm:
	; VFP: @ %bb.0: @ %entry			; VFP2: @ %bb.0: @ %entry
	; VFP-NEXT: .save {r7, lr}			; VFP2-NEXT: .save {r7, lr}
	; VFP-NEXT: push {r7, lr}			; VFP2-NEXT: push {r7, lr}
	; VFP-NEXT: vmov r0, r1, d0			; VFP2-NEXT: vmov r0, r1, d0
	; VFP-NEXT: bl __aeabi_d2ulz			; VFP2-NEXT: bl __aeabi_d2ulz
	; VFP-NEXT: cmp r1, #0			; VFP2-NEXT: cmp r1, #0
	; VFP-NEXT: it ne			; VFP2-NEXT: it ne
	; VFP-NEXT: movne.w r0, #-1			; VFP2-NEXT: movne.w r0, #-1
	; VFP-NEXT: pop {r7, pc}			; VFP2-NEXT: pop {r7, pc}
				;
				; FULL-LABEL: utest_f64i32_mm:
				; FULL: @ %bb.0: @ %entry
				; FULL-NEXT: vcvt.u32.f64 s0, d0
				; FULL-NEXT: vmov r0, s0
				; FULL-NEXT: bx lr
	entry:			entry:
	%conv = fptoui double %x to i64			%conv = fptoui double %x to i64
	%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)			%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)
	%conv6 = trunc i64 %spec.store.select to i32			%conv6 = trunc i64 %spec.store.select to i32
	ret i32 %conv6			ret i32 %conv6
	}			}

	define i32 @ustest_f64i32_mm(double %x) {			define i32 @ustest_f64i32_mm(double %x) {
	▲ Show 20 Lines • Show All 163 Lines • ▼ Show 20 Lines
	; SOFT-NEXT: @ %bb.1: @ %entry			; SOFT-NEXT: @ %bb.1: @ %entry
	; SOFT-NEXT: movs r0, #0			; SOFT-NEXT: movs r0, #0
	; SOFT-NEXT: mvns r0, r0			; SOFT-NEXT: mvns r0, r0
	; SOFT-NEXT: .LBB31_2: @ %entry			; SOFT-NEXT: .LBB31_2: @ %entry
	; SOFT-NEXT: pop {r7, pc}			; SOFT-NEXT: pop {r7, pc}
	;			;
	; VFP-LABEL: utest_f32i32_mm:			; VFP-LABEL: utest_f32i32_mm:
	; VFP: @ %bb.0: @ %entry			; VFP: @ %bb.0: @ %entry
	; VFP-NEXT: .save {r7, lr}			; VFP-NEXT: vcvt.u32.f32 s0, s0
	; VFP-NEXT: push {r7, lr}
	; VFP-NEXT: vmov r0, s0			; VFP-NEXT: vmov r0, s0
	; VFP-NEXT: bl __aeabi_f2ulz			; VFP-NEXT: bx lr
	; VFP-NEXT: cmp r1, #0
	; VFP-NEXT: it ne
	; VFP-NEXT: movne.w r0, #-1
	; VFP-NEXT: pop {r7, pc}
	entry:			entry:
	%conv = fptoui float %x to i64			%conv = fptoui float %x to i64
	%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)			%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)
	%conv6 = trunc i64 %spec.store.select to i32			%conv6 = trunc i64 %spec.store.select to i32
	ret i32 %conv6			ret i32 %conv6
	}			}

	define i32 @ustest_f32i32_mm(float %x) {			define i32 @ustest_f32i32_mm(float %x) {
	▲ Show 20 Lines • Show All 156 Lines • ▼ Show 20 Lines
	; SOFT-NEXT: pop {r7, pc}			; SOFT-NEXT: pop {r7, pc}
	;			;
	; VFP2-LABEL: utesth_f16i32_mm:			; VFP2-LABEL: utesth_f16i32_mm:
	; VFP2: @ %bb.0: @ %entry			; VFP2: @ %bb.0: @ %entry
	; VFP2-NEXT: .save {r7, lr}			; VFP2-NEXT: .save {r7, lr}
	; VFP2-NEXT: push {r7, lr}			; VFP2-NEXT: push {r7, lr}
	; VFP2-NEXT: vmov r0, s0			; VFP2-NEXT: vmov r0, s0
	; VFP2-NEXT: bl __aeabi_h2f			; VFP2-NEXT: bl __aeabi_h2f
	; VFP2-NEXT: bl __aeabi_f2ulz			; VFP2-NEXT: vmov s0, r0
	; VFP2-NEXT: cmp r1, #0			; VFP2-NEXT: vcvt.u32.f32 s0, s0
	; VFP2-NEXT: it ne			; VFP2-NEXT: vmov r0, s0
	; VFP2-NEXT: movne.w r0, #-1
	; VFP2-NEXT: pop {r7, pc}			; VFP2-NEXT: pop {r7, pc}
	;			;
	; FULL-LABEL: utesth_f16i32_mm:			; FULL-LABEL: utesth_f16i32_mm:
	; FULL: @ %bb.0: @ %entry			; FULL: @ %bb.0: @ %entry
	; FULL-NEXT: .save {r7, lr}			; FULL-NEXT: vcvt.u32.f16 s0, s0
	; FULL-NEXT: push {r7, lr}			; FULL-NEXT: vmov r0, s0
	; FULL-NEXT: vmov.f16 r0, s0			; FULL-NEXT: bx lr
	; FULL-NEXT: vmov s0, r0
	; FULL-NEXT: bl __fixunshfdi
	; FULL-NEXT: cmp r1, #0
	; FULL-NEXT: it ne
	; FULL-NEXT: movne.w r0, #-1
	; FULL-NEXT: pop {r7, pc}
	entry:			entry:
	%conv = fptoui half %x to i64			%conv = fptoui half %x to i64
	%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)			%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)
	%conv6 = trunc i64 %spec.store.select to i32			%conv6 = trunc i64 %spec.store.select to i32
	ret i32 %conv6			ret i32 %conv6
	}			}

	define i32 @ustest_f16i32_mm(half %x) {			define i32 @ustest_f16i32_mm(half %x) {
	▲ Show 20 Lines • Show All 2,801 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/fpclamptosat.ll

Show First 20 Lines • Show All 109 Lines • ▼ Show 20 Lines	entry:
%spec.store.select = select i1 %0, i64 %conv, i64 2147483647		%spec.store.select = select i1 %0, i64 %conv, i64 2147483647
%1 = icmp sgt i64 %spec.store.select, -2147483648		%1 = icmp sgt i64 %spec.store.select, -2147483648
%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648		%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utest_f64i32(double %x) {		define i32 @utest_f64i32(double %x) {
; RV32-LABEL: utest_f64i32:		; RV32IF-LABEL: utest_f64i32:
; RV32: # %bb.0: # %entry		; RV32IF: # %bb.0: # %entry
; RV32-NEXT: addi sp, sp, -16		; RV32IF-NEXT: addi sp, sp, -16
; RV32-NEXT: .cfi_def_cfa_offset 16		; RV32IF-NEXT: .cfi_def_cfa_offset 16
; RV32-NEXT: sw ra, 12(sp) # 4-byte Folded Spill		; RV32IF-NEXT: sw ra, 12(sp) # 4-byte Folded Spill
; RV32-NEXT: .cfi_offset ra, -4		; RV32IF-NEXT: .cfi_offset ra, -4
; RV32-NEXT: call __fixunsdfdi@plt		; RV32IF-NEXT: call __fixunsdfdi@plt
; RV32-NEXT: beqz a1, .LBB1_2		; RV32IF-NEXT: beqz a1, .LBB1_2
; RV32-NEXT: # %bb.1: # %entry		; RV32IF-NEXT: # %bb.1: # %entry
; RV32-NEXT: li a1, 0		; RV32IF-NEXT: li a1, 0
; RV32-NEXT: beqz a1, .LBB1_3		; RV32IF-NEXT: beqz a1, .LBB1_3
; RV32-NEXT: j .LBB1_4		; RV32IF-NEXT: j .LBB1_4
; RV32-NEXT: .LBB1_2:		; RV32IF-NEXT: .LBB1_2:
; RV32-NEXT: addi a1, a0, 1		; RV32IF-NEXT: addi a1, a0, 1
; RV32-NEXT: snez a1, a1		; RV32IF-NEXT: snez a1, a1
; RV32-NEXT: bnez a1, .LBB1_4		; RV32IF-NEXT: bnez a1, .LBB1_4
; RV32-NEXT: .LBB1_3: # %entry		; RV32IF-NEXT: .LBB1_3: # %entry
; RV32-NEXT: li a0, -1		; RV32IF-NEXT: li a0, -1
; RV32-NEXT: .LBB1_4: # %entry		; RV32IF-NEXT: .LBB1_4: # %entry
; RV32-NEXT: lw ra, 12(sp) # 4-byte Folded Reload		; RV32IF-NEXT: lw ra, 12(sp) # 4-byte Folded Reload
; RV32-NEXT: addi sp, sp, 16		; RV32IF-NEXT: addi sp, sp, 16
; RV32-NEXT: ret		; RV32IF-NEXT: ret
;		;
; RV64IF-LABEL: utest_f64i32:		; RV64IF-LABEL: utest_f64i32:
; RV64IF: # %bb.0: # %entry		; RV64IF: # %bb.0: # %entry
; RV64IF-NEXT: addi sp, sp, -16		; RV64IF-NEXT: addi sp, sp, -16
; RV64IF-NEXT: .cfi_def_cfa_offset 16		; RV64IF-NEXT: .cfi_def_cfa_offset 16
; RV64IF-NEXT: sd ra, 8(sp) # 8-byte Folded Spill		; RV64IF-NEXT: sd ra, 8(sp) # 8-byte Folded Spill
; RV64IF-NEXT: .cfi_offset ra, -8		; RV64IF-NEXT: .cfi_offset ra, -8
; RV64IF-NEXT: call __fixunsdfdi@plt		; RV64IF-NEXT: call __fixunsdfdi@plt
; RV64IF-NEXT: li a1, -1		; RV64IF-NEXT: li a1, -1
; RV64IF-NEXT: srli a1, a1, 32		; RV64IF-NEXT: srli a1, a1, 32
; RV64IF-NEXT: bltu a0, a1, .LBB1_2		; RV64IF-NEXT: bltu a0, a1, .LBB1_2
; RV64IF-NEXT: # %bb.1: # %entry		; RV64IF-NEXT: # %bb.1: # %entry
; RV64IF-NEXT: mv a0, a1		; RV64IF-NEXT: mv a0, a1
; RV64IF-NEXT: .LBB1_2: # %entry		; RV64IF-NEXT: .LBB1_2: # %entry
; RV64IF-NEXT: ld ra, 8(sp) # 8-byte Folded Reload		; RV64IF-NEXT: ld ra, 8(sp) # 8-byte Folded Reload
; RV64IF-NEXT: addi sp, sp, 16		; RV64IF-NEXT: addi sp, sp, 16
; RV64IF-NEXT: ret		; RV64IF-NEXT: ret
;		;
		; RV32IFD-LABEL: utest_f64i32:
		; RV32IFD: # %bb.0: # %entry
		; RV32IFD-NEXT: addi sp, sp, -16
		; RV32IFD-NEXT: .cfi_def_cfa_offset 16
		; RV32IFD-NEXT: sw a0, 8(sp)
		; RV32IFD-NEXT: sw a1, 12(sp)
		; RV32IFD-NEXT: fld ft0, 8(sp)
		; RV32IFD-NEXT: feq.d a0, ft0, ft0
		; RV32IFD-NEXT: bnez a0, .LBB1_2
		; RV32IFD-NEXT: # %bb.1: # %entry
		; RV32IFD-NEXT: li a0, 0
		; RV32IFD-NEXT: addi sp, sp, 16
		; RV32IFD-NEXT: ret
		; RV32IFD-NEXT: .LBB1_2:
		; RV32IFD-NEXT: fcvt.wu.d a0, ft0, rtz
		; RV32IFD-NEXT: addi sp, sp, 16
		; RV32IFD-NEXT: ret
		;
; RV64IFD-LABEL: utest_f64i32:		; RV64IFD-LABEL: utest_f64i32:
; RV64IFD: # %bb.0: # %entry		; RV64IFD: # %bb.0: # %entry
; RV64IFD-NEXT: fmv.d.x ft0, a0		; RV64IFD-NEXT: fmv.d.x ft0, a0
; RV64IFD-NEXT: fcvt.lu.d a0, ft0, rtz		; RV64IFD-NEXT: fcvt.lu.d a0, ft0, rtz
; RV64IFD-NEXT: li a1, -1		; RV64IFD-NEXT: li a1, -1
; RV64IFD-NEXT: srli a1, a1, 32		; RV64IFD-NEXT: srli a1, a1, 32
; RV64IFD-NEXT: bltu a0, a1, .LBB1_2		; RV64IFD-NEXT: bltu a0, a1, .LBB1_2
; RV64IFD-NEXT: # %bb.1: # %entry		; RV64IFD-NEXT: # %bb.1: # %entry
▲ Show 20 Lines • Show All 148 Lines • ▼ Show 20 Lines	entry:
%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648		%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utest_f32i32(float %x) {		define i32 @utest_f32i32(float %x) {
; RV32-LABEL: utest_f32i32:		; RV32-LABEL: utest_f32i32:
; RV32: # %bb.0: # %entry		; RV32: # %bb.0: # %entry
; RV32-NEXT: addi sp, sp, -16		; RV32-NEXT: fmv.w.x ft0, a0
; RV32-NEXT: .cfi_def_cfa_offset 16		; RV32-NEXT: feq.s a0, ft0, ft0
; RV32-NEXT: sw ra, 12(sp) # 4-byte Folded Spill		; RV32-NEXT: bnez a0, .LBB4_2
; RV32-NEXT: .cfi_offset ra, -4
; RV32-NEXT: call __fixunssfdi@plt
; RV32-NEXT: beqz a1, .LBB4_2
; RV32-NEXT: # %bb.1: # %entry		; RV32-NEXT: # %bb.1: # %entry
; RV32-NEXT: li a1, 0		; RV32-NEXT: li a0, 0
; RV32-NEXT: beqz a1, .LBB4_3		; RV32-NEXT: ret
; RV32-NEXT: j .LBB4_4
; RV32-NEXT: .LBB4_2:		; RV32-NEXT: .LBB4_2:
; RV32-NEXT: addi a1, a0, 1		; RV32-NEXT: fcvt.wu.s a0, ft0, rtz
; RV32-NEXT: snez a1, a1
; RV32-NEXT: bnez a1, .LBB4_4
; RV32-NEXT: .LBB4_3: # %entry
; RV32-NEXT: li a0, -1
; RV32-NEXT: .LBB4_4: # %entry
; RV32-NEXT: lw ra, 12(sp) # 4-byte Folded Reload
; RV32-NEXT: addi sp, sp, 16
; RV32-NEXT: ret		; RV32-NEXT: ret
;		;
; RV64-LABEL: utest_f32i32:		; RV64-LABEL: utest_f32i32:
; RV64: # %bb.0: # %entry		; RV64: # %bb.0: # %entry
; RV64-NEXT: fmv.w.x ft0, a0		; RV64-NEXT: fmv.w.x ft0, a0
; RV64-NEXT: fcvt.lu.s a0, ft0, rtz		; RV64-NEXT: fcvt.lu.s a0, ft0, rtz
; RV64-NEXT: li a1, -1		; RV64-NEXT: li a1, -1
; RV64-NEXT: srli a1, a1, 32		; RV64-NEXT: srli a1, a1, 32
▲ Show 20 Lines • Show All 1,717 Lines • ▼ Show 20 Lines	entry:
%conv = fptosi double %x to i64		%conv = fptosi double %x to i64
%spec.store.select = call i64 @llvm.smin.i64(i64 %conv, i64 2147483647)		%spec.store.select = call i64 @llvm.smin.i64(i64 %conv, i64 2147483647)
%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)		%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utest_f64i32_mm(double %x) {		define i32 @utest_f64i32_mm(double %x) {
; RV32-LABEL: utest_f64i32_mm:		; RV32IF-LABEL: utest_f64i32_mm:
; RV32: # %bb.0: # %entry		; RV32IF: # %bb.0: # %entry
; RV32-NEXT: addi sp, sp, -16		; RV32IF-NEXT: addi sp, sp, -16
; RV32-NEXT: .cfi_def_cfa_offset 16		; RV32IF-NEXT: .cfi_def_cfa_offset 16
; RV32-NEXT: sw ra, 12(sp) # 4-byte Folded Spill		; RV32IF-NEXT: sw ra, 12(sp) # 4-byte Folded Spill
; RV32-NEXT: .cfi_offset ra, -4		; RV32IF-NEXT: .cfi_offset ra, -4
; RV32-NEXT: call __fixunsdfdi@plt		; RV32IF-NEXT: call __fixunsdfdi@plt
; RV32-NEXT: beqz a1, .LBB28_2		; RV32IF-NEXT: beqz a1, .LBB28_2
; RV32-NEXT: # %bb.1: # %entry		; RV32IF-NEXT: # %bb.1: # %entry
; RV32-NEXT: li a0, -1		; RV32IF-NEXT: li a0, -1
; RV32-NEXT: .LBB28_2: # %entry		; RV32IF-NEXT: .LBB28_2: # %entry
; RV32-NEXT: lw ra, 12(sp) # 4-byte Folded Reload		; RV32IF-NEXT: lw ra, 12(sp) # 4-byte Folded Reload
; RV32-NEXT: addi sp, sp, 16		; RV32IF-NEXT: addi sp, sp, 16
; RV32-NEXT: ret		; RV32IF-NEXT: ret
;		;
; RV64IF-LABEL: utest_f64i32_mm:		; RV64IF-LABEL: utest_f64i32_mm:
; RV64IF: # %bb.0: # %entry		; RV64IF: # %bb.0: # %entry
; RV64IF-NEXT: addi sp, sp, -16		; RV64IF-NEXT: addi sp, sp, -16
; RV64IF-NEXT: .cfi_def_cfa_offset 16		; RV64IF-NEXT: .cfi_def_cfa_offset 16
; RV64IF-NEXT: sd ra, 8(sp) # 8-byte Folded Spill		; RV64IF-NEXT: sd ra, 8(sp) # 8-byte Folded Spill
; RV64IF-NEXT: .cfi_offset ra, -8		; RV64IF-NEXT: .cfi_offset ra, -8
; RV64IF-NEXT: call __fixunsdfdi@plt		; RV64IF-NEXT: call __fixunsdfdi@plt
; RV64IF-NEXT: li a1, -1		; RV64IF-NEXT: li a1, -1
; RV64IF-NEXT: srli a1, a1, 32		; RV64IF-NEXT: srli a1, a1, 32
; RV64IF-NEXT: bltu a0, a1, .LBB28_2		; RV64IF-NEXT: bltu a0, a1, .LBB28_2
; RV64IF-NEXT: # %bb.1: # %entry		; RV64IF-NEXT: # %bb.1: # %entry
; RV64IF-NEXT: mv a0, a1		; RV64IF-NEXT: mv a0, a1
; RV64IF-NEXT: .LBB28_2: # %entry		; RV64IF-NEXT: .LBB28_2: # %entry
; RV64IF-NEXT: ld ra, 8(sp) # 8-byte Folded Reload		; RV64IF-NEXT: ld ra, 8(sp) # 8-byte Folded Reload
; RV64IF-NEXT: addi sp, sp, 16		; RV64IF-NEXT: addi sp, sp, 16
; RV64IF-NEXT: ret		; RV64IF-NEXT: ret
;		;
		; RV32IFD-LABEL: utest_f64i32_mm:
		; RV32IFD: # %bb.0: # %entry
		; RV32IFD-NEXT: addi sp, sp, -16
		; RV32IFD-NEXT: .cfi_def_cfa_offset 16
		; RV32IFD-NEXT: sw a0, 8(sp)
		; RV32IFD-NEXT: sw a1, 12(sp)
		; RV32IFD-NEXT: fld ft0, 8(sp)
		; RV32IFD-NEXT: feq.d a0, ft0, ft0
		; RV32IFD-NEXT: bnez a0, .LBB28_2
		; RV32IFD-NEXT: # %bb.1: # %entry
		; RV32IFD-NEXT: li a0, 0
		; RV32IFD-NEXT: addi sp, sp, 16
		; RV32IFD-NEXT: ret
		; RV32IFD-NEXT: .LBB28_2:
		; RV32IFD-NEXT: fcvt.wu.d a0, ft0, rtz
		; RV32IFD-NEXT: addi sp, sp, 16
		; RV32IFD-NEXT: ret
		;
; RV64IFD-LABEL: utest_f64i32_mm:		; RV64IFD-LABEL: utest_f64i32_mm:
; RV64IFD: # %bb.0: # %entry		; RV64IFD: # %bb.0: # %entry
; RV64IFD-NEXT: fmv.d.x ft0, a0		; RV64IFD-NEXT: fmv.d.x ft0, a0
; RV64IFD-NEXT: fcvt.lu.d a0, ft0, rtz		; RV64IFD-NEXT: fcvt.lu.d a0, ft0, rtz
; RV64IFD-NEXT: li a1, -1		; RV64IFD-NEXT: li a1, -1
; RV64IFD-NEXT: srli a1, a1, 32		; RV64IFD-NEXT: srli a1, a1, 32
; RV64IFD-NEXT: bltu a0, a1, .LBB28_2		; RV64IFD-NEXT: bltu a0, a1, .LBB28_2
; RV64IFD-NEXT: # %bb.1: # %entry		; RV64IFD-NEXT: # %bb.1: # %entry
▲ Show 20 Lines • Show All 147 Lines • ▼ Show 20 Lines	entry:
%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)		%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utest_f32i32_mm(float %x) {		define i32 @utest_f32i32_mm(float %x) {
; RV32-LABEL: utest_f32i32_mm:		; RV32-LABEL: utest_f32i32_mm:
; RV32: # %bb.0: # %entry		; RV32: # %bb.0: # %entry
; RV32-NEXT: addi sp, sp, -16		; RV32-NEXT: fmv.w.x ft0, a0
; RV32-NEXT: .cfi_def_cfa_offset 16		; RV32-NEXT: feq.s a0, ft0, ft0
; RV32-NEXT: sw ra, 12(sp) # 4-byte Folded Spill		; RV32-NEXT: bnez a0, .LBB31_2
; RV32-NEXT: .cfi_offset ra, -4
; RV32-NEXT: call __fixunssfdi@plt
; RV32-NEXT: beqz a1, .LBB31_2
; RV32-NEXT: # %bb.1: # %entry		; RV32-NEXT: # %bb.1: # %entry
; RV32-NEXT: li a0, -1		; RV32-NEXT: li a0, 0
; RV32-NEXT: .LBB31_2: # %entry		; RV32-NEXT: ret
; RV32-NEXT: lw ra, 12(sp) # 4-byte Folded Reload		; RV32-NEXT: .LBB31_2:
; RV32-NEXT: addi sp, sp, 16		; RV32-NEXT: fcvt.wu.s a0, ft0, rtz
; RV32-NEXT: ret		; RV32-NEXT: ret
;		;
; RV64-LABEL: utest_f32i32_mm:		; RV64-LABEL: utest_f32i32_mm:
; RV64: # %bb.0: # %entry		; RV64: # %bb.0: # %entry
; RV64-NEXT: fmv.w.x ft0, a0		; RV64-NEXT: fmv.w.x ft0, a0
; RV64-NEXT: fcvt.lu.s a0, ft0, rtz		; RV64-NEXT: fcvt.lu.s a0, ft0, rtz
; RV64-NEXT: li a1, -1		; RV64-NEXT: li a1, -1
; RV64-NEXT: srli a1, a1, 32		; RV64-NEXT: srli a1, a1, 32
▲ Show 20 Lines • Show All 2,012 Lines • Show Last 20 Lines

llvm/test/CodeGen/Thumb2/mve-fpclamptosat_vec.ll

Show First 20 Lines • Show All 177 Lines • ▼ Show 20 Lines	entry:
%spec.store.select7 = select <4 x i1> %1, <4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>		%spec.store.select7 = select <4 x i1> %1, <4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>
%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

define arm_aapcs_vfpcc <4 x i32> @utest_f32i32(<4 x float> %x) {		define arm_aapcs_vfpcc <4 x i32> @utest_f32i32(<4 x float> %x) {
; CHECK-LABEL: utest_f32i32:		; CHECK-LABEL: utest_f32i32:
; CHECK: @ %bb.0: @ %entry		; CHECK: @ %bb.0: @ %entry
; CHECK-NEXT: .save {r4, r5, r6, r7, lr}		; CHECK-NEXT: vcvt.u32.f32 q0, q0
; CHECK-NEXT: push {r4, r5, r6, r7, lr}		; CHECK-NEXT: bx lr
; CHECK-NEXT: .pad #4
; CHECK-NEXT: sub sp, #4
; CHECK-NEXT: .vsave {d8, d9, d10, d11, d12, d13}
; CHECK-NEXT: vpush {d8, d9, d10, d11, d12, d13}
; CHECK-NEXT: vmov q4, q0
; CHECK-NEXT: vmov r0, r4, d9
; CHECK-NEXT: bl __aeabi_f2ulz
; CHECK-NEXT: mov r5, r0
; CHECK-NEXT: mov r0, r4
; CHECK-NEXT: mov r6, r1
; CHECK-NEXT: bl __aeabi_f2ulz
; CHECK-NEXT: subs.w r2, r5, #-1
; CHECK-NEXT: vmov q0[2], q0[0], r5, r0
; CHECK-NEXT: sbcs r2, r6, #0
; CHECK-NEXT: mov.w r3, #0
; CHECK-NEXT: csetm r2, lo
; CHECK-NEXT: subs.w r0, r0, #-1
; CHECK-NEXT: sbcs r0, r1, #0
; CHECK-NEXT: bfi r3, r2, #0, #8
; CHECK-NEXT: csetm r0, lo
; CHECK-NEXT: vmov.i64 q5, #0xffffffff
; CHECK-NEXT: bfi r3, r0, #8, #8
; CHECK-NEXT: vmov r0, r4, d8
; CHECK-NEXT: vmov q0[3], q0[1], r6, r1
; CHECK-NEXT: vmsr p0, r3
; CHECK-NEXT: movs r7, #0
; CHECK-NEXT: vpsel q6, q0, q5
; CHECK-NEXT: bl __aeabi_f2ulz
; CHECK-NEXT: mov r5, r0
; CHECK-NEXT: mov r0, r4
; CHECK-NEXT: mov r6, r1
; CHECK-NEXT: bl __aeabi_f2ulz
; CHECK-NEXT: subs.w r2, r5, #-1
; CHECK-NEXT: vmov q0[2], q0[0], r5, r0
; CHECK-NEXT: sbcs r2, r6, #0
; CHECK-NEXT: vmov q0[3], q0[1], r6, r1
; CHECK-NEXT: csetm r2, lo
; CHECK-NEXT: subs.w r0, r0, #-1
; CHECK-NEXT: sbcs r0, r1, #0
; CHECK-NEXT: bfi r7, r2, #0, #8
; CHECK-NEXT: csetm r0, lo
; CHECK-NEXT: bfi r7, r0, #8, #8
; CHECK-NEXT: vmsr p0, r7
; CHECK-NEXT: vpsel q0, q0, q5
; CHECK-NEXT: vmov.f32 s1, s2
; CHECK-NEXT: vmov.f32 s2, s24
; CHECK-NEXT: vmov.f32 s3, s26
; CHECK-NEXT: vpop {d8, d9, d10, d11, d12, d13}
; CHECK-NEXT: add sp, #4
; CHECK-NEXT: pop {r4, r5, r6, r7, pc}
entry:		entry:
%conv = fptoui <4 x float> %x to <4 x i64>		%conv = fptoui <4 x float> %x to <4 x i64>
%0 = icmp ult <4 x i64> %conv, <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>		%0 = icmp ult <4 x i64> %conv, <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>
%spec.store.select = select <4 x i1> %0, <4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>		%spec.store.select = select <4 x i1> %0, <4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>
%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

▲ Show 20 Lines • Show All 1,145 Lines • ▼ Show 20 Lines	entry:
%spec.store.select7 = call <4 x i64> @llvm.smax.v4i64(<4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>)		%spec.store.select7 = call <4 x i64> @llvm.smax.v4i64(<4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>)
%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

define arm_aapcs_vfpcc <4 x i32> @utest_f32i32_mm(<4 x float> %x) {		define arm_aapcs_vfpcc <4 x i32> @utest_f32i32_mm(<4 x float> %x) {
; CHECK-LABEL: utest_f32i32_mm:		; CHECK-LABEL: utest_f32i32_mm:
; CHECK: @ %bb.0: @ %entry		; CHECK: @ %bb.0: @ %entry
; CHECK-NEXT: .save {r4, r5, r6, r7, lr}		; CHECK-NEXT: vcvt.u32.f32 q0, q0
; CHECK-NEXT: push {r4, r5, r6, r7, lr}		; CHECK-NEXT: bx lr
; CHECK-NEXT: .pad #4
; CHECK-NEXT: sub sp, #4
; CHECK-NEXT: .vsave {d8, d9, d10, d11, d12, d13}
; CHECK-NEXT: vpush {d8, d9, d10, d11, d12, d13}
; CHECK-NEXT: vmov q4, q0
; CHECK-NEXT: vmov r0, r4, d9
; CHECK-NEXT: bl __aeabi_f2ulz
; CHECK-NEXT: mov r5, r0
; CHECK-NEXT: mov r0, r4
; CHECK-NEXT: mov r6, r1
; CHECK-NEXT: bl __aeabi_f2ulz
; CHECK-NEXT: subs.w r2, r5, #-1
; CHECK-NEXT: vmov q0[2], q0[0], r5, r0
; CHECK-NEXT: sbcs r2, r6, #0
; CHECK-NEXT: mov.w r3, #0
; CHECK-NEXT: csetm r2, lo
; CHECK-NEXT: subs.w r0, r0, #-1
; CHECK-NEXT: sbcs r0, r1, #0
; CHECK-NEXT: bfi r3, r2, #0, #8
; CHECK-NEXT: csetm r0, lo
; CHECK-NEXT: vmov.i64 q5, #0xffffffff
; CHECK-NEXT: bfi r3, r0, #8, #8
; CHECK-NEXT: vmov r0, r4, d8
; CHECK-NEXT: vmov q0[3], q0[1], r6, r1
; CHECK-NEXT: vmsr p0, r3
; CHECK-NEXT: movs r7, #0
; CHECK-NEXT: vpsel q6, q0, q5
; CHECK-NEXT: bl __aeabi_f2ulz
; CHECK-NEXT: mov r5, r0
; CHECK-NEXT: mov r0, r4
; CHECK-NEXT: mov r6, r1
; CHECK-NEXT: bl __aeabi_f2ulz
; CHECK-NEXT: subs.w r2, r5, #-1
; CHECK-NEXT: vmov q0[2], q0[0], r5, r0
; CHECK-NEXT: sbcs r2, r6, #0
; CHECK-NEXT: vmov q0[3], q0[1], r6, r1
; CHECK-NEXT: csetm r2, lo
; CHECK-NEXT: subs.w r0, r0, #-1
; CHECK-NEXT: sbcs r0, r1, #0
; CHECK-NEXT: bfi r7, r2, #0, #8
; CHECK-NEXT: csetm r0, lo
; CHECK-NEXT: bfi r7, r0, #8, #8
; CHECK-NEXT: vmsr p0, r7
; CHECK-NEXT: vpsel q0, q0, q5
; CHECK-NEXT: vmov.f32 s1, s2
; CHECK-NEXT: vmov.f32 s2, s24
; CHECK-NEXT: vmov.f32 s3, s26
; CHECK-NEXT: vpop {d8, d9, d10, d11, d12, d13}
; CHECK-NEXT: add sp, #4
; CHECK-NEXT: pop {r4, r5, r6, r7, pc}
entry:		entry:
%conv = fptoui <4 x float> %x to <4 x i64>		%conv = fptoui <4 x float> %x to <4 x i64>
%spec.store.select = call <4 x i64> @llvm.umin.v4i64(<4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>)		%spec.store.select = call <4 x i64> @llvm.umin.v4i64(<4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>)
%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

define arm_aapcs_vfpcc <4 x i32> @ustest_f32i32_mm(<4 x float> %x) {		define arm_aapcs_vfpcc <4 x i32> @ustest_f32i32_mm(<4 x float> %x) {
▲ Show 20 Lines • Show All 1,230 Lines • Show Last 20 Lines

llvm/test/CodeGen/WebAssembly/fpclamptosat.ll

Show All 17 Lines	entry:
%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648		%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utest_f64i32(double %x) {		define i32 @utest_f64i32(double %x) {
; CHECK-LABEL: utest_f64i32:		; CHECK-LABEL: utest_f64i32:
; CHECK: .functype utest_f64i32 (f64) -> (i32)		; CHECK: .functype utest_f64i32 (f64) -> (i32)
; CHECK-NEXT: .local i64
; CHECK-NEXT: # %bb.0: # %entry		; CHECK-NEXT: # %bb.0: # %entry
; CHECK-NEXT: local.get 0		; CHECK-NEXT: local.get 0
; CHECK-NEXT: i64.trunc_sat_f64_u		; CHECK-NEXT: i32.trunc_sat_f64_u
; CHECK-NEXT: local.tee 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: local.get 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i32.wrap_i64
; CHECK-NEXT: # fallthrough-return		; CHECK-NEXT: # fallthrough-return
entry:		entry:
%conv = fptoui double %x to i64		%conv = fptoui double %x to i64
%0 = icmp ult i64 %conv, 4294967295		%0 = icmp ult i64 %conv, 4294967295
%spec.store.select = select i1 %0, i64 %conv, i64 4294967295		%spec.store.select = select i1 %0, i64 %conv, i64 4294967295
%conv6 = trunc i64 %spec.store.select to i32		%conv6 = trunc i64 %spec.store.select to i32
ret i32 %conv6		ret i32 %conv6
}		}
Show All 30 Lines	entry:
%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648		%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utest_f32i32(float %x) {		define i32 @utest_f32i32(float %x) {
; CHECK-LABEL: utest_f32i32:		; CHECK-LABEL: utest_f32i32:
; CHECK: .functype utest_f32i32 (f32) -> (i32)		; CHECK: .functype utest_f32i32 (f32) -> (i32)
; CHECK-NEXT: .local i64
; CHECK-NEXT: # %bb.0: # %entry		; CHECK-NEXT: # %bb.0: # %entry
; CHECK-NEXT: local.get 0		; CHECK-NEXT: local.get 0
; CHECK-NEXT: i64.trunc_sat_f32_u		; CHECK-NEXT: i32.trunc_sat_f32_u
; CHECK-NEXT: local.tee 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: local.get 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i32.wrap_i64
; CHECK-NEXT: # fallthrough-return		; CHECK-NEXT: # fallthrough-return
entry:		entry:
%conv = fptoui float %x to i64		%conv = fptoui float %x to i64
%0 = icmp ult i64 %conv, 4294967295		%0 = icmp ult i64 %conv, 4294967295
%spec.store.select = select i1 %0, i64 %conv, i64 4294967295		%spec.store.select = select i1 %0, i64 %conv, i64 4294967295
%conv6 = trunc i64 %spec.store.select to i32		%conv6 = trunc i64 %spec.store.select to i32
ret i32 %conv6		ret i32 %conv6
}		}
Show All 32 Lines	entry:
%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648		%spec.store.select7 = select i1 %1, i64 %spec.store.select, i64 -2147483648
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utesth_f16i32(half %x) {		define i32 @utesth_f16i32(half %x) {
; CHECK-LABEL: utesth_f16i32:		; CHECK-LABEL: utesth_f16i32:
; CHECK: .functype utesth_f16i32 (f32) -> (i32)		; CHECK: .functype utesth_f16i32 (f32) -> (i32)
; CHECK-NEXT: .local i64
; CHECK-NEXT: # %bb.0: # %entry		; CHECK-NEXT: # %bb.0: # %entry
; CHECK-NEXT: local.get 0		; CHECK-NEXT: local.get 0
; CHECK-NEXT: call __truncsfhf2		; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: call __extendhfsf2		; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: i64.trunc_sat_f32_u		; CHECK-NEXT: i32.trunc_sat_f32_u
; CHECK-NEXT: local.tee 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: local.get 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i32.wrap_i64
; CHECK-NEXT: # fallthrough-return		; CHECK-NEXT: # fallthrough-return
entry:		entry:
%conv = fptoui half %x to i64		%conv = fptoui half %x to i64
%0 = icmp ult i64 %conv, 4294967295		%0 = icmp ult i64 %conv, 4294967295
%spec.store.select = select i1 %0, i64 %conv, i64 4294967295		%spec.store.select = select i1 %0, i64 %conv, i64 4294967295
%conv6 = trunc i64 %spec.store.select to i32		%conv6 = trunc i64 %spec.store.select to i32
ret i32 %conv6		ret i32 %conv6
}		}
▲ Show 20 Lines • Show All 642 Lines • ▼ Show 20 Lines	entry:
%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)		%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utest_f64i32_mm(double %x) {		define i32 @utest_f64i32_mm(double %x) {
; CHECK-LABEL: utest_f64i32_mm:		; CHECK-LABEL: utest_f64i32_mm:
; CHECK: .functype utest_f64i32_mm (f64) -> (i32)		; CHECK: .functype utest_f64i32_mm (f64) -> (i32)
; CHECK-NEXT: .local i64
; CHECK-NEXT: # %bb.0: # %entry		; CHECK-NEXT: # %bb.0: # %entry
; CHECK-NEXT: local.get 0		; CHECK-NEXT: local.get 0
; CHECK-NEXT: i64.trunc_sat_f64_u		; CHECK-NEXT: i32.trunc_sat_f64_u
; CHECK-NEXT: local.tee 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: local.get 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i32.wrap_i64
; CHECK-NEXT: # fallthrough-return		; CHECK-NEXT: # fallthrough-return
entry:		entry:
%conv = fptoui double %x to i64		%conv = fptoui double %x to i64
%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)		%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)
%conv6 = trunc i64 %spec.store.select to i32		%conv6 = trunc i64 %spec.store.select to i32
ret i32 %conv6		ret i32 %conv6
}		}

Show All 25 Lines	entry:
%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)		%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utest_f32i32_mm(float %x) {		define i32 @utest_f32i32_mm(float %x) {
; CHECK-LABEL: utest_f32i32_mm:		; CHECK-LABEL: utest_f32i32_mm:
; CHECK: .functype utest_f32i32_mm (f32) -> (i32)		; CHECK: .functype utest_f32i32_mm (f32) -> (i32)
; CHECK-NEXT: .local i64
; CHECK-NEXT: # %bb.0: # %entry		; CHECK-NEXT: # %bb.0: # %entry
; CHECK-NEXT: local.get 0		; CHECK-NEXT: local.get 0
; CHECK-NEXT: i64.trunc_sat_f32_u		; CHECK-NEXT: i32.trunc_sat_f32_u
; CHECK-NEXT: local.tee 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: local.get 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i32.wrap_i64
; CHECK-NEXT: # fallthrough-return		; CHECK-NEXT: # fallthrough-return
entry:		entry:
%conv = fptoui float %x to i64		%conv = fptoui float %x to i64
%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)		%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)
%conv6 = trunc i64 %spec.store.select to i32		%conv6 = trunc i64 %spec.store.select to i32
ret i32 %conv6		ret i32 %conv6
}		}

Show All 27 Lines	entry:
%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)		%spec.store.select7 = call i64 @llvm.smax.i64(i64 %spec.store.select, i64 -2147483648)
%conv6 = trunc i64 %spec.store.select7 to i32		%conv6 = trunc i64 %spec.store.select7 to i32
ret i32 %conv6		ret i32 %conv6
}		}

define i32 @utesth_f16i32_mm(half %x) {		define i32 @utesth_f16i32_mm(half %x) {
; CHECK-LABEL: utesth_f16i32_mm:		; CHECK-LABEL: utesth_f16i32_mm:
; CHECK: .functype utesth_f16i32_mm (f32) -> (i32)		; CHECK: .functype utesth_f16i32_mm (f32) -> (i32)
; CHECK-NEXT: .local i64
; CHECK-NEXT: # %bb.0: # %entry		; CHECK-NEXT: # %bb.0: # %entry
; CHECK-NEXT: local.get 0		; CHECK-NEXT: local.get 0
; CHECK-NEXT: call __truncsfhf2		; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: call __extendhfsf2		; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: i64.trunc_sat_f32_u		; CHECK-NEXT: i32.trunc_sat_f32_u
; CHECK-NEXT: local.tee 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: local.get 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i32.wrap_i64
; CHECK-NEXT: # fallthrough-return		; CHECK-NEXT: # fallthrough-return
entry:		entry:
%conv = fptoui half %x to i64		%conv = fptoui half %x to i64
%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)		%spec.store.select = call i64 @llvm.umin.i64(i64 %conv, i64 4294967295)
%conv6 = trunc i64 %spec.store.select to i32		%conv6 = trunc i64 %spec.store.select to i32
ret i32 %conv6		ret i32 %conv6
}		}

▲ Show 20 Lines • Show All 627 Lines • Show Last 20 Lines

llvm/test/CodeGen/WebAssembly/fpclamptosat_vec.ll

Show First 20 Lines • Show All 139 Lines • ▼ Show 20 Lines	entry:
%spec.store.select7 = select <4 x i1> %1, <4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>		%spec.store.select7 = select <4 x i1> %1, <4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>
%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

define <4 x i32> @utest_f32i32(<4 x float> %x) {		define <4 x i32> @utest_f32i32(<4 x float> %x) {
; CHECK-LABEL: utest_f32i32:		; CHECK-LABEL: utest_f32i32:
; CHECK: .functype utest_f32i32 (v128) -> (v128)		; CHECK: .functype utest_f32i32 (v128) -> (v128)
; CHECK-NEXT: .local i64, i64, v128
; CHECK-NEXT: # %bb.0: # %entry		; CHECK-NEXT: # %bb.0: # %entry
; CHECK-NEXT: local.get 0		; CHECK-NEXT: local.get 0
; CHECK-NEXT: f32x4.extract_lane 0		; CHECK-NEXT: i32x4.trunc_sat_f32x4_u
; CHECK-NEXT: i64.trunc_sat_f32_u
; CHECK-NEXT: local.tee 1
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: local.get 0
; CHECK-NEXT: f32x4.extract_lane 1
; CHECK-NEXT: i64.trunc_sat_f32_u
; CHECK-NEXT: local.tee 2
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: v128.const 4294967295, 4294967295
; CHECK-NEXT: local.tee 3
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 2
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: v128.bitselect
; CHECK-NEXT: local.get 0
; CHECK-NEXT: f32x4.extract_lane 2
; CHECK-NEXT: i64.trunc_sat_f32_u
; CHECK-NEXT: local.tee 1
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: local.get 0
; CHECK-NEXT: f32x4.extract_lane 3
; CHECK-NEXT: i64.trunc_sat_f32_u
; CHECK-NEXT: local.tee 2
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: local.get 3
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 2
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: v128.bitselect
; CHECK-NEXT: i8x16.shuffle 0, 1, 2, 3, 8, 9, 10, 11, 16, 17, 18, 19, 24, 25, 26, 27
; CHECK-NEXT: # fallthrough-return		; CHECK-NEXT: # fallthrough-return
entry:		entry:
%conv = fptoui <4 x float> %x to <4 x i64>		%conv = fptoui <4 x float> %x to <4 x i64>
%0 = icmp ult <4 x i64> %conv, <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>		%0 = icmp ult <4 x i64> %conv, <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>
%spec.store.select = select <4 x i1> %0, <4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>		%spec.store.select = select <4 x i1> %0, <4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>
%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	entry:
%spec.store.select7 = select <4 x i1> %1, <4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>		%spec.store.select7 = select <4 x i1> %1, <4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>
%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

define <4 x i32> @utesth_f16i32(<4 x half> %x) {		define <4 x i32> @utesth_f16i32(<4 x half> %x) {
; CHECK-LABEL: utesth_f16i32:		; CHECK-LABEL: utesth_f16i32:
; CHECK: .functype utesth_f16i32 (f32, f32, f32, f32) -> (v128)		; CHECK: .functype utesth_f16i32 (f32, f32, f32, f32) -> (v128)
; CHECK-NEXT: .local i64, i64, v128
; CHECK-NEXT: # %bb.0: # %entry		; CHECK-NEXT: # %bb.0: # %entry
; CHECK-NEXT: local.get 3
; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: local.set 3
; CHECK-NEXT: local.get 2
; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: local.set 2
; CHECK-NEXT: local.get 1		; CHECK-NEXT: local.get 1
; CHECK-NEXT: call __truncsfhf2		; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: call __extendhfsf2		; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: local.set 1		; CHECK-NEXT: local.set 1
; CHECK-NEXT: local.get 0		; CHECK-NEXT: local.get 0
; CHECK-NEXT: call __truncsfhf2		; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: call __extendhfsf2		; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: i64.trunc_sat_f32_u		; CHECK-NEXT: i32.trunc_sat_f32_u
; CHECK-NEXT: local.tee 4		; CHECK-NEXT: i32x4.splat
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: local.get 1		; CHECK-NEXT: local.get 1
; CHECK-NEXT: i64.trunc_sat_f32_u		; CHECK-NEXT: i32.trunc_sat_f32_u
; CHECK-NEXT: local.tee 5		; CHECK-NEXT: i32x4.replace_lane 1
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: v128.const 4294967295, 4294967295
; CHECK-NEXT: local.tee 6
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 4
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 5
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: v128.bitselect
; CHECK-NEXT: local.get 2		; CHECK-NEXT: local.get 2
; CHECK-NEXT: i64.trunc_sat_f32_u		; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: local.tee 4		; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: i64x2.splat		; CHECK-NEXT: i32.trunc_sat_f32_u
		; CHECK-NEXT: i32x4.replace_lane 2
; CHECK-NEXT: local.get 3		; CHECK-NEXT: local.get 3
; CHECK-NEXT: i64.trunc_sat_f32_u		; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: local.tee 5		; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: i64x2.replace_lane 1		; CHECK-NEXT: i32.trunc_sat_f32_u
; CHECK-NEXT: local.get 6		; CHECK-NEXT: i32x4.replace_lane 3
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 4
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 5
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: v128.bitselect
; CHECK-NEXT: i8x16.shuffle 0, 1, 2, 3, 8, 9, 10, 11, 16, 17, 18, 19, 24, 25, 26, 27
; CHECK-NEXT: # fallthrough-return		; CHECK-NEXT: # fallthrough-return
entry:		entry:
%conv = fptoui <4 x half> %x to <4 x i64>		%conv = fptoui <4 x half> %x to <4 x i64>
%0 = icmp ult <4 x i64> %conv, <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>		%0 = icmp ult <4 x i64> %conv, <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>
%spec.store.select = select <4 x i1> %0, <4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>		%spec.store.select = select <4 x i1> %0, <4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>
%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}
▲ Show 20 Lines • Show All 1,393 Lines • ▼ Show 20 Lines	entry:
%spec.store.select7 = call <4 x i64> @llvm.smax.v4i64(<4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>)		%spec.store.select7 = call <4 x i64> @llvm.smax.v4i64(<4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>)
%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

define <4 x i32> @utest_f32i32_mm(<4 x float> %x) {		define <4 x i32> @utest_f32i32_mm(<4 x float> %x) {
; CHECK-LABEL: utest_f32i32_mm:		; CHECK-LABEL: utest_f32i32_mm:
; CHECK: .functype utest_f32i32_mm (v128) -> (v128)		; CHECK: .functype utest_f32i32_mm (v128) -> (v128)
; CHECK-NEXT: .local i64, i64, v128
; CHECK-NEXT: # %bb.0: # %entry		; CHECK-NEXT: # %bb.0: # %entry
; CHECK-NEXT: local.get 0		; CHECK-NEXT: local.get 0
; CHECK-NEXT: f32x4.extract_lane 0		; CHECK-NEXT: i32x4.trunc_sat_f32x4_u
; CHECK-NEXT: i64.trunc_sat_f32_u
; CHECK-NEXT: local.tee 1
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: local.get 0
; CHECK-NEXT: f32x4.extract_lane 1
; CHECK-NEXT: i64.trunc_sat_f32_u
; CHECK-NEXT: local.tee 2
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: v128.const 4294967295, 4294967295
; CHECK-NEXT: local.tee 3
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 2
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: v128.bitselect
; CHECK-NEXT: local.get 0
; CHECK-NEXT: f32x4.extract_lane 2
; CHECK-NEXT: i64.trunc_sat_f32_u
; CHECK-NEXT: local.tee 1
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: local.get 0
; CHECK-NEXT: f32x4.extract_lane 3
; CHECK-NEXT: i64.trunc_sat_f32_u
; CHECK-NEXT: local.tee 2
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: local.get 3
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 1
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 2
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: v128.bitselect
; CHECK-NEXT: i8x16.shuffle 0, 1, 2, 3, 8, 9, 10, 11, 16, 17, 18, 19, 24, 25, 26, 27
; CHECK-NEXT: # fallthrough-return		; CHECK-NEXT: # fallthrough-return
entry:		entry:
%conv = fptoui <4 x float> %x to <4 x i64>		%conv = fptoui <4 x float> %x to <4 x i64>
%spec.store.select = call <4 x i64> @llvm.umin.v4i64(<4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>)		%spec.store.select = call <4 x i64> @llvm.umin.v4i64(<4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>)
%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	entry:
%spec.store.select7 = call <4 x i64> @llvm.smax.v4i64(<4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>)		%spec.store.select7 = call <4 x i64> @llvm.smax.v4i64(<4 x i64> %spec.store.select, <4 x i64> <i64 -2147483648, i64 -2147483648, i64 -2147483648, i64 -2147483648>)
%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select7 to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

define <4 x i32> @utesth_f16i32_mm(<4 x half> %x) {		define <4 x i32> @utesth_f16i32_mm(<4 x half> %x) {
; CHECK-LABEL: utesth_f16i32_mm:		; CHECK-LABEL: utesth_f16i32_mm:
; CHECK: .functype utesth_f16i32_mm (f32, f32, f32, f32) -> (v128)		; CHECK: .functype utesth_f16i32_mm (f32, f32, f32, f32) -> (v128)
; CHECK-NEXT: .local i64, i64, v128
; CHECK-NEXT: # %bb.0: # %entry		; CHECK-NEXT: # %bb.0: # %entry
; CHECK-NEXT: local.get 3
; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: local.set 3
; CHECK-NEXT: local.get 2
; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: local.set 2
; CHECK-NEXT: local.get 1		; CHECK-NEXT: local.get 1
; CHECK-NEXT: call __truncsfhf2		; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: call __extendhfsf2		; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: local.set 1		; CHECK-NEXT: local.set 1
; CHECK-NEXT: local.get 0		; CHECK-NEXT: local.get 0
; CHECK-NEXT: call __truncsfhf2		; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: call __extendhfsf2		; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: i64.trunc_sat_f32_u		; CHECK-NEXT: i32.trunc_sat_f32_u
; CHECK-NEXT: local.tee 4		; CHECK-NEXT: i32x4.splat
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: local.get 1		; CHECK-NEXT: local.get 1
; CHECK-NEXT: i64.trunc_sat_f32_u		; CHECK-NEXT: i32.trunc_sat_f32_u
; CHECK-NEXT: local.tee 5		; CHECK-NEXT: i32x4.replace_lane 1
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: v128.const 4294967295, 4294967295
; CHECK-NEXT: local.tee 6
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 4
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 5
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: v128.bitselect
; CHECK-NEXT: local.get 2		; CHECK-NEXT: local.get 2
; CHECK-NEXT: i64.trunc_sat_f32_u		; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: local.tee 4		; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: i64x2.splat		; CHECK-NEXT: i32.trunc_sat_f32_u
		; CHECK-NEXT: i32x4.replace_lane 2
; CHECK-NEXT: local.get 3		; CHECK-NEXT: local.get 3
; CHECK-NEXT: i64.trunc_sat_f32_u		; CHECK-NEXT: call __truncsfhf2
; CHECK-NEXT: local.tee 5		; CHECK-NEXT: call __extendhfsf2
; CHECK-NEXT: i64x2.replace_lane 1		; CHECK-NEXT: i32.trunc_sat_f32_u
; CHECK-NEXT: local.get 6		; CHECK-NEXT: i32x4.replace_lane 3
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 4
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.splat
; CHECK-NEXT: i64.const -1
; CHECK-NEXT: i64.const 0
; CHECK-NEXT: local.get 5
; CHECK-NEXT: i64.const 4294967295
; CHECK-NEXT: i64.lt_u
; CHECK-NEXT: i64.select
; CHECK-NEXT: i64x2.replace_lane 1
; CHECK-NEXT: v128.bitselect
; CHECK-NEXT: i8x16.shuffle 0, 1, 2, 3, 8, 9, 10, 11, 16, 17, 18, 19, 24, 25, 26, 27
; CHECK-NEXT: # fallthrough-return		; CHECK-NEXT: # fallthrough-return
entry:		entry:
%conv = fptoui <4 x half> %x to <4 x i64>		%conv = fptoui <4 x half> %x to <4 x i64>
%spec.store.select = call <4 x i64> @llvm.umin.v4i64(<4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>)		%spec.store.select = call <4 x i64> @llvm.umin.v4i64(<4 x i64> %conv, <4 x i64> <i64 4294967295, i64 4294967295, i64 4294967295, i64 4294967295>)
%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>		%conv6 = trunc <4 x i64> %spec.store.select to <4 x i32>
ret <4 x i32> %conv6		ret <4 x i32> %conv6
}		}

▲ Show 20 Lines • Show All 1,331 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[DAG] Create fptoui.sat from clamped fptouiClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 403159

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp

llvm/test/CodeGen/AArch64/fpclamptosat.ll

llvm/test/CodeGen/AArch64/fpclamptosat_vec.ll

llvm/test/CodeGen/ARM/fpclamptosat.ll

llvm/test/CodeGen/RISCV/fpclamptosat.ll

llvm/test/CodeGen/Thumb2/mve-fpclamptosat_vec.ll

llvm/test/CodeGen/WebAssembly/fpclamptosat.ll

llvm/test/CodeGen/WebAssembly/fpclamptosat_vec.ll

[DAG] Create fptoui.sat from clamped fptoui
ClosedPublic