This is an archive of the discontinued LLVM Phabricator instance.

Was there an RFC for this?
While i agree it likely makes sense to have these for consistency,
i'm not sure why they are *needed* for implementing the Embedded-C fixed point support in Clang.

lebedev.ri added inline comments.Jul 6 2020, 1:25 PM

llvm/lib/IR/Verifier.cpp
5006–5009	I don't think it makes sense to limit these to scalars.

nikic added a subscriber: nikic.Jul 6 2020, 1:39 PM

In D83216#2134130, @lebedev.ri wrote:

Was there an RFC for this?

No explicit RFC for these particular intrinsics, but they have been mentioned in the larger scope of fixed-point support:

I notice now that Leonard's mail does not mention saturated shifts, but my older one does.

While i agree it likely makes sense to have these for consistency,
i'm not sure why they are *needed* for implementing the Embedded-C fixed point support in Clang.

Yes, "needed" might be a stronger wording than necessary. I originally wrote "useful" but was concerned it wasn't strong enough.
Of course, they aren't needed per se, but it becomes more of a hassle to select instructions for the operations if there are no intrinsics.

llvm/lib/IR/Verifier.cpp
5006–5009	The add.sat and sub.sat intrinsics were given vector operands because they were useful for some of x86's vector instructions. I couldn't see any such operations for shifts, but I can add the vector type support for consistency.

Add vector support and TD isel nodes.

Thanks. This looks good to me in principle.

Alive2 support for these intrinsics: https://github.com/AliveToolkit/alive2/pull/448

llvm/include/llvm/CodeGen/ISDOpcodes.h
313–314	I'm not sure what `left shift on 2 integers` means. Perhaps this needs some rewording.
llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
818	`Op2Promoted = ZExtPromotedInteger(Op2);`
llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
7402–7407	Have you checked if naive `x != ((x << y) u/s>> y)` results in worse lowering?
7419–7420	Why not just change predicate to `ISD::SETUGE`?
llvm/test/CodeGen/X86/sshl_sat.ll
10	Add `i32` test while at it?

Addressed review comments.

ebevhan marked 5 inline comments as done.Jul 8 2020, 4:51 AM

ebevhan added inline comments.

llvm/include/llvm/CodeGen/ISDOpcodes.h
313–314	I lifted it from the other node descriptions, but it doesn't really make sense for shift. I changed it a bit.
llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
7402–7407	The CTLZ approach was the one that popped into my head first, so I went with that. But it does turn out that yours works a bit better, at least for sshl.sat, so I swapped it out.

Harbormaster failed remote builds in B63392: Diff 276377!Jul 8 2020, 5:33 AM

Some more thoughts.

llvm/docs/LangRef.rst
14551	This should be '``llvm.sshl.sat.*``' Intrinsics ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
14570	Here and elsewhere: i strongly suspect it should be `s/saturation/saturating/`
14600	same
llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
808	Assert that we only ever get `ISD::USHLSAT`/`ISD::SSHLSAT` ?
817	Actually, why do we need to signext, or even zeroext it? As the comment before function notes, we want anyext, we don't care about those new high bits, because we are immediately going to shift them out. ---------------------------------------- Name: promote ushl %r = ushl_sat i8 %x, %y ret i8 %r => %x_wide = zext i8 %x to i16 %y_wide = zext i8 %y to i16 %t0 = shl i16 %x_wide, 8 %t1 = ushl_sat i16 %t0, %y_wide %t2 = lshr i16 %t1, 8 %r = trunc i16 %t2 to i8 ret i8 %r Done: 1 Transformation seems to be correct! ---------------------------------------- Name: promote sshl %r = sshl_sat i8 %x, %y ret i8 %r => %x_wide = zext i8 %x to i16 %y_wide = zext i8 %y to i16 %t0 = shl i16 %x_wide, 8 %t1 = sshl_sat i16 %t0, %y_wide %t2 = ashr i16 %t1, 8 %r = trunc i16 %t2 to i8 ret i8 %r Done: 1 Transformation seems to be correct! So i think you want SDValue Op1Promoted = GetPromotedInteger(Op1); SDValue Op1Promoted = GetPromotedInteger(Op2); unsigned ShiftOp = Opcode == ISD::USHLSAT ? ISD::SRL : ISD::SRA; and maybe get rid of `ShiftOp` variable, or sink it closer to use.

Addressed review comments.

ebevhan marked 5 inline comments as done.Jul 8 2020, 6:46 AM

ebevhan added inline comments.

llvm/docs/LangRef.rst
14551	I made the change to the rest of the saturating/fixedpoint intrinsics as well.
llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
817	Ah, that's true. I grabbed it from the ADDSUBSAT promotion without thinking. That needs the proper extension due to the min/max expansion, I think.

Patch as-is looks good but i'm not sure what's the RFC status here.
If these new intrinsics were already previously proposed as part of some RFC that got accepted,
can you state that in the patch's description? (with link to the thread)

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
817	Err, i was half-right i think. I guess we actually need to zext the shift amount.

Fixed review comment and updated summary.

In D83216#2139098, @ebevhan wrote:

Fixed review comment and updated summary.

(note that updating commit msg does not automatically update description in phab differential)

ebevhan edited the summary of this revision. (Show Details)Jul 8 2020, 7:41 AM

In D83216#2138974, @lebedev.ri wrote:

Patch as-is looks good but i'm not sure what's the RFC status here.
If these new intrinsics were already previously proposed as part of some RFC that got accepted,
can you state that in the patch's description? (with link to the thread)

I added the links to the threads I mentioned earlier.

Looking back at the full discussion, it doesn't really seem like any real consensus regarding how to implement the types was reached, but the prevailing view was that an altogether new IR type was the best approach. I don't think either I or Leonard thought that was the right (or fastest, at least) way to go, though.

The final listing of intrinsics was in http://lists.llvm.org/pipermail/llvm-dev/2018-September/126311.html but the design has diverged a bit from that since then.

In D83216#2139104, @lebedev.ri wrote:

In D83216#2139098, @ebevhan wrote:

Fixed review comment and updated summary.

(note that updating commit msg does not automatically update description in phab differential)

I noticed! I was looking for an arc option to do that but couldn't seem to find one.

In D83216#2139130, @ebevhan wrote:

In D83216#2138974, @lebedev.ri wrote:

Patch as-is looks good but i'm not sure what's the RFC status here.
If these new intrinsics were already previously proposed as part of some RFC that got accepted,
can you state that in the patch's description? (with link to the thread)

I added the links to the threads I mentioned earlier.

Looking back at the full discussion, it doesn't really seem like any real consensus regarding how to implement the types was reached, but the prevailing view was that an altogether new IR type was the best approach. I don't think either I or Leonard thought that was the right (or fastest, at least) way to go, though.

The final listing of intrinsics was in http://lists.llvm.org/pipermail/llvm-dev/2018-September/126311.html but the design has diverged a bit from that since then.

i see.

I think this is fine, but just to be safe, may i suggest to do an RFC for these two intrinsics specifically,
just so we're 100% sure everyone is on the same page about them?

In D83216#2139104, @lebedev.ri wrote:

In D83216#2139098, @ebevhan wrote:

Fixed review comment and updated summary.

(note that updating commit msg does not automatically update description in phab differential)

I noticed! I was looking for an arc option to do that but couldn't seem to find one.

Sorry, it's just a repeating issue in many reviews :/

Harbormaster completed remote builds in B63406: Diff 276410.Jul 8 2020, 8:02 AM

In D83216#2139135, @lebedev.ri wrote:

i see.

I think this is fine, but just to be safe, may i suggest to do an RFC for these two intrinsics specifically,
just so we're 100% sure everyone is on the same page about them?

Sure, I'll send one out.

Harbormaster completed remote builds in B63417: Diff 276432.Jul 8 2020, 8:33 AM

efriedma added a subscriber: efriedma.Jul 8 2020, 11:39 AM

efriedma added inline comments.

llvm/docs/LangRef.rst
14577	Not sure what "must be" means in this context; the shift amount is a variable, so we can't enforce anything about it statically. Is it poison? Or undefined behavior? Or does the shift clamp to the min/max value?

lebedev.ri added inline comments.Jul 8 2020, 11:45 AM

llvm/docs/LangRef.rst

14577

Right, we should spell that out.
IMO this should be consistent with normal shifts, i.e. poison),

If ``b`` is (statically or
dynamically) equal to or larger than the number of bits in
``a``, this instruction returns a :ref:`poison value <poisonvalues>`.
If the arguments are vectors, each vector element of ``a`` is shifted
by the corresponding shift amount in ``b``.

Addressed review comments.

ebevhan marked an inline comment as done.Jul 9 2020, 3:10 AM

Harbormaster completed remote builds in B63561: Diff 276683.Jul 9 2020, 4:00 AM

Any more on this?

Can you also add the GlobalISel instructions etc.

llvm/docs/LangRef.rst
14767	Unrelated changes
15102	Separate change

arsenm added inline comments.Jul 15 2020, 5:48 PM

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
795	This code is exactly the same as for ADD/SUB sat and should be shared

ebevhan added inline comments.Jul 16 2020, 4:36 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
795	The code is not exactly the same, but some of it is. I could try factoring it in. I find that code that does a lot is a bit harder to read, though.

In D83216#2152941, @ebevhan wrote:

Any more on this?

Still looks good to me, but i'd like to see some (positive?) feedback in the actual RFC llvm-dev thread from the usual suspects (@lattner @spatel @nikic)

llvm/docs/LangRef.rst
14767	Please just commit these changes to already-existing lines directly.
llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
795	While i'm usually against duplication, i'm not not very sure that in this case it is better to deduplicate.

Added GlobalISel opcodes. I'm not that familiar with GISEL, though, so I've probably not done it completely right.

Herald added subscribers: kerbowa, nhaehnle, jvesely. · View Herald TranscriptJul 16 2020, 8:23 AM

ebevhan marked 3 inline comments as done.Jul 16 2020, 8:55 AM

ebevhan added inline comments.

llvm/docs/LangRef.rst
14767	I factored out the other fixes into a separate commit.
llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
795	I deduplicated anyway. I don't think it ended up too bad.

Harbormaster failed remote builds in B64529: Diff 278494!Jul 16 2020, 9:06 AM

LG apart from the GlobalISel part, which I know nothing about. @arsenm Could you please review the GlobalISel portion?

This revision is now accepted and ready to land.Jul 19 2020, 7:02 AM

Herald added a subscriber: wdng. · View Herald TranscriptJul 19 2020, 7:02 AM

arsenm added inline comments.Jul 20 2020, 9:43 AM

llvm/include/llvm/Target/GenericOpcodes.td
551	The shift amount type doesn't have to match the shift value type, so src2 should use type1
559	Ditto
llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-sshlsat.mir
18	These all failed to legalize. You need something like this in AMDGPULegalizerInfo getActionDefinitionsBuilder({G_SSHLSAT, G_USHLSAT}) .scalarize(0) .clampScalar(0, S32) .lower(); to actually trigger any of the legalization code

ebevhan added inline comments.Aug 3 2020, 7:12 AM

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-sshlsat.mir
18	Okay, I thought something was off. Does this mean that any target that wants these legalized with the default legalization needs to specify this explicitly?

arsenm added inline comments.Aug 3 2020, 7:40 AM

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-sshlsat.mir
18	Yes, nearly everything is opt-in in GlobalISel

Rebased; added more GlobalIsel legalization.

Harbormaster completed remote builds in B66896: Diff 282833.Aug 4 2020, 2:36 AM

GlobalISel parts LGTM

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
5930	Extra blank line here

Still no replies to RFC thread :/

llvm/test/CodeGen/X86/sshl_sat.ll
12	It is best to put vector tests into a separate file from the getgo See e.g. `uadd_sat_vec.ll`
llvm/test/CodeGen/X86/ushl_sat.ll
8	It seems i8 is the only "basic" bit width missing. It would probably good to have it.

Added i8 test and moved vector test to separate file.

LGTM, but i would have preferred to see more feedback on the RFC thread.
That being said, unless you are planning on forming calls to these intrinsics in middle-end transform passes
(i.e. only planning on using them in clang codegen), i think this is okay to proceed.

In D83216#2196028, @lebedev.ri wrote:

LGTM, but i would have preferred to see more feedback on the RFC thread.
That being said, unless you are planning on forming calls to these intrinsics in middle-end transform passes
(i.e. only planning on using them in clang codegen), i think this is okay to proceed.

At least for now, I'm not planning to emit them in middleend. I wanted to get most of the basic support in Clang and LLVM before looking at optimization.

However, based on the discussions in D82663, I am planning on sending an RFC for moving the fixed-point semantics and value class to LLVM, and adding a Builder class similar to the MatrixBuilder.

Harbormaster completed remote builds in B67079: Diff 283200.Aug 5 2020, 5:27 AM

ebevhan added a child revision: D85314: [IR] Add FixedPointBuilder..Aug 5 2020, 8:04 AM

In D83216#2196048, @ebevhan wrote:

In D83216#2196028, @lebedev.ri wrote:

LGTM, but i would have preferred to see more feedback on the RFC thread.
That being said, unless you are planning on forming calls to these intrinsics in middle-end transform passes
(i.e. only planning on using them in clang codegen), i think this is okay to proceed.

At least for now, I'm not planning to emit them in middleend. I wanted to get most of the basic support in Clang and LLVM before looking at optimization.

However, based on the discussions in D82663, I am planning on sending an RFC for moving the fixed-point semantics and value class to LLVM, and adding a Builder class similar to the MatrixBuilder.

I think middle-end rewrites such as constant folding, or rewriting (sshl.sat (sshl.sat X, Y), Z) -> (sshl.sat X, Y+Z) is OK.
As Roman said, we should probably think twice before we start rewrite arbitrary things into using these intrinsics (such as rewriting a saturated multiply into a saturated shift).

Still, we probably want to have a canonical form for things like "saturated multiplication by 2" in the future. I think that a lot of these saturated left shifts can be done using a saturated multiplication. Similarly saturated right shifts can often be described using saturated division. At least when the shift count is known. For unknown shift counts these intrinsics will make it much easier for a backend target that has such shifts in the instruction set to do the right thing.

This revision was landed with ongoing or failed builds.Aug 7 2020, 6:10 AM

Closed by commit rG5de6c56f7e86: [Intrinsic] Add sshl.sat/ushl.sat, saturated shift intrinsics. (authored by ebevhan). · Explain Why

This revision was automatically updated to reflect the committed changes.

ebevhan added a commit: rG5de6c56f7e86: [Intrinsic] Add sshl.sat/ushl.sat, saturated shift intrinsics..

spatel mentioned this in D96904: [IR] restrict vector reduction intrinsic types.Feb 19 2021, 9:23 AM

Revision Contents

Path

Size

llvm/

docs/

GlobalISel/

GenericOpcode.rst

6 lines

LangRef.rst

102 lines

include/

llvm/

CodeGen/

GlobalISel/

LegalizerHelper.h

5 lines

ISDOpcodes.h

10 lines

TargetLowering.h

4 lines

IR/

Intrinsics.td

6 lines

Support/

TargetOpcodes.def

6 lines

Target/

GenericOpcodes.td

16 lines

GlobalISel/

SelectionDAGCompat.td

2 lines

TargetSelectionDAG.td

2 lines

lib/

CodeGen/

GlobalISel/

IRTranslator.cpp

4 lines

LegalizerHelper.cpp

61 lines

MachineIRBuilder.cpp

4 lines

SelectionDAG/

LegalizeDAG.cpp

8 lines

LegalizeIntegerTypes.cpp

36 lines

LegalizeTypes.h

3 lines

LegalizeVectorOps.cpp

2 lines

LegalizeVectorTypes.cpp

6 lines

SelectionDAGBuilder.cpp

12 lines

SelectionDAGDumper.cpp

2 lines

TargetLowering.cpp

35 lines

TargetLoweringBase.cpp

2 lines

IR/

Verifier.cpp

12 lines

Target/

AMDGPU/

AMDGPULegalizerInfo.cpp

10 lines

test/

CodeGen/

AArch64/

GlobalISel/

legalizer-info-validation.mir

6 lines

AMDGPU/

GlobalISel/

irtranslator-sat.ll

158 lines

legalize-sshlsat.mir

952 lines

legalize-ushlsat.mir

800 lines

X86/

400 lines

146 lines

342 lines

115 lines

Diff 283889

llvm/docs/GlobalISel/GenericOpcode.rst

	Show First 20 Lines • Show All 239 Lines • ▼ Show 20 Lines
	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

	These each perform their respective integer arithmetic on a scalar.			These each perform their respective integer arithmetic on a scalar.

	.. code-block:: none			.. code-block:: none

	%2:_(s32) = G_ADD %0:_(s32), %1:_(s32)			%2:_(s32) = G_ADD %0:_(s32), %1:_(s32)

	G_SADDSAT, G_UADDSAT, G_SSUBSAT, G_USUBSAT			G_SADDSAT, G_UADDSAT, G_SSUBSAT, G_USUBSAT, G_SSHLSAT, G_USHLSAT
	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

	Signed and unsigned addition and subtraction with saturation.			Signed and unsigned addition, subtraction and left shift with saturation.

	.. code-block:: none			.. code-block:: none

	%2:_(s32) = G_SADDSAT %0:_(s32), %1:_(s32)			%2:_(s32) = G_SADDSAT %0:_(s32), %1:_(s32)

	G_SHL, G_LSHR, G_ASHR			G_SHL, G_LSHR, G_ASHR
	^^^^^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^^^^^

	▲ Show 20 Lines • Show All 429 Lines • Show Last 20 Lines

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 14,541 Lines • ▼ Show 20 Lines
"""""""""		"""""""""

.. code-block:: llvm		.. code-block:: llvm

%res = call i4 @llvm.usub.sat.i4(i4 2, i4 1) ; %res = 1		%res = call i4 @llvm.usub.sat.i4(i4 2, i4 1) ; %res = 1
%res = call i4 @llvm.usub.sat.i4(i4 2, i4 6) ; %res = 0		%res = call i4 @llvm.usub.sat.i4(i4 2, i4 6) ; %res = 0


		'``llvm.sshl.sat.*``' Intrinsics
		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
		lebedev.riUnsubmitted Not Done Reply Inline Actions This should be '``llvm.sshl.sat.``' Intrinsics ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ lebedev.ri:* This should be ``` '``llvm.sshl.sat.*``' Intrinsics ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ```
		ebevhanAuthorUnsubmitted Done Reply Inline Actions I made the change to the rest of the saturating/fixedpoint intrinsics as well. ebevhan: I made the change to the rest of the saturating/fixedpoint intrinsics as well.

		Syntax
		"""""""

		This is an overloaded intrinsic. You can use ``llvm.sshl.sat``
		on integers or vectors of integers of any bit width.

		::

		declare i16 @llvm.sshl.sat.i16(i16 %a, i16 %b)
		declare i32 @llvm.sshl.sat.i32(i32 %a, i32 %b)
		declare i64 @llvm.sshl.sat.i64(i64 %a, i64 %b)
		declare <4 x i32> @llvm.sshl.sat.v4i32(<4 x i32> %a, <4 x i32> %b)

		Overview
		"""""""""

		The '``llvm.sshl.sat``' family of intrinsic functions perform signed
		saturating left shift on the first argument.
		lebedev.riUnsubmitted Done Reply Inline Actions Here and elsewhere: i strongly suspect it should be `s/saturation/saturating/` lebedev.ri: Here and elsewhere: i strongly suspect it should be `s/saturation/saturating/`

		Arguments
		""""""""""

		The arguments (``%a`` and ``%b``) and the result may be of integer types of any
		bit width, but they must have the same bit width. ``%a`` is the value to be
		shifted, and ``%b`` is the amount to shift by. If ``b`` is (statically or
		efriedmaUnsubmitted Not Done Reply Inline Actions Not sure what "must be" means in this context; the shift amount is a variable, so we can't enforce anything about it statically. Is it poison? Or undefined behavior? Or does the shift clamp to the min/max value? efriedma: Not sure what "must be" means in this context; the shift amount is a variable, so we can't…
		lebedev.riUnsubmitted Done Reply Inline Actions Right, we should spell that out. IMO this should be consistent with normal shifts, i.e. poison), If ``b`` is (statically or dynamically) equal to or larger than the number of bits in ``a``, this instruction returns a :ref:`poison value <poisonvalues>`. If the arguments are vectors, each vector element of ``a`` is shifted by the corresponding shift amount in ``b``. lebedev.ri: Right, we should spell that out. IMO this should be consistent with normal shifts, i.e. poison)…
		dynamically) equal to or larger than the integer bit width of the arguments,
		the result is a :ref:`poison value <poisonvalues>`. If the arguments are
		vectors, each vector element of ``a`` is shifted by the corresponding shift
		amount in ``b``.


		Semantics:
		""""""""""

		The maximum value this operation can clamp to is the largest signed value
		representable by the bit width of the arguments. The minimum value is the
		smallest signed value representable by this bit width.


		Examples
		"""""""""

		.. code-block:: llvm

		%res = call i4 @llvm.sshl.sat.i4(i4 2, i4 1) ; %res = 4
		%res = call i4 @llvm.sshl.sat.i4(i4 2, i4 2) ; %res = 7
		%res = call i4 @llvm.sshl.sat.i4(i4 -5, i4 1) ; %res = -8
		%res = call i4 @llvm.sshl.sat.i4(i4 -1, i4 1) ; %res = -2
		lebedev.riUnsubmitted Done Reply Inline Actions same lebedev.ri: same


		'``llvm.ushl.sat.*``' Intrinsics
		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

		Syntax
		"""""""

		This is an overloaded intrinsic. You can use ``llvm.ushl.sat``
		on integers or vectors of integers of any bit width.

		::

		declare i16 @llvm.ushl.sat.i16(i16 %a, i16 %b)
		declare i32 @llvm.ushl.sat.i32(i32 %a, i32 %b)
		declare i64 @llvm.ushl.sat.i64(i64 %a, i64 %b)
		declare <4 x i32> @llvm.ushl.sat.v4i32(<4 x i32> %a, <4 x i32> %b)

		Overview
		"""""""""

		The '``llvm.ushl.sat``' family of intrinsic functions perform unsigned
		saturating left shift on the first argument.

		Arguments
		""""""""""

		The arguments (``%a`` and ``%b``) and the result may be of integer types of any
		bit width, but they must have the same bit width. ``%a`` is the value to be
		shifted, and ``%b`` is the amount to shift by. If ``b`` is (statically or
		dynamically) equal to or larger than the integer bit width of the arguments,
		the result is a :ref:`poison value <poisonvalues>`. If the arguments are
		vectors, each vector element of ``a`` is shifted by the corresponding shift
		amount in ``b``.

		Semantics:
		""""""""""

		The maximum value this operation can clamp to is the largest unsigned value
		representable by the bit width of the arguments.


		Examples
		"""""""""

		.. code-block:: llvm

		%res = call i4 @llvm.ushl.sat.i4(i4 2, i4 1) ; %res = 4
		%res = call i4 @llvm.ushl.sat.i4(i4 3, i4 3) ; %res = 15


Fixed Point Arithmetic Intrinsics		Fixed Point Arithmetic Intrinsics
---------------------------------		---------------------------------

A fixed point number represents a real data type for a number that has a fixed		A fixed point number represents a real data type for a number that has a fixed
number of digits after a radix point (equivalent to the decimal point '.').		number of digits after a radix point (equivalent to the decimal point '.').
The number of digits after the radix point is referred as the `scale`. These		The number of digits after the radix point is referred as the `scale`. These
are useful for representing fractional values to a specific precision. The		are useful for representing fractional values to a specific precision. The
following intrinsics perform fixed point arithmetic operations on 2 operands		following intrinsics perform fixed point arithmetic operations on 2 operands
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	.. code-block:: llvm
%res = call i4 @llvm.smul.fix.i4(i4 3, i4 2, i32 1) ; %res = 3 (1.5 x 1 = 1.5)		%res = call i4 @llvm.smul.fix.i4(i4 3, i4 2, i32 1) ; %res = 3 (1.5 x 1 = 1.5)
%res = call i4 @llvm.smul.fix.i4(i4 3, i4 -2, i32 1) ; %res = -3 (1.5 x -1 = -1.5)		%res = call i4 @llvm.smul.fix.i4(i4 3, i4 -2, i32 1) ; %res = -3 (1.5 x -1 = -1.5)

; The result in the following could be rounded up to -2 or down to -2.5		; The result in the following could be rounded up to -2 or down to -2.5
%res = call i4 @llvm.smul.fix.i4(i4 3, i4 -3, i32 1) ; %res = -5 (or -4) (1.5 x -1.5 = -2.25)		%res = call i4 @llvm.smul.fix.i4(i4 3, i4 -3, i32 1) ; %res = -5 (or -4) (1.5 x -1.5 = -2.25)


'``llvm.umul.fix.*``' Intrinsics		'``llvm.umul.fix.*``' Intrinsics
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
		arsenmUnsubmitted Not Done Reply Inline Actions Unrelated changes arsenm: Unrelated changes
		lebedev.riUnsubmitted Not Done Reply Inline Actions Please just commit these changes to already-existing lines directly. lebedev.ri: Please just commit these changes to already-existing lines directly.
		ebevhanAuthorUnsubmitted Done Reply Inline Actions I factored out the other fixes into a separate commit. ebevhan: I factored out the other fixes into a separate commit.

Syntax		Syntax
"""""""		"""""""

This is an overloaded intrinsic. You can use ``llvm.umul.fix``		This is an overloaded intrinsic. You can use ``llvm.umul.fix``
on any integer bit width or vectors of integers.		on any integer bit width or vectors of integers.

::		::
▲ Show 20 Lines • Show All 318 Lines • ▼ Show 20 Lines	::
declare i32 @llvm.sdiv.fix.sat.i32(i32 %a, i32 %b, i32 %scale)		declare i32 @llvm.sdiv.fix.sat.i32(i32 %a, i32 %b, i32 %scale)
declare i64 @llvm.sdiv.fix.sat.i64(i64 %a, i64 %b, i32 %scale)		declare i64 @llvm.sdiv.fix.sat.i64(i64 %a, i64 %b, i32 %scale)
declare <4 x i32> @llvm.sdiv.fix.sat.v4i32(<4 x i32> %a, <4 x i32> %b, i32 %scale)		declare <4 x i32> @llvm.sdiv.fix.sat.v4i32(<4 x i32> %a, <4 x i32> %b, i32 %scale)

Overview		Overview
"""""""""		"""""""""

The '``llvm.sdiv.fix.sat``' family of intrinsic functions perform signed		The '``llvm.sdiv.fix.sat``' family of intrinsic functions perform signed
fixed point saturating division on 2 arguments of the same scale.		fixed point saturating division on 2 arguments of the same scale.
		arsenmUnsubmitted Done Reply Inline Actions Separate change arsenm: Separate change

Arguments		Arguments
""""""""""		""""""""""

The arguments (%a and %b) and the result may be of integer types of any bit		The arguments (%a and %b) and the result may be of integer types of any bit
width, but they must have the same bit width. ``%a`` and ``%b`` are the two		width, but they must have the same bit width. ``%a`` and ``%b`` are the two
values that will undergo signed fixed point division. The argument		values that will undergo signed fixed point division. The argument
``%scale`` represents the scale of both operands, and must be a constant		``%scale`` represents the scale of both operands, and must be a constant
▲ Show 20 Lines • Show All 5,647 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/GlobalISel/LegalizerHelper.h

Show First 20 Lines • Show All 161 Lines • ▼ Show 20 Lines	private:
LegalizeResult		LegalizeResult
widenScalarMergeValues(MachineInstr &MI, unsigned TypeIdx, LLT WideTy);		widenScalarMergeValues(MachineInstr &MI, unsigned TypeIdx, LLT WideTy);
LegalizeResult		LegalizeResult
widenScalarUnmergeValues(MachineInstr &MI, unsigned TypeIdx, LLT WideTy);		widenScalarUnmergeValues(MachineInstr &MI, unsigned TypeIdx, LLT WideTy);
LegalizeResult		LegalizeResult
widenScalarExtract(MachineInstr &MI, unsigned TypeIdx, LLT WideTy);		widenScalarExtract(MachineInstr &MI, unsigned TypeIdx, LLT WideTy);
LegalizeResult		LegalizeResult
widenScalarInsert(MachineInstr &MI, unsigned TypeIdx, LLT WideTy);		widenScalarInsert(MachineInstr &MI, unsigned TypeIdx, LLT WideTy);
LegalizeResult widenScalarAddSubSat(MachineInstr &MI, unsigned TypeIdx,		LegalizeResult
LLT WideTy);		widenScalarAddSubShlSat(MachineInstr &MI, unsigned TypeIdx, LLT WideTy);

/// Helper function to split a wide generic register into bitwise blocks with		/// Helper function to split a wide generic register into bitwise blocks with
/// the given Type (which implies the number of blocks needed). The generic		/// the given Type (which implies the number of blocks needed). The generic
/// registers created are appended to Ops, starting at bit 0 of Reg.		/// registers created are appended to Ops, starting at bit 0 of Reg.
void extractParts(Register Reg, LLT Ty, int NumParts,		void extractParts(Register Reg, LLT Ty, int NumParts,
SmallVectorImpl<Register> &VRegs);		SmallVectorImpl<Register> &VRegs);

/// Version which handles irregular splits.		/// Version which handles irregular splits.
▲ Show 20 Lines • Show All 164 Lines • ▼ Show 20 Lines	public:
LegalizeResult lowerExtractInsertVectorElt(MachineInstr &MI);		LegalizeResult lowerExtractInsertVectorElt(MachineInstr &MI);
LegalizeResult lowerShuffleVector(MachineInstr &MI);		LegalizeResult lowerShuffleVector(MachineInstr &MI);
LegalizeResult lowerDynStackAlloc(MachineInstr &MI);		LegalizeResult lowerDynStackAlloc(MachineInstr &MI);
LegalizeResult lowerExtract(MachineInstr &MI);		LegalizeResult lowerExtract(MachineInstr &MI);
LegalizeResult lowerInsert(MachineInstr &MI);		LegalizeResult lowerInsert(MachineInstr &MI);
LegalizeResult lowerSADDO_SSUBO(MachineInstr &MI);		LegalizeResult lowerSADDO_SSUBO(MachineInstr &MI);
LegalizeResult lowerAddSubSatToMinMax(MachineInstr &MI);		LegalizeResult lowerAddSubSatToMinMax(MachineInstr &MI);
LegalizeResult lowerAddSubSatToAddoSubo(MachineInstr &MI);		LegalizeResult lowerAddSubSatToAddoSubo(MachineInstr &MI);
		LegalizeResult lowerShlSat(MachineInstr &MI);
LegalizeResult lowerBswap(MachineInstr &MI);		LegalizeResult lowerBswap(MachineInstr &MI);
LegalizeResult lowerBitreverse(MachineInstr &MI);		LegalizeResult lowerBitreverse(MachineInstr &MI);
LegalizeResult lowerReadWriteRegister(MachineInstr &MI);		LegalizeResult lowerReadWriteRegister(MachineInstr &MI);
};		};

/// Helper function that creates a libcall to the given \p Name using the given		/// Helper function that creates a libcall to the given \p Name using the given
/// calling convention \p CC.		/// calling convention \p CC.
LegalizerHelper::LegalizeResult		LegalizerHelper::LegalizeResult
Show All 18 Lines

llvm/include/llvm/CodeGen/ISDOpcodes.h

Show First 20 Lines • Show All 304 Lines • ▼ Show 20 Lines	enum NodeType {
/// integers with the same bit width (W). If the true value of LHS - RHS		/// integers with the same bit width (W). If the true value of LHS - RHS
/// exceeds the largest value that can be represented by W bits, the		/// exceeds the largest value that can be represented by W bits, the
/// resulting value is this maximum value. Otherwise, if this value is less		/// resulting value is this maximum value. Otherwise, if this value is less
/// than the smallest value that can be represented by W bits, the		/// than the smallest value that can be represented by W bits, the
/// resulting value is this minimum value.		/// resulting value is this minimum value.
SSUBSAT,		SSUBSAT,
USUBSAT,		USUBSAT,

		/// RESULT = [US]SHLSAT(LHS, RHS) - Perform saturation left shift. The first
		/// operand is the value to be shifted, and the second argument is the amount
		lebedev.riUnsubmitted Not Done Reply Inline Actions I'm not sure what `left shift on 2 integers` means. Perhaps this needs some rewording. lebedev.ri: I'm not sure what `left shift on 2 integers` means. Perhaps this needs some rewording.
		ebevhanAuthorUnsubmitted Done Reply Inline Actions I lifted it from the other node descriptions, but it doesn't really make sense for shift. I changed it a bit. ebevhan: I lifted it from the other node descriptions, but it doesn't really make sense for shift. I…
		/// to shift by. Both must be integers of the same bit width (W). If the true
		/// value of LHS << RHS exceeds the largest value that can be represented by
		/// W bits, the resulting value is this maximum value, Otherwise, if this
		/// value is less than the smallest value that can be represented by W bits,
		/// the resulting value is this minimum value.
		SSHLSAT,
		USHLSAT,

/// RESULT = [US]MULFIX(LHS, RHS, SCALE) - Perform fixed point multiplication		/// RESULT = [US]MULFIX(LHS, RHS, SCALE) - Perform fixed point multiplication
/// on		/// on
/// 2 integers with the same width and scale. SCALE represents the scale of		/// 2 integers with the same width and scale. SCALE represents the scale of
/// both operands as fixed point numbers. This SCALE parameter must be a		/// both operands as fixed point numbers. This SCALE parameter must be a
/// constant integer. A scale of zero is effectively performing		/// constant integer. A scale of zero is effectively performing
/// multiplication on 2 integers.		/// multiplication on 2 integers.
SMULFIX,		SMULFIX,
UMULFIX,		UMULFIX,
▲ Show 20 Lines • Show All 971 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/TargetLowering.h

	Show First 20 Lines • Show All 4,385 Lines • ▼ Show 20 Lines
	/// bounds.			/// bounds.
	SDValue getVectorElementPointer(SelectionDAG &DAG, SDValue VecPtr, EVT VecVT,			SDValue getVectorElementPointer(SelectionDAG &DAG, SDValue VecPtr, EVT VecVT,
	SDValue Index) const;			SDValue Index) const;

	/// Method for building the DAG expansion of ISD::[US][ADD\|SUB]SAT. This			/// Method for building the DAG expansion of ISD::[US][ADD\|SUB]SAT. This
	/// method accepts integers as its arguments.			/// method accepts integers as its arguments.
	SDValue expandAddSubSat(SDNode *Node, SelectionDAG &DAG) const;			SDValue expandAddSubSat(SDNode *Node, SelectionDAG &DAG) const;

				/// Method for building the DAG expansion of ISD::[US]SHLSAT. This
				/// method accepts integers as its arguments.
				SDValue expandShlSat(SDNode *Node, SelectionDAG &DAG) const;

	/// Method for building the DAG expansion of ISD::[U\|S]MULFIX[SAT]. This			/// Method for building the DAG expansion of ISD::[U\|S]MULFIX[SAT]. This
	/// method accepts integers as its arguments.			/// method accepts integers as its arguments.
	SDValue expandFixedPointMul(SDNode *Node, SelectionDAG &DAG) const;			SDValue expandFixedPointMul(SDNode *Node, SelectionDAG &DAG) const;

	/// Method for building the DAG expansion of ISD::[US]DIVFIX[SAT]. This			/// Method for building the DAG expansion of ISD::[US]DIVFIX[SAT]. This
	/// method accepts integers as its arguments.			/// method accepts integers as its arguments.
	/// Note: This method may fail if the division could not be performed			/// Note: This method may fail if the division could not be performed
	/// within the type. Clients must retry with a wider type if this happens.			/// within the type. Clients must retry with a wider type if this happens.
	▲ Show 20 Lines • Show All 120 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Intrinsics.td

Show First 20 Lines • Show All 993 Lines • ▼ Show 20 Lines	def int_uadd_sat : Intrinsic<[llvm_anyint_ty],
[LLVMMatchType<0>, LLVMMatchType<0>],		[LLVMMatchType<0>, LLVMMatchType<0>],
[IntrNoMem, IntrSpeculatable, IntrWillReturn, Commutative]>;		[IntrNoMem, IntrSpeculatable, IntrWillReturn, Commutative]>;
def int_ssub_sat : Intrinsic<[llvm_anyint_ty],		def int_ssub_sat : Intrinsic<[llvm_anyint_ty],
[LLVMMatchType<0>, LLVMMatchType<0>],		[LLVMMatchType<0>, LLVMMatchType<0>],
[IntrNoMem, IntrSpeculatable, IntrWillReturn]>;		[IntrNoMem, IntrSpeculatable, IntrWillReturn]>;
def int_usub_sat : Intrinsic<[llvm_anyint_ty],		def int_usub_sat : Intrinsic<[llvm_anyint_ty],
[LLVMMatchType<0>, LLVMMatchType<0>],		[LLVMMatchType<0>, LLVMMatchType<0>],
[IntrNoMem, IntrSpeculatable, IntrWillReturn]>;		[IntrNoMem, IntrSpeculatable, IntrWillReturn]>;
		def int_sshl_sat : Intrinsic<[llvm_anyint_ty],
		[LLVMMatchType<0>, LLVMMatchType<0>],
		[IntrNoMem, IntrSpeculatable, IntrWillReturn]>;
		def int_ushl_sat : Intrinsic<[llvm_anyint_ty],
		[LLVMMatchType<0>, LLVMMatchType<0>],
		[IntrNoMem, IntrSpeculatable, IntrWillReturn]>;

//===------------------------- Fixed Point Arithmetic Intrinsics ---------------------===//		//===------------------------- Fixed Point Arithmetic Intrinsics ---------------------===//
//		//
def int_smul_fix : Intrinsic<[llvm_anyint_ty],		def int_smul_fix : Intrinsic<[llvm_anyint_ty],
[LLVMMatchType<0>, LLVMMatchType<0>, llvm_i32_ty],		[LLVMMatchType<0>, LLVMMatchType<0>, llvm_i32_ty],
[IntrNoMem, IntrSpeculatable, IntrWillReturn,		[IntrNoMem, IntrSpeculatable, IntrWillReturn,
Commutative, ImmArg<ArgIndex<2>>]>;		Commutative, ImmArg<ArgIndex<2>>]>;

▲ Show 20 Lines • Show All 562 Lines • Show Last 20 Lines

llvm/include/llvm/Support/TargetOpcodes.def

	Show First 20 Lines • Show All 469 Lines • ▼ Show 20 Lines
	HANDLE_TARGET_OPCODE(G_SADDSAT)			HANDLE_TARGET_OPCODE(G_SADDSAT)

	/// Generic saturating unsigned subtraction.			/// Generic saturating unsigned subtraction.
	HANDLE_TARGET_OPCODE(G_USUBSAT)			HANDLE_TARGET_OPCODE(G_USUBSAT)

	/// Generic saturating signed subtraction.			/// Generic saturating signed subtraction.
	HANDLE_TARGET_OPCODE(G_SSUBSAT)			HANDLE_TARGET_OPCODE(G_SSUBSAT)

				/// Generic saturating unsigned left shift.
				HANDLE_TARGET_OPCODE(G_USHLSAT)

				/// Generic saturating signed left shift.
				HANDLE_TARGET_OPCODE(G_SSHLSAT)

	// Perform signed fixed point multiplication			// Perform signed fixed point multiplication
	HANDLE_TARGET_OPCODE(G_SMULFIX)			HANDLE_TARGET_OPCODE(G_SMULFIX)

	// Perform unsigned fixed point multiplication			// Perform unsigned fixed point multiplication
	HANDLE_TARGET_OPCODE(G_UMULFIX)			HANDLE_TARGET_OPCODE(G_UMULFIX)

	// Perform signed, saturating fixed point multiplication			// Perform signed, saturating fixed point multiplication
	HANDLE_TARGET_OPCODE(G_SMULFIXSAT)			HANDLE_TARGET_OPCODE(G_SMULFIXSAT)
	▲ Show 20 Lines • Show All 213 Lines • Show Last 20 Lines

llvm/include/llvm/Target/GenericOpcodes.td

	Show First 20 Lines • Show All 539 Lines • ▼ Show 20 Lines
	// Generic saturating signed subtraction.			// Generic saturating signed subtraction.
	def G_SSUBSAT : GenericInstruction {			def G_SSUBSAT : GenericInstruction {
	let OutOperandList = (outs type0:$dst);			let OutOperandList = (outs type0:$dst);
	let InOperandList = (ins type0:$src1, type0:$src2);			let InOperandList = (ins type0:$src1, type0:$src2);
	let hasSideEffects = 0;			let hasSideEffects = 0;
	let isCommutable = 0;			let isCommutable = 0;
	}			}

				// Generic saturating unsigned left shift.
				def G_USHLSAT : GenericInstruction {
				let OutOperandList = (outs type0:$dst);
				let InOperandList = (ins type0:$src1, type1:$src2);
				arsenmUnsubmitted Not Done Reply Inline Actions The shift amount type doesn't have to match the shift value type, so src2 should use type1 arsenm: The shift amount type doesn't have to match the shift value type, so src2 should use type1
				let hasSideEffects = 0;
				let isCommutable = 0;
				}

				// Generic saturating signed left shift.
				def G_SSHLSAT : GenericInstruction {
				let OutOperandList = (outs type0:$dst);
				let InOperandList = (ins type0:$src1, type1:$src2);
				arsenmUnsubmitted Not Done Reply Inline Actions Ditto arsenm: Ditto
				let hasSideEffects = 0;
				let isCommutable = 0;
				}

	/// RESULT = [US]MULFIX(LHS, RHS, SCALE) - Perform fixed point			/// RESULT = [US]MULFIX(LHS, RHS, SCALE) - Perform fixed point
	/// multiplication on 2 integers with the same width and scale. SCALE			/// multiplication on 2 integers with the same width and scale. SCALE
	/// represents the scale of both operands as fixed point numbers. This			/// represents the scale of both operands as fixed point numbers. This
	/// SCALE parameter must be a constant integer. A scale of zero is			/// SCALE parameter must be a constant integer. A scale of zero is
	/// effectively performing multiplication on 2 integers.			/// effectively performing multiplication on 2 integers.
	def G_SMULFIX : GenericInstruction {			def G_SMULFIX : GenericInstruction {
	let OutOperandList = (outs type0:$dst);			let OutOperandList = (outs type0:$dst);
	let InOperandList = (ins type0:$src0, type0:$src1, untyped_imm_0:$scale);			let InOperandList = (ins type0:$src0, type0:$src1, untyped_imm_0:$scale);
	▲ Show 20 Lines • Show All 696 Lines • Show Last 20 Lines

llvm/include/llvm/Target/GlobalISel/SelectionDAGCompat.td

	Show First 20 Lines • Show All 67 Lines • ▼ Show 20 Lines
	def : GINodeEquiv<G_XOR, xor>;			def : GINodeEquiv<G_XOR, xor>;
	def : GINodeEquiv<G_SHL, shl>;			def : GINodeEquiv<G_SHL, shl>;
	def : GINodeEquiv<G_LSHR, srl>;			def : GINodeEquiv<G_LSHR, srl>;
	def : GINodeEquiv<G_ASHR, sra>;			def : GINodeEquiv<G_ASHR, sra>;
	def : GINodeEquiv<G_SADDSAT, saddsat>;			def : GINodeEquiv<G_SADDSAT, saddsat>;
	def : GINodeEquiv<G_UADDSAT, uaddsat>;			def : GINodeEquiv<G_UADDSAT, uaddsat>;
	def : GINodeEquiv<G_SSUBSAT, ssubsat>;			def : GINodeEquiv<G_SSUBSAT, ssubsat>;
	def : GINodeEquiv<G_USUBSAT, usubsat>;			def : GINodeEquiv<G_USUBSAT, usubsat>;
				def : GINodeEquiv<G_SSHLSAT, sshlsat>;
				def : GINodeEquiv<G_USHLSAT, ushlsat>;
	def : GINodeEquiv<G_SMULFIX, smulfix>;			def : GINodeEquiv<G_SMULFIX, smulfix>;
	def : GINodeEquiv<G_UMULFIX, umulfix>;			def : GINodeEquiv<G_UMULFIX, umulfix>;
	def : GINodeEquiv<G_SMULFIXSAT, smulfixsat>;			def : GINodeEquiv<G_SMULFIXSAT, smulfixsat>;
	def : GINodeEquiv<G_UMULFIXSAT, umulfixsat>;			def : GINodeEquiv<G_UMULFIXSAT, umulfixsat>;
	def : GINodeEquiv<G_SDIVFIX, sdivfix>;			def : GINodeEquiv<G_SDIVFIX, sdivfix>;
	def : GINodeEquiv<G_UDIVFIX, udivfix>;			def : GINodeEquiv<G_UDIVFIX, udivfix>;
	def : GINodeEquiv<G_SDIVFIXSAT, sdivfixsat>;			def : GINodeEquiv<G_SDIVFIXSAT, sdivfixsat>;
	def : GINodeEquiv<G_UDIVFIXSAT, udivfixsat>;			def : GINodeEquiv<G_UDIVFIXSAT, udivfixsat>;
	▲ Show 20 Lines • Show All 124 Lines • Show Last 20 Lines

llvm/include/llvm/Target/TargetSelectionDAG.td

Show First 20 Lines • Show All 390 Lines • ▼ Show 20 Lines	def umin : SDNode<"ISD::UMIN" , SDTIntBinOp,
[SDNPCommutative, SDNPAssociative]>;		[SDNPCommutative, SDNPAssociative]>;
def umax : SDNode<"ISD::UMAX" , SDTIntBinOp,		def umax : SDNode<"ISD::UMAX" , SDTIntBinOp,
[SDNPCommutative, SDNPAssociative]>;		[SDNPCommutative, SDNPAssociative]>;

def saddsat : SDNode<"ISD::SADDSAT" , SDTIntBinOp, [SDNPCommutative]>;		def saddsat : SDNode<"ISD::SADDSAT" , SDTIntBinOp, [SDNPCommutative]>;
def uaddsat : SDNode<"ISD::UADDSAT" , SDTIntBinOp, [SDNPCommutative]>;		def uaddsat : SDNode<"ISD::UADDSAT" , SDTIntBinOp, [SDNPCommutative]>;
def ssubsat : SDNode<"ISD::SSUBSAT" , SDTIntBinOp>;		def ssubsat : SDNode<"ISD::SSUBSAT" , SDTIntBinOp>;
def usubsat : SDNode<"ISD::USUBSAT" , SDTIntBinOp>;		def usubsat : SDNode<"ISD::USUBSAT" , SDTIntBinOp>;
		def sshlsat : SDNode<"ISD::SSHLSAT" , SDTIntBinOp>;
		def ushlsat : SDNode<"ISD::USHLSAT" , SDTIntBinOp>;

def smulfix : SDNode<"ISD::SMULFIX" , SDTIntScaledBinOp, [SDNPCommutative]>;		def smulfix : SDNode<"ISD::SMULFIX" , SDTIntScaledBinOp, [SDNPCommutative]>;
def smulfixsat : SDNode<"ISD::SMULFIXSAT", SDTIntScaledBinOp, [SDNPCommutative]>;		def smulfixsat : SDNode<"ISD::SMULFIXSAT", SDTIntScaledBinOp, [SDNPCommutative]>;
def umulfix : SDNode<"ISD::UMULFIX" , SDTIntScaledBinOp, [SDNPCommutative]>;		def umulfix : SDNode<"ISD::UMULFIX" , SDTIntScaledBinOp, [SDNPCommutative]>;
def umulfixsat : SDNode<"ISD::UMULFIXSAT", SDTIntScaledBinOp, [SDNPCommutative]>;		def umulfixsat : SDNode<"ISD::UMULFIXSAT", SDTIntScaledBinOp, [SDNPCommutative]>;
def sdivfix : SDNode<"ISD::SDIVFIX" , SDTIntScaledBinOp>;		def sdivfix : SDNode<"ISD::SDIVFIX" , SDTIntScaledBinOp>;
def sdivfixsat : SDNode<"ISD::SDIVFIXSAT", SDTIntScaledBinOp>;		def sdivfixsat : SDNode<"ISD::SDIVFIXSAT", SDTIntScaledBinOp>;
def udivfix : SDNode<"ISD::UDIVFIX" , SDTIntScaledBinOp>;		def udivfix : SDNode<"ISD::UDIVFIX" , SDTIntScaledBinOp>;
▲ Show 20 Lines • Show All 1,219 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp

Show First 20 Lines • Show All 1,478 Lines • ▼ Show 20 Lines	bool IRTranslator::translateKnownIntrinsic(const CallInst &CI, Intrinsic::ID ID,
case Intrinsic::uadd_sat:		case Intrinsic::uadd_sat:
return translateBinaryOp(TargetOpcode::G_UADDSAT, CI, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_UADDSAT, CI, MIRBuilder);
case Intrinsic::sadd_sat:		case Intrinsic::sadd_sat:
return translateBinaryOp(TargetOpcode::G_SADDSAT, CI, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_SADDSAT, CI, MIRBuilder);
case Intrinsic::usub_sat:		case Intrinsic::usub_sat:
return translateBinaryOp(TargetOpcode::G_USUBSAT, CI, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_USUBSAT, CI, MIRBuilder);
case Intrinsic::ssub_sat:		case Intrinsic::ssub_sat:
return translateBinaryOp(TargetOpcode::G_SSUBSAT, CI, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_SSUBSAT, CI, MIRBuilder);
		case Intrinsic::ushl_sat:
		return translateBinaryOp(TargetOpcode::G_USHLSAT, CI, MIRBuilder);
		case Intrinsic::sshl_sat:
		return translateBinaryOp(TargetOpcode::G_SSHLSAT, CI, MIRBuilder);
case Intrinsic::umin:		case Intrinsic::umin:
return translateBinaryOp(TargetOpcode::G_UMIN, CI, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_UMIN, CI, MIRBuilder);
case Intrinsic::umax:		case Intrinsic::umax:
return translateBinaryOp(TargetOpcode::G_UMAX, CI, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_UMAX, CI, MIRBuilder);
case Intrinsic::smin:		case Intrinsic::smin:
return translateBinaryOp(TargetOpcode::G_SMIN, CI, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_SMIN, CI, MIRBuilder);
case Intrinsic::smax:		case Intrinsic::smax:
return translateBinaryOp(TargetOpcode::G_SMAX, CI, MIRBuilder);		return translateBinaryOp(TargetOpcode::G_SMAX, CI, MIRBuilder);
▲ Show 20 Lines • Show All 1,089 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp

Show First 20 Lines • Show All 1,708 Lines • ▼ Show 20 Lines	LegalizerHelper::widenScalarInsert(MachineInstr &MI, unsigned TypeIdx,
Observer.changingInstr(MI);		Observer.changingInstr(MI);
widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ANYEXT);		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ANYEXT);
widenScalarDst(MI, WideTy);		widenScalarDst(MI, WideTy);
Observer.changedInstr(MI);		Observer.changedInstr(MI);
return Legalized;		return Legalized;
}		}

LegalizerHelper::LegalizeResult		LegalizerHelper::LegalizeResult
LegalizerHelper::widenScalarAddSubSat(MachineInstr &MI, unsigned TypeIdx,		LegalizerHelper::widenScalarAddSubShlSat(MachineInstr &MI, unsigned TypeIdx,
LLT WideTy) {		LLT WideTy) {
bool IsSigned = MI.getOpcode() == TargetOpcode::G_SADDSAT \|\|		bool IsSigned = MI.getOpcode() == TargetOpcode::G_SADDSAT \|\|
MI.getOpcode() == TargetOpcode::G_SSUBSAT;		MI.getOpcode() == TargetOpcode::G_SSUBSAT \|\|
		MI.getOpcode() == TargetOpcode::G_SSHLSAT;
		bool IsShift = MI.getOpcode() == TargetOpcode::G_SSHLSAT \|\|
		MI.getOpcode() == TargetOpcode::G_USHLSAT;
// We can convert this to:		// We can convert this to:
// 1. Any extend iN to iM		// 1. Any extend iN to iM
// 2. SHL by M-N		// 2. SHL by M-N
// 3. [US][ADD\|SUB]SAT		// 3. [US][ADD\|SUB\|SHL]SAT
// 4. L/ASHR by M-N		// 4. L/ASHR by M-N
//		//
// It may be more efficient to lower this to a min and a max operation in		// It may be more efficient to lower this to a min and a max operation in
// the higher precision arithmetic if the promoted operation isn't legal,		// the higher precision arithmetic if the promoted operation isn't legal,
// but this decision is up to the target's lowering request.		// but this decision is up to the target's lowering request.
Register DstReg = MI.getOperand(0).getReg();		Register DstReg = MI.getOperand(0).getReg();

unsigned NewBits = WideTy.getScalarSizeInBits();		unsigned NewBits = WideTy.getScalarSizeInBits();
unsigned SHLAmount = NewBits - MRI.getType(DstReg).getScalarSizeInBits();		unsigned SHLAmount = NewBits - MRI.getType(DstReg).getScalarSizeInBits();

		// Shifts must zero-extend the RHS to preserve the unsigned quantity, and
		// must not left shift the RHS to preserve the shift amount.
auto LHS = MIRBuilder.buildAnyExt(WideTy, MI.getOperand(1));		auto LHS = MIRBuilder.buildAnyExt(WideTy, MI.getOperand(1));
auto RHS = MIRBuilder.buildAnyExt(WideTy, MI.getOperand(2));		auto RHS = IsShift ? MIRBuilder.buildZExt(WideTy, MI.getOperand(2))
		: MIRBuilder.buildAnyExt(WideTy, MI.getOperand(2));
auto ShiftK = MIRBuilder.buildConstant(WideTy, SHLAmount);		auto ShiftK = MIRBuilder.buildConstant(WideTy, SHLAmount);
auto ShiftL = MIRBuilder.buildShl(WideTy, LHS, ShiftK);		auto ShiftL = MIRBuilder.buildShl(WideTy, LHS, ShiftK);
auto ShiftR = MIRBuilder.buildShl(WideTy, RHS, ShiftK);		auto ShiftR = IsShift ? RHS : MIRBuilder.buildShl(WideTy, RHS, ShiftK);

auto WideInst = MIRBuilder.buildInstr(MI.getOpcode(), {WideTy},		auto WideInst = MIRBuilder.buildInstr(MI.getOpcode(), {WideTy},
{ShiftL, ShiftR}, MI.getFlags());		{ShiftL, ShiftR}, MI.getFlags());

// Use a shift that will preserve the number of sign bits when the trunc is		// Use a shift that will preserve the number of sign bits when the trunc is
// folded away.		// folded away.
auto Result = IsSigned ? MIRBuilder.buildAShr(WideTy, WideInst, ShiftK)		auto Result = IsSigned ? MIRBuilder.buildAShr(WideTy, WideInst, ShiftK)
: MIRBuilder.buildLShr(WideTy, WideInst, ShiftK);		: MIRBuilder.buildLShr(WideTy, WideInst, ShiftK);
Show All 36 Lines	case TargetOpcode::G_USUBO: {
MIRBuilder.buildICmp(CmpInst::ICMP_NE, MI.getOperand(1), NewOp, AndOp);		MIRBuilder.buildICmp(CmpInst::ICMP_NE, MI.getOperand(1), NewOp, AndOp);
// Now trunc the NewOp to the original result.		// Now trunc the NewOp to the original result.
MIRBuilder.buildTrunc(MI.getOperand(0), NewOp);		MIRBuilder.buildTrunc(MI.getOperand(0), NewOp);
MI.eraseFromParent();		MI.eraseFromParent();
return Legalized;		return Legalized;
}		}
case TargetOpcode::G_SADDSAT:		case TargetOpcode::G_SADDSAT:
case TargetOpcode::G_SSUBSAT:		case TargetOpcode::G_SSUBSAT:
		case TargetOpcode::G_SSHLSAT:
case TargetOpcode::G_UADDSAT:		case TargetOpcode::G_UADDSAT:
case TargetOpcode::G_USUBSAT:		case TargetOpcode::G_USUBSAT:
return widenScalarAddSubSat(MI, TypeIdx, WideTy);		case TargetOpcode::G_USHLSAT:
		return widenScalarAddSubShlSat(MI, TypeIdx, WideTy);
case TargetOpcode::G_CTTZ:		case TargetOpcode::G_CTTZ:
case TargetOpcode::G_CTTZ_ZERO_UNDEF:		case TargetOpcode::G_CTTZ_ZERO_UNDEF:
case TargetOpcode::G_CTLZ:		case TargetOpcode::G_CTLZ:
case TargetOpcode::G_CTLZ_ZERO_UNDEF:		case TargetOpcode::G_CTLZ_ZERO_UNDEF:
case TargetOpcode::G_CTPOP: {		case TargetOpcode::G_CTPOP: {
if (TypeIdx == 0) {		if (TypeIdx == 0) {
Observer.changingInstr(MI);		Observer.changingInstr(MI);
widenScalarDst(MI, WideTy, 0);		widenScalarDst(MI, WideTy, 0);
▲ Show 20 Lines • Show All 1,138 Lines • ▼ Show 20 Lines	case G_SSUBSAT: {

// FIXME: It would probably make more sense to see if G_SADDO is preferred,		// FIXME: It would probably make more sense to see if G_SADDO is preferred,
// since it's a shorter expansion. However, we would need to figure out the		// since it's a shorter expansion. However, we would need to figure out the
// preferred boolean type for the carry out for the query.		// preferred boolean type for the carry out for the query.
if (LI.isLegalOrCustom({G_SMIN, Ty}) && LI.isLegalOrCustom({G_SMAX, Ty}))		if (LI.isLegalOrCustom({G_SMIN, Ty}) && LI.isLegalOrCustom({G_SMAX, Ty}))
return lowerAddSubSatToMinMax(MI);		return lowerAddSubSatToMinMax(MI);
return lowerAddSubSatToAddoSubo(MI);		return lowerAddSubSatToAddoSubo(MI);
}		}
		case G_SSHLSAT:
		case G_USHLSAT:
		return lowerShlSat(MI);
}		}
}		}

Align LegalizerHelper::getStackTemporaryAlignment(LLT Ty,		Align LegalizerHelper::getStackTemporaryAlignment(LLT Ty,
Align MinAlign) const {		Align MinAlign) const {
// FIXME: We're missing a way to go back from LLT to llvm::Type to query the		// FIXME: We're missing a way to go back from LLT to llvm::Type to query the
// datalayout for the preferred alignment. Also there should be a target hook		// datalayout for the preferred alignment. Also there should be a target hook
// for this to allow targets to reduce the alignment and ignore the		// for this to allow targets to reduce the alignment and ignore the
▲ Show 20 Lines • Show All 862 Lines • ▼ Show 20 Lines	LegalizerHelper::fewerElementsVector(MachineInstr &MI, unsigned TypeIdx,
case G_SADDSAT:		case G_SADDSAT:
case G_SSUBSAT:		case G_SSUBSAT:
case G_UADDSAT:		case G_UADDSAT:
case G_USUBSAT:		case G_USUBSAT:
return reduceOperationWidth(MI, TypeIdx, NarrowTy);		return reduceOperationWidth(MI, TypeIdx, NarrowTy);
case G_SHL:		case G_SHL:
case G_LSHR:		case G_LSHR:
case G_ASHR:		case G_ASHR:
		case G_SSHLSAT:
		case G_USHLSAT:
case G_CTLZ:		case G_CTLZ:
case G_CTLZ_ZERO_UNDEF:		case G_CTLZ_ZERO_UNDEF:
case G_CTTZ:		case G_CTTZ:
case G_CTTZ_ZERO_UNDEF:		case G_CTTZ_ZERO_UNDEF:
case G_CTPOP:		case G_CTPOP:
case G_FCOPYSIGN:		case G_FCOPYSIGN:
return fewerElementsVectorMultiEltType(MI, TypeIdx, NarrowTy);		return fewerElementsVectorMultiEltType(MI, TypeIdx, NarrowTy);
case G_ZEXT:		case G_ZEXT:
▲ Show 20 Lines • Show All 2,010 Lines • ▼ Show 20 Lines	LegalizerHelper::lowerAddSubSatToAddoSubo(MachineInstr &MI) {
}		}
MIRBuilder.buildSelect(Res, Ov, Clamp, Tmp);		MIRBuilder.buildSelect(Res, Ov, Clamp, Tmp);

MI.eraseFromParent();		MI.eraseFromParent();
return Legalized;		return Legalized;
}		}

LegalizerHelper::LegalizeResult		LegalizerHelper::LegalizeResult
		LegalizerHelper::lowerShlSat(MachineInstr &MI) {
		assert((MI.getOpcode() == TargetOpcode::G_SSHLSAT \|\|
		MI.getOpcode() == TargetOpcode::G_USHLSAT) &&
		"Expected shlsat opcode!");
		bool IsSigned = MI.getOpcode() == TargetOpcode::G_SSHLSAT;
		Register Res = MI.getOperand(0).getReg();
		Register LHS = MI.getOperand(1).getReg();
		Register RHS = MI.getOperand(2).getReg();
		LLT Ty = MRI.getType(Res);
		LLT BoolTy = Ty.changeElementSize(1);

		unsigned BW = Ty.getScalarSizeInBits();
		auto Result = MIRBuilder.buildShl(Ty, LHS, RHS);
		auto Orig = IsSigned ? MIRBuilder.buildAShr(Ty, Result, RHS)
		: MIRBuilder.buildLShr(Ty, Result, RHS);

		MachineInstrBuilder SatVal;
		if (IsSigned) {
		auto SatMin = MIRBuilder.buildConstant(Ty, APInt::getSignedMinValue(BW));
		auto SatMax = MIRBuilder.buildConstant(Ty, APInt::getSignedMaxValue(BW));
		auto Cmp = MIRBuilder.buildICmp(CmpInst::ICMP_SLT, BoolTy, LHS,
		MIRBuilder.buildConstant(Ty, 0));
		SatVal = MIRBuilder.buildSelect(Ty, Cmp, SatMin, SatMax);
		} else {
		SatVal = MIRBuilder.buildConstant(Ty, APInt::getMaxValue(BW));
		}
		auto Ov = MIRBuilder.buildICmp(CmpInst::ICMP_NE, Ty, LHS, Orig);
		MIRBuilder.buildSelect(Res, Ov, SatVal, Result);

		MI.eraseFromParent();
		return Legalized;
		}

		LegalizerHelper::LegalizeResult
LegalizerHelper::lowerBswap(MachineInstr &MI) {		LegalizerHelper::lowerBswap(MachineInstr &MI) {
Register Dst = MI.getOperand(0).getReg();		Register Dst = MI.getOperand(0).getReg();
Register Src = MI.getOperand(1).getReg();		Register Src = MI.getOperand(1).getReg();
const LLT Ty = MRI.getType(Src);		const LLT Ty = MRI.getType(Src);
unsigned SizeInBytes = (Ty.getScalarSizeInBits() + 7) / 8;		unsigned SizeInBytes = (Ty.getScalarSizeInBits() + 7) / 8;
unsigned BaseShiftAmt = (SizeInBytes - 1) * 8;		unsigned BaseShiftAmt = (SizeInBytes - 1) * 8;

// Swap most and least significant byte, set remaining bytes in Res to zero.		// Swap most and least significant byte, set remaining bytes in Res to zero.
Show All 14 Lines	for (unsigned i = 1; i < SizeInBytes / 2; ++i) {
Res = MIRBuilder.buildOr(Ty, Res, LoShiftedLeft);		Res = MIRBuilder.buildOr(Ty, Res, LoShiftedLeft);
// High byte shifted right to place of low byte: (Src >> ShiftAmt) & Mask.		// High byte shifted right to place of low byte: (Src >> ShiftAmt) & Mask.
auto SrcShiftedRight = MIRBuilder.buildLShr(Ty, Src, ShiftAmt);		auto SrcShiftedRight = MIRBuilder.buildLShr(Ty, Src, ShiftAmt);
auto HiShiftedRight = MIRBuilder.buildAnd(Ty, SrcShiftedRight, Mask);		auto HiShiftedRight = MIRBuilder.buildAnd(Ty, SrcShiftedRight, Mask);
Res = MIRBuilder.buildOr(Ty, Res, HiShiftedRight);		Res = MIRBuilder.buildOr(Ty, Res, HiShiftedRight);
}		}
Res.getInstr()->getOperand(0).setReg(Dst);		Res.getInstr()->getOperand(0).setReg(Dst);

MI.eraseFromParent();		MI.eraseFromParent();
		arsenmUnsubmitted Not Done Reply Inline Actions Extra blank line here arsenm: Extra blank line here
return Legalized;		return Legalized;
}		}

//{ (Src & Mask) >> N } \| { (Src << N) & Mask }		//{ (Src & Mask) >> N } \| { (Src << N) & Mask }
static MachineInstrBuilder SwapN(unsigned N, DstOp Dst, MachineIRBuilder &B,		static MachineInstrBuilder SwapN(unsigned N, DstOp Dst, MachineIRBuilder &B,
MachineInstrBuilder Src, APInt Mask) {		MachineInstrBuilder Src, APInt Mask) {
const LLT Ty = Dst.getLLTTy(*B.getMRI());		const LLT Ty = Dst.getLLTTy(*B.getMRI());
MachineInstrBuilder C_N = B.buildConstant(Ty, N);		MachineInstrBuilder C_N = B.buildConstant(Ty, N);
▲ Show 20 Lines • Show All 64 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/MachineIRBuilder.cpp

Show First 20 Lines • Show All 971 Lines • ▼ Show 20 Lines	case TargetOpcode::G_SSUBSAT: {
assert(SrcOps.size() == 2 && "Invalid Srcs");		assert(SrcOps.size() == 2 && "Invalid Srcs");
validateBinaryOp(DstOps[0].getLLTTy(*getMRI()),		validateBinaryOp(DstOps[0].getLLTTy(*getMRI()),
SrcOps[0].getLLTTy(*getMRI()),		SrcOps[0].getLLTTy(*getMRI()),
SrcOps[1].getLLTTy(*getMRI()));		SrcOps[1].getLLTTy(*getMRI()));
break;		break;
}		}
case TargetOpcode::G_SHL:		case TargetOpcode::G_SHL:
case TargetOpcode::G_ASHR:		case TargetOpcode::G_ASHR:
case TargetOpcode::G_LSHR: {		case TargetOpcode::G_LSHR:
		case TargetOpcode::G_USHLSAT:
		case TargetOpcode::G_SSHLSAT: {
assert(DstOps.size() == 1 && "Invalid Dst");		assert(DstOps.size() == 1 && "Invalid Dst");
assert(SrcOps.size() == 2 && "Invalid Srcs");		assert(SrcOps.size() == 2 && "Invalid Srcs");
validateShiftOp(DstOps[0].getLLTTy(*getMRI()),		validateShiftOp(DstOps[0].getLLTTy(*getMRI()),
SrcOps[0].getLLTTy(*getMRI()),		SrcOps[0].getLLTTy(*getMRI()),
SrcOps[1].getLLTTy(*getMRI()));		SrcOps[1].getLLTTy(*getMRI()));
break;		break;
}		}
case TargetOpcode::G_SEXT:		case TargetOpcode::G_SEXT:
▲ Show 20 Lines • Show All 188 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

Show First 20 Lines • Show All 1,112 Lines • ▼ Show 20 Lines	if (Action == TargetLowering::Expand) {
ReplaceNode(Node, NewVal.getNode());		ReplaceNode(Node, NewVal.getNode());
LegalizeOp(NewVal.getNode());		LegalizeOp(NewVal.getNode());
return;		return;
}		}
break;		break;
case ISD::SADDSAT:		case ISD::SADDSAT:
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::USUBSAT: {		case ISD::USUBSAT:
		case ISD::SSHLSAT:
		case ISD::USHLSAT: {
Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));		Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));
break;		break;
}		}
case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX:		case ISD::UMULFIX:
case ISD::UMULFIXSAT:		case ISD::UMULFIXSAT:
case ISD::SDIVFIX:		case ISD::SDIVFIX:
▲ Show 20 Lines • Show All 2,341 Lines • ▼ Show 20 Lines	if (TLI.expandROT(Node, Tmp1, DAG))
Results.push_back(Tmp1);		Results.push_back(Tmp1);
break;		break;
case ISD::SADDSAT:		case ISD::SADDSAT:
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::USUBSAT:		case ISD::USUBSAT:
Results.push_back(TLI.expandAddSubSat(Node, DAG));		Results.push_back(TLI.expandAddSubSat(Node, DAG));
break;		break;
		case ISD::SSHLSAT:
		case ISD::USHLSAT:
		Results.push_back(TLI.expandShlSat(Node, DAG));
		break;
case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX:		case ISD::UMULFIX:
case ISD::UMULFIXSAT:		case ISD::UMULFIXSAT:
Results.push_back(TLI.expandFixedPointMul(Node, DAG));		Results.push_back(TLI.expandFixedPointMul(Node, DAG));
break;		break;
case ISD::SDIVFIX:		case ISD::SDIVFIX:
case ISD::SDIVFIXSAT:		case ISD::SDIVFIXSAT:
▲ Show 20 Lines • Show All 1,412 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

Show First 20 Lines • Show All 148 Lines • ▼ Show 20 Lines	#endif
case ISD::ADDE:		case ISD::ADDE:
case ISD::SUBE:		case ISD::SUBE:
case ISD::ADDCARRY:		case ISD::ADDCARRY:
case ISD::SUBCARRY: Res = PromoteIntRes_ADDSUBCARRY(N, ResNo); break;		case ISD::SUBCARRY: Res = PromoteIntRes_ADDSUBCARRY(N, ResNo); break;

case ISD::SADDSAT:		case ISD::SADDSAT:
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::USUBSAT: Res = PromoteIntRes_ADDSUBSAT(N); break;		case ISD::USUBSAT:
		case ISD::SSHLSAT:
		case ISD::USHLSAT: Res = PromoteIntRes_ADDSUBSHLSAT(N); break;

case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX:		case ISD::UMULFIX:
case ISD::UMULFIXSAT: Res = PromoteIntRes_MULFIX(N); break;		case ISD::UMULFIXSAT: Res = PromoteIntRes_MULFIX(N); break;

case ISD::SDIVFIX:		case ISD::SDIVFIX:
case ISD::SDIVFIXSAT:		case ISD::SDIVFIXSAT:
▲ Show 20 Lines • Show All 529 Lines • ▼ Show 20 Lines	SDValue DAGTypeLegalizer::PromoteIntRes_Overflow(SDNode *N) {
// Modified the sum result - switch anything that used the old sum to use		// Modified the sum result - switch anything that used the old sum to use
// the new one.		// the new one.
ReplaceValueWith(SDValue(N, 0), Res);		ReplaceValueWith(SDValue(N, 0), Res);

// Convert to the expected type.		// Convert to the expected type.
return DAG.getBoolExtOrTrunc(Res.getValue(1), dl, NVT, VT);		return DAG.getBoolExtOrTrunc(Res.getValue(1), dl, NVT, VT);
}		}

SDValue DAGTypeLegalizer::PromoteIntRes_ADDSUBSAT(SDNode *N) {		SDValue DAGTypeLegalizer::PromoteIntRes_ADDSUBSHLSAT(SDNode *N) {
// If the promoted type is legal, we can convert this to:		// If the promoted type is legal, we can convert this to:
// 1. ANY_EXTEND iN to iM		// 1. ANY_EXTEND iN to iM
// 2. SHL by M-N		// 2. SHL by M-N
// 3. [US][ADD\|SUB]SAT		// 3. [US][ADD\|SUB\|SHL]SAT
// 4. L/ASHR by M-N		// 4. L/ASHR by M-N
// Else it is more efficient to convert this to a min and a max		// Else it is more efficient to convert this to a min and a max
// operation in the higher precision arithmetic.		// operation in the higher precision arithmetic.
SDLoc dl(N);		SDLoc dl(N);
SDValue Op1 = N->getOperand(0);		SDValue Op1 = N->getOperand(0);
SDValue Op2 = N->getOperand(1);		SDValue Op2 = N->getOperand(1);
unsigned OldBits = Op1.getScalarValueSizeInBits();		unsigned OldBits = Op1.getScalarValueSizeInBits();

unsigned Opcode = N->getOpcode();		unsigned Opcode = N->getOpcode();
		bool IsShift = Opcode == ISD::USHLSAT \|\| Opcode == ISD::SSHLSAT;

SDValue Op1Promoted, Op2Promoted;		SDValue Op1Promoted, Op2Promoted;
if (Opcode == ISD::UADDSAT \|\| Opcode == ISD::USUBSAT) {		if (IsShift) {
		Op1Promoted = GetPromotedInteger(Op1);
		Op2Promoted = ZExtPromotedInteger(Op2);
		} else if (Opcode == ISD::UADDSAT \|\| Opcode == ISD::USUBSAT) {
Op1Promoted = ZExtPromotedInteger(Op1);		Op1Promoted = ZExtPromotedInteger(Op1);
Op2Promoted = ZExtPromotedInteger(Op2);		Op2Promoted = ZExtPromotedInteger(Op2);
} else {		} else {
Op1Promoted = SExtPromotedInteger(Op1);		Op1Promoted = SExtPromotedInteger(Op1);
Op2Promoted = SExtPromotedInteger(Op2);		Op2Promoted = SExtPromotedInteger(Op2);
}		}
EVT PromotedType = Op1Promoted.getValueType();		EVT PromotedType = Op1Promoted.getValueType();
unsigned NewBits = PromotedType.getScalarSizeInBits();		unsigned NewBits = PromotedType.getScalarSizeInBits();

if (TLI.isOperationLegalOrCustom(Opcode, PromotedType)) {		// Shift cannot use a min/max expansion, we can't detect overflow if all of
		// the bits have been shifted out.
		if (IsShift \|\| TLI.isOperationLegalOrCustom(Opcode, PromotedType)) {
unsigned ShiftOp;		unsigned ShiftOp;
switch (Opcode) {		switch (Opcode) {
case ISD::SADDSAT:		case ISD::SADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
		case ISD::SSHLSAT:
ShiftOp = ISD::SRA;		ShiftOp = ISD::SRA;
break;		break;
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::USUBSAT:		case ISD::USUBSAT:
		case ISD::USHLSAT:
ShiftOp = ISD::SRL;		ShiftOp = ISD::SRL;
break;		break;
default:		default:
llvm_unreachable("Expected opcode to be signed or unsigned saturation "		llvm_unreachable("Expected opcode to be signed or unsigned saturation "
"addition or subtraction");		"addition, subtraction or left shift");
}		}

unsigned SHLAmount = NewBits - OldBits;		unsigned SHLAmount = NewBits - OldBits;
EVT SHVT = TLI.getShiftAmountTy(PromotedType, DAG.getDataLayout());		EVT SHVT = TLI.getShiftAmountTy(PromotedType, DAG.getDataLayout());
SDValue ShiftAmount = DAG.getConstant(SHLAmount, dl, SHVT);		SDValue ShiftAmount = DAG.getConstant(SHLAmount, dl, SHVT);
Op1Promoted =		Op1Promoted =
DAG.getNode(ISD::SHL, dl, PromotedType, Op1Promoted, ShiftAmount);		DAG.getNode(ISD::SHL, dl, PromotedType, Op1Promoted, ShiftAmount);
		if (!IsShift)
Op2Promoted =		Op2Promoted =
DAG.getNode(ISD::SHL, dl, PromotedType, Op2Promoted, ShiftAmount);		DAG.getNode(ISD::SHL, dl, PromotedType, Op2Promoted, ShiftAmount);

SDValue Result =		SDValue Result =
DAG.getNode(Opcode, dl, PromotedType, Op1Promoted, Op2Promoted);		DAG.getNode(Opcode, dl, PromotedType, Op1Promoted, Op2Promoted);
return DAG.getNode(ShiftOp, dl, PromotedType, Result, ShiftAmount);		return DAG.getNode(ShiftOp, dl, PromotedType, Result, ShiftAmount);
} else {		} else {
if (Opcode == ISD::USUBSAT) {		if (Opcode == ISD::USUBSAT) {
SDValue Max =		SDValue Max =
DAG.getNode(ISD::UMAX, dl, PromotedType, Op1Promoted, Op2Promoted);		DAG.getNode(ISD::UMAX, dl, PromotedType, Op1Promoted, Op2Promoted);
Show All 16 Lines	if (IsShift \|\| TLI.isOperationLegalOrCustom(Opcode, PromotedType)) {
SDValue Result =		SDValue Result =
DAG.getNode(AddOp, dl, PromotedType, Op1Promoted, Op2Promoted);		DAG.getNode(AddOp, dl, PromotedType, Op1Promoted, Op2Promoted);
Result = DAG.getNode(ISD::SMIN, dl, PromotedType, Result, SatMax);		Result = DAG.getNode(ISD::SMIN, dl, PromotedType, Result, SatMax);
Result = DAG.getNode(ISD::SMAX, dl, PromotedType, Result, SatMin);		Result = DAG.getNode(ISD::SMAX, dl, PromotedType, Result, SatMin);
return Result;		return Result;
}		}
}		}

SDValue DAGTypeLegalizer::PromoteIntRes_MULFIX(SDNode *N) {		SDValue DAGTypeLegalizer::PromoteIntRes_MULFIX(SDNode *N) {
		arsenmUnsubmitted Not Done Reply Inline Actions This code is exactly the same as for ADD/SUB sat and should be shared arsenm: This code is exactly the same as for ADD/SUB sat and should be shared
		ebevhanAuthorUnsubmitted Not Done Reply Inline Actions The code is not exactly the same, but some of it is. I could try factoring it in. I find that code that does a lot is a bit harder to read, though. ebevhan: The code is not exactly the same, but some of it is. I could try factoring it in. I find that…
		lebedev.riUnsubmitted Not Done Reply Inline Actions While i'm usually against duplication, i'm not not very sure that in this case it is better to deduplicate. lebedev.ri: While i'm usually against duplication, i'm not not very sure that in this case it is better to…
		ebevhanAuthorUnsubmitted Done Reply Inline Actions I deduplicated anyway. I don't think it ended up too bad. ebevhan: I deduplicated anyway. I don't think it ended up too bad.
// Can just promote the operands then continue with operation.		// Can just promote the operands then continue with operation.
SDLoc dl(N);		SDLoc dl(N);
SDValue Op1Promoted, Op2Promoted;		SDValue Op1Promoted, Op2Promoted;
bool Signed =		bool Signed =
N->getOpcode() == ISD::SMULFIX \|\| N->getOpcode() == ISD::SMULFIXSAT;		N->getOpcode() == ISD::SMULFIX \|\| N->getOpcode() == ISD::SMULFIXSAT;
bool Saturating =		bool Saturating =
N->getOpcode() == ISD::SMULFIXSAT \|\| N->getOpcode() == ISD::UMULFIXSAT;		N->getOpcode() == ISD::SMULFIXSAT \|\| N->getOpcode() == ISD::UMULFIXSAT;
if (Signed) {		if (Signed) {
Op1Promoted = SExtPromotedInteger(N->getOperand(0));		Op1Promoted = SExtPromotedInteger(N->getOperand(0));
Op2Promoted = SExtPromotedInteger(N->getOperand(1));		Op2Promoted = SExtPromotedInteger(N->getOperand(1));
} else {		} else {
Op1Promoted = ZExtPromotedInteger(N->getOperand(0));		Op1Promoted = ZExtPromotedInteger(N->getOperand(0));
Op2Promoted = ZExtPromotedInteger(N->getOperand(1));		Op2Promoted = ZExtPromotedInteger(N->getOperand(1));
		lebedev.riUnsubmitted Done Reply Inline Actions Assert that we only ever get `ISD::USHLSAT`/`ISD::SSHLSAT` ? lebedev.ri: Assert that we only ever get `ISD::USHLSAT`/`ISD::SSHLSAT` ?
}		}
EVT OldType = N->getOperand(0).getValueType();		EVT OldType = N->getOperand(0).getValueType();
EVT PromotedType = Op1Promoted.getValueType();		EVT PromotedType = Op1Promoted.getValueType();
unsigned DiffSize =		unsigned DiffSize =
PromotedType.getScalarSizeInBits() - OldType.getScalarSizeInBits();		PromotedType.getScalarSizeInBits() - OldType.getScalarSizeInBits();

if (Saturating) {		if (Saturating) {
// Promoting the operand and result values changes the saturation width,		// Promoting the operand and result values changes the saturation width,
// which is extends the values that we clamp to on saturation. This could be		// which is extends the values that we clamp to on saturation. This could be
		lebedev.riUnsubmitted Not Done Reply Inline Actions Actually, why do we need to signext, or even zeroext it? As the comment before function notes, we want anyext, we don't care about those new high bits, because we are immediately going to shift them out. ---------------------------------------- Name: promote ushl %r = ushl_sat i8 %x, %y ret i8 %r => %x_wide = zext i8 %x to i16 %y_wide = zext i8 %y to i16 %t0 = shl i16 %x_wide, 8 %t1 = ushl_sat i16 %t0, %y_wide %t2 = lshr i16 %t1, 8 %r = trunc i16 %t2 to i8 ret i8 %r Done: 1 Transformation seems to be correct! ---------------------------------------- Name: promote sshl %r = sshl_sat i8 %x, %y ret i8 %r => %x_wide = zext i8 %x to i16 %y_wide = zext i8 %y to i16 %t0 = shl i16 %x_wide, 8 %t1 = sshl_sat i16 %t0, %y_wide %t2 = ashr i16 %t1, 8 %r = trunc i16 %t2 to i8 ret i8 %r Done: 1 Transformation seems to be correct! So i think you want SDValue Op1Promoted = GetPromotedInteger(Op1); SDValue Op1Promoted = GetPromotedInteger(Op2); unsigned ShiftOp = Opcode == ISD::USHLSAT ? ISD::SRL : ISD::SRA; and maybe get rid of `ShiftOp` variable, or sink it closer to use. lebedev.ri: Actually, why do we need to signext, or even zeroext it? As the comment before function notes…
		ebevhanAuthorUnsubmitted Done Reply Inline Actions Ah, that's true. I grabbed it from the ADDSUBSAT promotion without thinking. That needs the proper extension due to the min/max expansion, I think. ebevhan: Ah, that's true. I grabbed it from the ADDSUBSAT promotion without thinking. That needs the…
		lebedev.riUnsubmitted Not Done Reply Inline Actions Err, i was half-right i think. I guess we actually need to zext the shift amount. lebedev.ri: Err, i was half-right i think. I guess we actually need to zext the shift amount.
// resolved by shifting one of the operands the same amount, which would		// resolved by shifting one of the operands the same amount, which would
		lebedev.riUnsubmitted Done Reply Inline Actions `Op2Promoted = ZExtPromotedInteger(Op2);` lebedev.ri: `Op2Promoted = ZExtPromotedInteger(Op2);`
// also shift the result we compare against, then shifting back.		// also shift the result we compare against, then shifting back.
EVT ShiftTy = TLI.getShiftAmountTy(PromotedType, DAG.getDataLayout());		EVT ShiftTy = TLI.getShiftAmountTy(PromotedType, DAG.getDataLayout());
Op1Promoted = DAG.getNode(ISD::SHL, dl, PromotedType, Op1Promoted,		Op1Promoted = DAG.getNode(ISD::SHL, dl, PromotedType, Op1Promoted,
DAG.getConstant(DiffSize, dl, ShiftTy));		DAG.getConstant(DiffSize, dl, ShiftTy));
SDValue Result = DAG.getNode(N->getOpcode(), dl, PromotedType, Op1Promoted,		SDValue Result = DAG.getNode(N->getOpcode(), dl, PromotedType, Op1Promoted,
Op2Promoted, N->getOperand(2));		Op2Promoted, N->getOperand(2));
unsigned ShiftOp = Signed ? ISD::SRA : ISD::SRL;		unsigned ShiftOp = Signed ? ISD::SRA : ISD::SRL;
return DAG.getNode(ShiftOp, dl, PromotedType, Result,		return DAG.getNode(ShiftOp, dl, PromotedType, Result,
▲ Show 20 Lines • Show All 1,205 Lines • ▼ Show 20 Lines	#endif
case ISD::UMULO:		case ISD::UMULO:
case ISD::SMULO: ExpandIntRes_XMULO(N, Lo, Hi); break;		case ISD::SMULO: ExpandIntRes_XMULO(N, Lo, Hi); break;

case ISD::SADDSAT:		case ISD::SADDSAT:
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::USUBSAT: ExpandIntRes_ADDSUBSAT(N, Lo, Hi); break;		case ISD::USUBSAT: ExpandIntRes_ADDSUBSAT(N, Lo, Hi); break;

		case ISD::SSHLSAT:
		case ISD::USHLSAT: ExpandIntRes_SHLSAT(N, Lo, Hi); break;

case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX:		case ISD::UMULFIX:
case ISD::UMULFIXSAT: ExpandIntRes_MULFIX(N, Lo, Hi); break;		case ISD::UMULFIXSAT: ExpandIntRes_MULFIX(N, Lo, Hi); break;

case ISD::SDIVFIX:		case ISD::SDIVFIX:
case ISD::SDIVFIXSAT:		case ISD::SDIVFIXSAT:
case ISD::UDIVFIX:		case ISD::UDIVFIX:
▲ Show 20 Lines • Show All 1,106 Lines • ▼ Show 20 Lines
}		}

void DAGTypeLegalizer::ExpandIntRes_ADDSUBSAT(SDNode *N, SDValue &Lo,		void DAGTypeLegalizer::ExpandIntRes_ADDSUBSAT(SDNode *N, SDValue &Lo,
SDValue &Hi) {		SDValue &Hi) {
SDValue Result = TLI.expandAddSubSat(N, DAG);		SDValue Result = TLI.expandAddSubSat(N, DAG);
SplitInteger(Result, Lo, Hi);		SplitInteger(Result, Lo, Hi);
}		}

		void DAGTypeLegalizer::ExpandIntRes_SHLSAT(SDNode *N, SDValue &Lo,
		SDValue &Hi) {
		SDValue Result = TLI.expandShlSat(N, DAG);
		SplitInteger(Result, Lo, Hi);
		}

/// This performs an expansion of the integer result for a fixed point		/// This performs an expansion of the integer result for a fixed point
/// multiplication. The default expansion performs rounding down towards		/// multiplication. The default expansion performs rounding down towards
/// negative infinity, though targets that do care about rounding should specify		/// negative infinity, though targets that do care about rounding should specify
/// a target hook for rounding and provide their own expansion or lowering of		/// a target hook for rounding and provide their own expansion or lowering of
/// fixed point multiplication to be consistent with rounding.		/// fixed point multiplication to be consistent with rounding.
void DAGTypeLegalizer::ExpandIntRes_MULFIX(SDNode *N, SDValue &Lo,		void DAGTypeLegalizer::ExpandIntRes_MULFIX(SDNode *N, SDValue &Lo,
SDValue &Hi) {		SDValue &Hi) {
SDLoc dl(N);		SDLoc dl(N);
▲ Show 20 Lines • Show All 1,455 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h

Show First 20 Lines • Show All 335 Lines • ▼ Show 20 Lines	private:
SDValue PromoteIntRes_SRL(SDNode *N);		SDValue PromoteIntRes_SRL(SDNode *N);
SDValue PromoteIntRes_TRUNCATE(SDNode *N);		SDValue PromoteIntRes_TRUNCATE(SDNode *N);
SDValue PromoteIntRes_UADDSUBO(SDNode *N, unsigned ResNo);		SDValue PromoteIntRes_UADDSUBO(SDNode *N, unsigned ResNo);
SDValue PromoteIntRes_ADDSUBCARRY(SDNode *N, unsigned ResNo);		SDValue PromoteIntRes_ADDSUBCARRY(SDNode *N, unsigned ResNo);
SDValue PromoteIntRes_UNDEF(SDNode *N);		SDValue PromoteIntRes_UNDEF(SDNode *N);
SDValue PromoteIntRes_VAARG(SDNode *N);		SDValue PromoteIntRes_VAARG(SDNode *N);
SDValue PromoteIntRes_VSCALE(SDNode *N);		SDValue PromoteIntRes_VSCALE(SDNode *N);
SDValue PromoteIntRes_XMULO(SDNode *N, unsigned ResNo);		SDValue PromoteIntRes_XMULO(SDNode *N, unsigned ResNo);
SDValue PromoteIntRes_ADDSUBSAT(SDNode *N);		SDValue PromoteIntRes_ADDSUBSHLSAT(SDNode *N);
SDValue PromoteIntRes_MULFIX(SDNode *N);		SDValue PromoteIntRes_MULFIX(SDNode *N);
SDValue PromoteIntRes_DIVFIX(SDNode *N);		SDValue PromoteIntRes_DIVFIX(SDNode *N);
SDValue PromoteIntRes_FLT_ROUNDS(SDNode *N);		SDValue PromoteIntRes_FLT_ROUNDS(SDNode *N);
SDValue PromoteIntRes_VECREDUCE(SDNode *N);		SDValue PromoteIntRes_VECREDUCE(SDNode *N);
SDValue PromoteIntRes_ABS(SDNode *N);		SDValue PromoteIntRes_ABS(SDNode *N);

// Integer Operand Promotion.		// Integer Operand Promotion.
bool PromoteIntegerOperand(SDNode *N, unsigned OpNo);		bool PromoteIntegerOperand(SDNode *N, unsigned OpNo);
▲ Show 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	private:
void ExpandIntRes_Shift (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_Shift (SDNode *N, SDValue &Lo, SDValue &Hi);

void ExpandIntRes_MINMAX (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_MINMAX (SDNode *N, SDValue &Lo, SDValue &Hi);

void ExpandIntRes_SADDSUBO (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_SADDSUBO (SDNode *N, SDValue &Lo, SDValue &Hi);
void ExpandIntRes_UADDSUBO (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_UADDSUBO (SDNode *N, SDValue &Lo, SDValue &Hi);
void ExpandIntRes_XMULO (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_XMULO (SDNode *N, SDValue &Lo, SDValue &Hi);
void ExpandIntRes_ADDSUBSAT (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_ADDSUBSAT (SDNode *N, SDValue &Lo, SDValue &Hi);
		void ExpandIntRes_SHLSAT (SDNode *N, SDValue &Lo, SDValue &Hi);
void ExpandIntRes_MULFIX (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_MULFIX (SDNode *N, SDValue &Lo, SDValue &Hi);
void ExpandIntRes_DIVFIX (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_DIVFIX (SDNode *N, SDValue &Lo, SDValue &Hi);

void ExpandIntRes_ATOMIC_LOAD (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_ATOMIC_LOAD (SDNode *N, SDValue &Lo, SDValue &Hi);
void ExpandIntRes_VECREDUCE (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_VECREDUCE (SDNode *N, SDValue &Lo, SDValue &Hi);

void ExpandShiftByConstant(SDNode *N, const APInt &Amt,		void ExpandShiftByConstant(SDNode *N, const APInt &Amt,
SDValue &Lo, SDValue &Hi);		SDValue &Lo, SDValue &Hi);
▲ Show 20 Lines • Show All 576 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

Show First 20 Lines • Show All 448 Lines • ▼ Show 20 Lines	#include "llvm/IR/ConstrainedOps.def"
case ISD::USUBO:		case ISD::USUBO:
case ISD::SMULO:		case ISD::SMULO:
case ISD::UMULO:		case ISD::UMULO:
case ISD::FCANONICALIZE:		case ISD::FCANONICALIZE:
case ISD::SADDSAT:		case ISD::SADDSAT:
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::USUBSAT:		case ISD::USUBSAT:
		case ISD::SSHLSAT:
		case ISD::USHLSAT:
Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));		Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));
break;		break;
case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX:		case ISD::UMULFIX:
case ISD::UMULFIXSAT:		case ISD::UMULFIXSAT:
case ISD::SDIVFIX:		case ISD::SDIVFIX:
case ISD::SDIVFIXSAT:		case ISD::SDIVFIXSAT:
▲ Show 20 Lines • Show All 998 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

Show First 20 Lines • Show All 123 Lines • ▼ Show 20 Lines	#endif
case ISD::SMAX:		case ISD::SMAX:
case ISD::UMIN:		case ISD::UMIN:
case ISD::UMAX:		case ISD::UMAX:

case ISD::SADDSAT:		case ISD::SADDSAT:
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::USUBSAT:		case ISD::USUBSAT:
		case ISD::SSHLSAT:
		case ISD::USHLSAT:

case ISD::FPOW:		case ISD::FPOW:
case ISD::FREM:		case ISD::FREM:
case ISD::FSUB:		case ISD::FSUB:
case ISD::MUL:		case ISD::MUL:
case ISD::OR:		case ISD::OR:
case ISD::SDIV:		case ISD::SDIV:
case ISD::SREM:		case ISD::SREM:
▲ Show 20 Lines • Show All 797 Lines • ▼ Show 20 Lines	#endif
case ISD::SMIN:		case ISD::SMIN:
case ISD::SMAX:		case ISD::SMAX:
case ISD::UMIN:		case ISD::UMIN:
case ISD::UMAX:		case ISD::UMAX:
case ISD::SADDSAT:		case ISD::SADDSAT:
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::USUBSAT:		case ISD::USUBSAT:
		case ISD::SSHLSAT:
		case ISD::USHLSAT:
SplitVecRes_BinOp(N, Lo, Hi);		SplitVecRes_BinOp(N, Lo, Hi);
break;		break;
case ISD::FMA:		case ISD::FMA:
SplitVecRes_TernaryOp(N, Lo, Hi);		SplitVecRes_TernaryOp(N, Lo, Hi);
break;		break;

#define DAG_INSTRUCTION(NAME, NARG, ROUND_MODE, INTRINSIC, DAGN) \		#define DAG_INSTRUCTION(NAME, NARG, ROUND_MODE, INTRINSIC, DAGN) \
case ISD::STRICT_##DAGN:		case ISD::STRICT_##DAGN:
▲ Show 20 Lines • Show All 1,834 Lines • ▼ Show 20 Lines	#endif
case ISD::SMIN:		case ISD::SMIN:
case ISD::SMAX:		case ISD::SMAX:
case ISD::UMIN:		case ISD::UMIN:
case ISD::UMAX:		case ISD::UMAX:
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SADDSAT:		case ISD::SADDSAT:
case ISD::USUBSAT:		case ISD::USUBSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
		case ISD::SSHLSAT:
		case ISD::USHLSAT:
Res = WidenVecRes_Binary(N);		Res = WidenVecRes_Binary(N);
break;		break;

case ISD::FADD:		case ISD::FADD:
case ISD::FMUL:		case ISD::FMUL:
case ISD::FPOW:		case ISD::FPOW:
case ISD::FSUB:		case ISD::FSUB:
case ISD::FDIV:		case ISD::FDIV:
▲ Show 20 Lines • Show All 2,471 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,326 Lines • ▼ Show 20 Lines	case Intrinsic::ssub_sat: {
return;		return;
}		}
case Intrinsic::usub_sat: {		case Intrinsic::usub_sat: {
SDValue Op1 = getValue(I.getArgOperand(0));		SDValue Op1 = getValue(I.getArgOperand(0));
SDValue Op2 = getValue(I.getArgOperand(1));		SDValue Op2 = getValue(I.getArgOperand(1));
setValue(&I, DAG.getNode(ISD::USUBSAT, sdl, Op1.getValueType(), Op1, Op2));		setValue(&I, DAG.getNode(ISD::USUBSAT, sdl, Op1.getValueType(), Op1, Op2));
return;		return;
}		}
		case Intrinsic::sshl_sat: {
		SDValue Op1 = getValue(I.getArgOperand(0));
		SDValue Op2 = getValue(I.getArgOperand(1));
		setValue(&I, DAG.getNode(ISD::SSHLSAT, sdl, Op1.getValueType(), Op1, Op2));
		return;
		}
		case Intrinsic::ushl_sat: {
		SDValue Op1 = getValue(I.getArgOperand(0));
		SDValue Op2 = getValue(I.getArgOperand(1));
		setValue(&I, DAG.getNode(ISD::USHLSAT, sdl, Op1.getValueType(), Op1, Op2));
		return;
		}
case Intrinsic::smul_fix:		case Intrinsic::smul_fix:
case Intrinsic::umul_fix:		case Intrinsic::umul_fix:
case Intrinsic::smul_fix_sat:		case Intrinsic::smul_fix_sat:
case Intrinsic::umul_fix_sat: {		case Intrinsic::umul_fix_sat: {
SDValue Op1 = getValue(I.getArgOperand(0));		SDValue Op1 = getValue(I.getArgOperand(0));
SDValue Op2 = getValue(I.getArgOperand(1));		SDValue Op2 = getValue(I.getArgOperand(1));
SDValue Op3 = getValue(I.getArgOperand(2));		SDValue Op3 = getValue(I.getArgOperand(2));
setValue(&I, DAG.getNode(FixedPointIntrinsicToOpcode(Intrinsic), sdl,		setValue(&I, DAG.getNode(FixedPointIntrinsicToOpcode(Intrinsic), sdl,
▲ Show 20 Lines • Show All 4,377 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

Show First 20 Lines • Show All 304 Lines • ▼ Show 20 Lines	#endif
case ISD::SHL_PARTS: return "shl_parts";		case ISD::SHL_PARTS: return "shl_parts";
case ISD::SRA_PARTS: return "sra_parts";		case ISD::SRA_PARTS: return "sra_parts";
case ISD::SRL_PARTS: return "srl_parts";		case ISD::SRL_PARTS: return "srl_parts";

case ISD::SADDSAT: return "saddsat";		case ISD::SADDSAT: return "saddsat";
case ISD::UADDSAT: return "uaddsat";		case ISD::UADDSAT: return "uaddsat";
case ISD::SSUBSAT: return "ssubsat";		case ISD::SSUBSAT: return "ssubsat";
case ISD::USUBSAT: return "usubsat";		case ISD::USUBSAT: return "usubsat";
		case ISD::SSHLSAT: return "sshlsat";
		case ISD::USHLSAT: return "ushlsat";

case ISD::SMULFIX: return "smulfix";		case ISD::SMULFIX: return "smulfix";
case ISD::SMULFIXSAT: return "smulfixsat";		case ISD::SMULFIXSAT: return "smulfixsat";
case ISD::UMULFIX: return "umulfix";		case ISD::UMULFIX: return "umulfix";
case ISD::UMULFIXSAT: return "umulfixsat";		case ISD::UMULFIXSAT: return "umulfixsat";

case ISD::SDIVFIX: return "sdivfix";		case ISD::SDIVFIX: return "sdivfix";
case ISD::SDIVFIXSAT: return "sdivfixsat";		case ISD::SDIVFIXSAT: return "sdivfixsat";
▲ Show 20 Lines • Show All 673 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,379 Lines • ▼ Show 20 Lines	if (Opcode == ISD::UADDSAT) {
SDValue SatMin = DAG.getConstant(MinVal, dl, VT);		SDValue SatMin = DAG.getConstant(MinVal, dl, VT);
SDValue SatMax = DAG.getConstant(MaxVal, dl, VT);		SDValue SatMax = DAG.getConstant(MaxVal, dl, VT);
SDValue SumNeg = DAG.getSetCC(dl, BoolVT, SumDiff, Zero, ISD::SETLT);		SDValue SumNeg = DAG.getSetCC(dl, BoolVT, SumDiff, Zero, ISD::SETLT);
Result = DAG.getSelect(dl, VT, SumNeg, SatMax, SatMin);		Result = DAG.getSelect(dl, VT, SumNeg, SatMax, SatMin);
return DAG.getSelect(dl, VT, Overflow, Result, SumDiff);		return DAG.getSelect(dl, VT, Overflow, Result, SumDiff);
}		}
}		}

		SDValue TargetLowering::expandShlSat(SDNode *Node, SelectionDAG &DAG) const {
		unsigned Opcode = Node->getOpcode();
		bool IsSigned = Opcode == ISD::SSHLSAT;
		SDValue LHS = Node->getOperand(0);
		SDValue RHS = Node->getOperand(1);
		EVT VT = LHS.getValueType();
		SDLoc dl(Node);

		assert((Node->getOpcode() == ISD::SSHLSAT \|\|
		Node->getOpcode() == ISD::USHLSAT) &&
		"Expected a SHLSAT opcode");
		assert(VT == RHS.getValueType() && "Expected operands to be the same type");
		assert(VT.isInteger() && "Expected operands to be integers");

		// If LHS != (LHS << RHS) >> RHS, we have overflow and must saturate.

		unsigned BW = VT.getScalarSizeInBits();
		SDValue Result = DAG.getNode(ISD::SHL, dl, VT, LHS, RHS);
		SDValue Orig =
		DAG.getNode(IsSigned ? ISD::SRA : ISD::SRL, dl, VT, Result, RHS);
		lebedev.riUnsubmitted Not Done Reply Inline Actions Have you checked if naive `x != ((x << y) u/s>> y)` results in worse lowering? lebedev.ri: Have you checked if naive `x != ((x << y) u/s>> y)` results in worse lowering?
		ebevhanAuthorUnsubmitted Done Reply Inline Actions The CTLZ approach was the one that popped into my head first, so I went with that. But it does turn out that yours works a bit better, at least for sshl.sat, so I swapped it out. ebevhan: The CTLZ approach was the one that popped into my head first, so I went with that. But it does…

		SDValue SatVal;
		if (IsSigned) {
		SDValue SatMin = DAG.getConstant(APInt::getSignedMinValue(BW), dl, VT);
		SDValue SatMax = DAG.getConstant(APInt::getSignedMaxValue(BW), dl, VT);
		SatVal = DAG.getSelectCC(dl, LHS, DAG.getConstant(0, dl, VT),
		SatMin, SatMax, ISD::SETLT);
		} else {
		SatVal = DAG.getConstant(APInt::getMaxValue(BW), dl, VT);
		}
		Result = DAG.getSelectCC(dl, LHS, Orig, SatVal, Result, ISD::SETNE);

		return Result;
		lebedev.riUnsubmitted Done Reply Inline Actions Why not just change predicate to `ISD::SETUGE`? lebedev.ri: Why not just change predicate to `ISD::SETUGE`?
		}

SDValue		SDValue
TargetLowering::expandFixedPointMul(SDNode *Node, SelectionDAG &DAG) const {		TargetLowering::expandFixedPointMul(SDNode *Node, SelectionDAG &DAG) const {
assert((Node->getOpcode() == ISD::SMULFIX \|\|		assert((Node->getOpcode() == ISD::SMULFIX \|\|
Node->getOpcode() == ISD::UMULFIX \|\|		Node->getOpcode() == ISD::UMULFIX \|\|
Node->getOpcode() == ISD::SMULFIXSAT \|\|		Node->getOpcode() == ISD::SMULFIXSAT \|\|
Node->getOpcode() == ISD::UMULFIXSAT) &&		Node->getOpcode() == ISD::UMULFIXSAT) &&
"Expected a fixed point multiplication opcode");		"Expected a fixed point multiplication opcode");

▲ Show 20 Lines • Show All 512 Lines • Show Last 20 Lines

llvm/lib/CodeGen/TargetLoweringBase.cpp

Show First 20 Lines • Show All 651 Lines • ▼ Show 20 Lines	for (MVT VT : MVT::all_valuetypes()) {
setOperationAction(ISD::UMAX, VT, Expand);		setOperationAction(ISD::UMAX, VT, Expand);
setOperationAction(ISD::ABS, VT, Expand);		setOperationAction(ISD::ABS, VT, Expand);
setOperationAction(ISD::FSHL, VT, Expand);		setOperationAction(ISD::FSHL, VT, Expand);
setOperationAction(ISD::FSHR, VT, Expand);		setOperationAction(ISD::FSHR, VT, Expand);
setOperationAction(ISD::SADDSAT, VT, Expand);		setOperationAction(ISD::SADDSAT, VT, Expand);
setOperationAction(ISD::UADDSAT, VT, Expand);		setOperationAction(ISD::UADDSAT, VT, Expand);
setOperationAction(ISD::SSUBSAT, VT, Expand);		setOperationAction(ISD::SSUBSAT, VT, Expand);
setOperationAction(ISD::USUBSAT, VT, Expand);		setOperationAction(ISD::USUBSAT, VT, Expand);
		setOperationAction(ISD::SSHLSAT, VT, Expand);
		setOperationAction(ISD::USHLSAT, VT, Expand);
setOperationAction(ISD::SMULFIX, VT, Expand);		setOperationAction(ISD::SMULFIX, VT, Expand);
setOperationAction(ISD::SMULFIXSAT, VT, Expand);		setOperationAction(ISD::SMULFIXSAT, VT, Expand);
setOperationAction(ISD::UMULFIX, VT, Expand);		setOperationAction(ISD::UMULFIX, VT, Expand);
setOperationAction(ISD::UMULFIXSAT, VT, Expand);		setOperationAction(ISD::UMULFIXSAT, VT, Expand);
setOperationAction(ISD::SDIVFIX, VT, Expand);		setOperationAction(ISD::SDIVFIX, VT, Expand);
setOperationAction(ISD::SDIVFIXSAT, VT, Expand);		setOperationAction(ISD::SDIVFIXSAT, VT, Expand);
setOperationAction(ISD::UDIVFIX, VT, Expand);		setOperationAction(ISD::UDIVFIX, VT, Expand);
setOperationAction(ISD::UDIVFIXSAT, VT, Expand);		setOperationAction(ISD::UDIVFIXSAT, VT, Expand);
▲ Show 20 Lines • Show All 1,528 Lines • Show Last 20 Lines

llvm/lib/IR/Verifier.cpp

Show First 20 Lines • Show All 4,980 Lines • ▼ Show 20 Lines	if (isa<CallInst>(Call)) {
"of the value computed by experimental_deoptimize");		"of the value computed by experimental_deoptimize");
}		}

break;		break;
}		}
case Intrinsic::sadd_sat:		case Intrinsic::sadd_sat:
case Intrinsic::uadd_sat:		case Intrinsic::uadd_sat:
case Intrinsic::ssub_sat:		case Intrinsic::ssub_sat:
case Intrinsic::usub_sat: {		case Intrinsic::usub_sat:
		case Intrinsic::sshl_sat:
		case Intrinsic::ushl_sat: {
Value *Op1 = Call.getArgOperand(0);		Value *Op1 = Call.getArgOperand(0);
Value *Op2 = Call.getArgOperand(1);		Value *Op2 = Call.getArgOperand(1);
Assert(Op1->getType()->isIntOrIntVectorTy(),		Assert(Op1->getType()->isIntOrIntVectorTy(),
"first operand of [us][add\|sub]_sat must be an int type or vector "		"first operand of [us][add\|sub\|shl]_sat must be an int type or "
"of ints");		"vector of ints");
Assert(Op2->getType()->isIntOrIntVectorTy(),		Assert(Op2->getType()->isIntOrIntVectorTy(),
"second operand of [us][add\|sub]_sat must be an int type or vector "		"second operand of [us][add\|sub\|shl]_sat must be an int type or "
"of ints");		"vector of ints");
break;		break;
}		}
case Intrinsic::smul_fix:		case Intrinsic::smul_fix:
case Intrinsic::smul_fix_sat:		case Intrinsic::smul_fix_sat:
case Intrinsic::umul_fix:		case Intrinsic::umul_fix:
case Intrinsic::umul_fix_sat:		case Intrinsic::umul_fix_sat:
case Intrinsic::sdiv_fix:		case Intrinsic::sdiv_fix:
case Intrinsic::sdiv_fix_sat:		case Intrinsic::sdiv_fix_sat:
case Intrinsic::udiv_fix:		case Intrinsic::udiv_fix:
case Intrinsic::udiv_fix_sat: {		case Intrinsic::udiv_fix_sat: {
		lebedev.riUnsubmitted Not Done Reply Inline Actions I don't think it makes sense to limit these to scalars. lebedev.ri: I don't think it makes sense to limit these to scalars.
		ebevhanAuthorUnsubmitted Done Reply Inline Actions The add.sat and sub.sat intrinsics were given vector operands because they were useful for some of x86's vector instructions. I couldn't see any such operations for shifts, but I can add the vector type support for consistency. ebevhan: The add.sat and sub.sat intrinsics were given vector operands because they were useful for some…
Value *Op1 = Call.getArgOperand(0);		Value *Op1 = Call.getArgOperand(0);
Value *Op2 = Call.getArgOperand(1);		Value *Op2 = Call.getArgOperand(1);
Assert(Op1->getType()->isIntOrIntVectorTy(),		Assert(Op1->getType()->isIntOrIntVectorTy(),
"first operand of [us][mul\|div]_fix[_sat] must be an int type or "		"first operand of [us][mul\|div]_fix[_sat] must be an int type or "
"vector of ints");		"vector of ints");
Assert(Op2->getType()->isIntOrIntVectorTy(),		Assert(Op2->getType()->isIntOrIntVectorTy(),
"second operand of [us][mul\|div]_fix[_sat] must be an int type or "		"second operand of [us][mul\|div]_fix[_sat] must be an int type or "
"vector of ints");		"vector of ints");
▲ Show 20 Lines • Show All 921 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp

Show First 20 Lines • Show All 1,287 Lines • ▼ Show 20 Lines	Shifts.widenScalarIf(
const LLT AmountTy = Query.Types[1];		const LLT AmountTy = Query.Types[1];
return ValTy.getSizeInBits() <= 16 &&		return ValTy.getSizeInBits() <= 16 &&
AmountTy.getSizeInBits() < 16;		AmountTy.getSizeInBits() < 16;
}, changeTo(1, S16));		}, changeTo(1, S16));
Shifts.maxScalarIf(typeIs(0, S16), 1, S16);		Shifts.maxScalarIf(typeIs(0, S16), 1, S16);
Shifts.clampScalar(1, S32, S32);		Shifts.clampScalar(1, S32, S32);
Shifts.clampScalar(0, S16, S64);		Shifts.clampScalar(0, S16, S64);
Shifts.widenScalarToNextPow2(0, 16);		Shifts.widenScalarToNextPow2(0, 16);

		getActionDefinitionsBuilder({G_SSHLSAT, G_USHLSAT})
		.minScalar(0, S16)
		.scalarize(0)
		.lower();
} else {		} else {
// Make sure we legalize the shift amount type first, as the general		// Make sure we legalize the shift amount type first, as the general
// expansion for the shifted type will produce much worse code if it hasn't		// expansion for the shifted type will produce much worse code if it hasn't
// been truncated already.		// been truncated already.
Shifts.clampScalar(1, S32, S32);		Shifts.clampScalar(1, S32, S32);
Shifts.clampScalar(0, S32, S64);		Shifts.clampScalar(0, S32, S64);
Shifts.widenScalarToNextPow2(0, 32);		Shifts.widenScalarToNextPow2(0, 32);

		getActionDefinitionsBuilder({G_SSHLSAT, G_USHLSAT})
		.minScalar(0, S32)
		.scalarize(0)
		.lower();
}		}
Shifts.scalarize(0);		Shifts.scalarize(0);

for (unsigned Op : {G_EXTRACT_VECTOR_ELT, G_INSERT_VECTOR_ELT}) {		for (unsigned Op : {G_EXTRACT_VECTOR_ELT, G_INSERT_VECTOR_ELT}) {
unsigned VecTypeIdx = Op == G_EXTRACT_VECTOR_ELT ? 1 : 0;		unsigned VecTypeIdx = Op == G_EXTRACT_VECTOR_ELT ? 1 : 0;
unsigned EltTypeIdx = Op == G_EXTRACT_VECTOR_ELT ? 0 : 1;		unsigned EltTypeIdx = Op == G_EXTRACT_VECTOR_ELT ? 0 : 1;
unsigned IdxTypeIdx = 2;		unsigned IdxTypeIdx = 2;

▲ Show 20 Lines • Show All 3,198 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir

	Show First 20 Lines • Show All 338 Lines • ▼ Show 20 Lines
	# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
	# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
	# DEBUG-NEXT: G_USUBSAT (opcode {{[0-9]+}}): 1 type index, 0 imm indices			# DEBUG-NEXT: G_USUBSAT (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
	# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
	# DEBUG-NEXT: G_SSUBSAT (opcode {{[0-9]+}}): 1 type index, 0 imm indices			# DEBUG-NEXT: G_SSUBSAT (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
	# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_USHLSAT (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_SSHLSAT (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
	# DEBUG-NEXT: G_SMULFIX (opcode {{[0-9]+}}): 1 type index, 1 imm index			# DEBUG-NEXT: G_SMULFIX (opcode {{[0-9]+}}): 1 type index, 1 imm index
	# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
	# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
	# DEBUG-NEXT: G_UMULFIX (opcode {{[0-9]+}}): 1 type index, 1 imm index			# DEBUG-NEXT: G_UMULFIX (opcode {{[0-9]+}}): 1 type index, 1 imm index
	# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
	# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
	# DEBUG-NEXT: G_SMULFIXSAT (opcode {{[0-9]+}}): 1 type index, 1 imm index			# DEBUG-NEXT: G_SMULFIXSAT (opcode {{[0-9]+}}): 1 type index, 1 imm index
	# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
	▲ Show 20 Lines • Show All 208 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/GlobalISel/irtranslator-sat.ll

Show First 20 Lines • Show All 310 Lines • ▼ Show 20 Lines	define <2 x i32> @ssubsat_v2i32(<2 x i32> %lhs, <2 x i32> %rhs) {
; CHECK: $vgpr0 = COPY [[UV]](s32)		; CHECK: $vgpr0 = COPY [[UV]](s32)
; CHECK: $vgpr1 = COPY [[UV1]](s32)		; CHECK: $vgpr1 = COPY [[UV1]](s32)
; CHECK: [[COPY5:%[0-9]+]]:ccr_sgpr_64 = COPY [[COPY4]]		; CHECK: [[COPY5:%[0-9]+]]:ccr_sgpr_64 = COPY [[COPY4]]
; CHECK: S_SETPC_B64_return [[COPY5]], implicit $vgpr0, implicit $vgpr1		; CHECK: S_SETPC_B64_return [[COPY5]], implicit $vgpr0, implicit $vgpr1
%res = call <2 x i32> @llvm.ssub.sat.v2i32(<2 x i32> %lhs, <2 x i32> %rhs)		%res = call <2 x i32> @llvm.ssub.sat.v2i32(<2 x i32> %lhs, <2 x i32> %rhs)
ret <2 x i32> %res		ret <2 x i32> %res
}		}
declare <2 x i32> @llvm.ssub.sat.v2i32(<2 x i32>, <2 x i32>)		declare <2 x i32> @llvm.ssub.sat.v2i32(<2 x i32>, <2 x i32>)

		define i16 @ushlsat_i16(i16 %lhs, i16 %rhs) {
		; CHECK-LABEL: name: ushlsat_i16
		; CHECK: bb.1 (%ir-block.0):
		; CHECK: liveins: $vgpr0, $vgpr1, $sgpr30_sgpr31
		; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
		; CHECK: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
		; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
		; CHECK: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[COPY1]](s32)
		; CHECK: [[COPY2:%[0-9]+]]:sgpr_64 = COPY $sgpr30_sgpr31
		; CHECK: [[USHLSAT:%[0-9]+]]:_(s16) = G_USHLSAT [[TRUNC]], [[TRUNC1]]
		; CHECK: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[USHLSAT]](s16)
		; CHECK: $vgpr0 = COPY [[ANYEXT]](s32)
		; CHECK: [[COPY3:%[0-9]+]]:ccr_sgpr_64 = COPY [[COPY2]]
		; CHECK: S_SETPC_B64_return [[COPY3]], implicit $vgpr0
		%res = call i16 @llvm.ushl.sat.i16(i16 %lhs, i16 %rhs)
		ret i16 %res
		}
		declare i16 @llvm.ushl.sat.i16(i16, i16)

		define i32 @ushlsat_i32(i32 %lhs, i32 %rhs) {
		; CHECK-LABEL: name: ushlsat_i32
		; CHECK: bb.1 (%ir-block.0):
		; CHECK: liveins: $vgpr0, $vgpr1, $sgpr30_sgpr31
		; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
		; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
		; CHECK: [[COPY2:%[0-9]+]]:sgpr_64 = COPY $sgpr30_sgpr31
		; CHECK: [[USHLSAT:%[0-9]+]]:_(s32) = G_USHLSAT [[COPY]], [[COPY1]]
		; CHECK: $vgpr0 = COPY [[USHLSAT]](s32)
		; CHECK: [[COPY3:%[0-9]+]]:ccr_sgpr_64 = COPY [[COPY2]]
		; CHECK: S_SETPC_B64_return [[COPY3]], implicit $vgpr0
		%res = call i32 @llvm.ushl.sat.i32(i32 %lhs, i32 %rhs)
		ret i32 %res
		}
		declare i32 @llvm.ushl.sat.i32(i32, i32)

		define i64 @ushlsat_i64(i64 %lhs, i64 %rhs) {
		; CHECK-LABEL: name: ushlsat_i64
		; CHECK: bb.1 (%ir-block.0):
		; CHECK: liveins: $vgpr0, $vgpr1, $vgpr2, $vgpr3, $sgpr30_sgpr31
		; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
		; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
		; CHECK: [[COPY2:%[0-9]+]]:_(s32) = COPY $vgpr2
		; CHECK: [[COPY3:%[0-9]+]]:_(s32) = COPY $vgpr3
		; CHECK: [[COPY4:%[0-9]+]]:sgpr_64 = COPY $sgpr30_sgpr31
		; CHECK: [[MV:%[0-9]+]]:_(s64) = G_MERGE_VALUES [[COPY]](s32), [[COPY1]](s32)
		; CHECK: [[MV1:%[0-9]+]]:_(s64) = G_MERGE_VALUES [[COPY2]](s32), [[COPY3]](s32)
		; CHECK: [[USHLSAT:%[0-9]+]]:_(s64) = G_USHLSAT [[MV]], [[MV1]]
		; CHECK: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[USHLSAT]](s64)
		; CHECK: $vgpr0 = COPY [[UV]](s32)
		; CHECK: $vgpr1 = COPY [[UV1]](s32)
		; CHECK: [[COPY5:%[0-9]+]]:ccr_sgpr_64 = COPY [[COPY4]]
		; CHECK: S_SETPC_B64_return [[COPY5]], implicit $vgpr0, implicit $vgpr1
		%res = call i64 @llvm.ushl.sat.i64(i64 %lhs, i64 %rhs)
		ret i64 %res
		}
		declare i64 @llvm.ushl.sat.i64(i64, i64)

		define <2 x i32> @ushlsat_v2i32(<2 x i32> %lhs, <2 x i32> %rhs) {
		; CHECK-LABEL: name: ushlsat_v2i32
		; CHECK: bb.1 (%ir-block.0):
		; CHECK: liveins: $vgpr0, $vgpr1, $vgpr2, $vgpr3, $sgpr30_sgpr31
		; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
		; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
		; CHECK: [[COPY2:%[0-9]+]]:_(s32) = COPY $vgpr2
		; CHECK: [[COPY3:%[0-9]+]]:_(s32) = COPY $vgpr3
		; CHECK: [[COPY4:%[0-9]+]]:sgpr_64 = COPY $sgpr30_sgpr31
		; CHECK: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[COPY]](s32), [[COPY1]](s32)
		; CHECK: [[BUILD_VECTOR1:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[COPY2]](s32), [[COPY3]](s32)
		; CHECK: [[USHLSAT:%[0-9]+]]:_(<2 x s32>) = G_USHLSAT [[BUILD_VECTOR]], [[BUILD_VECTOR1]]
		; CHECK: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[USHLSAT]](<2 x s32>)
		; CHECK: $vgpr0 = COPY [[UV]](s32)
		; CHECK: $vgpr1 = COPY [[UV1]](s32)
		; CHECK: [[COPY5:%[0-9]+]]:ccr_sgpr_64 = COPY [[COPY4]]
		; CHECK: S_SETPC_B64_return [[COPY5]], implicit $vgpr0, implicit $vgpr1
		%res = call <2 x i32> @llvm.ushl.sat.v2i32(<2 x i32> %lhs, <2 x i32> %rhs)
		ret <2 x i32> %res
		}
		declare <2 x i32> @llvm.ushl.sat.v2i32(<2 x i32>, <2 x i32>)

		define i16 @sshlsat_i16(i16 %lhs, i16 %rhs) {
		; CHECK-LABEL: name: sshlsat_i16
		; CHECK: bb.1 (%ir-block.0):
		; CHECK: liveins: $vgpr0, $vgpr1, $sgpr30_sgpr31
		; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
		; CHECK: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
		; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
		; CHECK: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[COPY1]](s32)
		; CHECK: [[COPY2:%[0-9]+]]:sgpr_64 = COPY $sgpr30_sgpr31
		; CHECK: [[SSHLSAT:%[0-9]+]]:_(s16) = G_SSHLSAT [[TRUNC]], [[TRUNC1]]
		; CHECK: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[SSHLSAT]](s16)
		; CHECK: $vgpr0 = COPY [[ANYEXT]](s32)
		; CHECK: [[COPY3:%[0-9]+]]:ccr_sgpr_64 = COPY [[COPY2]]
		; CHECK: S_SETPC_B64_return [[COPY3]], implicit $vgpr0
		%res = call i16 @llvm.sshl.sat.i16(i16 %lhs, i16 %rhs)
		ret i16 %res
		}
		declare i16 @llvm.sshl.sat.i16(i16, i16)

		define i32 @sshlsat_i32(i32 %lhs, i32 %rhs) {
		; CHECK-LABEL: name: sshlsat_i32
		; CHECK: bb.1 (%ir-block.0):
		; CHECK: liveins: $vgpr0, $vgpr1, $sgpr30_sgpr31
		; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
		; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
		; CHECK: [[COPY2:%[0-9]+]]:sgpr_64 = COPY $sgpr30_sgpr31
		; CHECK: [[SSHLSAT:%[0-9]+]]:_(s32) = G_SSHLSAT [[COPY]], [[COPY1]]
		; CHECK: $vgpr0 = COPY [[SSHLSAT]](s32)
		; CHECK: [[COPY3:%[0-9]+]]:ccr_sgpr_64 = COPY [[COPY2]]
		; CHECK: S_SETPC_B64_return [[COPY3]], implicit $vgpr0
		%res = call i32 @llvm.sshl.sat.i32(i32 %lhs, i32 %rhs)
		ret i32 %res
		}
		declare i32 @llvm.sshl.sat.i32(i32, i32)

		define i64 @sshlsat_i64(i64 %lhs, i64 %rhs) {
		; CHECK-LABEL: name: sshlsat_i64
		; CHECK: bb.1 (%ir-block.0):
		; CHECK: liveins: $vgpr0, $vgpr1, $vgpr2, $vgpr3, $sgpr30_sgpr31
		; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
		; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
		; CHECK: [[COPY2:%[0-9]+]]:_(s32) = COPY $vgpr2
		; CHECK: [[COPY3:%[0-9]+]]:_(s32) = COPY $vgpr3
		; CHECK: [[COPY4:%[0-9]+]]:sgpr_64 = COPY $sgpr30_sgpr31
		; CHECK: [[MV:%[0-9]+]]:_(s64) = G_MERGE_VALUES [[COPY]](s32), [[COPY1]](s32)
		; CHECK: [[MV1:%[0-9]+]]:_(s64) = G_MERGE_VALUES [[COPY2]](s32), [[COPY3]](s32)
		; CHECK: [[SSHLSAT:%[0-9]+]]:_(s64) = G_SSHLSAT [[MV]], [[MV1]]
		; CHECK: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[SSHLSAT]](s64)
		; CHECK: $vgpr0 = COPY [[UV]](s32)
		; CHECK: $vgpr1 = COPY [[UV1]](s32)
		; CHECK: [[COPY5:%[0-9]+]]:ccr_sgpr_64 = COPY [[COPY4]]
		; CHECK: S_SETPC_B64_return [[COPY5]], implicit $vgpr0, implicit $vgpr1
		%res = call i64 @llvm.sshl.sat.i64(i64 %lhs, i64 %rhs)
		ret i64 %res
		}
		declare i64 @llvm.sshl.sat.i64(i64, i64)

		define <2 x i32> @sshlsat_v2i32(<2 x i32> %lhs, <2 x i32> %rhs) {
		; CHECK-LABEL: name: sshlsat_v2i32
		; CHECK: bb.1 (%ir-block.0):
		; CHECK: liveins: $vgpr0, $vgpr1, $vgpr2, $vgpr3, $sgpr30_sgpr31
		; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
		; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
		; CHECK: [[COPY2:%[0-9]+]]:_(s32) = COPY $vgpr2
		; CHECK: [[COPY3:%[0-9]+]]:_(s32) = COPY $vgpr3
		; CHECK: [[COPY4:%[0-9]+]]:sgpr_64 = COPY $sgpr30_sgpr31
		; CHECK: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[COPY]](s32), [[COPY1]](s32)
		; CHECK: [[BUILD_VECTOR1:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[COPY2]](s32), [[COPY3]](s32)
		; CHECK: [[SSHLSAT:%[0-9]+]]:_(<2 x s32>) = G_SSHLSAT [[BUILD_VECTOR]], [[BUILD_VECTOR1]]
		; CHECK: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[SSHLSAT]](<2 x s32>)
		; CHECK: $vgpr0 = COPY [[UV]](s32)
		; CHECK: $vgpr1 = COPY [[UV1]](s32)
		; CHECK: [[COPY5:%[0-9]+]]:ccr_sgpr_64 = COPY [[COPY4]]
		; CHECK: S_SETPC_B64_return [[COPY5]], implicit $vgpr0, implicit $vgpr1
		%res = call <2 x i32> @llvm.sshl.sat.v2i32(<2 x i32> %lhs, <2 x i32> %rhs)
		ret <2 x i32> %res
		}
		declare <2 x i32> @llvm.sshl.sat.v2i32(<2 x i32>, <2 x i32>)

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-sshlsat.mir

This file was added.

				# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
				# RUN: llc -global-isel-abort=0 -march=amdgcn -mcpu=tahiti -run-pass=legalizer %s -o - \| FileCheck -check-prefix=GFX6 %s
				# RUN: llc -global-isel-abort=0 -march=amdgcn -mcpu=fiji -run-pass=legalizer %s -o - \| FileCheck -check-prefix=GFX8 %s
				# RUN: llc -global-isel-abort=0 -march=amdgcn -mcpu=gfx900 -run-pass=legalizer %s -o - \| FileCheck -check-prefix=GFX9 %s

				---
				name: sshlsat_s7
				body: \|
				bb.0:
				liveins: $vgpr0, $vgpr1

				; GFX6-LABEL: name: sshlsat_s7
				; GFX6: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX6: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 127
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]
				arsenmUnsubmitted Not Done Reply Inline Actions These all failed to legalize. You need something like this in AMDGPULegalizerInfo getActionDefinitionsBuilder({G_SSHLSAT, G_USHLSAT}) .scalarize(0) .clampScalar(0, S32) .lower(); to actually trigger any of the legalization code arsenm: These all failed to legalize. You need something like this in AMDGPULegalizerInfo ```…
				ebevhanAuthorUnsubmitted Not Done Reply Inline Actions Okay, I thought something was off. Does this mean that any target that wants these legalized with the default legalization needs to specify this explicitly? ebevhan: Okay, I thought something was off. Does this mean that any target that wants these legalized…
				arsenmUnsubmitted Not Done Reply Inline Actions Yes, nearly everything is opt-in in GlobalISel arsenm: Yes, nearly everything is opt-in in GlobalISel
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 25
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C1]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX6: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX6: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX6: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s32), [[C4]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C2]], [[C3]]
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[ASHR]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL1]]
				; GFX6: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SELECT1]], [[C1]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)
				; GFX6: $vgpr0 = COPY [[COPY4]](s32)
				; GFX8-LABEL: name: sshlsat_s7
				; GFX8: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX8: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX8: [[TRUNC:%[0-9]+]]:_(s7) = G_TRUNC [[COPY]](s32)
				; GFX8: [[TRUNC1:%[0-9]+]]:_(s7) = G_TRUNC [[COPY1]](s32)
				; GFX8: [[ANYEXT:%[0-9]+]]:_(s16) = G_ANYEXT [[TRUNC]](s7)
				; GFX8: [[ZEXT:%[0-9]+]]:_(s16) = G_ZEXT [[TRUNC1]](s7)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 9
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[ANYEXT]], [[C]](s16)
				; GFX8: [[SHL1:%[0-9]+]]:_(s16) = G_SHL [[SHL]], [[ZEXT]](s16)
				; GFX8: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL1]], [[ZEXT]](s16)
				; GFX8: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX8: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX8: [[C3:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX8: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s16), [[C3]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C1]], [[C2]]
				; GFX8: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[SHL]](s16), [[ASHR]]
				; GFX8: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL1]]
				; GFX8: [[ASHR1:%[0-9]+]]:_(s16) = G_ASHR [[SELECT1]], [[C]](s16)
				; GFX8: [[TRUNC2:%[0-9]+]]:_(s7) = G_TRUNC [[ASHR1]](s16)
				; GFX8: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[TRUNC2]](s7)
				; GFX8: $vgpr0 = COPY [[ANYEXT1]](s32)
				; GFX9-LABEL: name: sshlsat_s7
				; GFX9: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX9: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX9: [[TRUNC:%[0-9]+]]:_(s7) = G_TRUNC [[COPY]](s32)
				; GFX9: [[TRUNC1:%[0-9]+]]:_(s7) = G_TRUNC [[COPY1]](s32)
				; GFX9: [[ANYEXT:%[0-9]+]]:_(s16) = G_ANYEXT [[TRUNC]](s7)
				; GFX9: [[ZEXT:%[0-9]+]]:_(s16) = G_ZEXT [[TRUNC1]](s7)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 9
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[ANYEXT]], [[C]](s16)
				; GFX9: [[SHL1:%[0-9]+]]:_(s16) = G_SHL [[SHL]], [[ZEXT]](s16)
				; GFX9: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL1]], [[ZEXT]](s16)
				; GFX9: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX9: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX9: [[C3:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX9: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s16), [[C3]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C1]], [[C2]]
				; GFX9: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[SHL]](s16), [[ASHR]]
				; GFX9: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL1]]
				; GFX9: [[ASHR1:%[0-9]+]]:_(s16) = G_ASHR [[SELECT1]], [[C]](s16)
				; GFX9: [[TRUNC2:%[0-9]+]]:_(s7) = G_TRUNC [[ASHR1]](s16)
				; GFX9: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[TRUNC2]](s7)
				; GFX9: $vgpr0 = COPY [[ANYEXT1]](s32)
				%0:_(s32) = COPY $vgpr0
				%1:_(s32) = COPY $vgpr1
				%2:_(s7) = G_TRUNC %0
				%3:_(s7) = G_TRUNC %1
				%4:_(s7) = G_SSHLSAT %2, %3
				%5:_(s32) = G_ANYEXT %4
				$vgpr0 = COPY %5
				...

				---
				name: sshlsat_s8
				body: \|
				bb.0:
				liveins: $vgpr0, $vgpr1

				; GFX6-LABEL: name: sshlsat_s8
				; GFX6: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX6: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C1]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX6: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX6: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX6: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s32), [[C4]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C2]], [[C3]]
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[ASHR]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL1]]
				; GFX6: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SELECT1]], [[C1]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)
				; GFX6: $vgpr0 = COPY [[COPY4]](s32)
				; GFX8-LABEL: name: sshlsat_s8
				; GFX8: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX8: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX8: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[COPY]](s32)
				; GFX8: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[COPY1]](s32)
				; GFX8: [[ANYEXT:%[0-9]+]]:_(s16) = G_ANYEXT [[TRUNC]](s8)
				; GFX8: [[ZEXT:%[0-9]+]]:_(s16) = G_ZEXT [[TRUNC1]](s8)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 8
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[ANYEXT]], [[C]](s16)
				; GFX8: [[SHL1:%[0-9]+]]:_(s16) = G_SHL [[SHL]], [[ZEXT]](s16)
				; GFX8: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL1]], [[ZEXT]](s16)
				; GFX8: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX8: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX8: [[C3:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX8: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s16), [[C3]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C1]], [[C2]]
				; GFX8: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[SHL]](s16), [[ASHR]]
				; GFX8: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL1]]
				; GFX8: [[ASHR1:%[0-9]+]]:_(s16) = G_ASHR [[SELECT1]], [[C]](s16)
				; GFX8: [[TRUNC2:%[0-9]+]]:_(s8) = G_TRUNC [[ASHR1]](s16)
				; GFX8: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[TRUNC2]](s8)
				; GFX8: $vgpr0 = COPY [[ANYEXT1]](s32)
				; GFX9-LABEL: name: sshlsat_s8
				; GFX9: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX9: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX9: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[COPY]](s32)
				; GFX9: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[COPY1]](s32)
				; GFX9: [[ANYEXT:%[0-9]+]]:_(s16) = G_ANYEXT [[TRUNC]](s8)
				; GFX9: [[ZEXT:%[0-9]+]]:_(s16) = G_ZEXT [[TRUNC1]](s8)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 8
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[ANYEXT]], [[C]](s16)
				; GFX9: [[SHL1:%[0-9]+]]:_(s16) = G_SHL [[SHL]], [[ZEXT]](s16)
				; GFX9: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL1]], [[ZEXT]](s16)
				; GFX9: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX9: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX9: [[C3:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX9: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s16), [[C3]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C1]], [[C2]]
				; GFX9: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[SHL]](s16), [[ASHR]]
				; GFX9: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL1]]
				; GFX9: [[ASHR1:%[0-9]+]]:_(s16) = G_ASHR [[SELECT1]], [[C]](s16)
				; GFX9: [[TRUNC2:%[0-9]+]]:_(s8) = G_TRUNC [[ASHR1]](s16)
				; GFX9: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[TRUNC2]](s8)
				; GFX9: $vgpr0 = COPY [[ANYEXT1]](s32)
				%0:_(s32) = COPY $vgpr0
				%1:_(s32) = COPY $vgpr1
				%2:_(s8) = G_TRUNC %0
				%3:_(s8) = G_TRUNC %1
				%4:_(s8) = G_SSHLSAT %2, %3
				%5:_(s32) = G_ANYEXT %4
				$vgpr0 = COPY %5
				...

				---
				name: sshlsat_v2s8
				body: \|
				bb.0:
				liveins: $vgpr0, $vgpr1

				; GFX6-LABEL: name: sshlsat_v2s8
				; GFX6: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX6: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 8
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[COPY]], [[C]](s32)
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
				; GFX6: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[COPY]], [[C1]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
				; GFX6: [[LSHR2:%[0-9]+]]:_(s32) = G_LSHR [[COPY]], [[C2]](s32)
				; GFX6: [[LSHR3:%[0-9]+]]:_(s32) = G_LSHR [[COPY1]], [[C]](s32)
				; GFX6: [[LSHR4:%[0-9]+]]:_(s32) = G_LSHR [[COPY1]], [[C1]](s32)
				; GFX6: [[LSHR5:%[0-9]+]]:_(s32) = G_LSHR [[COPY1]], [[C2]](s32)
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
				; GFX6: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C3]]
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C2]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX6: [[C5:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX6: [[C6:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX6: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s32), [[C6]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C4]], [[C5]]
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[ASHR]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL1]]
				; GFX6: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SELECT1]], [[C2]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[LSHR]](s32)
				; GFX6: [[COPY5:%[0-9]+]]:_(s32) = COPY [[LSHR3]](s32)
				; GFX6: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY5]], [[C3]]
				; GFX6: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C2]](s32)
				; GFX6: [[SHL3:%[0-9]+]]:_(s32) = G_SHL [[SHL2]], [[AND1]](s32)
				; GFX6: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL3]], [[AND1]](s32)
				; GFX6: [[ICMP2:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL2]](s32), [[C6]]
				; GFX6: [[SELECT2:%[0-9]+]]:_(s32) = G_SELECT [[ICMP2]](s1), [[C4]], [[C5]]
				; GFX6: [[ICMP3:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL2]](s32), [[ASHR2]]
				; GFX6: [[SELECT3:%[0-9]+]]:_(s32) = G_SELECT [[ICMP3]](s32), [[SELECT2]], [[SHL3]]
				; GFX6: [[ASHR3:%[0-9]+]]:_(s32) = G_ASHR [[SELECT3]], [[C2]](s32)
				; GFX6: [[C7:%[0-9]+]]:_(s16) = G_CONSTANT i16 255
				; GFX6: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[ASHR1]](s32)
				; GFX6: [[AND2:%[0-9]+]]:_(s16) = G_AND [[TRUNC]], [[C7]]
				; GFX6: [[COPY6:%[0-9]+]]:_(s32) = COPY [[C]](s32)
				; GFX6: [[COPY7:%[0-9]+]]:_(s32) = COPY [[ASHR3]](s32)
				; GFX6: [[AND3:%[0-9]+]]:_(s32) = G_AND [[COPY7]], [[C3]]
				; GFX6: [[SHL4:%[0-9]+]]:_(s32) = G_SHL [[AND3]], [[COPY6]](s32)
				; GFX6: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[SHL4]](s32)
				; GFX6: [[OR:%[0-9]+]]:_(s16) = G_OR [[AND2]], [[TRUNC1]]
				; GFX6: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[OR]](s16)
				; GFX6: $vgpr0 = COPY [[ANYEXT]](s32)
				; GFX8-LABEL: name: sshlsat_v2s8
				; GFX8: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX8: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX8: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
				; GFX8: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[COPY1]](s32)
				; GFX8: [[BITCAST:%[0-9]+]]:_(<2 x s8>) = G_BITCAST [[TRUNC]](s16)
				; GFX8: [[BITCAST1:%[0-9]+]]:_(<2 x s8>) = G_BITCAST [[TRUNC1]](s16)
				; GFX8: [[UV:%[0-9]+]]:_(s8), [[UV1:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[BITCAST]](<2 x s8>)
				; GFX8: [[UV2:%[0-9]+]]:_(s8), [[UV3:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[BITCAST1]](<2 x s8>)
				; GFX8: [[SSHLSAT:%[0-9]+]]:_(s8) = G_SSHLSAT [[UV]], [[UV2]](s8)
				; GFX8: [[ANYEXT:%[0-9]+]]:_(s16) = G_ANYEXT [[UV1]](s8)
				; GFX8: [[ZEXT:%[0-9]+]]:_(s16) = G_ZEXT [[UV3]](s8)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 8
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[ANYEXT]], [[C]](s16)
				; GFX8: [[SHL1:%[0-9]+]]:_(s16) = G_SHL [[SHL]], [[ZEXT]](s16)
				; GFX8: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL1]], [[ZEXT]](s16)
				; GFX8: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX8: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX8: [[C3:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX8: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s16), [[C3]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C1]], [[C2]]
				; GFX8: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[SHL]](s16), [[ASHR]]
				; GFX8: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL1]]
				; GFX8: [[ASHR1:%[0-9]+]]:_(s16) = G_ASHR [[SELECT1]], [[C]](s16)
				; GFX8: [[TRUNC2:%[0-9]+]]:_(s8) = G_TRUNC [[ASHR1]](s16)
				; GFX8: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s8>) = G_BUILD_VECTOR [[SSHLSAT]](s8), [[TRUNC2]](s8)
				; GFX8: [[UV4:%[0-9]+]]:_(s8), [[UV5:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[BUILD_VECTOR]](<2 x s8>)
				; GFX8: [[MV:%[0-9]+]]:_(s16) = G_MERGE_VALUES [[UV4]](s8), [[UV5]](s8)
				; GFX8: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[MV]](s16)
				; GFX8: $vgpr0 = COPY [[ANYEXT1]](s32)
				; GFX9-LABEL: name: sshlsat_v2s8
				; GFX9: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX9: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX9: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
				; GFX9: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[COPY1]](s32)
				; GFX9: [[BITCAST:%[0-9]+]]:_(<2 x s8>) = G_BITCAST [[TRUNC]](s16)
				; GFX9: [[BITCAST1:%[0-9]+]]:_(<2 x s8>) = G_BITCAST [[TRUNC1]](s16)
				; GFX9: [[UV:%[0-9]+]]:_(s8), [[UV1:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[BITCAST]](<2 x s8>)
				; GFX9: [[UV2:%[0-9]+]]:_(s8), [[UV3:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[BITCAST1]](<2 x s8>)
				; GFX9: [[SSHLSAT:%[0-9]+]]:_(s8) = G_SSHLSAT [[UV]], [[UV2]](s8)
				; GFX9: [[ANYEXT:%[0-9]+]]:_(s16) = G_ANYEXT [[UV1]](s8)
				; GFX9: [[ZEXT:%[0-9]+]]:_(s16) = G_ZEXT [[UV3]](s8)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 8
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[ANYEXT]], [[C]](s16)
				; GFX9: [[SHL1:%[0-9]+]]:_(s16) = G_SHL [[SHL]], [[ZEXT]](s16)
				; GFX9: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL1]], [[ZEXT]](s16)
				; GFX9: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX9: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX9: [[C3:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX9: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s16), [[C3]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C1]], [[C2]]
				; GFX9: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[SHL]](s16), [[ASHR]]
				; GFX9: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL1]]
				; GFX9: [[ASHR1:%[0-9]+]]:_(s16) = G_ASHR [[SELECT1]], [[C]](s16)
				; GFX9: [[TRUNC2:%[0-9]+]]:_(s8) = G_TRUNC [[ASHR1]](s16)
				; GFX9: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s8>) = G_BUILD_VECTOR [[SSHLSAT]](s8), [[TRUNC2]](s8)
				; GFX9: [[UV4:%[0-9]+]]:_(s8), [[UV5:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[BUILD_VECTOR]](<2 x s8>)
				; GFX9: [[MV:%[0-9]+]]:_(s16) = G_MERGE_VALUES [[UV4]](s8), [[UV5]](s8)
				; GFX9: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[MV]](s16)
				; GFX9: $vgpr0 = COPY [[ANYEXT1]](s32)
				%0:_(s32) = COPY $vgpr0
				%1:_(s32) = COPY $vgpr1
				%2:_(s16) = G_TRUNC %0
				%3:_(s16) = G_TRUNC %1
				%4:_(<2 x s8>) = G_BITCAST %2
				%5:_(<2 x s8>) = G_BITCAST %3
				%6:_(<2 x s8>) = G_SSHLSAT %4, %5
				%7:_(s16) = G_BITCAST %6
				%8:_(s32) = G_ANYEXT %7
				$vgpr0 = COPY %8
				...

				---
				name: sshlsat_s16
				body: \|
				bb.0:
				liveins: $vgpr0, $vgpr1

				; GFX6-LABEL: name: sshlsat_s16
				; GFX6: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX6: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C1]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX6: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX6: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX6: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s32), [[C4]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C2]], [[C3]]
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[ASHR]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL1]]
				; GFX6: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SELECT1]], [[C1]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)
				; GFX6: $vgpr0 = COPY [[COPY4]](s32)
				; GFX8-LABEL: name: sshlsat_s16
				; GFX8: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX8: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX8: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
				; GFX8: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[COPY1]](s32)
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[TRUNC]], [[TRUNC1]](s16)
				; GFX8: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL]], [[TRUNC1]](s16)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX8: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX8: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX8: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[TRUNC]](s16), [[C2]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX8: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[TRUNC]](s16), [[ASHR]]
				; GFX8: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL]]
				; GFX8: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[SELECT1]](s16)
				; GFX8: $vgpr0 = COPY [[ANYEXT]](s32)
				; GFX9-LABEL: name: sshlsat_s16
				; GFX9: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX9: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX9: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
				; GFX9: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[COPY1]](s32)
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[TRUNC]], [[TRUNC1]](s16)
				; GFX9: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL]], [[TRUNC1]](s16)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX9: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX9: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX9: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[TRUNC]](s16), [[C2]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX9: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[TRUNC]](s16), [[ASHR]]
				; GFX9: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL]]
				; GFX9: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[SELECT1]](s16)
				; GFX9: $vgpr0 = COPY [[ANYEXT]](s32)
				%0:_(s32) = COPY $vgpr0
				%1:_(s32) = COPY $vgpr1
				%2:_(s16) = G_TRUNC %0
				%3:_(s16) = G_TRUNC %1
				%4:_(s16) = G_SSHLSAT %2, %3
				%5:_(s32) = G_ANYEXT %4
				$vgpr0 = COPY %5
				...

				---
				name: sshlsat_v2s16
				body: \|
				bb.0:
				liveins: $vgpr0, $vgpr1

				; GFX6-LABEL: name: sshlsat_v2s16
				; GFX6: [[COPY:%[0-9]+]]:_(<2 x s16>) = COPY $vgpr0
				; GFX6: [[COPY1:%[0-9]+]]:_(<2 x s16>) = COPY $vgpr1
				; GFX6: [[BITCAST:%[0-9]+]]:_(s32) = G_BITCAST [[COPY]](<2 x s16>)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST]], [[C]](s32)
				; GFX6: [[BITCAST1:%[0-9]+]]:_(s32) = G_BITCAST [[COPY1]](<2 x s16>)
				; GFX6: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST1]], [[C]](s32)
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[BITCAST]](s32)
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[BITCAST1]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C1]]
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX6: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX6: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX6: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s32), [[C4]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C2]], [[C3]]
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[ASHR]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL1]]
				; GFX6: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SELECT1]], [[C]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[LSHR]](s32)
				; GFX6: [[COPY5:%[0-9]+]]:_(s32) = COPY [[LSHR1]](s32)
				; GFX6: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY5]], [[C1]]
				; GFX6: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
				; GFX6: [[SHL3:%[0-9]+]]:_(s32) = G_SHL [[SHL2]], [[AND1]](s32)
				; GFX6: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL3]], [[AND1]](s32)
				; GFX6: [[ICMP2:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL2]](s32), [[C4]]
				; GFX6: [[SELECT2:%[0-9]+]]:_(s32) = G_SELECT [[ICMP2]](s1), [[C2]], [[C3]]
				; GFX6: [[ICMP3:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL2]](s32), [[ASHR2]]
				; GFX6: [[SELECT3:%[0-9]+]]:_(s32) = G_SELECT [[ICMP3]](s32), [[SELECT2]], [[SHL3]]
				; GFX6: [[ASHR3:%[0-9]+]]:_(s32) = G_ASHR [[SELECT3]], [[C]](s32)
				; GFX6: [[COPY6:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)
				; GFX6: [[AND2:%[0-9]+]]:_(s32) = G_AND [[COPY6]], [[C1]]
				; GFX6: [[COPY7:%[0-9]+]]:_(s32) = COPY [[ASHR3]](s32)
				; GFX6: [[AND3:%[0-9]+]]:_(s32) = G_AND [[COPY7]], [[C1]]
				; GFX6: [[SHL4:%[0-9]+]]:_(s32) = G_SHL [[AND3]], [[C]](s32)
				; GFX6: [[OR:%[0-9]+]]:_(s32) = G_OR [[AND2]], [[SHL4]]
				; GFX6: [[BITCAST2:%[0-9]+]]:_(<2 x s16>) = G_BITCAST [[OR]](s32)
				; GFX6: $vgpr0 = COPY [[BITCAST2]](<2 x s16>)
				; GFX8-LABEL: name: sshlsat_v2s16
				; GFX8: [[COPY:%[0-9]+]]:_(<2 x s16>) = COPY $vgpr0
				; GFX8: [[COPY1:%[0-9]+]]:_(<2 x s16>) = COPY $vgpr1
				; GFX8: [[UV:%[0-9]+]]:_(s16), [[UV1:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY]](<2 x s16>)
				; GFX8: [[UV2:%[0-9]+]]:_(s16), [[UV3:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY1]](<2 x s16>)
				; GFX8: [[SSHLSAT:%[0-9]+]]:_(s16) = G_SSHLSAT [[UV]], [[UV2]](s16)
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[UV1]], [[UV3]](s16)
				; GFX8: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL]], [[UV3]](s16)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX8: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX8: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX8: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV1]](s16), [[C2]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX8: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[UV1]](s16), [[ASHR]]
				; GFX8: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL]]
				; GFX8: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s16>) = G_BUILD_VECTOR [[SSHLSAT]](s16), [[SELECT1]](s16)
				; GFX8: $vgpr0 = COPY [[BUILD_VECTOR]](<2 x s16>)
				; GFX9-LABEL: name: sshlsat_v2s16
				; GFX9: [[COPY:%[0-9]+]]:_(<2 x s16>) = COPY $vgpr0
				; GFX9: [[COPY1:%[0-9]+]]:_(<2 x s16>) = COPY $vgpr1
				; GFX9: [[UV:%[0-9]+]]:_(s16), [[UV1:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY]](<2 x s16>)
				; GFX9: [[UV2:%[0-9]+]]:_(s16), [[UV3:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY1]](<2 x s16>)
				; GFX9: [[SSHLSAT:%[0-9]+]]:_(s16) = G_SSHLSAT [[UV]], [[UV2]](s16)
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[UV1]], [[UV3]](s16)
				; GFX9: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL]], [[UV3]](s16)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX9: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX9: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX9: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV1]](s16), [[C2]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX9: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[UV1]](s16), [[ASHR]]
				; GFX9: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL]]
				; GFX9: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s16>) = G_BUILD_VECTOR [[SSHLSAT]](s16), [[SELECT1]](s16)
				; GFX9: $vgpr0 = COPY [[BUILD_VECTOR]](<2 x s16>)
				%0:_(<2 x s16>) = COPY $vgpr0
				%1:_(<2 x s16>) = COPY $vgpr1
				%2:_(<2 x s16>) = G_SSHLSAT %0, %1
				$vgpr0 = COPY %2
				...

				---
				name: sshlsat_v3s16
				body: \|
				bb.0:
				liveins: $vgpr0_vgpr1_vgpr2

				; GFX6-LABEL: name: sshlsat_v3s16
				; GFX6: [[COPY:%[0-9]+]]:_(<6 x s16>) = COPY $vgpr0_vgpr1_vgpr2
				; GFX6: [[UV:%[0-9]+]]:_(<3 x s16>), [[UV1:%[0-9]+]]:_(<3 x s16>) = G_UNMERGE_VALUES [[COPY]](<6 x s16>)
				; GFX6: [[DEF:%[0-9]+]]:_(<4 x s16>) = G_IMPLICIT_DEF
				; GFX6: [[INSERT:%[0-9]+]]:_(<4 x s16>) = G_INSERT [[DEF]], [[UV]](<3 x s16>), 0
				; GFX6: [[UV2:%[0-9]+]]:_(<2 x s16>), [[UV3:%[0-9]+]]:_(<2 x s16>) = G_UNMERGE_VALUES [[INSERT]](<4 x s16>)
				; GFX6: [[BITCAST:%[0-9]+]]:_(s32) = G_BITCAST [[UV2]](<2 x s16>)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST]], [[C]](s32)
				; GFX6: [[BITCAST1:%[0-9]+]]:_(s32) = G_BITCAST [[UV3]](<2 x s16>)
				; GFX6: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST1]], [[C]](s32)
				; GFX6: [[INSERT1:%[0-9]+]]:_(<4 x s16>) = G_INSERT [[DEF]], [[UV1]](<3 x s16>), 0
				; GFX6: [[UV4:%[0-9]+]]:_(<2 x s16>), [[UV5:%[0-9]+]]:_(<2 x s16>) = G_UNMERGE_VALUES [[INSERT1]](<4 x s16>)
				; GFX6: [[BITCAST2:%[0-9]+]]:_(s32) = G_BITCAST [[UV4]](<2 x s16>)
				; GFX6: [[LSHR2:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST2]], [[C]](s32)
				; GFX6: [[BITCAST3:%[0-9]+]]:_(s32) = G_BITCAST [[UV5]](<2 x s16>)
				; GFX6: [[LSHR3:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST3]], [[C]](s32)
				; GFX6: [[COPY1:%[0-9]+]]:_(s32) = COPY [[BITCAST]](s32)
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[BITCAST2]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C1]]
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX6: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX6: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX6: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s32), [[C4]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C2]], [[C3]]
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[ASHR]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL1]]
				; GFX6: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SELECT1]], [[C]](s32)
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[LSHR]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[LSHR2]](s32)
				; GFX6: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY4]], [[C1]]
				; GFX6: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C]](s32)
				; GFX6: [[SHL3:%[0-9]+]]:_(s32) = G_SHL [[SHL2]], [[AND1]](s32)
				; GFX6: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL3]], [[AND1]](s32)
				; GFX6: [[ICMP2:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL2]](s32), [[C4]]
				; GFX6: [[SELECT2:%[0-9]+]]:_(s32) = G_SELECT [[ICMP2]](s1), [[C2]], [[C3]]
				; GFX6: [[ICMP3:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL2]](s32), [[ASHR2]]
				; GFX6: [[SELECT3:%[0-9]+]]:_(s32) = G_SELECT [[ICMP3]](s32), [[SELECT2]], [[SHL3]]
				; GFX6: [[ASHR3:%[0-9]+]]:_(s32) = G_ASHR [[SELECT3]], [[C]](s32)
				; GFX6: [[COPY5:%[0-9]+]]:_(s32) = COPY [[BITCAST1]](s32)
				; GFX6: [[COPY6:%[0-9]+]]:_(s32) = COPY [[BITCAST3]](s32)
				; GFX6: [[AND2:%[0-9]+]]:_(s32) = G_AND [[COPY6]], [[C1]]
				; GFX6: [[SHL4:%[0-9]+]]:_(s32) = G_SHL [[COPY5]], [[C]](s32)
				; GFX6: [[SHL5:%[0-9]+]]:_(s32) = G_SHL [[SHL4]], [[AND2]](s32)
				; GFX6: [[ASHR4:%[0-9]+]]:_(s32) = G_ASHR [[SHL5]], [[AND2]](s32)
				; GFX6: [[ICMP4:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL4]](s32), [[C4]]
				; GFX6: [[SELECT4:%[0-9]+]]:_(s32) = G_SELECT [[ICMP4]](s1), [[C2]], [[C3]]
				; GFX6: [[ICMP5:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL4]](s32), [[ASHR4]]
				; GFX6: [[SELECT5:%[0-9]+]]:_(s32) = G_SELECT [[ICMP5]](s32), [[SELECT4]], [[SHL5]]
				; GFX6: [[ASHR5:%[0-9]+]]:_(s32) = G_ASHR [[SELECT5]], [[C]](s32)
				; GFX6: [[COPY7:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)
				; GFX6: [[AND3:%[0-9]+]]:_(s32) = G_AND [[COPY7]], [[C1]]
				; GFX6: [[COPY8:%[0-9]+]]:_(s32) = COPY [[ASHR3]](s32)
				; GFX6: [[AND4:%[0-9]+]]:_(s32) = G_AND [[COPY8]], [[C1]]
				; GFX6: [[SHL6:%[0-9]+]]:_(s32) = G_SHL [[AND4]], [[C]](s32)
				; GFX6: [[OR:%[0-9]+]]:_(s32) = G_OR [[AND3]], [[SHL6]]
				; GFX6: [[BITCAST4:%[0-9]+]]:_(<2 x s16>) = G_BITCAST [[OR]](s32)
				; GFX6: [[COPY9:%[0-9]+]]:_(s32) = COPY [[ASHR5]](s32)
				; GFX6: [[AND5:%[0-9]+]]:_(s32) = G_AND [[COPY9]], [[C1]]
				; GFX6: [[COPY10:%[0-9]+]]:_(s32) = COPY [[C4]](s32)
				; GFX6: [[SHL7:%[0-9]+]]:_(s32) = G_SHL [[COPY10]], [[C]](s32)
				; GFX6: [[OR1:%[0-9]+]]:_(s32) = G_OR [[AND5]], [[SHL7]]
				; GFX6: [[BITCAST5:%[0-9]+]]:_(<2 x s16>) = G_BITCAST [[OR1]](s32)
				; GFX6: [[CONCAT_VECTORS:%[0-9]+]]:_(<4 x s16>) = G_CONCAT_VECTORS [[BITCAST4]](<2 x s16>), [[BITCAST5]](<2 x s16>)
				; GFX6: [[EXTRACT:%[0-9]+]]:_(<3 x s16>) = G_EXTRACT [[CONCAT_VECTORS]](<4 x s16>), 0
				; GFX6: [[DEF1:%[0-9]+]]:_(<4 x s16>) = G_IMPLICIT_DEF
				; GFX6: [[CONCAT_VECTORS1:%[0-9]+]]:_(<12 x s16>) = G_CONCAT_VECTORS [[DEF1]](<4 x s16>), [[DEF]](<4 x s16>), [[DEF]](<4 x s16>)
				; GFX6: [[UV6:%[0-9]+]]:_(<3 x s16>), [[UV7:%[0-9]+]]:_(<3 x s16>), [[UV8:%[0-9]+]]:_(<3 x s16>), [[UV9:%[0-9]+]]:_(<3 x s16>) = G_UNMERGE_VALUES [[CONCAT_VECTORS1]](<12 x s16>)
				; GFX6: [[CONCAT_VECTORS2:%[0-9]+]]:_(<6 x s16>) = G_CONCAT_VECTORS [[EXTRACT]](<3 x s16>), [[UV6]](<3 x s16>)
				; GFX6: $vgpr0_vgpr1_vgpr2 = COPY [[CONCAT_VECTORS2]](<6 x s16>)
				; GFX8-LABEL: name: sshlsat_v3s16
				; GFX8: [[COPY:%[0-9]+]]:_(<6 x s16>) = COPY $vgpr0_vgpr1_vgpr2
				; GFX8: [[UV:%[0-9]+]]:_(<3 x s16>), [[UV1:%[0-9]+]]:_(<3 x s16>) = G_UNMERGE_VALUES [[COPY]](<6 x s16>)
				; GFX8: [[UV2:%[0-9]+]]:_(s16), [[UV3:%[0-9]+]]:_(s16), [[UV4:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[UV]](<3 x s16>)
				; GFX8: [[UV5:%[0-9]+]]:_(s16), [[UV6:%[0-9]+]]:_(s16), [[UV7:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[UV1]](<3 x s16>)
				; GFX8: [[SSHLSAT:%[0-9]+]]:_(s16) = G_SSHLSAT [[UV2]], [[UV5]](s16)
				; GFX8: [[SSHLSAT1:%[0-9]+]]:_(s16) = G_SSHLSAT [[UV3]], [[UV6]](s16)
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[UV4]], [[UV7]](s16)
				; GFX8: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL]], [[UV7]](s16)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX8: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX8: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX8: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV4]](s16), [[C2]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX8: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[UV4]](s16), [[ASHR]]
				; GFX8: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL]]
				; GFX8: [[BUILD_VECTOR:%[0-9]+]]:_(<3 x s16>) = G_BUILD_VECTOR [[SSHLSAT]](s16), [[SSHLSAT1]](s16), [[SELECT1]](s16)
				; GFX8: [[DEF:%[0-9]+]]:_(<4 x s16>) = G_IMPLICIT_DEF
				; GFX8: [[DEF1:%[0-9]+]]:_(<4 x s16>) = G_IMPLICIT_DEF
				; GFX8: [[CONCAT_VECTORS:%[0-9]+]]:_(<12 x s16>) = G_CONCAT_VECTORS [[DEF]](<4 x s16>), [[DEF1]](<4 x s16>), [[DEF1]](<4 x s16>)
				; GFX8: [[UV8:%[0-9]+]]:_(<3 x s16>), [[UV9:%[0-9]+]]:_(<3 x s16>), [[UV10:%[0-9]+]]:_(<3 x s16>), [[UV11:%[0-9]+]]:_(<3 x s16>) = G_UNMERGE_VALUES [[CONCAT_VECTORS]](<12 x s16>)
				; GFX8: [[CONCAT_VECTORS1:%[0-9]+]]:_(<6 x s16>) = G_CONCAT_VECTORS [[BUILD_VECTOR]](<3 x s16>), [[UV8]](<3 x s16>)
				; GFX8: $vgpr0_vgpr1_vgpr2 = COPY [[CONCAT_VECTORS1]](<6 x s16>)
				; GFX9-LABEL: name: sshlsat_v3s16
				; GFX9: [[COPY:%[0-9]+]]:_(<6 x s16>) = COPY $vgpr0_vgpr1_vgpr2
				; GFX9: [[UV:%[0-9]+]]:_(<3 x s16>), [[UV1:%[0-9]+]]:_(<3 x s16>) = G_UNMERGE_VALUES [[COPY]](<6 x s16>)
				; GFX9: [[UV2:%[0-9]+]]:_(s16), [[UV3:%[0-9]+]]:_(s16), [[UV4:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[UV]](<3 x s16>)
				; GFX9: [[UV5:%[0-9]+]]:_(s16), [[UV6:%[0-9]+]]:_(s16), [[UV7:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[UV1]](<3 x s16>)
				; GFX9: [[SSHLSAT:%[0-9]+]]:_(s16) = G_SSHLSAT [[UV2]], [[UV5]](s16)
				; GFX9: [[SSHLSAT1:%[0-9]+]]:_(s16) = G_SSHLSAT [[UV3]], [[UV6]](s16)
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[UV4]], [[UV7]](s16)
				; GFX9: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL]], [[UV7]](s16)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX9: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX9: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX9: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV4]](s16), [[C2]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX9: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[UV4]](s16), [[ASHR]]
				; GFX9: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL]]
				; GFX9: [[BUILD_VECTOR:%[0-9]+]]:_(<3 x s16>) = G_BUILD_VECTOR [[SSHLSAT]](s16), [[SSHLSAT1]](s16), [[SELECT1]](s16)
				; GFX9: [[DEF:%[0-9]+]]:_(<4 x s16>) = G_IMPLICIT_DEF
				; GFX9: [[DEF1:%[0-9]+]]:_(<4 x s16>) = G_IMPLICIT_DEF
				; GFX9: [[CONCAT_VECTORS:%[0-9]+]]:_(<12 x s16>) = G_CONCAT_VECTORS [[DEF]](<4 x s16>), [[DEF1]](<4 x s16>), [[DEF1]](<4 x s16>)
				; GFX9: [[UV8:%[0-9]+]]:_(<3 x s16>), [[UV9:%[0-9]+]]:_(<3 x s16>), [[UV10:%[0-9]+]]:_(<3 x s16>), [[UV11:%[0-9]+]]:_(<3 x s16>) = G_UNMERGE_VALUES [[CONCAT_VECTORS]](<12 x s16>)
				; GFX9: [[CONCAT_VECTORS1:%[0-9]+]]:_(<6 x s16>) = G_CONCAT_VECTORS [[BUILD_VECTOR]](<3 x s16>), [[UV8]](<3 x s16>)
				; GFX9: $vgpr0_vgpr1_vgpr2 = COPY [[CONCAT_VECTORS1]](<6 x s16>)
				%0:_(<6 x s16>) = COPY $vgpr0_vgpr1_vgpr2
				%1:_(<3 x s16>), %2:_(<3 x s16>) = G_UNMERGE_VALUES %0
				%3:_(<3 x s16>) = G_SSHLSAT %1, %2
				%4:_(<3 x s16>) = G_IMPLICIT_DEF
				%5:_(<6 x s16>) = G_CONCAT_VECTORS %3, %4
				$vgpr0_vgpr1_vgpr2 = COPY %5
				...

				---
				name: sshlsat_v4s16
				body: \|
				bb.0:
				liveins: $vgpr0_vgpr1, $vgpr2_vgpr3

				; GFX6-LABEL: name: sshlsat_v4s16
				; GFX6: [[COPY:%[0-9]+]]:_(<4 x s16>) = COPY $vgpr0_vgpr1
				; GFX6: [[COPY1:%[0-9]+]]:_(<4 x s16>) = COPY $vgpr2_vgpr3
				; GFX6: [[UV:%[0-9]+]]:_(<2 x s16>), [[UV1:%[0-9]+]]:_(<2 x s16>) = G_UNMERGE_VALUES [[COPY]](<4 x s16>)
				; GFX6: [[BITCAST:%[0-9]+]]:_(s32) = G_BITCAST [[UV]](<2 x s16>)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST]], [[C]](s32)
				; GFX6: [[BITCAST1:%[0-9]+]]:_(s32) = G_BITCAST [[UV1]](<2 x s16>)
				; GFX6: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST1]], [[C]](s32)
				; GFX6: [[UV2:%[0-9]+]]:_(<2 x s16>), [[UV3:%[0-9]+]]:_(<2 x s16>) = G_UNMERGE_VALUES [[COPY1]](<4 x s16>)
				; GFX6: [[BITCAST2:%[0-9]+]]:_(s32) = G_BITCAST [[UV2]](<2 x s16>)
				; GFX6: [[LSHR2:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST2]], [[C]](s32)
				; GFX6: [[BITCAST3:%[0-9]+]]:_(s32) = G_BITCAST [[UV3]](<2 x s16>)
				; GFX6: [[LSHR3:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST3]], [[C]](s32)
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[BITCAST]](s32)
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[BITCAST2]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C1]]
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX6: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX6: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX6: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL]](s32), [[C4]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C2]], [[C3]]
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[ASHR]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL1]]
				; GFX6: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SELECT1]], [[C]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[LSHR]](s32)
				; GFX6: [[COPY5:%[0-9]+]]:_(s32) = COPY [[LSHR2]](s32)
				; GFX6: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY5]], [[C1]]
				; GFX6: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
				; GFX6: [[SHL3:%[0-9]+]]:_(s32) = G_SHL [[SHL2]], [[AND1]](s32)
				; GFX6: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL3]], [[AND1]](s32)
				; GFX6: [[ICMP2:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL2]](s32), [[C4]]
				; GFX6: [[SELECT2:%[0-9]+]]:_(s32) = G_SELECT [[ICMP2]](s1), [[C2]], [[C3]]
				; GFX6: [[ICMP3:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL2]](s32), [[ASHR2]]
				; GFX6: [[SELECT3:%[0-9]+]]:_(s32) = G_SELECT [[ICMP3]](s32), [[SELECT2]], [[SHL3]]
				; GFX6: [[ASHR3:%[0-9]+]]:_(s32) = G_ASHR [[SELECT3]], [[C]](s32)
				; GFX6: [[COPY6:%[0-9]+]]:_(s32) = COPY [[BITCAST1]](s32)
				; GFX6: [[COPY7:%[0-9]+]]:_(s32) = COPY [[BITCAST3]](s32)
				; GFX6: [[AND2:%[0-9]+]]:_(s32) = G_AND [[COPY7]], [[C1]]
				; GFX6: [[SHL4:%[0-9]+]]:_(s32) = G_SHL [[COPY6]], [[C]](s32)
				; GFX6: [[SHL5:%[0-9]+]]:_(s32) = G_SHL [[SHL4]], [[AND2]](s32)
				; GFX6: [[ASHR4:%[0-9]+]]:_(s32) = G_ASHR [[SHL5]], [[AND2]](s32)
				; GFX6: [[ICMP4:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL4]](s32), [[C4]]
				; GFX6: [[SELECT4:%[0-9]+]]:_(s32) = G_SELECT [[ICMP4]](s1), [[C2]], [[C3]]
				; GFX6: [[ICMP5:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL4]](s32), [[ASHR4]]
				; GFX6: [[SELECT5:%[0-9]+]]:_(s32) = G_SELECT [[ICMP5]](s32), [[SELECT4]], [[SHL5]]
				; GFX6: [[ASHR5:%[0-9]+]]:_(s32) = G_ASHR [[SELECT5]], [[C]](s32)
				; GFX6: [[COPY8:%[0-9]+]]:_(s32) = COPY [[LSHR1]](s32)
				; GFX6: [[COPY9:%[0-9]+]]:_(s32) = COPY [[LSHR3]](s32)
				; GFX6: [[AND3:%[0-9]+]]:_(s32) = G_AND [[COPY9]], [[C1]]
				; GFX6: [[SHL6:%[0-9]+]]:_(s32) = G_SHL [[COPY8]], [[C]](s32)
				; GFX6: [[SHL7:%[0-9]+]]:_(s32) = G_SHL [[SHL6]], [[AND3]](s32)
				; GFX6: [[ASHR6:%[0-9]+]]:_(s32) = G_ASHR [[SHL7]], [[AND3]](s32)
				; GFX6: [[ICMP6:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[SHL6]](s32), [[C4]]
				; GFX6: [[SELECT6:%[0-9]+]]:_(s32) = G_SELECT [[ICMP6]](s1), [[C2]], [[C3]]
				; GFX6: [[ICMP7:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL6]](s32), [[ASHR6]]
				; GFX6: [[SELECT7:%[0-9]+]]:_(s32) = G_SELECT [[ICMP7]](s32), [[SELECT6]], [[SHL7]]
				; GFX6: [[ASHR7:%[0-9]+]]:_(s32) = G_ASHR [[SELECT7]], [[C]](s32)
				; GFX6: [[COPY10:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)
				; GFX6: [[AND4:%[0-9]+]]:_(s32) = G_AND [[COPY10]], [[C1]]
				; GFX6: [[COPY11:%[0-9]+]]:_(s32) = COPY [[ASHR3]](s32)
				; GFX6: [[AND5:%[0-9]+]]:_(s32) = G_AND [[COPY11]], [[C1]]
				; GFX6: [[SHL8:%[0-9]+]]:_(s32) = G_SHL [[AND5]], [[C]](s32)
				; GFX6: [[OR:%[0-9]+]]:_(s32) = G_OR [[AND4]], [[SHL8]]
				; GFX6: [[BITCAST4:%[0-9]+]]:_(<2 x s16>) = G_BITCAST [[OR]](s32)
				; GFX6: [[COPY12:%[0-9]+]]:_(s32) = COPY [[ASHR5]](s32)
				; GFX6: [[AND6:%[0-9]+]]:_(s32) = G_AND [[COPY12]], [[C1]]
				; GFX6: [[COPY13:%[0-9]+]]:_(s32) = COPY [[ASHR7]](s32)
				; GFX6: [[AND7:%[0-9]+]]:_(s32) = G_AND [[COPY13]], [[C1]]
				; GFX6: [[SHL9:%[0-9]+]]:_(s32) = G_SHL [[AND7]], [[C]](s32)
				; GFX6: [[OR1:%[0-9]+]]:_(s32) = G_OR [[AND6]], [[SHL9]]
				; GFX6: [[BITCAST5:%[0-9]+]]:_(<2 x s16>) = G_BITCAST [[OR1]](s32)
				; GFX6: [[CONCAT_VECTORS:%[0-9]+]]:_(<4 x s16>) = G_CONCAT_VECTORS [[BITCAST4]](<2 x s16>), [[BITCAST5]](<2 x s16>)
				; GFX6: $vgpr0_vgpr1 = COPY [[CONCAT_VECTORS]](<4 x s16>)
				; GFX8-LABEL: name: sshlsat_v4s16
				; GFX8: [[COPY:%[0-9]+]]:_(<4 x s16>) = COPY $vgpr0_vgpr1
				; GFX8: [[COPY1:%[0-9]+]]:_(<4 x s16>) = COPY $vgpr2_vgpr3
				; GFX8: [[UV:%[0-9]+]]:_(s16), [[UV1:%[0-9]+]]:_(s16), [[UV2:%[0-9]+]]:_(s16), [[UV3:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY]](<4 x s16>)
				; GFX8: [[UV4:%[0-9]+]]:_(s16), [[UV5:%[0-9]+]]:_(s16), [[UV6:%[0-9]+]]:_(s16), [[UV7:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY1]](<4 x s16>)
				; GFX8: [[SSHLSAT:%[0-9]+]]:_(s16) = G_SSHLSAT [[UV]], [[UV4]](s16)
				; GFX8: [[SSHLSAT1:%[0-9]+]]:_(s16) = G_SSHLSAT [[UV1]], [[UV5]](s16)
				; GFX8: [[SSHLSAT2:%[0-9]+]]:_(s16) = G_SSHLSAT [[UV2]], [[UV6]](s16)
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[UV3]], [[UV7]](s16)
				; GFX8: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL]], [[UV7]](s16)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX8: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX8: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX8: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV3]](s16), [[C2]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX8: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[UV3]](s16), [[ASHR]]
				; GFX8: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL]]
				; GFX8: [[BUILD_VECTOR:%[0-9]+]]:_(<4 x s16>) = G_BUILD_VECTOR [[SSHLSAT]](s16), [[SSHLSAT1]](s16), [[SSHLSAT2]](s16), [[SELECT1]](s16)
				; GFX8: $vgpr0_vgpr1 = COPY [[BUILD_VECTOR]](<4 x s16>)
				; GFX9-LABEL: name: sshlsat_v4s16
				; GFX9: [[COPY:%[0-9]+]]:_(<4 x s16>) = COPY $vgpr0_vgpr1
				; GFX9: [[COPY1:%[0-9]+]]:_(<4 x s16>) = COPY $vgpr2_vgpr3
				; GFX9: [[UV:%[0-9]+]]:_(s16), [[UV1:%[0-9]+]]:_(s16), [[UV2:%[0-9]+]]:_(s16), [[UV3:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY]](<4 x s16>)
				; GFX9: [[UV4:%[0-9]+]]:_(s16), [[UV5:%[0-9]+]]:_(s16), [[UV6:%[0-9]+]]:_(s16), [[UV7:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY1]](<4 x s16>)
				; GFX9: [[SSHLSAT:%[0-9]+]]:_(s16) = G_SSHLSAT [[UV]], [[UV4]](s16)
				; GFX9: [[SSHLSAT1:%[0-9]+]]:_(s16) = G_SSHLSAT [[UV1]], [[UV5]](s16)
				; GFX9: [[SSHLSAT2:%[0-9]+]]:_(s16) = G_SSHLSAT [[UV2]], [[UV6]](s16)
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[UV3]], [[UV7]](s16)
				; GFX9: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL]], [[UV7]](s16)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -32768
				; GFX9: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 32767
				; GFX9: [[C2:%[0-9]+]]:_(s16) = G_CONSTANT i16 0
				; GFX9: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV3]](s16), [[C2]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX9: [[ICMP1:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[UV3]](s16), [[ASHR]]
				; GFX9: [[SELECT1:%[0-9]+]]:_(s16) = G_SELECT [[ICMP1]](s16), [[SELECT]], [[SHL]]
				; GFX9: [[BUILD_VECTOR:%[0-9]+]]:_(<4 x s16>) = G_BUILD_VECTOR [[SSHLSAT]](s16), [[SSHLSAT1]](s16), [[SSHLSAT2]](s16), [[SELECT1]](s16)
				; GFX9: $vgpr0_vgpr1 = COPY [[BUILD_VECTOR]](<4 x s16>)
				%0:_(<4 x s16>) = COPY $vgpr0_vgpr1
				%1:_(<4 x s16>) = COPY $vgpr2_vgpr3
				%2:_(<4 x s16>) = G_SSHLSAT %0, %1
				$vgpr0_vgpr1 = COPY %2
				...

				---
				name: sshlsat_s32
				body: \|
				bb.0:
				liveins: $vgpr0, $vgpr1

				; GFX6-LABEL: name: sshlsat_s32
				; GFX6: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX6: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY]], [[COPY1]](s32)
				; GFX6: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[COPY1]](s32)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX6: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[COPY]](s32), [[C2]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[COPY]](s32), [[ASHR]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL]]
				; GFX6: $vgpr0 = COPY [[SELECT1]](s32)
				; GFX8-LABEL: name: sshlsat_s32
				; GFX8: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX8: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX8: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY]], [[COPY1]](s32)
				; GFX8: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[COPY1]](s32)
				; GFX8: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX8: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX8: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX8: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[COPY]](s32), [[C2]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX8: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[COPY]](s32), [[ASHR]]
				; GFX8: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL]]
				; GFX8: $vgpr0 = COPY [[SELECT1]](s32)
				; GFX9-LABEL: name: sshlsat_s32
				; GFX9: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX9: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX9: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY]], [[COPY1]](s32)
				; GFX9: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[COPY1]](s32)
				; GFX9: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX9: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX9: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX9: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[COPY]](s32), [[C2]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX9: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[COPY]](s32), [[ASHR]]
				; GFX9: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL]]
				; GFX9: $vgpr0 = COPY [[SELECT1]](s32)
				%0:_(s32) = COPY $vgpr0
				%1:_(s32) = COPY $vgpr1
				%2:_(s32) = G_SSHLSAT %0, %1
				$vgpr0 = COPY %2
				...

				---
				name: sshlsat_v2s32
				body: \|
				bb.0:
				liveins: $vgpr0_vgpr1, $vgpr2_vgpr3

				; GFX6-LABEL: name: sshlsat_v2s32
				; GFX6: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr0_vgpr1
				; GFX6: [[COPY1:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr2_vgpr3
				; GFX6: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)
				; GFX6: [[UV2:%[0-9]+]]:_(s32), [[UV3:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY1]](<2 x s32>)
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[UV2]](s32)
				; GFX6: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[UV2]](s32)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX6: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV]](s32), [[C2]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[UV]](s32), [[ASHR]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL]]
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[UV3]](s32)
				; GFX6: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[UV3]](s32)
				; GFX6: [[ICMP2:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV1]](s32), [[C2]]
				; GFX6: [[SELECT2:%[0-9]+]]:_(s32) = G_SELECT [[ICMP2]](s1), [[C]], [[C1]]
				; GFX6: [[ICMP3:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[UV1]](s32), [[ASHR1]]
				; GFX6: [[SELECT3:%[0-9]+]]:_(s32) = G_SELECT [[ICMP3]](s32), [[SELECT2]], [[SHL1]]
				; GFX6: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[SELECT1]](s32), [[SELECT3]](s32)
				; GFX6: $vgpr0_vgpr1 = COPY [[BUILD_VECTOR]](<2 x s32>)
				; GFX8-LABEL: name: sshlsat_v2s32
				; GFX8: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr0_vgpr1
				; GFX8: [[COPY1:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr2_vgpr3
				; GFX8: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)
				; GFX8: [[UV2:%[0-9]+]]:_(s32), [[UV3:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY1]](<2 x s32>)
				; GFX8: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[UV2]](s32)
				; GFX8: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[UV2]](s32)
				; GFX8: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX8: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX8: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX8: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV]](s32), [[C2]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX8: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[UV]](s32), [[ASHR]]
				; GFX8: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL]]
				; GFX8: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[UV3]](s32)
				; GFX8: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[UV3]](s32)
				; GFX8: [[ICMP2:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV1]](s32), [[C2]]
				; GFX8: [[SELECT2:%[0-9]+]]:_(s32) = G_SELECT [[ICMP2]](s1), [[C]], [[C1]]
				; GFX8: [[ICMP3:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[UV1]](s32), [[ASHR1]]
				; GFX8: [[SELECT3:%[0-9]+]]:_(s32) = G_SELECT [[ICMP3]](s32), [[SELECT2]], [[SHL1]]
				; GFX8: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[SELECT1]](s32), [[SELECT3]](s32)
				; GFX8: $vgpr0_vgpr1 = COPY [[BUILD_VECTOR]](<2 x s32>)
				; GFX9-LABEL: name: sshlsat_v2s32
				; GFX9: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr0_vgpr1
				; GFX9: [[COPY1:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr2_vgpr3
				; GFX9: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)
				; GFX9: [[UV2:%[0-9]+]]:_(s32), [[UV3:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY1]](<2 x s32>)
				; GFX9: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[UV2]](s32)
				; GFX9: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[UV2]](s32)
				; GFX9: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2147483648
				; GFX9: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 2147483647
				; GFX9: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX9: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV]](s32), [[C2]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX9: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[UV]](s32), [[ASHR]]
				; GFX9: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[SELECT]], [[SHL]]
				; GFX9: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[UV3]](s32)
				; GFX9: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[UV3]](s32)
				; GFX9: [[ICMP2:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV1]](s32), [[C2]]
				; GFX9: [[SELECT2:%[0-9]+]]:_(s32) = G_SELECT [[ICMP2]](s1), [[C]], [[C1]]
				; GFX9: [[ICMP3:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[UV1]](s32), [[ASHR1]]
				; GFX9: [[SELECT3:%[0-9]+]]:_(s32) = G_SELECT [[ICMP3]](s32), [[SELECT2]], [[SHL1]]
				; GFX9: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[SELECT1]](s32), [[SELECT3]](s32)
				; GFX9: $vgpr0_vgpr1 = COPY [[BUILD_VECTOR]](<2 x s32>)
				%0:_(<2 x s32>) = COPY $vgpr0_vgpr1
				%1:_(<2 x s32>) = COPY $vgpr2_vgpr3
				%2:_(<2 x s32>) = G_SSHLSAT %0, %1
				$vgpr0_vgpr1 = COPY %2
				...

				---
				name: sshlsat_s64
				body: \|
				bb.0:
				liveins: $vgpr0_vgpr1, $vgpr2_vgpr3

				; GFX6-LABEL: name: sshlsat_s64
				; GFX6: [[COPY:%[0-9]+]]:_(s64) = COPY $vgpr0_vgpr1
				; GFX6: [[COPY1:%[0-9]+]]:_(s64) = COPY $vgpr2_vgpr3
				; GFX6: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY]], [[COPY1]](s64)
				; GFX6: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[COPY1]](s64)
				; GFX6: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -9223372036854775808
				; GFX6: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 9223372036854775807
				; GFX6: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 0
				; GFX6: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[COPY]](s64), [[C2]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s64) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX6: [[ICMP1:%[0-9]+]]:_(s64) = G_ICMP intpred(ne), [[COPY]](s64), [[ASHR]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s64) = G_SELECT [[ICMP1]](s64), [[SELECT]], [[SHL]]
				; GFX6: $vgpr0_vgpr1 = COPY [[SELECT1]](s64)
				; GFX8-LABEL: name: sshlsat_s64
				; GFX8: [[COPY:%[0-9]+]]:_(s64) = COPY $vgpr0_vgpr1
				; GFX8: [[COPY1:%[0-9]+]]:_(s64) = COPY $vgpr2_vgpr3
				; GFX8: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY]], [[COPY1]](s64)
				; GFX8: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[COPY1]](s64)
				; GFX8: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -9223372036854775808
				; GFX8: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 9223372036854775807
				; GFX8: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 0
				; GFX8: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[COPY]](s64), [[C2]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s64) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX8: [[ICMP1:%[0-9]+]]:_(s64) = G_ICMP intpred(ne), [[COPY]](s64), [[ASHR]]
				; GFX8: [[SELECT1:%[0-9]+]]:_(s64) = G_SELECT [[ICMP1]](s64), [[SELECT]], [[SHL]]
				; GFX8: $vgpr0_vgpr1 = COPY [[SELECT1]](s64)
				; GFX9-LABEL: name: sshlsat_s64
				; GFX9: [[COPY:%[0-9]+]]:_(s64) = COPY $vgpr0_vgpr1
				; GFX9: [[COPY1:%[0-9]+]]:_(s64) = COPY $vgpr2_vgpr3
				; GFX9: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY]], [[COPY1]](s64)
				; GFX9: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[COPY1]](s64)
				; GFX9: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -9223372036854775808
				; GFX9: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 9223372036854775807
				; GFX9: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 0
				; GFX9: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[COPY]](s64), [[C2]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s64) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX9: [[ICMP1:%[0-9]+]]:_(s64) = G_ICMP intpred(ne), [[COPY]](s64), [[ASHR]]
				; GFX9: [[SELECT1:%[0-9]+]]:_(s64) = G_SELECT [[ICMP1]](s64), [[SELECT]], [[SHL]]
				; GFX9: $vgpr0_vgpr1 = COPY [[SELECT1]](s64)
				%0:_(s64) = COPY $vgpr0_vgpr1
				%1:_(s64) = COPY $vgpr2_vgpr3
				%2:_(s64) = G_SSHLSAT %0, %1
				$vgpr0_vgpr1 = COPY %2
				...

				---
				name: sshlsat_v2s64
				body: \|
				bb.0:
				liveins: $vgpr0_vgpr1_vgpr2_vgpr3, $vgpr4_vgpr5_vgpr6_vgpr7

				; GFX6-LABEL: name: sshlsat_v2s64
				; GFX6: [[COPY:%[0-9]+]]:_(<2 x s64>) = COPY $vgpr0_vgpr1_vgpr2_vgpr3
				; GFX6: [[COPY1:%[0-9]+]]:_(<2 x s64>) = COPY $vgpr4_vgpr5_vgpr6_vgpr7
				; GFX6: [[UV:%[0-9]+]]:_(s64), [[UV1:%[0-9]+]]:_(s64) = G_UNMERGE_VALUES [[COPY]](<2 x s64>)
				; GFX6: [[UV2:%[0-9]+]]:_(s64), [[UV3:%[0-9]+]]:_(s64) = G_UNMERGE_VALUES [[COPY1]](<2 x s64>)
				; GFX6: [[SSHLSAT:%[0-9]+]]:_(s64) = G_SSHLSAT [[UV]], [[UV2]](s64)
				; GFX6: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[UV1]], [[UV3]](s64)
				; GFX6: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[UV3]](s64)
				; GFX6: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -9223372036854775808
				; GFX6: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 9223372036854775807
				; GFX6: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 0
				; GFX6: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV1]](s64), [[C2]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s64) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX6: [[ICMP1:%[0-9]+]]:_(s64) = G_ICMP intpred(ne), [[UV1]](s64), [[ASHR]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s64) = G_SELECT [[ICMP1]](s64), [[SELECT]], [[SHL]]
				; GFX6: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s64>) = G_BUILD_VECTOR [[SSHLSAT]](s64), [[SELECT1]](s64)
				; GFX6: $vgpr0_vgpr1_vgpr2_vgpr3 = COPY [[BUILD_VECTOR]](<2 x s64>)
				; GFX8-LABEL: name: sshlsat_v2s64
				; GFX8: [[COPY:%[0-9]+]]:_(<2 x s64>) = COPY $vgpr0_vgpr1_vgpr2_vgpr3
				; GFX8: [[COPY1:%[0-9]+]]:_(<2 x s64>) = COPY $vgpr4_vgpr5_vgpr6_vgpr7
				; GFX8: [[UV:%[0-9]+]]:_(s64), [[UV1:%[0-9]+]]:_(s64) = G_UNMERGE_VALUES [[COPY]](<2 x s64>)
				; GFX8: [[UV2:%[0-9]+]]:_(s64), [[UV3:%[0-9]+]]:_(s64) = G_UNMERGE_VALUES [[COPY1]](<2 x s64>)
				; GFX8: [[SSHLSAT:%[0-9]+]]:_(s64) = G_SSHLSAT [[UV]], [[UV2]](s64)
				; GFX8: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[UV1]], [[UV3]](s64)
				; GFX8: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[UV3]](s64)
				; GFX8: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -9223372036854775808
				; GFX8: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 9223372036854775807
				; GFX8: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 0
				; GFX8: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV1]](s64), [[C2]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s64) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX8: [[ICMP1:%[0-9]+]]:_(s64) = G_ICMP intpred(ne), [[UV1]](s64), [[ASHR]]
				; GFX8: [[SELECT1:%[0-9]+]]:_(s64) = G_SELECT [[ICMP1]](s64), [[SELECT]], [[SHL]]
				; GFX8: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s64>) = G_BUILD_VECTOR [[SSHLSAT]](s64), [[SELECT1]](s64)
				; GFX8: $vgpr0_vgpr1_vgpr2_vgpr3 = COPY [[BUILD_VECTOR]](<2 x s64>)
				; GFX9-LABEL: name: sshlsat_v2s64
				; GFX9: [[COPY:%[0-9]+]]:_(<2 x s64>) = COPY $vgpr0_vgpr1_vgpr2_vgpr3
				; GFX9: [[COPY1:%[0-9]+]]:_(<2 x s64>) = COPY $vgpr4_vgpr5_vgpr6_vgpr7
				; GFX9: [[UV:%[0-9]+]]:_(s64), [[UV1:%[0-9]+]]:_(s64) = G_UNMERGE_VALUES [[COPY]](<2 x s64>)
				; GFX9: [[UV2:%[0-9]+]]:_(s64), [[UV3:%[0-9]+]]:_(s64) = G_UNMERGE_VALUES [[COPY1]](<2 x s64>)
				; GFX9: [[SSHLSAT:%[0-9]+]]:_(s64) = G_SSHLSAT [[UV]], [[UV2]](s64)
				; GFX9: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[UV1]], [[UV3]](s64)
				; GFX9: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[UV3]](s64)
				; GFX9: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -9223372036854775808
				; GFX9: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 9223372036854775807
				; GFX9: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 0
				; GFX9: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(slt), [[UV1]](s64), [[C2]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s64) = G_SELECT [[ICMP]](s1), [[C]], [[C1]]
				; GFX9: [[ICMP1:%[0-9]+]]:_(s64) = G_ICMP intpred(ne), [[UV1]](s64), [[ASHR]]
				; GFX9: [[SELECT1:%[0-9]+]]:_(s64) = G_SELECT [[ICMP1]](s64), [[SELECT]], [[SHL]]
				; GFX9: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s64>) = G_BUILD_VECTOR [[SSHLSAT]](s64), [[SELECT1]](s64)
				; GFX9: $vgpr0_vgpr1_vgpr2_vgpr3 = COPY [[BUILD_VECTOR]](<2 x s64>)
				%0:_(<2 x s64>) = COPY $vgpr0_vgpr1_vgpr2_vgpr3
				%1:_(<2 x s64>) = COPY $vgpr4_vgpr5_vgpr6_vgpr7
				%2:_(<2 x s64>) = G_SSHLSAT %0, %1
				$vgpr0_vgpr1_vgpr2_vgpr3 = COPY %2
				...

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-ushlsat.mir

This file was added.

				# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
				# RUN: llc -global-isel-abort=0 -march=amdgcn -mcpu=tahiti -run-pass=legalizer %s -o - \| FileCheck -check-prefix=GFX6 %s
				# RUN: llc -global-isel-abort=0 -march=amdgcn -mcpu=fiji -run-pass=legalizer %s -o - \| FileCheck -check-prefix=GFX8 %s
				# RUN: llc -global-isel-abort=0 -march=amdgcn -mcpu=gfx900 -run-pass=legalizer %s -o - \| FileCheck -check-prefix=GFX9 %s

				---
				name: ushlsat_s7
				body: \|
				bb.0:
				liveins: $vgpr0, $vgpr1

				; GFX6-LABEL: name: ushlsat_s7
				; GFX6: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX6: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 127
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 25
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C1]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX6: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[LSHR]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C2]], [[SHL1]]
				; GFX6: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[SELECT]], [[C1]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[LSHR1]](s32)
				; GFX6: $vgpr0 = COPY [[COPY4]](s32)
				; GFX8-LABEL: name: ushlsat_s7
				; GFX8: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX8: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX8: [[TRUNC:%[0-9]+]]:_(s7) = G_TRUNC [[COPY]](s32)
				; GFX8: [[TRUNC1:%[0-9]+]]:_(s7) = G_TRUNC [[COPY1]](s32)
				; GFX8: [[ANYEXT:%[0-9]+]]:_(s16) = G_ANYEXT [[TRUNC]](s7)
				; GFX8: [[ZEXT:%[0-9]+]]:_(s16) = G_ZEXT [[TRUNC1]](s7)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 9
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[ANYEXT]], [[C]](s16)
				; GFX8: [[SHL1:%[0-9]+]]:_(s16) = G_SHL [[SHL]], [[ZEXT]](s16)
				; GFX8: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL1]], [[ZEXT]](s16)
				; GFX8: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX8: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[SHL]](s16), [[LSHR]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C1]], [[SHL1]]
				; GFX8: [[LSHR1:%[0-9]+]]:_(s16) = G_LSHR [[SELECT]], [[C]](s16)
				; GFX8: [[TRUNC2:%[0-9]+]]:_(s7) = G_TRUNC [[LSHR1]](s16)
				; GFX8: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[TRUNC2]](s7)
				; GFX8: $vgpr0 = COPY [[ANYEXT1]](s32)
				; GFX9-LABEL: name: ushlsat_s7
				; GFX9: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX9: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX9: [[TRUNC:%[0-9]+]]:_(s7) = G_TRUNC [[COPY]](s32)
				; GFX9: [[TRUNC1:%[0-9]+]]:_(s7) = G_TRUNC [[COPY1]](s32)
				; GFX9: [[ANYEXT:%[0-9]+]]:_(s16) = G_ANYEXT [[TRUNC]](s7)
				; GFX9: [[ZEXT:%[0-9]+]]:_(s16) = G_ZEXT [[TRUNC1]](s7)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 9
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[ANYEXT]], [[C]](s16)
				; GFX9: [[SHL1:%[0-9]+]]:_(s16) = G_SHL [[SHL]], [[ZEXT]](s16)
				; GFX9: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL1]], [[ZEXT]](s16)
				; GFX9: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX9: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[SHL]](s16), [[LSHR]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C1]], [[SHL1]]
				; GFX9: [[LSHR1:%[0-9]+]]:_(s16) = G_LSHR [[SELECT]], [[C]](s16)
				; GFX9: [[TRUNC2:%[0-9]+]]:_(s7) = G_TRUNC [[LSHR1]](s16)
				; GFX9: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[TRUNC2]](s7)
				; GFX9: $vgpr0 = COPY [[ANYEXT1]](s32)
				%0:_(s32) = COPY $vgpr0
				%1:_(s32) = COPY $vgpr1
				%2:_(s7) = G_TRUNC %0
				%3:_(s7) = G_TRUNC %1
				%4:_(s7) = G_USHLSAT %2, %3
				%5:_(s32) = G_ANYEXT %4
				$vgpr0 = COPY %5
				...

				---
				name: ushlsat_s8
				body: \|
				bb.0:
				liveins: $vgpr0, $vgpr1

				; GFX6-LABEL: name: ushlsat_s8
				; GFX6: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX6: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C1]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX6: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[LSHR]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C2]], [[SHL1]]
				; GFX6: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[SELECT]], [[C1]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[LSHR1]](s32)
				; GFX6: $vgpr0 = COPY [[COPY4]](s32)
				; GFX8-LABEL: name: ushlsat_s8
				; GFX8: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX8: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX8: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[COPY]](s32)
				; GFX8: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[COPY1]](s32)
				; GFX8: [[ANYEXT:%[0-9]+]]:_(s16) = G_ANYEXT [[TRUNC]](s8)
				; GFX8: [[ZEXT:%[0-9]+]]:_(s16) = G_ZEXT [[TRUNC1]](s8)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 8
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[ANYEXT]], [[C]](s16)
				; GFX8: [[SHL1:%[0-9]+]]:_(s16) = G_SHL [[SHL]], [[ZEXT]](s16)
				; GFX8: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL1]], [[ZEXT]](s16)
				; GFX8: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX8: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[SHL]](s16), [[LSHR]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C1]], [[SHL1]]
				; GFX8: [[LSHR1:%[0-9]+]]:_(s16) = G_LSHR [[SELECT]], [[C]](s16)
				; GFX8: [[TRUNC2:%[0-9]+]]:_(s8) = G_TRUNC [[LSHR1]](s16)
				; GFX8: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[TRUNC2]](s8)
				; GFX8: $vgpr0 = COPY [[ANYEXT1]](s32)
				; GFX9-LABEL: name: ushlsat_s8
				; GFX9: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX9: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX9: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[COPY]](s32)
				; GFX9: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[COPY1]](s32)
				; GFX9: [[ANYEXT:%[0-9]+]]:_(s16) = G_ANYEXT [[TRUNC]](s8)
				; GFX9: [[ZEXT:%[0-9]+]]:_(s16) = G_ZEXT [[TRUNC1]](s8)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 8
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[ANYEXT]], [[C]](s16)
				; GFX9: [[SHL1:%[0-9]+]]:_(s16) = G_SHL [[SHL]], [[ZEXT]](s16)
				; GFX9: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL1]], [[ZEXT]](s16)
				; GFX9: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX9: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[SHL]](s16), [[LSHR]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C1]], [[SHL1]]
				; GFX9: [[LSHR1:%[0-9]+]]:_(s16) = G_LSHR [[SELECT]], [[C]](s16)
				; GFX9: [[TRUNC2:%[0-9]+]]:_(s8) = G_TRUNC [[LSHR1]](s16)
				; GFX9: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[TRUNC2]](s8)
				; GFX9: $vgpr0 = COPY [[ANYEXT1]](s32)
				%0:_(s32) = COPY $vgpr0
				%1:_(s32) = COPY $vgpr1
				%2:_(s8) = G_TRUNC %0
				%3:_(s8) = G_TRUNC %1
				%4:_(s8) = G_USHLSAT %2, %3
				%5:_(s32) = G_ANYEXT %4
				$vgpr0 = COPY %5
				...

				---
				name: ushlsat_v2s8
				body: \|
				bb.0:
				liveins: $vgpr0, $vgpr1

				; GFX6-LABEL: name: ushlsat_v2s8
				; GFX6: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX6: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 8
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[COPY]], [[C]](s32)
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
				; GFX6: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[COPY]], [[C1]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
				; GFX6: [[LSHR2:%[0-9]+]]:_(s32) = G_LSHR [[COPY]], [[C2]](s32)
				; GFX6: [[LSHR3:%[0-9]+]]:_(s32) = G_LSHR [[COPY1]], [[C]](s32)
				; GFX6: [[LSHR4:%[0-9]+]]:_(s32) = G_LSHR [[COPY1]], [[C1]](s32)
				; GFX6: [[LSHR5:%[0-9]+]]:_(s32) = G_LSHR [[COPY1]], [[C2]](s32)
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
				; GFX6: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C3]]
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C2]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[LSHR6:%[0-9]+]]:_(s32) = G_LSHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX6: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[LSHR6]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C4]], [[SHL1]]
				; GFX6: [[LSHR7:%[0-9]+]]:_(s32) = G_LSHR [[SELECT]], [[C2]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[LSHR]](s32)
				; GFX6: [[COPY5:%[0-9]+]]:_(s32) = COPY [[LSHR3]](s32)
				; GFX6: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY5]], [[C3]]
				; GFX6: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C2]](s32)
				; GFX6: [[SHL3:%[0-9]+]]:_(s32) = G_SHL [[SHL2]], [[AND1]](s32)
				; GFX6: [[LSHR8:%[0-9]+]]:_(s32) = G_LSHR [[SHL3]], [[AND1]](s32)
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL2]](s32), [[LSHR8]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[C4]], [[SHL3]]
				; GFX6: [[LSHR9:%[0-9]+]]:_(s32) = G_LSHR [[SELECT1]], [[C2]](s32)
				; GFX6: [[C5:%[0-9]+]]:_(s16) = G_CONSTANT i16 255
				; GFX6: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[LSHR7]](s32)
				; GFX6: [[AND2:%[0-9]+]]:_(s16) = G_AND [[TRUNC]], [[C5]]
				; GFX6: [[COPY6:%[0-9]+]]:_(s32) = COPY [[C]](s32)
				; GFX6: [[COPY7:%[0-9]+]]:_(s32) = COPY [[LSHR9]](s32)
				; GFX6: [[AND3:%[0-9]+]]:_(s32) = G_AND [[COPY7]], [[C3]]
				; GFX6: [[SHL4:%[0-9]+]]:_(s32) = G_SHL [[AND3]], [[COPY6]](s32)
				; GFX6: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[SHL4]](s32)
				; GFX6: [[OR:%[0-9]+]]:_(s16) = G_OR [[AND2]], [[TRUNC1]]
				; GFX6: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[OR]](s16)
				; GFX6: $vgpr0 = COPY [[ANYEXT]](s32)
				; GFX8-LABEL: name: ushlsat_v2s8
				; GFX8: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX8: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX8: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
				; GFX8: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[COPY1]](s32)
				; GFX8: [[BITCAST:%[0-9]+]]:_(<2 x s8>) = G_BITCAST [[TRUNC]](s16)
				; GFX8: [[BITCAST1:%[0-9]+]]:_(<2 x s8>) = G_BITCAST [[TRUNC1]](s16)
				; GFX8: [[UV:%[0-9]+]]:_(s8), [[UV1:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[BITCAST]](<2 x s8>)
				; GFX8: [[UV2:%[0-9]+]]:_(s8), [[UV3:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[BITCAST1]](<2 x s8>)
				; GFX8: [[USHLSAT:%[0-9]+]]:_(s8) = G_USHLSAT [[UV]], [[UV2]](s8)
				; GFX8: [[ANYEXT:%[0-9]+]]:_(s16) = G_ANYEXT [[UV1]](s8)
				; GFX8: [[ZEXT:%[0-9]+]]:_(s16) = G_ZEXT [[UV3]](s8)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 8
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[ANYEXT]], [[C]](s16)
				; GFX8: [[SHL1:%[0-9]+]]:_(s16) = G_SHL [[SHL]], [[ZEXT]](s16)
				; GFX8: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL1]], [[ZEXT]](s16)
				; GFX8: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX8: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[SHL]](s16), [[LSHR]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C1]], [[SHL1]]
				; GFX8: [[LSHR1:%[0-9]+]]:_(s16) = G_LSHR [[SELECT]], [[C]](s16)
				; GFX8: [[TRUNC2:%[0-9]+]]:_(s8) = G_TRUNC [[LSHR1]](s16)
				; GFX8: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s8>) = G_BUILD_VECTOR [[USHLSAT]](s8), [[TRUNC2]](s8)
				; GFX8: [[UV4:%[0-9]+]]:_(s8), [[UV5:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[BUILD_VECTOR]](<2 x s8>)
				; GFX8: [[MV:%[0-9]+]]:_(s16) = G_MERGE_VALUES [[UV4]](s8), [[UV5]](s8)
				; GFX8: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[MV]](s16)
				; GFX8: $vgpr0 = COPY [[ANYEXT1]](s32)
				; GFX9-LABEL: name: ushlsat_v2s8
				; GFX9: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX9: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX9: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
				; GFX9: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[COPY1]](s32)
				; GFX9: [[BITCAST:%[0-9]+]]:_(<2 x s8>) = G_BITCAST [[TRUNC]](s16)
				; GFX9: [[BITCAST1:%[0-9]+]]:_(<2 x s8>) = G_BITCAST [[TRUNC1]](s16)
				; GFX9: [[UV:%[0-9]+]]:_(s8), [[UV1:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[BITCAST]](<2 x s8>)
				; GFX9: [[UV2:%[0-9]+]]:_(s8), [[UV3:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[BITCAST1]](<2 x s8>)
				; GFX9: [[USHLSAT:%[0-9]+]]:_(s8) = G_USHLSAT [[UV]], [[UV2]](s8)
				; GFX9: [[ANYEXT:%[0-9]+]]:_(s16) = G_ANYEXT [[UV1]](s8)
				; GFX9: [[ZEXT:%[0-9]+]]:_(s16) = G_ZEXT [[UV3]](s8)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 8
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[ANYEXT]], [[C]](s16)
				; GFX9: [[SHL1:%[0-9]+]]:_(s16) = G_SHL [[SHL]], [[ZEXT]](s16)
				; GFX9: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL1]], [[ZEXT]](s16)
				; GFX9: [[C1:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX9: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[SHL]](s16), [[LSHR]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C1]], [[SHL1]]
				; GFX9: [[LSHR1:%[0-9]+]]:_(s16) = G_LSHR [[SELECT]], [[C]](s16)
				; GFX9: [[TRUNC2:%[0-9]+]]:_(s8) = G_TRUNC [[LSHR1]](s16)
				; GFX9: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s8>) = G_BUILD_VECTOR [[USHLSAT]](s8), [[TRUNC2]](s8)
				; GFX9: [[UV4:%[0-9]+]]:_(s8), [[UV5:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[BUILD_VECTOR]](<2 x s8>)
				; GFX9: [[MV:%[0-9]+]]:_(s16) = G_MERGE_VALUES [[UV4]](s8), [[UV5]](s8)
				; GFX9: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[MV]](s16)
				; GFX9: $vgpr0 = COPY [[ANYEXT1]](s32)
				%0:_(s32) = COPY $vgpr0
				%1:_(s32) = COPY $vgpr1
				%2:_(s16) = G_TRUNC %0
				%3:_(s16) = G_TRUNC %1
				%4:_(<2 x s8>) = G_BITCAST %2
				%5:_(<2 x s8>) = G_BITCAST %3
				%6:_(<2 x s8>) = G_USHLSAT %4, %5
				%7:_(s16) = G_BITCAST %6
				%8:_(s32) = G_ANYEXT %7
				$vgpr0 = COPY %8
				...

				---
				name: ushlsat_s16
				body: \|
				bb.0:
				liveins: $vgpr0, $vgpr1

				; GFX6-LABEL: name: ushlsat_s16
				; GFX6: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX6: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C1]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX6: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[LSHR]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C2]], [[SHL1]]
				; GFX6: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[SELECT]], [[C1]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[LSHR1]](s32)
				; GFX6: $vgpr0 = COPY [[COPY4]](s32)
				; GFX8-LABEL: name: ushlsat_s16
				; GFX8: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX8: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX8: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
				; GFX8: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[COPY1]](s32)
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[TRUNC]], [[TRUNC1]](s16)
				; GFX8: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL]], [[TRUNC1]](s16)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX8: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[TRUNC]](s16), [[LSHR]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C]], [[SHL]]
				; GFX8: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[SELECT]](s16)
				; GFX8: $vgpr0 = COPY [[ANYEXT]](s32)
				; GFX9-LABEL: name: ushlsat_s16
				; GFX9: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX9: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX9: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
				; GFX9: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[COPY1]](s32)
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[TRUNC]], [[TRUNC1]](s16)
				; GFX9: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL]], [[TRUNC1]](s16)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX9: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[TRUNC]](s16), [[LSHR]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C]], [[SHL]]
				; GFX9: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[SELECT]](s16)
				; GFX9: $vgpr0 = COPY [[ANYEXT]](s32)
				%0:_(s32) = COPY $vgpr0
				%1:_(s32) = COPY $vgpr1
				%2:_(s16) = G_TRUNC %0
				%3:_(s16) = G_TRUNC %1
				%4:_(s16) = G_USHLSAT %2, %3
				%5:_(s32) = G_ANYEXT %4
				$vgpr0 = COPY %5
				...

				---
				name: ushlsat_v2s16
				body: \|
				bb.0:
				liveins: $vgpr0, $vgpr1

				; GFX6-LABEL: name: ushlsat_v2s16
				; GFX6: [[COPY:%[0-9]+]]:_(<2 x s16>) = COPY $vgpr0
				; GFX6: [[COPY1:%[0-9]+]]:_(<2 x s16>) = COPY $vgpr1
				; GFX6: [[BITCAST:%[0-9]+]]:_(s32) = G_BITCAST [[COPY]](<2 x s16>)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST]], [[C]](s32)
				; GFX6: [[BITCAST1:%[0-9]+]]:_(s32) = G_BITCAST [[COPY1]](<2 x s16>)
				; GFX6: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST1]], [[C]](s32)
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[BITCAST]](s32)
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[BITCAST1]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C1]]
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[LSHR2:%[0-9]+]]:_(s32) = G_LSHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX6: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[LSHR2]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C2]], [[SHL1]]
				; GFX6: [[LSHR3:%[0-9]+]]:_(s32) = G_LSHR [[SELECT]], [[C]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[LSHR]](s32)
				; GFX6: [[COPY5:%[0-9]+]]:_(s32) = COPY [[LSHR1]](s32)
				; GFX6: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY5]], [[C1]]
				; GFX6: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
				; GFX6: [[SHL3:%[0-9]+]]:_(s32) = G_SHL [[SHL2]], [[AND1]](s32)
				; GFX6: [[LSHR4:%[0-9]+]]:_(s32) = G_LSHR [[SHL3]], [[AND1]](s32)
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL2]](s32), [[LSHR4]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[C2]], [[SHL3]]
				; GFX6: [[LSHR5:%[0-9]+]]:_(s32) = G_LSHR [[SELECT1]], [[C]](s32)
				; GFX6: [[COPY6:%[0-9]+]]:_(s32) = COPY [[LSHR3]](s32)
				; GFX6: [[AND2:%[0-9]+]]:_(s32) = G_AND [[COPY6]], [[C1]]
				; GFX6: [[COPY7:%[0-9]+]]:_(s32) = COPY [[LSHR5]](s32)
				; GFX6: [[AND3:%[0-9]+]]:_(s32) = G_AND [[COPY7]], [[C1]]
				; GFX6: [[SHL4:%[0-9]+]]:_(s32) = G_SHL [[AND3]], [[C]](s32)
				; GFX6: [[OR:%[0-9]+]]:_(s32) = G_OR [[AND2]], [[SHL4]]
				; GFX6: [[BITCAST2:%[0-9]+]]:_(<2 x s16>) = G_BITCAST [[OR]](s32)
				; GFX6: $vgpr0 = COPY [[BITCAST2]](<2 x s16>)
				; GFX8-LABEL: name: ushlsat_v2s16
				; GFX8: [[COPY:%[0-9]+]]:_(<2 x s16>) = COPY $vgpr0
				; GFX8: [[COPY1:%[0-9]+]]:_(<2 x s16>) = COPY $vgpr1
				; GFX8: [[UV:%[0-9]+]]:_(s16), [[UV1:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY]](<2 x s16>)
				; GFX8: [[UV2:%[0-9]+]]:_(s16), [[UV3:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY1]](<2 x s16>)
				; GFX8: [[USHLSAT:%[0-9]+]]:_(s16) = G_USHLSAT [[UV]], [[UV2]](s16)
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[UV1]], [[UV3]](s16)
				; GFX8: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL]], [[UV3]](s16)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX8: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[UV1]](s16), [[LSHR]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C]], [[SHL]]
				; GFX8: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s16>) = G_BUILD_VECTOR [[USHLSAT]](s16), [[SELECT]](s16)
				; GFX8: $vgpr0 = COPY [[BUILD_VECTOR]](<2 x s16>)
				; GFX9-LABEL: name: ushlsat_v2s16
				; GFX9: [[COPY:%[0-9]+]]:_(<2 x s16>) = COPY $vgpr0
				; GFX9: [[COPY1:%[0-9]+]]:_(<2 x s16>) = COPY $vgpr1
				; GFX9: [[UV:%[0-9]+]]:_(s16), [[UV1:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY]](<2 x s16>)
				; GFX9: [[UV2:%[0-9]+]]:_(s16), [[UV3:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY1]](<2 x s16>)
				; GFX9: [[USHLSAT:%[0-9]+]]:_(s16) = G_USHLSAT [[UV]], [[UV2]](s16)
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[UV1]], [[UV3]](s16)
				; GFX9: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL]], [[UV3]](s16)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX9: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[UV1]](s16), [[LSHR]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C]], [[SHL]]
				; GFX9: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s16>) = G_BUILD_VECTOR [[USHLSAT]](s16), [[SELECT]](s16)
				; GFX9: $vgpr0 = COPY [[BUILD_VECTOR]](<2 x s16>)
				%0:_(<2 x s16>) = COPY $vgpr0
				%1:_(<2 x s16>) = COPY $vgpr1
				%2:_(<2 x s16>) = G_USHLSAT %0, %1
				$vgpr0 = COPY %2
				...

				---
				name: ushlsat_v3s16
				body: \|
				bb.0:
				liveins: $vgpr0_vgpr1_vgpr2

				; GFX6-LABEL: name: ushlsat_v3s16
				; GFX6: [[COPY:%[0-9]+]]:_(<6 x s16>) = COPY $vgpr0_vgpr1_vgpr2
				; GFX6: [[UV:%[0-9]+]]:_(<3 x s16>), [[UV1:%[0-9]+]]:_(<3 x s16>) = G_UNMERGE_VALUES [[COPY]](<6 x s16>)
				; GFX6: [[DEF:%[0-9]+]]:_(<4 x s16>) = G_IMPLICIT_DEF
				; GFX6: [[INSERT:%[0-9]+]]:_(<4 x s16>) = G_INSERT [[DEF]], [[UV]](<3 x s16>), 0
				; GFX6: [[UV2:%[0-9]+]]:_(<2 x s16>), [[UV3:%[0-9]+]]:_(<2 x s16>) = G_UNMERGE_VALUES [[INSERT]](<4 x s16>)
				; GFX6: [[BITCAST:%[0-9]+]]:_(s32) = G_BITCAST [[UV2]](<2 x s16>)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST]], [[C]](s32)
				; GFX6: [[BITCAST1:%[0-9]+]]:_(s32) = G_BITCAST [[UV3]](<2 x s16>)
				; GFX6: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST1]], [[C]](s32)
				; GFX6: [[INSERT1:%[0-9]+]]:_(<4 x s16>) = G_INSERT [[DEF]], [[UV1]](<3 x s16>), 0
				; GFX6: [[UV4:%[0-9]+]]:_(<2 x s16>), [[UV5:%[0-9]+]]:_(<2 x s16>) = G_UNMERGE_VALUES [[INSERT1]](<4 x s16>)
				; GFX6: [[BITCAST2:%[0-9]+]]:_(s32) = G_BITCAST [[UV4]](<2 x s16>)
				; GFX6: [[LSHR2:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST2]], [[C]](s32)
				; GFX6: [[BITCAST3:%[0-9]+]]:_(s32) = G_BITCAST [[UV5]](<2 x s16>)
				; GFX6: [[LSHR3:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST3]], [[C]](s32)
				; GFX6: [[COPY1:%[0-9]+]]:_(s32) = COPY [[BITCAST]](s32)
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[BITCAST2]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C1]]
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[LSHR4:%[0-9]+]]:_(s32) = G_LSHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX6: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[LSHR4]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C2]], [[SHL1]]
				; GFX6: [[LSHR5:%[0-9]+]]:_(s32) = G_LSHR [[SELECT]], [[C]](s32)
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[LSHR]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[LSHR2]](s32)
				; GFX6: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY4]], [[C1]]
				; GFX6: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C]](s32)
				; GFX6: [[SHL3:%[0-9]+]]:_(s32) = G_SHL [[SHL2]], [[AND1]](s32)
				; GFX6: [[LSHR6:%[0-9]+]]:_(s32) = G_LSHR [[SHL3]], [[AND1]](s32)
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL2]](s32), [[LSHR6]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[C2]], [[SHL3]]
				; GFX6: [[LSHR7:%[0-9]+]]:_(s32) = G_LSHR [[SELECT1]], [[C]](s32)
				; GFX6: [[COPY5:%[0-9]+]]:_(s32) = COPY [[BITCAST1]](s32)
				; GFX6: [[COPY6:%[0-9]+]]:_(s32) = COPY [[BITCAST3]](s32)
				; GFX6: [[AND2:%[0-9]+]]:_(s32) = G_AND [[COPY6]], [[C1]]
				; GFX6: [[SHL4:%[0-9]+]]:_(s32) = G_SHL [[COPY5]], [[C]](s32)
				; GFX6: [[SHL5:%[0-9]+]]:_(s32) = G_SHL [[SHL4]], [[AND2]](s32)
				; GFX6: [[LSHR8:%[0-9]+]]:_(s32) = G_LSHR [[SHL5]], [[AND2]](s32)
				; GFX6: [[ICMP2:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL4]](s32), [[LSHR8]]
				; GFX6: [[SELECT2:%[0-9]+]]:_(s32) = G_SELECT [[ICMP2]](s32), [[C2]], [[SHL5]]
				; GFX6: [[LSHR9:%[0-9]+]]:_(s32) = G_LSHR [[SELECT2]], [[C]](s32)
				; GFX6: [[COPY7:%[0-9]+]]:_(s32) = COPY [[LSHR5]](s32)
				; GFX6: [[AND3:%[0-9]+]]:_(s32) = G_AND [[COPY7]], [[C1]]
				; GFX6: [[COPY8:%[0-9]+]]:_(s32) = COPY [[LSHR7]](s32)
				; GFX6: [[AND4:%[0-9]+]]:_(s32) = G_AND [[COPY8]], [[C1]]
				; GFX6: [[SHL6:%[0-9]+]]:_(s32) = G_SHL [[AND4]], [[C]](s32)
				; GFX6: [[OR:%[0-9]+]]:_(s32) = G_OR [[AND3]], [[SHL6]]
				; GFX6: [[BITCAST4:%[0-9]+]]:_(<2 x s16>) = G_BITCAST [[OR]](s32)
				; GFX6: [[COPY9:%[0-9]+]]:_(s32) = COPY [[LSHR9]](s32)
				; GFX6: [[AND5:%[0-9]+]]:_(s32) = G_AND [[COPY9]], [[C1]]
				; GFX6: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
				; GFX6: [[SHL7:%[0-9]+]]:_(s32) = G_SHL [[C3]], [[C]](s32)
				; GFX6: [[OR1:%[0-9]+]]:_(s32) = G_OR [[AND5]], [[SHL7]]
				; GFX6: [[BITCAST5:%[0-9]+]]:_(<2 x s16>) = G_BITCAST [[OR1]](s32)
				; GFX6: [[CONCAT_VECTORS:%[0-9]+]]:_(<4 x s16>) = G_CONCAT_VECTORS [[BITCAST4]](<2 x s16>), [[BITCAST5]](<2 x s16>)
				; GFX6: [[EXTRACT:%[0-9]+]]:_(<3 x s16>) = G_EXTRACT [[CONCAT_VECTORS]](<4 x s16>), 0
				; GFX6: [[DEF1:%[0-9]+]]:_(<4 x s16>) = G_IMPLICIT_DEF
				; GFX6: [[CONCAT_VECTORS1:%[0-9]+]]:_(<12 x s16>) = G_CONCAT_VECTORS [[DEF1]](<4 x s16>), [[DEF]](<4 x s16>), [[DEF]](<4 x s16>)
				; GFX6: [[UV6:%[0-9]+]]:_(<3 x s16>), [[UV7:%[0-9]+]]:_(<3 x s16>), [[UV8:%[0-9]+]]:_(<3 x s16>), [[UV9:%[0-9]+]]:_(<3 x s16>) = G_UNMERGE_VALUES [[CONCAT_VECTORS1]](<12 x s16>)
				; GFX6: [[CONCAT_VECTORS2:%[0-9]+]]:_(<6 x s16>) = G_CONCAT_VECTORS [[EXTRACT]](<3 x s16>), [[UV6]](<3 x s16>)
				; GFX6: $vgpr0_vgpr1_vgpr2 = COPY [[CONCAT_VECTORS2]](<6 x s16>)
				; GFX8-LABEL: name: ushlsat_v3s16
				; GFX8: [[COPY:%[0-9]+]]:_(<6 x s16>) = COPY $vgpr0_vgpr1_vgpr2
				; GFX8: [[UV:%[0-9]+]]:_(<3 x s16>), [[UV1:%[0-9]+]]:_(<3 x s16>) = G_UNMERGE_VALUES [[COPY]](<6 x s16>)
				; GFX8: [[UV2:%[0-9]+]]:_(s16), [[UV3:%[0-9]+]]:_(s16), [[UV4:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[UV]](<3 x s16>)
				; GFX8: [[UV5:%[0-9]+]]:_(s16), [[UV6:%[0-9]+]]:_(s16), [[UV7:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[UV1]](<3 x s16>)
				; GFX8: [[USHLSAT:%[0-9]+]]:_(s16) = G_USHLSAT [[UV2]], [[UV5]](s16)
				; GFX8: [[USHLSAT1:%[0-9]+]]:_(s16) = G_USHLSAT [[UV3]], [[UV6]](s16)
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[UV4]], [[UV7]](s16)
				; GFX8: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL]], [[UV7]](s16)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX8: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[UV4]](s16), [[LSHR]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C]], [[SHL]]
				; GFX8: [[BUILD_VECTOR:%[0-9]+]]:_(<3 x s16>) = G_BUILD_VECTOR [[USHLSAT]](s16), [[USHLSAT1]](s16), [[SELECT]](s16)
				; GFX8: [[DEF:%[0-9]+]]:_(<4 x s16>) = G_IMPLICIT_DEF
				; GFX8: [[DEF1:%[0-9]+]]:_(<4 x s16>) = G_IMPLICIT_DEF
				; GFX8: [[CONCAT_VECTORS:%[0-9]+]]:_(<12 x s16>) = G_CONCAT_VECTORS [[DEF]](<4 x s16>), [[DEF1]](<4 x s16>), [[DEF1]](<4 x s16>)
				; GFX8: [[UV8:%[0-9]+]]:_(<3 x s16>), [[UV9:%[0-9]+]]:_(<3 x s16>), [[UV10:%[0-9]+]]:_(<3 x s16>), [[UV11:%[0-9]+]]:_(<3 x s16>) = G_UNMERGE_VALUES [[CONCAT_VECTORS]](<12 x s16>)
				; GFX8: [[CONCAT_VECTORS1:%[0-9]+]]:_(<6 x s16>) = G_CONCAT_VECTORS [[BUILD_VECTOR]](<3 x s16>), [[UV8]](<3 x s16>)
				; GFX8: $vgpr0_vgpr1_vgpr2 = COPY [[CONCAT_VECTORS1]](<6 x s16>)
				; GFX9-LABEL: name: ushlsat_v3s16
				; GFX9: [[COPY:%[0-9]+]]:_(<6 x s16>) = COPY $vgpr0_vgpr1_vgpr2
				; GFX9: [[UV:%[0-9]+]]:_(<3 x s16>), [[UV1:%[0-9]+]]:_(<3 x s16>) = G_UNMERGE_VALUES [[COPY]](<6 x s16>)
				; GFX9: [[UV2:%[0-9]+]]:_(s16), [[UV3:%[0-9]+]]:_(s16), [[UV4:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[UV]](<3 x s16>)
				; GFX9: [[UV5:%[0-9]+]]:_(s16), [[UV6:%[0-9]+]]:_(s16), [[UV7:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[UV1]](<3 x s16>)
				; GFX9: [[USHLSAT:%[0-9]+]]:_(s16) = G_USHLSAT [[UV2]], [[UV5]](s16)
				; GFX9: [[USHLSAT1:%[0-9]+]]:_(s16) = G_USHLSAT [[UV3]], [[UV6]](s16)
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[UV4]], [[UV7]](s16)
				; GFX9: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL]], [[UV7]](s16)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX9: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[UV4]](s16), [[LSHR]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C]], [[SHL]]
				; GFX9: [[BUILD_VECTOR:%[0-9]+]]:_(<3 x s16>) = G_BUILD_VECTOR [[USHLSAT]](s16), [[USHLSAT1]](s16), [[SELECT]](s16)
				; GFX9: [[DEF:%[0-9]+]]:_(<4 x s16>) = G_IMPLICIT_DEF
				; GFX9: [[DEF1:%[0-9]+]]:_(<4 x s16>) = G_IMPLICIT_DEF
				; GFX9: [[CONCAT_VECTORS:%[0-9]+]]:_(<12 x s16>) = G_CONCAT_VECTORS [[DEF]](<4 x s16>), [[DEF1]](<4 x s16>), [[DEF1]](<4 x s16>)
				; GFX9: [[UV8:%[0-9]+]]:_(<3 x s16>), [[UV9:%[0-9]+]]:_(<3 x s16>), [[UV10:%[0-9]+]]:_(<3 x s16>), [[UV11:%[0-9]+]]:_(<3 x s16>) = G_UNMERGE_VALUES [[CONCAT_VECTORS]](<12 x s16>)
				; GFX9: [[CONCAT_VECTORS1:%[0-9]+]]:_(<6 x s16>) = G_CONCAT_VECTORS [[BUILD_VECTOR]](<3 x s16>), [[UV8]](<3 x s16>)
				; GFX9: $vgpr0_vgpr1_vgpr2 = COPY [[CONCAT_VECTORS1]](<6 x s16>)
				%0:_(<6 x s16>) = COPY $vgpr0_vgpr1_vgpr2
				%1:_(<3 x s16>), %2:_(<3 x s16>) = G_UNMERGE_VALUES %0
				%3:_(<3 x s16>) = G_USHLSAT %1, %2
				%4:_(<3 x s16>) = G_IMPLICIT_DEF
				%5:_(<6 x s16>) = G_CONCAT_VECTORS %3, %4
				$vgpr0_vgpr1_vgpr2 = COPY %5
				...

				---
				name: ushlsat_v4s16
				body: \|
				bb.0:
				liveins: $vgpr0_vgpr1, $vgpr2_vgpr3

				; GFX6-LABEL: name: ushlsat_v4s16
				; GFX6: [[COPY:%[0-9]+]]:_(<4 x s16>) = COPY $vgpr0_vgpr1
				; GFX6: [[COPY1:%[0-9]+]]:_(<4 x s16>) = COPY $vgpr2_vgpr3
				; GFX6: [[UV:%[0-9]+]]:_(<2 x s16>), [[UV1:%[0-9]+]]:_(<2 x s16>) = G_UNMERGE_VALUES [[COPY]](<4 x s16>)
				; GFX6: [[BITCAST:%[0-9]+]]:_(s32) = G_BITCAST [[UV]](<2 x s16>)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST]], [[C]](s32)
				; GFX6: [[BITCAST1:%[0-9]+]]:_(s32) = G_BITCAST [[UV1]](<2 x s16>)
				; GFX6: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST1]], [[C]](s32)
				; GFX6: [[UV2:%[0-9]+]]:_(<2 x s16>), [[UV3:%[0-9]+]]:_(<2 x s16>) = G_UNMERGE_VALUES [[COPY1]](<4 x s16>)
				; GFX6: [[BITCAST2:%[0-9]+]]:_(s32) = G_BITCAST [[UV2]](<2 x s16>)
				; GFX6: [[LSHR2:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST2]], [[C]](s32)
				; GFX6: [[BITCAST3:%[0-9]+]]:_(s32) = G_BITCAST [[UV3]](<2 x s16>)
				; GFX6: [[LSHR3:%[0-9]+]]:_(s32) = G_LSHR [[BITCAST3]], [[C]](s32)
				; GFX6: [[COPY2:%[0-9]+]]:_(s32) = COPY [[BITCAST]](s32)
				; GFX6: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535
				; GFX6: [[COPY3:%[0-9]+]]:_(s32) = COPY [[BITCAST2]](s32)
				; GFX6: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C1]]
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[SHL]], [[AND]](s32)
				; GFX6: [[LSHR4:%[0-9]+]]:_(s32) = G_LSHR [[SHL1]], [[AND]](s32)
				; GFX6: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX6: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL]](s32), [[LSHR4]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C2]], [[SHL1]]
				; GFX6: [[LSHR5:%[0-9]+]]:_(s32) = G_LSHR [[SELECT]], [[C]](s32)
				; GFX6: [[COPY4:%[0-9]+]]:_(s32) = COPY [[LSHR]](s32)
				; GFX6: [[COPY5:%[0-9]+]]:_(s32) = COPY [[LSHR2]](s32)
				; GFX6: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY5]], [[C1]]
				; GFX6: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
				; GFX6: [[SHL3:%[0-9]+]]:_(s32) = G_SHL [[SHL2]], [[AND1]](s32)
				; GFX6: [[LSHR6:%[0-9]+]]:_(s32) = G_LSHR [[SHL3]], [[AND1]](s32)
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL2]](s32), [[LSHR6]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[C2]], [[SHL3]]
				; GFX6: [[LSHR7:%[0-9]+]]:_(s32) = G_LSHR [[SELECT1]], [[C]](s32)
				; GFX6: [[COPY6:%[0-9]+]]:_(s32) = COPY [[BITCAST1]](s32)
				; GFX6: [[COPY7:%[0-9]+]]:_(s32) = COPY [[BITCAST3]](s32)
				; GFX6: [[AND2:%[0-9]+]]:_(s32) = G_AND [[COPY7]], [[C1]]
				; GFX6: [[SHL4:%[0-9]+]]:_(s32) = G_SHL [[COPY6]], [[C]](s32)
				; GFX6: [[SHL5:%[0-9]+]]:_(s32) = G_SHL [[SHL4]], [[AND2]](s32)
				; GFX6: [[LSHR8:%[0-9]+]]:_(s32) = G_LSHR [[SHL5]], [[AND2]](s32)
				; GFX6: [[ICMP2:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL4]](s32), [[LSHR8]]
				; GFX6: [[SELECT2:%[0-9]+]]:_(s32) = G_SELECT [[ICMP2]](s32), [[C2]], [[SHL5]]
				; GFX6: [[LSHR9:%[0-9]+]]:_(s32) = G_LSHR [[SELECT2]], [[C]](s32)
				; GFX6: [[COPY8:%[0-9]+]]:_(s32) = COPY [[LSHR1]](s32)
				; GFX6: [[COPY9:%[0-9]+]]:_(s32) = COPY [[LSHR3]](s32)
				; GFX6: [[AND3:%[0-9]+]]:_(s32) = G_AND [[COPY9]], [[C1]]
				; GFX6: [[SHL6:%[0-9]+]]:_(s32) = G_SHL [[COPY8]], [[C]](s32)
				; GFX6: [[SHL7:%[0-9]+]]:_(s32) = G_SHL [[SHL6]], [[AND3]](s32)
				; GFX6: [[LSHR10:%[0-9]+]]:_(s32) = G_LSHR [[SHL7]], [[AND3]](s32)
				; GFX6: [[ICMP3:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[SHL6]](s32), [[LSHR10]]
				; GFX6: [[SELECT3:%[0-9]+]]:_(s32) = G_SELECT [[ICMP3]](s32), [[C2]], [[SHL7]]
				; GFX6: [[LSHR11:%[0-9]+]]:_(s32) = G_LSHR [[SELECT3]], [[C]](s32)
				; GFX6: [[COPY10:%[0-9]+]]:_(s32) = COPY [[LSHR5]](s32)
				; GFX6: [[AND4:%[0-9]+]]:_(s32) = G_AND [[COPY10]], [[C1]]
				; GFX6: [[COPY11:%[0-9]+]]:_(s32) = COPY [[LSHR7]](s32)
				; GFX6: [[AND5:%[0-9]+]]:_(s32) = G_AND [[COPY11]], [[C1]]
				; GFX6: [[SHL8:%[0-9]+]]:_(s32) = G_SHL [[AND5]], [[C]](s32)
				; GFX6: [[OR:%[0-9]+]]:_(s32) = G_OR [[AND4]], [[SHL8]]
				; GFX6: [[BITCAST4:%[0-9]+]]:_(<2 x s16>) = G_BITCAST [[OR]](s32)
				; GFX6: [[COPY12:%[0-9]+]]:_(s32) = COPY [[LSHR9]](s32)
				; GFX6: [[AND6:%[0-9]+]]:_(s32) = G_AND [[COPY12]], [[C1]]
				; GFX6: [[COPY13:%[0-9]+]]:_(s32) = COPY [[LSHR11]](s32)
				; GFX6: [[AND7:%[0-9]+]]:_(s32) = G_AND [[COPY13]], [[C1]]
				; GFX6: [[SHL9:%[0-9]+]]:_(s32) = G_SHL [[AND7]], [[C]](s32)
				; GFX6: [[OR1:%[0-9]+]]:_(s32) = G_OR [[AND6]], [[SHL9]]
				; GFX6: [[BITCAST5:%[0-9]+]]:_(<2 x s16>) = G_BITCAST [[OR1]](s32)
				; GFX6: [[CONCAT_VECTORS:%[0-9]+]]:_(<4 x s16>) = G_CONCAT_VECTORS [[BITCAST4]](<2 x s16>), [[BITCAST5]](<2 x s16>)
				; GFX6: $vgpr0_vgpr1 = COPY [[CONCAT_VECTORS]](<4 x s16>)
				; GFX8-LABEL: name: ushlsat_v4s16
				; GFX8: [[COPY:%[0-9]+]]:_(<4 x s16>) = COPY $vgpr0_vgpr1
				; GFX8: [[COPY1:%[0-9]+]]:_(<4 x s16>) = COPY $vgpr2_vgpr3
				; GFX8: [[UV:%[0-9]+]]:_(s16), [[UV1:%[0-9]+]]:_(s16), [[UV2:%[0-9]+]]:_(s16), [[UV3:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY]](<4 x s16>)
				; GFX8: [[UV4:%[0-9]+]]:_(s16), [[UV5:%[0-9]+]]:_(s16), [[UV6:%[0-9]+]]:_(s16), [[UV7:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY1]](<4 x s16>)
				; GFX8: [[USHLSAT:%[0-9]+]]:_(s16) = G_USHLSAT [[UV]], [[UV4]](s16)
				; GFX8: [[USHLSAT1:%[0-9]+]]:_(s16) = G_USHLSAT [[UV1]], [[UV5]](s16)
				; GFX8: [[USHLSAT2:%[0-9]+]]:_(s16) = G_USHLSAT [[UV2]], [[UV6]](s16)
				; GFX8: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[UV3]], [[UV7]](s16)
				; GFX8: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL]], [[UV7]](s16)
				; GFX8: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX8: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[UV3]](s16), [[LSHR]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C]], [[SHL]]
				; GFX8: [[BUILD_VECTOR:%[0-9]+]]:_(<4 x s16>) = G_BUILD_VECTOR [[USHLSAT]](s16), [[USHLSAT1]](s16), [[USHLSAT2]](s16), [[SELECT]](s16)
				; GFX8: $vgpr0_vgpr1 = COPY [[BUILD_VECTOR]](<4 x s16>)
				; GFX9-LABEL: name: ushlsat_v4s16
				; GFX9: [[COPY:%[0-9]+]]:_(<4 x s16>) = COPY $vgpr0_vgpr1
				; GFX9: [[COPY1:%[0-9]+]]:_(<4 x s16>) = COPY $vgpr2_vgpr3
				; GFX9: [[UV:%[0-9]+]]:_(s16), [[UV1:%[0-9]+]]:_(s16), [[UV2:%[0-9]+]]:_(s16), [[UV3:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY]](<4 x s16>)
				; GFX9: [[UV4:%[0-9]+]]:_(s16), [[UV5:%[0-9]+]]:_(s16), [[UV6:%[0-9]+]]:_(s16), [[UV7:%[0-9]+]]:_(s16) = G_UNMERGE_VALUES [[COPY1]](<4 x s16>)
				; GFX9: [[USHLSAT:%[0-9]+]]:_(s16) = G_USHLSAT [[UV]], [[UV4]](s16)
				; GFX9: [[USHLSAT1:%[0-9]+]]:_(s16) = G_USHLSAT [[UV1]], [[UV5]](s16)
				; GFX9: [[USHLSAT2:%[0-9]+]]:_(s16) = G_USHLSAT [[UV2]], [[UV6]](s16)
				; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[UV3]], [[UV7]](s16)
				; GFX9: [[LSHR:%[0-9]+]]:_(s16) = G_LSHR [[SHL]], [[UV7]](s16)
				; GFX9: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 -1
				; GFX9: [[ICMP:%[0-9]+]]:_(s16) = G_ICMP intpred(ne), [[UV3]](s16), [[LSHR]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s16) = G_SELECT [[ICMP]](s16), [[C]], [[SHL]]
				; GFX9: [[BUILD_VECTOR:%[0-9]+]]:_(<4 x s16>) = G_BUILD_VECTOR [[USHLSAT]](s16), [[USHLSAT1]](s16), [[USHLSAT2]](s16), [[SELECT]](s16)
				; GFX9: $vgpr0_vgpr1 = COPY [[BUILD_VECTOR]](<4 x s16>)
				%0:_(<4 x s16>) = COPY $vgpr0_vgpr1
				%1:_(<4 x s16>) = COPY $vgpr2_vgpr3
				%2:_(<4 x s16>) = G_USHLSAT %0, %1
				$vgpr0_vgpr1 = COPY %2
				...

				---
				name: ushlsat_s32
				body: \|
				bb.0:
				liveins: $vgpr0, $vgpr1

				; GFX6-LABEL: name: ushlsat_s32
				; GFX6: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX6: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY]], [[COPY1]](s32)
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[SHL]], [[COPY1]](s32)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX6: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[COPY]](s32), [[LSHR]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C]], [[SHL]]
				; GFX6: $vgpr0 = COPY [[SELECT]](s32)
				; GFX8-LABEL: name: ushlsat_s32
				; GFX8: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX8: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX8: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY]], [[COPY1]](s32)
				; GFX8: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[SHL]], [[COPY1]](s32)
				; GFX8: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX8: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[COPY]](s32), [[LSHR]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C]], [[SHL]]
				; GFX8: $vgpr0 = COPY [[SELECT]](s32)
				; GFX9-LABEL: name: ushlsat_s32
				; GFX9: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
				; GFX9: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
				; GFX9: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY]], [[COPY1]](s32)
				; GFX9: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[SHL]], [[COPY1]](s32)
				; GFX9: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX9: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[COPY]](s32), [[LSHR]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C]], [[SHL]]
				; GFX9: $vgpr0 = COPY [[SELECT]](s32)
				%0:_(s32) = COPY $vgpr0
				%1:_(s32) = COPY $vgpr1
				%2:_(s32) = G_USHLSAT %0, %1
				$vgpr0 = COPY %2
				...

				---
				name: ushlsat_v2s32
				body: \|
				bb.0:
				liveins: $vgpr0_vgpr1, $vgpr2_vgpr3

				; GFX6-LABEL: name: ushlsat_v2s32
				; GFX6: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr0_vgpr1
				; GFX6: [[COPY1:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr2_vgpr3
				; GFX6: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)
				; GFX6: [[UV2:%[0-9]+]]:_(s32), [[UV3:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY1]](<2 x s32>)
				; GFX6: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[UV2]](s32)
				; GFX6: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[SHL]], [[UV2]](s32)
				; GFX6: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX6: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[UV]](s32), [[LSHR]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C]], [[SHL]]
				; GFX6: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[UV3]](s32)
				; GFX6: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[SHL1]], [[UV3]](s32)
				; GFX6: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[UV1]](s32), [[LSHR1]]
				; GFX6: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[C]], [[SHL1]]
				; GFX6: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[SELECT]](s32), [[SELECT1]](s32)
				; GFX6: $vgpr0_vgpr1 = COPY [[BUILD_VECTOR]](<2 x s32>)
				; GFX8-LABEL: name: ushlsat_v2s32
				; GFX8: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr0_vgpr1
				; GFX8: [[COPY1:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr2_vgpr3
				; GFX8: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)
				; GFX8: [[UV2:%[0-9]+]]:_(s32), [[UV3:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY1]](<2 x s32>)
				; GFX8: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[UV2]](s32)
				; GFX8: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[SHL]], [[UV2]](s32)
				; GFX8: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX8: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[UV]](s32), [[LSHR]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C]], [[SHL]]
				; GFX8: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[UV3]](s32)
				; GFX8: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[SHL1]], [[UV3]](s32)
				; GFX8: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[UV1]](s32), [[LSHR1]]
				; GFX8: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[C]], [[SHL1]]
				; GFX8: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[SELECT]](s32), [[SELECT1]](s32)
				; GFX8: $vgpr0_vgpr1 = COPY [[BUILD_VECTOR]](<2 x s32>)
				; GFX9-LABEL: name: ushlsat_v2s32
				; GFX9: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr0_vgpr1
				; GFX9: [[COPY1:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr2_vgpr3
				; GFX9: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)
				; GFX9: [[UV2:%[0-9]+]]:_(s32), [[UV3:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY1]](<2 x s32>)
				; GFX9: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[UV2]](s32)
				; GFX9: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[SHL]], [[UV2]](s32)
				; GFX9: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -1
				; GFX9: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[UV]](s32), [[LSHR]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s32) = G_SELECT [[ICMP]](s32), [[C]], [[SHL]]
				; GFX9: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[UV3]](s32)
				; GFX9: [[LSHR1:%[0-9]+]]:_(s32) = G_LSHR [[SHL1]], [[UV3]](s32)
				; GFX9: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ne), [[UV1]](s32), [[LSHR1]]
				; GFX9: [[SELECT1:%[0-9]+]]:_(s32) = G_SELECT [[ICMP1]](s32), [[C]], [[SHL1]]
				; GFX9: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[SELECT]](s32), [[SELECT1]](s32)
				; GFX9: $vgpr0_vgpr1 = COPY [[BUILD_VECTOR]](<2 x s32>)
				%0:_(<2 x s32>) = COPY $vgpr0_vgpr1
				%1:_(<2 x s32>) = COPY $vgpr2_vgpr3
				%2:_(<2 x s32>) = G_USHLSAT %0, %1
				$vgpr0_vgpr1 = COPY %2
				...

				---
				name: ushlsat_s64
				body: \|
				bb.0:
				liveins: $vgpr0_vgpr1, $vgpr2_vgpr3

				; GFX6-LABEL: name: ushlsat_s64
				; GFX6: [[COPY:%[0-9]+]]:_(s64) = COPY $vgpr0_vgpr1
				; GFX6: [[COPY1:%[0-9]+]]:_(s64) = COPY $vgpr2_vgpr3
				; GFX6: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY]], [[COPY1]](s64)
				; GFX6: [[LSHR:%[0-9]+]]:_(s64) = G_LSHR [[SHL]], [[COPY1]](s64)
				; GFX6: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -1
				; GFX6: [[ICMP:%[0-9]+]]:_(s64) = G_ICMP intpred(ne), [[COPY]](s64), [[LSHR]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s64) = G_SELECT [[ICMP]](s64), [[C]], [[SHL]]
				; GFX6: $vgpr0_vgpr1 = COPY [[SELECT]](s64)
				; GFX8-LABEL: name: ushlsat_s64
				; GFX8: [[COPY:%[0-9]+]]:_(s64) = COPY $vgpr0_vgpr1
				; GFX8: [[COPY1:%[0-9]+]]:_(s64) = COPY $vgpr2_vgpr3
				; GFX8: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY]], [[COPY1]](s64)
				; GFX8: [[LSHR:%[0-9]+]]:_(s64) = G_LSHR [[SHL]], [[COPY1]](s64)
				; GFX8: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -1
				; GFX8: [[ICMP:%[0-9]+]]:_(s64) = G_ICMP intpred(ne), [[COPY]](s64), [[LSHR]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s64) = G_SELECT [[ICMP]](s64), [[C]], [[SHL]]
				; GFX8: $vgpr0_vgpr1 = COPY [[SELECT]](s64)
				; GFX9-LABEL: name: ushlsat_s64
				; GFX9: [[COPY:%[0-9]+]]:_(s64) = COPY $vgpr0_vgpr1
				; GFX9: [[COPY1:%[0-9]+]]:_(s64) = COPY $vgpr2_vgpr3
				; GFX9: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY]], [[COPY1]](s64)
				; GFX9: [[LSHR:%[0-9]+]]:_(s64) = G_LSHR [[SHL]], [[COPY1]](s64)
				; GFX9: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -1
				; GFX9: [[ICMP:%[0-9]+]]:_(s64) = G_ICMP intpred(ne), [[COPY]](s64), [[LSHR]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s64) = G_SELECT [[ICMP]](s64), [[C]], [[SHL]]
				; GFX9: $vgpr0_vgpr1 = COPY [[SELECT]](s64)
				%0:_(s64) = COPY $vgpr0_vgpr1
				%1:_(s64) = COPY $vgpr2_vgpr3
				%2:_(s64) = G_USHLSAT %0, %1
				$vgpr0_vgpr1 = COPY %2
				...

				---
				name: ushlsat_v2s64
				body: \|
				bb.0:
				liveins: $vgpr0_vgpr1_vgpr2_vgpr3, $vgpr4_vgpr5_vgpr6_vgpr7

				; GFX6-LABEL: name: ushlsat_v2s64
				; GFX6: [[COPY:%[0-9]+]]:_(<2 x s64>) = COPY $vgpr0_vgpr1_vgpr2_vgpr3
				; GFX6: [[COPY1:%[0-9]+]]:_(<2 x s64>) = COPY $vgpr4_vgpr5_vgpr6_vgpr7
				; GFX6: [[UV:%[0-9]+]]:_(s64), [[UV1:%[0-9]+]]:_(s64) = G_UNMERGE_VALUES [[COPY]](<2 x s64>)
				; GFX6: [[UV2:%[0-9]+]]:_(s64), [[UV3:%[0-9]+]]:_(s64) = G_UNMERGE_VALUES [[COPY1]](<2 x s64>)
				; GFX6: [[USHLSAT:%[0-9]+]]:_(s64) = G_USHLSAT [[UV]], [[UV2]](s64)
				; GFX6: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[UV1]], [[UV3]](s64)
				; GFX6: [[LSHR:%[0-9]+]]:_(s64) = G_LSHR [[SHL]], [[UV3]](s64)
				; GFX6: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -1
				; GFX6: [[ICMP:%[0-9]+]]:_(s64) = G_ICMP intpred(ne), [[UV1]](s64), [[LSHR]]
				; GFX6: [[SELECT:%[0-9]+]]:_(s64) = G_SELECT [[ICMP]](s64), [[C]], [[SHL]]
				; GFX6: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s64>) = G_BUILD_VECTOR [[USHLSAT]](s64), [[SELECT]](s64)
				; GFX6: $vgpr0_vgpr1_vgpr2_vgpr3 = COPY [[BUILD_VECTOR]](<2 x s64>)
				; GFX8-LABEL: name: ushlsat_v2s64
				; GFX8: [[COPY:%[0-9]+]]:_(<2 x s64>) = COPY $vgpr0_vgpr1_vgpr2_vgpr3
				; GFX8: [[COPY1:%[0-9]+]]:_(<2 x s64>) = COPY $vgpr4_vgpr5_vgpr6_vgpr7
				; GFX8: [[UV:%[0-9]+]]:_(s64), [[UV1:%[0-9]+]]:_(s64) = G_UNMERGE_VALUES [[COPY]](<2 x s64>)
				; GFX8: [[UV2:%[0-9]+]]:_(s64), [[UV3:%[0-9]+]]:_(s64) = G_UNMERGE_VALUES [[COPY1]](<2 x s64>)
				; GFX8: [[USHLSAT:%[0-9]+]]:_(s64) = G_USHLSAT [[UV]], [[UV2]](s64)
				; GFX8: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[UV1]], [[UV3]](s64)
				; GFX8: [[LSHR:%[0-9]+]]:_(s64) = G_LSHR [[SHL]], [[UV3]](s64)
				; GFX8: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -1
				; GFX8: [[ICMP:%[0-9]+]]:_(s64) = G_ICMP intpred(ne), [[UV1]](s64), [[LSHR]]
				; GFX8: [[SELECT:%[0-9]+]]:_(s64) = G_SELECT [[ICMP]](s64), [[C]], [[SHL]]
				; GFX8: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s64>) = G_BUILD_VECTOR [[USHLSAT]](s64), [[SELECT]](s64)
				; GFX8: $vgpr0_vgpr1_vgpr2_vgpr3 = COPY [[BUILD_VECTOR]](<2 x s64>)
				; GFX9-LABEL: name: ushlsat_v2s64
				; GFX9: [[COPY:%[0-9]+]]:_(<2 x s64>) = COPY $vgpr0_vgpr1_vgpr2_vgpr3
				; GFX9: [[COPY1:%[0-9]+]]:_(<2 x s64>) = COPY $vgpr4_vgpr5_vgpr6_vgpr7
				; GFX9: [[UV:%[0-9]+]]:_(s64), [[UV1:%[0-9]+]]:_(s64) = G_UNMERGE_VALUES [[COPY]](<2 x s64>)
				; GFX9: [[UV2:%[0-9]+]]:_(s64), [[UV3:%[0-9]+]]:_(s64) = G_UNMERGE_VALUES [[COPY1]](<2 x s64>)
				; GFX9: [[USHLSAT:%[0-9]+]]:_(s64) = G_USHLSAT [[UV]], [[UV2]](s64)
				; GFX9: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[UV1]], [[UV3]](s64)
				; GFX9: [[LSHR:%[0-9]+]]:_(s64) = G_LSHR [[SHL]], [[UV3]](s64)
				; GFX9: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -1
				; GFX9: [[ICMP:%[0-9]+]]:_(s64) = G_ICMP intpred(ne), [[UV1]](s64), [[LSHR]]
				; GFX9: [[SELECT:%[0-9]+]]:_(s64) = G_SELECT [[ICMP]](s64), [[C]], [[SHL]]
				; GFX9: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s64>) = G_BUILD_VECTOR [[USHLSAT]](s64), [[SELECT]](s64)
				; GFX9: $vgpr0_vgpr1_vgpr2_vgpr3 = COPY [[BUILD_VECTOR]](<2 x s64>)
				%0:_(<2 x s64>) = COPY $vgpr0_vgpr1_vgpr2_vgpr3
				%1:_(<2 x s64>) = COPY $vgpr4_vgpr5_vgpr6_vgpr7
				%2:_(<2 x s64>) = G_USHLSAT %0, %1
				$vgpr0_vgpr1_vgpr2_vgpr3 = COPY %2
				...

llvm/test/CodeGen/X86/sshl_sat.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc < %s -mtriple=x86_64-linux \| FileCheck %s --check-prefix=X64
				; RUN: llc < %s -mtriple=i686 -mattr=cmov \| FileCheck %s --check-prefix=X86

				declare i4 @llvm.sshl.sat.i4 (i4, i4)
				declare i8 @llvm.sshl.sat.i8 (i8, i8)
				declare i15 @llvm.sshl.sat.i15 (i15, i15)
				declare i16 @llvm.sshl.sat.i16 (i16, i16)
				declare i18 @llvm.sshl.sat.i18 (i18, i18)
				declare i32 @llvm.sshl.sat.i32 (i32, i32)
				lebedev.riUnsubmitted Done Reply Inline Actions Add `i32` test while at it? lebedev.ri: Add `i32` test while at it?
				declare i64 @llvm.sshl.sat.i64 (i64, i64)

				lebedev.riUnsubmitted Not Done Reply Inline Actions It is best to put vector tests into a separate file from the getgo See e.g. `uadd_sat_vec.ll` lebedev.ri: It is best to put vector tests into a separate file from the getgo See e.g. `uadd_sat_vec.ll`
				define i16 @func(i16 %x, i16 %y) nounwind {
				; X64-LABEL: func:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: movl %edi, %edx
				; X64-NEXT: shll %cl, %edx
				; X64-NEXT: movswl %dx, %esi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: sarl %cl, %esi
				; X64-NEXT: xorl %eax, %eax
				; X64-NEXT: testw %di, %di
				; X64-NEXT: sets %al
				; X64-NEXT: addl $32767, %eax # imm = 0x7FFF
				; X64-NEXT: cmpw %si, %di
				; X64-NEXT: cmovel %edx, %eax
				; X64-NEXT: # kill: def $ax killed $ax killed $eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func:
				; X86: # %bb.0:
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %edx
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movl %edx, %esi
				; X86-NEXT: shll %cl, %esi
				; X86-NEXT: movswl %si, %edi
				; X86-NEXT: sarl %cl, %edi
				; X86-NEXT: xorl %eax, %eax
				; X86-NEXT: testw %dx, %dx
				; X86-NEXT: sets %al
				; X86-NEXT: addl $32767, %eax # imm = 0x7FFF
				; X86-NEXT: cmpw %di, %dx
				; X86-NEXT: cmovel %esi, %eax
				; X86-NEXT: # kill: def $ax killed $ax killed $eax
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: retl
				%tmp = call i16 @llvm.sshl.sat.i16(i16 %x, i16 %y)
				ret i16 %tmp
				}

				define i16 @func2(i8 %x, i8 %y) nounwind {
				; X64-LABEL: func2:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: movsbl %dil, %eax
				; X64-NEXT: addl %eax, %eax
				; X64-NEXT: xorl %edx, %edx
				; X64-NEXT: testw %ax, %ax
				; X64-NEXT: sets %dl
				; X64-NEXT: addl $32767, %edx # imm = 0x7FFF
				; X64-NEXT: movl %eax, %esi
				; X64-NEXT: shll %cl, %esi
				; X64-NEXT: movswl %si, %edi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: sarl %cl, %edi
				; X64-NEXT: cmpw %di, %ax
				; X64-NEXT: cmovnel %edx, %esi
				; X64-NEXT: movswl %si, %eax
				; X64-NEXT: shrl %eax
				; X64-NEXT: # kill: def $ax killed $ax killed $eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func2:
				; X86: # %bb.0:
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movsbl {{[0-9]+}}(%esp), %edx
				; X86-NEXT: addl %edx, %edx
				; X86-NEXT: movl %edx, %esi
				; X86-NEXT: shll %cl, %esi
				; X86-NEXT: movswl %si, %edi
				; X86-NEXT: sarl %cl, %edi
				; X86-NEXT: xorl %eax, %eax
				; X86-NEXT: testw %dx, %dx
				; X86-NEXT: sets %al
				; X86-NEXT: addl $32767, %eax # imm = 0x7FFF
				; X86-NEXT: cmpw %di, %dx
				; X86-NEXT: cmovel %esi, %eax
				; X86-NEXT: cwtl
				; X86-NEXT: shrl %eax
				; X86-NEXT: # kill: def $ax killed $ax killed $eax
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: retl
				%x2 = sext i8 %x to i15
				%y2 = sext i8 %y to i15
				%tmp = call i15 @llvm.sshl.sat.i15(i15 %x2, i15 %y2)
				%tmp2 = sext i15 %tmp to i16
				ret i16 %tmp2
				}

				define i16 @func3(i15 %x, i8 %y) nounwind {
				; X64-LABEL: func3:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: shll $7, %ecx
				; X64-NEXT: addl %edi, %edi
				; X64-NEXT: movl %edi, %eax
				; X64-NEXT: shll %cl, %eax
				; X64-NEXT: movswl %ax, %edx
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: sarl %cl, %edx
				; X64-NEXT: xorl %ecx, %ecx
				; X64-NEXT: testw %di, %di
				; X64-NEXT: sets %cl
				; X64-NEXT: addl $32767, %ecx # imm = 0x7FFF
				; X64-NEXT: cmpw %dx, %di
				; X64-NEXT: cmovel %eax, %ecx
				; X64-NEXT: movswl %cx, %eax
				; X64-NEXT: shrl %eax
				; X64-NEXT: # kill: def $ax killed $ax killed $eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func3:
				; X86: # %bb.0:
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %edx
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx
				; X86-NEXT: shll $7, %ecx
				; X86-NEXT: addl %edx, %edx
				; X86-NEXT: movl %edx, %esi
				; X86-NEXT: shll %cl, %esi
				; X86-NEXT: movswl %si, %edi
				; X86-NEXT: # kill: def $cl killed $cl killed $ecx
				; X86-NEXT: sarl %cl, %edi
				; X86-NEXT: xorl %eax, %eax
				; X86-NEXT: testw %dx, %dx
				; X86-NEXT: sets %al
				; X86-NEXT: addl $32767, %eax # imm = 0x7FFF
				; X86-NEXT: cmpw %di, %dx
				; X86-NEXT: cmovel %esi, %eax
				; X86-NEXT: cwtl
				; X86-NEXT: shrl %eax
				; X86-NEXT: # kill: def $ax killed $ax killed $eax
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: retl
				%y2 = sext i8 %y to i15
				%y3 = shl i15 %y2, 7
				%tmp = call i15 @llvm.sshl.sat.i15(i15 %x, i15 %y3)
				%tmp2 = sext i15 %tmp to i16
				ret i16 %tmp2
				}

				define i4 @func4(i4 %x, i4 %y) nounwind {
				; X64-LABEL: func4:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: andb $15, %cl
				; X64-NEXT: shlb $4, %dil
				; X64-NEXT: movl %edi, %eax
				; X64-NEXT: shlb %cl, %al
				; X64-NEXT: movzbl %al, %esi
				; X64-NEXT: movl %esi, %edx
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: sarb %cl, %dl
				; X64-NEXT: xorl %eax, %eax
				; X64-NEXT: testb %dil, %dil
				; X64-NEXT: sets %al
				; X64-NEXT: addl $127, %eax
				; X64-NEXT: cmpb %dl, %dil
				; X64-NEXT: cmovel %esi, %eax
				; X64-NEXT: sarb $4, %al
				; X64-NEXT: # kill: def $al killed $al killed $eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func4:
				; X86: # %bb.0:
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: andb $15, %cl
				; X86-NEXT: movb {{[0-9]+}}(%esp), %dl
				; X86-NEXT: shlb $4, %dl
				; X86-NEXT: movb %dl, %ch
				; X86-NEXT: shlb %cl, %ch
				; X86-NEXT: movzbl %ch, %esi
				; X86-NEXT: sarb %cl, %ch
				; X86-NEXT: xorl %eax, %eax
				; X86-NEXT: testb %dl, %dl
				; X86-NEXT: sets %al
				; X86-NEXT: addl $127, %eax
				; X86-NEXT: cmpb %ch, %dl
				; X86-NEXT: cmovel %esi, %eax
				; X86-NEXT: sarb $4, %al
				; X86-NEXT: # kill: def $al killed $al killed $eax
				; X86-NEXT: popl %esi
				; X86-NEXT: retl
				%tmp = call i4 @llvm.sshl.sat.i4(i4 %x, i4 %y)
				ret i4 %tmp
				}

				define i64 @func5(i64 %x, i64 %y) nounwind {
				; X64-LABEL: func5:
				; X64: # %bb.0:
				; X64-NEXT: movq %rsi, %rcx
				; X64-NEXT: xorl %edx, %edx
				; X64-NEXT: testq %rdi, %rdi
				; X64-NEXT: sets %dl
				; X64-NEXT: movabsq $9223372036854775807, %rax # imm = 0x7FFFFFFFFFFFFFFF
				; X64-NEXT: addq %rdx, %rax
				; X64-NEXT: movq %rdi, %rdx
				; X64-NEXT: shlq %cl, %rdx
				; X64-NEXT: movq %rdx, %rsi
				; X64-NEXT: # kill: def $cl killed $cl killed $rcx
				; X64-NEXT: sarq %cl, %rsi
				; X64-NEXT: cmpq %rsi, %rdi
				; X64-NEXT: cmoveq %rdx, %rax
				; X64-NEXT: retq
				;
				; X86-LABEL: func5:
				; X86: # %bb.0:
				; X86-NEXT: pushl %ebp
				; X86-NEXT: pushl %ebx
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ebx
				; X86-NEXT: movl %eax, %ebp
				; X86-NEXT: shll %cl, %ebp
				; X86-NEXT: shldl %cl, %eax, %ebx
				; X86-NEXT: xorl %edx, %edx
				; X86-NEXT: testb $32, %cl
				; X86-NEXT: cmovnel %ebp, %ebx
				; X86-NEXT: cmovnel %edx, %ebp
				; X86-NEXT: movl %ebx, %edx
				; X86-NEXT: sarl %cl, %edx
				; X86-NEXT: movl %ebx, %edi
				; X86-NEXT: sarl $31, %edi
				; X86-NEXT: testb $32, %cl
				; X86-NEXT: cmovel %edx, %edi
				; X86-NEXT: movl %ebp, %esi
				; X86-NEXT: shrdl %cl, %ebx, %esi
				; X86-NEXT: testb $32, %cl
				; X86-NEXT: cmovnel %edx, %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx
				; X86-NEXT: xorl %ecx, %edi
				; X86-NEXT: xorl %eax, %esi
				; X86-NEXT: xorl %edx, %edx
				; X86-NEXT: testl %ecx, %ecx
				; X86-NEXT: movl $-1, %eax
				; X86-NEXT: movl $0, %ecx
				; X86-NEXT: cmovsl %ecx, %eax
				; X86-NEXT: sets %dl
				; X86-NEXT: addl $2147483647, %edx # imm = 0x7FFFFFFF
				; X86-NEXT: orl %edi, %esi
				; X86-NEXT: cmovel %ebp, %eax
				; X86-NEXT: cmovel %ebx, %edx
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: popl %ebx
				; X86-NEXT: popl %ebp
				; X86-NEXT: retl
				%tmp = call i64 @llvm.sshl.sat.i64(i64 %x, i64 %y)
				ret i64 %tmp
				}

				define i18 @func6(i16 %x, i16 %y) nounwind {
				; X64-LABEL: func6:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: movswl %di, %edx
				; X64-NEXT: shll $14, %edx
				; X64-NEXT: movl %edx, %esi
				; X64-NEXT: shll %cl, %esi
				; X64-NEXT: movl %esi, %edi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: sarl %cl, %edi
				; X64-NEXT: xorl %eax, %eax
				; X64-NEXT: testl %edx, %edx
				; X64-NEXT: sets %al
				; X64-NEXT: addl $2147483647, %eax # imm = 0x7FFFFFFF
				; X64-NEXT: cmpl %edi, %edx
				; X64-NEXT: cmovel %esi, %eax
				; X64-NEXT: sarl $14, %eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func6:
				; X86: # %bb.0:
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movswl {{[0-9]+}}(%esp), %edx
				; X86-NEXT: shll $14, %edx
				; X86-NEXT: movl %edx, %esi
				; X86-NEXT: shll %cl, %esi
				; X86-NEXT: movl %esi, %edi
				; X86-NEXT: sarl %cl, %edi
				; X86-NEXT: xorl %eax, %eax
				; X86-NEXT: testl %edx, %edx
				; X86-NEXT: sets %al
				; X86-NEXT: addl $2147483647, %eax # imm = 0x7FFFFFFF
				; X86-NEXT: cmpl %edi, %edx
				; X86-NEXT: cmovel %esi, %eax
				; X86-NEXT: sarl $14, %eax
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: retl
				%x2 = sext i16 %x to i18
				%y2 = sext i16 %y to i18
				%tmp = call i18 @llvm.sshl.sat.i18(i18 %x2, i18 %y2)
				ret i18 %tmp
				}

				define i32 @func7(i32 %x, i32 %y) nounwind {
				; X64-LABEL: func7:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: movl %edi, %edx
				; X64-NEXT: shll %cl, %edx
				; X64-NEXT: movl %edx, %esi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: sarl %cl, %esi
				; X64-NEXT: xorl %eax, %eax
				; X64-NEXT: testl %edi, %edi
				; X64-NEXT: sets %al
				; X64-NEXT: addl $2147483647, %eax # imm = 0x7FFFFFFF
				; X64-NEXT: cmpl %esi, %edi
				; X64-NEXT: cmovel %edx, %eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func7:
				; X86: # %bb.0:
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movl {{[0-9]+}}(%esp), %edx
				; X86-NEXT: movl %edx, %esi
				; X86-NEXT: shll %cl, %esi
				; X86-NEXT: movl %esi, %edi
				; X86-NEXT: sarl %cl, %edi
				; X86-NEXT: xorl %eax, %eax
				; X86-NEXT: testl %edx, %edx
				; X86-NEXT: sets %al
				; X86-NEXT: addl $2147483647, %eax # imm = 0x7FFFFFFF
				; X86-NEXT: cmpl %edi, %edx
				; X86-NEXT: cmovel %esi, %eax
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: retl
				%tmp = call i32 @llvm.sshl.sat.i32(i32 %x, i32 %y)
				ret i32 %tmp
				}

				define i8 @func8(i8 %x, i8 %y) nounwind {
				; X64-LABEL: func8:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: movl %edi, %eax
				; X64-NEXT: shlb %cl, %al
				; X64-NEXT: movzbl %al, %esi
				; X64-NEXT: movl %esi, %edx
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: sarb %cl, %dl
				; X64-NEXT: xorl %eax, %eax
				; X64-NEXT: testb %dil, %dil
				; X64-NEXT: sets %al
				; X64-NEXT: addl $127, %eax
				; X64-NEXT: cmpb %dl, %dil
				; X64-NEXT: cmovel %esi, %eax
				; X64-NEXT: # kill: def $al killed $al killed $eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func8:
				; X86: # %bb.0:
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movb {{[0-9]+}}(%esp), %dl
				; X86-NEXT: movb %dl, %ch
				; X86-NEXT: shlb %cl, %ch
				; X86-NEXT: movzbl %ch, %esi
				; X86-NEXT: sarb %cl, %ch
				; X86-NEXT: xorl %eax, %eax
				; X86-NEXT: testb %dl, %dl
				; X86-NEXT: sets %al
				; X86-NEXT: addl $127, %eax
				; X86-NEXT: cmpb %ch, %dl
				; X86-NEXT: cmovel %esi, %eax
				; X86-NEXT: # kill: def $al killed $al killed $eax
				; X86-NEXT: popl %esi
				; X86-NEXT: retl
				%tmp = call i8 @llvm.sshl.sat.i8(i8 %x, i8 %y)
				ret i8 %tmp
				}

llvm/test/CodeGen/X86/sshl_sat_vec.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc < %s -mtriple=x86_64-linux \| FileCheck %s --check-prefix=X64
				; RUN: llc < %s -mtriple=i686 -mattr=cmov \| FileCheck %s --check-prefix=X86

				declare <4 x i32> @llvm.sshl.sat.v4i32(<4 x i32>, <4 x i32>)

				define <4 x i32> @vec(<4 x i32> %x, <4 x i32> %y) nounwind {
				; X64-LABEL: vec:
				; X64: # %bb.0:
				; X64-NEXT: pshufd {{.*#+}} xmm2 = xmm0[3,3,3,3]
				; X64-NEXT: movd %xmm2, %eax
				; X64-NEXT: pshufd {{.*#+}} xmm2 = xmm1[3,3,3,3]
				; X64-NEXT: movd %xmm2, %ecx
				; X64-NEXT: movl %eax, %edx
				; X64-NEXT: shll %cl, %edx
				; X64-NEXT: movl %edx, %esi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: sarl %cl, %esi
				; X64-NEXT: xorl %ecx, %ecx
				; X64-NEXT: testl %eax, %eax
				; X64-NEXT: sets %cl
				; X64-NEXT: addl $2147483647, %ecx # imm = 0x7FFFFFFF
				; X64-NEXT: cmpl %esi, %eax
				; X64-NEXT: cmovel %edx, %ecx
				; X64-NEXT: movd %ecx, %xmm2
				; X64-NEXT: pshufd {{.*#+}} xmm3 = xmm0[2,3,2,3]
				; X64-NEXT: movd %xmm3, %eax
				; X64-NEXT: pshufd {{.*#+}} xmm3 = xmm1[2,3,2,3]
				; X64-NEXT: movd %xmm3, %ecx
				; X64-NEXT: movl %eax, %edx
				; X64-NEXT: shll %cl, %edx
				; X64-NEXT: movl %edx, %esi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: sarl %cl, %esi
				; X64-NEXT: xorl %ecx, %ecx
				; X64-NEXT: testl %eax, %eax
				; X64-NEXT: sets %cl
				; X64-NEXT: addl $2147483647, %ecx # imm = 0x7FFFFFFF
				; X64-NEXT: cmpl %esi, %eax
				; X64-NEXT: cmovel %edx, %ecx
				; X64-NEXT: movd %ecx, %xmm3
				; X64-NEXT: punpckldq {{.*#+}} xmm3 = xmm3[0],xmm2[0],xmm3[1],xmm2[1]
				; X64-NEXT: movd %xmm0, %eax
				; X64-NEXT: movd %xmm1, %ecx
				; X64-NEXT: movl %eax, %edx
				; X64-NEXT: shll %cl, %edx
				; X64-NEXT: movl %edx, %esi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: sarl %cl, %esi
				; X64-NEXT: xorl %ecx, %ecx
				; X64-NEXT: testl %eax, %eax
				; X64-NEXT: sets %cl
				; X64-NEXT: addl $2147483647, %ecx # imm = 0x7FFFFFFF
				; X64-NEXT: cmpl %esi, %eax
				; X64-NEXT: cmovel %edx, %ecx
				; X64-NEXT: movd %ecx, %xmm2
				; X64-NEXT: pshufd {{.*#+}} xmm0 = xmm0[1,1,1,1]
				; X64-NEXT: movd %xmm0, %eax
				; X64-NEXT: pshufd {{.*#+}} xmm0 = xmm1[1,1,1,1]
				; X64-NEXT: movd %xmm0, %ecx
				; X64-NEXT: movl %eax, %edx
				; X64-NEXT: shll %cl, %edx
				; X64-NEXT: movl %edx, %esi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: sarl %cl, %esi
				; X64-NEXT: xorl %ecx, %ecx
				; X64-NEXT: testl %eax, %eax
				; X64-NEXT: sets %cl
				; X64-NEXT: addl $2147483647, %ecx # imm = 0x7FFFFFFF
				; X64-NEXT: cmpl %esi, %eax
				; X64-NEXT: cmovel %edx, %ecx
				; X64-NEXT: movd %ecx, %xmm0
				; X64-NEXT: punpckldq {{.*#+}} xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1]
				; X64-NEXT: punpcklqdq {{.*#+}} xmm2 = xmm2[0],xmm3[0]
				; X64-NEXT: movdqa %xmm2, %xmm0
				; X64-NEXT: retq
				;
				; X86-LABEL: vec:
				; X86: # %bb.0:
				; X86-NEXT: pushl %ebp
				; X86-NEXT: pushl %ebx
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %ch
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movl {{[0-9]+}}(%esp), %edx
				; X86-NEXT: movl %edx, %edi
				; X86-NEXT: shll %cl, %edi
				; X86-NEXT: movl %edi, %ebp
				; X86-NEXT: sarl %cl, %ebp
				; X86-NEXT: xorl %ebx, %ebx
				; X86-NEXT: testl %edx, %edx
				; X86-NEXT: sets %bl
				; X86-NEXT: addl $2147483647, %ebx # imm = 0x7FFFFFFF
				; X86-NEXT: cmpl %ebp, %edx
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ebp
				; X86-NEXT: cmovel %edi, %ebx
				; X86-NEXT: movl %ebp, %edi
				; X86-NEXT: movb %ch, %cl
				; X86-NEXT: shll %cl, %edi
				; X86-NEXT: movl %edi, %eax
				; X86-NEXT: sarl %cl, %eax
				; X86-NEXT: xorl %edx, %edx
				; X86-NEXT: testl %ebp, %ebp
				; X86-NEXT: sets %dl
				; X86-NEXT: addl $2147483647, %edx # imm = 0x7FFFFFFF
				; X86-NEXT: cmpl %eax, %ebp
				; X86-NEXT: cmovel %edi, %edx
				; X86-NEXT: movl %esi, %edi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: shll %cl, %edi
				; X86-NEXT: movl %edi, %ebp
				; X86-NEXT: sarl %cl, %ebp
				; X86-NEXT: xorl %eax, %eax
				; X86-NEXT: testl %esi, %esi
				; X86-NEXT: sets %al
				; X86-NEXT: addl $2147483647, %eax # imm = 0x7FFFFFFF
				; X86-NEXT: cmpl %ebp, %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %esi
				; X86-NEXT: cmovel %edi, %eax
				; X86-NEXT: movl %esi, %edi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: shll %cl, %edi
				; X86-NEXT: movl %edi, %ebp
				; X86-NEXT: sarl %cl, %ebp
				; X86-NEXT: xorl %ecx, %ecx
				; X86-NEXT: testl %esi, %esi
				; X86-NEXT: sets %cl
				; X86-NEXT: addl $2147483647, %ecx # imm = 0x7FFFFFFF
				; X86-NEXT: cmpl %ebp, %esi
				; X86-NEXT: cmovel %edi, %ecx
				; X86-NEXT: movl {{[0-9]+}}(%esp), %esi
				; X86-NEXT: movl %ecx, 12(%esi)
				; X86-NEXT: movl %eax, 8(%esi)
				; X86-NEXT: movl %edx, 4(%esi)
				; X86-NEXT: movl %ebx, (%esi)
				; X86-NEXT: movl %esi, %eax
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: popl %ebx
				; X86-NEXT: popl %ebp
				; X86-NEXT: retl $4
				%tmp = call <4 x i32> @llvm.sshl.sat.v4i32(<4 x i32> %x, <4 x i32> %y)
				ret <4 x i32> %tmp
				}

llvm/test/CodeGen/X86/ushl_sat.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc < %s -mtriple=x86_64-linux \| FileCheck %s --check-prefix=X64
				; RUN: llc < %s -mtriple=i686 -mattr=cmov \| FileCheck %s --check-prefix=X86

				declare i4 @llvm.ushl.sat.i4 (i4, i4)
				declare i8 @llvm.ushl.sat.i8 (i8, i8)
				declare i15 @llvm.ushl.sat.i15 (i15, i15)
				declare i16 @llvm.ushl.sat.i16 (i16, i16)
				lebedev.riUnsubmitted Not Done Reply Inline Actions It seems i8 is the only "basic" bit width missing. It would probably good to have it. lebedev.ri: It seems i8 is the only "basic" bit width missing. It would probably good to have it.
				declare i18 @llvm.ushl.sat.i18 (i18, i18)
				declare i32 @llvm.ushl.sat.i32 (i32, i32)
				declare i64 @llvm.ushl.sat.i64 (i64, i64)

				define i16 @func(i16 %x, i16 %y) nounwind {
				; X64-LABEL: func:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: movl %edi, %eax
				; X64-NEXT: shll %cl, %eax
				; X64-NEXT: movzwl %ax, %edx
				; X64-NEXT: movl %edx, %eax
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: shrl %cl, %eax
				; X64-NEXT: cmpw %ax, %di
				; X64-NEXT: movl $65535, %eax # imm = 0xFFFF
				; X64-NEXT: cmovel %edx, %eax
				; X64-NEXT: # kill: def $ax killed $ax killed $eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func:
				; X86: # %bb.0:
				; X86-NEXT: pushl %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movl %eax, %edx
				; X86-NEXT: shll %cl, %edx
				; X86-NEXT: movzwl %dx, %edx
				; X86-NEXT: movl %edx, %esi
				; X86-NEXT: shrl %cl, %esi
				; X86-NEXT: cmpw %si, %ax
				; X86-NEXT: movl $65535, %eax # imm = 0xFFFF
				; X86-NEXT: cmovel %edx, %eax
				; X86-NEXT: # kill: def $ax killed $ax killed $eax
				; X86-NEXT: popl %esi
				; X86-NEXT: retl
				%tmp = call i16 @llvm.ushl.sat.i16(i16 %x, i16 %y)
				ret i16 %tmp
				}

				define i16 @func2(i8 %x, i8 %y) nounwind {
				; X64-LABEL: func2:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: movsbl %dil, %eax
				; X64-NEXT: addl %eax, %eax
				; X64-NEXT: movl %eax, %edx
				; X64-NEXT: shll %cl, %edx
				; X64-NEXT: movzwl %dx, %edx
				; X64-NEXT: movl %edx, %esi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: shrl %cl, %esi
				; X64-NEXT: cmpw %si, %ax
				; X64-NEXT: movl $65535, %eax # imm = 0xFFFF
				; X64-NEXT: cmovel %edx, %eax
				; X64-NEXT: cwtl
				; X64-NEXT: shrl %eax
				; X64-NEXT: # kill: def $ax killed $ax killed $eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func2:
				; X86: # %bb.0:
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movsbl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: addl %eax, %eax
				; X86-NEXT: movl %eax, %edx
				; X86-NEXT: shll %cl, %edx
				; X86-NEXT: movzwl %dx, %edx
				; X86-NEXT: movl %edx, %esi
				; X86-NEXT: shrl %cl, %esi
				; X86-NEXT: cmpw %si, %ax
				; X86-NEXT: movl $65535, %eax # imm = 0xFFFF
				; X86-NEXT: cmovel %edx, %eax
				; X86-NEXT: cwtl
				; X86-NEXT: shrl %eax
				; X86-NEXT: # kill: def $ax killed $ax killed $eax
				; X86-NEXT: popl %esi
				; X86-NEXT: retl
				%x2 = sext i8 %x to i15
				%y2 = sext i8 %y to i15
				%tmp = call i15 @llvm.ushl.sat.i15(i15 %x2, i15 %y2)
				%tmp2 = sext i15 %tmp to i16
				ret i16 %tmp2
				}

				define i16 @func3(i15 %x, i8 %y) nounwind {
				; X64-LABEL: func3:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: shll $7, %ecx
				; X64-NEXT: addl %edi, %edi
				; X64-NEXT: movl %edi, %eax
				; X64-NEXT: shll %cl, %eax
				; X64-NEXT: movzwl %ax, %eax
				; X64-NEXT: movl %eax, %edx
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: shrl %cl, %edx
				; X64-NEXT: cmpw %dx, %di
				; X64-NEXT: movl $65535, %ecx # imm = 0xFFFF
				; X64-NEXT: cmovel %eax, %ecx
				; X64-NEXT: movswl %cx, %eax
				; X64-NEXT: shrl %eax
				; X64-NEXT: # kill: def $ax killed $ax killed $eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func3:
				; X86: # %bb.0:
				; X86-NEXT: pushl %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx
				; X86-NEXT: shll $7, %ecx
				; X86-NEXT: addl %eax, %eax
				; X86-NEXT: movl %eax, %edx
				; X86-NEXT: shll %cl, %edx
				; X86-NEXT: movzwl %dx, %edx
				; X86-NEXT: movl %edx, %esi
				; X86-NEXT: # kill: def $cl killed $cl killed $ecx
				; X86-NEXT: shrl %cl, %esi
				; X86-NEXT: cmpw %si, %ax
				; X86-NEXT: movl $65535, %eax # imm = 0xFFFF
				; X86-NEXT: cmovel %edx, %eax
				; X86-NEXT: cwtl
				; X86-NEXT: shrl %eax
				; X86-NEXT: # kill: def $ax killed $ax killed $eax
				; X86-NEXT: popl %esi
				; X86-NEXT: retl
				%y2 = sext i8 %y to i15
				%y3 = shl i15 %y2, 7
				%tmp = call i15 @llvm.ushl.sat.i15(i15 %x, i15 %y3)
				%tmp2 = sext i15 %tmp to i16
				ret i16 %tmp2
				}

				define i4 @func4(i4 %x, i4 %y) nounwind {
				; X64-LABEL: func4:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: andb $15, %cl
				; X64-NEXT: shlb $4, %dil
				; X64-NEXT: movl %edi, %eax
				; X64-NEXT: shlb %cl, %al
				; X64-NEXT: movzbl %al, %edx
				; X64-NEXT: movl %edx, %eax
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: shrb %cl, %al
				; X64-NEXT: cmpb %al, %dil
				; X64-NEXT: movl $255, %eax
				; X64-NEXT: cmovel %edx, %eax
				; X64-NEXT: shrb $4, %al
				; X64-NEXT: # kill: def $al killed $al killed $eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func4:
				; X86: # %bb.0:
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: andb $15, %cl
				; X86-NEXT: movb {{[0-9]+}}(%esp), %al
				; X86-NEXT: shlb $4, %al
				; X86-NEXT: movl %eax, %edx
				; X86-NEXT: shlb %cl, %dl
				; X86-NEXT: movzbl %dl, %esi
				; X86-NEXT: shrb %cl, %dl
				; X86-NEXT: cmpb %dl, %al
				; X86-NEXT: movl $255, %eax
				; X86-NEXT: cmovel %esi, %eax
				; X86-NEXT: shrb $4, %al
				; X86-NEXT: # kill: def $al killed $al killed $eax
				; X86-NEXT: popl %esi
				; X86-NEXT: retl
				%tmp = call i4 @llvm.ushl.sat.i4(i4 %x, i4 %y)
				ret i4 %tmp
				}

				define i64 @func5(i64 %x, i64 %y) nounwind {
				; X64-LABEL: func5:
				; X64: # %bb.0:
				; X64-NEXT: movq %rsi, %rcx
				; X64-NEXT: movq %rdi, %rdx
				; X64-NEXT: shlq %cl, %rdx
				; X64-NEXT: movq %rdx, %rax
				; X64-NEXT: # kill: def $cl killed $cl killed $rcx
				; X64-NEXT: shrq %cl, %rax
				; X64-NEXT: cmpq %rax, %rdi
				; X64-NEXT: movq $-1, %rax
				; X64-NEXT: cmoveq %rdx, %rax
				; X64-NEXT: retq
				;
				; X86-LABEL: func5:
				; X86: # %bb.0:
				; X86-NEXT: pushl %ebp
				; X86-NEXT: pushl %ebx
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movl {{[0-9]+}}(%esp), %edi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %edx
				; X86-NEXT: movl %edi, %esi
				; X86-NEXT: shll %cl, %esi
				; X86-NEXT: shldl %cl, %edi, %edx
				; X86-NEXT: xorl %ebx, %ebx
				; X86-NEXT: testb $32, %cl
				; X86-NEXT: cmovnel %esi, %edx
				; X86-NEXT: cmovnel %ebx, %esi
				; X86-NEXT: movl %edx, %ebp
				; X86-NEXT: shrl %cl, %ebp
				; X86-NEXT: testb $32, %cl
				; X86-NEXT: cmovel %ebp, %ebx
				; X86-NEXT: movl %esi, %eax
				; X86-NEXT: shrdl %cl, %edx, %eax
				; X86-NEXT: testb $32, %cl
				; X86-NEXT: cmovnel %ebp, %eax
				; X86-NEXT: xorl %edi, %eax
				; X86-NEXT: xorl {{[0-9]+}}(%esp), %ebx
				; X86-NEXT: orl %eax, %ebx
				; X86-NEXT: movl $-1, %eax
				; X86-NEXT: cmovnel %eax, %esi
				; X86-NEXT: cmovnel %eax, %edx
				; X86-NEXT: movl %esi, %eax
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: popl %ebx
				; X86-NEXT: popl %ebp
				; X86-NEXT: retl
				%tmp = call i64 @llvm.ushl.sat.i64(i64 %x, i64 %y)
				ret i64 %tmp
				}

				define i18 @func6(i16 %x, i16 %y) nounwind {
				; X64-LABEL: func6:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: movswl %di, %eax
				; X64-NEXT: shll $14, %eax
				; X64-NEXT: movl %eax, %edx
				; X64-NEXT: shll %cl, %edx
				; X64-NEXT: movl %edx, %esi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: shrl %cl, %esi
				; X64-NEXT: cmpl %esi, %eax
				; X64-NEXT: movl $-1, %eax
				; X64-NEXT: cmovel %edx, %eax
				; X64-NEXT: shrl $14, %eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func6:
				; X86: # %bb.0:
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movswl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: shll $14, %eax
				; X86-NEXT: movl %eax, %edx
				; X86-NEXT: shll %cl, %edx
				; X86-NEXT: movl %edx, %esi
				; X86-NEXT: shrl %cl, %esi
				; X86-NEXT: cmpl %esi, %eax
				; X86-NEXT: movl $-1, %eax
				; X86-NEXT: cmovel %edx, %eax
				; X86-NEXT: shrl $14, %eax
				; X86-NEXT: popl %esi
				; X86-NEXT: retl
				%x2 = sext i16 %x to i18
				%y2 = sext i16 %y to i18
				%tmp = call i18 @llvm.ushl.sat.i18(i18 %x2, i18 %y2)
				ret i18 %tmp
				}

				define i32 @func7(i32 %x, i32 %y) nounwind {
				; X64-LABEL: func7:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: movl %edi, %edx
				; X64-NEXT: shll %cl, %edx
				; X64-NEXT: movl %edx, %eax
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: shrl %cl, %eax
				; X64-NEXT: cmpl %eax, %edi
				; X64-NEXT: movl $-1, %eax
				; X64-NEXT: cmovel %edx, %eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func7:
				; X86: # %bb.0:
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: movl %eax, %edx
				; X86-NEXT: shll %cl, %edx
				; X86-NEXT: movl %edx, %esi
				; X86-NEXT: shrl %cl, %esi
				; X86-NEXT: cmpl %esi, %eax
				; X86-NEXT: movl $-1, %eax
				; X86-NEXT: cmovel %edx, %eax
				; X86-NEXT: popl %esi
				; X86-NEXT: retl
				%tmp = call i32 @llvm.ushl.sat.i32(i32 %x, i32 %y)
				ret i32 %tmp
				}

				define i8 @func8(i8 %x, i8 %y) nounwind {
				; X64-LABEL: func8:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %ecx
				; X64-NEXT: movl %edi, %eax
				; X64-NEXT: shlb %cl, %al
				; X64-NEXT: movzbl %al, %edx
				; X64-NEXT: movl %edx, %eax
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: shrb %cl, %al
				; X64-NEXT: cmpb %al, %dil
				; X64-NEXT: movl $255, %eax
				; X64-NEXT: cmovel %edx, %eax
				; X64-NEXT: # kill: def $al killed $al killed $eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func8:
				; X86: # %bb.0:
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movb {{[0-9]+}}(%esp), %al
				; X86-NEXT: movl %eax, %edx
				; X86-NEXT: shlb %cl, %dl
				; X86-NEXT: movzbl %dl, %esi
				; X86-NEXT: shrb %cl, %dl
				; X86-NEXT: cmpb %dl, %al
				; X86-NEXT: movl $255, %eax
				; X86-NEXT: cmovel %esi, %eax
				; X86-NEXT: # kill: def $al killed $al killed $eax
				; X86-NEXT: popl %esi
				; X86-NEXT: retl
				%tmp = call i8 @llvm.ushl.sat.i8(i8 %x, i8 %y)
				ret i8 %tmp
				}

llvm/test/CodeGen/X86/ushl_sat_vec.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc < %s -mtriple=x86_64-linux \| FileCheck %s --check-prefix=X64
				; RUN: llc < %s -mtriple=i686 -mattr=cmov \| FileCheck %s --check-prefix=X86

				declare <4 x i32> @llvm.ushl.sat.v4i32(<4 x i32>, <4 x i32>)

				define <4 x i32> @vec(<4 x i32> %x, <4 x i32> %y) nounwind {
				; X64-LABEL: vec:
				; X64: # %bb.0:
				; X64-NEXT: pshufd {{.*#+}} xmm2 = xmm0[3,3,3,3]
				; X64-NEXT: movd %xmm2, %eax
				; X64-NEXT: pshufd {{.*#+}} xmm2 = xmm1[3,3,3,3]
				; X64-NEXT: movd %xmm2, %ecx
				; X64-NEXT: movl %eax, %edx
				; X64-NEXT: shll %cl, %edx
				; X64-NEXT: movl %edx, %esi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: shrl %cl, %esi
				; X64-NEXT: cmpl %esi, %eax
				; X64-NEXT: movl $-1, %eax
				; X64-NEXT: cmovnel %eax, %edx
				; X64-NEXT: movd %edx, %xmm2
				; X64-NEXT: pshufd {{.*#+}} xmm3 = xmm0[2,3,2,3]
				; X64-NEXT: movd %xmm3, %edx
				; X64-NEXT: pshufd {{.*#+}} xmm3 = xmm1[2,3,2,3]
				; X64-NEXT: movd %xmm3, %ecx
				; X64-NEXT: movl %edx, %esi
				; X64-NEXT: shll %cl, %esi
				; X64-NEXT: movl %esi, %edi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: shrl %cl, %edi
				; X64-NEXT: cmpl %edi, %edx
				; X64-NEXT: cmovnel %eax, %esi
				; X64-NEXT: movd %esi, %xmm3
				; X64-NEXT: punpckldq {{.*#+}} xmm3 = xmm3[0],xmm2[0],xmm3[1],xmm2[1]
				; X64-NEXT: movd %xmm0, %edx
				; X64-NEXT: movd %xmm1, %ecx
				; X64-NEXT: movl %edx, %esi
				; X64-NEXT: shll %cl, %esi
				; X64-NEXT: movl %esi, %edi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: shrl %cl, %edi
				; X64-NEXT: cmpl %edi, %edx
				; X64-NEXT: cmovnel %eax, %esi
				; X64-NEXT: movd %esi, %xmm2
				; X64-NEXT: pshufd {{.*#+}} xmm0 = xmm0[1,1,1,1]
				; X64-NEXT: movd %xmm0, %edx
				; X64-NEXT: pshufd {{.*#+}} xmm0 = xmm1[1,1,1,1]
				; X64-NEXT: movd %xmm0, %ecx
				; X64-NEXT: movl %edx, %esi
				; X64-NEXT: shll %cl, %esi
				; X64-NEXT: movl %esi, %edi
				; X64-NEXT: # kill: def $cl killed $cl killed $ecx
				; X64-NEXT: shrl %cl, %edi
				; X64-NEXT: cmpl %edi, %edx
				; X64-NEXT: cmovnel %eax, %esi
				; X64-NEXT: movd %esi, %xmm0
				; X64-NEXT: punpckldq {{.*#+}} xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1]
				; X64-NEXT: punpcklqdq {{.*#+}} xmm2 = xmm2[0],xmm3[0]
				; X64-NEXT: movdqa %xmm2, %xmm0
				; X64-NEXT: retq
				;
				; X86-LABEL: vec:
				; X86: # %bb.0:
				; X86-NEXT: pushl %ebp
				; X86-NEXT: pushl %ebx
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %al
				; X86-NEXT: movb {{[0-9]+}}(%esp), %ch
				; X86-NEXT: movb {{[0-9]+}}(%esp), %ah
				; X86-NEXT: movl {{[0-9]+}}(%esp), %edi
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movl {{[0-9]+}}(%esp), %edx
				; X86-NEXT: movl %edx, %ebp
				; X86-NEXT: shll %cl, %ebp
				; X86-NEXT: movl %ebp, %esi
				; X86-NEXT: shrl %cl, %esi
				; X86-NEXT: cmpl %esi, %edx
				; X86-NEXT: movl $-1, %edx
				; X86-NEXT: cmovnel %edx, %ebp
				; X86-NEXT: movl %edi, %ebx
				; X86-NEXT: movb %ah, %cl
				; X86-NEXT: shll %cl, %ebx
				; X86-NEXT: movl %ebx, %esi
				; X86-NEXT: shrl %cl, %esi
				; X86-NEXT: cmpl %esi, %edi
				; X86-NEXT: cmovnel %edx, %ebx
				; X86-NEXT: movl {{[0-9]+}}(%esp), %esi
				; X86-NEXT: movb %ch, %cl
				; X86-NEXT: shll %cl, %esi
				; X86-NEXT: movl %esi, %edi
				; X86-NEXT: shrl %cl, %edi
				; X86-NEXT: cmpl %edi, {{[0-9]+}}(%esp)
				; X86-NEXT: movl {{[0-9]+}}(%esp), %edi
				; X86-NEXT: cmovnel %edx, %esi
				; X86-NEXT: movl %eax, %ecx
				; X86-NEXT: shll %cl, %edi
				; X86-NEXT: movl %edi, %eax
				; X86-NEXT: shrl %cl, %eax
				; X86-NEXT: cmpl %eax, {{[0-9]+}}(%esp)
				; X86-NEXT: cmovnel %edx, %edi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: movl %edi, 12(%eax)
				; X86-NEXT: movl %esi, 8(%eax)
				; X86-NEXT: movl %ebx, 4(%eax)
				; X86-NEXT: movl %ebp, (%eax)
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: popl %ebx
				; X86-NEXT: popl %ebp
				; X86-NEXT: retl $4
				%tmp = call <4 x i32> @llvm.ushl.sat.v4i32(<4 x i32> %x, <4 x i32> %y)
				ret <4 x i32> %tmp
				}

This is an archive of the discontinued LLVM Phabricator instance.

[Intrinsic] Add sshl.sat/ushl.sat, saturated shift intrinsics.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 283889

llvm/docs/GlobalISel/GenericOpcode.rst

llvm/docs/LangRef.rst

llvm/include/llvm/CodeGen/GlobalISel/LegalizerHelper.h

llvm/include/llvm/CodeGen/ISDOpcodes.h

llvm/include/llvm/CodeGen/TargetLowering.h

llvm/include/llvm/IR/Intrinsics.td

llvm/include/llvm/Support/TargetOpcodes.def

llvm/include/llvm/Target/GenericOpcodes.td

llvm/include/llvm/Target/GlobalISel/SelectionDAGCompat.td

llvm/include/llvm/Target/TargetSelectionDAG.td

llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp

llvm/lib/CodeGen/GlobalISel/MachineIRBuilder.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp

llvm/lib/CodeGen/TargetLoweringBase.cpp

llvm/lib/IR/Verifier.cpp

llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp

llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir

llvm/test/CodeGen/AMDGPU/GlobalISel/irtranslator-sat.ll

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-sshlsat.mir

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-ushlsat.mir

llvm/test/CodeGen/X86/sshl_sat.ll

llvm/test/CodeGen/X86/sshl_sat_vec.ll

llvm/test/CodeGen/X86/ushl_sat.ll

llvm/test/CodeGen/X86/ushl_sat_vec.ll

[Intrinsic] Add sshl.sat/ushl.sat, saturated shift intrinsics.
ClosedPublic