This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Support/
-
llvm/
-
Support/
-
X86DisassemblerDecoderCommon.h
-
lib/Target/X86/
-
Target/
-
X86/
-
Disassembler/
-
X86Disassembler.cpp
1/19
X86InstrSSE.td
-
X86ScheduleBtVer2.td
-
test/
-
CodeGen/X86/
-
X86/
-
maskmovdqu.ll
-
sse2-intrinsics-fast-isel.ll
-
MC/X86/
-
X86/
-
maskmovdqu.s
-
maskmovdqu64.s
-
utils/TableGen/
-
TableGen/
-
X86DisassemblerTables.cpp
-
X86RecognizableInstr.cpp

Differential D103427

[X86] Fix handling of maskmovdqu in X32
ClosedPublic

Authored by hvdijk on May 31 2021, 2:42 PM.

Download Raw Diff

Details

Reviewers

craig.topper
MaskRay
RKSimon

Commits

rGa8ad91705439: [X86] Fix handling of maskmovdqu in X32

Summary

The maskmovdqu instruction has a 32-bit and a 64-bit variant, the former using EDI, the latter RDI, but the use of the register is implicit. In 64-bit mode, a 0x67 prefix can be used to get the version using EDI, but there is no way to express this in assembly in a single instruction, the only way is with an explicit addr32.

This change adds support for the instruction. When generating assembly text, that explicit addr32 will be added. When not generating assembly text, it will be kept as a single instruction and will be emitted with that 0x67 prefix. When parsing assembly text, it will be re-parsed as ADDR32 followed by MASKMOVDQU64, which still results in the correct bytes when converted to machine code.

The same applies to vmaskmovdqu as well.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

hvdijk created this revision.May 31 2021, 2:42 PM

Herald added subscribers: pengfei, hiraditya. · View Herald TranscriptMay 31 2021, 2:42 PM

hvdijk requested review of this revision.May 31 2021, 2:42 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 31 2021, 2:42 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B106959: Diff 348864.May 31 2021, 3:16 PM

craig.topper added inline comments.May 31 2021, 9:18 PM

llvm/lib/Target/X86/X86InstrSSE.td
4015	Can we make this CodeGenOnly=1 so the disassembler tables don't need to be updated? Especially since there are no dissambler tests in this patch.

hvdijk added inline comments.May 31 2021, 11:29 PM

llvm/lib/Target/X86/X86InstrSSE.td
4015	We do need that for correct disassembly. It seems like we have rather limited testing of the disassembler in general; I could add a new test specifically for this that only tests these instructions, but if there's some approach that lets us test the disassembler for all (most) instructions in one go that might be better.

After adding a new test for the disassembly of these instructions, I found that I had not fully tested my previous version's attempt at disassembling: it only correctly handled the disassembly of addr32 maskmovdqu, not that of addr32 vmaskmovdqu. This version fixes it and adds tests for it. The instruction classes confuse me, so I may have missed a simpler way of achieving the same result.

Harbormaster completed remote builds in B107831: Diff 350081.Jun 5 2021, 4:18 PM

ping, re-tested today, still applies and passes tests.

In D103427#2881558, @hvdijk wrote:

ping, re-tested today, still applies and passes tests.

I don't see a disassembler test. Did I miss it?

In D103427#2881563, @craig.topper wrote:

I don't see a disassembler test. Did I miss it?

The disassembly is tested in maskmovdqu64.s.

LGTM

This revision is now accepted and ready to land.Jul 15 2021, 2:50 PM

This revision was landed with ongoing or failed builds.Jul 15 2021, 2:56 PM

Closed by commit rGa8ad91705439: [X86] Fix handling of maskmovdqu in X32 (authored by hvdijk). · Explain Why

This revision was automatically updated to reflect the committed changes.

hvdijk added a commit: rGa8ad91705439: [X86] Fix handling of maskmovdqu in X32.

I'm sorry to say that this patch introduced a serious regression for the disassembler. Almost all the VEX instructions w/ address-size prefix can not be decoded due to this change. The context IC_64BIT_VEX_OPSIZE_ADSIZE should never be added b/c we can always add a addr32 prefix on any VEX instruction w/ a memory operand. I think the best way to support maskmovdqu in X32 is support something like ExplicitVEXPrefix in X86InstrFormats.td.

Herald added a project: Restricted Project. · View Herald TranscriptMar 24 2022, 6:27 PM

Herald added a subscriber: StephenFan. · View Herald Transcript

Opened an issue here: https://github.com/llvm/llvm-project/issues/54540

skan added a reverting change: D122448: Revert "[X86] Fix handling of maskmovdqu in X32".Mar 24 2022, 6:41 PM

skan added inline comments.Mar 24 2022, 7:31 PM

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	BTW, why do we need add an instruction like `MASKMOVDQUX32` when we already have `MASKMOVDQU`? They have the same encoding/decoding except a address-size prefix. We can definitly encode a `0x67` during encoding and and print a "addr32" during decoding according to the mode w/o adding any new intrustion. If removing `Not64BitMode` of `MASKMOVDQU` may cause a ISEL issue, we should fix it in ISEL.

craig.topper added inline comments.Mar 24 2022, 7:55 PM

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	The address register is implicit in the instruction. Our addr32 emission is normally based on the register class of the address registers. But we don't have that here. Are you suggesting to hardcode a special case for MASKMOVDQU in the encoder?

skan added inline comments.Mar 24 2022, 8:10 PM

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	Harcode is one of the solution. Another solution is that we can add `X86::IP_HAS_AD_SIZE` to `Flags` of `MCInst` when translating a `MachineInstr` to a `MCInst`, so that a 0x67 will be emitted. `MASKMOVDQU` accesses memory and has implicit use `EDI`, and we can get such information from a `MachineInstr`.

craig.topper added inline comments.Mar 24 2022, 9:22 PM

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	The implicit def will always be RDI because it's part of the tablegen Uses. I think. So SelectionDAG will always create the MachineInstr with it.

skan added inline comments.Mar 24 2022, 9:37 PM

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	Do you mean MASKMOVDQU use `RDI` rather than `EDI` in MachineInstr?

craig.topper added inline comments.Mar 24 2022, 9:41 PM

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	Oh nevermind, I should have looked more carefully. I didn't realize we had 3 instructions and you want to reduce to 2.

hvdijk added inline comments.Mar 26 2022, 6:52 PM

llvm/lib/Target/X86/X86InstrSSE.td

4032–4039

We can definitly encode a 0x67 during encoding and and print a "addr32" during decoding according to the mode w/o adding any new intrustion.

The 0x67 during encoding is handled automatically if we mark the instruction AdSize32, but we also need to print the addr32 when printing in text form, that's what I had trouble with originally. However, trying again to reuse the existing MASKMOVDQU and VMASKMOVDQU, we can actually get that working with something along these lines instead:

--- a/llvm/lib/Target/X86/MCTargetDesc/X86ATTInstPrinter.cpp
+++ b/llvm/lib/Target/X86/MCTargetDesc/X86ATTInstPrinter.cpp
@@ -69,8 +69,12 @@ void X86ATTInstPrinter::printInst(const MCInst *MI, uint64_t Address,
     OS << "\tdata32";
   }
   // Try to print any aliases first.
-  else if (!printAliasInstr(MI, Address, OS) && !printVecCompareInstr(MI, OS))
+  else if (!printAliasInstr(MI, Address, OS) && !printVecCompareInstr(MI, OS)) {
+    if ((MI->getOpcode() == X86::MASKMOVDQU || MI->getOpcode() == X86::VMASKMOVDQU) &&
+        STI.getFeatureBits()[X86::Is64Bit])
+      OS << "\taddr32\n";
     printInstruction(MI, Address, OS);
+  }
 
   // Next always print the annotation.
   printAnnotation(OS, Annot);

Is that about what you had in mind too? I'll continue with this approach and see if it passes your and my tests.

hvdijk marked an inline comment as not done.Mar 26 2022, 7:22 PM

hvdijk added inline comments.

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	That should be handled in `printInstFlags` instead, which already checks whether `addr32` needs to be printed but does not handle this case. The important part of the question, however, is whether this should be a special hardcoded exception for (V)MASKMOVDQU, or whether there should be something to automatically detect the need for the prefix.

skan added inline comments.Mar 26 2022, 8:19 PM

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	Exactly not... Let me propose a patch to illustrate that.

skan mentioned this in D122537: [X86] Support maskmovdqu,vmaskmovdqu in X32.Mar 26 2022, 8:41 PM

@hvdijk D122448 + D122537 can fix both bugs.

hvdijk marked an inline comment as not done.Mar 26 2022, 8:46 PM

hvdijk added inline comments.

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	Looking at your D122537: so they do need to be special cased, it's just that you moved the special casing into `X86MCInstLower::Lower`. I had already come up with that independently as well while cleaning up my own special casing. I have a little bit more than what you put in D122537 though, let me check whether that is still needed.

skan added inline comments.Mar 26 2022, 8:57 PM

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	No. The two functions are in different stages. The decoding for `maskmovdqu/vmaskmovdqu` is already correct if we revert this change, no more fix is needed. `printInstFlags` or `printInst` is used mostly for disassembler, so we do need to touch it. `X86MCInstLower::Lower` is used to tranlate MachineInstr to MCInst, which is used to add address-size prefix when we lowering from MIR.

hvdijk added inline comments.Mar 26 2022, 9:00 PM

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	So, yes. You're saying exactly what I'm saying. I don't have a change in `printInstFlags` or `printInst` any longer either because like I said, I moved the special casing to `X86MCInstLower::Lower` exactly like you did.

skan added inline comments.Mar 26 2022, 9:02 PM

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	No. The two functions are in different stages. The decoding for `maskmovdqu/vmaskmovdqu` is already correct if we revert this change, no more fix is needed. `printInstFlags` or `printInst` is used mostly for disassembler, so we do need to touch it. `X86MCInstLower::Lower` is used to tranlate MachineInstr to MCInst, which is used to add address-size prefix when we lowering from MIR. so we do need to touch it -> so we don't need to touch it
4032–4039	Looking at your D122537: so they do need to be special cased, it's just that you moved the special casing into `X86MCInstLower::Lower`. I had already come up with that independently as well while cleaning up my own special casing. I have a little bit more than what you put in D122537 though, let me check whether that is still needed. And I commented before, it's not necessary to add a special case for `maskmovdqu` b/c we can check the implicit operand of the instruction. However, there are so many existing speical cases in `X86MCInstLower::Lower`, I think it's okay to hardcode it.

hvdijk added inline comments.Mar 26 2022, 9:13 PM

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	I went back to read your previous comments to see whether I missed it and am not seeing where you said so. Regardless, the discussion here has well ceased to be constructive so I propose we drop that.

skan added inline comments.Mar 26 2022, 9:21 PM

llvm/lib/Target/X86/X86InstrSSE.td
4032–4039	Harcode is one of the solution. Another solution is that we can add `X86::IP_HAS_AD_SIZE` to `Flags` of `MCInst` when translating a `MachineInstr` to a `MCInst`, so that a 0x67 will be emitted. `MASKMOVDQU` accesses memory and has implicit use `EDI`, and we can get such information from a `MachineInstr`. I commented here.
4032–4039	Agree

skan mentioned this in D122540: [X86] Fix handling of maskmovdqu in x32 differently.Mar 28 2022, 6:53 PM

hvdijk mentioned this in rG3337f50625a3: [X86] Fix handling of maskmovdqu in x32 differently.Apr 12 2022, 10:32 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

Support/

X86DisassemblerDecoderCommon.h

2 lines

lib/

Target/

X86/

Disassembler/

X86Disassembler.cpp

4 lines

X86InstrSSE.td

20 lines

X86ScheduleBtVer2.td

4 lines

test/

CodeGen/

X86/

maskmovdqu.ll

15 lines

sse2-intrinsics-fast-isel.ll

1278 lines

MC/

X86/

maskmovdqu.s

15 lines

maskmovdqu64.s

27 lines

utils/

TableGen/

X86DisassemblerTables.cpp

31 lines

X86RecognizableInstr.cpp

13 lines

Diff 359138

llvm/include/llvm/Support/X86DisassemblerDecoderCommon.h

Show First 20 Lines • Show All 110 Lines • ▼ Show 20 Lines	ENUM_ENTRY(IC_64BIT_REXW_XD, 7, "Just as meaningful as " \
"IC_64BIT_REXW_XS") \		"IC_64BIT_REXW_XS") \
ENUM_ENTRY(IC_64BIT_REXW_OPSIZE, 8, "The Dynamic Duo! Prefer over all " \		ENUM_ENTRY(IC_64BIT_REXW_OPSIZE, 8, "The Dynamic Duo! Prefer over all " \
"else because this changes most " \		"else because this changes most " \
"operands' meaning") \		"operands' meaning") \
ENUM_ENTRY(IC_VEX, 1, "requires a VEX prefix") \		ENUM_ENTRY(IC_VEX, 1, "requires a VEX prefix") \
ENUM_ENTRY(IC_VEX_XS, 2, "requires VEX and the XS prefix") \		ENUM_ENTRY(IC_VEX_XS, 2, "requires VEX and the XS prefix") \
ENUM_ENTRY(IC_VEX_XD, 2, "requires VEX and the XD prefix") \		ENUM_ENTRY(IC_VEX_XD, 2, "requires VEX and the XD prefix") \
ENUM_ENTRY(IC_VEX_OPSIZE, 2, "requires VEX and the OpSize prefix") \		ENUM_ENTRY(IC_VEX_OPSIZE, 2, "requires VEX and the OpSize prefix") \
		ENUM_ENTRY(IC_64BIT_VEX_OPSIZE, 4, "requires 64-bit mode and VEX") \
		ENUM_ENTRY(IC_64BIT_VEX_OPSIZE_ADSIZE, 5, "requires 64-bit mode, VEX, and AdSize")\
ENUM_ENTRY(IC_VEX_W, 3, "requires VEX and the W prefix") \		ENUM_ENTRY(IC_VEX_W, 3, "requires VEX and the W prefix") \
ENUM_ENTRY(IC_VEX_W_XS, 4, "requires VEX, W, and XS prefix") \		ENUM_ENTRY(IC_VEX_W_XS, 4, "requires VEX, W, and XS prefix") \
ENUM_ENTRY(IC_VEX_W_XD, 4, "requires VEX, W, and XD prefix") \		ENUM_ENTRY(IC_VEX_W_XD, 4, "requires VEX, W, and XD prefix") \
ENUM_ENTRY(IC_VEX_W_OPSIZE, 4, "requires VEX, W, and OpSize") \		ENUM_ENTRY(IC_VEX_W_OPSIZE, 4, "requires VEX, W, and OpSize") \
ENUM_ENTRY(IC_VEX_L, 3, "requires VEX and the L prefix") \		ENUM_ENTRY(IC_VEX_L, 3, "requires VEX and the L prefix") \
ENUM_ENTRY(IC_VEX_L_XS, 4, "requires VEX and the L and XS prefix")\		ENUM_ENTRY(IC_VEX_L_XS, 4, "requires VEX and the L and XS prefix")\
ENUM_ENTRY(IC_VEX_L_XD, 4, "requires VEX and the L and XD prefix")\		ENUM_ENTRY(IC_VEX_L_XD, 4, "requires VEX and the L and XD prefix")\
ENUM_ENTRY(IC_VEX_L_OPSIZE, 4, "requires VEX, L, and OpSize") \		ENUM_ENTRY(IC_VEX_L_OPSIZE, 4, "requires VEX, L, and OpSize") \
▲ Show 20 Lines • Show All 344 Lines • Show Last 20 Lines

llvm/lib/Target/X86/Disassembler/X86Disassembler.cpp

Show First 20 Lines • Show All 1,113 Lines • ▼ Show 20 Lines	if (insn->vectorExtensionType == TYPE_EVEX) {
}		}

if (lFromVEX3of3(insn->vectorExtensionPrefix[2]))		if (lFromVEX3of3(insn->vectorExtensionPrefix[2]))
attrMask \|= ATTR_VEXL;		attrMask \|= ATTR_VEXL;
} else if (insn->vectorExtensionType == TYPE_VEX_2B) {		} else if (insn->vectorExtensionType == TYPE_VEX_2B) {
switch (ppFromVEX2of2(insn->vectorExtensionPrefix[1])) {		switch (ppFromVEX2of2(insn->vectorExtensionPrefix[1])) {
case VEX_PREFIX_66:		case VEX_PREFIX_66:
attrMask \|= ATTR_OPSIZE;		attrMask \|= ATTR_OPSIZE;
		if (insn->hasAdSize)
		attrMask \|= ATTR_ADSIZE;
break;		break;
case VEX_PREFIX_F3:		case VEX_PREFIX_F3:
attrMask \|= ATTR_XS;		attrMask \|= ATTR_XS;
break;		break;
case VEX_PREFIX_F2:		case VEX_PREFIX_F2:
attrMask \|= ATTR_XD;		attrMask \|= ATTR_XD;
break;		break;
}		}
Show All 40 Lines	case 0xf2:
attrMask \|= ATTR_XD;		attrMask \|= ATTR_XD;
break;		break;
case 0xf3:		case 0xf3:
attrMask \|= ATTR_XS;		attrMask \|= ATTR_XS;
break;		break;
case 0x66:		case 0x66:
if (insn->mode != MODE_16BIT)		if (insn->mode != MODE_16BIT)
attrMask \|= ATTR_OPSIZE;		attrMask \|= ATTR_OPSIZE;
		if (insn->hasAdSize)
		attrMask \|= ATTR_ADSIZE;
break;		break;
case 0x67:		case 0x67:
attrMask \|= ATTR_ADSIZE;		attrMask \|= ATTR_ADSIZE;
break;		break;
}		}
}		}

if (insn->rexPrefix & 0x08) {		if (insn->rexPrefix & 0x08) {
▲ Show 20 Lines • Show All 1,178 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86InstrSSE.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 4,005 Lines • ▼ Show 20 Lines	def VMASKMOVDQU : VPDI<0xF7, MRMSrcReg, (outs),
"maskmovdqu\t{$mask, $src\|$src, $mask}",		"maskmovdqu\t{$mask, $src\|$src, $mask}",
[(int_x86_sse2_maskmov_dqu VR128:$src, VR128:$mask, EDI)]>,		[(int_x86_sse2_maskmov_dqu VR128:$src, VR128:$mask, EDI)]>,
VEX, VEX_WIG;		VEX, VEX_WIG;
let Uses = [RDI], Predicates = [HasAVX,In64BitMode] in		let Uses = [RDI], Predicates = [HasAVX,In64BitMode] in
def VMASKMOVDQU64 : VPDI<0xF7, MRMSrcReg, (outs),		def VMASKMOVDQU64 : VPDI<0xF7, MRMSrcReg, (outs),
(ins VR128:$src, VR128:$mask),		(ins VR128:$src, VR128:$mask),
"maskmovdqu\t{$mask, $src\|$src, $mask}",		"maskmovdqu\t{$mask, $src\|$src, $mask}",
[(int_x86_sse2_maskmov_dqu VR128:$src, VR128:$mask, RDI)]>,		[(int_x86_sse2_maskmov_dqu VR128:$src, VR128:$mask, RDI)]>,
VEX, VEX_WIG;		VEX, VEX_WIG, AdSize64;
		let Uses = [EDI], Predicates = [HasAVX,In64BitMode] in
		craig.topperUnsubmitted Not Done Reply Inline Actions Can we make this CodeGenOnly=1 so the disassembler tables don't need to be updated? Especially since there are no dissambler tests in this patch. craig.topper: Can we make this CodeGenOnly=1 so the disassembler tables don't need to be updated? Especially…
		hvdijkAuthorUnsubmitted Done Reply Inline Actions We do need that for correct disassembly. It seems like we have rather limited testing of the disassembler in general; I could add a new test specifically for this that only tests these instructions, but if there's some approach that lets us test the disassembler for all (most) instructions in one go that might be better. hvdijk: We do need that for correct disassembly. It seems like we have rather limited testing of the…
		def VMASKMOVDQUX32 : VPDI<0xF7, MRMSrcReg, (outs),
		(ins VR128:$src, VR128:$mask), "",
		[(int_x86_sse2_maskmov_dqu VR128:$src, VR128:$mask, EDI)]>,
		VEX, VEX_WIG, AdSize32 {
		let AsmString = "addr32 vmaskmovdqu\t{$mask, $src\|$src, $mask}";
		let AsmVariantName = "NonParsable";
		}

let Uses = [EDI], Predicates = [UseSSE2,Not64BitMode] in		let Uses = [EDI], Predicates = [UseSSE2,Not64BitMode] in
def MASKMOVDQU : PDI<0xF7, MRMSrcReg, (outs), (ins VR128:$src, VR128:$mask),		def MASKMOVDQU : PDI<0xF7, MRMSrcReg, (outs), (ins VR128:$src, VR128:$mask),
"maskmovdqu\t{$mask, $src\|$src, $mask}",		"maskmovdqu\t{$mask, $src\|$src, $mask}",
[(int_x86_sse2_maskmov_dqu VR128:$src, VR128:$mask, EDI)]>;		[(int_x86_sse2_maskmov_dqu VR128:$src, VR128:$mask, EDI)]>;
let Uses = [RDI], Predicates = [UseSSE2,In64BitMode] in		let Uses = [RDI], Predicates = [UseSSE2,In64BitMode] in
def MASKMOVDQU64 : PDI<0xF7, MRMSrcReg, (outs), (ins VR128:$src, VR128:$mask),		def MASKMOVDQU64 : PDI<0xF7, MRMSrcReg, (outs), (ins VR128:$src, VR128:$mask),
"maskmovdqu\t{$mask, $src\|$src, $mask}",		"maskmovdqu\t{$mask, $src\|$src, $mask}",
[(int_x86_sse2_maskmov_dqu VR128:$src, VR128:$mask, RDI)]>;		[(int_x86_sse2_maskmov_dqu VR128:$src, VR128:$mask, RDI)]>,
		AdSize64;
		let Uses = [EDI], Predicates = [UseSSE2,In64BitMode] in
		def MASKMOVDQUX32 : PDI<0xF7, MRMSrcReg, (outs), (ins VR128:$src, VR128:$mask),
		"addr32 maskmovdqu\t{$mask, $src\|$src, $mask}",
		[(int_x86_sse2_maskmov_dqu VR128:$src, VR128:$mask, EDI)]>,
		AdSize32 {
		let AsmVariantName = "NonParsable";
		}
		skanUnsubmitted Not Done Reply Inline Actions BTW, why do we need add an instruction like `MASKMOVDQUX32` when we already have `MASKMOVDQU`? They have the same encoding/decoding except a address-size prefix. We can definitly encode a `0x67` during encoding and and print a "addr32" during decoding according to the mode w/o adding any new intrustion. If removing `Not64BitMode` of `MASKMOVDQU` may cause a ISEL issue, we should fix it in ISEL. skan: BTW, why do we need add an instruction like `MASKMOVDQUX32` when we already have `MASKMOVDQU`?
		craig.topperUnsubmitted Not Done Reply Inline Actions The address register is implicit in the instruction. Our addr32 emission is normally based on the register class of the address registers. But we don't have that here. Are you suggesting to hardcode a special case for MASKMOVDQU in the encoder? craig.topper: The address register is implicit in the instruction. Our addr32 emission is normally based on…
		skanUnsubmitted Not Done Reply Inline Actions Harcode is one of the solution. Another solution is that we can add `X86::IP_HAS_AD_SIZE` to `Flags` of `MCInst` when translating a `MachineInstr` to a `MCInst`, so that a 0x67 will be emitted. `MASKMOVDQU` accesses memory and has implicit use `EDI`, and we can get such information from a `MachineInstr`. skan: Harcode is one of the solution. Another solution is that we can add `X86::IP_HAS_AD_SIZE` to…
		craig.topperUnsubmitted Not Done Reply Inline Actions The implicit def will always be RDI because it's part of the tablegen Uses. I think. So SelectionDAG will always create the MachineInstr with it. craig.topper: The implicit def will always be RDI because it's part of the tablegen Uses. I think. So…
		skanUnsubmitted Not Done Reply Inline Actions Do you mean MASKMOVDQU use `RDI` rather than `EDI` in MachineInstr? skan: Do you mean MASKMOVDQU use `RDI` rather than `EDI` in MachineInstr?
		craig.topperUnsubmitted Not Done Reply Inline Actions Oh nevermind, I should have looked more carefully. I didn't realize we had 3 instructions and you want to reduce to 2. craig.topper: Oh nevermind, I should have looked more carefully. I didn't realize we had 3 instructions and…
		skanUnsubmitted Not Done Reply Inline Actions Harcode is one of the solution. Another solution is that we can add `X86::IP_HAS_AD_SIZE` to `Flags` of `MCInst` when translating a `MachineInstr` to a `MCInst`, so that a 0x67 will be emitted. `MASKMOVDQU` accesses memory and has implicit use `EDI`, and we can get such information from a `MachineInstr`. I commented here. skan: > Harcode is one of the solution. Another solution is that we can add `X86::IP_HAS_AD_SIZE` to…
		hvdijkAuthorUnsubmitted Not Done Reply Inline Actions We can definitly encode a `0x67` during encoding and and print a "addr32" during decoding according to the mode w/o adding any new intrustion. The 0x67 during encoding is handled automatically if we mark the instruction AdSize32, but we also need to print the addr32 when printing in text form, that's what I had trouble with originally. However, trying again to reuse the existing `MASKMOVDQU` and `VMASKMOVDQU`, we can actually get that working with something along these lines instead: --- a/llvm/lib/Target/X86/MCTargetDesc/X86ATTInstPrinter.cpp +++ b/llvm/lib/Target/X86/MCTargetDesc/X86ATTInstPrinter.cpp @@ -69,8 +69,12 @@ void X86ATTInstPrinter::printInst(const MCInst MI, uint64_t Address, OS << "\tdata32"; } // Try to print any aliases first. - else if (!printAliasInstr(MI, Address, OS) && !printVecCompareInstr(MI, OS)) + else if (!printAliasInstr(MI, Address, OS) && !printVecCompareInstr(MI, OS)) { + if ((MI->getOpcode() == X86::MASKMOVDQU \|\| MI->getOpcode() == X86::VMASKMOVDQU) && + STI.getFeatureBits()[X86::Is64Bit]) + OS << "\taddr32\n"; printInstruction(MI, Address, OS); + } // Next always print the annotation. printAnnotation(OS, Annot); Is that about what you had in mind too? I'll continue with this approach and see if it passes your and my tests. hvdijk:* > We can definitly encode a `0x67` during encoding and and print a "addr32" during decoding…
		hvdijkAuthorUnsubmitted Not Done Reply Inline Actions That should be handled in `printInstFlags` instead, which already checks whether `addr32` needs to be printed but does not handle this case. The important part of the question, however, is whether this should be a special hardcoded exception for (V)MASKMOVDQU, or whether there should be something to automatically detect the need for the prefix. hvdijk: That should be handled in `printInstFlags` instead, which already checks whether `addr32` needs…
		skanUnsubmitted Not Done Reply Inline Actions Exactly not... Let me propose a patch to illustrate that. skan: Exactly not... Let me propose a patch to illustrate that.
		hvdijkAuthorUnsubmitted Not Done Reply Inline Actions Looking at your D122537: so they do need to be special cased, it's just that you moved the special casing into `X86MCInstLower::Lower`. I had already come up with that independently as well while cleaning up my own special casing. I have a little bit more than what you put in D122537 though, let me check whether that is still needed. hvdijk: Looking at your D122537: so they do need to be special cased, it's just that you moved the…
		skanUnsubmitted Not Done Reply Inline Actions No. The two functions are in different stages. The decoding for `maskmovdqu/vmaskmovdqu` is already correct if we revert this change, no more fix is needed. `printInstFlags` or `printInst` is used mostly for disassembler, so we do need to touch it. `X86MCInstLower::Lower` is used to tranlate MachineInstr to MCInst, which is used to add address-size prefix when we lowering from MIR. skan: No. The two functions are in different stages. The decoding for `maskmovdqu/vmaskmovdqu` is…
		hvdijkAuthorUnsubmitted Not Done Reply Inline Actions So, yes. You're saying exactly what I'm saying. I don't have a change in `printInstFlags` or `printInst` any longer either because like I said, I moved the special casing to `X86MCInstLower::Lower` exactly like you did. hvdijk: So, yes. You're saying exactly what I'm saying. I don't have a change in `printInstFlags` or…
		skanUnsubmitted Not Done Reply Inline Actions No. The two functions are in different stages. The decoding for `maskmovdqu/vmaskmovdqu` is already correct if we revert this change, no more fix is needed. `printInstFlags` or `printInst` is used mostly for disassembler, so we do need to touch it. `X86MCInstLower::Lower` is used to tranlate MachineInstr to MCInst, which is used to add address-size prefix when we lowering from MIR. so we do need to touch it -> so we don't need to touch it skan: > No. The two functions are in different stages. The decoding for `maskmovdqu/vmaskmovdqu` is…
		skanUnsubmitted Not Done Reply Inline Actions Looking at your D122537: so they do need to be special cased, it's just that you moved the special casing into `X86MCInstLower::Lower`. I had already come up with that independently as well while cleaning up my own special casing. I have a little bit more than what you put in D122537 though, let me check whether that is still needed. And I commented before, it's not necessary to add a special case for `maskmovdqu` b/c we can check the implicit operand of the instruction. However, there are so many existing speical cases in `X86MCInstLower::Lower`, I think it's okay to hardcode it. skan: > Looking at your D122537: so they do need to be special cased, it's just that you moved the…
		hvdijkAuthorUnsubmitted Not Done Reply Inline Actions I went back to read your previous comments to see whether I missed it and am not seeing where you said so. Regardless, the discussion here has well ceased to be constructive so I propose we drop that. hvdijk: I went back to read your previous comments to see whether I missed it and am not seeing where…
		skanUnsubmitted Not Done Reply Inline Actions Agree skan: Agree

} // ExeDomain = SSEPackedInt		} // ExeDomain = SSEPackedInt

//===---------------------------------------------------------------------===//		//===---------------------------------------------------------------------===//
// SSE2 - Move Doubleword/Quadword		// SSE2 - Move Doubleword/Quadword
//===---------------------------------------------------------------------===//		//===---------------------------------------------------------------------===//

//===---------------------------------------------------------------------===//		//===---------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 3,965 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86ScheduleBtVer2.td

	Show First 20 Lines • Show All 829 Lines • ▼ Show 20 Lines
	// SSE2/AVX Store Selected Bytes of Double Quadword - (V)MASKMOVDQ			// SSE2/AVX Store Selected Bytes of Double Quadword - (V)MASKMOVDQ
	///////////////////////////////////////////////////////////////////////////////			///////////////////////////////////////////////////////////////////////////////

	def JWriteMASKMOVDQU: SchedWriteRes<[JFPU0, JFPA, JFPU1, JSTC, JLAGU, JSAGU, JALU01]> {			def JWriteMASKMOVDQU: SchedWriteRes<[JFPU0, JFPA, JFPU1, JSTC, JLAGU, JSAGU, JALU01]> {
	let Latency = 34;			let Latency = 34;
	let ResourceCycles = [1, 1, 2, 2, 2, 16, 42];			let ResourceCycles = [1, 1, 2, 2, 2, 16, 42];
	let NumMicroOps = 63;			let NumMicroOps = 63;
	}			}
	def : InstRW<[JWriteMASKMOVDQU], (instrs MASKMOVDQU, MASKMOVDQU64,			def : InstRW<[JWriteMASKMOVDQU], (instrs MASKMOVDQU, MASKMOVDQU64, MASKMOVDQUX32,
	VMASKMOVDQU, VMASKMOVDQU64)>;			VMASKMOVDQU, VMASKMOVDQU64, VMASKMOVDQUX32)>;

	///////////////////////////////////////////////////////////////////////////////			///////////////////////////////////////////////////////////////////////////////
	// SchedWriteVariant definitions.			// SchedWriteVariant definitions.
	///////////////////////////////////////////////////////////////////////////////			///////////////////////////////////////////////////////////////////////////////

	def JWriteZeroLatency : SchedWriteRes<[]> {			def JWriteZeroLatency : SchedWriteRes<[]> {
	let Latency = 0;			let Latency = 0;
	}			}
	▲ Show 20 Lines • Show All 204 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/maskmovdqu.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=i686-- -mattr=+sse2,-avx \| FileCheck %s --check-prefix=i686_SSE2			; RUN: llc < %s -mtriple=i686-- -mattr=+sse2,-avx \| FileCheck %s --check-prefix=i686_SSE2
	; RUN: llc < %s -mtriple=x86_64-- -mattr=+sse2,-avx \| FileCheck %s --check-prefix=x86_64_SSE2			; RUN: llc < %s -mtriple=x86_64-- -mattr=+sse2,-avx \| FileCheck %s --check-prefix=x86_64_SSE2
				; RUN: llc < %s -mtriple=x86_64--gnux32 -mattr=+sse2,-avx \| FileCheck %s --check-prefix=x86_x32_SSE2
	; RUN: llc < %s -mtriple=i686-- -mattr=+avx \| FileCheck %s --check-prefix=i686_AVX			; RUN: llc < %s -mtriple=i686-- -mattr=+avx \| FileCheck %s --check-prefix=i686_AVX
	; RUN: llc < %s -mtriple=x86_64-- -mattr=+avx \| FileCheck %s --check-prefix=x86_64_AVX			; RUN: llc < %s -mtriple=x86_64-- -mattr=+avx \| FileCheck %s --check-prefix=x86_64_AVX
				; RUN: llc < %s -mtriple=x86_64--gnux32 -mattr=+avx \| FileCheck %s --check-prefix=x86_x32_AVX
	; rdar://6573467			; rdar://6573467

	define void @test(<16 x i8> %a, <16 x i8> %b, i32 %dummy, i8* %c) nounwind {			define void @test(<16 x i8> %a, <16 x i8> %b, i32 %dummy, i8* %c) nounwind {
	; i686_SSE2-LABEL: test:			; i686_SSE2-LABEL: test:
	; i686_SSE2: # %bb.0: # %entry			; i686_SSE2: # %bb.0: # %entry
	; i686_SSE2-NEXT: pushl %edi			; i686_SSE2-NEXT: pushl %edi
	; i686_SSE2-NEXT: movl {{[0-9]+}}(%esp), %edi			; i686_SSE2-NEXT: movl {{[0-9]+}}(%esp), %edi
	; i686_SSE2-NEXT: maskmovdqu %xmm1, %xmm0			; i686_SSE2-NEXT: maskmovdqu %xmm1, %xmm0
	; i686_SSE2-NEXT: popl %edi			; i686_SSE2-NEXT: popl %edi
	; i686_SSE2-NEXT: retl			; i686_SSE2-NEXT: retl
	;			;
	; x86_64_SSE2-LABEL: test:			; x86_64_SSE2-LABEL: test:
	; x86_64_SSE2: # %bb.0: # %entry			; x86_64_SSE2: # %bb.0: # %entry
	; x86_64_SSE2-NEXT: movq %rsi, %rdi			; x86_64_SSE2-NEXT: movq %rsi, %rdi
	; x86_64_SSE2-NEXT: maskmovdqu %xmm1, %xmm0			; x86_64_SSE2-NEXT: maskmovdqu %xmm1, %xmm0
	; x86_64_SSE2-NEXT: retq			; x86_64_SSE2-NEXT: retq
	;			;
				; x86_x32_SSE2-LABEL: test:
				; x86_x32_SSE2: # %bb.0: # %entry
				; x86_x32_SSE2-NEXT: movq %rsi, %rdi
				; x86_x32_SSE2-NEXT: # kill: def $edi killed $edi killed $rdi
				; x86_x32_SSE2-NEXT: addr32 maskmovdqu %xmm1, %xmm0
				; x86_x32_SSE2-NEXT: retq
				;
	; i686_AVX-LABEL: test:			; i686_AVX-LABEL: test:
	; i686_AVX: # %bb.0: # %entry			; i686_AVX: # %bb.0: # %entry
	; i686_AVX-NEXT: pushl %edi			; i686_AVX-NEXT: pushl %edi
	; i686_AVX-NEXT: movl {{[0-9]+}}(%esp), %edi			; i686_AVX-NEXT: movl {{[0-9]+}}(%esp), %edi
	; i686_AVX-NEXT: vmaskmovdqu %xmm1, %xmm0			; i686_AVX-NEXT: vmaskmovdqu %xmm1, %xmm0
	; i686_AVX-NEXT: popl %edi			; i686_AVX-NEXT: popl %edi
	; i686_AVX-NEXT: retl			; i686_AVX-NEXT: retl
	;			;
	; x86_64_AVX-LABEL: test:			; x86_64_AVX-LABEL: test:
	; x86_64_AVX: # %bb.0: # %entry			; x86_64_AVX: # %bb.0: # %entry
	; x86_64_AVX-NEXT: movq %rsi, %rdi			; x86_64_AVX-NEXT: movq %rsi, %rdi
	; x86_64_AVX-NEXT: vmaskmovdqu %xmm1, %xmm0			; x86_64_AVX-NEXT: vmaskmovdqu %xmm1, %xmm0
	; x86_64_AVX-NEXT: retq			; x86_64_AVX-NEXT: retq
				; x86_x32_AVX-LABEL: test:
				; x86_x32_AVX: # %bb.0: # %entry
				; x86_x32_AVX-NEXT: movq %rsi, %rdi
				; x86_x32_AVX-NEXT: # kill: def $edi killed $edi killed $rdi
				; x86_x32_AVX-NEXT: addr32 vmaskmovdqu %xmm1, %xmm0
				; x86_x32_AVX-NEXT: retq
	entry:			entry:
	tail call void @llvm.x86.sse2.maskmov.dqu( <16 x i8> %a, <16 x i8> %b, i8* %c )			tail call void @llvm.x86.sse2.maskmov.dqu( <16 x i8> %a, <16 x i8> %b, i8* %c )
	ret void			ret void
	}			}

	declare void @llvm.x86.sse2.maskmov.dqu(<16 x i8>, <16 x i8>, i8*) nounwind			declare void @llvm.x86.sse2.maskmov.dqu(<16 x i8>, <16 x i8>, i8*) nounwind

llvm/test/CodeGen/X86/sse2-intrinsics-fast-isel.ll

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=i386-unknown-unknown -mattr=+sse2 \| FileCheck %s --check-prefixes=CHECK,X86,SSE,X86-SSE			; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=i386-unknown-unknown -mattr=+sse2 \| FileCheck %s --check-prefixes=CHECK,X86,SSE,X86-SSE
	; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=i386-unknown-unknown -mattr=+avx \| FileCheck %s --check-prefixes=CHECK,X86,AVX,X86-AVX,AVX1,X86-AVX1			; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=i386-unknown-unknown -mattr=+avx \| FileCheck %s --check-prefixes=CHECK,X86,AVX,X86-AVX,AVX1,X86-AVX1
	; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=i386-unknown-unknown -mattr=+avx512f,+avx512bw,+avx512dq,+avx512vl \| FileCheck %s --check-prefixes=CHECK,X86,AVX,X86-AVX,AVX512,X86-AVX512			; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=i386-unknown-unknown -mattr=+avx512f,+avx512bw,+avx512dq,+avx512vl \| FileCheck %s --check-prefixes=CHECK,X86,AVX,X86-AVX,AVX512,X86-AVX512
	; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=x86_64-unknown-unknown -mattr=+sse2 \| FileCheck %s --check-prefixes=CHECK,X64,SSE,X64-SSE			; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=x86_64-unknown-unknown -mattr=+sse2 \| FileCheck %s --check-prefixes=CHECK,X64,SSE,X64-SSE
	; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=x86_64-unknown-unknown -mattr=+avx \| FileCheck %s --check-prefixes=CHECK,X64,AVX,X64-AVX,AVX1,X64-AVX1			; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=x86_64-unknown-unknown -mattr=+avx \| FileCheck %s --check-prefixes=CHECK,X64,AVX,X64-AVX,AVX1,X64-AVX1
	; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=x86_64-unknown-unknown -mattr=+avx512f,+avx512bw,+avx512dq,+avx512vl \| FileCheck %s --check-prefixes=CHECK,X64,AVX,X64-AVX,AVX512,X64-AVX512			; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=x86_64-unknown-unknown -mattr=+avx512f,+avx512bw,+avx512dq,+avx512vl \| FileCheck %s --check-prefixes=CHECK,X64,AVX,X64-AVX,AVX512,X64-AVX512
				; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=x86_64-unknown-unknown-gnux32 -mattr=+sse2 \| FileCheck %s --check-prefixes=CHECK,X32,SSE,X32-SSE
				; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=x86_64-unknown-unknown-gnux32 -mattr=+avx \| FileCheck %s --check-prefixes=CHECK,X32,AVX,X32-AVX,AVX1,X32-AVX1
				; RUN: llc < %s -show-mc-encoding -fast-isel -mtriple=x86_64-unknown-unknown-gnux32 -mattr=+avx512f,+avx512bw,+avx512dq,+avx512vl \| FileCheck %s --check-prefixes=CHECK,X32,AVX,X32-AVX,AVX512,X32-AVX512

	; NOTE: This should use IR equivalent to what is generated by clang/test/CodeGen/sse2-builtins.c			; NOTE: This should use IR equivalent to what is generated by clang/test/CodeGen/sse2-builtins.c

	define <2 x i64> @test_mm_add_epi8(<2 x i64> %a0, <2 x i64> %a1) nounwind {			define <2 x i64> @test_mm_add_epi8(<2 x i64> %a0, <2 x i64> %a1) nounwind {
	; SSE-LABEL: test_mm_add_epi8:			; SSE-LABEL: test_mm_add_epi8:
	; SSE: # %bb.0:			; SSE: # %bb.0:
	; SSE-NEXT: paddb %xmm1, %xmm0 # encoding: [0x66,0x0f,0xfc,0xc1]			; SSE-NEXT: paddb %xmm1, %xmm0 # encoding: [0x66,0x0f,0xfc,0xc1]
	; SSE-NEXT: ret{{[l\|q]}} # encoding: [0xc3]			; SSE-NEXT: ret{{[l\|q]}} # encoding: [0xc3]
	▲ Show 20 Lines • Show All 452 Lines • ▼ Show 20 Lines
	; X86-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x04]			; X86-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x04]
	; X86-NEXT: clflush (%eax) # encoding: [0x0f,0xae,0x38]			; X86-NEXT: clflush (%eax) # encoding: [0x0f,0xae,0x38]
	; X86-NEXT: retl # encoding: [0xc3]			; X86-NEXT: retl # encoding: [0xc3]
	;			;
	; X64-LABEL: test_mm_clflush:			; X64-LABEL: test_mm_clflush:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: clflush (%rdi) # encoding: [0x0f,0xae,0x3f]			; X64-NEXT: clflush (%rdi) # encoding: [0x0f,0xae,0x3f]
	; X64-NEXT: retq # encoding: [0xc3]			; X64-NEXT: retq # encoding: [0xc3]
				;
				; X32-LABEL: test_mm_clflush:
				; X32: # %bb.0:
				; X32-NEXT: clflush (%edi) # encoding: [0x67,0x0f,0xae,0x3f]
				; X32-NEXT: retq # encoding: [0xc3]
	call void @llvm.x86.sse2.clflush(i8* %a0)			call void @llvm.x86.sse2.clflush(i8* %a0)
	ret void			ret void
	}			}
	declare void @llvm.x86.sse2.clflush(i8*) nounwind readnone			declare void @llvm.x86.sse2.clflush(i8*) nounwind readnone

	define <2 x i64> @test_mm_cmpeq_epi8(<2 x i64> %a0, <2 x i64> %a1) nounwind {			define <2 x i64> @test_mm_cmpeq_epi8(<2 x i64> %a0, <2 x i64> %a1) nounwind {
	; SSE-LABEL: test_mm_cmpeq_epi8:			; SSE-LABEL: test_mm_cmpeq_epi8:
	; SSE: # %bb.0:			; SSE: # %bb.0:
	▲ Show 20 Lines • Show All 1,008 Lines • ▼ Show 20 Lines
	; X86-AVX512-NEXT: fldl (%esp) # encoding: [0xdd,0x04,0x24]			; X86-AVX512-NEXT: fldl (%esp) # encoding: [0xdd,0x04,0x24]
	; X86-AVX512-NEXT: movl %ebp, %esp # encoding: [0x89,0xec]			; X86-AVX512-NEXT: movl %ebp, %esp # encoding: [0x89,0xec]
	; X86-AVX512-NEXT: popl %ebp # encoding: [0x5d]			; X86-AVX512-NEXT: popl %ebp # encoding: [0x5d]
	; X86-AVX512-NEXT: retl # encoding: [0xc3]			; X86-AVX512-NEXT: retl # encoding: [0xc3]
	;			;
	; X64-LABEL: test_mm_cvtsd_f64:			; X64-LABEL: test_mm_cvtsd_f64:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: retq # encoding: [0xc3]			; X64-NEXT: retq # encoding: [0xc3]
				;
				; X32-LABEL: test_mm_cvtsd_f64:
				; X32: # %bb.0:
				; X32-NEXT: retq # encoding: [0xc3]
	%res = extractelement <2 x double> %a0, i32 0			%res = extractelement <2 x double> %a0, i32 0
	ret double %res			ret double %res
	}			}

	define i32 @test_mm_cvtsd_si32(<2 x double> %a0) nounwind {			define i32 @test_mm_cvtsd_si32(<2 x double> %a0) nounwind {
	; SSE-LABEL: test_mm_cvtsd_si32:			; SSE-LABEL: test_mm_cvtsd_si32:
	; SSE: # %bb.0:			; SSE: # %bb.0:
	; SSE-NEXT: cvtsd2si %xmm0, %eax # encoding: [0xf2,0x0f,0x2d,0xc0]			; SSE-NEXT: cvtsd2si %xmm0, %eax # encoding: [0xf2,0x0f,0x2d,0xc0]
	▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vcvtsd2ss (%rdi), %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x5a,0x07]			; X64-AVX1-NEXT: vcvtsd2ss (%rdi), %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x5a,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_cvtsd_ss_load:			; X64-AVX512-LABEL: test_mm_cvtsd_ss_load:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vcvtsd2ss (%rdi), %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x5a,0x07]			; X64-AVX512-NEXT: vcvtsd2ss (%rdi), %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x5a,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_cvtsd_ss_load:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: cvtsd2ss (%edi), %xmm0 # encoding: [0x67,0xf2,0x0f,0x5a,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_cvtsd_ss_load:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vcvtsd2ss (%edi), %xmm0, %xmm0 # encoding: [0x67,0xc5,0xfb,0x5a,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_cvtsd_ss_load:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vcvtsd2ss (%edi), %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc5,0xfb,0x5a,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%a1 = load <2 x double>, <2 x double>* %p1			%a1 = load <2 x double>, <2 x double>* %p1
	%res = call <4 x float> @llvm.x86.sse2.cvtsd2ss(<4 x float> %a0, <2 x double> %a1)			%res = call <4 x float> @llvm.x86.sse2.cvtsd2ss(<4 x float> %a0, <2 x double> %a1)
	ret <4 x float> %res			ret <4 x float> %res
	}			}

	define i32 @test_mm_cvtsi128_si32(<2 x i64> %a0) nounwind {			define i32 @test_mm_cvtsi128_si32(<2 x i64> %a0) nounwind {
	; SSE-LABEL: test_mm_cvtsi128_si32:			; SSE-LABEL: test_mm_cvtsi128_si32:
	; SSE: # %bb.0:			; SSE: # %bb.0:
	Show All 39 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vcvtsi2sd %edi, %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x2a,0xc7]			; X64-AVX1-NEXT: vcvtsi2sd %edi, %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x2a,0xc7]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_cvtsi32_sd:			; X64-AVX512-LABEL: test_mm_cvtsi32_sd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vcvtsi2sd %edi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x2a,0xc7]			; X64-AVX512-NEXT: vcvtsi2sd %edi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x2a,0xc7]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_cvtsi32_sd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: cvtsi2sd %edi, %xmm0 # encoding: [0xf2,0x0f,0x2a,0xc7]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_cvtsi32_sd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vcvtsi2sd %edi, %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x2a,0xc7]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_cvtsi32_sd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vcvtsi2sd %edi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x2a,0xc7]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%cvt = sitofp i32 %a1 to double			%cvt = sitofp i32 %a1 to double
	%res = insertelement <2 x double> %a0, double %cvt, i32 0			%res = insertelement <2 x double> %a0, double %cvt, i32 0
	ret <2 x double> %res			ret <2 x double> %res
	}			}

	define <2 x i64> @test_mm_cvtsi32_si128(i32 %a0) nounwind {			define <2 x i64> @test_mm_cvtsi32_si128(i32 %a0) nounwind {
	; X86-SSE-LABEL: test_mm_cvtsi32_si128:			; X86-SSE-LABEL: test_mm_cvtsi32_si128:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 22 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovd %edi, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc7]			; X64-AVX1-NEXT: vmovd %edi, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc7]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_cvtsi32_si128:			; X64-AVX512-LABEL: test_mm_cvtsi32_si128:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovd %edi, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc7]			; X64-AVX512-NEXT: vmovd %edi, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc7]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_cvtsi32_si128:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movd %edi, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc7]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_cvtsi32_si128:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovd %edi, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc7]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_cvtsi32_si128:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovd %edi, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc7]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <4 x i32> undef, i32 %a0, i32 0			%res0 = insertelement <4 x i32> undef, i32 %a0, i32 0
	%res1 = insertelement <4 x i32> %res0, i32 0, i32 1			%res1 = insertelement <4 x i32> %res0, i32 0, i32 1
	%res2 = insertelement <4 x i32> %res1, i32 0, i32 2			%res2 = insertelement <4 x i32> %res1, i32 0, i32 2
	%res3 = insertelement <4 x i32> %res2, i32 0, i32 3			%res3 = insertelement <4 x i32> %res2, i32 0, i32 3
	%res = bitcast <4 x i32> %res3 to <2 x i64>			%res = bitcast <4 x i32> %res3 to <2 x i64>
	ret <2 x i64> %res			ret <2 x i64> %res
	}			}

	▲ Show 20 Lines • Show All 173 Lines • ▼ Show 20 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vpinsrw $1, %edi, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0xc4,0xc7,0x01]			; X64-AVX1-NEXT: vpinsrw $1, %edi, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0xc4,0xc7,0x01]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_insert_epi16:			; X64-AVX512-LABEL: test_mm_insert_epi16:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vpinsrw $1, %edi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc7,0x01]			; X64-AVX512-NEXT: vpinsrw $1, %edi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc7,0x01]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_insert_epi16:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: pinsrw $1, %edi, %xmm0 # encoding: [0x66,0x0f,0xc4,0xc7,0x01]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_insert_epi16:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vpinsrw $1, %edi, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0xc4,0xc7,0x01]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_insert_epi16:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vpinsrw $1, %edi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc7,0x01]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%arg0 = bitcast <2 x i64> %a0 to <8 x i16>			%arg0 = bitcast <2 x i64> %a0 to <8 x i16>
	%res = insertelement <8 x i16> %arg0, i16 %a1,i32 1			%res = insertelement <8 x i16> %arg0, i16 %a1,i32 1
	%bc = bitcast <8 x i16> %res to <2 x i64>			%bc = bitcast <8 x i16> %res to <2 x i64>
	ret <2 x i64> %bc			ret <2 x i64> %bc
	}			}

	define void @test_mm_lfence() nounwind {			define void @test_mm_lfence() nounwind {
	; CHECK-LABEL: test_mm_lfence:			; CHECK-LABEL: test_mm_lfence:
	Show All 33 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovaps (%rdi), %xmm0 # encoding: [0xc5,0xf8,0x28,0x07]			; X64-AVX1-NEXT: vmovaps (%rdi), %xmm0 # encoding: [0xc5,0xf8,0x28,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_load_pd:			; X64-AVX512-LABEL: test_mm_load_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovaps (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x28,0x07]			; X64-AVX512-NEXT: vmovaps (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x28,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_load_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movaps (%edi), %xmm0 # encoding: [0x67,0x0f,0x28,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_load_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovaps (%edi), %xmm0 # encoding: [0x67,0xc5,0xf8,0x28,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_load_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovaps (%edi), %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x28,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%arg0 = bitcast double* %a0 to <2 x double>*			%arg0 = bitcast double* %a0 to <2 x double>*
	%res = load <2 x double>, <2 x double>* %arg0, align 16			%res = load <2 x double>, <2 x double>* %arg0, align 16
	ret <2 x double> %res			ret <2 x double> %res
	}			}

	define <2 x double> @test_mm_load_sd(double* %a0) nounwind {			define <2 x double> @test_mm_load_sd(double* %a0) nounwind {
	; X86-SSE-LABEL: test_mm_load_sd:			; X86-SSE-LABEL: test_mm_load_sd:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 28 Lines
	; X64-AVX1-NEXT: # xmm0 = mem[0],zero			; X64-AVX1-NEXT: # xmm0 = mem[0],zero
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_load_sd:			; X64-AVX512-LABEL: test_mm_load_sd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovsd (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x10,0x07]			; X64-AVX512-NEXT: vmovsd (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x10,0x07]
	; X64-AVX512-NEXT: # xmm0 = mem[0],zero			; X64-AVX512-NEXT: # xmm0 = mem[0],zero
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_load_sd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movsd (%edi), %xmm0 # encoding: [0x67,0xf2,0x0f,0x10,0x07]
				; X32-SSE-NEXT: # xmm0 = mem[0],zero
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_load_sd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovsd (%edi), %xmm0 # encoding: [0x67,0xc5,0xfb,0x10,0x07]
				; X32-AVX1-NEXT: # xmm0 = mem[0],zero
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_load_sd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovsd (%edi), %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc5,0xfb,0x10,0x07]
				; X32-AVX512-NEXT: # xmm0 = mem[0],zero
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%ld = load double, double* %a0, align 1			%ld = load double, double* %a0, align 1
	%res0 = insertelement <2 x double> undef, double %ld, i32 0			%res0 = insertelement <2 x double> undef, double %ld, i32 0
	%res1 = insertelement <2 x double> %res0, double 0.0, i32 1			%res1 = insertelement <2 x double> %res0, double 0.0, i32 1
	ret <2 x double> %res1			ret <2 x double> %res1
	}			}

	define <2 x i64> @test_mm_load_si128(<2 x i64>* %a0) nounwind {			define <2 x i64> @test_mm_load_si128(<2 x i64>* %a0) nounwind {
	; X86-SSE-LABEL: test_mm_load_si128:			; X86-SSE-LABEL: test_mm_load_si128:
	Show All 23 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovaps (%rdi), %xmm0 # encoding: [0xc5,0xf8,0x28,0x07]			; X64-AVX1-NEXT: vmovaps (%rdi), %xmm0 # encoding: [0xc5,0xf8,0x28,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_load_si128:			; X64-AVX512-LABEL: test_mm_load_si128:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovaps (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x28,0x07]			; X64-AVX512-NEXT: vmovaps (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x28,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_load_si128:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movaps (%edi), %xmm0 # encoding: [0x67,0x0f,0x28,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_load_si128:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovaps (%edi), %xmm0 # encoding: [0x67,0xc5,0xf8,0x28,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_load_si128:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovaps (%edi), %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x28,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res = load <2 x i64>, <2 x i64>* %a0, align 16			%res = load <2 x i64>, <2 x i64>* %a0, align 16
	ret <2 x i64> %res			ret <2 x i64> %res
	}			}

	define <2 x double> @test_mm_load1_pd(double* %a0) nounwind {			define <2 x double> @test_mm_load1_pd(double* %a0) nounwind {
	; X86-SSE-LABEL: test_mm_load1_pd:			; X86-SSE-LABEL: test_mm_load1_pd:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	; X86-SSE-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x04]			; X86-SSE-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x04]
	Show All 31 Lines
	; X64-AVX1-NEXT: # xmm0 = mem[0,0]			; X64-AVX1-NEXT: # xmm0 = mem[0,0]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_load1_pd:			; X64-AVX512-LABEL: test_mm_load1_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovddup (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0x07]			; X64-AVX512-NEXT: vmovddup (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0x07]
	; X64-AVX512-NEXT: # xmm0 = mem[0,0]			; X64-AVX512-NEXT: # xmm0 = mem[0,0]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_load1_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movsd (%edi), %xmm0 # encoding: [0x67,0xf2,0x0f,0x10,0x07]
				; X32-SSE-NEXT: # xmm0 = mem[0],zero
				; X32-SSE-NEXT: movlhps %xmm0, %xmm0 # encoding: [0x0f,0x16,0xc0]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_load1_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovddup (%edi), %xmm0 # encoding: [0x67,0xc5,0xfb,0x12,0x07]
				; X32-AVX1-NEXT: # xmm0 = mem[0,0]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_load1_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovddup (%edi), %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc5,0xfb,0x12,0x07]
				; X32-AVX512-NEXT: # xmm0 = mem[0,0]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%ld = load double, double* %a0, align 8			%ld = load double, double* %a0, align 8
	%res0 = insertelement <2 x double> undef, double %ld, i32 0			%res0 = insertelement <2 x double> undef, double %ld, i32 0
	%res1 = insertelement <2 x double> %res0, double %ld, i32 1			%res1 = insertelement <2 x double> %res0, double %ld, i32 1
	ret <2 x double> %res1			ret <2 x double> %res1
	}			}

	define <2 x double> @test_mm_loadh_pd(<2 x double> %a0, double* %a1) nounwind {			define <2 x double> @test_mm_loadh_pd(<2 x double> %a0, double* %a1) nounwind {
	; X86-SSE-LABEL: test_mm_loadh_pd:			; X86-SSE-LABEL: test_mm_loadh_pd:
	Show All 29 Lines
	; X64-AVX1-NEXT: # xmm0 = xmm0[0,1],mem[0,1]			; X64-AVX1-NEXT: # xmm0 = xmm0[0,1],mem[0,1]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_loadh_pd:			; X64-AVX512-LABEL: test_mm_loadh_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovhps (%rdi), %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x16,0x07]			; X64-AVX512-NEXT: vmovhps (%rdi), %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x16,0x07]
	; X64-AVX512-NEXT: # xmm0 = xmm0[0,1],mem[0,1]			; X64-AVX512-NEXT: # xmm0 = xmm0[0,1],mem[0,1]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_loadh_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movhps (%edi), %xmm0 # encoding: [0x67,0x0f,0x16,0x07]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,1],mem[0,1]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_loadh_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovhps (%edi), %xmm0, %xmm0 # encoding: [0x67,0xc5,0xf8,0x16,0x07]
				; X32-AVX1-NEXT: # xmm0 = xmm0[0,1],mem[0,1]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_loadh_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovhps (%edi), %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x16,0x07]
				; X32-AVX512-NEXT: # xmm0 = xmm0[0,1],mem[0,1]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%ld = load double, double* %a1, align 8			%ld = load double, double* %a1, align 8
	%res = insertelement <2 x double> %a0, double %ld, i32 1			%res = insertelement <2 x double> %a0, double %ld, i32 1
	ret <2 x double> %res			ret <2 x double> %res
	}			}

	define <2 x i64> @test_mm_loadl_epi64(<2 x i64> %a0, <2 x i64>* %a1) nounwind {			define <2 x i64> @test_mm_loadl_epi64(<2 x i64> %a0, <2 x i64>* %a1) nounwind {
	; X86-SSE-LABEL: test_mm_loadl_epi64:			; X86-SSE-LABEL: test_mm_loadl_epi64:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 28 Lines
	; X64-AVX1-NEXT: # xmm0 = mem[0],zero			; X64-AVX1-NEXT: # xmm0 = mem[0],zero
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_loadl_epi64:			; X64-AVX512-LABEL: test_mm_loadl_epi64:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovsd (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x10,0x07]			; X64-AVX512-NEXT: vmovsd (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x10,0x07]
	; X64-AVX512-NEXT: # xmm0 = mem[0],zero			; X64-AVX512-NEXT: # xmm0 = mem[0],zero
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_loadl_epi64:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movsd (%edi), %xmm0 # encoding: [0x67,0xf2,0x0f,0x10,0x07]
				; X32-SSE-NEXT: # xmm0 = mem[0],zero
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_loadl_epi64:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovsd (%edi), %xmm0 # encoding: [0x67,0xc5,0xfb,0x10,0x07]
				; X32-AVX1-NEXT: # xmm0 = mem[0],zero
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_loadl_epi64:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovsd (%edi), %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc5,0xfb,0x10,0x07]
				; X32-AVX512-NEXT: # xmm0 = mem[0],zero
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%bc = bitcast <2 x i64>* %a1 to i64*			%bc = bitcast <2 x i64>* %a1 to i64*
	%ld = load i64, i64* %bc, align 1			%ld = load i64, i64* %bc, align 1
	%res0 = insertelement <2 x i64> undef, i64 %ld, i32 0			%res0 = insertelement <2 x i64> undef, i64 %ld, i32 0
	%res1 = insertelement <2 x i64> %res0, i64 0, i32 1			%res1 = insertelement <2 x i64> %res0, i64 0, i32 1
	ret <2 x i64> %res1			ret <2 x i64> %res1
	}			}

	define <2 x double> @test_mm_loadl_pd(<2 x double> %a0, double* %a1) nounwind {			define <2 x double> @test_mm_loadl_pd(<2 x double> %a0, double* %a1) nounwind {
	Show All 30 Lines
	; X64-AVX1-NEXT: # xmm0 = mem[0,1],xmm0[2,3]			; X64-AVX1-NEXT: # xmm0 = mem[0,1],xmm0[2,3]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_loadl_pd:			; X64-AVX512-LABEL: test_mm_loadl_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovlps (%rdi), %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x12,0x07]			; X64-AVX512-NEXT: vmovlps (%rdi), %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x12,0x07]
	; X64-AVX512-NEXT: # xmm0 = mem[0,1],xmm0[2,3]			; X64-AVX512-NEXT: # xmm0 = mem[0,1],xmm0[2,3]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_loadl_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movlps (%edi), %xmm0 # encoding: [0x67,0x0f,0x12,0x07]
				; X32-SSE-NEXT: # xmm0 = mem[0,1],xmm0[2,3]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_loadl_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovlps (%edi), %xmm0, %xmm0 # encoding: [0x67,0xc5,0xf8,0x12,0x07]
				; X32-AVX1-NEXT: # xmm0 = mem[0,1],xmm0[2,3]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_loadl_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovlps (%edi), %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x12,0x07]
				; X32-AVX512-NEXT: # xmm0 = mem[0,1],xmm0[2,3]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%ld = load double, double* %a1, align 8			%ld = load double, double* %a1, align 8
	%res = insertelement <2 x double> %a0, double %ld, i32 0			%res = insertelement <2 x double> %a0, double %ld, i32 0
	ret <2 x double> %res			ret <2 x double> %res
	}			}

	define <2 x double> @test_mm_loadr_pd(double* %a0) nounwind {			define <2 x double> @test_mm_loadr_pd(double* %a0) nounwind {
	; X86-SSE-LABEL: test_mm_loadr_pd:			; X86-SSE-LABEL: test_mm_loadr_pd:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 30 Lines
	; X64-AVX1-NEXT: # xmm0 = mem[1,0]			; X64-AVX1-NEXT: # xmm0 = mem[1,0]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_loadr_pd:			; X64-AVX512-LABEL: test_mm_loadr_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vpermilpd $1, (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x05,0x07,0x01]			; X64-AVX512-NEXT: vpermilpd $1, (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x05,0x07,0x01]
	; X64-AVX512-NEXT: # xmm0 = mem[1,0]			; X64-AVX512-NEXT: # xmm0 = mem[1,0]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_loadr_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movaps (%edi), %xmm0 # encoding: [0x67,0x0f,0x28,0x07]
				; X32-SSE-NEXT: shufps $78, %xmm0, %xmm0 # encoding: [0x0f,0xc6,0xc0,0x4e]
				; X32-SSE-NEXT: # xmm0 = xmm0[2,3,0,1]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_loadr_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vpermilpd $1, (%edi), %xmm0 # encoding: [0x67,0xc4,0xe3,0x79,0x05,0x07,0x01]
				; X32-AVX1-NEXT: # xmm0 = mem[1,0]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_loadr_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vpermilpd $1, (%edi), %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc4,0xe3,0x79,0x05,0x07,0x01]
				; X32-AVX512-NEXT: # xmm0 = mem[1,0]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%arg0 = bitcast double* %a0 to <2 x double>*			%arg0 = bitcast double* %a0 to <2 x double>*
	%ld = load <2 x double>, <2 x double>* %arg0, align 16			%ld = load <2 x double>, <2 x double>* %arg0, align 16
	%res = shufflevector <2 x double> %ld, <2 x double> undef, <2 x i32> <i32 1, i32 0>			%res = shufflevector <2 x double> %ld, <2 x double> undef, <2 x i32> <i32 1, i32 0>
	ret <2 x double> %res			ret <2 x double> %res
	}			}

	define <2 x double> @test_mm_loadu_pd(double* %a0) nounwind {			define <2 x double> @test_mm_loadu_pd(double* %a0) nounwind {
	; X86-SSE-LABEL: test_mm_loadu_pd:			; X86-SSE-LABEL: test_mm_loadu_pd:
	Show All 23 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovups (%rdi), %xmm0 # encoding: [0xc5,0xf8,0x10,0x07]			; X64-AVX1-NEXT: vmovups (%rdi), %xmm0 # encoding: [0xc5,0xf8,0x10,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_loadu_pd:			; X64-AVX512-LABEL: test_mm_loadu_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovups (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x10,0x07]			; X64-AVX512-NEXT: vmovups (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x10,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_loadu_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movups (%edi), %xmm0 # encoding: [0x67,0x0f,0x10,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_loadu_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovups (%edi), %xmm0 # encoding: [0x67,0xc5,0xf8,0x10,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_loadu_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovups (%edi), %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x10,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%arg0 = bitcast double* %a0 to <2 x double>*			%arg0 = bitcast double* %a0 to <2 x double>*
	%res = load <2 x double>, <2 x double>* %arg0, align 1			%res = load <2 x double>, <2 x double>* %arg0, align 1
	ret <2 x double> %res			ret <2 x double> %res
	}			}

	define <2 x i64> @test_mm_loadu_si128(<2 x i64>* %a0) nounwind {			define <2 x i64> @test_mm_loadu_si128(<2 x i64>* %a0) nounwind {
	; X86-SSE-LABEL: test_mm_loadu_si128:			; X86-SSE-LABEL: test_mm_loadu_si128:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 22 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovups (%rdi), %xmm0 # encoding: [0xc5,0xf8,0x10,0x07]			; X64-AVX1-NEXT: vmovups (%rdi), %xmm0 # encoding: [0xc5,0xf8,0x10,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_loadu_si128:			; X64-AVX512-LABEL: test_mm_loadu_si128:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovups (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x10,0x07]			; X64-AVX512-NEXT: vmovups (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x10,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_loadu_si128:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movups (%edi), %xmm0 # encoding: [0x67,0x0f,0x10,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_loadu_si128:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovups (%edi), %xmm0 # encoding: [0x67,0xc5,0xf8,0x10,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_loadu_si128:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovups (%edi), %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x10,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res = load <2 x i64>, <2 x i64>* %a0, align 1			%res = load <2 x i64>, <2 x i64>* %a0, align 1
	ret <2 x i64> %res			ret <2 x i64> %res
	}			}

	define <2 x i64> @test_mm_loadu_si64(i8* nocapture readonly %A) {			define <2 x i64> @test_mm_loadu_si64(i8* nocapture readonly %A) {
	; X86-SSE-LABEL: test_mm_loadu_si64:			; X86-SSE-LABEL: test_mm_loadu_si64:
	; X86-SSE: # %bb.0: # %entry			; X86-SSE: # %bb.0: # %entry
	; X86-SSE-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x04]			; X86-SSE-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x04]
	Show All 27 Lines
	; X64-AVX1-NEXT: # xmm0 = mem[0],zero			; X64-AVX1-NEXT: # xmm0 = mem[0],zero
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_loadu_si64:			; X64-AVX512-LABEL: test_mm_loadu_si64:
	; X64-AVX512: # %bb.0: # %entry			; X64-AVX512: # %bb.0: # %entry
	; X64-AVX512-NEXT: vmovsd (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x10,0x07]			; X64-AVX512-NEXT: vmovsd (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x10,0x07]
	; X64-AVX512-NEXT: # xmm0 = mem[0],zero			; X64-AVX512-NEXT: # xmm0 = mem[0],zero
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_loadu_si64:
				; X32-SSE: # %bb.0: # %entry
				; X32-SSE-NEXT: movsd (%edi), %xmm0 # encoding: [0x67,0xf2,0x0f,0x10,0x07]
				; X32-SSE-NEXT: # xmm0 = mem[0],zero
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_loadu_si64:
				; X32-AVX1: # %bb.0: # %entry
				; X32-AVX1-NEXT: vmovsd (%edi), %xmm0 # encoding: [0x67,0xc5,0xfb,0x10,0x07]
				; X32-AVX1-NEXT: # xmm0 = mem[0],zero
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_loadu_si64:
				; X32-AVX512: # %bb.0: # %entry
				; X32-AVX512-NEXT: vmovsd (%edi), %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc5,0xfb,0x10,0x07]
				; X32-AVX512-NEXT: # xmm0 = mem[0],zero
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	entry:			entry:
	%__v.i = bitcast i8* %A to i64*			%__v.i = bitcast i8* %A to i64*
	%0 = load i64, i64* %__v.i, align 1			%0 = load i64, i64* %__v.i, align 1
	%vecinit1.i = insertelement <2 x i64> <i64 undef, i64 0>, i64 %0, i32 0			%vecinit1.i = insertelement <2 x i64> <i64 undef, i64 0>, i64 %0, i32 0
	ret <2 x i64> %vecinit1.i			ret <2 x i64> %vecinit1.i
	}			}

	define <2 x i64> @test_mm_loadu_si32(i8* nocapture readonly %A) {			define <2 x i64> @test_mm_loadu_si32(i8* nocapture readonly %A) {
	Show All 30 Lines
	; X64-AVX1-NEXT: # xmm0 = mem[0],zero,zero,zero			; X64-AVX1-NEXT: # xmm0 = mem[0],zero,zero,zero
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_loadu_si32:			; X64-AVX512-LABEL: test_mm_loadu_si32:
	; X64-AVX512: # %bb.0: # %entry			; X64-AVX512: # %bb.0: # %entry
	; X64-AVX512-NEXT: vmovss (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfa,0x10,0x07]			; X64-AVX512-NEXT: vmovss (%rdi), %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfa,0x10,0x07]
	; X64-AVX512-NEXT: # xmm0 = mem[0],zero,zero,zero			; X64-AVX512-NEXT: # xmm0 = mem[0],zero,zero,zero
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_loadu_si32:
				; X32-SSE: # %bb.0: # %entry
				; X32-SSE-NEXT: movss (%edi), %xmm0 # encoding: [0x67,0xf3,0x0f,0x10,0x07]
				; X32-SSE-NEXT: # xmm0 = mem[0],zero,zero,zero
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_loadu_si32:
				; X32-AVX1: # %bb.0: # %entry
				; X32-AVX1-NEXT: vmovss (%edi), %xmm0 # encoding: [0x67,0xc5,0xfa,0x10,0x07]
				; X32-AVX1-NEXT: # xmm0 = mem[0],zero,zero,zero
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_loadu_si32:
				; X32-AVX512: # %bb.0: # %entry
				; X32-AVX512-NEXT: vmovss (%edi), %xmm0 # EVEX TO VEX Compression encoding: [0x67,0xc5,0xfa,0x10,0x07]
				; X32-AVX512-NEXT: # xmm0 = mem[0],zero,zero,zero
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	entry:			entry:
	%__v.i = bitcast i8* %A to i32*			%__v.i = bitcast i8* %A to i32*
	%0 = load i32, i32* %__v.i, align 1			%0 = load i32, i32* %__v.i, align 1
	%vecinit3.i = insertelement <4 x i32> <i32 undef, i32 0, i32 0, i32 0>, i32 %0, i32 0			%vecinit3.i = insertelement <4 x i32> <i32 undef, i32 0, i32 0, i32 0>, i32 %0, i32 0
	%1 = bitcast <4 x i32> %vecinit3.i to <2 x i64>			%1 = bitcast <4 x i32> %vecinit3.i to <2 x i64>
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	Show All 31 Lines
	; X64-AVX1-NEXT: vmovd %eax, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc0]			; X64-AVX1-NEXT: vmovd %eax, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc0]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_loadu_si16:			; X64-AVX512-LABEL: test_mm_loadu_si16:
	; X64-AVX512: # %bb.0: # %entry			; X64-AVX512: # %bb.0: # %entry
	; X64-AVX512-NEXT: movzwl (%rdi), %eax # encoding: [0x0f,0xb7,0x07]			; X64-AVX512-NEXT: movzwl (%rdi), %eax # encoding: [0x0f,0xb7,0x07]
	; X64-AVX512-NEXT: vmovd %eax, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc0]			; X64-AVX512-NEXT: vmovd %eax, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc0]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_loadu_si16:
				; X32-SSE: # %bb.0: # %entry
				; X32-SSE-NEXT: movzwl (%edi), %eax # encoding: [0x67,0x0f,0xb7,0x07]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_loadu_si16:
				; X32-AVX1: # %bb.0: # %entry
				; X32-AVX1-NEXT: movzwl (%edi), %eax # encoding: [0x67,0x0f,0xb7,0x07]
				; X32-AVX1-NEXT: vmovd %eax, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc0]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_loadu_si16:
				; X32-AVX512: # %bb.0: # %entry
				; X32-AVX512-NEXT: movzwl (%edi), %eax # encoding: [0x67,0x0f,0xb7,0x07]
				; X32-AVX512-NEXT: vmovd %eax, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc0]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	entry:			entry:
	%__v.i = bitcast i8* %A to i16*			%__v.i = bitcast i8* %A to i16*
	%0 = load i16, i16* %__v.i, align 1			%0 = load i16, i16* %__v.i, align 1
	%vecinit7.i = insertelement <8 x i16> <i16 undef, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0>, i16 %0, i32 0			%vecinit7.i = insertelement <8 x i16> <i16 undef, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0>, i16 %0, i32 0
	%1 = bitcast <8 x i16> %vecinit7.i to <2 x i64>			%1 = bitcast <8 x i16> %vecinit7.i to <2 x i64>
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
	; X64-SSE: # %bb.0:			; X64-SSE: # %bb.0:
	; X64-SSE-NEXT: maskmovdqu %xmm1, %xmm0 # encoding: [0x66,0x0f,0xf7,0xc1]			; X64-SSE-NEXT: maskmovdqu %xmm1, %xmm0 # encoding: [0x66,0x0f,0xf7,0xc1]
	; X64-SSE-NEXT: retq # encoding: [0xc3]			; X64-SSE-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX-LABEL: test_mm_maskmoveu_si128:			; X64-AVX-LABEL: test_mm_maskmoveu_si128:
	; X64-AVX: # %bb.0:			; X64-AVX: # %bb.0:
	; X64-AVX-NEXT: vmaskmovdqu %xmm1, %xmm0 # encoding: [0xc5,0xf9,0xf7,0xc1]			; X64-AVX-NEXT: vmaskmovdqu %xmm1, %xmm0 # encoding: [0xc5,0xf9,0xf7,0xc1]
	; X64-AVX-NEXT: retq # encoding: [0xc3]			; X64-AVX-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_maskmoveu_si128:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: # kill: def $edi killed $edi killed $rdi
				; X32-SSE-NEXT: addr32 maskmovdqu %xmm1, %xmm0 # encoding: [0x67,0x66,0x0f,0xf7,0xc1]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX-LABEL: test_mm_maskmoveu_si128:
				; X32-AVX: # %bb.0:
				; X32-AVX-NEXT: # kill: def $edi killed $edi killed $rdi
				; X32-AVX-NEXT: addr32 vmaskmovdqu %xmm1, %xmm0 # encoding: [0x67,0xc5,0xf9,0xf7,0xc1]
				; X32-AVX-NEXT: retq # encoding: [0xc3]
	%arg0 = bitcast <2 x i64> %a0 to <16 x i8>			%arg0 = bitcast <2 x i64> %a0 to <16 x i8>
	%arg1 = bitcast <2 x i64> %a1 to <16 x i8>			%arg1 = bitcast <2 x i64> %a1 to <16 x i8>
	call void @llvm.x86.sse2.maskmov.dqu(<16 x i8> %arg0, <16 x i8> %arg1, i8* %a2)			call void @llvm.x86.sse2.maskmov.dqu(<16 x i8> %arg0, <16 x i8> %arg1, i8* %a2)
	ret void			ret void
	}			}
	declare void @llvm.x86.sse2.maskmov.dqu(<16 x i8>, <16 x i8>, i8*) nounwind			declare void @llvm.x86.sse2.maskmov.dqu(<16 x i8>, <16 x i8>, i8*) nounwind

	define <2 x i64> @test_mm_max_epi16(<2 x i64> %a0, <2 x i64> %a1) nounwind {			define <2 x i64> @test_mm_max_epi16(<2 x i64> %a0, <2 x i64> %a1) nounwind {
	▲ Show 20 Lines • Show All 798 Lines • ▼ Show 20 Lines
	; X64-AVX512-NEXT: vpinsrb $12, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0c]			; X64-AVX512-NEXT: vpinsrb $12, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0c]
	; X64-AVX512-NEXT: movzbl %dl, %eax # encoding: [0x0f,0xb6,0xc2]			; X64-AVX512-NEXT: movzbl %dl, %eax # encoding: [0x0f,0xb6,0xc2]
	; X64-AVX512-NEXT: vpinsrb $13, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0d]			; X64-AVX512-NEXT: vpinsrb $13, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0d]
	; X64-AVX512-NEXT: movzbl %sil, %eax # encoding: [0x40,0x0f,0xb6,0xc6]			; X64-AVX512-NEXT: movzbl %sil, %eax # encoding: [0x40,0x0f,0xb6,0xc6]
	; X64-AVX512-NEXT: vpinsrb $14, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0e]			; X64-AVX512-NEXT: vpinsrb $14, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0e]
	; X64-AVX512-NEXT: movzbl %dil, %eax # encoding: [0x40,0x0f,0xb6,0xc7]			; X64-AVX512-NEXT: movzbl %dil, %eax # encoding: [0x40,0x0f,0xb6,0xc7]
	; X64-AVX512-NEXT: vpinsrb $15, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0f]			; X64-AVX512-NEXT: vpinsrb $15, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0f]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_set_epi8:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movzbl %dil, %eax # encoding: [0x40,0x0f,0xb6,0xc7]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl %sil, %eax # encoding: [0x40,0x0f,0xb6,0xc6]
				; X32-SSE-NEXT: movd %eax, %xmm1 # encoding: [0x66,0x0f,0x6e,0xc8]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm1 # encoding: [0x66,0x0f,0x60,0xc8]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm0[0],xmm1[1],xmm0[1],xmm1[2],xmm0[2],xmm1[3],xmm0[3],xmm1[4],xmm0[4],xmm1[5],xmm0[5],xmm1[6],xmm0[6],xmm1[7],xmm0[7]
				; X32-SSE-NEXT: movzbl %dl, %eax # encoding: [0x0f,0xb6,0xc2]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl %cl, %eax # encoding: [0x0f,0xb6,0xc1]
				; X32-SSE-NEXT: movd %eax, %xmm2 # encoding: [0x66,0x0f,0x6e,0xd0]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm2 # encoding: [0x66,0x0f,0x60,0xd0]
				; X32-SSE-NEXT: # xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1],xmm2[2],xmm0[2],xmm2[3],xmm0[3],xmm2[4],xmm0[4],xmm2[5],xmm0[5],xmm2[6],xmm0[6],xmm2[7],xmm0[7]
				; X32-SSE-NEXT: punpcklwd %xmm1, %xmm2 # encoding: [0x66,0x0f,0x61,0xd1]
				; X32-SSE-NEXT: # xmm2 = xmm2[0],xmm1[0],xmm2[1],xmm1[1],xmm2[2],xmm1[2],xmm2[3],xmm1[3]
				; X32-SSE-NEXT: movzbl %r8b, %eax # encoding: [0x41,0x0f,0xb6,0xc0]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl %r9b, %eax # encoding: [0x41,0x0f,0xb6,0xc1]
				; X32-SSE-NEXT: movd %eax, %xmm3 # encoding: [0x66,0x0f,0x6e,0xd8]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm3 # encoding: [0x66,0x0f,0x60,0xd8]
				; X32-SSE-NEXT: # xmm3 = xmm3[0],xmm0[0],xmm3[1],xmm0[1],xmm3[2],xmm0[2],xmm3[3],xmm0[3],xmm3[4],xmm0[4],xmm3[5],xmm0[5],xmm3[6],xmm0[6],xmm3[7],xmm0[7]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x08]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x10]
				; X32-SSE-NEXT: movd %eax, %xmm1 # encoding: [0x66,0x0f,0x6e,0xc8]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm1 # encoding: [0x66,0x0f,0x60,0xc8]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm0[0],xmm1[1],xmm0[1],xmm1[2],xmm0[2],xmm1[3],xmm0[3],xmm1[4],xmm0[4],xmm1[5],xmm0[5],xmm1[6],xmm0[6],xmm1[7],xmm0[7]
				; X32-SSE-NEXT: punpcklwd %xmm3, %xmm1 # encoding: [0x66,0x0f,0x61,0xcb]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm3[0],xmm1[1],xmm3[1],xmm1[2],xmm3[2],xmm1[3],xmm3[3]
				; X32-SSE-NEXT: punpckldq %xmm2, %xmm1 # encoding: [0x66,0x0f,0x62,0xca]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm2[0],xmm1[1],xmm2[1]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x18]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x20]
				; X32-SSE-NEXT: movd %eax, %xmm2 # encoding: [0x66,0x0f,0x6e,0xd0]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm2 # encoding: [0x66,0x0f,0x60,0xd0]
				; X32-SSE-NEXT: # xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1],xmm2[2],xmm0[2],xmm2[3],xmm0[3],xmm2[4],xmm0[4],xmm2[5],xmm0[5],xmm2[6],xmm0[6],xmm2[7],xmm0[7]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x28]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x30]
				; X32-SSE-NEXT: movd %eax, %xmm3 # encoding: [0x66,0x0f,0x6e,0xd8]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm3 # encoding: [0x66,0x0f,0x60,0xd8]
				; X32-SSE-NEXT: # xmm3 = xmm3[0],xmm0[0],xmm3[1],xmm0[1],xmm3[2],xmm0[2],xmm3[3],xmm0[3],xmm3[4],xmm0[4],xmm3[5],xmm0[5],xmm3[6],xmm0[6],xmm3[7],xmm0[7]
				; X32-SSE-NEXT: punpcklwd %xmm2, %xmm3 # encoding: [0x66,0x0f,0x61,0xda]
				; X32-SSE-NEXT: # xmm3 = xmm3[0],xmm2[0],xmm3[1],xmm2[1],xmm3[2],xmm2[2],xmm3[3],xmm2[3]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x38]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x40]
				; X32-SSE-NEXT: movd %eax, %xmm2 # encoding: [0x66,0x0f,0x6e,0xd0]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm2 # encoding: [0x66,0x0f,0x60,0xd0]
				; X32-SSE-NEXT: # xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1],xmm2[2],xmm0[2],xmm2[3],xmm0[3],xmm2[4],xmm0[4],xmm2[5],xmm0[5],xmm2[6],xmm0[6],xmm2[7],xmm0[7]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x48]
				; X32-SSE-NEXT: movd %eax, %xmm4 # encoding: [0x66,0x0f,0x6e,0xe0]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x50]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: punpcklbw %xmm4, %xmm0 # encoding: [0x66,0x0f,0x60,0xc4]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm4[0],xmm0[1],xmm4[1],xmm0[2],xmm4[2],xmm0[3],xmm4[3],xmm0[4],xmm4[4],xmm0[5],xmm4[5],xmm0[6],xmm4[6],xmm0[7],xmm4[7]
				; X32-SSE-NEXT: punpcklwd %xmm2, %xmm0 # encoding: [0x66,0x0f,0x61,0xc2]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm2[0],xmm0[1],xmm2[1],xmm0[2],xmm2[2],xmm0[3],xmm2[3]
				; X32-SSE-NEXT: punpckldq %xmm3, %xmm0 # encoding: [0x66,0x0f,0x62,0xc3]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm3[0],xmm0[1],xmm3[1]
				; X32-SSE-NEXT: punpcklqdq %xmm1, %xmm0 # encoding: [0x66,0x0f,0x6c,0xc1]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm1[0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_set_epi8:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %r10d # encoding: [0x67,0x44,0x0f,0xb6,0x54,0x24,0x48]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x50]
				; X32-AVX1-NEXT: vmovd %eax, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc0]
				; X32-AVX1-NEXT: vpinsrb $1, %r10d, %xmm0, %xmm0 # encoding: [0xc4,0xc3,0x79,0x20,0xc2,0x01]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x40]
				; X32-AVX1-NEXT: vpinsrb $2, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x02]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x38]
				; X32-AVX1-NEXT: vpinsrb $3, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x03]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x30]
				; X32-AVX1-NEXT: vpinsrb $4, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x04]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x28]
				; X32-AVX1-NEXT: vpinsrb $5, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x05]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x20]
				; X32-AVX1-NEXT: vpinsrb $6, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x06]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x18]
				; X32-AVX1-NEXT: vpinsrb $7, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x07]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x10]
				; X32-AVX1-NEXT: vpinsrb $8, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x08]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x08]
				; X32-AVX1-NEXT: vpinsrb $9, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x09]
				; X32-AVX1-NEXT: movzbl %r9b, %eax # encoding: [0x41,0x0f,0xb6,0xc1]
				; X32-AVX1-NEXT: vpinsrb $10, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0a]
				; X32-AVX1-NEXT: movzbl %r8b, %eax # encoding: [0x41,0x0f,0xb6,0xc0]
				; X32-AVX1-NEXT: vpinsrb $11, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0b]
				; X32-AVX1-NEXT: movzbl %cl, %eax # encoding: [0x0f,0xb6,0xc1]
				; X32-AVX1-NEXT: vpinsrb $12, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0c]
				; X32-AVX1-NEXT: movzbl %dl, %eax # encoding: [0x0f,0xb6,0xc2]
				; X32-AVX1-NEXT: vpinsrb $13, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0d]
				; X32-AVX1-NEXT: movzbl %sil, %eax # encoding: [0x40,0x0f,0xb6,0xc6]
				; X32-AVX1-NEXT: vpinsrb $14, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0e]
				; X32-AVX1-NEXT: movzbl %dil, %eax # encoding: [0x40,0x0f,0xb6,0xc7]
				; X32-AVX1-NEXT: vpinsrb $15, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0f]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_set_epi8:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %r10d # encoding: [0x67,0x44,0x0f,0xb6,0x54,0x24,0x48]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x50]
				; X32-AVX512-NEXT: vmovd %eax, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc0]
				; X32-AVX512-NEXT: vpinsrb $1, %r10d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc3,0x79,0x20,0xc2,0x01]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x40]
				; X32-AVX512-NEXT: vpinsrb $2, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x02]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x38]
				; X32-AVX512-NEXT: vpinsrb $3, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x03]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x30]
				; X32-AVX512-NEXT: vpinsrb $4, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x04]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x28]
				; X32-AVX512-NEXT: vpinsrb $5, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x05]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x20]
				; X32-AVX512-NEXT: vpinsrb $6, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x06]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x18]
				; X32-AVX512-NEXT: vpinsrb $7, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x07]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x10]
				; X32-AVX512-NEXT: vpinsrb $8, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x08]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x08]
				; X32-AVX512-NEXT: vpinsrb $9, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x09]
				; X32-AVX512-NEXT: movzbl %r9b, %eax # encoding: [0x41,0x0f,0xb6,0xc1]
				; X32-AVX512-NEXT: vpinsrb $10, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0a]
				; X32-AVX512-NEXT: movzbl %r8b, %eax # encoding: [0x41,0x0f,0xb6,0xc0]
				; X32-AVX512-NEXT: vpinsrb $11, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0b]
				; X32-AVX512-NEXT: movzbl %cl, %eax # encoding: [0x0f,0xb6,0xc1]
				; X32-AVX512-NEXT: vpinsrb $12, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0c]
				; X32-AVX512-NEXT: movzbl %dl, %eax # encoding: [0x0f,0xb6,0xc2]
				; X32-AVX512-NEXT: vpinsrb $13, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0d]
				; X32-AVX512-NEXT: movzbl %sil, %eax # encoding: [0x40,0x0f,0xb6,0xc6]
				; X32-AVX512-NEXT: vpinsrb $14, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0e]
				; X32-AVX512-NEXT: movzbl %dil, %eax # encoding: [0x40,0x0f,0xb6,0xc7]
				; X32-AVX512-NEXT: vpinsrb $15, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0f]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <16 x i8> undef, i8 %a15, i32 0			%res0 = insertelement <16 x i8> undef, i8 %a15, i32 0
	%res1 = insertelement <16 x i8> %res0, i8 %a14, i32 1			%res1 = insertelement <16 x i8> %res0, i8 %a14, i32 1
	%res2 = insertelement <16 x i8> %res1, i8 %a13, i32 2			%res2 = insertelement <16 x i8> %res1, i8 %a13, i32 2
	%res3 = insertelement <16 x i8> %res2, i8 %a12, i32 3			%res3 = insertelement <16 x i8> %res2, i8 %a12, i32 3
	%res4 = insertelement <16 x i8> %res3, i8 %a11, i32 4			%res4 = insertelement <16 x i8> %res3, i8 %a11, i32 4
	%res5 = insertelement <16 x i8> %res4, i8 %a10, i32 5			%res5 = insertelement <16 x i8> %res4, i8 %a10, i32 5
	%res6 = insertelement <16 x i8> %res5, i8 %a9 , i32 6			%res6 = insertelement <16 x i8> %res5, i8 %a9 , i32 6
	%res7 = insertelement <16 x i8> %res6, i8 %a8 , i32 7			%res7 = insertelement <16 x i8> %res6, i8 %a8 , i32 7
	▲ Show 20 Lines • Show All 134 Lines • ▼ Show 20 Lines
	; X64-AVX512-NEXT: vpinsrw $1, %r10d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc2,0x01]			; X64-AVX512-NEXT: vpinsrw $1, %r10d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc2,0x01]
	; X64-AVX512-NEXT: vpinsrw $2, %r9d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc1,0x02]			; X64-AVX512-NEXT: vpinsrw $2, %r9d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc1,0x02]
	; X64-AVX512-NEXT: vpinsrw $3, %r8d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc0,0x03]			; X64-AVX512-NEXT: vpinsrw $3, %r8d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc0,0x03]
	; X64-AVX512-NEXT: vpinsrw $4, %ecx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc1,0x04]			; X64-AVX512-NEXT: vpinsrw $4, %ecx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc1,0x04]
	; X64-AVX512-NEXT: vpinsrw $5, %edx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc2,0x05]			; X64-AVX512-NEXT: vpinsrw $5, %edx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc2,0x05]
	; X64-AVX512-NEXT: vpinsrw $6, %esi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc6,0x06]			; X64-AVX512-NEXT: vpinsrw $6, %esi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc6,0x06]
	; X64-AVX512-NEXT: vpinsrw $7, %edi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc7,0x07]			; X64-AVX512-NEXT: vpinsrw $7, %edi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc7,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_set_epi16:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movzwl {{[0-9]+}}(%esp), %r10d # encoding: [0x67,0x44,0x0f,0xb7,0x54,0x24,0x10]
				; X32-SSE-NEXT: movzwl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb7,0x44,0x24,0x08]
				; X32-SSE-NEXT: movd %edi, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc7]
				; X32-SSE-NEXT: movd %esi, %xmm1 # encoding: [0x66,0x0f,0x6e,0xce]
				; X32-SSE-NEXT: punpcklwd %xmm0, %xmm1 # encoding: [0x66,0x0f,0x61,0xc8]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm0[0],xmm1[1],xmm0[1],xmm1[2],xmm0[2],xmm1[3],xmm0[3]
				; X32-SSE-NEXT: movd %edx, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc2]
				; X32-SSE-NEXT: movd %ecx, %xmm2 # encoding: [0x66,0x0f,0x6e,0xd1]
				; X32-SSE-NEXT: punpcklwd %xmm0, %xmm2 # encoding: [0x66,0x0f,0x61,0xd0]
				; X32-SSE-NEXT: # xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1],xmm2[2],xmm0[2],xmm2[3],xmm0[3]
				; X32-SSE-NEXT: punpckldq %xmm1, %xmm2 # encoding: [0x66,0x0f,0x62,0xd1]
				; X32-SSE-NEXT: # xmm2 = xmm2[0],xmm1[0],xmm2[1],xmm1[1]
				; X32-SSE-NEXT: movd %r8d, %xmm0 # encoding: [0x66,0x41,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movd %r9d, %xmm1 # encoding: [0x66,0x41,0x0f,0x6e,0xc9]
				; X32-SSE-NEXT: punpcklwd %xmm0, %xmm1 # encoding: [0x66,0x0f,0x61,0xc8]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm0[0],xmm1[1],xmm0[1],xmm1[2],xmm0[2],xmm1[3],xmm0[3]
				; X32-SSE-NEXT: movd %eax, %xmm3 # encoding: [0x66,0x0f,0x6e,0xd8]
				; X32-SSE-NEXT: movd %r10d, %xmm0 # encoding: [0x66,0x41,0x0f,0x6e,0xc2]
				; X32-SSE-NEXT: punpcklwd %xmm3, %xmm0 # encoding: [0x66,0x0f,0x61,0xc3]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm3[0],xmm0[1],xmm3[1],xmm0[2],xmm3[2],xmm0[3],xmm3[3]
				; X32-SSE-NEXT: punpckldq %xmm1, %xmm0 # encoding: [0x66,0x0f,0x62,0xc1]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm1[0],xmm0[1],xmm1[1]
				; X32-SSE-NEXT: punpcklqdq %xmm2, %xmm0 # encoding: [0x66,0x0f,0x6c,0xc2]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm2[0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_set_epi16:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: movzwl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb7,0x44,0x24,0x10]
				; X32-AVX1-NEXT: movzwl {{[0-9]+}}(%esp), %r10d # encoding: [0x67,0x44,0x0f,0xb7,0x54,0x24,0x08]
				; X32-AVX1-NEXT: vmovd %eax, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc0]
				; X32-AVX1-NEXT: vpinsrw $1, %r10d, %xmm0, %xmm0 # encoding: [0xc4,0xc1,0x79,0xc4,0xc2,0x01]
				; X32-AVX1-NEXT: vpinsrw $2, %r9d, %xmm0, %xmm0 # encoding: [0xc4,0xc1,0x79,0xc4,0xc1,0x02]
				; X32-AVX1-NEXT: vpinsrw $3, %r8d, %xmm0, %xmm0 # encoding: [0xc4,0xc1,0x79,0xc4,0xc0,0x03]
				; X32-AVX1-NEXT: vpinsrw $4, %ecx, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0xc4,0xc1,0x04]
				; X32-AVX1-NEXT: vpinsrw $5, %edx, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0xc4,0xc2,0x05]
				; X32-AVX1-NEXT: vpinsrw $6, %esi, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0xc4,0xc6,0x06]
				; X32-AVX1-NEXT: vpinsrw $7, %edi, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0xc4,0xc7,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_set_epi16:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: movzwl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb7,0x44,0x24,0x10]
				; X32-AVX512-NEXT: movzwl {{[0-9]+}}(%esp), %r10d # encoding: [0x67,0x44,0x0f,0xb7,0x54,0x24,0x08]
				; X32-AVX512-NEXT: vmovd %eax, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc0]
				; X32-AVX512-NEXT: vpinsrw $1, %r10d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc2,0x01]
				; X32-AVX512-NEXT: vpinsrw $2, %r9d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc1,0x02]
				; X32-AVX512-NEXT: vpinsrw $3, %r8d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc0,0x03]
				; X32-AVX512-NEXT: vpinsrw $4, %ecx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc1,0x04]
				; X32-AVX512-NEXT: vpinsrw $5, %edx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc2,0x05]
				; X32-AVX512-NEXT: vpinsrw $6, %esi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc6,0x06]
				; X32-AVX512-NEXT: vpinsrw $7, %edi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc7,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <8 x i16> undef, i16 %a7, i32 0			%res0 = insertelement <8 x i16> undef, i16 %a7, i32 0
	%res1 = insertelement <8 x i16> %res0, i16 %a6, i32 1			%res1 = insertelement <8 x i16> %res0, i16 %a6, i32 1
	%res2 = insertelement <8 x i16> %res1, i16 %a5, i32 2			%res2 = insertelement <8 x i16> %res1, i16 %a5, i32 2
	%res3 = insertelement <8 x i16> %res2, i16 %a4, i32 3			%res3 = insertelement <8 x i16> %res2, i16 %a4, i32 3
	%res4 = insertelement <8 x i16> %res3, i16 %a3, i32 4			%res4 = insertelement <8 x i16> %res3, i16 %a3, i32 4
	%res5 = insertelement <8 x i16> %res4, i16 %a2, i32 5			%res5 = insertelement <8 x i16> %res4, i16 %a2, i32 5
	%res6 = insertelement <8 x i16> %res5, i16 %a1, i32 6			%res6 = insertelement <8 x i16> %res5, i16 %a1, i32 6
	%res7 = insertelement <8 x i16> %res6, i16 %a0, i32 7			%res7 = insertelement <8 x i16> %res6, i16 %a0, i32 7
	▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
	;			;
	; X64-AVX512-LABEL: test_mm_set_epi32:			; X64-AVX512-LABEL: test_mm_set_epi32:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovd %ecx, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc1]			; X64-AVX512-NEXT: vmovd %ecx, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc1]
	; X64-AVX512-NEXT: vpinsrd $1, %edx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc2,0x01]			; X64-AVX512-NEXT: vpinsrd $1, %edx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc2,0x01]
	; X64-AVX512-NEXT: vpinsrd $2, %esi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc6,0x02]			; X64-AVX512-NEXT: vpinsrd $2, %esi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc6,0x02]
	; X64-AVX512-NEXT: vpinsrd $3, %edi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc7,0x03]			; X64-AVX512-NEXT: vpinsrd $3, %edi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc7,0x03]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_set_epi32:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movd %edi, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc7]
				; X32-SSE-NEXT: movd %esi, %xmm1 # encoding: [0x66,0x0f,0x6e,0xce]
				; X32-SSE-NEXT: punpckldq %xmm0, %xmm1 # encoding: [0x66,0x0f,0x62,0xc8]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm0[0],xmm1[1],xmm0[1]
				; X32-SSE-NEXT: movd %edx, %xmm2 # encoding: [0x66,0x0f,0x6e,0xd2]
				; X32-SSE-NEXT: movd %ecx, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc1]
				; X32-SSE-NEXT: punpckldq %xmm2, %xmm0 # encoding: [0x66,0x0f,0x62,0xc2]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm2[0],xmm0[1],xmm2[1]
				; X32-SSE-NEXT: punpcklqdq %xmm1, %xmm0 # encoding: [0x66,0x0f,0x6c,0xc1]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm1[0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_set_epi32:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovd %ecx, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc1]
				; X32-AVX1-NEXT: vpinsrd $1, %edx, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x22,0xc2,0x01]
				; X32-AVX1-NEXT: vpinsrd $2, %esi, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x22,0xc6,0x02]
				; X32-AVX1-NEXT: vpinsrd $3, %edi, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x22,0xc7,0x03]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_set_epi32:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovd %ecx, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc1]
				; X32-AVX512-NEXT: vpinsrd $1, %edx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc2,0x01]
				; X32-AVX512-NEXT: vpinsrd $2, %esi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc6,0x02]
				; X32-AVX512-NEXT: vpinsrd $3, %edi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc7,0x03]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <4 x i32> undef, i32 %a3, i32 0			%res0 = insertelement <4 x i32> undef, i32 %a3, i32 0
	%res1 = insertelement <4 x i32> %res0, i32 %a2, i32 1			%res1 = insertelement <4 x i32> %res0, i32 %a2, i32 1
	%res2 = insertelement <4 x i32> %res1, i32 %a1, i32 2			%res2 = insertelement <4 x i32> %res1, i32 %a1, i32 2
	%res3 = insertelement <4 x i32> %res2, i32 %a0, i32 3			%res3 = insertelement <4 x i32> %res2, i32 %a0, i32 3
	%res = bitcast <4 x i32> %res3 to <2 x i64>			%res = bitcast <4 x i32> %res3 to <2 x i64>
	ret <2 x i64> %res			ret <2 x i64> %res
	}			}

	▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
	;			;
	; X64-AVX512-LABEL: test_mm_set_epi64x:			; X64-AVX512-LABEL: test_mm_set_epi64x:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovq %rdi, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x6e,0xc7]			; X64-AVX512-NEXT: vmovq %rdi, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x6e,0xc7]
	; X64-AVX512-NEXT: vmovq %rsi, %xmm1 # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x6e,0xce]			; X64-AVX512-NEXT: vmovq %rsi, %xmm1 # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x6e,0xce]
	; X64-AVX512-NEXT: vpunpcklqdq %xmm0, %xmm1, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf1,0x6c,0xc0]			; X64-AVX512-NEXT: vpunpcklqdq %xmm0, %xmm1, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf1,0x6c,0xc0]
	; X64-AVX512-NEXT: # xmm0 = xmm1[0],xmm0[0]			; X64-AVX512-NEXT: # xmm0 = xmm1[0],xmm0[0]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_set_epi64x:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movq %rdi, %xmm1 # encoding: [0x66,0x48,0x0f,0x6e,0xcf]
				; X32-SSE-NEXT: movq %rsi, %xmm0 # encoding: [0x66,0x48,0x0f,0x6e,0xc6]
				; X32-SSE-NEXT: punpcklqdq %xmm1, %xmm0 # encoding: [0x66,0x0f,0x6c,0xc1]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm1[0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_set_epi64x:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovq %rdi, %xmm0 # encoding: [0xc4,0xe1,0xf9,0x6e,0xc7]
				; X32-AVX1-NEXT: vmovq %rsi, %xmm1 # encoding: [0xc4,0xe1,0xf9,0x6e,0xce]
				; X32-AVX1-NEXT: vpunpcklqdq %xmm0, %xmm1, %xmm0 # encoding: [0xc5,0xf1,0x6c,0xc0]
				; X32-AVX1-NEXT: # xmm0 = xmm1[0],xmm0[0]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_set_epi64x:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovq %rdi, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x6e,0xc7]
				; X32-AVX512-NEXT: vmovq %rsi, %xmm1 # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x6e,0xce]
				; X32-AVX512-NEXT: vpunpcklqdq %xmm0, %xmm1, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf1,0x6c,0xc0]
				; X32-AVX512-NEXT: # xmm0 = xmm1[0],xmm0[0]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <2 x i64> undef, i64 %a1, i32 0			%res0 = insertelement <2 x i64> undef, i64 %a1, i32 0
	%res1 = insertelement <2 x i64> %res0, i64 %a0, i32 1			%res1 = insertelement <2 x i64> %res0, i64 %a0, i32 1
	ret <2 x i64> %res1			ret <2 x i64> %res1
	}			}

	define <2 x double> @test_mm_set_pd(double %a0, double %a1) nounwind {			define <2 x double> @test_mm_set_pd(double %a0, double %a1) nounwind {
	; X86-SSE-LABEL: test_mm_set_pd:			; X86-SSE-LABEL: test_mm_set_pd:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 38 Lines
	; X64-AVX1-NEXT: # xmm0 = xmm1[0],xmm0[0]			; X64-AVX1-NEXT: # xmm0 = xmm1[0],xmm0[0]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_set_pd:			; X64-AVX512-LABEL: test_mm_set_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovlhps %xmm0, %xmm1, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf0,0x16,0xc0]			; X64-AVX512-NEXT: vmovlhps %xmm0, %xmm1, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf0,0x16,0xc0]
	; X64-AVX512-NEXT: # xmm0 = xmm1[0],xmm0[0]			; X64-AVX512-NEXT: # xmm0 = xmm1[0],xmm0[0]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_set_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movlhps %xmm0, %xmm1 # encoding: [0x0f,0x16,0xc8]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm0[0]
				; X32-SSE-NEXT: movaps %xmm1, %xmm0 # encoding: [0x0f,0x28,0xc1]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_set_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovlhps %xmm0, %xmm1, %xmm0 # encoding: [0xc5,0xf0,0x16,0xc0]
				; X32-AVX1-NEXT: # xmm0 = xmm1[0],xmm0[0]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_set_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovlhps %xmm0, %xmm1, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf0,0x16,0xc0]
				; X32-AVX512-NEXT: # xmm0 = xmm1[0],xmm0[0]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <2 x double> undef, double %a1, i32 0			%res0 = insertelement <2 x double> undef, double %a1, i32 0
	%res1 = insertelement <2 x double> %res0, double %a0, i32 1			%res1 = insertelement <2 x double> %res0, double %a0, i32 1
	ret <2 x double> %res1			ret <2 x double> %res1
	}			}

	define <2 x double> @test_mm_set_pd1(double %a0) nounwind {			define <2 x double> @test_mm_set_pd1(double %a0) nounwind {
	; X86-SSE-LABEL: test_mm_set_pd1:			; X86-SSE-LABEL: test_mm_set_pd1:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 31 Lines
	; X64-AVX1-NEXT: # xmm0 = xmm0[0,0]			; X64-AVX1-NEXT: # xmm0 = xmm0[0,0]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_set_pd1:			; X64-AVX512-LABEL: test_mm_set_pd1:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovddup %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0xc0]			; X64-AVX512-NEXT: vmovddup %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0xc0]
	; X64-AVX512-NEXT: # xmm0 = xmm0[0,0]			; X64-AVX512-NEXT: # xmm0 = xmm0[0,0]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_set_pd1:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movlhps %xmm0, %xmm0 # encoding: [0x0f,0x16,0xc0]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_set_pd1:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovddup %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x12,0xc0]
				; X32-AVX1-NEXT: # xmm0 = xmm0[0,0]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_set_pd1:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovddup %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0xc0]
				; X32-AVX512-NEXT: # xmm0 = xmm0[0,0]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <2 x double> undef, double %a0, i32 0			%res0 = insertelement <2 x double> undef, double %a0, i32 0
	%res1 = insertelement <2 x double> %res0, double %a0, i32 1			%res1 = insertelement <2 x double> %res0, double %a0, i32 1
	ret <2 x double> %res1			ret <2 x double> %res1
	}			}

	define <2 x double> @test_mm_set_sd(double %a0) nounwind {			define <2 x double> @test_mm_set_sd(double %a0) nounwind {
	; X86-SSE-LABEL: test_mm_set_sd:			; X86-SSE-LABEL: test_mm_set_sd:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 31 Lines
	; X64-AVX1-NEXT: # xmm0 = xmm0[0],zero			; X64-AVX1-NEXT: # xmm0 = xmm0[0],zero
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_set_sd:			; X64-AVX512-LABEL: test_mm_set_sd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovq %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfa,0x7e,0xc0]			; X64-AVX512-NEXT: vmovq %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfa,0x7e,0xc0]
	; X64-AVX512-NEXT: # xmm0 = xmm0[0],zero			; X64-AVX512-NEXT: # xmm0 = xmm0[0],zero
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_set_sd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movq %xmm0, %xmm0 # encoding: [0xf3,0x0f,0x7e,0xc0]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],zero
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_set_sd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovq %xmm0, %xmm0 # encoding: [0xc5,0xfa,0x7e,0xc0]
				; X32-AVX1-NEXT: # xmm0 = xmm0[0],zero
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_set_sd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovq %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfa,0x7e,0xc0]
				; X32-AVX512-NEXT: # xmm0 = xmm0[0],zero
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <2 x double> undef, double %a0, i32 0			%res0 = insertelement <2 x double> undef, double %a0, i32 0
	%res1 = insertelement <2 x double> %res0, double 0.0, i32 1			%res1 = insertelement <2 x double> %res0, double 0.0, i32 1
	ret <2 x double> %res1			ret <2 x double> %res1
	}			}

	define <2 x i64> @test_mm_set1_epi8(i8 %a0) nounwind {			define <2 x i64> @test_mm_set1_epi8(i8 %a0) nounwind {
	; X86-SSE-LABEL: test_mm_set1_epi8:			; X86-SSE-LABEL: test_mm_set1_epi8:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 40 Lines
	; X64-AVX1-NEXT: vpxor %xmm1, %xmm1, %xmm1 # encoding: [0xc5,0xf1,0xef,0xc9]			; X64-AVX1-NEXT: vpxor %xmm1, %xmm1, %xmm1 # encoding: [0xc5,0xf1,0xef,0xc9]
	; X64-AVX1-NEXT: vpshufb %xmm1, %xmm0, %xmm0 # encoding: [0xc4,0xe2,0x79,0x00,0xc1]			; X64-AVX1-NEXT: vpshufb %xmm1, %xmm0, %xmm0 # encoding: [0xc4,0xe2,0x79,0x00,0xc1]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_set1_epi8:			; X64-AVX512-LABEL: test_mm_set1_epi8:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vpbroadcastb %edi, %xmm0 # encoding: [0x62,0xf2,0x7d,0x08,0x7a,0xc7]			; X64-AVX512-NEXT: vpbroadcastb %edi, %xmm0 # encoding: [0x62,0xf2,0x7d,0x08,0x7a,0xc7]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_set1_epi8:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movzbl %dil, %eax # encoding: [0x40,0x0f,0xb6,0xc7]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm0 # encoding: [0x66,0x0f,0x60,0xc0]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,0,1,1,2,2,3,3,4,4,5,5,6,6,7,7]
				; X32-SSE-NEXT: pshuflw $0, %xmm0, %xmm0 # encoding: [0xf2,0x0f,0x70,0xc0,0x00]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,0,0,0,4,5,6,7]
				; X32-SSE-NEXT: pshufd $0, %xmm0, %xmm0 # encoding: [0x66,0x0f,0x70,0xc0,0x00]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,0,0,0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_set1_epi8:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: movzbl %dil, %eax # encoding: [0x40,0x0f,0xb6,0xc7]
				; X32-AVX1-NEXT: vmovd %eax, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc0]
				; X32-AVX1-NEXT: vpxor %xmm1, %xmm1, %xmm1 # encoding: [0xc5,0xf1,0xef,0xc9]
				; X32-AVX1-NEXT: vpshufb %xmm1, %xmm0, %xmm0 # encoding: [0xc4,0xe2,0x79,0x00,0xc1]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_set1_epi8:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vpbroadcastb %edi, %xmm0 # encoding: [0x62,0xf2,0x7d,0x08,0x7a,0xc7]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <16 x i8> undef, i8 %a0, i32 0			%res0 = insertelement <16 x i8> undef, i8 %a0, i32 0
	%res1 = insertelement <16 x i8> %res0, i8 %a0, i32 1			%res1 = insertelement <16 x i8> %res0, i8 %a0, i32 1
	%res2 = insertelement <16 x i8> %res1, i8 %a0, i32 2			%res2 = insertelement <16 x i8> %res1, i8 %a0, i32 2
	%res3 = insertelement <16 x i8> %res2, i8 %a0, i32 3			%res3 = insertelement <16 x i8> %res2, i8 %a0, i32 3
	%res4 = insertelement <16 x i8> %res3, i8 %a0, i32 4			%res4 = insertelement <16 x i8> %res3, i8 %a0, i32 4
	%res5 = insertelement <16 x i8> %res4, i8 %a0, i32 5			%res5 = insertelement <16 x i8> %res4, i8 %a0, i32 5
	%res6 = insertelement <16 x i8> %res5, i8 %a0, i32 6			%res6 = insertelement <16 x i8> %res5, i8 %a0, i32 6
	%res7 = insertelement <16 x i8> %res6, i8 %a0, i32 7			%res7 = insertelement <16 x i8> %res6, i8 %a0, i32 7
	▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines
	; X64-AVX1-NEXT: vpshufd $0, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0x70,0xc0,0x00]			; X64-AVX1-NEXT: vpshufd $0, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0x70,0xc0,0x00]
	; X64-AVX1-NEXT: # xmm0 = xmm0[0,0,0,0]			; X64-AVX1-NEXT: # xmm0 = xmm0[0,0,0,0]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_set1_epi16:			; X64-AVX512-LABEL: test_mm_set1_epi16:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vpbroadcastw %edi, %xmm0 # encoding: [0x62,0xf2,0x7d,0x08,0x7b,0xc7]			; X64-AVX512-NEXT: vpbroadcastw %edi, %xmm0 # encoding: [0x62,0xf2,0x7d,0x08,0x7b,0xc7]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_set1_epi16:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movd %edi, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc7]
				; X32-SSE-NEXT: pshuflw $0, %xmm0, %xmm0 # encoding: [0xf2,0x0f,0x70,0xc0,0x00]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,0,0,0,4,5,6,7]
				; X32-SSE-NEXT: pshufd $0, %xmm0, %xmm0 # encoding: [0x66,0x0f,0x70,0xc0,0x00]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,0,0,0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_set1_epi16:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovd %edi, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc7]
				; X32-AVX1-NEXT: vpshuflw $0, %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x70,0xc0,0x00]
				; X32-AVX1-NEXT: # xmm0 = xmm0[0,0,0,0,4,5,6,7]
				; X32-AVX1-NEXT: vpshufd $0, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0x70,0xc0,0x00]
				; X32-AVX1-NEXT: # xmm0 = xmm0[0,0,0,0]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_set1_epi16:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vpbroadcastw %edi, %xmm0 # encoding: [0x62,0xf2,0x7d,0x08,0x7b,0xc7]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <8 x i16> undef, i16 %a0, i32 0			%res0 = insertelement <8 x i16> undef, i16 %a0, i32 0
	%res1 = insertelement <8 x i16> %res0, i16 %a0, i32 1			%res1 = insertelement <8 x i16> %res0, i16 %a0, i32 1
	%res2 = insertelement <8 x i16> %res1, i16 %a0, i32 2			%res2 = insertelement <8 x i16> %res1, i16 %a0, i32 2
	%res3 = insertelement <8 x i16> %res2, i16 %a0, i32 3			%res3 = insertelement <8 x i16> %res2, i16 %a0, i32 3
	%res4 = insertelement <8 x i16> %res3, i16 %a0, i32 4			%res4 = insertelement <8 x i16> %res3, i16 %a0, i32 4
	%res5 = insertelement <8 x i16> %res4, i16 %a0, i32 5			%res5 = insertelement <8 x i16> %res4, i16 %a0, i32 5
	%res6 = insertelement <8 x i16> %res5, i16 %a0, i32 6			%res6 = insertelement <8 x i16> %res5, i16 %a0, i32 6
	%res7 = insertelement <8 x i16> %res6, i16 %a0, i32 7			%res7 = insertelement <8 x i16> %res6, i16 %a0, i32 7
	Show All 37 Lines
	; X64-AVX1-NEXT: vpshufd $0, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0x70,0xc0,0x00]			; X64-AVX1-NEXT: vpshufd $0, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0x70,0xc0,0x00]
	; X64-AVX1-NEXT: # xmm0 = xmm0[0,0,0,0]			; X64-AVX1-NEXT: # xmm0 = xmm0[0,0,0,0]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_set1_epi32:			; X64-AVX512-LABEL: test_mm_set1_epi32:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vpbroadcastd %edi, %xmm0 # encoding: [0x62,0xf2,0x7d,0x08,0x7c,0xc7]			; X64-AVX512-NEXT: vpbroadcastd %edi, %xmm0 # encoding: [0x62,0xf2,0x7d,0x08,0x7c,0xc7]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_set1_epi32:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movd %edi, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc7]
				; X32-SSE-NEXT: pshufd $0, %xmm0, %xmm0 # encoding: [0x66,0x0f,0x70,0xc0,0x00]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,0,0,0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_set1_epi32:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovd %edi, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc7]
				; X32-AVX1-NEXT: vpshufd $0, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0x70,0xc0,0x00]
				; X32-AVX1-NEXT: # xmm0 = xmm0[0,0,0,0]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_set1_epi32:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vpbroadcastd %edi, %xmm0 # encoding: [0x62,0xf2,0x7d,0x08,0x7c,0xc7]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <4 x i32> undef, i32 %a0, i32 0			%res0 = insertelement <4 x i32> undef, i32 %a0, i32 0
	%res1 = insertelement <4 x i32> %res0, i32 %a0, i32 1			%res1 = insertelement <4 x i32> %res0, i32 %a0, i32 1
	%res2 = insertelement <4 x i32> %res1, i32 %a0, i32 2			%res2 = insertelement <4 x i32> %res1, i32 %a0, i32 2
	%res3 = insertelement <4 x i32> %res2, i32 %a0, i32 3			%res3 = insertelement <4 x i32> %res2, i32 %a0, i32 3
	%res = bitcast <4 x i32> %res3 to <2 x i64>			%res = bitcast <4 x i32> %res3 to <2 x i64>
	ret <2 x i64> %res			ret <2 x i64> %res
	}			}

	▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
	; X64-AVX1-NEXT: vpshufd $68, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0x70,0xc0,0x44]			; X64-AVX1-NEXT: vpshufd $68, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0x70,0xc0,0x44]
	; X64-AVX1-NEXT: # xmm0 = xmm0[0,1,0,1]			; X64-AVX1-NEXT: # xmm0 = xmm0[0,1,0,1]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_set1_epi64x:			; X64-AVX512-LABEL: test_mm_set1_epi64x:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vpbroadcastq %rdi, %xmm0 # encoding: [0x62,0xf2,0xfd,0x08,0x7c,0xc7]			; X64-AVX512-NEXT: vpbroadcastq %rdi, %xmm0 # encoding: [0x62,0xf2,0xfd,0x08,0x7c,0xc7]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_set1_epi64x:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movq %rdi, %xmm0 # encoding: [0x66,0x48,0x0f,0x6e,0xc7]
				; X32-SSE-NEXT: pshufd $68, %xmm0, %xmm0 # encoding: [0x66,0x0f,0x70,0xc0,0x44]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,1,0,1]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_set1_epi64x:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovq %rdi, %xmm0 # encoding: [0xc4,0xe1,0xf9,0x6e,0xc7]
				; X32-AVX1-NEXT: vpshufd $68, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0x70,0xc0,0x44]
				; X32-AVX1-NEXT: # xmm0 = xmm0[0,1,0,1]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_set1_epi64x:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vpbroadcastq %rdi, %xmm0 # encoding: [0x62,0xf2,0xfd,0x08,0x7c,0xc7]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <2 x i64> undef, i64 %a0, i32 0			%res0 = insertelement <2 x i64> undef, i64 %a0, i32 0
	%res1 = insertelement <2 x i64> %res0, i64 %a0, i32 1			%res1 = insertelement <2 x i64> %res0, i64 %a0, i32 1
	ret <2 x i64> %res1			ret <2 x i64> %res1
	}			}

	define <2 x double> @test_mm_set1_pd(double %a0) nounwind {			define <2 x double> @test_mm_set1_pd(double %a0) nounwind {
	; X86-SSE-LABEL: test_mm_set1_pd:			; X86-SSE-LABEL: test_mm_set1_pd:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 31 Lines
	; X64-AVX1-NEXT: # xmm0 = xmm0[0,0]			; X64-AVX1-NEXT: # xmm0 = xmm0[0,0]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_set1_pd:			; X64-AVX512-LABEL: test_mm_set1_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovddup %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0xc0]			; X64-AVX512-NEXT: vmovddup %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0xc0]
	; X64-AVX512-NEXT: # xmm0 = xmm0[0,0]			; X64-AVX512-NEXT: # xmm0 = xmm0[0,0]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_set1_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movlhps %xmm0, %xmm0 # encoding: [0x0f,0x16,0xc0]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_set1_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovddup %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x12,0xc0]
				; X32-AVX1-NEXT: # xmm0 = xmm0[0,0]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_set1_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovddup %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0xc0]
				; X32-AVX512-NEXT: # xmm0 = xmm0[0,0]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <2 x double> undef, double %a0, i32 0			%res0 = insertelement <2 x double> undef, double %a0, i32 0
	%res1 = insertelement <2 x double> %res0, double %a0, i32 1			%res1 = insertelement <2 x double> %res0, double %a0, i32 1
	ret <2 x double> %res1			ret <2 x double> %res1
	}			}

	define <2 x i64> @test_mm_setr_epi8(i8 %a0, i8 %a1, i8 %a2, i8 %a3, i8 %a4, i8 %a5, i8 %a6, i8 %a7, i8 %a8, i8 %a9, i8 %a10, i8 %a11, i8 %a12, i8 %a13, i8 %a14, i8 %a15) nounwind {			define <2 x i64> @test_mm_setr_epi8(i8 %a0, i8 %a1, i8 %a2, i8 %a3, i8 %a4, i8 %a5, i8 %a6, i8 %a7, i8 %a8, i8 %a9, i8 %a10, i8 %a11, i8 %a12, i8 %a13, i8 %a14, i8 %a15) nounwind {
	; X86-SSE-LABEL: test_mm_setr_epi8:			; X86-SSE-LABEL: test_mm_setr_epi8:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	▲ Show 20 Lines • Show All 265 Lines • ▼ Show 20 Lines
	; X64-AVX512-NEXT: vpinsrb $12, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0c]			; X64-AVX512-NEXT: vpinsrb $12, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0c]
	; X64-AVX512-NEXT: movzbl {{[0-9]+}}(%rsp), %eax # encoding: [0x0f,0xb6,0x44,0x24,0x40]			; X64-AVX512-NEXT: movzbl {{[0-9]+}}(%rsp), %eax # encoding: [0x0f,0xb6,0x44,0x24,0x40]
	; X64-AVX512-NEXT: vpinsrb $13, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0d]			; X64-AVX512-NEXT: vpinsrb $13, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0d]
	; X64-AVX512-NEXT: movzbl {{[0-9]+}}(%rsp), %eax # encoding: [0x0f,0xb6,0x44,0x24,0x48]			; X64-AVX512-NEXT: movzbl {{[0-9]+}}(%rsp), %eax # encoding: [0x0f,0xb6,0x44,0x24,0x48]
	; X64-AVX512-NEXT: vpinsrb $14, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0e]			; X64-AVX512-NEXT: vpinsrb $14, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0e]
	; X64-AVX512-NEXT: movzbl {{[0-9]+}}(%rsp), %eax # encoding: [0x0f,0xb6,0x44,0x24,0x50]			; X64-AVX512-NEXT: movzbl {{[0-9]+}}(%rsp), %eax # encoding: [0x0f,0xb6,0x44,0x24,0x50]
	; X64-AVX512-NEXT: vpinsrb $15, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0f]			; X64-AVX512-NEXT: vpinsrb $15, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0f]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_setr_epi8:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x50]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x48]
				; X32-SSE-NEXT: movd %eax, %xmm1 # encoding: [0x66,0x0f,0x6e,0xc8]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm1 # encoding: [0x66,0x0f,0x60,0xc8]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm0[0],xmm1[1],xmm0[1],xmm1[2],xmm0[2],xmm1[3],xmm0[3],xmm1[4],xmm0[4],xmm1[5],xmm0[5],xmm1[6],xmm0[6],xmm1[7],xmm0[7]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x40]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x38]
				; X32-SSE-NEXT: movd %eax, %xmm2 # encoding: [0x66,0x0f,0x6e,0xd0]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm2 # encoding: [0x66,0x0f,0x60,0xd0]
				; X32-SSE-NEXT: # xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1],xmm2[2],xmm0[2],xmm2[3],xmm0[3],xmm2[4],xmm0[4],xmm2[5],xmm0[5],xmm2[6],xmm0[6],xmm2[7],xmm0[7]
				; X32-SSE-NEXT: punpcklwd %xmm1, %xmm2 # encoding: [0x66,0x0f,0x61,0xd1]
				; X32-SSE-NEXT: # xmm2 = xmm2[0],xmm1[0],xmm2[1],xmm1[1],xmm2[2],xmm1[2],xmm2[3],xmm1[3]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x30]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x28]
				; X32-SSE-NEXT: movd %eax, %xmm3 # encoding: [0x66,0x0f,0x6e,0xd8]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm3 # encoding: [0x66,0x0f,0x60,0xd8]
				; X32-SSE-NEXT: # xmm3 = xmm3[0],xmm0[0],xmm3[1],xmm0[1],xmm3[2],xmm0[2],xmm3[3],xmm0[3],xmm3[4],xmm0[4],xmm3[5],xmm0[5],xmm3[6],xmm0[6],xmm3[7],xmm0[7]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x20]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x18]
				; X32-SSE-NEXT: movd %eax, %xmm1 # encoding: [0x66,0x0f,0x6e,0xc8]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm1 # encoding: [0x66,0x0f,0x60,0xc8]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm0[0],xmm1[1],xmm0[1],xmm1[2],xmm0[2],xmm1[3],xmm0[3],xmm1[4],xmm0[4],xmm1[5],xmm0[5],xmm1[6],xmm0[6],xmm1[7],xmm0[7]
				; X32-SSE-NEXT: punpcklwd %xmm3, %xmm1 # encoding: [0x66,0x0f,0x61,0xcb]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm3[0],xmm1[1],xmm3[1],xmm1[2],xmm3[2],xmm1[3],xmm3[3]
				; X32-SSE-NEXT: punpckldq %xmm2, %xmm1 # encoding: [0x66,0x0f,0x62,0xca]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm2[0],xmm1[1],xmm2[1]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x10]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x08]
				; X32-SSE-NEXT: movd %eax, %xmm2 # encoding: [0x66,0x0f,0x6e,0xd0]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm2 # encoding: [0x66,0x0f,0x60,0xd0]
				; X32-SSE-NEXT: # xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1],xmm2[2],xmm0[2],xmm2[3],xmm0[3],xmm2[4],xmm0[4],xmm2[5],xmm0[5],xmm2[6],xmm0[6],xmm2[7],xmm0[7]
				; X32-SSE-NEXT: movzbl %r9b, %eax # encoding: [0x41,0x0f,0xb6,0xc1]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl %r8b, %eax # encoding: [0x41,0x0f,0xb6,0xc0]
				; X32-SSE-NEXT: movd %eax, %xmm3 # encoding: [0x66,0x0f,0x6e,0xd8]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm3 # encoding: [0x66,0x0f,0x60,0xd8]
				; X32-SSE-NEXT: # xmm3 = xmm3[0],xmm0[0],xmm3[1],xmm0[1],xmm3[2],xmm0[2],xmm3[3],xmm0[3],xmm3[4],xmm0[4],xmm3[5],xmm0[5],xmm3[6],xmm0[6],xmm3[7],xmm0[7]
				; X32-SSE-NEXT: punpcklwd %xmm2, %xmm3 # encoding: [0x66,0x0f,0x61,0xda]
				; X32-SSE-NEXT: # xmm3 = xmm3[0],xmm2[0],xmm3[1],xmm2[1],xmm3[2],xmm2[2],xmm3[3],xmm2[3]
				; X32-SSE-NEXT: movzbl %cl, %eax # encoding: [0x0f,0xb6,0xc1]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movzbl %dl, %eax # encoding: [0x0f,0xb6,0xc2]
				; X32-SSE-NEXT: movd %eax, %xmm2 # encoding: [0x66,0x0f,0x6e,0xd0]
				; X32-SSE-NEXT: punpcklbw %xmm0, %xmm2 # encoding: [0x66,0x0f,0x60,0xd0]
				; X32-SSE-NEXT: # xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1],xmm2[2],xmm0[2],xmm2[3],xmm0[3],xmm2[4],xmm0[4],xmm2[5],xmm0[5],xmm2[6],xmm0[6],xmm2[7],xmm0[7]
				; X32-SSE-NEXT: movzbl %sil, %eax # encoding: [0x40,0x0f,0xb6,0xc6]
				; X32-SSE-NEXT: movd %eax, %xmm4 # encoding: [0x66,0x0f,0x6e,0xe0]
				; X32-SSE-NEXT: movzbl %dil, %eax # encoding: [0x40,0x0f,0xb6,0xc7]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: punpcklbw %xmm4, %xmm0 # encoding: [0x66,0x0f,0x60,0xc4]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm4[0],xmm0[1],xmm4[1],xmm0[2],xmm4[2],xmm0[3],xmm4[3],xmm0[4],xmm4[4],xmm0[5],xmm4[5],xmm0[6],xmm4[6],xmm0[7],xmm4[7]
				; X32-SSE-NEXT: punpcklwd %xmm2, %xmm0 # encoding: [0x66,0x0f,0x61,0xc2]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm2[0],xmm0[1],xmm2[1],xmm0[2],xmm2[2],xmm0[3],xmm2[3]
				; X32-SSE-NEXT: punpckldq %xmm3, %xmm0 # encoding: [0x66,0x0f,0x62,0xc3]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm3[0],xmm0[1],xmm3[1]
				; X32-SSE-NEXT: punpcklqdq %xmm1, %xmm0 # encoding: [0x66,0x0f,0x6c,0xc1]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm1[0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_setr_epi8:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: movzbl %sil, %eax # encoding: [0x40,0x0f,0xb6,0xc6]
				; X32-AVX1-NEXT: movzbl %dil, %esi # encoding: [0x40,0x0f,0xb6,0xf7]
				; X32-AVX1-NEXT: vmovd %esi, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc6]
				; X32-AVX1-NEXT: vpinsrb $1, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x01]
				; X32-AVX1-NEXT: movzbl %dl, %eax # encoding: [0x0f,0xb6,0xc2]
				; X32-AVX1-NEXT: vpinsrb $2, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x02]
				; X32-AVX1-NEXT: movzbl %cl, %eax # encoding: [0x0f,0xb6,0xc1]
				; X32-AVX1-NEXT: vpinsrb $3, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x03]
				; X32-AVX1-NEXT: movzbl %r8b, %eax # encoding: [0x41,0x0f,0xb6,0xc0]
				; X32-AVX1-NEXT: vpinsrb $4, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x04]
				; X32-AVX1-NEXT: movzbl %r9b, %eax # encoding: [0x41,0x0f,0xb6,0xc1]
				; X32-AVX1-NEXT: vpinsrb $5, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x05]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x08]
				; X32-AVX1-NEXT: vpinsrb $6, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x06]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x10]
				; X32-AVX1-NEXT: vpinsrb $7, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x07]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x18]
				; X32-AVX1-NEXT: vpinsrb $8, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x08]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x20]
				; X32-AVX1-NEXT: vpinsrb $9, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x09]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x28]
				; X32-AVX1-NEXT: vpinsrb $10, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0a]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x30]
				; X32-AVX1-NEXT: vpinsrb $11, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0b]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x38]
				; X32-AVX1-NEXT: vpinsrb $12, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0c]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x40]
				; X32-AVX1-NEXT: vpinsrb $13, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0d]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x48]
				; X32-AVX1-NEXT: vpinsrb $14, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0e]
				; X32-AVX1-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x50]
				; X32-AVX1-NEXT: vpinsrb $15, %eax, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0f]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_setr_epi8:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: movzbl %sil, %eax # encoding: [0x40,0x0f,0xb6,0xc6]
				; X32-AVX512-NEXT: movzbl %dil, %esi # encoding: [0x40,0x0f,0xb6,0xf7]
				; X32-AVX512-NEXT: vmovd %esi, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc6]
				; X32-AVX512-NEXT: vpinsrb $1, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x01]
				; X32-AVX512-NEXT: movzbl %dl, %eax # encoding: [0x0f,0xb6,0xc2]
				; X32-AVX512-NEXT: vpinsrb $2, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x02]
				; X32-AVX512-NEXT: movzbl %cl, %eax # encoding: [0x0f,0xb6,0xc1]
				; X32-AVX512-NEXT: vpinsrb $3, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x03]
				; X32-AVX512-NEXT: movzbl %r8b, %eax # encoding: [0x41,0x0f,0xb6,0xc0]
				; X32-AVX512-NEXT: vpinsrb $4, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x04]
				; X32-AVX512-NEXT: movzbl %r9b, %eax # encoding: [0x41,0x0f,0xb6,0xc1]
				; X32-AVX512-NEXT: vpinsrb $5, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x05]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x08]
				; X32-AVX512-NEXT: vpinsrb $6, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x06]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x10]
				; X32-AVX512-NEXT: vpinsrb $7, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x07]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x18]
				; X32-AVX512-NEXT: vpinsrb $8, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x08]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x20]
				; X32-AVX512-NEXT: vpinsrb $9, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x09]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x28]
				; X32-AVX512-NEXT: vpinsrb $10, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0a]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x30]
				; X32-AVX512-NEXT: vpinsrb $11, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0b]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x38]
				; X32-AVX512-NEXT: vpinsrb $12, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0c]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x40]
				; X32-AVX512-NEXT: vpinsrb $13, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0d]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x48]
				; X32-AVX512-NEXT: vpinsrb $14, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0e]
				; X32-AVX512-NEXT: movzbl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb6,0x44,0x24,0x50]
				; X32-AVX512-NEXT: vpinsrb $15, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x20,0xc0,0x0f]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <16 x i8> undef, i8 %a0 , i32 0			%res0 = insertelement <16 x i8> undef, i8 %a0 , i32 0
	%res1 = insertelement <16 x i8> %res0, i8 %a1 , i32 1			%res1 = insertelement <16 x i8> %res0, i8 %a1 , i32 1
	%res2 = insertelement <16 x i8> %res1, i8 %a2 , i32 2			%res2 = insertelement <16 x i8> %res1, i8 %a2 , i32 2
	%res3 = insertelement <16 x i8> %res2, i8 %a3 , i32 3			%res3 = insertelement <16 x i8> %res2, i8 %a3 , i32 3
	%res4 = insertelement <16 x i8> %res3, i8 %a4 , i32 4			%res4 = insertelement <16 x i8> %res3, i8 %a4 , i32 4
	%res5 = insertelement <16 x i8> %res4, i8 %a5 , i32 5			%res5 = insertelement <16 x i8> %res4, i8 %a5 , i32 5
	%res6 = insertelement <16 x i8> %res5, i8 %a6 , i32 6			%res6 = insertelement <16 x i8> %res5, i8 %a6 , i32 6
	%res7 = insertelement <16 x i8> %res6, i8 %a7 , i32 7			%res7 = insertelement <16 x i8> %res6, i8 %a7 , i32 7
	▲ Show 20 Lines • Show All 134 Lines • ▼ Show 20 Lines
	; X64-AVX512-NEXT: vpinsrw $1, %esi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc6,0x01]			; X64-AVX512-NEXT: vpinsrw $1, %esi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc6,0x01]
	; X64-AVX512-NEXT: vpinsrw $2, %edx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc2,0x02]			; X64-AVX512-NEXT: vpinsrw $2, %edx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc2,0x02]
	; X64-AVX512-NEXT: vpinsrw $3, %ecx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc1,0x03]			; X64-AVX512-NEXT: vpinsrw $3, %ecx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc1,0x03]
	; X64-AVX512-NEXT: vpinsrw $4, %r8d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc0,0x04]			; X64-AVX512-NEXT: vpinsrw $4, %r8d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc0,0x04]
	; X64-AVX512-NEXT: vpinsrw $5, %r9d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc1,0x05]			; X64-AVX512-NEXT: vpinsrw $5, %r9d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc1,0x05]
	; X64-AVX512-NEXT: vpinsrw $6, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc0,0x06]			; X64-AVX512-NEXT: vpinsrw $6, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc0,0x06]
	; X64-AVX512-NEXT: vpinsrw $7, %r10d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc2,0x07]			; X64-AVX512-NEXT: vpinsrw $7, %r10d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc2,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_setr_epi16:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movzwl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb7,0x44,0x24,0x10]
				; X32-SSE-NEXT: movzwl {{[0-9]+}}(%esp), %r10d # encoding: [0x67,0x44,0x0f,0xb7,0x54,0x24,0x08]
				; X32-SSE-NEXT: movd %eax, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc0]
				; X32-SSE-NEXT: movd %r10d, %xmm1 # encoding: [0x66,0x41,0x0f,0x6e,0xca]
				; X32-SSE-NEXT: punpcklwd %xmm0, %xmm1 # encoding: [0x66,0x0f,0x61,0xc8]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm0[0],xmm1[1],xmm0[1],xmm1[2],xmm0[2],xmm1[3],xmm0[3]
				; X32-SSE-NEXT: movd %r9d, %xmm0 # encoding: [0x66,0x41,0x0f,0x6e,0xc1]
				; X32-SSE-NEXT: movd %r8d, %xmm2 # encoding: [0x66,0x41,0x0f,0x6e,0xd0]
				; X32-SSE-NEXT: punpcklwd %xmm0, %xmm2 # encoding: [0x66,0x0f,0x61,0xd0]
				; X32-SSE-NEXT: # xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1],xmm2[2],xmm0[2],xmm2[3],xmm0[3]
				; X32-SSE-NEXT: punpckldq %xmm1, %xmm2 # encoding: [0x66,0x0f,0x62,0xd1]
				; X32-SSE-NEXT: # xmm2 = xmm2[0],xmm1[0],xmm2[1],xmm1[1]
				; X32-SSE-NEXT: movd %ecx, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc1]
				; X32-SSE-NEXT: movd %edx, %xmm1 # encoding: [0x66,0x0f,0x6e,0xca]
				; X32-SSE-NEXT: punpcklwd %xmm0, %xmm1 # encoding: [0x66,0x0f,0x61,0xc8]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm0[0],xmm1[1],xmm0[1],xmm1[2],xmm0[2],xmm1[3],xmm0[3]
				; X32-SSE-NEXT: movd %esi, %xmm3 # encoding: [0x66,0x0f,0x6e,0xde]
				; X32-SSE-NEXT: movd %edi, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc7]
				; X32-SSE-NEXT: punpcklwd %xmm3, %xmm0 # encoding: [0x66,0x0f,0x61,0xc3]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm3[0],xmm0[1],xmm3[1],xmm0[2],xmm3[2],xmm0[3],xmm3[3]
				; X32-SSE-NEXT: punpckldq %xmm1, %xmm0 # encoding: [0x66,0x0f,0x62,0xc1]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm1[0],xmm0[1],xmm1[1]
				; X32-SSE-NEXT: punpcklqdq %xmm2, %xmm0 # encoding: [0x66,0x0f,0x6c,0xc2]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm2[0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_setr_epi16:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: movzwl {{[0-9]+}}(%esp), %r10d # encoding: [0x67,0x44,0x0f,0xb7,0x54,0x24,0x10]
				; X32-AVX1-NEXT: movzwl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb7,0x44,0x24,0x08]
				; X32-AVX1-NEXT: vmovd %edi, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc7]
				; X32-AVX1-NEXT: vpinsrw $1, %esi, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0xc4,0xc6,0x01]
				; X32-AVX1-NEXT: vpinsrw $2, %edx, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0xc4,0xc2,0x02]
				; X32-AVX1-NEXT: vpinsrw $3, %ecx, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0xc4,0xc1,0x03]
				; X32-AVX1-NEXT: vpinsrw $4, %r8d, %xmm0, %xmm0 # encoding: [0xc4,0xc1,0x79,0xc4,0xc0,0x04]
				; X32-AVX1-NEXT: vpinsrw $5, %r9d, %xmm0, %xmm0 # encoding: [0xc4,0xc1,0x79,0xc4,0xc1,0x05]
				; X32-AVX1-NEXT: vpinsrw $6, %eax, %xmm0, %xmm0 # encoding: [0xc5,0xf9,0xc4,0xc0,0x06]
				; X32-AVX1-NEXT: vpinsrw $7, %r10d, %xmm0, %xmm0 # encoding: [0xc4,0xc1,0x79,0xc4,0xc2,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_setr_epi16:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: movzwl {{[0-9]+}}(%esp), %r10d # encoding: [0x67,0x44,0x0f,0xb7,0x54,0x24,0x10]
				; X32-AVX512-NEXT: movzwl {{[0-9]+}}(%esp), %eax # encoding: [0x67,0x0f,0xb7,0x44,0x24,0x08]
				; X32-AVX512-NEXT: vmovd %edi, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc7]
				; X32-AVX512-NEXT: vpinsrw $1, %esi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc6,0x01]
				; X32-AVX512-NEXT: vpinsrw $2, %edx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc2,0x02]
				; X32-AVX512-NEXT: vpinsrw $3, %ecx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc1,0x03]
				; X32-AVX512-NEXT: vpinsrw $4, %r8d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc0,0x04]
				; X32-AVX512-NEXT: vpinsrw $5, %r9d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc1,0x05]
				; X32-AVX512-NEXT: vpinsrw $6, %eax, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0xc4,0xc0,0x06]
				; X32-AVX512-NEXT: vpinsrw $7, %r10d, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xc1,0x79,0xc4,0xc2,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <8 x i16> undef, i16 %a0, i32 0			%res0 = insertelement <8 x i16> undef, i16 %a0, i32 0
	%res1 = insertelement <8 x i16> %res0, i16 %a1, i32 1			%res1 = insertelement <8 x i16> %res0, i16 %a1, i32 1
	%res2 = insertelement <8 x i16> %res1, i16 %a2, i32 2			%res2 = insertelement <8 x i16> %res1, i16 %a2, i32 2
	%res3 = insertelement <8 x i16> %res2, i16 %a3, i32 3			%res3 = insertelement <8 x i16> %res2, i16 %a3, i32 3
	%res4 = insertelement <8 x i16> %res3, i16 %a4, i32 4			%res4 = insertelement <8 x i16> %res3, i16 %a4, i32 4
	%res5 = insertelement <8 x i16> %res4, i16 %a5, i32 5			%res5 = insertelement <8 x i16> %res4, i16 %a5, i32 5
	%res6 = insertelement <8 x i16> %res5, i16 %a6, i32 6			%res6 = insertelement <8 x i16> %res5, i16 %a6, i32 6
	%res7 = insertelement <8 x i16> %res6, i16 %a7, i32 7			%res7 = insertelement <8 x i16> %res6, i16 %a7, i32 7
	▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
	;			;
	; X64-AVX512-LABEL: test_mm_setr_epi32:			; X64-AVX512-LABEL: test_mm_setr_epi32:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovd %edi, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc7]			; X64-AVX512-NEXT: vmovd %edi, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc7]
	; X64-AVX512-NEXT: vpinsrd $1, %esi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc6,0x01]			; X64-AVX512-NEXT: vpinsrd $1, %esi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc6,0x01]
	; X64-AVX512-NEXT: vpinsrd $2, %edx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc2,0x02]			; X64-AVX512-NEXT: vpinsrd $2, %edx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc2,0x02]
	; X64-AVX512-NEXT: vpinsrd $3, %ecx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc1,0x03]			; X64-AVX512-NEXT: vpinsrd $3, %ecx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc1,0x03]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_setr_epi32:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movd %ecx, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc1]
				; X32-SSE-NEXT: movd %edx, %xmm1 # encoding: [0x66,0x0f,0x6e,0xca]
				; X32-SSE-NEXT: punpckldq %xmm0, %xmm1 # encoding: [0x66,0x0f,0x62,0xc8]
				; X32-SSE-NEXT: # xmm1 = xmm1[0],xmm0[0],xmm1[1],xmm0[1]
				; X32-SSE-NEXT: movd %esi, %xmm2 # encoding: [0x66,0x0f,0x6e,0xd6]
				; X32-SSE-NEXT: movd %edi, %xmm0 # encoding: [0x66,0x0f,0x6e,0xc7]
				; X32-SSE-NEXT: punpckldq %xmm2, %xmm0 # encoding: [0x66,0x0f,0x62,0xc2]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm2[0],xmm0[1],xmm2[1]
				; X32-SSE-NEXT: punpcklqdq %xmm1, %xmm0 # encoding: [0x66,0x0f,0x6c,0xc1]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm1[0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_setr_epi32:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovd %edi, %xmm0 # encoding: [0xc5,0xf9,0x6e,0xc7]
				; X32-AVX1-NEXT: vpinsrd $1, %esi, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x22,0xc6,0x01]
				; X32-AVX1-NEXT: vpinsrd $2, %edx, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x22,0xc2,0x02]
				; X32-AVX1-NEXT: vpinsrd $3, %ecx, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x22,0xc1,0x03]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_setr_epi32:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovd %edi, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x6e,0xc7]
				; X32-AVX512-NEXT: vpinsrd $1, %esi, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc6,0x01]
				; X32-AVX512-NEXT: vpinsrd $2, %edx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc2,0x02]
				; X32-AVX512-NEXT: vpinsrd $3, %ecx, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x22,0xc1,0x03]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <4 x i32> undef, i32 %a0, i32 0			%res0 = insertelement <4 x i32> undef, i32 %a0, i32 0
	%res1 = insertelement <4 x i32> %res0, i32 %a1, i32 1			%res1 = insertelement <4 x i32> %res0, i32 %a1, i32 1
	%res2 = insertelement <4 x i32> %res1, i32 %a2, i32 2			%res2 = insertelement <4 x i32> %res1, i32 %a2, i32 2
	%res3 = insertelement <4 x i32> %res2, i32 %a3, i32 3			%res3 = insertelement <4 x i32> %res2, i32 %a3, i32 3
	%res = bitcast <4 x i32> %res3 to <2 x i64>			%res = bitcast <4 x i32> %res3 to <2 x i64>
	ret <2 x i64> %res			ret <2 x i64> %res
	}			}

	▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
	;			;
	; X64-AVX512-LABEL: test_mm_setr_epi64x:			; X64-AVX512-LABEL: test_mm_setr_epi64x:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovq %rsi, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x6e,0xc6]			; X64-AVX512-NEXT: vmovq %rsi, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x6e,0xc6]
	; X64-AVX512-NEXT: vmovq %rdi, %xmm1 # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x6e,0xcf]			; X64-AVX512-NEXT: vmovq %rdi, %xmm1 # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x6e,0xcf]
	; X64-AVX512-NEXT: vpunpcklqdq %xmm0, %xmm1, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf1,0x6c,0xc0]			; X64-AVX512-NEXT: vpunpcklqdq %xmm0, %xmm1, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf1,0x6c,0xc0]
	; X64-AVX512-NEXT: # xmm0 = xmm1[0],xmm0[0]			; X64-AVX512-NEXT: # xmm0 = xmm1[0],xmm0[0]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_setr_epi64x:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movq %rsi, %xmm1 # encoding: [0x66,0x48,0x0f,0x6e,0xce]
				; X32-SSE-NEXT: movq %rdi, %xmm0 # encoding: [0x66,0x48,0x0f,0x6e,0xc7]
				; X32-SSE-NEXT: punpcklqdq %xmm1, %xmm0 # encoding: [0x66,0x0f,0x6c,0xc1]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm1[0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_setr_epi64x:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovq %rsi, %xmm0 # encoding: [0xc4,0xe1,0xf9,0x6e,0xc6]
				; X32-AVX1-NEXT: vmovq %rdi, %xmm1 # encoding: [0xc4,0xe1,0xf9,0x6e,0xcf]
				; X32-AVX1-NEXT: vpunpcklqdq %xmm0, %xmm1, %xmm0 # encoding: [0xc5,0xf1,0x6c,0xc0]
				; X32-AVX1-NEXT: # xmm0 = xmm1[0],xmm0[0]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_setr_epi64x:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovq %rsi, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x6e,0xc6]
				; X32-AVX512-NEXT: vmovq %rdi, %xmm1 # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x6e,0xcf]
				; X32-AVX512-NEXT: vpunpcklqdq %xmm0, %xmm1, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf1,0x6c,0xc0]
				; X32-AVX512-NEXT: # xmm0 = xmm1[0],xmm0[0]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <2 x i64> undef, i64 %a0, i32 0			%res0 = insertelement <2 x i64> undef, i64 %a0, i32 0
	%res1 = insertelement <2 x i64> %res0, i64 %a1, i32 1			%res1 = insertelement <2 x i64> %res0, i64 %a1, i32 1
	ret <2 x i64> %res1			ret <2 x i64> %res1
	}			}

	define <2 x double> @test_mm_setr_pd(double %a0, double %a1) nounwind {			define <2 x double> @test_mm_setr_pd(double %a0, double %a1) nounwind {
	; X86-SSE-LABEL: test_mm_setr_pd:			; X86-SSE-LABEL: test_mm_setr_pd:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 37 Lines
	; X64-AVX1-NEXT: # xmm0 = xmm0[0],xmm1[0]			; X64-AVX1-NEXT: # xmm0 = xmm0[0],xmm1[0]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_setr_pd:			; X64-AVX512-LABEL: test_mm_setr_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovlhps %xmm1, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x16,0xc1]			; X64-AVX512-NEXT: vmovlhps %xmm1, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x16,0xc1]
	; X64-AVX512-NEXT: # xmm0 = xmm0[0],xmm1[0]			; X64-AVX512-NEXT: # xmm0 = xmm0[0],xmm1[0]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_setr_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movlhps %xmm1, %xmm0 # encoding: [0x0f,0x16,0xc1]
				; X32-SSE-NEXT: # xmm0 = xmm0[0],xmm1[0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_setr_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovlhps %xmm1, %xmm0, %xmm0 # encoding: [0xc5,0xf8,0x16,0xc1]
				; X32-AVX1-NEXT: # xmm0 = xmm0[0],xmm1[0]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_setr_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovlhps %xmm1, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x16,0xc1]
				; X32-AVX512-NEXT: # xmm0 = xmm0[0],xmm1[0]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%res0 = insertelement <2 x double> undef, double %a0, i32 0			%res0 = insertelement <2 x double> undef, double %a0, i32 0
	%res1 = insertelement <2 x double> %res0, double %a1, i32 1			%res1 = insertelement <2 x double> %res0, double %a1, i32 1
	ret <2 x double> %res1			ret <2 x double> %res1
	}			}

	define <2 x double> @test_mm_setzero_pd() {			define <2 x double> @test_mm_setzero_pd() {
	; SSE-LABEL: test_mm_setzero_pd:			; SSE-LABEL: test_mm_setzero_pd:
	; SSE: # %bb.0:			; SSE: # %bb.0:
	▲ Show 20 Lines • Show All 376 Lines • ▼ Show 20 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vsqrtsd %xmm0, %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x51,0xc0]			; X64-AVX1-NEXT: vsqrtsd %xmm0, %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x51,0xc0]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_sqrt_sd_scalar:			; X64-AVX512-LABEL: test_mm_sqrt_sd_scalar:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vsqrtsd %xmm0, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x51,0xc0]			; X64-AVX512-NEXT: vsqrtsd %xmm0, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x51,0xc0]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_sqrt_sd_scalar:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: sqrtsd %xmm0, %xmm0 # encoding: [0xf2,0x0f,0x51,0xc0]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_sqrt_sd_scalar:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vsqrtsd %xmm0, %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x51,0xc0]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_sqrt_sd_scalar:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vsqrtsd %xmm0, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x51,0xc0]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%sqrt = call double @llvm.sqrt.f64(double %a0)			%sqrt = call double @llvm.sqrt.f64(double %a0)
	ret double %sqrt			ret double %sqrt
	}			}

	define <2 x i64> @test_mm_sra_epi16(<2 x i64> %a0, <2 x i64> %a1) {			define <2 x i64> @test_mm_sra_epi16(<2 x i64> %a0, <2 x i64> %a1) {
	; SSE-LABEL: test_mm_sra_epi16:			; SSE-LABEL: test_mm_sra_epi16:
	; SSE: # %bb.0:			; SSE: # %bb.0:
	; SSE-NEXT: psraw %xmm1, %xmm0 # encoding: [0x66,0x0f,0xe1,0xc1]			; SSE-NEXT: psraw %xmm1, %xmm0 # encoding: [0x66,0x0f,0xe1,0xc1]
	▲ Show 20 Lines • Show All 265 Lines • ▼ Show 20 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovaps %xmm0, (%rdi) # encoding: [0xc5,0xf8,0x29,0x07]			; X64-AVX1-NEXT: vmovaps %xmm0, (%rdi) # encoding: [0xc5,0xf8,0x29,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_store_pd:			; X64-AVX512-LABEL: test_mm_store_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovaps %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x29,0x07]			; X64-AVX512-NEXT: vmovaps %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x29,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_store_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movaps %xmm0, (%edi) # encoding: [0x67,0x0f,0x29,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_store_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovaps %xmm0, (%edi) # encoding: [0x67,0xc5,0xf8,0x29,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_store_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovaps %xmm0, (%edi) # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x29,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%arg0 = bitcast double* %a0 to <2 x double>*			%arg0 = bitcast double* %a0 to <2 x double>*
	store <2 x double> %a1, <2 x double>* %arg0, align 16			store <2 x double> %a1, <2 x double>* %arg0, align 16
	ret void			ret void
	}			}

	define void @test_mm_store_pd1(double *%a0, <2 x double> %a1) {			define void @test_mm_store_pd1(double *%a0, <2 x double> %a1) {
	; X86-SSE-LABEL: test_mm_store_pd1:			; X86-SSE-LABEL: test_mm_store_pd1:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 34 Lines
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_store_pd1:			; X64-AVX512-LABEL: test_mm_store_pd1:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovddup %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0xc0]			; X64-AVX512-NEXT: vmovddup %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0xc0]
	; X64-AVX512-NEXT: # xmm0 = xmm0[0,0]			; X64-AVX512-NEXT: # xmm0 = xmm0[0,0]
	; X64-AVX512-NEXT: vmovaps %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x29,0x07]			; X64-AVX512-NEXT: vmovaps %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x29,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_store_pd1:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movlhps %xmm0, %xmm0 # encoding: [0x0f,0x16,0xc0]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,0]
				; X32-SSE-NEXT: movaps %xmm0, (%edi) # encoding: [0x67,0x0f,0x29,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_store_pd1:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovddup %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x12,0xc0]
				; X32-AVX1-NEXT: # xmm0 = xmm0[0,0]
				; X32-AVX1-NEXT: vmovaps %xmm0, (%edi) # encoding: [0x67,0xc5,0xf8,0x29,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_store_pd1:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovddup %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0xc0]
				; X32-AVX512-NEXT: # xmm0 = xmm0[0,0]
				; X32-AVX512-NEXT: vmovaps %xmm0, (%edi) # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x29,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%arg0 = bitcast double * %a0 to <2 x double>*			%arg0 = bitcast double * %a0 to <2 x double>*
	%shuf = shufflevector <2 x double> %a1, <2 x double> undef, <2 x i32> zeroinitializer			%shuf = shufflevector <2 x double> %a1, <2 x double> undef, <2 x i32> zeroinitializer
	store <2 x double> %shuf, <2 x double>* %arg0, align 16			store <2 x double> %shuf, <2 x double>* %arg0, align 16
	ret void			ret void
	}			}

	define void @test_mm_store_sd(double *%a0, <2 x double> %a1) {			define void @test_mm_store_sd(double *%a0, <2 x double> %a1) {
	; X86-SSE-LABEL: test_mm_store_sd:			; X86-SSE-LABEL: test_mm_store_sd:
	Show All 23 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovsd %xmm0, (%rdi) # encoding: [0xc5,0xfb,0x11,0x07]			; X64-AVX1-NEXT: vmovsd %xmm0, (%rdi) # encoding: [0xc5,0xfb,0x11,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_store_sd:			; X64-AVX512-LABEL: test_mm_store_sd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovsd %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x11,0x07]			; X64-AVX512-NEXT: vmovsd %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x11,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_store_sd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movsd %xmm0, (%edi) # encoding: [0x67,0xf2,0x0f,0x11,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_store_sd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovsd %xmm0, (%edi) # encoding: [0x67,0xc5,0xfb,0x11,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_store_sd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovsd %xmm0, (%edi) # EVEX TO VEX Compression encoding: [0x67,0xc5,0xfb,0x11,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%ext = extractelement <2 x double> %a1, i32 0			%ext = extractelement <2 x double> %a1, i32 0
	store double %ext, double* %a0, align 1			store double %ext, double* %a0, align 1
	ret void			ret void
	}			}

	define void @test_mm_store_si128(<2 x i64> *%a0, <2 x i64> %a1) {			define void @test_mm_store_si128(<2 x i64> *%a0, <2 x i64> %a1) {
	; X86-SSE-LABEL: test_mm_store_si128:			; X86-SSE-LABEL: test_mm_store_si128:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 22 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovaps %xmm0, (%rdi) # encoding: [0xc5,0xf8,0x29,0x07]			; X64-AVX1-NEXT: vmovaps %xmm0, (%rdi) # encoding: [0xc5,0xf8,0x29,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_store_si128:			; X64-AVX512-LABEL: test_mm_store_si128:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovaps %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x29,0x07]			; X64-AVX512-NEXT: vmovaps %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x29,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_store_si128:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movaps %xmm0, (%edi) # encoding: [0x67,0x0f,0x29,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_store_si128:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovaps %xmm0, (%edi) # encoding: [0x67,0xc5,0xf8,0x29,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_store_si128:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovaps %xmm0, (%edi) # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x29,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	store <2 x i64> %a1, <2 x i64>* %a0, align 16			store <2 x i64> %a1, <2 x i64>* %a0, align 16
	ret void			ret void
	}			}

	define void @test_mm_store1_pd(double *%a0, <2 x double> %a1) {			define void @test_mm_store1_pd(double *%a0, <2 x double> %a1) {
	; X86-SSE-LABEL: test_mm_store1_pd:			; X86-SSE-LABEL: test_mm_store1_pd:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	; X86-SSE-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x04]			; X86-SSE-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x04]
	Show All 33 Lines
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_store1_pd:			; X64-AVX512-LABEL: test_mm_store1_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovddup %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0xc0]			; X64-AVX512-NEXT: vmovddup %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0xc0]
	; X64-AVX512-NEXT: # xmm0 = xmm0[0,0]			; X64-AVX512-NEXT: # xmm0 = xmm0[0,0]
	; X64-AVX512-NEXT: vmovaps %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x29,0x07]			; X64-AVX512-NEXT: vmovaps %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x29,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_store1_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movlhps %xmm0, %xmm0 # encoding: [0x0f,0x16,0xc0]
				; X32-SSE-NEXT: # xmm0 = xmm0[0,0]
				; X32-SSE-NEXT: movaps %xmm0, (%edi) # encoding: [0x67,0x0f,0x29,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_store1_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovddup %xmm0, %xmm0 # encoding: [0xc5,0xfb,0x12,0xc0]
				; X32-AVX1-NEXT: # xmm0 = xmm0[0,0]
				; X32-AVX1-NEXT: vmovaps %xmm0, (%edi) # encoding: [0x67,0xc5,0xf8,0x29,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_store1_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovddup %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x12,0xc0]
				; X32-AVX512-NEXT: # xmm0 = xmm0[0,0]
				; X32-AVX512-NEXT: vmovaps %xmm0, (%edi) # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x29,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%arg0 = bitcast double * %a0 to <2 x double>*			%arg0 = bitcast double * %a0 to <2 x double>*
	%shuf = shufflevector <2 x double> %a1, <2 x double> undef, <2 x i32> zeroinitializer			%shuf = shufflevector <2 x double> %a1, <2 x double> undef, <2 x i32> zeroinitializer
	store <2 x double> %shuf, <2 x double>* %arg0, align 16			store <2 x double> %shuf, <2 x double>* %arg0, align 16
	ret void			ret void
	}			}

	define void @test_mm_storeh_sd(double *%a0, <2 x double> %a1) {			define void @test_mm_storeh_sd(double *%a0, <2 x double> %a1) {
	; X86-SSE-LABEL: test_mm_storeh_sd:			; X86-SSE-LABEL: test_mm_storeh_sd:
	Show All 35 Lines
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_storeh_sd:			; X64-AVX512-LABEL: test_mm_storeh_sd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vpermilpd $1, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x05,0xc0,0x01]			; X64-AVX512-NEXT: vpermilpd $1, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x05,0xc0,0x01]
	; X64-AVX512-NEXT: # xmm0 = xmm0[1,0]			; X64-AVX512-NEXT: # xmm0 = xmm0[1,0]
	; X64-AVX512-NEXT: vmovsd %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x11,0x07]			; X64-AVX512-NEXT: vmovsd %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x11,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_storeh_sd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movhlps %xmm0, %xmm0 # encoding: [0x0f,0x12,0xc0]
				; X32-SSE-NEXT: # xmm0 = xmm0[1,1]
				; X32-SSE-NEXT: movsd %xmm0, (%edi) # encoding: [0x67,0xf2,0x0f,0x11,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_storeh_sd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vpermilpd $1, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x05,0xc0,0x01]
				; X32-AVX1-NEXT: # xmm0 = xmm0[1,0]
				; X32-AVX1-NEXT: vmovsd %xmm0, (%edi) # encoding: [0x67,0xc5,0xfb,0x11,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_storeh_sd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vpermilpd $1, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x05,0xc0,0x01]
				; X32-AVX512-NEXT: # xmm0 = xmm0[1,0]
				; X32-AVX512-NEXT: vmovsd %xmm0, (%edi) # EVEX TO VEX Compression encoding: [0x67,0xc5,0xfb,0x11,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%ext = extractelement <2 x double> %a1, i32 1			%ext = extractelement <2 x double> %a1, i32 1
	store double %ext, double* %a0, align 8			store double %ext, double* %a0, align 8
	ret void			ret void
	}			}

	define void @test_mm_storel_epi64(<2 x i64> *%a0, <2 x i64> %a1) {			define void @test_mm_storel_epi64(<2 x i64> *%a0, <2 x i64> %a1) {
	; X86-SSE-LABEL: test_mm_storel_epi64:			; X86-SSE-LABEL: test_mm_storel_epi64:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 25 Lines
	; X64-AVX1-NEXT: movq %rax, (%rdi) # encoding: [0x48,0x89,0x07]			; X64-AVX1-NEXT: movq %rax, (%rdi) # encoding: [0x48,0x89,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_storel_epi64:			; X64-AVX512-LABEL: test_mm_storel_epi64:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovq %xmm0, %rax # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x7e,0xc0]			; X64-AVX512-NEXT: vmovq %xmm0, %rax # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x7e,0xc0]
	; X64-AVX512-NEXT: movq %rax, (%rdi) # encoding: [0x48,0x89,0x07]			; X64-AVX512-NEXT: movq %rax, (%rdi) # encoding: [0x48,0x89,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_storel_epi64:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movq %xmm0, %rax # encoding: [0x66,0x48,0x0f,0x7e,0xc0]
				; X32-SSE-NEXT: movq %rax, (%edi) # encoding: [0x67,0x48,0x89,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_storel_epi64:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovq %xmm0, %rax # encoding: [0xc4,0xe1,0xf9,0x7e,0xc0]
				; X32-AVX1-NEXT: movq %rax, (%edi) # encoding: [0x67,0x48,0x89,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_storel_epi64:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovq %xmm0, %rax # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x7e,0xc0]
				; X32-AVX512-NEXT: movq %rax, (%edi) # encoding: [0x67,0x48,0x89,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%ext = extractelement <2 x i64> %a1, i32 0			%ext = extractelement <2 x i64> %a1, i32 0
	%bc = bitcast <2 x i64> %a0 to i64			%bc = bitcast <2 x i64> %a0 to i64
	store i64 %ext, i64* %bc, align 8			store i64 %ext, i64* %bc, align 8
	ret void			ret void
	}			}

	define void @test_mm_storel_sd(double *%a0, <2 x double> %a1) {			define void @test_mm_storel_sd(double *%a0, <2 x double> %a1) {
	; X86-SSE-LABEL: test_mm_storel_sd:			; X86-SSE-LABEL: test_mm_storel_sd:
	Show All 23 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovsd %xmm0, (%rdi) # encoding: [0xc5,0xfb,0x11,0x07]			; X64-AVX1-NEXT: vmovsd %xmm0, (%rdi) # encoding: [0xc5,0xfb,0x11,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_storel_sd:			; X64-AVX512-LABEL: test_mm_storel_sd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovsd %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x11,0x07]			; X64-AVX512-NEXT: vmovsd %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xfb,0x11,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_storel_sd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movsd %xmm0, (%edi) # encoding: [0x67,0xf2,0x0f,0x11,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_storel_sd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovsd %xmm0, (%edi) # encoding: [0x67,0xc5,0xfb,0x11,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_storel_sd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovsd %xmm0, (%edi) # EVEX TO VEX Compression encoding: [0x67,0xc5,0xfb,0x11,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%ext = extractelement <2 x double> %a1, i32 0			%ext = extractelement <2 x double> %a1, i32 0
	store double %ext, double* %a0, align 8			store double %ext, double* %a0, align 8
	ret void			ret void
	}			}

	define void @test_mm_storer_pd(double *%a0, <2 x double> %a1) {			define void @test_mm_storer_pd(double *%a0, <2 x double> %a1) {
	; X86-SSE-LABEL: test_mm_storer_pd:			; X86-SSE-LABEL: test_mm_storer_pd:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 34 Lines
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_storer_pd:			; X64-AVX512-LABEL: test_mm_storer_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vpermilpd $1, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x05,0xc0,0x01]			; X64-AVX512-NEXT: vpermilpd $1, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x05,0xc0,0x01]
	; X64-AVX512-NEXT: # xmm0 = xmm0[1,0]			; X64-AVX512-NEXT: # xmm0 = xmm0[1,0]
	; X64-AVX512-NEXT: vmovapd %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x29,0x07]			; X64-AVX512-NEXT: vmovapd %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x29,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_storer_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: shufps $78, %xmm0, %xmm0 # encoding: [0x0f,0xc6,0xc0,0x4e]
				; X32-SSE-NEXT: # xmm0 = xmm0[2,3,0,1]
				; X32-SSE-NEXT: movaps %xmm0, (%edi) # encoding: [0x67,0x0f,0x29,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_storer_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vpermilpd $1, %xmm0, %xmm0 # encoding: [0xc4,0xe3,0x79,0x05,0xc0,0x01]
				; X32-AVX1-NEXT: # xmm0 = xmm0[1,0]
				; X32-AVX1-NEXT: vmovapd %xmm0, (%edi) # encoding: [0x67,0xc5,0xf9,0x29,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_storer_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vpermilpd $1, %xmm0, %xmm0 # EVEX TO VEX Compression encoding: [0xc4,0xe3,0x79,0x05,0xc0,0x01]
				; X32-AVX512-NEXT: # xmm0 = xmm0[1,0]
				; X32-AVX512-NEXT: vmovapd %xmm0, (%edi) # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf9,0x29,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%arg0 = bitcast double* %a0 to <2 x double>*			%arg0 = bitcast double* %a0 to <2 x double>*
	%shuf = shufflevector <2 x double> %a1, <2 x double> undef, <2 x i32> <i32 1, i32 0>			%shuf = shufflevector <2 x double> %a1, <2 x double> undef, <2 x i32> <i32 1, i32 0>
	store <2 x double> %shuf, <2 x double>* %arg0, align 16			store <2 x double> %shuf, <2 x double>* %arg0, align 16
	ret void			ret void
	}			}

	define void @test_mm_storeu_pd(double *%a0, <2 x double> %a1) {			define void @test_mm_storeu_pd(double *%a0, <2 x double> %a1) {
	; X86-SSE-LABEL: test_mm_storeu_pd:			; X86-SSE-LABEL: test_mm_storeu_pd:
	Show All 23 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovups %xmm0, (%rdi) # encoding: [0xc5,0xf8,0x11,0x07]			; X64-AVX1-NEXT: vmovups %xmm0, (%rdi) # encoding: [0xc5,0xf8,0x11,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_storeu_pd:			; X64-AVX512-LABEL: test_mm_storeu_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovups %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x11,0x07]			; X64-AVX512-NEXT: vmovups %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x11,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_storeu_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movups %xmm0, (%edi) # encoding: [0x67,0x0f,0x11,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_storeu_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovups %xmm0, (%edi) # encoding: [0x67,0xc5,0xf8,0x11,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_storeu_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovups %xmm0, (%edi) # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x11,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%arg0 = bitcast double* %a0 to <2 x double>*			%arg0 = bitcast double* %a0 to <2 x double>*
	store <2 x double> %a1, <2 x double>* %arg0, align 1			store <2 x double> %a1, <2 x double>* %arg0, align 1
	ret void			ret void
	}			}

	define void @test_mm_storeu_si128(<2 x i64> *%a0, <2 x i64> %a1) {			define void @test_mm_storeu_si128(<2 x i64> *%a0, <2 x i64> %a1) {
	; X86-SSE-LABEL: test_mm_storeu_si128:			; X86-SSE-LABEL: test_mm_storeu_si128:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	Show All 22 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovups %xmm0, (%rdi) # encoding: [0xc5,0xf8,0x11,0x07]			; X64-AVX1-NEXT: vmovups %xmm0, (%rdi) # encoding: [0xc5,0xf8,0x11,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_storeu_si128:			; X64-AVX512-LABEL: test_mm_storeu_si128:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovups %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x11,0x07]			; X64-AVX512-NEXT: vmovups %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x11,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_storeu_si128:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movups %xmm0, (%edi) # encoding: [0x67,0x0f,0x11,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_storeu_si128:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovups %xmm0, (%edi) # encoding: [0x67,0xc5,0xf8,0x11,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_storeu_si128:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovups %xmm0, (%edi) # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x11,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	store <2 x i64> %a1, <2 x i64>* %a0, align 1			store <2 x i64> %a1, <2 x i64>* %a0, align 1
	ret void			ret void
	}			}

	define void @test_mm_storeu_si64(i8* nocapture %A, <2 x i64> %B) {			define void @test_mm_storeu_si64(i8* nocapture %A, <2 x i64> %B) {
	; X86-SSE-LABEL: test_mm_storeu_si64:			; X86-SSE-LABEL: test_mm_storeu_si64:
	; X86-SSE: # %bb.0: # %entry			; X86-SSE: # %bb.0: # %entry
	; X86-SSE-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x04]			; X86-SSE-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x04]
	Show All 24 Lines
	; X64-AVX1-NEXT: movq %rax, (%rdi) # encoding: [0x48,0x89,0x07]			; X64-AVX1-NEXT: movq %rax, (%rdi) # encoding: [0x48,0x89,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_storeu_si64:			; X64-AVX512-LABEL: test_mm_storeu_si64:
	; X64-AVX512: # %bb.0: # %entry			; X64-AVX512: # %bb.0: # %entry
	; X64-AVX512-NEXT: vmovq %xmm0, %rax # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x7e,0xc0]			; X64-AVX512-NEXT: vmovq %xmm0, %rax # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x7e,0xc0]
	; X64-AVX512-NEXT: movq %rax, (%rdi) # encoding: [0x48,0x89,0x07]			; X64-AVX512-NEXT: movq %rax, (%rdi) # encoding: [0x48,0x89,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_storeu_si64:
				; X32-SSE: # %bb.0: # %entry
				; X32-SSE-NEXT: movq %xmm0, %rax # encoding: [0x66,0x48,0x0f,0x7e,0xc0]
				; X32-SSE-NEXT: movq %rax, (%edi) # encoding: [0x67,0x48,0x89,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_storeu_si64:
				; X32-AVX1: # %bb.0: # %entry
				; X32-AVX1-NEXT: vmovq %xmm0, %rax # encoding: [0xc4,0xe1,0xf9,0x7e,0xc0]
				; X32-AVX1-NEXT: movq %rax, (%edi) # encoding: [0x67,0x48,0x89,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_storeu_si64:
				; X32-AVX512: # %bb.0: # %entry
				; X32-AVX512-NEXT: vmovq %xmm0, %rax # EVEX TO VEX Compression encoding: [0xc4,0xe1,0xf9,0x7e,0xc0]
				; X32-AVX512-NEXT: movq %rax, (%edi) # encoding: [0x67,0x48,0x89,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	entry:			entry:
	%vecext.i = extractelement <2 x i64> %B, i32 0			%vecext.i = extractelement <2 x i64> %B, i32 0
	%__v.i = bitcast i8* %A to i64*			%__v.i = bitcast i8* %A to i64*
	store i64 %vecext.i, i64* %__v.i, align 1			store i64 %vecext.i, i64* %__v.i, align 1
	ret void			ret void
	}			}

	define void @test_mm_storeu_si32(i8* nocapture %A, <2 x i64> %B) {			define void @test_mm_storeu_si32(i8* nocapture %A, <2 x i64> %B) {
	Show All 30 Lines
	; X64-AVX1-NEXT: movl %eax, (%rdi) # encoding: [0x89,0x07]			; X64-AVX1-NEXT: movl %eax, (%rdi) # encoding: [0x89,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_storeu_si32:			; X64-AVX512-LABEL: test_mm_storeu_si32:
	; X64-AVX512: # %bb.0: # %entry			; X64-AVX512: # %bb.0: # %entry
	; X64-AVX512-NEXT: vmovd %xmm0, %eax # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x7e,0xc0]			; X64-AVX512-NEXT: vmovd %xmm0, %eax # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x7e,0xc0]
	; X64-AVX512-NEXT: movl %eax, (%rdi) # encoding: [0x89,0x07]			; X64-AVX512-NEXT: movl %eax, (%rdi) # encoding: [0x89,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_storeu_si32:
				; X32-SSE: # %bb.0: # %entry
				; X32-SSE-NEXT: movd %xmm0, %eax # encoding: [0x66,0x0f,0x7e,0xc0]
				; X32-SSE-NEXT: movl %eax, (%edi) # encoding: [0x67,0x89,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_storeu_si32:
				; X32-AVX1: # %bb.0: # %entry
				; X32-AVX1-NEXT: vmovd %xmm0, %eax # encoding: [0xc5,0xf9,0x7e,0xc0]
				; X32-AVX1-NEXT: movl %eax, (%edi) # encoding: [0x67,0x89,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_storeu_si32:
				; X32-AVX512: # %bb.0: # %entry
				; X32-AVX512-NEXT: vmovd %xmm0, %eax # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x7e,0xc0]
				; X32-AVX512-NEXT: movl %eax, (%edi) # encoding: [0x67,0x89,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	entry:			entry:
	%0 = bitcast <2 x i64> %B to <4 x i32>			%0 = bitcast <2 x i64> %B to <4 x i32>
	%vecext.i = extractelement <4 x i32> %0, i32 0			%vecext.i = extractelement <4 x i32> %0, i32 0
	%__v.i = bitcast i8* %A to i32*			%__v.i = bitcast i8* %A to i32*
	store i32 %vecext.i, i32* %__v.i, align 1			store i32 %vecext.i, i32* %__v.i, align 1
	ret void			ret void
	}			}

	Show All 31 Lines
	; X64-AVX1-NEXT: movw %ax, (%rdi) # encoding: [0x66,0x89,0x07]			; X64-AVX1-NEXT: movw %ax, (%rdi) # encoding: [0x66,0x89,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_storeu_si16:			; X64-AVX512-LABEL: test_mm_storeu_si16:
	; X64-AVX512: # %bb.0: # %entry			; X64-AVX512: # %bb.0: # %entry
	; X64-AVX512-NEXT: vmovd %xmm0, %eax # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x7e,0xc0]			; X64-AVX512-NEXT: vmovd %xmm0, %eax # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x7e,0xc0]
	; X64-AVX512-NEXT: movw %ax, (%rdi) # encoding: [0x66,0x89,0x07]			; X64-AVX512-NEXT: movw %ax, (%rdi) # encoding: [0x66,0x89,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_storeu_si16:
				; X32-SSE: # %bb.0: # %entry
				; X32-SSE-NEXT: movd %xmm0, %eax # encoding: [0x66,0x0f,0x7e,0xc0]
				; X32-SSE-NEXT: movw %ax, (%edi) # encoding: [0x67,0x66,0x89,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_storeu_si16:
				; X32-AVX1: # %bb.0: # %entry
				; X32-AVX1-NEXT: vmovd %xmm0, %eax # encoding: [0xc5,0xf9,0x7e,0xc0]
				; X32-AVX1-NEXT: movw %ax, (%edi) # encoding: [0x67,0x66,0x89,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_storeu_si16:
				; X32-AVX512: # %bb.0: # %entry
				; X32-AVX512-NEXT: vmovd %xmm0, %eax # EVEX TO VEX Compression encoding: [0xc5,0xf9,0x7e,0xc0]
				; X32-AVX512-NEXT: movw %ax, (%edi) # encoding: [0x67,0x66,0x89,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	entry:			entry:
	%0 = bitcast <2 x i64> %B to <8 x i16>			%0 = bitcast <2 x i64> %B to <8 x i16>
	%vecext.i = extractelement <8 x i16> %0, i32 0			%vecext.i = extractelement <8 x i16> %0, i32 0
	%__v.i = bitcast i8* %A to i16*			%__v.i = bitcast i8* %A to i16*
	store i16 %vecext.i, i16* %__v.i, align 1			store i16 %vecext.i, i16* %__v.i, align 1
	ret void			ret void
	}			}

	Show All 25 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovntps %xmm0, (%rdi) # encoding: [0xc5,0xf8,0x2b,0x07]			; X64-AVX1-NEXT: vmovntps %xmm0, (%rdi) # encoding: [0xc5,0xf8,0x2b,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_stream_pd:			; X64-AVX512-LABEL: test_mm_stream_pd:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovntps %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x2b,0x07]			; X64-AVX512-NEXT: vmovntps %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x2b,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_stream_pd:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movntps %xmm0, (%edi) # encoding: [0x67,0x0f,0x2b,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_stream_pd:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovntps %xmm0, (%edi) # encoding: [0x67,0xc5,0xf8,0x2b,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_stream_pd:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovntps %xmm0, (%edi) # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x2b,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	%arg0 = bitcast double* %a0 to <2 x double>*			%arg0 = bitcast double* %a0 to <2 x double>*
	store <2 x double> %a1, <2 x double>* %arg0, align 16, !nontemporal !0			store <2 x double> %a1, <2 x double>* %arg0, align 16, !nontemporal !0
	ret void			ret void
	}			}

	define void @test_mm_stream_si32(i32 *%a0, i32 %a1) {			define void @test_mm_stream_si32(i32 *%a0, i32 %a1) {
	; X86-LABEL: test_mm_stream_si32:			; X86-LABEL: test_mm_stream_si32:
	; X86: # %bb.0:			; X86: # %bb.0:
	; X86-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x08]			; X86-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x08]
	; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx # encoding: [0x8b,0x4c,0x24,0x04]			; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx # encoding: [0x8b,0x4c,0x24,0x04]
	; X86-NEXT: movntil %eax, (%ecx) # encoding: [0x0f,0xc3,0x01]			; X86-NEXT: movntil %eax, (%ecx) # encoding: [0x0f,0xc3,0x01]
	; X86-NEXT: retl # encoding: [0xc3]			; X86-NEXT: retl # encoding: [0xc3]
	;			;
	; X64-LABEL: test_mm_stream_si32:			; X64-LABEL: test_mm_stream_si32:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: movntil %esi, (%rdi) # encoding: [0x0f,0xc3,0x37]			; X64-NEXT: movntil %esi, (%rdi) # encoding: [0x0f,0xc3,0x37]
	; X64-NEXT: retq # encoding: [0xc3]			; X64-NEXT: retq # encoding: [0xc3]
				;
				; X32-LABEL: test_mm_stream_si32:
				; X32: # %bb.0:
				; X32-NEXT: movntil %esi, (%edi) # encoding: [0x67,0x0f,0xc3,0x37]
				; X32-NEXT: retq # encoding: [0xc3]
	store i32 %a1, i32* %a0, align 1, !nontemporal !0			store i32 %a1, i32* %a0, align 1, !nontemporal !0
	ret void			ret void
	}			}

	define void @test_mm_stream_si128(<2 x i64> *%a0, <2 x i64> %a1) {			define void @test_mm_stream_si128(<2 x i64> *%a0, <2 x i64> %a1) {
	; X86-SSE-LABEL: test_mm_stream_si128:			; X86-SSE-LABEL: test_mm_stream_si128:
	; X86-SSE: # %bb.0:			; X86-SSE: # %bb.0:
	; X86-SSE-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x04]			; X86-SSE-NEXT: movl {{[0-9]+}}(%esp), %eax # encoding: [0x8b,0x44,0x24,0x04]
	Show All 21 Lines
	; X64-AVX1: # %bb.0:			; X64-AVX1: # %bb.0:
	; X64-AVX1-NEXT: vmovntps %xmm0, (%rdi) # encoding: [0xc5,0xf8,0x2b,0x07]			; X64-AVX1-NEXT: vmovntps %xmm0, (%rdi) # encoding: [0xc5,0xf8,0x2b,0x07]
	; X64-AVX1-NEXT: retq # encoding: [0xc3]			; X64-AVX1-NEXT: retq # encoding: [0xc3]
	;			;
	; X64-AVX512-LABEL: test_mm_stream_si128:			; X64-AVX512-LABEL: test_mm_stream_si128:
	; X64-AVX512: # %bb.0:			; X64-AVX512: # %bb.0:
	; X64-AVX512-NEXT: vmovntps %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x2b,0x07]			; X64-AVX512-NEXT: vmovntps %xmm0, (%rdi) # EVEX TO VEX Compression encoding: [0xc5,0xf8,0x2b,0x07]
	; X64-AVX512-NEXT: retq # encoding: [0xc3]			; X64-AVX512-NEXT: retq # encoding: [0xc3]
				;
				; X32-SSE-LABEL: test_mm_stream_si128:
				; X32-SSE: # %bb.0:
				; X32-SSE-NEXT: movntps %xmm0, (%edi) # encoding: [0x67,0x0f,0x2b,0x07]
				; X32-SSE-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX1-LABEL: test_mm_stream_si128:
				; X32-AVX1: # %bb.0:
				; X32-AVX1-NEXT: vmovntps %xmm0, (%edi) # encoding: [0x67,0xc5,0xf8,0x2b,0x07]
				; X32-AVX1-NEXT: retq # encoding: [0xc3]
				;
				; X32-AVX512-LABEL: test_mm_stream_si128:
				; X32-AVX512: # %bb.0:
				; X32-AVX512-NEXT: vmovntps %xmm0, (%edi) # EVEX TO VEX Compression encoding: [0x67,0xc5,0xf8,0x2b,0x07]
				; X32-AVX512-NEXT: retq # encoding: [0xc3]
	store <2 x i64> %a1, <2 x i64>* %a0, align 16, !nontemporal !0			store <2 x i64> %a1, <2 x i64>* %a0, align 16, !nontemporal !0
	ret void			ret void
	}			}

	define <2 x i64> @test_mm_sub_epi8(<2 x i64> %a0, <2 x i64> %a1) nounwind {			define <2 x i64> @test_mm_sub_epi8(<2 x i64> %a0, <2 x i64> %a1) nounwind {
	; SSE-LABEL: test_mm_sub_epi8:			; SSE-LABEL: test_mm_sub_epi8:
	; SSE: # %bb.0:			; SSE: # %bb.0:
	; SSE-NEXT: psubb %xmm1, %xmm0 # encoding: [0x66,0x0f,0xf8,0xc1]			; SSE-NEXT: psubb %xmm1, %xmm0 # encoding: [0x66,0x0f,0xf8,0xc1]
	▲ Show 20 Lines • Show All 677 Lines • Show Last 20 Lines

llvm/test/MC/X86/maskmovdqu.s

This file was added.

				// RUN: llvm-mc -triple i386-- --show-encoding %s \|\
				// RUN: FileCheck %s --check-prefixes=CHECK,ENCODING

				// RUN: llvm-mc -triple i386-- -filetype=obj %s \|\
				// RUN: llvm-objdump -d - \| FileCheck %s

				// CHECK-NOT: addr32
				// CHECK: maskmovdqu %xmm1, %xmm0
				// ENCODING: encoding: [0x66,0x0f,0xf7,0xc1]
				maskmovdqu %xmm1, %xmm0

				// CHECK-NOT: addr32
				// CHECK: vmaskmovdqu %xmm1, %xmm0
				// ENCODING: encoding: [0xc5,0xf9,0xf7,0xc1]
				vmaskmovdqu %xmm1, %xmm0

llvm/test/MC/X86/maskmovdqu64.s

This file was added.

				// RUN: llvm-mc -triple x86_64-- --show-encoding %s \|\
				// RUN: FileCheck %s --check-prefixes=CHECK,ENCODING

				// RUN: llvm-mc -triple x86_64-- -filetype=obj %s \|\
				// RUN: llvm-objdump -d - \| FileCheck %s

				// CHECK-NOT: addr32
				// CHECK: maskmovdqu %xmm1, %xmm0
				// ENCODING: encoding: [0x66,0x0f,0xf7,0xc1]
				maskmovdqu %xmm1, %xmm0

				// CHECK-NOT: addr32
				// CHECK: vmaskmovdqu %xmm1, %xmm0
				// ENCODING: encoding: [0xc5,0xf9,0xf7,0xc1]
				vmaskmovdqu %xmm1, %xmm0

				// CHECK: addr32
				// ENCODING: encoding: [0x67]
				// CHECK: maskmovdqu %xmm1, %xmm0
				// ENCODING: encoding: [0x66,0x0f,0xf7,0xc1]
				addr32 maskmovdqu %xmm1, %xmm0

				// CHECK: addr32
				// ENCODING: encoding: [0x67]
				// CHECK: vmaskmovdqu %xmm1, %xmm0
				// ENCODING: encoding: [0xc5,0xf9,0xf7,0xc1]
				addr32 vmaskmovdqu %xmm1, %xmm0

llvm/utils/TableGen/X86DisassemblerTables.cpp

Show First 20 Lines • Show All 96 Lines • ▼ Show 20 Lines	return inheritsFrom(child, IC_64BIT_OPSIZE) \|\|
inheritsFrom(child, IC_OPSIZE_ADSIZE);		inheritsFrom(child, IC_OPSIZE_ADSIZE);
case IC_ADSIZE:		case IC_ADSIZE:
return (noPrefix && inheritsFrom(child, IC_OPSIZE_ADSIZE, noPrefix));		return (noPrefix && inheritsFrom(child, IC_OPSIZE_ADSIZE, noPrefix));
case IC_OPSIZE_ADSIZE:		case IC_OPSIZE_ADSIZE:
return false;		return false;
case IC_64BIT_ADSIZE:		case IC_64BIT_ADSIZE:
return (noPrefix && inheritsFrom(child, IC_64BIT_OPSIZE_ADSIZE, noPrefix));		return (noPrefix && inheritsFrom(child, IC_64BIT_OPSIZE_ADSIZE, noPrefix));
case IC_64BIT_OPSIZE_ADSIZE:		case IC_64BIT_OPSIZE_ADSIZE:
return false;		return (noPrefix &&
		inheritsFrom(child, IC_64BIT_VEX_OPSIZE_ADSIZE, noPrefix));
case IC_XD:		case IC_XD:
return inheritsFrom(child, IC_64BIT_XD);		return inheritsFrom(child, IC_64BIT_XD);
case IC_XS:		case IC_XS:
return inheritsFrom(child, IC_64BIT_XS);		return inheritsFrom(child, IC_64BIT_XS);
case IC_XD_OPSIZE:		case IC_XD_OPSIZE:
return inheritsFrom(child, IC_64BIT_XD_OPSIZE);		return inheritsFrom(child, IC_64BIT_XD_OPSIZE);
case IC_XS_OPSIZE:		case IC_XS_OPSIZE:
return inheritsFrom(child, IC_64BIT_XS_OPSIZE);		return inheritsFrom(child, IC_64BIT_XS_OPSIZE);
case IC_XD_ADSIZE:		case IC_XD_ADSIZE:
return inheritsFrom(child, IC_64BIT_XD_ADSIZE);		return inheritsFrom(child, IC_64BIT_XD_ADSIZE);
case IC_XS_ADSIZE:		case IC_XS_ADSIZE:
return inheritsFrom(child, IC_64BIT_XS_ADSIZE);		return inheritsFrom(child, IC_64BIT_XS_ADSIZE);
case IC_64BIT_REXW:		case IC_64BIT_REXW:
return((noPrefix && inheritsFrom(child, IC_64BIT_REXW_XS, noPrefix)) \|\|		return((noPrefix && inheritsFrom(child, IC_64BIT_REXW_XS, noPrefix)) \|\|
(noPrefix && inheritsFrom(child, IC_64BIT_REXW_XD, noPrefix)) \|\|		(noPrefix && inheritsFrom(child, IC_64BIT_REXW_XD, noPrefix)) \|\|
(noPrefix && inheritsFrom(child, IC_64BIT_REXW_OPSIZE, noPrefix)) \|\|		(noPrefix && inheritsFrom(child, IC_64BIT_REXW_OPSIZE, noPrefix)) \|\|
(!AdSize64 && inheritsFrom(child, IC_64BIT_REXW_ADSIZE)));		(!AdSize64 && inheritsFrom(child, IC_64BIT_REXW_ADSIZE)));
case IC_64BIT_OPSIZE:		case IC_64BIT_OPSIZE:
return inheritsFrom(child, IC_64BIT_REXW_OPSIZE) \|\|		return inheritsFrom(child, IC_64BIT_REXW_OPSIZE) \|\|
(!AdSize64 && inheritsFrom(child, IC_64BIT_OPSIZE_ADSIZE)) \|\|		(!AdSize64 && inheritsFrom(child, IC_64BIT_OPSIZE_ADSIZE)) \|\|
(!AdSize64 && inheritsFrom(child, IC_64BIT_REXW_ADSIZE));		(!AdSize64 && inheritsFrom(child, IC_64BIT_REXW_ADSIZE)) \|\|
		(!AdSize64 && inheritsFrom(child, IC_64BIT_VEX_OPSIZE_ADSIZE));
case IC_64BIT_XD:		case IC_64BIT_XD:
return(inheritsFrom(child, IC_64BIT_REXW_XD) \|\|		return (inheritsFrom(child, IC_64BIT_REXW_XD) \|\|
(!AdSize64 && inheritsFrom(child, IC_64BIT_XD_ADSIZE)));		(!AdSize64 && inheritsFrom(child, IC_64BIT_XD_ADSIZE)));
case IC_64BIT_XS:		case IC_64BIT_XS:
return(inheritsFrom(child, IC_64BIT_REXW_XS) \|\|		return(inheritsFrom(child, IC_64BIT_REXW_XS) \|\|
(!AdSize64 && inheritsFrom(child, IC_64BIT_XS_ADSIZE)));		(!AdSize64 && inheritsFrom(child, IC_64BIT_XS_ADSIZE)));
case IC_64BIT_XD_OPSIZE:		case IC_64BIT_XD_OPSIZE:
case IC_64BIT_XS_OPSIZE:		case IC_64BIT_XS_OPSIZE:
return false;		return false;
case IC_64BIT_XD_ADSIZE:		case IC_64BIT_XD_ADSIZE:
case IC_64BIT_XS_ADSIZE:		case IC_64BIT_XS_ADSIZE:
Show All 13 Lines	return (VEX_LIG && VEX_WIG && inheritsFrom(child, IC_VEX_L_W_XS)) \|\|
(VEX_LIG && inheritsFrom(child, IC_VEX_L_XS));		(VEX_LIG && inheritsFrom(child, IC_VEX_L_XS));
case IC_VEX_XD:		case IC_VEX_XD:
return (VEX_LIG && VEX_WIG && inheritsFrom(child, IC_VEX_L_W_XD)) \|\|		return (VEX_LIG && VEX_WIG && inheritsFrom(child, IC_VEX_L_W_XD)) \|\|
(VEX_WIG && inheritsFrom(child, IC_VEX_W_XD)) \|\|		(VEX_WIG && inheritsFrom(child, IC_VEX_W_XD)) \|\|
(VEX_LIG && inheritsFrom(child, IC_VEX_L_XD));		(VEX_LIG && inheritsFrom(child, IC_VEX_L_XD));
case IC_VEX_OPSIZE:		case IC_VEX_OPSIZE:
return (VEX_LIG && VEX_WIG && inheritsFrom(child, IC_VEX_L_W_OPSIZE)) \|\|		return (VEX_LIG && VEX_WIG && inheritsFrom(child, IC_VEX_L_W_OPSIZE)) \|\|
(VEX_WIG && inheritsFrom(child, IC_VEX_W_OPSIZE)) \|\|		(VEX_WIG && inheritsFrom(child, IC_VEX_W_OPSIZE)) \|\|
(VEX_LIG && inheritsFrom(child, IC_VEX_L_OPSIZE));		(VEX_LIG && inheritsFrom(child, IC_VEX_L_OPSIZE)) \|\|
		inheritsFrom(child, IC_64BIT_VEX_OPSIZE);
		case IC_64BIT_VEX_OPSIZE:
		return inheritsFrom(child, IC_64BIT_VEX_OPSIZE_ADSIZE);
		case IC_64BIT_VEX_OPSIZE_ADSIZE:
		return false;
case IC_VEX_W:		case IC_VEX_W:
return VEX_LIG && inheritsFrom(child, IC_VEX_L_W);		return VEX_LIG && inheritsFrom(child, IC_VEX_L_W);
case IC_VEX_W_XS:		case IC_VEX_W_XS:
return VEX_LIG && inheritsFrom(child, IC_VEX_L_W_XS);		return VEX_LIG && inheritsFrom(child, IC_VEX_L_W_XS);
case IC_VEX_W_XD:		case IC_VEX_W_XD:
return VEX_LIG && inheritsFrom(child, IC_VEX_L_W_XD);		return VEX_LIG && inheritsFrom(child, IC_VEX_L_W_XD);
case IC_VEX_W_OPSIZE:		case IC_VEX_W_OPSIZE:
return VEX_LIG && inheritsFrom(child, IC_VEX_L_W_OPSIZE);		return VEX_LIG && inheritsFrom(child, IC_VEX_L_W_OPSIZE);
▲ Show 20 Lines • Show All 708 Lines • ▼ Show 20 Lines	void DisassemblerTables::emitContextTable(raw_ostream &o, unsigned &i) const {
i++;		i++;

for (unsigned index = 0; index < ATTR_max; ++index) {		for (unsigned index = 0; index < ATTR_max; ++index) {
o.indent(i * 2);		o.indent(i * 2);

if ((index & ATTR_EVEX) \|\| (index & ATTR_VEX) \|\| (index & ATTR_VEXL)) {		if ((index & ATTR_EVEX) \|\| (index & ATTR_VEX) \|\| (index & ATTR_VEXL)) {
if (index & ATTR_EVEX)		if (index & ATTR_EVEX)
o << "IC_EVEX";		o << "IC_EVEX";
		else if ((index & (ATTR_64BIT \| ATTR_VEXL \| ATTR_REXW \| ATTR_OPSIZE)) ==
		(ATTR_64BIT \| ATTR_OPSIZE))
		o << "IC_64BIT_VEX";
else		else
o << "IC_VEX";		o << "IC_VEX";

if ((index & ATTR_EVEX) && (index & ATTR_EVEXL2))		if ((index & ATTR_EVEX) && (index & ATTR_EVEXL2))
o << "_L2";		o << "_L2";
else if (index & ATTR_VEXL)		else if (index & ATTR_VEXL)
o << "_L";		o << "_L";

if (index & ATTR_REXW)		if (index & ATTR_REXW)
o << "_W";		o << "_W";

if (index & ATTR_OPSIZE)		if (index & ATTR_OPSIZE) {
o << "_OPSIZE";		o << "_OPSIZE";
else if (index & ATTR_XD)		if ((index & (ATTR_64BIT \| ATTR_EVEX \| ATTR_VEX \| ATTR_VEXL \|
		ATTR_REXW \| ATTR_ADSIZE)) ==
		(ATTR_64BIT \| ATTR_VEX \| ATTR_ADSIZE))
		o << "_ADSIZE";
		} else if (index & ATTR_XD)
o << "_XD";		o << "_XD";
else if (index & ATTR_XS)		else if (index & ATTR_XS)
o << "_XS";		o << "_XS";

if ((index & ATTR_EVEX)) {		if ((index & ATTR_EVEX)) {
if (index & ATTR_EVEXKZ)		if (index & ATTR_EVEXKZ)
o << "_KZ";		o << "_KZ";
else if (index & ATTR_EVEXK)		else if (index & ATTR_EVEXK)
o << "_K";		o << "_K";

if (index & ATTR_EVEXB)		if (index & ATTR_EVEXB)
o << "_B";		o << "_B";
}		}
}		} else if ((index & ATTR_64BIT) && (index & ATTR_REXW) && (index & ATTR_XS))
else if ((index & ATTR_64BIT) && (index & ATTR_REXW) && (index & ATTR_XS))
o << "IC_64BIT_REXW_XS";		o << "IC_64BIT_REXW_XS";
else if ((index & ATTR_64BIT) && (index & ATTR_REXW) && (index & ATTR_XD))		else if ((index & ATTR_64BIT) && (index & ATTR_REXW) && (index & ATTR_XD))
o << "IC_64BIT_REXW_XD";		o << "IC_64BIT_REXW_XD";
else if ((index & ATTR_64BIT) && (index & ATTR_REXW) &&		else if ((index & ATTR_64BIT) && (index & ATTR_REXW) &&
(index & ATTR_OPSIZE))		(index & ATTR_OPSIZE))
o << "IC_64BIT_REXW_OPSIZE";		o << "IC_64BIT_REXW_OPSIZE";
else if ((index & ATTR_64BIT) && (index & ATTR_REXW) &&		else if ((index & ATTR_64BIT) && (index & ATTR_REXW) &&
(index & ATTR_ADSIZE))		(index & ATTR_ADSIZE))
▲ Show 20 Lines • Show All 168 Lines • Show Last 20 Lines

llvm/utils/TableGen/X86RecognizableInstr.cpp

Show First 20 Lines • Show All 119 Lines • ▼ Show 20 Lines	for (unsigned i = 0, e = Predicates.size(); i != e; ++i) {
}		}
}		}

if (Form == X86Local::Pseudo \|\| (IsCodeGenOnly && !ForceDisassemble)) {		if (Form == X86Local::Pseudo \|\| (IsCodeGenOnly && !ForceDisassemble)) {
ShouldBeEmitted = false;		ShouldBeEmitted = false;
return;		return;
}		}

// Special case since there is no attribute class for 64-bit and VEX
if (Name == "VMASKMOVDQU64") {
ShouldBeEmitted = false;
return;
}

ShouldBeEmitted = true;		ShouldBeEmitted = true;
}		}

void RecognizableInstr::processInstr(DisassemblerTables &tables,		void RecognizableInstr::processInstr(DisassemblerTables &tables,
const CodeGenInstruction &insn,		const CodeGenInstruction &insn,
InstrUID uid)		InstrUID uid)
{		{
// Ignore "asm parser only" instructions.		// Ignore "asm parser only" instructions.
if (insn.TheDef->getValueAsBit("isAsmParserOnly"))		if (insn.TheDef->getValueAsBit("isAsmParserOnly"))
▲ Show 20 Lines • Show All 119 Lines • ▼ Show 20 Lines	if (HasVEX_LPrefix && HasVEX_W) {
else {		else {
errs() << "Instruction does not use a prefix: " << Name << "\n";		errs() << "Instruction does not use a prefix: " << Name << "\n";
llvm_unreachable("Invalid prefix");		llvm_unreachable("Invalid prefix");
}		}
} else if (OpPrefix == X86Local::PD && HasVEX_LPrefix)		} else if (OpPrefix == X86Local::PD && HasVEX_LPrefix)
insnContext = IC_VEX_L_OPSIZE;		insnContext = IC_VEX_L_OPSIZE;
else if (OpPrefix == X86Local::PD && HasVEX_W)		else if (OpPrefix == X86Local::PD && HasVEX_W)
insnContext = IC_VEX_W_OPSIZE;		insnContext = IC_VEX_W_OPSIZE;
		else if (OpPrefix == X86Local::PD && Is64Bit &&
		AdSize == X86Local::AdSize32)
		insnContext = IC_64BIT_VEX_OPSIZE_ADSIZE;
		else if (OpPrefix == X86Local::PD && Is64Bit)
		insnContext = IC_64BIT_VEX_OPSIZE;
else if (OpPrefix == X86Local::PD)		else if (OpPrefix == X86Local::PD)
insnContext = IC_VEX_OPSIZE;		insnContext = IC_VEX_OPSIZE;
else if (HasVEX_LPrefix && OpPrefix == X86Local::XS)		else if (HasVEX_LPrefix && OpPrefix == X86Local::XS)
insnContext = IC_VEX_L_XS;		insnContext = IC_VEX_L_XS;
else if (HasVEX_LPrefix && OpPrefix == X86Local::XD)		else if (HasVEX_LPrefix && OpPrefix == X86Local::XD)
insnContext = IC_VEX_L_XD;		insnContext = IC_VEX_L_XD;
else if (HasVEX_W && OpPrefix == X86Local::XS)		else if (HasVEX_W && OpPrefix == X86Local::XS)
insnContext = IC_VEX_W_XS;		insnContext = IC_VEX_W_XS;
▲ Show 20 Lines • Show All 990 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[X86] Fix handling of maskmovdqu in X32ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 359138

llvm/include/llvm/Support/X86DisassemblerDecoderCommon.h

llvm/lib/Target/X86/Disassembler/X86Disassembler.cpp

llvm/lib/Target/X86/X86InstrSSE.td

llvm/lib/Target/X86/X86ScheduleBtVer2.td

llvm/test/CodeGen/X86/maskmovdqu.ll

llvm/test/CodeGen/X86/sse2-intrinsics-fast-isel.ll

llvm/test/MC/X86/maskmovdqu.s

llvm/test/MC/X86/maskmovdqu64.s

llvm/utils/TableGen/X86DisassemblerTables.cpp

llvm/utils/TableGen/X86RecognizableInstr.cpp

[X86] Fix handling of maskmovdqu in X32
ClosedPublic