This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/test/CodeGen/SystemZ/
-
test/
-
CodeGen/
-
SystemZ/
-
builtins-systemz-zvector-constrained.c
-
builtins-systemz-zvector.c
-
builtins-systemz-zvector2-constrained.c
-
builtins-systemz-zvector2.c
-
llvm/
-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
1
InstCombineSimplifyDemanded.cpp
-
test/
-
CodeGen/AMDGPU/
-
AMDGPU/
-
vector-alloca-bitcast.ll
-
Transforms/
-
InstCombine/
-
AArch64/
-
tbl1.ll
-
ARM/
-
tbl1.ll
-
X86/
-
clmulqdq.ll
-
x86-addsub-inseltpoison.ll
-
x86-addsub.ll
-
x86-avx512-inseltpoison.ll
-
x86-avx512.ll
-
x86-insertps.ll
-
x86-masked-memops.ll
-
x86-muldq.ll
-
x86-pack-inseltpoison.ll
-
x86-pack.ll
-
x86-pshufb.ll
-
x86-sse-inseltpoison.ll
-
x86-sse.ll
-
x86-sse2-inseltpoison.ll
-
x86-sse2.ll
-
x86-sse41-inseltpoison.ll
-
x86-sse41.ll
-
x86-sse4a.ll
-
x86-vec_demanded_elts-inseltpoison.ll
-
x86-vec_demanded_elts.ll
-
x86-vector-shifts-inseltpoison.ll
-
x86-vector-shifts.ll
-
x86-vpermil.ll
-
x86-xop-inseltpoison.ll
-
x86-xop.ll
-
bitcast-bigendian.ll
-
bitcast-inseltpoison.ll
-
bitcast-vec-canon-inseltpoison.ll
-
bitcast-vec-canon.ll
-
bitcast.ll
-
broadcast-inseltpoison.ll
-
broadcast.ll
-
cast.ll
-
getelementptr.ll
-
insert-const-shuf.ll
-
insert-extract-shuffle-inseltpoison.ll
-
insert-extract-shuffle.ll
-
masked_intrinsics-inseltpoison.ll
-
masked_intrinsics.ll
-
minmax-fold.ll
-
pr2645-0.ll
-
shuffle_select.ll
-
shufflevector-div-rem-inseltpoison.ll
-
shufflevector-div-rem.ll
-
sub-of-negatible.ll
-
trunc.ll
-
vec_demanded_elts-inseltpoison.ll
-
vec_demanded_elts.ll
-
vec_shuffle-inseltpoison.ll
-
vec_shuffle.ll
-
vector-casts-inseltpoison.ll
-
vector-casts.ll
-
vector_insertelt_shuffle-inseltpoison.ll
-
vector_insertelt_shuffle.ll
-
LoopVectorize/
-
X86/
-
invariant-load-gather.ll
-
induction.ll
-
reduction-inloop-pred.ll
-
reduction-inloop.ll
-
PhaseOrdering/X86/
-
X86/
-
horiz-math-inseltpoison.ll
-
horiz-math.ll
-
SLPVectorizer/
-
AArch64/
-
getelementptr.ll
-
X86/
-
alternate-cast-inseltpoison.ll
-
alternate-cast.ll
-
alternate-int-inseltpoison.ll
-
blending-shuffle-inseltpoison.ll

Differential D93586

[InstCombine] use poison as placeholder for undemanded elems
ClosedPublic

Authored by aqjune on Dec 19 2020, 1:03 PM.

Download Raw Diff

Details

Reviewers

spatel
lebedev.ri
efriedma
nlopes
regehr
RKSimon
zhengyangl
nikic
hfinkel

Commits

rG9d70dbdc2bf2: [InstCombine] use poison as placeholder for undemanded elems

Summary

Currently undef is used as a don’t-care vector when constructing a vector using a series of insertelement.
However, this is problematic because undef isn’t undefined enough.
Especially, a sequence of insertelement can be optimized to shufflevector, but using undef as its placeholder makes shufflevector a poison-blocking instruction because undef cannot be optimized to poison.
This makes a few straightforward optimizations incorrect, such as:

;  https://bugs.llvm.org/show_bug.cgi?id=44185 

define <4 x float> @insert_not_undef_shuffle_translate_commute(float %x, <4 x float> %y, <4 x float> %q) {
  %xv = insertelement <4 x float> %q, float %x, i32 2
  %r = shufflevector <4 x float> %y, <4 x float> %xv, <4 x i32> { 0, 6, 2, undef }
  ret <4 x float> %r ; %r[3] is undef
}
=>
define <4 x float> @insert_not_undef_shuffle_translate_commute(float %x, <4 x float> %y, <4 x float> %q) {
  %r = insertelement <4 x float> %y, float %x, i32 1
  ret <4 x float> %r ; %r[3] = %y[3], incorrect if %y[3] = poison
}

Transformation doesn't verify!
ERROR: Target is more poisonous than source

I’d like to suggest

Using poison as insertelement’s placeholder value (IRBuilder::CreateVectorSplat should be patched too)
Updating shufflevector’s semantics to return poison element if mask is undef

Note that poison is currently lowered into UNDEF in SelDag, so codegen part is okay.
m_Undef() matches PoisonValue as well, so existing optimizations will still fire.

The only concern is hidden miscompilations that will go incorrect when poison constant is given.
A conservative way is copying all tests having insertelement undef & replacing it with insertelement poison & run Alive2 on it, but it will create many tests and people won’t like it. :(

Instead, I’ll simply locally maintain the tests and run Alive2.
If there is any bug found, I’ll report it.

Relevant links: https://bugs.llvm.org/show_bug.cgi?id=43958 , http://lists.llvm.org/pipermail/llvm-dev/2019-November/137242.html

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

aqjune created this revision.Dec 19 2020, 1:03 PM

Herald added subscribers: kerbowa, hiraditya, nhaehnle, jvesely. · View Herald TranscriptDec 19 2020, 1:03 PM

aqjune requested review of this revision.Dec 19 2020, 1:03 PM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptDec 19 2020, 1:03 PM

Herald added subscribers: llvm-commits, cfe-commits. · View Herald Transcript

aqjune mentioned this in D93229: [VectorCombine] allow peeking through GEPs when creating a vector load.Dec 19 2020, 1:28 PM

Harbormaster completed remote builds in B83054: Diff 312947.Dec 19 2020, 1:45 PM

pengfei added a subscriber: pengfei.Dec 20 2020, 4:53 AM

Thank you for working on this! Looking back at the bug comments (and adding reviewers based on those comments), this is a step towards killing undef that has been discussed for a long time now. :)

Besides changing IRBuilder and shufflevector's definition, I think we'll also need updates in the vectorizers, InstSimplify, and other places in InstCombine that use UndefValue for InsertElement and shuffle transforms.

A conservative way is copying all tests having insertelement undef & replacing it with insertelement poison & run Alive2 on it, but it will create many tests and people won’t like it. :(

Do you have an estimate of how many tests are out there? If it's a temporary increase and not huge, I don't think there is much downside. But if we think the transition will take a long time, then maybe we don't want the duplication.

In D93586#2464841, @spatel wrote:

Besides changing IRBuilder and shufflevector's definition, I think we'll also need updates in the vectorizers, InstSimplify, and other places in InstCombine that use UndefValue for InsertElement and shuffle transforms.

Thank you for the pointer!

Do you have an estimate of how many tests are out there? If it's a temporary increase and not huge, I don't think there is much downside. But if we think the transition will take a long time, then maybe we don't want the duplication.

Using this command (which counts insertelement <..> undef in non-checks):

grep -R -E '^[^;]*insertelement <.*> undef,' . | cut -d":" -f1 | uniq | wc -l

There are 792 files in llvm/test, most of which are in test/Transform (119) and test/CodeGen(655).
The transition will be swiftly done (if there's no other issue hopefully) by the next weekend.

My concern is that some tests aren't using utils/update_test_checks.py, and this makes things complicated. :(
Simply replacing insertelement undef at CHECK: isn't enough (ex: Transforms/SLPVectorizer/X86/alternate-cast.ll).
What do you think?

In D93586#2466284, @aqjune wrote:

The transition will be swiftly done (if there's no other issue hopefully) by the next weekend.

Oops, I meant this weekend hopefully.

There are 792 files in llvm/test, most of which are in test/Transform (119) and test/CodeGen(655).
The transition will be swiftly done (if there's no other issue hopefully) by the next weekend.

Thinking about these again, do we need to make a poison copy for test/CodeGen? I don't think so, since the backend won't be changed anyway.

I'm sorry I've only just started looking at this - are you saying that you want to handle all "insertelement undef" cases in one go and not just a series of patcches after this one?

In D93586#2467844, @RKSimon wrote:

I'm sorry I've only just started looking at this - are you saying that you want to handle all "insertelement undef" cases in one go and not just a series of patcches after this one?

It will be treated in a series of patches. There are places other than SimplifyDemandedVectorElts that create insertelement undef. I'll create these and link them as children.

In D93586#2467248, @aqjune wrote:

There are 792 files in llvm/test, most of which are in test/Transform (119) and test/CodeGen(655).
The transition will be swiftly done (if there's no other issue hopefully) by the next weekend.

Thinking about these again, do we need to make a poison copy for test/CodeGen? I don't think so, since the backend won't be changed anyway.

It would be good to update those for consistency; the codegen tests are supposed to be representative of what comes out of the IR optimizer. IIUC, we could do the substitution on those files already, and it would not change anything. But let's sort out the IR changes first?

In D93586#2466284, @aqjune wrote:

My concern is that some tests aren't using utils/update_test_checks.py, and this makes things complicated. :(
Simply replacing insertelement undef at CHECK: isn't enough (ex: Transforms/SLPVectorizer/X86/alternate-cast.ll).

Yes, tests that don't have scripted CHECK lines require more work to understand. That SLP test file is scripted though. Is there another problem with that one?

aqjune mentioned this in rGdb7a2f347f13: Precommit transform tests that have poison as insertelement's placeholder.Dec 23 2020, 6:46 PM

Rebase after adding poison tests
(rGdb7a2f347f13 , rG303654724896)

In D93586#2468350, @spatel wrote:

It would be good to update those for consistency; the codegen tests are supposed to be representative of what comes out of the IR optimizer. IIUC, we could do the substitution on those files already, and it would not change anything. But let's sort out the IR changes first?

Yep, let's start with IR changes first. I made undef->poison test commits already.

In D93586#2466284, @aqjune wrote:

Yes, tests that don't have scripted CHECK lines require more work to understand. That SLP test file is scripted though. Is there another problem with that one?

Oh, I simply meant that replacing "insertelement undef" with "insertelement poison" won't work when updating CHECK part of non-scripted tests.
It seems things are okay because there aren't too many tests needed to be fixed.

Harbormaster completed remote builds in B83451: Diff 313645.Dec 23 2020, 8:17 PM

aqjune added a child revision: D93793: [IR] Let IRBuilder's CreateVectorSplat/CreateShuffleVector use poison as placeholder.Dec 23 2020, 9:03 PM

I'll make two more patches - the instsimplify/vectorizer/.... changes that make insertelement poison, and the langref update to shufflevector.

aqjune added a child revision: D93817: [InstCombine] Update transformations to use poison for insertelement/shufflevector's placeholder value (2/2).Dec 25 2020, 3:15 PM

aqjune mentioned this in D93818: [LangRef] Update shufflevector's semantics to return poison if the mask is undef.Dec 25 2020, 3:30 PM

aqjune added a child revision: D93818: [LangRef] Update shufflevector's semantics to return poison if the mask is undef.Dec 25 2020, 3:31 PM

yubing added a subscriber: yubing.Dec 26 2020, 5:54 AM

LGTM

llvm/lib/Transforms/InstCombine/InstCombineSimplifyDemanded.cpp
1072–1073	undef -> poison

This revision is now accepted and ready to land.Dec 27 2020, 1:15 PM

There are 3 more patches:

https://reviews.llvm.org/D93793 (IRBuilder's CreateVectorSplat)
https://reviews.llvm.org/D93817 (Other transformations)
https://reviews.llvm.org/D93818 (LangRef)

Would it be desirable if I land all of these at once as well as this (93586) when they are accepted, or is incrementally landing accepted patches okay?

In D93586#2472314, @aqjune wrote:

There are 3 more patches:

https://reviews.llvm.org/D93793 (IRBuilder's CreateVectorSplat)
https://reviews.llvm.org/D93817 (Other transformations)
https://reviews.llvm.org/D93818 (LangRef)

Would it be desirable if I land all of these at once as well as this (93586) when they are accepted, or is incrementally landing accepted patches okay?

Unless I'm missing something, this patch in particular is a pure optimization improvement, that does not change semantics, so I don't see a problem with landing it right away.

Okay, I'll gently land this.
I was just wondering whether insertelements having heterogenous placeholder would be problematic (this patch makes some of insertelement use poison, but not all), but it may not matter very much.

Address a comment

This revision was landed with ongoing or failed builds.Dec 27 2020, 3:58 PM

Closed by commit rG9d70dbdc2bf2: [InstCombine] use poison as placeholder for undemanded elems (authored by aqjune). · Explain Why

This revision was automatically updated to reflect the committed changes.

aqjune added a commit: rG9d70dbdc2bf2: [InstCombine] use poison as placeholder for undemanded elems.

Harbormaster completed remote builds in B83553: Diff 313807.Dec 27 2020, 4:28 PM

sdesmalen mentioned this in D107254: [VectorUtils] Teach findScalarElement to return splat value..Aug 2 2021, 3:08 AM

Revision Contents

Path

Size

clang/

test/

CodeGen/

SystemZ/

builtins-systemz-zvector-constrained.c

2 lines

builtins-systemz-zvector.c

18 lines

builtins-systemz-zvector2-constrained.c

4 lines

builtins-systemz-zvector2.c

4 lines

llvm/

lib/

Transforms/

InstCombine/

InstCombineSimplifyDemanded.cpp

23 lines

test/

CodeGen/

AMDGPU/

vector-alloca-bitcast.ll

2 lines

Transforms/

InstCombine/

AArch64/

tbl1.ll

2 lines

ARM/

tbl1.ll

2 lines

X86/

clmulqdq.ll

18 lines

x86-addsub-inseltpoison.ll

4 lines

x86-addsub.ll

4 lines

x86-avx512-inseltpoison.ll

40 lines

40 lines

4 lines

16 lines

4 lines

x86-pack-inseltpoison.ll

4 lines

x86-pack.ll

4 lines

x86-pshufb.ll

72 lines

x86-sse-inseltpoison.ll

70 lines

x86-sse.ll

70 lines

x86-sse2-inseltpoison.ll

66 lines

x86-sse2.ll

66 lines

x86-sse41-inseltpoison.ll

18 lines

x86-sse41.ll

18 lines

x86-sse4a.ll

18 lines

x86-vec_demanded_elts-inseltpoison.ll

22 lines

x86-vec_demanded_elts.ll

43 lines

x86-vector-shifts-inseltpoison.ll

20 lines

x86-vector-shifts.ll

20 lines

x86-vpermil.ll

2 lines

x86-xop-inseltpoison.ll

4 lines

x86-xop.ll

4 lines

bitcast-bigendian.ll

6 lines

bitcast-inseltpoison.ll

6 lines

bitcast-vec-canon-inseltpoison.ll

2 lines

bitcast-vec-canon.ll

4 lines

bitcast.ll

6 lines

broadcast-inseltpoison.ll

4 lines

4 lines

8 lines

4 lines

18 lines

insert-extract-shuffle-inseltpoison.ll

14 lines

insert-extract-shuffle.ll

14 lines

masked_intrinsics-inseltpoison.ll

8 lines

8 lines

2 lines

3 lines

22 lines

shufflevector-div-rem-inseltpoison.ll

2 lines

shufflevector-div-rem.ll

2 lines

sub-of-negatible.ll

2 lines

trunc.ll

2 lines

vec_demanded_elts-inseltpoison.ll

44 lines

vec_demanded_elts.ll

44 lines

vec_shuffle-inseltpoison.ll

55 lines

vec_shuffle.ll

52 lines

vector-casts-inseltpoison.ll

2 lines

vector-casts.ll

2 lines

vector_insertelt_shuffle-inseltpoison.ll

4 lines

vector_insertelt_shuffle.ll

22 lines

LoopVectorize/

X86/

invariant-load-gather.ll

4 lines

induction.ll

2 lines

reduction-inloop-pred.ll

2 lines

reduction-inloop.ll

2 lines

PhaseOrdering/

X86/

horiz-math-inseltpoison.ll

2 lines

horiz-math.ll

2 lines

SLPVectorizer/

AArch64/

getelementptr.ll

6 lines

X86/

alternate-cast-inseltpoison.ll

4 lines

alternate-cast.ll

4 lines

alternate-int-inseltpoison.ll

2 lines

blending-shuffle-inseltpoison.ll

2 lines

Diff 313808

clang/test/CodeGen/SystemZ/builtins-systemz-zvector-constrained.c

Show All 35 Lines	void test_core(void) {
// CHECK: insertelement <2 x double> %{{.}}, double %{{.}}, i32 %{{.*}}		// CHECK: insertelement <2 x double> %{{.}}, double %{{.}}, i32 %{{.*}}
// CHECK-ASM: vlvgg		// CHECK-ASM: vlvgg

vd = vec_promote(d, idx);		vd = vec_promote(d, idx);
// CHECK: insertelement <2 x double> undef, double %{{.}}, i32 %{{.}}		// CHECK: insertelement <2 x double> undef, double %{{.}}, i32 %{{.}}
// CHECK-ASM: vlvgg		// CHECK-ASM: vlvgg

vd = vec_insert_and_zero(cptrd);		vd = vec_insert_and_zero(cptrd);
// CHECK: [[ZVEC:%[^ ]+]] = insertelement <2 x double> <double undef, double 0.000000e+00>, double {{.*}}, i32 0		// CHECK: [[ZVEC:%[^ ]+]] = insertelement <2 x double> <double poison, double 0.000000e+00>, double {{.*}}, i32 0
// CHECK-ASM: vllezg		// CHECK-ASM: vllezg

vd = vec_revb(vd);		vd = vec_revb(vd);
// CHECK-ASM: vperm		// CHECK-ASM: vperm

vd = vec_reve(vd);		vd = vec_reve(vd);
// CHECK-ASM: {{vperm\|vpdi}}		// CHECK-ASM: {{vperm\|vpdi}}

▲ Show 20 Lines • Show All 265 Lines • Show Last 20 Lines

clang/test/CodeGen/SystemZ/builtins-systemz-zvector.c

Show First 20 Lines • Show All 188 Lines • ▼ Show 20 Lines	void test_core(void) {
vul = vec_promote(ul, idx);		vul = vec_promote(ul, idx);
// CHECK: insertelement <2 x i64> undef, i64 %{{.}}, i32 %{{.}}		// CHECK: insertelement <2 x i64> undef, i64 %{{.}}, i32 %{{.}}
// CHECK-ASM: vlvgg		// CHECK-ASM: vlvgg
vd = vec_promote(d, idx);		vd = vec_promote(d, idx);
// CHECK: insertelement <2 x double> undef, double %{{.}}, i32 %{{.}}		// CHECK: insertelement <2 x double> undef, double %{{.}}, i32 %{{.}}
// CHECK-ASM: vlvgg		// CHECK-ASM: vlvgg

vsc = vec_insert_and_zero(cptrsc);		vsc = vec_insert_and_zero(cptrsc);
// CHECK: insertelement <16 x i8> <i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 undef, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0>, i8 %{{.*}}, i32 7		// CHECK: insertelement <16 x i8> <i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 poison, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0>, i8 %{{.*}}, i32 7
// CHECK-ASM: vllezb		// CHECK-ASM: vllezb
vuc = vec_insert_and_zero(cptruc);		vuc = vec_insert_and_zero(cptruc);
// CHECK: insertelement <16 x i8> <i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 undef, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0>, i8 %{{.*}}, i32 7		// CHECK: insertelement <16 x i8> <i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 poison, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0>, i8 %{{.*}}, i32 7
// CHECK-ASM: vllezb		// CHECK-ASM: vllezb
vss = vec_insert_and_zero(cptrss);		vss = vec_insert_and_zero(cptrss);
// CHECK: insertelement <8 x i16> <i16 0, i16 0, i16 0, i16 undef, i16 0, i16 0, i16 0, i16 0>, i16 %{{.*}}, i32 3		// CHECK: insertelement <8 x i16> <i16 0, i16 0, i16 0, i16 poison, i16 0, i16 0, i16 0, i16 0>, i16 %{{.*}}, i32 3
// CHECK-ASM: vllezh		// CHECK-ASM: vllezh
vus = vec_insert_and_zero(cptrus);		vus = vec_insert_and_zero(cptrus);
// CHECK: insertelement <8 x i16> <i16 0, i16 0, i16 0, i16 undef, i16 0, i16 0, i16 0, i16 0>, i16 %{{.*}}, i32 3		// CHECK: insertelement <8 x i16> <i16 0, i16 0, i16 0, i16 poison, i16 0, i16 0, i16 0, i16 0>, i16 %{{.*}}, i32 3
// CHECK-ASM: vllezh		// CHECK-ASM: vllezh
vsi = vec_insert_and_zero(cptrsi);		vsi = vec_insert_and_zero(cptrsi);
// CHECK: insertelement <4 x i32> <i32 0, i32 undef, i32 0, i32 0>, i32 %{{.*}}, i32 1		// CHECK: insertelement <4 x i32> <i32 0, i32 poison, i32 0, i32 0>, i32 %{{.*}}, i32 1
// CHECK-ASM: vllezf		// CHECK-ASM: vllezf
vui = vec_insert_and_zero(cptrui);		vui = vec_insert_and_zero(cptrui);
// CHECK: insertelement <4 x i32> <i32 0, i32 undef, i32 0, i32 0>, i32 %{{.*}}, i32 1		// CHECK: insertelement <4 x i32> <i32 0, i32 poison, i32 0, i32 0>, i32 %{{.*}}, i32 1
// CHECK-ASM: vllezf		// CHECK-ASM: vllezf
vsl = vec_insert_and_zero(cptrsl);		vsl = vec_insert_and_zero(cptrsl);
// CHECK: insertelement <2 x i64> <i64 undef, i64 0>, i64 %{{.*}}, i32 0		// CHECK: insertelement <2 x i64> <i64 poison, i64 0>, i64 %{{.*}}, i32 0
// CHECK-ASM: vllezg		// CHECK-ASM: vllezg
vul = vec_insert_and_zero(cptrul);		vul = vec_insert_and_zero(cptrul);
// CHECK: insertelement <2 x i64> <i64 undef, i64 0>, i64 %{{.*}}, i32 0		// CHECK: insertelement <2 x i64> <i64 poison, i64 0>, i64 %{{.*}}, i32 0
// CHECK-ASM: vllezg		// CHECK-ASM: vllezg
vd = vec_insert_and_zero(cptrd);		vd = vec_insert_and_zero(cptrd);
// CHECK: insertelement <2 x double> <double undef, double 0.000000e+00>, double %{{.*}}, i32 0		// CHECK: insertelement <2 x double> <double poison, double 0.000000e+00>, double %{{.*}}, i32 0
// CHECK-ASM: vllezg		// CHECK-ASM: vllezg

vsc = vec_perm(vsc, vsc, vuc);		vsc = vec_perm(vsc, vsc, vuc);
// CHECK: call <16 x i8> @llvm.s390.vperm(<16 x i8> %{{.}}, <16 x i8> %{{.}}, <16 x i8> %{{.*}})		// CHECK: call <16 x i8> @llvm.s390.vperm(<16 x i8> %{{.}}, <16 x i8> %{{.}}, <16 x i8> %{{.*}})
// CHECK-ASM: vperm		// CHECK-ASM: vperm
vuc = vec_perm(vuc, vuc, vuc);		vuc = vec_perm(vuc, vuc, vuc);
// CHECK: call <16 x i8> @llvm.s390.vperm(<16 x i8> %{{.}}, <16 x i8> %{{.}}, <16 x i8> %{{.*}})		// CHECK: call <16 x i8> @llvm.s390.vperm(<16 x i8> %{{.}}, <16 x i8> %{{.}}, <16 x i8> %{{.*}})
// CHECK-ASM: vperm		// CHECK-ASM: vperm
▲ Show 20 Lines • Show All 4,393 Lines • Show Last 20 Lines

clang/test/CodeGen/SystemZ/builtins-systemz-zvector2-constrained.c

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	void test_core(void) {
vf = vec_promote(f, idx);		vf = vec_promote(f, idx);
// CHECK: insertelement <4 x float> undef, float %{{.}}, i32 %{{.}}		// CHECK: insertelement <4 x float> undef, float %{{.}}, i32 %{{.}}
// CHECK-ASM: vlvgf		// CHECK-ASM: vlvgf
vd = vec_promote(d, idx);		vd = vec_promote(d, idx);
// CHECK: insertelement <2 x double> undef, double %{{.}}, i32 %{{.}}		// CHECK: insertelement <2 x double> undef, double %{{.}}, i32 %{{.}}
// CHECK-ASM: vlvgg		// CHECK-ASM: vlvgg

vf = vec_insert_and_zero(cptrf);		vf = vec_insert_and_zero(cptrf);
// CHECK: insertelement <4 x float> <float 0.000000e+00, float undef, float 0.000000e+00, float 0.000000e+00>, float {{.*}}, i32 1		// CHECK: insertelement <4 x float> <float 0.000000e+00, float poison, float 0.000000e+00, float 0.000000e+00>, float {{.*}}, i32 1
// CHECK-ASM: vllezf		// CHECK-ASM: vllezf
vd = vec_insert_and_zero(cptrd);		vd = vec_insert_and_zero(cptrd);
// CHECK: insertelement <2 x double> <double undef, double 0.000000e+00>, double %{{.*}}, i32 0		// CHECK: insertelement <2 x double> <double poison, double 0.000000e+00>, double %{{.*}}, i32 0
// CHECK-ASM: vllezg		// CHECK-ASM: vllezg

vf = vec_revb(vf);		vf = vec_revb(vf);
// CHECK-ASM: vperm		// CHECK-ASM: vperm
vd = vec_revb(vd);		vd = vec_revb(vd);
// CHECK-ASM: vperm		// CHECK-ASM: vperm

vf = vec_reve(vf);		vf = vec_reve(vf);
▲ Show 20 Lines • Show All 452 Lines • Show Last 20 Lines

clang/test/CodeGen/SystemZ/builtins-systemz-zvector2.c

Show First 20 Lines • Show All 103 Lines • ▼ Show 20 Lines	void test_core(void) {
vf = vec_promote(f, idx);		vf = vec_promote(f, idx);
// CHECK: insertelement <4 x float> undef, float %{{.}}, i32 %{{.}}		// CHECK: insertelement <4 x float> undef, float %{{.}}, i32 %{{.}}
// CHECK-ASM: vlvgf		// CHECK-ASM: vlvgf
vd = vec_promote(d, idx);		vd = vec_promote(d, idx);
// CHECK: insertelement <2 x double> undef, double %{{.}}, i32 %{{.}}		// CHECK: insertelement <2 x double> undef, double %{{.}}, i32 %{{.}}
// CHECK-ASM: vlvgg		// CHECK-ASM: vlvgg

vf = vec_insert_and_zero(cptrf);		vf = vec_insert_and_zero(cptrf);
// CHECK: insertelement <4 x float> <float 0.000000e+00, float undef, float 0.000000e+00, float 0.000000e+00>, float %{{.*}}, i32 1		// CHECK: insertelement <4 x float> <float 0.000000e+00, float poison, float 0.000000e+00, float 0.000000e+00>, float %{{.*}}, i32 1
// CHECK-ASM: vllezf		// CHECK-ASM: vllezf
vd = vec_insert_and_zero(cptrd);		vd = vec_insert_and_zero(cptrd);
// CHECK: insertelement <2 x double> <double undef, double 0.000000e+00>, double %{{.*}}, i32 0		// CHECK: insertelement <2 x double> <double poison, double 0.000000e+00>, double %{{.*}}, i32 0
// CHECK-ASM: vllezg		// CHECK-ASM: vllezg

vf = vec_perm(vf, vf, vuc);		vf = vec_perm(vf, vf, vuc);
// CHECK: call <16 x i8> @llvm.s390.vperm(<16 x i8> %{{.}}, <16 x i8> %{{.}}, <16 x i8> %{{.*}})		// CHECK: call <16 x i8> @llvm.s390.vperm(<16 x i8> %{{.}}, <16 x i8> %{{.}}, <16 x i8> %{{.*}})
// CHECK-ASM: vperm		// CHECK-ASM: vperm
vd = vec_perm(vd, vd, vuc);		vd = vec_perm(vd, vd, vuc);
// CHECK: call <16 x i8> @llvm.s390.vperm(<16 x i8> %{{.}}, <16 x i8> %{{.}}, <16 x i8> %{{.*}})		// CHECK: call <16 x i8> @llvm.s390.vperm(<16 x i8> %{{.}}, <16 x i8> %{{.}}, <16 x i8> %{{.*}})
// CHECK-ASM: vperm		// CHECK-ASM: vperm
▲ Show 20 Lines • Show All 723 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstCombineSimplifyDemanded.cpp

Show First 20 Lines • Show All 1,015 Lines • ▼ Show 20 Lines	if ((BitMask1 & DemandedMask) == (BitMask2 & DemandedMask)) {

return InsertNewInstWith(New, *Shl);		return InsertNewInstWith(New, *Shl);
}		}

return nullptr;		return nullptr;
}		}

/// The specified value produces a vector with any number of elements.		/// The specified value produces a vector with any number of elements.
/// This method analyzes which elements of the operand are undef and returns		/// This method analyzes which elements of the operand are undef or poison and
/// that information in UndefElts.		/// returns that information in UndefElts.
///		///
/// DemandedElts contains the set of elements that are actually used by the		/// DemandedElts contains the set of elements that are actually used by the
/// caller, and by default (AllowMultipleUsers equals false) the value is		/// caller, and by default (AllowMultipleUsers equals false) the value is
/// simplified only if it has a single caller. If AllowMultipleUsers is set		/// simplified only if it has a single caller. If AllowMultipleUsers is set
/// to true, DemandedElts refers to the union of sets of elements that are		/// to true, DemandedElts refers to the union of sets of elements that are
/// used by all callers.		/// used by all callers.
///		///
/// If the information about demanded elements can be used to simplify the		/// If the information about demanded elements can be used to simplify the
Show All 9 Lines	Value InstCombinerImpl::SimplifyDemandedVectorElts(Value V,
if (isa<ScalableVectorType>(V->getType()))		if (isa<ScalableVectorType>(V->getType()))
return nullptr;		return nullptr;

unsigned VWidth = cast<FixedVectorType>(V->getType())->getNumElements();		unsigned VWidth = cast<FixedVectorType>(V->getType())->getNumElements();
APInt EltMask(APInt::getAllOnesValue(VWidth));		APInt EltMask(APInt::getAllOnesValue(VWidth));
assert((DemandedElts & ~EltMask) == 0 && "Invalid DemandedElts!");		assert((DemandedElts & ~EltMask) == 0 && "Invalid DemandedElts!");

if (isa<UndefValue>(V)) {		if (isa<UndefValue>(V)) {
// If the entire vector is undefined, just return this info.		// If the entire vector is undef or poison, just return this info.
UndefElts = EltMask;		UndefElts = EltMask;
return nullptr;		return nullptr;
}		}

if (DemandedElts.isNullValue()) { // If nothing is demanded, provide undef.		if (DemandedElts.isNullValue()) { // If nothing is demanded, provide poison.
UndefElts = EltMask;		UndefElts = EltMask;
return UndefValue::get(V->getType());		return PoisonValue::get(V->getType());
}		}

UndefElts = 0;		UndefElts = 0;

if (auto *C = dyn_cast<Constant>(V)) {		if (auto *C = dyn_cast<Constant>(V)) {
// Check if this is identity. If so, return 0 since we are not simplifying		// Check if this is identity. If so, return 0 since we are not simplifying
// anything.		// anything.
if (DemandedElts.isAllOnesValue())		if (DemandedElts.isAllOnesValue())
return nullptr;		return nullptr;

Type *EltTy = cast<VectorType>(V->getType())->getElementType();		Type *EltTy = cast<VectorType>(V->getType())->getElementType();
Constant *Undef = UndefValue::get(EltTy);		Constant *Poison = PoisonValue::get(EltTy);
SmallVector<Constant*, 16> Elts;		SmallVector<Constant*, 16> Elts;
for (unsigned i = 0; i != VWidth; ++i) {		for (unsigned i = 0; i != VWidth; ++i) {
if (!DemandedElts[i]) { // If not demanded, set to undef.		if (!DemandedElts[i]) { // If not demanded, set to poison.
		nikicUnsubmitted Not Done Reply Inline Actions undef -> poison nikic: undef -> poison
Elts.push_back(Undef);		Elts.push_back(Poison);
UndefElts.setBit(i);		UndefElts.setBit(i);
continue;		continue;
}		}

Constant *Elt = C->getAggregateElement(i);		Constant *Elt = C->getAggregateElement(i);
if (!Elt) return nullptr;		if (!Elt) return nullptr;

if (isa<UndefValue>(Elt)) { // Already undef.
Elts.push_back(Undef);
UndefElts.setBit(i);
} else { // Otherwise, defined.
Elts.push_back(Elt);		Elts.push_back(Elt);
}		if (isa<UndefValue>(Elt)) // Already undef or poison.
		UndefElts.setBit(i);
}		}

// If we changed the constant, return it.		// If we changed the constant, return it.
Constant *NewCV = ConstantVector::get(Elts);		Constant *NewCV = ConstantVector::get(Elts);
return NewCV != C ? NewCV : nullptr;		return NewCV != C ? NewCV : nullptr;
}		}

// Limit search depth.		// Limit search depth.
▲ Show 20 Lines • Show All 439 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/vector-alloca-bitcast.ll

	Show First 20 Lines • Show All 278 Lines • ▼ Show 20 Lines

	; TODO: llvm.assume can be ingored			; TODO: llvm.assume can be ingored

	; OPT-LABEL: @vector_read_alloca_bitcast_assume(			; OPT-LABEL: @vector_read_alloca_bitcast_assume(
	; OPT: %tmp = alloca <4 x i32>, align 16, addrspace(5)			; OPT: %tmp = alloca <4 x i32>, align 16, addrspace(5)
	; OPT-NEXT: %x = getelementptr inbounds <4 x i32>, <4 x i32> addrspace(5)* %tmp, i64 0, i64 0			; OPT-NEXT: %x = getelementptr inbounds <4 x i32>, <4 x i32> addrspace(5)* %tmp, i64 0, i64 0
	; OPT-NEXT: store i32 0, i32 addrspace(5)* %x, align 16			; OPT-NEXT: store i32 0, i32 addrspace(5)* %x, align 16
	; OPT-NEXT: %0 = load <4 x i32>, <4 x i32> addrspace(5)* %tmp, align 16			; OPT-NEXT: %0 = load <4 x i32>, <4 x i32> addrspace(5)* %tmp, align 16
	; OPT-NEXT: %1 = shufflevector <4 x i32> %0, <4 x i32> <i32 undef, i32 1, i32 2, i32 3>, <4 x i32> <i32 0, i32 5, i32 6, i32 7>			; OPT-NEXT: %1 = shufflevector <4 x i32> %0, <4 x i32> <i32 poison, i32 1, i32 2, i32 3>, <4 x i32> <i32 0, i32 5, i32 6, i32 7>
	; OPT-NEXT: store <4 x i32> %1, <4 x i32> addrspace(5)* %tmp, align 16			; OPT-NEXT: store <4 x i32> %1, <4 x i32> addrspace(5)* %tmp, align 16
	; OPT-NEXT: %2 = extractelement <4 x i32> %1, i32 %index			; OPT-NEXT: %2 = extractelement <4 x i32> %1, i32 %index
	; OPT-NEXT: store i32 %2, i32 addrspace(1)* %out, align 4			; OPT-NEXT: store i32 %2, i32 addrspace(1)* %out, align 4

	; GCN-LABEL: {{^}}vector_read_alloca_bitcast_assume:			; GCN-LABEL: {{^}}vector_read_alloca_bitcast_assume:
	; GCN-COUNT-4: buffer_store_dword			; GCN-COUNT-4: buffer_store_dword

	define amdgpu_kernel void @vector_read_alloca_bitcast_assume(i32 addrspace(1)* %out, i32 %index) {			define amdgpu_kernel void @vector_read_alloca_bitcast_assume(i32 addrspace(1)* %out, i32 %index) {
	▲ Show 20 Lines • Show All 170 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/AArch64/tbl1.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -instcombine -S \| FileCheck %s			; RUN: opt < %s -instcombine -S \| FileCheck %s

	target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"			target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"
	target triple = "aarch64-arm-none-eabi"			target triple = "aarch64-arm-none-eabi"

	; Turning a table lookup intrinsic into a shuffle vector instruction			; Turning a table lookup intrinsic into a shuffle vector instruction
	; can be beneficial. If the mask used for the lookup is the constant			; can be beneficial. If the mask used for the lookup is the constant
	; vector {7,6,5,4,3,2,1,0}, then the back-end generates rev64			; vector {7,6,5,4,3,2,1,0}, then the back-end generates rev64
	; instructions instead.			; instructions instead.

	define <8 x i8> @tbl1_8x8(<16 x i8> %vec) {			define <8 x i8> @tbl1_8x8(<16 x i8> %vec) {
	; CHECK-LABEL: @tbl1_8x8(			; CHECK-LABEL: @tbl1_8x8(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP0:%.]] = shufflevector <16 x i8> [[VEC:%.]], <16 x i8> undef, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>			; CHECK-NEXT: [[TMP0:%.]] = shufflevector <16 x i8> [[VEC:%.]], <16 x i8> poison, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>
	; CHECK-NEXT: ret <8 x i8> [[TMP0]]			; CHECK-NEXT: ret <8 x i8> [[TMP0]]
	;			;
	entry:			entry:
	%tbl1 = call <8 x i8> @llvm.aarch64.neon.tbl1.v8i8(<16 x i8> %vec, <8 x i8> <i8 7, i8 6, i8 5, i8 4, i8 3, i8 2, i8 1, i8 0>)			%tbl1 = call <8 x i8> @llvm.aarch64.neon.tbl1.v8i8(<16 x i8> %vec, <8 x i8> <i8 7, i8 6, i8 5, i8 4, i8 3, i8 2, i8 1, i8 0>)
	ret <8 x i8> %tbl1			ret <8 x i8> %tbl1
	}			}

	; Bail the optimization if a mask index is out of range.			; Bail the optimization if a mask index is out of range.
	▲ Show 20 Lines • Show All 42 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/ARM/tbl1.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -instcombine -S \| FileCheck %s			; RUN: opt < %s -instcombine -S \| FileCheck %s

	target datalayout = "e-m:e-p:32:32-i64:64-v128:64:128-a:0:32-n32-S64"			target datalayout = "e-m:e-p:32:32-i64:64-v128:64:128-a:0:32-n32-S64"
	target triple = "armv8-arm-none-eabi"			target triple = "armv8-arm-none-eabi"

	; Turning a table lookup intrinsic into a shuffle vector instruction			; Turning a table lookup intrinsic into a shuffle vector instruction
	; can be beneficial. If the mask used for the lookup is the constant			; can be beneficial. If the mask used for the lookup is the constant
	; vector {7,6,5,4,3,2,1,0}, then the back-end generates rev64			; vector {7,6,5,4,3,2,1,0}, then the back-end generates rev64
	; instructions instead.			; instructions instead.

	define <8 x i8> @tbl1_8x8(<8 x i8> %vec) {			define <8 x i8> @tbl1_8x8(<8 x i8> %vec) {
	; CHECK-LABEL: @tbl1_8x8(			; CHECK-LABEL: @tbl1_8x8(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP0:%.]] = shufflevector <8 x i8> [[VEC:%.]], <8 x i8> undef, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>			; CHECK-NEXT: [[TMP0:%.]] = shufflevector <8 x i8> [[VEC:%.]], <8 x i8> poison, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>
	; CHECK-NEXT: ret <8 x i8> [[TMP0]]			; CHECK-NEXT: ret <8 x i8> [[TMP0]]
	;			;
	entry:			entry:
	%vtbl1 = call <8 x i8> @llvm.arm.neon.vtbl1(<8 x i8> %vec, <8 x i8> <i8 7, i8 6, i8 5, i8 4, i8 3, i8 2, i8 1, i8 0>)			%vtbl1 = call <8 x i8> @llvm.arm.neon.vtbl1(<8 x i8> %vec, <8 x i8> <i8 7, i8 6, i8 5, i8 4, i8 3, i8 2, i8 1, i8 0>)
	ret <8 x i8> %vtbl1			ret <8 x i8> %vtbl1
	}			}

	; Bail the optimization if a mask index is out of range.			; Bail the optimization if a mask index is out of range.
	Show All 12 Lines

llvm/test/Transforms/InstCombine/X86/clmulqdq.ll

Show All 12 Lines	;
%1 = insertelement <2 x i64> %a0, i64 1, i64 1		%1 = insertelement <2 x i64> %a0, i64 1, i64 1
%2 = insertelement <2 x i64> %a1, i64 1, i64 1		%2 = insertelement <2 x i64> %a1, i64 1, i64 1
%3 = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> %1, <2 x i64> %2, i8 0)		%3 = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> %1, <2 x i64> %2, i8 0)
ret <2 x i64> %3		ret <2 x i64> %3
}		}

define <2 x i64> @test_demanded_elts_pclmulqdq_1(<2 x i64> %a0, <2 x i64> %a1) {		define <2 x i64> @test_demanded_elts_pclmulqdq_1(<2 x i64> %a0, <2 x i64> %a1) {
; CHECK-LABEL: @test_demanded_elts_pclmulqdq_1(		; CHECK-LABEL: @test_demanded_elts_pclmulqdq_1(
; CHECK-NEXT: [[TMP1:%.]] = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> <i64 undef, i64 1>, <2 x i64> [[A1:%.]], i8 1)		; CHECK-NEXT: [[TMP1:%.]] = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> <i64 poison, i64 1>, <2 x i64> [[A1:%.]], i8 1)
; CHECK-NEXT: ret <2 x i64> [[TMP1]]		; CHECK-NEXT: ret <2 x i64> [[TMP1]]
;		;
%1 = insertelement <2 x i64> %a0, i64 1, i64 1		%1 = insertelement <2 x i64> %a0, i64 1, i64 1
%2 = insertelement <2 x i64> %a1, i64 1, i64 1		%2 = insertelement <2 x i64> %a1, i64 1, i64 1
%3 = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> %1, <2 x i64> %2, i8 1)		%3 = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> %1, <2 x i64> %2, i8 1)
ret <2 x i64> %3		ret <2 x i64> %3
}		}

define <2 x i64> @test_demanded_elts_pclmulqdq_16(<2 x i64> %a0, <2 x i64> %a1) {		define <2 x i64> @test_demanded_elts_pclmulqdq_16(<2 x i64> %a0, <2 x i64> %a1) {
; CHECK-LABEL: @test_demanded_elts_pclmulqdq_16(		; CHECK-LABEL: @test_demanded_elts_pclmulqdq_16(
; CHECK-NEXT: [[TMP1:%.]] = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> [[A0:%.]], <2 x i64> <i64 undef, i64 1>, i8 16)		; CHECK-NEXT: [[TMP1:%.]] = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> [[A0:%.]], <2 x i64> <i64 poison, i64 1>, i8 16)
; CHECK-NEXT: ret <2 x i64> [[TMP1]]		; CHECK-NEXT: ret <2 x i64> [[TMP1]]
;		;
%1 = insertelement <2 x i64> %a0, i64 1, i64 1		%1 = insertelement <2 x i64> %a0, i64 1, i64 1
%2 = insertelement <2 x i64> %a1, i64 1, i64 1		%2 = insertelement <2 x i64> %a1, i64 1, i64 1
%3 = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> %1, <2 x i64> %2, i8 16)		%3 = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> %1, <2 x i64> %2, i8 16)
ret <2 x i64> %3		ret <2 x i64> %3
}		}

define <2 x i64> @test_demanded_elts_pclmulqdq_17(<2 x i64> %a0, <2 x i64> %a1) {		define <2 x i64> @test_demanded_elts_pclmulqdq_17(<2 x i64> %a0, <2 x i64> %a1) {
; CHECK-LABEL: @test_demanded_elts_pclmulqdq_17(		; CHECK-LABEL: @test_demanded_elts_pclmulqdq_17(
; CHECK-NEXT: [[TMP1:%.*]] = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> <i64 undef, i64 1>, <2 x i64> <i64 undef, i64 1>, i8 17)		; CHECK-NEXT: [[TMP1:%.*]] = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> <i64 poison, i64 1>, <2 x i64> <i64 poison, i64 1>, i8 17)
; CHECK-NEXT: ret <2 x i64> [[TMP1]]		; CHECK-NEXT: ret <2 x i64> [[TMP1]]
;		;
%1 = insertelement <2 x i64> %a0, i64 1, i64 1		%1 = insertelement <2 x i64> %a0, i64 1, i64 1
%2 = insertelement <2 x i64> %a1, i64 1, i64 1		%2 = insertelement <2 x i64> %a1, i64 1, i64 1
%3 = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> %1, <2 x i64> %2, i8 17)		%3 = call <2 x i64> @llvm.x86.pclmulqdq(<2 x i64> %1, <2 x i64> %2, i8 17)
ret <2 x i64> %3		ret <2 x i64> %3
}		}

Show All 39 Lines	;
%3 = insertelement <4 x i64> %1, i64 1, i64 3		%3 = insertelement <4 x i64> %1, i64 1, i64 3
%4 = insertelement <4 x i64> %2, i64 1, i64 3		%4 = insertelement <4 x i64> %2, i64 1, i64 3
%res = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> %3, <4 x i64> %4, i8 0)		%res = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> %3, <4 x i64> %4, i8 0)
ret <4 x i64> %res		ret <4 x i64> %res
}		}

define <4 x i64> @test_demanded_elts_pclmulqdq_256_1(<4 x i64> %a0, <4 x i64> %a1) {		define <4 x i64> @test_demanded_elts_pclmulqdq_256_1(<4 x i64> %a0, <4 x i64> %a1) {
; CHECK-LABEL: @test_demanded_elts_pclmulqdq_256_1(		; CHECK-LABEL: @test_demanded_elts_pclmulqdq_256_1(
; CHECK-NEXT: [[RES:%.]] = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> <i64 undef, i64 1, i64 undef, i64 1>, <4 x i64> [[A1:%.]], i8 1)		; CHECK-NEXT: [[RES:%.]] = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> <i64 poison, i64 1, i64 poison, i64 1>, <4 x i64> [[A1:%.]], i8 1)
; CHECK-NEXT: ret <4 x i64> [[RES]]		; CHECK-NEXT: ret <4 x i64> [[RES]]
;		;
%1 = insertelement <4 x i64> %a0, i64 1, i64 1		%1 = insertelement <4 x i64> %a0, i64 1, i64 1
%2 = insertelement <4 x i64> %a1, i64 1, i64 1		%2 = insertelement <4 x i64> %a1, i64 1, i64 1
%3 = insertelement <4 x i64> %1, i64 1, i64 3		%3 = insertelement <4 x i64> %1, i64 1, i64 3
%4 = insertelement <4 x i64> %2, i64 1, i64 3		%4 = insertelement <4 x i64> %2, i64 1, i64 3
%res = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> %3, <4 x i64> %4, i8 1)		%res = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> %3, <4 x i64> %4, i8 1)
ret <4 x i64> %res		ret <4 x i64> %res
}		}

define <4 x i64> @test_demanded_elts_pclmulqdq_256_16(<4 x i64> %a0, <4 x i64> %a1) {		define <4 x i64> @test_demanded_elts_pclmulqdq_256_16(<4 x i64> %a0, <4 x i64> %a1) {
; CHECK-LABEL: @test_demanded_elts_pclmulqdq_256_16(		; CHECK-LABEL: @test_demanded_elts_pclmulqdq_256_16(
; CHECK-NEXT: [[RES:%.]] = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> [[A0:%.]], <4 x i64> <i64 undef, i64 1, i64 undef, i64 1>, i8 16)		; CHECK-NEXT: [[RES:%.]] = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> [[A0:%.]], <4 x i64> <i64 poison, i64 1, i64 poison, i64 1>, i8 16)
; CHECK-NEXT: ret <4 x i64> [[RES]]		; CHECK-NEXT: ret <4 x i64> [[RES]]
;		;
%1 = insertelement <4 x i64> %a0, i64 1, i64 1		%1 = insertelement <4 x i64> %a0, i64 1, i64 1
%2 = insertelement <4 x i64> %a1, i64 1, i64 1		%2 = insertelement <4 x i64> %a1, i64 1, i64 1
%3 = insertelement <4 x i64> %1, i64 1, i64 3		%3 = insertelement <4 x i64> %1, i64 1, i64 3
%4 = insertelement <4 x i64> %2, i64 1, i64 3		%4 = insertelement <4 x i64> %2, i64 1, i64 3
%res = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> %3, <4 x i64> %4, i8 16)		%res = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> %3, <4 x i64> %4, i8 16)
ret <4 x i64> %res		ret <4 x i64> %res
}		}

define <4 x i64> @test_demanded_elts_pclmulqdq_256_17(<4 x i64> %a0, <4 x i64> %a1) {		define <4 x i64> @test_demanded_elts_pclmulqdq_256_17(<4 x i64> %a0, <4 x i64> %a1) {
; CHECK-LABEL: @test_demanded_elts_pclmulqdq_256_17(		; CHECK-LABEL: @test_demanded_elts_pclmulqdq_256_17(
; CHECK-NEXT: [[RES:%.*]] = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> <i64 undef, i64 1, i64 undef, i64 1>, <4 x i64> <i64 undef, i64 1, i64 undef, i64 1>, i8 17)		; CHECK-NEXT: [[RES:%.*]] = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> <i64 poison, i64 1, i64 poison, i64 1>, <4 x i64> <i64 poison, i64 1, i64 poison, i64 1>, i8 17)
; CHECK-NEXT: ret <4 x i64> [[RES]]		; CHECK-NEXT: ret <4 x i64> [[RES]]
;		;
%1 = insertelement <4 x i64> %a0, i64 1, i64 1		%1 = insertelement <4 x i64> %a0, i64 1, i64 1
%2 = insertelement <4 x i64> %a1, i64 1, i64 1		%2 = insertelement <4 x i64> %a1, i64 1, i64 1
%3 = insertelement <4 x i64> %1, i64 1, i64 3		%3 = insertelement <4 x i64> %1, i64 1, i64 3
%4 = insertelement <4 x i64> %2, i64 1, i64 3		%4 = insertelement <4 x i64> %2, i64 1, i64 3
%res = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> %3, <4 x i64> %4, i8 17)		%res = call <4 x i64> @llvm.x86.pclmulqdq.256(<4 x i64> %3, <4 x i64> %4, i8 17)
ret <4 x i64> %res		ret <4 x i64> %res
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	;
%7 = insertelement <8 x i64> %5, i64 1, i64 7		%7 = insertelement <8 x i64> %5, i64 1, i64 7
%8 = insertelement <8 x i64> %6, i64 1, i64 7		%8 = insertelement <8 x i64> %6, i64 1, i64 7
%res = call <8 x i64> @llvm.x86.pclmulqdq.512(<8 x i64> %7, <8 x i64> %8, i8 0)		%res = call <8 x i64> @llvm.x86.pclmulqdq.512(<8 x i64> %7, <8 x i64> %8, i8 0)
ret <8 x i64> %res		ret <8 x i64> %res
}		}

define <8 x i64> @test_demanded_elts_pclmulqdq_512_1(<8 x i64> %a0, <8 x i64> %a1) {		define <8 x i64> @test_demanded_elts_pclmulqdq_512_1(<8 x i64> %a0, <8 x i64> %a1) {
; CHECK-LABEL: @test_demanded_elts_pclmulqdq_512_1(		; CHECK-LABEL: @test_demanded_elts_pclmulqdq_512_1(
; CHECK-NEXT: [[RES:%.]] = call <8 x i64> @llvm.x86.pclmulqdq.512(<8 x i64> <i64 undef, i64 1, i64 undef, i64 1, i64 undef, i64 1, i64 undef, i64 1>, <8 x i64> [[A1:%.]], i8 1)		; CHECK-NEXT: [[RES:%.]] = call <8 x i64> @llvm.x86.pclmulqdq.512(<8 x i64> <i64 poison, i64 1, i64 poison, i64 1, i64 poison, i64 1, i64 poison, i64 1>, <8 x i64> [[A1:%.]], i8 1)
; CHECK-NEXT: ret <8 x i64> [[RES]]		; CHECK-NEXT: ret <8 x i64> [[RES]]
;		;
%1 = insertelement <8 x i64> %a0, i64 1, i64 1		%1 = insertelement <8 x i64> %a0, i64 1, i64 1
%2 = insertelement <8 x i64> %a1, i64 1, i64 1		%2 = insertelement <8 x i64> %a1, i64 1, i64 1
%3 = insertelement <8 x i64> %1, i64 1, i64 3		%3 = insertelement <8 x i64> %1, i64 1, i64 3
%4 = insertelement <8 x i64> %2, i64 1, i64 3		%4 = insertelement <8 x i64> %2, i64 1, i64 3
%5 = insertelement <8 x i64> %3, i64 1, i64 5		%5 = insertelement <8 x i64> %3, i64 1, i64 5
%6 = insertelement <8 x i64> %4, i64 1, i64 5		%6 = insertelement <8 x i64> %4, i64 1, i64 5
%7 = insertelement <8 x i64> %5, i64 1, i64 7		%7 = insertelement <8 x i64> %5, i64 1, i64 7
%8 = insertelement <8 x i64> %6, i64 1, i64 7		%8 = insertelement <8 x i64> %6, i64 1, i64 7
%res = call <8 x i64> @llvm.x86.pclmulqdq.512(<8 x i64> %7, <8 x i64> %8, i8 1)		%res = call <8 x i64> @llvm.x86.pclmulqdq.512(<8 x i64> %7, <8 x i64> %8, i8 1)
ret <8 x i64> %res		ret <8 x i64> %res
}		}

define <8 x i64> @test_demanded_elts_pclmulqdq_512_16(<8 x i64> %a0, <8 x i64> %a1) {		define <8 x i64> @test_demanded_elts_pclmulqdq_512_16(<8 x i64> %a0, <8 x i64> %a1) {
; CHECK-LABEL: @test_demanded_elts_pclmulqdq_512_16(		; CHECK-LABEL: @test_demanded_elts_pclmulqdq_512_16(
; CHECK-NEXT: [[RES:%.]] = call <8 x i64> @llvm.x86.pclmulqdq.512(<8 x i64> [[A0:%.]], <8 x i64> <i64 undef, i64 1, i64 undef, i64 1, i64 undef, i64 1, i64 undef, i64 1>, i8 16)		; CHECK-NEXT: [[RES:%.]] = call <8 x i64> @llvm.x86.pclmulqdq.512(<8 x i64> [[A0:%.]], <8 x i64> <i64 poison, i64 1, i64 poison, i64 1, i64 poison, i64 1, i64 poison, i64 1>, i8 16)
; CHECK-NEXT: ret <8 x i64> [[RES]]		; CHECK-NEXT: ret <8 x i64> [[RES]]
;		;
%1 = insertelement <8 x i64> %a0, i64 1, i64 1		%1 = insertelement <8 x i64> %a0, i64 1, i64 1
%2 = insertelement <8 x i64> %a1, i64 1, i64 1		%2 = insertelement <8 x i64> %a1, i64 1, i64 1
%3 = insertelement <8 x i64> %1, i64 1, i64 3		%3 = insertelement <8 x i64> %1, i64 1, i64 3
%4 = insertelement <8 x i64> %2, i64 1, i64 3		%4 = insertelement <8 x i64> %2, i64 1, i64 3
%5 = insertelement <8 x i64> %3, i64 1, i64 5		%5 = insertelement <8 x i64> %3, i64 1, i64 5
%6 = insertelement <8 x i64> %4, i64 1, i64 5		%6 = insertelement <8 x i64> %4, i64 1, i64 5
%7 = insertelement <8 x i64> %5, i64 1, i64 7		%7 = insertelement <8 x i64> %5, i64 1, i64 7
%8 = insertelement <8 x i64> %6, i64 1, i64 7		%8 = insertelement <8 x i64> %6, i64 1, i64 7
%res = call <8 x i64> @llvm.x86.pclmulqdq.512(<8 x i64> %7, <8 x i64> %8, i8 16)		%res = call <8 x i64> @llvm.x86.pclmulqdq.512(<8 x i64> %7, <8 x i64> %8, i8 16)
ret <8 x i64> %res		ret <8 x i64> %res
}		}

define <8 x i64> @test_demanded_elts_pclmulqdq_512_17(<8 x i64> %a0, <8 x i64> %a1) {		define <8 x i64> @test_demanded_elts_pclmulqdq_512_17(<8 x i64> %a0, <8 x i64> %a1) {
; CHECK-LABEL: @test_demanded_elts_pclmulqdq_512_17(		; CHECK-LABEL: @test_demanded_elts_pclmulqdq_512_17(
; CHECK-NEXT: [[RES:%.*]] = call <8 x i64> @llvm.x86.pclmulqdq.512(<8 x i64> <i64 undef, i64 1, i64 undef, i64 1, i64 undef, i64 1, i64 undef, i64 1>, <8 x i64> <i64 undef, i64 1, i64 undef, i64 1, i64 undef, i64 1, i64 undef, i64 1>, i8 17)		; CHECK-NEXT: [[RES:%.*]] = call <8 x i64> @llvm.x86.pclmulqdq.512(<8 x i64> <i64 poison, i64 1, i64 poison, i64 1, i64 poison, i64 1, i64 poison, i64 1>, <8 x i64> <i64 poison, i64 1, i64 poison, i64 1, i64 poison, i64 1, i64 poison, i64 1>, i8 17)
; CHECK-NEXT: ret <8 x i64> [[RES]]		; CHECK-NEXT: ret <8 x i64> [[RES]]
;		;
%1 = insertelement <8 x i64> %a0, i64 1, i64 1		%1 = insertelement <8 x i64> %a0, i64 1, i64 1
%2 = insertelement <8 x i64> %a1, i64 1, i64 1		%2 = insertelement <8 x i64> %a1, i64 1, i64 1
%3 = insertelement <8 x i64> %1, i64 1, i64 3		%3 = insertelement <8 x i64> %1, i64 1, i64 3
%4 = insertelement <8 x i64> %2, i64 1, i64 3		%4 = insertelement <8 x i64> %2, i64 1, i64 3
%5 = insertelement <8 x i64> %3, i64 1, i64 5		%5 = insertelement <8 x i64> %3, i64 1, i64 5
%6 = insertelement <8 x i64> %4, i64 1, i64 5		%6 = insertelement <8 x i64> %4, i64 1, i64 5
Show All 37 Lines

llvm/test/Transforms/InstCombine/X86/x86-addsub-inseltpoison.ll

Show First 20 Lines • Show All 161 Lines • ▼ Show 20 Lines	;
store float %12, float* %5, align 4		store float %12, float* %5, align 4
%14 = extractelement <4 x float> %11, i32 1		%14 = extractelement <4 x float> %11, i32 1
store float %14, float* %13, align 4		store float %14, float* %13, align 4
ret void		ret void
}		}

define double @PR48476_fsub(<2 x double> %x) {		define double @PR48476_fsub(<2 x double> %x) {
; CHECK-LABEL: @PR48476_fsub(		; CHECK-LABEL: @PR48476_fsub(
; CHECK-NEXT: [[TMP1:%.]] = fsub <2 x double> <double 0.000000e+00, double undef>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = fsub <2 x double> <double 0.000000e+00, double poison>, [[X:%.]]
; CHECK-NEXT: [[T2:%.*]] = call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[TMP1]], <2 x double> [[X]], i8 6)		; CHECK-NEXT: [[T2:%.*]] = call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[TMP1]], <2 x double> [[X]], i8 6)
; CHECK-NEXT: [[VECEXT:%.*]] = extractelement <2 x double> [[T2]], i32 0		; CHECK-NEXT: [[VECEXT:%.*]] = extractelement <2 x double> [[T2]], i32 0
; CHECK-NEXT: ret double [[VECEXT]]		; CHECK-NEXT: ret double [[VECEXT]]
;		;
%t1 = call <2 x double> @llvm.x86.sse3.addsub.pd(<2 x double> zeroinitializer, <2 x double> %x)		%t1 = call <2 x double> @llvm.x86.sse3.addsub.pd(<2 x double> zeroinitializer, <2 x double> %x)
%t2 = call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %t1, <2 x double> %x, i8 6)		%t2 = call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %t1, <2 x double> %x, i8 6)
%vecext = extractelement <2 x double> %t2, i32 0		%vecext = extractelement <2 x double> %t2, i32 0
ret double %vecext		ret double %vecext
}		}

define double @PR48476_fadd_fsub(<2 x double> %x) {		define double @PR48476_fadd_fsub(<2 x double> %x) {
; CHECK-LABEL: @PR48476_fadd_fsub(		; CHECK-LABEL: @PR48476_fadd_fsub(
; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x double> [[X:%.]], <double undef, double 0.000000e+00>		; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x double> [[X:%.]], <double poison, double 0.000000e+00>
; CHECK-NEXT: [[S:%.*]] = shufflevector <2 x double> [[TMP1]], <2 x double> undef, <2 x i32> <i32 1, i32 undef>		; CHECK-NEXT: [[S:%.*]] = shufflevector <2 x double> [[TMP1]], <2 x double> undef, <2 x i32> <i32 1, i32 undef>
; CHECK-NEXT: [[TMP2:%.*]] = fsub <2 x double> [[S]], [[X]]		; CHECK-NEXT: [[TMP2:%.*]] = fsub <2 x double> [[S]], [[X]]
; CHECK-NEXT: [[VECEXT:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[VECEXT:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[VECEXT]]		; CHECK-NEXT: ret double [[VECEXT]]
;		;
%t1 = call <2 x double> @llvm.x86.sse3.addsub.pd(<2 x double> zeroinitializer, <2 x double> %x)		%t1 = call <2 x double> @llvm.x86.sse3.addsub.pd(<2 x double> zeroinitializer, <2 x double> %x)
%s = shufflevector <2 x double> %t1, <2 x double> undef, <2 x i32> <i32 1, i32 0>		%s = shufflevector <2 x double> %t1, <2 x double> undef, <2 x i32> <i32 1, i32 0>
%t2 = call <2 x double> @llvm.x86.sse3.addsub.pd(<2 x double> %s, <2 x double> %x)		%t2 = call <2 x double> @llvm.x86.sse3.addsub.pd(<2 x double> %s, <2 x double> %x)
%vecext = extractelement <2 x double> %t2, i32 0		%vecext = extractelement <2 x double> %t2, i32 0
ret double %vecext		ret double %vecext
}		}

llvm/test/Transforms/InstCombine/X86/x86-addsub.ll

Show First 20 Lines • Show All 161 Lines • ▼ Show 20 Lines	;
store float %12, float* %5, align 4		store float %12, float* %5, align 4
%14 = extractelement <4 x float> %11, i32 1		%14 = extractelement <4 x float> %11, i32 1
store float %14, float* %13, align 4		store float %14, float* %13, align 4
ret void		ret void
}		}

define double @PR48476_fsub(<2 x double> %x) {		define double @PR48476_fsub(<2 x double> %x) {
; CHECK-LABEL: @PR48476_fsub(		; CHECK-LABEL: @PR48476_fsub(
; CHECK-NEXT: [[TMP1:%.]] = fsub <2 x double> <double 0.000000e+00, double undef>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = fsub <2 x double> <double 0.000000e+00, double poison>, [[X:%.]]
; CHECK-NEXT: [[T2:%.*]] = call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[TMP1]], <2 x double> [[X]], i8 6)		; CHECK-NEXT: [[T2:%.*]] = call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[TMP1]], <2 x double> [[X]], i8 6)
; CHECK-NEXT: [[VECEXT:%.*]] = extractelement <2 x double> [[T2]], i32 0		; CHECK-NEXT: [[VECEXT:%.*]] = extractelement <2 x double> [[T2]], i32 0
; CHECK-NEXT: ret double [[VECEXT]]		; CHECK-NEXT: ret double [[VECEXT]]
;		;
%t1 = call <2 x double> @llvm.x86.sse3.addsub.pd(<2 x double> zeroinitializer, <2 x double> %x)		%t1 = call <2 x double> @llvm.x86.sse3.addsub.pd(<2 x double> zeroinitializer, <2 x double> %x)
%t2 = call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %t1, <2 x double> %x, i8 6)		%t2 = call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %t1, <2 x double> %x, i8 6)
%vecext = extractelement <2 x double> %t2, i32 0		%vecext = extractelement <2 x double> %t2, i32 0
ret double %vecext		ret double %vecext
}		}

define double @PR48476_fadd_fsub(<2 x double> %x) {		define double @PR48476_fadd_fsub(<2 x double> %x) {
; CHECK-LABEL: @PR48476_fadd_fsub(		; CHECK-LABEL: @PR48476_fadd_fsub(
; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x double> [[X:%.]], <double undef, double 0.000000e+00>		; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x double> [[X:%.]], <double poison, double 0.000000e+00>
; CHECK-NEXT: [[S:%.*]] = shufflevector <2 x double> [[TMP1]], <2 x double> undef, <2 x i32> <i32 1, i32 undef>		; CHECK-NEXT: [[S:%.*]] = shufflevector <2 x double> [[TMP1]], <2 x double> undef, <2 x i32> <i32 1, i32 undef>
; CHECK-NEXT: [[TMP2:%.*]] = fsub <2 x double> [[S]], [[X]]		; CHECK-NEXT: [[TMP2:%.*]] = fsub <2 x double> [[S]], [[X]]
; CHECK-NEXT: [[VECEXT:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[VECEXT:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[VECEXT]]		; CHECK-NEXT: ret double [[VECEXT]]
;		;
%t1 = call <2 x double> @llvm.x86.sse3.addsub.pd(<2 x double> zeroinitializer, <2 x double> %x)		%t1 = call <2 x double> @llvm.x86.sse3.addsub.pd(<2 x double> zeroinitializer, <2 x double> %x)
%s = shufflevector <2 x double> %t1, <2 x double> undef, <2 x i32> <i32 1, i32 0>		%s = shufflevector <2 x double> %t1, <2 x double> undef, <2 x i32> <i32 1, i32 0>
%t2 = call <2 x double> @llvm.x86.sse3.addsub.pd(<2 x double> %s, <2 x double> %x)		%t2 = call <2 x double> @llvm.x86.sse3.addsub.pd(<2 x double> %s, <2 x double> %x)
%vecext = extractelement <2 x double> %t2, i32 0		%vecext = extractelement <2 x double> %t2, i32 0
ret double %vecext		ret double %vecext
}		}

llvm/test/Transforms/InstCombine/X86/x86-avx512-inseltpoison.ll

Show First 20 Lines • Show All 752 Lines • ▼ Show 20 Lines	;
%1 = insertelement <2 x double> %a, double 1.000000e+00, i32 1		%1 = insertelement <2 x double> %a, double 1.000000e+00, i32 1
%2 = insertelement <2 x double> %b, double 2.000000e+00, i32 1		%2 = insertelement <2 x double> %b, double 2.000000e+00, i32 1
%3 = tail call i8 @llvm.x86.avx512.mask.cmp.sd(<2 x double> %1, <2 x double> %2, i32 3, i8 %mask, i32 4)		%3 = tail call i8 @llvm.x86.avx512.mask.cmp.sd(<2 x double> %1, <2 x double> %2, i32 3, i8 %mask, i32 4)
ret i8 %3		ret i8 %3
}		}

define i64 @test(float %f, double %d) {		define i64 @test(float %f, double %d) {
; CHECK-LABEL: @test(		; CHECK-LABEL: @test(
; CHECK-NEXT: [[V03:%.]] = insertelement <4 x float> undef, float [[F:%.]], i32 0		; CHECK-NEXT: [[V03:%.]] = insertelement <4 x float> poison, float [[F:%.]], i32 0
; CHECK-NEXT: [[T0:%.*]] = tail call i32 @llvm.x86.avx512.vcvtss2si32(<4 x float> [[V03]], i32 4)		; CHECK-NEXT: [[T0:%.*]] = tail call i32 @llvm.x86.avx512.vcvtss2si32(<4 x float> [[V03]], i32 4)
; CHECK-NEXT: [[V13:%.*]] = insertelement <4 x float> undef, float [[F]], i32 0		; CHECK-NEXT: [[V13:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
; CHECK-NEXT: [[T1:%.*]] = tail call i64 @llvm.x86.avx512.vcvtss2si64(<4 x float> [[V13]], i32 4)		; CHECK-NEXT: [[T1:%.*]] = tail call i64 @llvm.x86.avx512.vcvtss2si64(<4 x float> [[V13]], i32 4)
; CHECK-NEXT: [[V23:%.*]] = insertelement <4 x float> undef, float [[F]], i32 0		; CHECK-NEXT: [[V23:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
; CHECK-NEXT: [[T2:%.*]] = tail call i32 @llvm.x86.avx512.cvttss2si(<4 x float> [[V23]], i32 4)		; CHECK-NEXT: [[T2:%.*]] = tail call i32 @llvm.x86.avx512.cvttss2si(<4 x float> [[V23]], i32 4)
; CHECK-NEXT: [[V33:%.*]] = insertelement <4 x float> undef, float [[F]], i32 0		; CHECK-NEXT: [[V33:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
; CHECK-NEXT: [[T3:%.*]] = tail call i64 @llvm.x86.avx512.cvttss2si64(<4 x float> [[V33]], i32 4)		; CHECK-NEXT: [[T3:%.*]] = tail call i64 @llvm.x86.avx512.cvttss2si64(<4 x float> [[V33]], i32 4)
; CHECK-NEXT: [[V41:%.]] = insertelement <2 x double> undef, double [[D:%.]], i32 0		; CHECK-NEXT: [[V41:%.]] = insertelement <2 x double> poison, double [[D:%.]], i32 0
; CHECK-NEXT: [[T4:%.*]] = tail call i32 @llvm.x86.avx512.vcvtsd2si32(<2 x double> [[V41]], i32 4)		; CHECK-NEXT: [[T4:%.*]] = tail call i32 @llvm.x86.avx512.vcvtsd2si32(<2 x double> [[V41]], i32 4)
; CHECK-NEXT: [[V51:%.*]] = insertelement <2 x double> undef, double [[D]], i32 0		; CHECK-NEXT: [[V51:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
; CHECK-NEXT: [[T5:%.*]] = tail call i64 @llvm.x86.avx512.vcvtsd2si64(<2 x double> [[V51]], i32 4)		; CHECK-NEXT: [[T5:%.*]] = tail call i64 @llvm.x86.avx512.vcvtsd2si64(<2 x double> [[V51]], i32 4)
; CHECK-NEXT: [[V61:%.*]] = insertelement <2 x double> undef, double [[D]], i32 0		; CHECK-NEXT: [[V61:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
; CHECK-NEXT: [[T6:%.*]] = tail call i32 @llvm.x86.avx512.cvttsd2si(<2 x double> [[V61]], i32 4)		; CHECK-NEXT: [[T6:%.*]] = tail call i32 @llvm.x86.avx512.cvttsd2si(<2 x double> [[V61]], i32 4)
; CHECK-NEXT: [[V71:%.*]] = insertelement <2 x double> undef, double [[D]], i32 0		; CHECK-NEXT: [[V71:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
; CHECK-NEXT: [[T7:%.*]] = tail call i64 @llvm.x86.avx512.cvttsd2si64(<2 x double> [[V71]], i32 4)		; CHECK-NEXT: [[T7:%.*]] = tail call i64 @llvm.x86.avx512.cvttsd2si64(<2 x double> [[V71]], i32 4)
; CHECK-NEXT: [[T8:%.*]] = add i32 [[T0]], [[T2]]		; CHECK-NEXT: [[T8:%.*]] = add i32 [[T0]], [[T2]]
; CHECK-NEXT: [[T9:%.*]] = add i32 [[T4]], [[T6]]		; CHECK-NEXT: [[T9:%.*]] = add i32 [[T4]], [[T6]]
; CHECK-NEXT: [[T10:%.*]] = add i32 [[T8]], [[T9]]		; CHECK-NEXT: [[T10:%.*]] = add i32 [[T8]], [[T9]]
; CHECK-NEXT: [[T11:%.*]] = sext i32 [[T10]] to i64		; CHECK-NEXT: [[T11:%.*]] = sext i32 [[T10]] to i64
; CHECK-NEXT: [[T12:%.*]] = add i64 [[T1]], [[T3]]		; CHECK-NEXT: [[T12:%.*]] = add i64 [[T1]], [[T3]]
; CHECK-NEXT: [[T13:%.*]] = add i64 [[T5]], [[T7]]		; CHECK-NEXT: [[T13:%.*]] = add i64 [[T5]], [[T7]]
; CHECK-NEXT: [[T14:%.*]] = add i64 [[T12]], [[T13]]		; CHECK-NEXT: [[T14:%.*]] = add i64 [[T12]], [[T13]]
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
declare i64 @llvm.x86.avx512.cvttss2si64(<4 x float>, i32)		declare i64 @llvm.x86.avx512.cvttss2si64(<4 x float>, i32)
declare i32 @llvm.x86.avx512.vcvtsd2si32(<2 x double>, i32)		declare i32 @llvm.x86.avx512.vcvtsd2si32(<2 x double>, i32)
declare i64 @llvm.x86.avx512.vcvtsd2si64(<2 x double>, i32)		declare i64 @llvm.x86.avx512.vcvtsd2si64(<2 x double>, i32)
declare i32 @llvm.x86.avx512.cvttsd2si(<2 x double>, i32)		declare i32 @llvm.x86.avx512.cvttsd2si(<2 x double>, i32)
declare i64 @llvm.x86.avx512.cvttsd2si64(<2 x double>, i32)		declare i64 @llvm.x86.avx512.cvttsd2si64(<2 x double>, i32)

define i64 @test2(float %f, double %d) {		define i64 @test2(float %f, double %d) {
; CHECK-LABEL: @test2(		; CHECK-LABEL: @test2(
; CHECK-NEXT: [[V03:%.]] = insertelement <4 x float> undef, float [[F:%.]], i32 0		; CHECK-NEXT: [[V03:%.]] = insertelement <4 x float> poison, float [[F:%.]], i32 0
; CHECK-NEXT: [[T0:%.*]] = tail call i32 @llvm.x86.avx512.vcvtss2usi32(<4 x float> [[V03]], i32 4)		; CHECK-NEXT: [[T0:%.*]] = tail call i32 @llvm.x86.avx512.vcvtss2usi32(<4 x float> [[V03]], i32 4)
; CHECK-NEXT: [[V13:%.*]] = insertelement <4 x float> undef, float [[F]], i32 0		; CHECK-NEXT: [[V13:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
; CHECK-NEXT: [[T1:%.*]] = tail call i64 @llvm.x86.avx512.vcvtss2usi64(<4 x float> [[V13]], i32 4)		; CHECK-NEXT: [[T1:%.*]] = tail call i64 @llvm.x86.avx512.vcvtss2usi64(<4 x float> [[V13]], i32 4)
; CHECK-NEXT: [[V23:%.*]] = insertelement <4 x float> undef, float [[F]], i32 0		; CHECK-NEXT: [[V23:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
; CHECK-NEXT: [[T2:%.*]] = tail call i32 @llvm.x86.avx512.cvttss2usi(<4 x float> [[V23]], i32 4)		; CHECK-NEXT: [[T2:%.*]] = tail call i32 @llvm.x86.avx512.cvttss2usi(<4 x float> [[V23]], i32 4)
; CHECK-NEXT: [[V33:%.*]] = insertelement <4 x float> undef, float [[F]], i32 0		; CHECK-NEXT: [[V33:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
; CHECK-NEXT: [[T3:%.*]] = tail call i64 @llvm.x86.avx512.cvttss2usi64(<4 x float> [[V33]], i32 4)		; CHECK-NEXT: [[T3:%.*]] = tail call i64 @llvm.x86.avx512.cvttss2usi64(<4 x float> [[V33]], i32 4)
; CHECK-NEXT: [[V41:%.]] = insertelement <2 x double> undef, double [[D:%.]], i32 0		; CHECK-NEXT: [[V41:%.]] = insertelement <2 x double> poison, double [[D:%.]], i32 0
; CHECK-NEXT: [[T4:%.*]] = tail call i32 @llvm.x86.avx512.vcvtsd2usi32(<2 x double> [[V41]], i32 4)		; CHECK-NEXT: [[T4:%.*]] = tail call i32 @llvm.x86.avx512.vcvtsd2usi32(<2 x double> [[V41]], i32 4)
; CHECK-NEXT: [[V51:%.*]] = insertelement <2 x double> undef, double [[D]], i32 0		; CHECK-NEXT: [[V51:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
; CHECK-NEXT: [[T5:%.*]] = tail call i64 @llvm.x86.avx512.vcvtsd2usi64(<2 x double> [[V51]], i32 4)		; CHECK-NEXT: [[T5:%.*]] = tail call i64 @llvm.x86.avx512.vcvtsd2usi64(<2 x double> [[V51]], i32 4)
; CHECK-NEXT: [[V61:%.*]] = insertelement <2 x double> undef, double [[D]], i32 0		; CHECK-NEXT: [[V61:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
; CHECK-NEXT: [[T6:%.*]] = tail call i32 @llvm.x86.avx512.cvttsd2usi(<2 x double> [[V61]], i32 4)		; CHECK-NEXT: [[T6:%.*]] = tail call i32 @llvm.x86.avx512.cvttsd2usi(<2 x double> [[V61]], i32 4)
; CHECK-NEXT: [[V71:%.*]] = insertelement <2 x double> undef, double [[D]], i32 0		; CHECK-NEXT: [[V71:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
; CHECK-NEXT: [[T7:%.*]] = tail call i64 @llvm.x86.avx512.cvttsd2usi64(<2 x double> [[V71]], i32 4)		; CHECK-NEXT: [[T7:%.*]] = tail call i64 @llvm.x86.avx512.cvttsd2usi64(<2 x double> [[V71]], i32 4)
; CHECK-NEXT: [[T8:%.*]] = add i32 [[T0]], [[T2]]		; CHECK-NEXT: [[T8:%.*]] = add i32 [[T0]], [[T2]]
; CHECK-NEXT: [[T9:%.*]] = add i32 [[T4]], [[T6]]		; CHECK-NEXT: [[T9:%.*]] = add i32 [[T4]], [[T6]]
; CHECK-NEXT: [[T10:%.*]] = add i32 [[T8]], [[T9]]		; CHECK-NEXT: [[T10:%.*]] = add i32 [[T8]], [[T9]]
; CHECK-NEXT: [[T11:%.*]] = sext i32 [[T10]] to i64		; CHECK-NEXT: [[T11:%.*]] = sext i32 [[T10]] to i64
; CHECK-NEXT: [[T12:%.*]] = add i64 [[T1]], [[T3]]		; CHECK-NEXT: [[T12:%.*]] = add i64 [[T1]], [[T3]]
; CHECK-NEXT: [[T13:%.*]] = add i64 [[T5]], [[T7]]		; CHECK-NEXT: [[T13:%.*]] = add i64 [[T5]], [[T7]]
; CHECK-NEXT: [[T14:%.*]] = add i64 [[T12]], [[T13]]		; CHECK-NEXT: [[T14:%.*]] = add i64 [[T12]], [[T13]]
▲ Show 20 Lines • Show All 2,503 Lines • ▼ Show 20 Lines	;
%3 = select <8 x i1> %2, <8 x double> %1, <8 x double> %c		%3 = select <8 x i1> %2, <8 x double> %1, <8 x double> %c
ret <8 x double> %3		ret <8 x double> %3
}		}

declare i32 @llvm.x86.avx512.vcomi.ss(<4 x float>, <4 x float>, i32, i32)		declare i32 @llvm.x86.avx512.vcomi.ss(<4 x float>, <4 x float>, i32, i32)

define i32 @test_comi_ss_0(float %a, float %b) {		define i32 @test_comi_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comi_ss_0(		; CHECK-LABEL: @test_comi_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.avx512.vcomi.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]], i32 0, i32 4)		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.avx512.vcomi.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]], i32 0, i32 4)
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.avx512.vcomi.ss(<4 x float> %4, <4 x float> %8, i32 0, i32 4)		%9 = tail call i32 @llvm.x86.avx512.vcomi.ss(<4 x float> %4, <4 x float> %8, i32 0, i32 4)
ret i32 %9		ret i32 %9
}		}

declare i32 @llvm.x86.avx512.vcomi.sd(<2 x double>, <2 x double>, i32, i32)		declare i32 @llvm.x86.avx512.vcomi.sd(<2 x double>, <2 x double>, i32, i32)

define i32 @test_comi_sd_0(double %a, double %b) {		define i32 @test_comi_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comi_sd_0(		; CHECK-LABEL: @test_comi_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.avx512.vcomi.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]], i32 0, i32 4)		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.avx512.vcomi.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]], i32 0, i32 4)
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.avx512.vcomi.sd(<2 x double> %2, <2 x double> %4, i32 0, i32 4)		%5 = tail call i32 @llvm.x86.avx512.vcomi.sd(<2 x double> %2, <2 x double> %4, i32 0, i32 4)
ret i32 %5		ret i32 %5
}		}

llvm/test/Transforms/InstCombine/X86/x86-avx512.ll

Show First 20 Lines • Show All 752 Lines • ▼ Show 20 Lines	;
%1 = insertelement <2 x double> %a, double 1.000000e+00, i32 1		%1 = insertelement <2 x double> %a, double 1.000000e+00, i32 1
%2 = insertelement <2 x double> %b, double 2.000000e+00, i32 1		%2 = insertelement <2 x double> %b, double 2.000000e+00, i32 1
%3 = tail call i8 @llvm.x86.avx512.mask.cmp.sd(<2 x double> %1, <2 x double> %2, i32 3, i8 %mask, i32 4)		%3 = tail call i8 @llvm.x86.avx512.mask.cmp.sd(<2 x double> %1, <2 x double> %2, i32 3, i8 %mask, i32 4)
ret i8 %3		ret i8 %3
}		}

define i64 @test(float %f, double %d) {		define i64 @test(float %f, double %d) {
; CHECK-LABEL: @test(		; CHECK-LABEL: @test(
; CHECK-NEXT: [[V03:%.]] = insertelement <4 x float> undef, float [[F:%.]], i32 0		; CHECK-NEXT: [[V03:%.]] = insertelement <4 x float> poison, float [[F:%.]], i32 0
; CHECK-NEXT: [[T0:%.*]] = tail call i32 @llvm.x86.avx512.vcvtss2si32(<4 x float> [[V03]], i32 4)		; CHECK-NEXT: [[T0:%.*]] = tail call i32 @llvm.x86.avx512.vcvtss2si32(<4 x float> [[V03]], i32 4)
; CHECK-NEXT: [[V13:%.*]] = insertelement <4 x float> undef, float [[F]], i32 0		; CHECK-NEXT: [[V13:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
; CHECK-NEXT: [[T1:%.*]] = tail call i64 @llvm.x86.avx512.vcvtss2si64(<4 x float> [[V13]], i32 4)		; CHECK-NEXT: [[T1:%.*]] = tail call i64 @llvm.x86.avx512.vcvtss2si64(<4 x float> [[V13]], i32 4)
; CHECK-NEXT: [[V23:%.*]] = insertelement <4 x float> undef, float [[F]], i32 0		; CHECK-NEXT: [[V23:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
; CHECK-NEXT: [[T2:%.*]] = tail call i32 @llvm.x86.avx512.cvttss2si(<4 x float> [[V23]], i32 4)		; CHECK-NEXT: [[T2:%.*]] = tail call i32 @llvm.x86.avx512.cvttss2si(<4 x float> [[V23]], i32 4)
; CHECK-NEXT: [[V33:%.*]] = insertelement <4 x float> undef, float [[F]], i32 0		; CHECK-NEXT: [[V33:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
; CHECK-NEXT: [[T3:%.*]] = tail call i64 @llvm.x86.avx512.cvttss2si64(<4 x float> [[V33]], i32 4)		; CHECK-NEXT: [[T3:%.*]] = tail call i64 @llvm.x86.avx512.cvttss2si64(<4 x float> [[V33]], i32 4)
; CHECK-NEXT: [[V41:%.]] = insertelement <2 x double> undef, double [[D:%.]], i32 0		; CHECK-NEXT: [[V41:%.]] = insertelement <2 x double> poison, double [[D:%.]], i32 0
; CHECK-NEXT: [[T4:%.*]] = tail call i32 @llvm.x86.avx512.vcvtsd2si32(<2 x double> [[V41]], i32 4)		; CHECK-NEXT: [[T4:%.*]] = tail call i32 @llvm.x86.avx512.vcvtsd2si32(<2 x double> [[V41]], i32 4)
; CHECK-NEXT: [[V51:%.*]] = insertelement <2 x double> undef, double [[D]], i32 0		; CHECK-NEXT: [[V51:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
; CHECK-NEXT: [[T5:%.*]] = tail call i64 @llvm.x86.avx512.vcvtsd2si64(<2 x double> [[V51]], i32 4)		; CHECK-NEXT: [[T5:%.*]] = tail call i64 @llvm.x86.avx512.vcvtsd2si64(<2 x double> [[V51]], i32 4)
; CHECK-NEXT: [[V61:%.*]] = insertelement <2 x double> undef, double [[D]], i32 0		; CHECK-NEXT: [[V61:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
; CHECK-NEXT: [[T6:%.*]] = tail call i32 @llvm.x86.avx512.cvttsd2si(<2 x double> [[V61]], i32 4)		; CHECK-NEXT: [[T6:%.*]] = tail call i32 @llvm.x86.avx512.cvttsd2si(<2 x double> [[V61]], i32 4)
; CHECK-NEXT: [[V71:%.*]] = insertelement <2 x double> undef, double [[D]], i32 0		; CHECK-NEXT: [[V71:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
; CHECK-NEXT: [[T7:%.*]] = tail call i64 @llvm.x86.avx512.cvttsd2si64(<2 x double> [[V71]], i32 4)		; CHECK-NEXT: [[T7:%.*]] = tail call i64 @llvm.x86.avx512.cvttsd2si64(<2 x double> [[V71]], i32 4)
; CHECK-NEXT: [[T8:%.*]] = add i32 [[T0]], [[T2]]		; CHECK-NEXT: [[T8:%.*]] = add i32 [[T0]], [[T2]]
; CHECK-NEXT: [[T9:%.*]] = add i32 [[T4]], [[T6]]		; CHECK-NEXT: [[T9:%.*]] = add i32 [[T4]], [[T6]]
; CHECK-NEXT: [[T10:%.*]] = add i32 [[T8]], [[T9]]		; CHECK-NEXT: [[T10:%.*]] = add i32 [[T8]], [[T9]]
; CHECK-NEXT: [[T11:%.*]] = sext i32 [[T10]] to i64		; CHECK-NEXT: [[T11:%.*]] = sext i32 [[T10]] to i64
; CHECK-NEXT: [[T12:%.*]] = add i64 [[T1]], [[T3]]		; CHECK-NEXT: [[T12:%.*]] = add i64 [[T1]], [[T3]]
; CHECK-NEXT: [[T13:%.*]] = add i64 [[T5]], [[T7]]		; CHECK-NEXT: [[T13:%.*]] = add i64 [[T5]], [[T7]]
; CHECK-NEXT: [[T14:%.*]] = add i64 [[T12]], [[T13]]		; CHECK-NEXT: [[T14:%.*]] = add i64 [[T12]], [[T13]]
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
declare i64 @llvm.x86.avx512.cvttss2si64(<4 x float>, i32)		declare i64 @llvm.x86.avx512.cvttss2si64(<4 x float>, i32)
declare i32 @llvm.x86.avx512.vcvtsd2si32(<2 x double>, i32)		declare i32 @llvm.x86.avx512.vcvtsd2si32(<2 x double>, i32)
declare i64 @llvm.x86.avx512.vcvtsd2si64(<2 x double>, i32)		declare i64 @llvm.x86.avx512.vcvtsd2si64(<2 x double>, i32)
declare i32 @llvm.x86.avx512.cvttsd2si(<2 x double>, i32)		declare i32 @llvm.x86.avx512.cvttsd2si(<2 x double>, i32)
declare i64 @llvm.x86.avx512.cvttsd2si64(<2 x double>, i32)		declare i64 @llvm.x86.avx512.cvttsd2si64(<2 x double>, i32)

define i64 @test2(float %f, double %d) {		define i64 @test2(float %f, double %d) {
; CHECK-LABEL: @test2(		; CHECK-LABEL: @test2(
; CHECK-NEXT: [[V03:%.]] = insertelement <4 x float> undef, float [[F:%.]], i32 0		; CHECK-NEXT: [[V03:%.]] = insertelement <4 x float> poison, float [[F:%.]], i32 0
; CHECK-NEXT: [[T0:%.*]] = tail call i32 @llvm.x86.avx512.vcvtss2usi32(<4 x float> [[V03]], i32 4)		; CHECK-NEXT: [[T0:%.*]] = tail call i32 @llvm.x86.avx512.vcvtss2usi32(<4 x float> [[V03]], i32 4)
; CHECK-NEXT: [[V13:%.*]] = insertelement <4 x float> undef, float [[F]], i32 0		; CHECK-NEXT: [[V13:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
; CHECK-NEXT: [[T1:%.*]] = tail call i64 @llvm.x86.avx512.vcvtss2usi64(<4 x float> [[V13]], i32 4)		; CHECK-NEXT: [[T1:%.*]] = tail call i64 @llvm.x86.avx512.vcvtss2usi64(<4 x float> [[V13]], i32 4)
; CHECK-NEXT: [[V23:%.*]] = insertelement <4 x float> undef, float [[F]], i32 0		; CHECK-NEXT: [[V23:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
; CHECK-NEXT: [[T2:%.*]] = tail call i32 @llvm.x86.avx512.cvttss2usi(<4 x float> [[V23]], i32 4)		; CHECK-NEXT: [[T2:%.*]] = tail call i32 @llvm.x86.avx512.cvttss2usi(<4 x float> [[V23]], i32 4)
; CHECK-NEXT: [[V33:%.*]] = insertelement <4 x float> undef, float [[F]], i32 0		; CHECK-NEXT: [[V33:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
; CHECK-NEXT: [[T3:%.*]] = tail call i64 @llvm.x86.avx512.cvttss2usi64(<4 x float> [[V33]], i32 4)		; CHECK-NEXT: [[T3:%.*]] = tail call i64 @llvm.x86.avx512.cvttss2usi64(<4 x float> [[V33]], i32 4)
; CHECK-NEXT: [[V41:%.]] = insertelement <2 x double> undef, double [[D:%.]], i32 0		; CHECK-NEXT: [[V41:%.]] = insertelement <2 x double> poison, double [[D:%.]], i32 0
; CHECK-NEXT: [[T4:%.*]] = tail call i32 @llvm.x86.avx512.vcvtsd2usi32(<2 x double> [[V41]], i32 4)		; CHECK-NEXT: [[T4:%.*]] = tail call i32 @llvm.x86.avx512.vcvtsd2usi32(<2 x double> [[V41]], i32 4)
; CHECK-NEXT: [[V51:%.*]] = insertelement <2 x double> undef, double [[D]], i32 0		; CHECK-NEXT: [[V51:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
; CHECK-NEXT: [[T5:%.*]] = tail call i64 @llvm.x86.avx512.vcvtsd2usi64(<2 x double> [[V51]], i32 4)		; CHECK-NEXT: [[T5:%.*]] = tail call i64 @llvm.x86.avx512.vcvtsd2usi64(<2 x double> [[V51]], i32 4)
; CHECK-NEXT: [[V61:%.*]] = insertelement <2 x double> undef, double [[D]], i32 0		; CHECK-NEXT: [[V61:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
; CHECK-NEXT: [[T6:%.*]] = tail call i32 @llvm.x86.avx512.cvttsd2usi(<2 x double> [[V61]], i32 4)		; CHECK-NEXT: [[T6:%.*]] = tail call i32 @llvm.x86.avx512.cvttsd2usi(<2 x double> [[V61]], i32 4)
; CHECK-NEXT: [[V71:%.*]] = insertelement <2 x double> undef, double [[D]], i32 0		; CHECK-NEXT: [[V71:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
; CHECK-NEXT: [[T7:%.*]] = tail call i64 @llvm.x86.avx512.cvttsd2usi64(<2 x double> [[V71]], i32 4)		; CHECK-NEXT: [[T7:%.*]] = tail call i64 @llvm.x86.avx512.cvttsd2usi64(<2 x double> [[V71]], i32 4)
; CHECK-NEXT: [[T8:%.*]] = add i32 [[T0]], [[T2]]		; CHECK-NEXT: [[T8:%.*]] = add i32 [[T0]], [[T2]]
; CHECK-NEXT: [[T9:%.*]] = add i32 [[T4]], [[T6]]		; CHECK-NEXT: [[T9:%.*]] = add i32 [[T4]], [[T6]]
; CHECK-NEXT: [[T10:%.*]] = add i32 [[T8]], [[T9]]		; CHECK-NEXT: [[T10:%.*]] = add i32 [[T8]], [[T9]]
; CHECK-NEXT: [[T11:%.*]] = sext i32 [[T10]] to i64		; CHECK-NEXT: [[T11:%.*]] = sext i32 [[T10]] to i64
; CHECK-NEXT: [[T12:%.*]] = add i64 [[T1]], [[T3]]		; CHECK-NEXT: [[T12:%.*]] = add i64 [[T1]], [[T3]]
; CHECK-NEXT: [[T13:%.*]] = add i64 [[T5]], [[T7]]		; CHECK-NEXT: [[T13:%.*]] = add i64 [[T5]], [[T7]]
; CHECK-NEXT: [[T14:%.*]] = add i64 [[T12]], [[T13]]		; CHECK-NEXT: [[T14:%.*]] = add i64 [[T12]], [[T13]]
▲ Show 20 Lines • Show All 2,503 Lines • ▼ Show 20 Lines	;
%3 = select <8 x i1> %2, <8 x double> %1, <8 x double> %c		%3 = select <8 x i1> %2, <8 x double> %1, <8 x double> %c
ret <8 x double> %3		ret <8 x double> %3
}		}

declare i32 @llvm.x86.avx512.vcomi.ss(<4 x float>, <4 x float>, i32, i32)		declare i32 @llvm.x86.avx512.vcomi.ss(<4 x float>, <4 x float>, i32, i32)

define i32 @test_comi_ss_0(float %a, float %b) {		define i32 @test_comi_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comi_ss_0(		; CHECK-LABEL: @test_comi_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.avx512.vcomi.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]], i32 0, i32 4)		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.avx512.vcomi.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]], i32 0, i32 4)
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.avx512.vcomi.ss(<4 x float> %4, <4 x float> %8, i32 0, i32 4)		%9 = tail call i32 @llvm.x86.avx512.vcomi.ss(<4 x float> %4, <4 x float> %8, i32 0, i32 4)
ret i32 %9		ret i32 %9
}		}

declare i32 @llvm.x86.avx512.vcomi.sd(<2 x double>, <2 x double>, i32, i32)		declare i32 @llvm.x86.avx512.vcomi.sd(<2 x double>, <2 x double>, i32, i32)

define i32 @test_comi_sd_0(double %a, double %b) {		define i32 @test_comi_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comi_sd_0(		; CHECK-LABEL: @test_comi_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.avx512.vcomi.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]], i32 0, i32 4)		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.avx512.vcomi.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]], i32 0, i32 4)
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.avx512.vcomi.sd(<2 x double> %2, <2 x double> %4, i32 0, i32 4)		%5 = tail call i32 @llvm.x86.avx512.vcomi.sd(<2 x double> %2, <2 x double> %4, i32 0, i32 4)
ret i32 %5		ret i32 %5
}		}

llvm/test/Transforms/InstCombine/X86/x86-insertps.ll

Show All 30 Lines	;
%res = call <4 x float> @llvm.x86.sse41.insertps(<4 x float> %v1, <4 x float> %v2, i8 12)		%res = call <4 x float> @llvm.x86.sse41.insertps(<4 x float> %v1, <4 x float> %v2, i8 12)
ret <4 x float> %res		ret <4 x float> %res
}		}

; ...unless both input vectors are the same operand.		; ...unless both input vectors are the same operand.

define <4 x float> @insertps_0x15_single_input(<4 x float> %v1) {		define <4 x float> @insertps_0x15_single_input(<4 x float> %v1) {
; CHECK-LABEL: @insertps_0x15_single_input(		; CHECK-LABEL: @insertps_0x15_single_input(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <4 x float> [[V1:%.]], <4 x float> <float 0.000000e+00, float undef, float 0.000000e+00, float undef>, <4 x i32> <i32 4, i32 0, i32 6, i32 3>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <4 x float> [[V1:%.]], <4 x float> <float 0.000000e+00, float poison, float 0.000000e+00, float poison>, <4 x i32> <i32 4, i32 0, i32 6, i32 3>
; CHECK-NEXT: ret <4 x float> [[TMP1]]		; CHECK-NEXT: ret <4 x float> [[TMP1]]
;		;
%res = call <4 x float> @llvm.x86.sse41.insertps(<4 x float> %v1, <4 x float> %v1, i8 21)		%res = call <4 x float> @llvm.x86.sse41.insertps(<4 x float> %v1, <4 x float> %v1, i8 21)
ret <4 x float> %res		ret <4 x float> %res
}		}

; The zero mask overrides the insertion lane.		; The zero mask overrides the insertion lane.

define <4 x float> @insertps_0x1a_single_input(<4 x float> %v1) {		define <4 x float> @insertps_0x1a_single_input(<4 x float> %v1) {
; CHECK-LABEL: @insertps_0x1a_single_input(		; CHECK-LABEL: @insertps_0x1a_single_input(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <4 x float> [[V1:%.]], <4 x float> <float undef, float 0.000000e+00, float undef, float 0.000000e+00>, <4 x i32> <i32 0, i32 5, i32 2, i32 7>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <4 x float> [[V1:%.]], <4 x float> <float poison, float 0.000000e+00, float poison, float 0.000000e+00>, <4 x i32> <i32 0, i32 5, i32 2, i32 7>
; CHECK-NEXT: ret <4 x float> [[TMP1]]		; CHECK-NEXT: ret <4 x float> [[TMP1]]
;		;
%res = call <4 x float> @llvm.x86.sse41.insertps(<4 x float> %v1, <4 x float> %v1, i8 26)		%res = call <4 x float> @llvm.x86.sse41.insertps(<4 x float> %v1, <4 x float> %v1, i8 26)
ret <4 x float> %res		ret <4 x float> %res
}		}

; The zero mask overrides the insertion lane, so the second input vector is not used.		; The zero mask overrides the insertion lane, so the second input vector is not used.

▲ Show 20 Lines • Show All 82 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/X86/x86-masked-memops.ll

Show First 20 Lines • Show All 60 Lines • ▼ Show 20 Lines	;
ret <4 x float> %ld		ret <4 x float> %ld
}		}

; It's a constant mask, so convert to an LLVM intrinsic. The backend should optimize further.		; It's a constant mask, so convert to an LLVM intrinsic. The backend should optimize further.

define <4 x float> @mload_one_one(i8* %f) {		define <4 x float> @mload_one_one(i8* %f) {
; CHECK-LABEL: @mload_one_one(		; CHECK-LABEL: @mload_one_one(
; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <4 x float>		; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <4 x float>
; CHECK-NEXT: [[TMP1:%.]] = call <4 x float> @llvm.masked.load.v4f32.p0v4f32(<4 x float> [[CASTVEC]], i32 1, <4 x i1> <i1 false, i1 false, i1 false, i1 true>, <4 x float> <float 0.000000e+00, float 0.000000e+00, float 0.000000e+00, float undef>)		; CHECK-NEXT: [[TMP1:%.]] = call <4 x float> @llvm.masked.load.v4f32.p0v4f32(<4 x float> [[CASTVEC]], i32 1, <4 x i1> <i1 false, i1 false, i1 false, i1 true>, <4 x float> <float 0.000000e+00, float 0.000000e+00, float 0.000000e+00, float poison>)
; CHECK-NEXT: ret <4 x float> [[TMP1]]		; CHECK-NEXT: ret <4 x float> [[TMP1]]
;		;
%ld = tail call <4 x float> @llvm.x86.avx.maskload.ps(i8* %f, <4 x i32> <i32 0, i32 0, i32 0, i32 -1>)		%ld = tail call <4 x float> @llvm.x86.avx.maskload.ps(i8* %f, <4 x i32> <i32 0, i32 0, i32 0, i32 -1>)
ret <4 x float> %ld		ret <4 x float> %ld
}		}

; Try doubles.		; Try doubles.

define <2 x double> @mload_one_one_double(i8* %f) {		define <2 x double> @mload_one_one_double(i8* %f) {
; CHECK-LABEL: @mload_one_one_double(		; CHECK-LABEL: @mload_one_one_double(
; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <2 x double>		; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <2 x double>
; CHECK-NEXT: [[TMP1:%.]] = call <2 x double> @llvm.masked.load.v2f64.p0v2f64(<2 x double> [[CASTVEC]], i32 1, <2 x i1> <i1 true, i1 false>, <2 x double> <double undef, double 0.000000e+00>)		; CHECK-NEXT: [[TMP1:%.]] = call <2 x double> @llvm.masked.load.v2f64.p0v2f64(<2 x double> [[CASTVEC]], i32 1, <2 x i1> <i1 true, i1 false>, <2 x double> <double poison, double 0.000000e+00>)
; CHECK-NEXT: ret <2 x double> [[TMP1]]		; CHECK-NEXT: ret <2 x double> [[TMP1]]
;		;
%ld = tail call <2 x double> @llvm.x86.avx.maskload.pd(i8* %f, <2 x i64> <i64 -1, i64 0>)		%ld = tail call <2 x double> @llvm.x86.avx.maskload.pd(i8* %f, <2 x i64> <i64 -1, i64 0>)
ret <2 x double> %ld		ret <2 x double> %ld
}		}

; Try 256-bit FP ops.		; Try 256-bit FP ops.

define <8 x float> @mload_v8f32(i8* %f) {		define <8 x float> @mload_v8f32(i8* %f) {
; CHECK-LABEL: @mload_v8f32(		; CHECK-LABEL: @mload_v8f32(
; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <8 x float>		; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <8 x float>
; CHECK-NEXT: [[TMP1:%.]] = call <8 x float> @llvm.masked.load.v8f32.p0v8f32(<8 x float> [[CASTVEC]], i32 1, <8 x i1> <i1 false, i1 false, i1 false, i1 true, i1 false, i1 false, i1 false, i1 false>, <8 x float> <float 0.000000e+00, float 0.000000e+00, float 0.000000e+00, float undef, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>)		; CHECK-NEXT: [[TMP1:%.]] = call <8 x float> @llvm.masked.load.v8f32.p0v8f32(<8 x float> [[CASTVEC]], i32 1, <8 x i1> <i1 false, i1 false, i1 false, i1 true, i1 false, i1 false, i1 false, i1 false>, <8 x float> <float 0.000000e+00, float 0.000000e+00, float 0.000000e+00, float poison, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>)
; CHECK-NEXT: ret <8 x float> [[TMP1]]		; CHECK-NEXT: ret <8 x float> [[TMP1]]
;		;
%ld = tail call <8 x float> @llvm.x86.avx.maskload.ps.256(i8* %f, <8 x i32> <i32 0, i32 0, i32 0, i32 -1, i32 0, i32 0, i32 0, i32 0>)		%ld = tail call <8 x float> @llvm.x86.avx.maskload.ps.256(i8* %f, <8 x i32> <i32 0, i32 0, i32 0, i32 -1, i32 0, i32 0, i32 0, i32 0>)
ret <8 x float> %ld		ret <8 x float> %ld
}		}

define <8 x float> @mload_v8f32_cmp(i8* %f, <8 x float> %src0, <8 x float> %src1) {		define <8 x float> @mload_v8f32_cmp(i8* %f, <8 x float> %src0, <8 x float> %src1) {
; CHECK-LABEL: @mload_v8f32_cmp(		; CHECK-LABEL: @mload_v8f32_cmp(
Show All 11 Lines	;
%mask = and <8 x i32> %ext0, %ext1		%mask = and <8 x i32> %ext0, %ext1
%ld = tail call <8 x float> @llvm.x86.avx.maskload.ps.256(i8* %f, <8 x i32> %mask)		%ld = tail call <8 x float> @llvm.x86.avx.maskload.ps.256(i8* %f, <8 x i32> %mask)
ret <8 x float> %ld		ret <8 x float> %ld
}		}

define <4 x double> @mload_v4f64(i8* %f) {		define <4 x double> @mload_v4f64(i8* %f) {
; CHECK-LABEL: @mload_v4f64(		; CHECK-LABEL: @mload_v4f64(
; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <4 x double>		; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <4 x double>
; CHECK-NEXT: [[TMP1:%.]] = call <4 x double> @llvm.masked.load.v4f64.p0v4f64(<4 x double> [[CASTVEC]], i32 1, <4 x i1> <i1 true, i1 false, i1 false, i1 false>, <4 x double> <double undef, double 0.000000e+00, double 0.000000e+00, double 0.000000e+00>)		; CHECK-NEXT: [[TMP1:%.]] = call <4 x double> @llvm.masked.load.v4f64.p0v4f64(<4 x double> [[CASTVEC]], i32 1, <4 x i1> <i1 true, i1 false, i1 false, i1 false>, <4 x double> <double poison, double 0.000000e+00, double 0.000000e+00, double 0.000000e+00>)
; CHECK-NEXT: ret <4 x double> [[TMP1]]		; CHECK-NEXT: ret <4 x double> [[TMP1]]
;		;
%ld = tail call <4 x double> @llvm.x86.avx.maskload.pd.256(i8* %f, <4 x i64> <i64 -1, i64 0, i64 0, i64 0>)		%ld = tail call <4 x double> @llvm.x86.avx.maskload.pd.256(i8* %f, <4 x i64> <i64 -1, i64 0, i64 0, i64 0>)
ret <4 x double> %ld		ret <4 x double> %ld
}		}

; Try the AVX2 variants.		; Try the AVX2 variants.

define <4 x i32> @mload_v4i32(i8* %f) {		define <4 x i32> @mload_v4i32(i8* %f) {
; CHECK-LABEL: @mload_v4i32(		; CHECK-LABEL: @mload_v4i32(
; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <4 x i32>		; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <4 x i32>
; CHECK-NEXT: [[TMP1:%.]] = call <4 x i32> @llvm.masked.load.v4i32.p0v4i32(<4 x i32> [[CASTVEC]], i32 1, <4 x i1> <i1 false, i1 false, i1 false, i1 true>, <4 x i32> <i32 0, i32 0, i32 0, i32 undef>)		; CHECK-NEXT: [[TMP1:%.]] = call <4 x i32> @llvm.masked.load.v4i32.p0v4i32(<4 x i32> [[CASTVEC]], i32 1, <4 x i1> <i1 false, i1 false, i1 false, i1 true>, <4 x i32> <i32 0, i32 0, i32 0, i32 poison>)
; CHECK-NEXT: ret <4 x i32> [[TMP1]]		; CHECK-NEXT: ret <4 x i32> [[TMP1]]
;		;
%ld = tail call <4 x i32> @llvm.x86.avx2.maskload.d(i8* %f, <4 x i32> <i32 0, i32 0, i32 0, i32 -1>)		%ld = tail call <4 x i32> @llvm.x86.avx2.maskload.d(i8* %f, <4 x i32> <i32 0, i32 0, i32 0, i32 -1>)
ret <4 x i32> %ld		ret <4 x i32> %ld
}		}

define <2 x i64> @mload_v2i64(i8* %f) {		define <2 x i64> @mload_v2i64(i8* %f) {
; CHECK-LABEL: @mload_v2i64(		; CHECK-LABEL: @mload_v2i64(
; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <2 x i64>		; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <2 x i64>
; CHECK-NEXT: [[TMP1:%.]] = call <2 x i64> @llvm.masked.load.v2i64.p0v2i64(<2 x i64> [[CASTVEC]], i32 1, <2 x i1> <i1 true, i1 false>, <2 x i64> <i64 undef, i64 0>)		; CHECK-NEXT: [[TMP1:%.]] = call <2 x i64> @llvm.masked.load.v2i64.p0v2i64(<2 x i64> [[CASTVEC]], i32 1, <2 x i1> <i1 true, i1 false>, <2 x i64> <i64 poison, i64 0>)
; CHECK-NEXT: ret <2 x i64> [[TMP1]]		; CHECK-NEXT: ret <2 x i64> [[TMP1]]
;		;
%ld = tail call <2 x i64> @llvm.x86.avx2.maskload.q(i8* %f, <2 x i64> <i64 -1, i64 0>)		%ld = tail call <2 x i64> @llvm.x86.avx2.maskload.q(i8* %f, <2 x i64> <i64 -1, i64 0>)
ret <2 x i64> %ld		ret <2 x i64> %ld
}		}

define <8 x i32> @mload_v8i32(i8* %f) {		define <8 x i32> @mload_v8i32(i8* %f) {
; CHECK-LABEL: @mload_v8i32(		; CHECK-LABEL: @mload_v8i32(
; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <8 x i32>		; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <8 x i32>
; CHECK-NEXT: [[TMP1:%.]] = call <8 x i32> @llvm.masked.load.v8i32.p0v8i32(<8 x i32> [[CASTVEC]], i32 1, <8 x i1> <i1 false, i1 false, i1 false, i1 true, i1 false, i1 false, i1 false, i1 false>, <8 x i32> <i32 0, i32 0, i32 0, i32 undef, i32 0, i32 0, i32 0, i32 0>)		; CHECK-NEXT: [[TMP1:%.]] = call <8 x i32> @llvm.masked.load.v8i32.p0v8i32(<8 x i32> [[CASTVEC]], i32 1, <8 x i1> <i1 false, i1 false, i1 false, i1 true, i1 false, i1 false, i1 false, i1 false>, <8 x i32> <i32 0, i32 0, i32 0, i32 poison, i32 0, i32 0, i32 0, i32 0>)
; CHECK-NEXT: ret <8 x i32> [[TMP1]]		; CHECK-NEXT: ret <8 x i32> [[TMP1]]
;		;
%ld = tail call <8 x i32> @llvm.x86.avx2.maskload.d.256(i8* %f, <8 x i32> <i32 0, i32 0, i32 0, i32 -1, i32 0, i32 0, i32 0, i32 0>)		%ld = tail call <8 x i32> @llvm.x86.avx2.maskload.d.256(i8* %f, <8 x i32> <i32 0, i32 0, i32 0, i32 -1, i32 0, i32 0, i32 0, i32 0>)
ret <8 x i32> %ld		ret <8 x i32> %ld
}		}

define <4 x i64> @mload_v4i64(i8* %f) {		define <4 x i64> @mload_v4i64(i8* %f) {
; CHECK-LABEL: @mload_v4i64(		; CHECK-LABEL: @mload_v4i64(
; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <4 x i64>		; CHECK-NEXT: [[CASTVEC:%.]] = bitcast i8 [[F:%.]] to <4 x i64>
; CHECK-NEXT: [[TMP1:%.]] = call <4 x i64> @llvm.masked.load.v4i64.p0v4i64(<4 x i64> [[CASTVEC]], i32 1, <4 x i1> <i1 true, i1 false, i1 false, i1 false>, <4 x i64> <i64 undef, i64 0, i64 0, i64 0>)		; CHECK-NEXT: [[TMP1:%.]] = call <4 x i64> @llvm.masked.load.v4i64.p0v4i64(<4 x i64> [[CASTVEC]], i32 1, <4 x i1> <i1 true, i1 false, i1 false, i1 false>, <4 x i64> <i64 poison, i64 0, i64 0, i64 0>)
; CHECK-NEXT: ret <4 x i64> [[TMP1]]		; CHECK-NEXT: ret <4 x i64> [[TMP1]]
;		;
%ld = tail call <4 x i64> @llvm.x86.avx2.maskload.q.256(i8* %f, <4 x i64> <i64 -1, i64 0, i64 0, i64 0>)		%ld = tail call <4 x i64> @llvm.x86.avx2.maskload.q.256(i8* %f, <4 x i64> <i64 -1, i64 0, i64 0, i64 0>)
ret <4 x i64> %ld		ret <4 x i64> %ld
}		}

define <4 x i64> @mload_v4i64_cmp(i8* %f, <4 x i64> %src) {		define <4 x i64> @mload_v4i64_cmp(i8* %f, <4 x i64> %src) {
; CHECK-LABEL: @mload_v4i64_cmp(		; CHECK-LABEL: @mload_v4i64_cmp(
▲ Show 20 Lines • Show All 223 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/X86/x86-muldq.ll

	Show First 20 Lines • Show All 157 Lines • ▼ Show 20 Lines
	;			;

	define <2 x i64> @test_demanded_elts_pmuludq_128(<4 x i32> %a0, <4 x i32> %a1) {			define <2 x i64> @test_demanded_elts_pmuludq_128(<4 x i32> %a0, <4 x i32> %a1) {
	; CHECK-LABEL: @test_demanded_elts_pmuludq_128(			; CHECK-LABEL: @test_demanded_elts_pmuludq_128(
	; CHECK-NEXT: [[TMP1:%.]] = shufflevector <4 x i32> [[A0:%.]], <4 x i32> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP1:%.]] = shufflevector <4 x i32> [[A0:%.]], <4 x i32> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef>
	; CHECK-NEXT: [[TMP2:%.]] = shufflevector <4 x i32> [[A1:%.]], <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP2:%.]] = shufflevector <4 x i32> [[A1:%.]], <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 undef, i32 undef>
	; CHECK-NEXT: [[TMP3:%.*]] = bitcast <4 x i32> [[TMP1]] to <2 x i64>			; CHECK-NEXT: [[TMP3:%.*]] = bitcast <4 x i32> [[TMP1]] to <2 x i64>
	; CHECK-NEXT: [[TMP4:%.*]] = bitcast <4 x i32> [[TMP2]] to <2 x i64>			; CHECK-NEXT: [[TMP4:%.*]] = bitcast <4 x i32> [[TMP2]] to <2 x i64>
	; CHECK-NEXT: [[TMP5:%.*]] = and <2 x i64> [[TMP3]], <i64 4294967295, i64 undef>			; CHECK-NEXT: [[TMP5:%.*]] = and <2 x i64> [[TMP3]], <i64 4294967295, i64 poison>
	; CHECK-NEXT: [[TMP6:%.*]] = and <2 x i64> [[TMP4]], <i64 4294967295, i64 undef>			; CHECK-NEXT: [[TMP6:%.*]] = and <2 x i64> [[TMP4]], <i64 4294967295, i64 poison>
	; CHECK-NEXT: [[TMP7:%.*]] = mul <2 x i64> [[TMP5]], [[TMP6]]			; CHECK-NEXT: [[TMP7:%.*]] = mul <2 x i64> [[TMP5]], [[TMP6]]
	; CHECK-NEXT: [[TMP8:%.*]] = shufflevector <2 x i64> [[TMP7]], <2 x i64> undef, <2 x i32> zeroinitializer			; CHECK-NEXT: [[TMP8:%.*]] = shufflevector <2 x i64> [[TMP7]], <2 x i64> undef, <2 x i32> zeroinitializer
	; CHECK-NEXT: ret <2 x i64> [[TMP8]]			; CHECK-NEXT: ret <2 x i64> [[TMP8]]
	;			;
	%1 = shufflevector <4 x i32> %a0, <4 x i32> undef, <4 x i32> <i32 0, i32 0, i32 2, i32 2>			%1 = shufflevector <4 x i32> %a0, <4 x i32> undef, <4 x i32> <i32 0, i32 0, i32 2, i32 2>
	%2 = shufflevector <4 x i32> %a1, <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 3, i32 3>			%2 = shufflevector <4 x i32> %a1, <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 3, i32 3>
	%3 = call <2 x i64> @llvm.x86.sse2.pmulu.dq(<4 x i32> %1, <4 x i32> %2)			%3 = call <2 x i64> @llvm.x86.sse2.pmulu.dq(<4 x i32> %1, <4 x i32> %2)
	%4 = shufflevector <2 x i64> %3, <2 x i64> undef, <2 x i32> zeroinitializer			%4 = shufflevector <2 x i64> %3, <2 x i64> undef, <2 x i32> zeroinitializer
	▲ Show 20 Lines • Show All 106 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/X86/x86-pack-inseltpoison.ll

Show First 20 Lines • Show All 263 Lines • ▼ Show 20 Lines	;
%3 = call <16 x i16> @llvm.x86.avx2.packssdw(<8 x i32> %1, <8 x i32> %2)		%3 = call <16 x i16> @llvm.x86.avx2.packssdw(<8 x i32> %1, <8 x i32> %2)
%4 = shufflevector <16 x i16> %3, <16 x i16> undef, <16 x i32> <i32 undef, i32 undef, i32 2, i32 3, i32 4, i32 undef, i32 undef, i32 7, i32 8, i32 undef, i32 undef, i32 11, i32 12, i32 undef, i32 undef, i32 15>		%4 = shufflevector <16 x i16> %3, <16 x i16> undef, <16 x i32> <i32 undef, i32 undef, i32 2, i32 3, i32 4, i32 undef, i32 undef, i32 7, i32 8, i32 undef, i32 undef, i32 11, i32 12, i32 undef, i32 undef, i32 15>
ret <16 x i16> %4		ret <16 x i16> %4
}		}

define <16 x i16> @elts_packusdw_256(<8 x i32> %a0, <8 x i32> %a1) {		define <16 x i16> @elts_packusdw_256(<8 x i32> %a0, <8 x i32> %a1) {
; CHECK-LABEL: @elts_packusdw_256(		; CHECK-LABEL: @elts_packusdw_256(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <8 x i32> [[A1:%.]], <8 x i32> undef, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <8 x i32> [[A1:%.]], <8 x i32> undef, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>
; CHECK-NEXT: [[TMP2:%.*]] = call <16 x i16> @llvm.x86.avx2.packusdw(<8 x i32> undef, <8 x i32> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = call <16 x i16> @llvm.x86.avx2.packusdw(<8 x i32> poison, <8 x i32> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = shufflevector <16 x i16> [[TMP2]], <16 x i16> undef, <16 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP3:%.*]] = shufflevector <16 x i16> [[TMP2]], <16 x i16> undef, <16 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef>
; CHECK-NEXT: ret <16 x i16> [[TMP3]]		; CHECK-NEXT: ret <16 x i16> [[TMP3]]
;		;
%1 = shufflevector <8 x i32> %a0, <8 x i32> undef, <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7>		%1 = shufflevector <8 x i32> %a0, <8 x i32> undef, <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7>
%2 = shufflevector <8 x i32> %a1, <8 x i32> undef, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>		%2 = shufflevector <8 x i32> %a1, <8 x i32> undef, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>
%3 = call <16 x i16> @llvm.x86.avx2.packusdw(<8 x i32> %1, <8 x i32> %2)		%3 = call <16 x i16> @llvm.x86.avx2.packusdw(<8 x i32> %1, <8 x i32> %2)
%4 = shufflevector <16 x i16> %3, <16 x i16> undef, <16 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef>		%4 = shufflevector <16 x i16> %3, <16 x i16> undef, <16 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef>
ret <16 x i16> %4		ret <16 x i16> %4
Show All 32 Lines	;
%3 = call <32 x i16> @llvm.x86.avx512.packssdw.512(<16 x i32> %1, <16 x i32> %2)		%3 = call <32 x i16> @llvm.x86.avx512.packssdw.512(<16 x i32> %1, <16 x i32> %2)
%4 = shufflevector <32 x i16> %3, <32 x i16> undef, <32 x i32> <i32 undef, i32 undef, i32 2, i32 3, i32 4, i32 undef, i32 undef, i32 7, i32 8, i32 undef, i32 undef, i32 11, i32 12, i32 undef, i32 undef, i32 15, i32 undef, i32 undef, i32 18, i32 19, i32 20, i32 undef, i32 undef, i32 23, i32 24, i32 undef, i32 undef, i32 27, i32 28, i32 undef, i32 undef, i32 31>		%4 = shufflevector <32 x i16> %3, <32 x i16> undef, <32 x i32> <i32 undef, i32 undef, i32 2, i32 3, i32 4, i32 undef, i32 undef, i32 7, i32 8, i32 undef, i32 undef, i32 11, i32 12, i32 undef, i32 undef, i32 15, i32 undef, i32 undef, i32 18, i32 19, i32 20, i32 undef, i32 undef, i32 23, i32 24, i32 undef, i32 undef, i32 27, i32 28, i32 undef, i32 undef, i32 31>
ret <32 x i16> %4		ret <32 x i16> %4
}		}

define <32 x i16> @elts_packusdw_512(<16 x i32> %a0, <16 x i32> %a1) {		define <32 x i16> @elts_packusdw_512(<16 x i32> %a0, <16 x i32> %a1) {
; CHECK-LABEL: @elts_packusdw_512(		; CHECK-LABEL: @elts_packusdw_512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i32> [[A1:%.]], <16 x i32> undef, <16 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0, i32 15, i32 14, i32 13, i32 12, i32 11, i32 10, i32 9, i32 8>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i32> [[A1:%.]], <16 x i32> undef, <16 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0, i32 15, i32 14, i32 13, i32 12, i32 11, i32 10, i32 9, i32 8>
; CHECK-NEXT: [[TMP2:%.*]] = call <32 x i16> @llvm.x86.avx512.packusdw.512(<16 x i32> undef, <16 x i32> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = call <32 x i16> @llvm.x86.avx512.packusdw.512(<16 x i32> poison, <16 x i32> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = shufflevector <32 x i16> [[TMP2]], <32 x i16> undef, <32 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef, i32 20, i32 21, i32 22, i32 23, i32 undef, i32 undef, i32 undef, i32 undef, i32 28, i32 29, i32 30, i32 31, i32 undef, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP3:%.*]] = shufflevector <32 x i16> [[TMP2]], <32 x i16> undef, <32 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef, i32 20, i32 21, i32 22, i32 23, i32 undef, i32 undef, i32 undef, i32 undef, i32 28, i32 29, i32 30, i32 31, i32 undef, i32 undef, i32 undef, i32 undef>
; CHECK-NEXT: ret <32 x i16> [[TMP3]]		; CHECK-NEXT: ret <32 x i16> [[TMP3]]
;		;
%1 = shufflevector <16 x i32> %a0, <16 x i32> undef, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15>		%1 = shufflevector <16 x i32> %a0, <16 x i32> undef, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15>
%2 = shufflevector <16 x i32> %a1, <16 x i32> undef, <16 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0, i32 15, i32 14, i32 13, i32 12, i32 11, i32 10, i32 9, i32 8>		%2 = shufflevector <16 x i32> %a1, <16 x i32> undef, <16 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0, i32 15, i32 14, i32 13, i32 12, i32 11, i32 10, i32 9, i32 8>
%3 = call <32 x i16> @llvm.x86.avx512.packusdw.512(<16 x i32> %1, <16 x i32> %2)		%3 = call <32 x i16> @llvm.x86.avx512.packusdw.512(<16 x i32> %1, <16 x i32> %2)
%4 = shufflevector <32 x i16> %3, <32 x i16> undef, <32 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef, i32 20, i32 21, i32 22, i32 23, i32 undef, i32 undef, i32 undef, i32 undef, i32 28, i32 29, i32 30, i32 31, i32 undef, i32 undef, i32 undef, i32 undef>		%4 = shufflevector <32 x i16> %3, <32 x i16> undef, <32 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef, i32 20, i32 21, i32 22, i32 23, i32 undef, i32 undef, i32 undef, i32 undef, i32 28, i32 29, i32 30, i32 31, i32 undef, i32 undef, i32 undef, i32 undef>
ret <32 x i16> %4		ret <32 x i16> %4
▲ Show 20 Lines • Show All 306 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/X86/x86-pack.ll

Show First 20 Lines • Show All 263 Lines • ▼ Show 20 Lines	;
%3 = call <16 x i16> @llvm.x86.avx2.packssdw(<8 x i32> %1, <8 x i32> %2)		%3 = call <16 x i16> @llvm.x86.avx2.packssdw(<8 x i32> %1, <8 x i32> %2)
%4 = shufflevector <16 x i16> %3, <16 x i16> undef, <16 x i32> <i32 undef, i32 undef, i32 2, i32 3, i32 4, i32 undef, i32 undef, i32 7, i32 8, i32 undef, i32 undef, i32 11, i32 12, i32 undef, i32 undef, i32 15>		%4 = shufflevector <16 x i16> %3, <16 x i16> undef, <16 x i32> <i32 undef, i32 undef, i32 2, i32 3, i32 4, i32 undef, i32 undef, i32 7, i32 8, i32 undef, i32 undef, i32 11, i32 12, i32 undef, i32 undef, i32 15>
ret <16 x i16> %4		ret <16 x i16> %4
}		}

define <16 x i16> @elts_packusdw_256(<8 x i32> %a0, <8 x i32> %a1) {		define <16 x i16> @elts_packusdw_256(<8 x i32> %a0, <8 x i32> %a1) {
; CHECK-LABEL: @elts_packusdw_256(		; CHECK-LABEL: @elts_packusdw_256(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <8 x i32> [[A1:%.]], <8 x i32> undef, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <8 x i32> [[A1:%.]], <8 x i32> undef, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>
; CHECK-NEXT: [[TMP2:%.*]] = call <16 x i16> @llvm.x86.avx2.packusdw(<8 x i32> undef, <8 x i32> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = call <16 x i16> @llvm.x86.avx2.packusdw(<8 x i32> poison, <8 x i32> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = shufflevector <16 x i16> [[TMP2]], <16 x i16> undef, <16 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP3:%.*]] = shufflevector <16 x i16> [[TMP2]], <16 x i16> undef, <16 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef>
; CHECK-NEXT: ret <16 x i16> [[TMP3]]		; CHECK-NEXT: ret <16 x i16> [[TMP3]]
;		;
%1 = shufflevector <8 x i32> %a0, <8 x i32> undef, <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7>		%1 = shufflevector <8 x i32> %a0, <8 x i32> undef, <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7>
%2 = shufflevector <8 x i32> %a1, <8 x i32> undef, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>		%2 = shufflevector <8 x i32> %a1, <8 x i32> undef, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>
%3 = call <16 x i16> @llvm.x86.avx2.packusdw(<8 x i32> %1, <8 x i32> %2)		%3 = call <16 x i16> @llvm.x86.avx2.packusdw(<8 x i32> %1, <8 x i32> %2)
%4 = shufflevector <16 x i16> %3, <16 x i16> undef, <16 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef>		%4 = shufflevector <16 x i16> %3, <16 x i16> undef, <16 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef>
ret <16 x i16> %4		ret <16 x i16> %4
Show All 32 Lines	;
%3 = call <32 x i16> @llvm.x86.avx512.packssdw.512(<16 x i32> %1, <16 x i32> %2)		%3 = call <32 x i16> @llvm.x86.avx512.packssdw.512(<16 x i32> %1, <16 x i32> %2)
%4 = shufflevector <32 x i16> %3, <32 x i16> undef, <32 x i32> <i32 undef, i32 undef, i32 2, i32 3, i32 4, i32 undef, i32 undef, i32 7, i32 8, i32 undef, i32 undef, i32 11, i32 12, i32 undef, i32 undef, i32 15, i32 undef, i32 undef, i32 18, i32 19, i32 20, i32 undef, i32 undef, i32 23, i32 24, i32 undef, i32 undef, i32 27, i32 28, i32 undef, i32 undef, i32 31>		%4 = shufflevector <32 x i16> %3, <32 x i16> undef, <32 x i32> <i32 undef, i32 undef, i32 2, i32 3, i32 4, i32 undef, i32 undef, i32 7, i32 8, i32 undef, i32 undef, i32 11, i32 12, i32 undef, i32 undef, i32 15, i32 undef, i32 undef, i32 18, i32 19, i32 20, i32 undef, i32 undef, i32 23, i32 24, i32 undef, i32 undef, i32 27, i32 28, i32 undef, i32 undef, i32 31>
ret <32 x i16> %4		ret <32 x i16> %4
}		}

define <32 x i16> @elts_packusdw_512(<16 x i32> %a0, <16 x i32> %a1) {		define <32 x i16> @elts_packusdw_512(<16 x i32> %a0, <16 x i32> %a1) {
; CHECK-LABEL: @elts_packusdw_512(		; CHECK-LABEL: @elts_packusdw_512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i32> [[A1:%.]], <16 x i32> undef, <16 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0, i32 15, i32 14, i32 13, i32 12, i32 11, i32 10, i32 9, i32 8>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i32> [[A1:%.]], <16 x i32> undef, <16 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0, i32 15, i32 14, i32 13, i32 12, i32 11, i32 10, i32 9, i32 8>
; CHECK-NEXT: [[TMP2:%.*]] = call <32 x i16> @llvm.x86.avx512.packusdw.512(<16 x i32> undef, <16 x i32> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = call <32 x i16> @llvm.x86.avx512.packusdw.512(<16 x i32> poison, <16 x i32> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = shufflevector <32 x i16> [[TMP2]], <32 x i16> undef, <32 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef, i32 20, i32 21, i32 22, i32 23, i32 undef, i32 undef, i32 undef, i32 undef, i32 28, i32 29, i32 30, i32 31, i32 undef, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP3:%.*]] = shufflevector <32 x i16> [[TMP2]], <32 x i16> undef, <32 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef, i32 20, i32 21, i32 22, i32 23, i32 undef, i32 undef, i32 undef, i32 undef, i32 28, i32 29, i32 30, i32 31, i32 undef, i32 undef, i32 undef, i32 undef>
; CHECK-NEXT: ret <32 x i16> [[TMP3]]		; CHECK-NEXT: ret <32 x i16> [[TMP3]]
;		;
%1 = shufflevector <16 x i32> %a0, <16 x i32> undef, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15>		%1 = shufflevector <16 x i32> %a0, <16 x i32> undef, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15>
%2 = shufflevector <16 x i32> %a1, <16 x i32> undef, <16 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0, i32 15, i32 14, i32 13, i32 12, i32 11, i32 10, i32 9, i32 8>		%2 = shufflevector <16 x i32> %a1, <16 x i32> undef, <16 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0, i32 15, i32 14, i32 13, i32 12, i32 11, i32 10, i32 9, i32 8>
%3 = call <32 x i16> @llvm.x86.avx512.packusdw.512(<16 x i32> %1, <16 x i32> %2)		%3 = call <32 x i16> @llvm.x86.avx512.packusdw.512(<16 x i32> %1, <16 x i32> %2)
%4 = shufflevector <32 x i16> %3, <32 x i16> undef, <32 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef, i32 20, i32 21, i32 22, i32 23, i32 undef, i32 undef, i32 undef, i32 undef, i32 28, i32 29, i32 30, i32 31, i32 undef, i32 undef, i32 undef, i32 undef>		%4 = shufflevector <32 x i16> %3, <32 x i16> undef, <32 x i32> <i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 12, i32 13, i32 14, i32 15, i32 undef, i32 undef, i32 undef, i32 undef, i32 20, i32 21, i32 22, i32 23, i32 undef, i32 undef, i32 undef, i32 undef, i32 28, i32 29, i32 30, i32 31, i32 undef, i32 undef, i32 undef, i32 undef>
ret <32 x i16> %4		ret <32 x i16> %4
▲ Show 20 Lines • Show All 306 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/X86/x86-pshufb.ll

Show First 20 Lines • Show All 66 Lines • ▼ Show 20 Lines

; In the test case below, elements in the low 128-bit lane of the result		; In the test case below, elements in the low 128-bit lane of the result
; vector are equal to the lower byte of %InVec (shuffle index 0).		; vector are equal to the lower byte of %InVec (shuffle index 0).
; Elements in the high 128-bit lane of the result vector are equal to		; Elements in the high 128-bit lane of the result vector are equal to
; the lower byte in the high 128-bit lane of %InVec (shuffle index 16).		; the lower byte in the high 128-bit lane of %InVec (shuffle index 16).

define <32 x i8> @splat_test_avx2(<32 x i8> %InVec) {		define <32 x i8> @splat_test_avx2(<32 x i8> %InVec) {
; CHECK-LABEL: @splat_test_avx2(		; CHECK-LABEL: @splat_test_avx2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> undef, <32 x i32> <i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> poison, <32 x i32> <i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16>
; CHECK-NEXT: ret <32 x i8> [[TMP1]]		; CHECK-NEXT: ret <32 x i8> [[TMP1]]
;		;
%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> zeroinitializer)		%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> zeroinitializer)
ret <32 x i8> %1		ret <32 x i8> %1
}		}

define <64 x i8> @splat_test_avx512(<64 x i8> %InVec) {		define <64 x i8> @splat_test_avx512(<64 x i8> %InVec) {
; CHECK-LABEL: @splat_test_avx512(		; CHECK-LABEL: @splat_test_avx512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> undef, <64 x i32> <i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> poison, <64 x i32> <i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48>
; CHECK-NEXT: ret <64 x i8> [[TMP1]]		; CHECK-NEXT: ret <64 x i8> [[TMP1]]
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> zeroinitializer)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> zeroinitializer)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

; Each of the byte shuffles in the following tests is equivalent to a blend between		; Each of the byte shuffles in the following tests is equivalent to a blend between
; vector %InVec and a vector of all zeroes.		; vector %InVec and a vector of all zeroes.

define <16 x i8> @blend1(<16 x i8> %InVec) {		define <16 x i8> @blend1(<16 x i8> %InVec) {
; CHECK-LABEL: @blend1(		; CHECK-LABEL: @blend1(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <16 x i32> <i32 16, i32 1, i32 16, i32 3, i32 16, i32 5, i32 16, i32 7, i32 16, i32 9, i32 16, i32 11, i32 16, i32 13, i32 16, i32 15>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <16 x i32> <i32 16, i32 1, i32 16, i32 3, i32 16, i32 5, i32 16, i32 7, i32 16, i32 9, i32 16, i32 11, i32 16, i32 13, i32 16, i32 15>
; CHECK-NEXT: ret <16 x i8> [[TMP1]]		; CHECK-NEXT: ret <16 x i8> [[TMP1]]
;		;
%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15>)		%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15>)
ret <16 x i8> %1		ret <16 x i8> %1
}		}

define <16 x i8> @blend2(<16 x i8> %InVec) {		define <16 x i8> @blend2(<16 x i8> %InVec) {
; CHECK-LABEL: @blend2(		; CHECK-LABEL: @blend2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <16 x i32> <i32 16, i32 16, i32 2, i32 3, i32 16, i32 16, i32 6, i32 7, i32 16, i32 16, i32 10, i32 11, i32 16, i32 16, i32 14, i32 15>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <16 x i32> <i32 16, i32 16, i32 2, i32 3, i32 16, i32 16, i32 6, i32 7, i32 16, i32 16, i32 10, i32 11, i32 16, i32 16, i32 14, i32 15>
; CHECK-NEXT: ret <16 x i8> [[TMP1]]		; CHECK-NEXT: ret <16 x i8> [[TMP1]]
;		;
%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15>)		%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15>)
ret <16 x i8> %1		ret <16 x i8> %1
}		}

define <16 x i8> @blend3(<16 x i8> %InVec) {		define <16 x i8> @blend3(<16 x i8> %InVec) {
; CHECK-LABEL: @blend3(		; CHECK-LABEL: @blend3(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <16 x i32> <i32 16, i32 16, i32 16, i32 16, i32 4, i32 5, i32 6, i32 7, i32 16, i32 16, i32 16, i32 16, i32 12, i32 13, i32 14, i32 15>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <16 x i32> <i32 16, i32 16, i32 16, i32 16, i32 4, i32 5, i32 6, i32 7, i32 16, i32 16, i32 16, i32 16, i32 12, i32 13, i32 14, i32 15>
; CHECK-NEXT: ret <16 x i8> [[TMP1]]		; CHECK-NEXT: ret <16 x i8> [[TMP1]]
;		;
%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15>)		%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15>)
ret <16 x i8> %1		ret <16 x i8> %1
}		}

define <16 x i8> @blend4(<16 x i8> %InVec) {		define <16 x i8> @blend4(<16 x i8> %InVec) {
; CHECK-LABEL: @blend4(		; CHECK-LABEL: @blend4(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <16 x i32> <i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <16 x i32> <i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15>
; CHECK-NEXT: ret <16 x i8> [[TMP1]]		; CHECK-NEXT: ret <16 x i8> [[TMP1]]
;		;
%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15>)		%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15>)
ret <16 x i8> %1		ret <16 x i8> %1
}		}

define <16 x i8> @blend5(<16 x i8> %InVec) {		define <16 x i8> @blend5(<16 x i8> %InVec) {
; CHECK-LABEL: @blend5(		; CHECK-LABEL: @blend5(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16>
; CHECK-NEXT: ret <16 x i8> [[TMP1]]		; CHECK-NEXT: ret <16 x i8> [[TMP1]]
;		;
%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)		%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)
ret <16 x i8> %1		ret <16 x i8> %1
}		}

define <16 x i8> @blend6(<16 x i8> %InVec) {		define <16 x i8> @blend6(<16 x i8> %InVec) {
; CHECK-LABEL: @blend6(		; CHECK-LABEL: @blend6(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <16 x i32> <i32 0, i32 1, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <16 x i32> <i32 0, i32 1, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16>
; CHECK-NEXT: ret <16 x i8> [[TMP1]]		; CHECK-NEXT: ret <16 x i8> [[TMP1]]
;		;
%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)		%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)
ret <16 x i8> %1		ret <16 x i8> %1
}		}

define <32 x i8> @blend1_avx2(<32 x i8> %InVec) {		define <32 x i8> @blend1_avx2(<32 x i8> %InVec) {
; CHECK-LABEL: @blend1_avx2(		; CHECK-LABEL: @blend1_avx2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <32 x i32> <i32 32, i32 1, i32 32, i32 3, i32 32, i32 5, i32 32, i32 7, i32 32, i32 9, i32 32, i32 11, i32 32, i32 13, i32 32, i32 15, i32 48, i32 17, i32 48, i32 19, i32 48, i32 21, i32 48, i32 23, i32 48, i32 25, i32 48, i32 27, i32 48, i32 29, i32 48, i32 31>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <32 x i32> <i32 32, i32 1, i32 32, i32 3, i32 32, i32 5, i32 32, i32 7, i32 32, i32 9, i32 32, i32 11, i32 32, i32 13, i32 32, i32 15, i32 48, i32 17, i32 48, i32 19, i32 48, i32 21, i32 48, i32 23, i32 48, i32 25, i32 48, i32 27, i32 48, i32 29, i32 48, i32 31>
; CHECK-NEXT: ret <32 x i8> [[TMP1]]		; CHECK-NEXT: ret <32 x i8> [[TMP1]]
;		;
%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15, i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15>)		%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15, i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15>)
ret <32 x i8> %1		ret <32 x i8> %1
}		}

define <32 x i8> @blend2_avx2(<32 x i8> %InVec) {		define <32 x i8> @blend2_avx2(<32 x i8> %InVec) {
; CHECK-LABEL: @blend2_avx2(		; CHECK-LABEL: @blend2_avx2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <32 x i32> <i32 32, i32 32, i32 2, i32 3, i32 32, i32 32, i32 6, i32 7, i32 32, i32 32, i32 10, i32 11, i32 32, i32 32, i32 14, i32 15, i32 48, i32 48, i32 18, i32 19, i32 48, i32 48, i32 22, i32 23, i32 48, i32 48, i32 26, i32 27, i32 48, i32 48, i32 30, i32 31>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <32 x i32> <i32 32, i32 32, i32 2, i32 3, i32 32, i32 32, i32 6, i32 7, i32 32, i32 32, i32 10, i32 11, i32 32, i32 32, i32 14, i32 15, i32 48, i32 48, i32 18, i32 19, i32 48, i32 48, i32 22, i32 23, i32 48, i32 48, i32 26, i32 27, i32 48, i32 48, i32 30, i32 31>
; CHECK-NEXT: ret <32 x i8> [[TMP1]]		; CHECK-NEXT: ret <32 x i8> [[TMP1]]
;		;
%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15, i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15>)		%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15, i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15>)
ret <32 x i8> %1		ret <32 x i8> %1
}		}

define <32 x i8> @blend3_avx2(<32 x i8> %InVec) {		define <32 x i8> @blend3_avx2(<32 x i8> %InVec) {
; CHECK-LABEL: @blend3_avx2(		; CHECK-LABEL: @blend3_avx2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <32 x i32> <i32 32, i32 32, i32 32, i32 32, i32 4, i32 5, i32 6, i32 7, i32 32, i32 32, i32 32, i32 32, i32 12, i32 13, i32 14, i32 15, i32 48, i32 48, i32 48, i32 48, i32 20, i32 21, i32 22, i32 23, i32 48, i32 48, i32 48, i32 48, i32 28, i32 29, i32 30, i32 31>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <32 x i32> <i32 32, i32 32, i32 32, i32 32, i32 4, i32 5, i32 6, i32 7, i32 32, i32 32, i32 32, i32 32, i32 12, i32 13, i32 14, i32 15, i32 48, i32 48, i32 48, i32 48, i32 20, i32 21, i32 22, i32 23, i32 48, i32 48, i32 48, i32 48, i32 28, i32 29, i32 30, i32 31>
; CHECK-NEXT: ret <32 x i8> [[TMP1]]		; CHECK-NEXT: ret <32 x i8> [[TMP1]]
;		;
%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15>)		%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15>)
ret <32 x i8> %1		ret <32 x i8> %1
}		}

define <32 x i8> @blend4_avx2(<32 x i8> %InVec) {		define <32 x i8> @blend4_avx2(<32 x i8> %InVec) {
; CHECK-LABEL: @blend4_avx2(		; CHECK-LABEL: @blend4_avx2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <32 x i32> <i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 24, i32 25, i32 26, i32 27, i32 28, i32 29, i32 30, i32 31>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <32 x i32> <i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 24, i32 25, i32 26, i32 27, i32 28, i32 29, i32 30, i32 31>
; CHECK-NEXT: ret <32 x i8> [[TMP1]]		; CHECK-NEXT: ret <32 x i8> [[TMP1]]
;		;
%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15>)		%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15>)
ret <32 x i8> %1		ret <32 x i8> %1
}		}

define <32 x i8> @blend5_avx2(<32 x i8> %InVec) {		define <32 x i8> @blend5_avx2(<32 x i8> %InVec) {
; CHECK-LABEL: @blend5_avx2(		; CHECK-LABEL: @blend5_avx2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <32 x i32> <i32 0, i32 1, i32 2, i32 3, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 16, i32 17, i32 18, i32 19, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <32 x i32> <i32 0, i32 1, i32 2, i32 3, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 16, i32 17, i32 18, i32 19, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48>
; CHECK-NEXT: ret <32 x i8> [[TMP1]]		; CHECK-NEXT: ret <32 x i8> [[TMP1]]
;		;
%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)		%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)
ret <32 x i8> %1		ret <32 x i8> %1
}		}

define <32 x i8> @blend6_avx2(<32 x i8> %InVec) {		define <32 x i8> @blend6_avx2(<32 x i8> %InVec) {
; CHECK-LABEL: @blend6_avx2(		; CHECK-LABEL: @blend6_avx2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <32 x i32> <i32 0, i32 1, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 16, i32 17, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <32 x i32> <i32 0, i32 1, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 16, i32 17, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48>
; CHECK-NEXT: ret <32 x i8> [[TMP1]]		; CHECK-NEXT: ret <32 x i8> [[TMP1]]
;		;
%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)		%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)
ret <32 x i8> %1		ret <32 x i8> %1
}		}

define <64 x i8> @blend1_avx512(<64 x i8> %InVec) {		define <64 x i8> @blend1_avx512(<64 x i8> %InVec) {
; CHECK-LABEL: @blend1_avx512(		; CHECK-LABEL: @blend1_avx512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <64 x i32> <i32 64, i32 1, i32 64, i32 3, i32 64, i32 5, i32 64, i32 7, i32 64, i32 9, i32 64, i32 11, i32 64, i32 13, i32 64, i32 15, i32 80, i32 17, i32 80, i32 19, i32 80, i32 21, i32 80, i32 23, i32 80, i32 25, i32 80, i32 27, i32 80, i32 29, i32 80, i32 31, i32 96, i32 33, i32 96, i32 35, i32 96, i32 37, i32 96, i32 39, i32 96, i32 41, i32 96, i32 43, i32 96, i32 45, i32 96, i32 47, i32 112, i32 49, i32 112, i32 51, i32 112, i32 53, i32 112, i32 55, i32 112, i32 57, i32 112, i32 59, i32 112, i32 61, i32 112, i32 63>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <64 x i32> <i32 64, i32 1, i32 64, i32 3, i32 64, i32 5, i32 64, i32 7, i32 64, i32 9, i32 64, i32 11, i32 64, i32 13, i32 64, i32 15, i32 80, i32 17, i32 80, i32 19, i32 80, i32 21, i32 80, i32 23, i32 80, i32 25, i32 80, i32 27, i32 80, i32 29, i32 80, i32 31, i32 96, i32 33, i32 96, i32 35, i32 96, i32 37, i32 96, i32 39, i32 96, i32 41, i32 96, i32 43, i32 96, i32 45, i32 96, i32 47, i32 112, i32 49, i32 112, i32 51, i32 112, i32 53, i32 112, i32 55, i32 112, i32 57, i32 112, i32 59, i32 112, i32 61, i32 112, i32 63>
; CHECK-NEXT: ret <64 x i8> [[TMP1]]		; CHECK-NEXT: ret <64 x i8> [[TMP1]]
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15, i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15, i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15, i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15>)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15, i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15, i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15, i8 -128, i8 1, i8 -128, i8 3, i8 -128, i8 5, i8 -128, i8 7, i8 -128, i8 9, i8 -128, i8 11, i8 -128, i8 13, i8 -128, i8 15>)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

define <64 x i8> @blend2_avx512(<64 x i8> %InVec) {		define <64 x i8> @blend2_avx512(<64 x i8> %InVec) {
; CHECK-LABEL: @blend2_avx512(		; CHECK-LABEL: @blend2_avx512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <64 x i32> <i32 64, i32 64, i32 2, i32 3, i32 64, i32 64, i32 6, i32 7, i32 64, i32 64, i32 10, i32 11, i32 64, i32 64, i32 14, i32 15, i32 80, i32 80, i32 18, i32 19, i32 80, i32 80, i32 22, i32 23, i32 80, i32 80, i32 26, i32 27, i32 80, i32 80, i32 30, i32 31, i32 96, i32 96, i32 34, i32 35, i32 96, i32 96, i32 38, i32 39, i32 96, i32 96, i32 42, i32 43, i32 96, i32 96, i32 46, i32 47, i32 112, i32 112, i32 50, i32 51, i32 112, i32 112, i32 54, i32 55, i32 112, i32 112, i32 58, i32 59, i32 112, i32 112, i32 62, i32 63>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <64 x i32> <i32 64, i32 64, i32 2, i32 3, i32 64, i32 64, i32 6, i32 7, i32 64, i32 64, i32 10, i32 11, i32 64, i32 64, i32 14, i32 15, i32 80, i32 80, i32 18, i32 19, i32 80, i32 80, i32 22, i32 23, i32 80, i32 80, i32 26, i32 27, i32 80, i32 80, i32 30, i32 31, i32 96, i32 96, i32 34, i32 35, i32 96, i32 96, i32 38, i32 39, i32 96, i32 96, i32 42, i32 43, i32 96, i32 96, i32 46, i32 47, i32 112, i32 112, i32 50, i32 51, i32 112, i32 112, i32 54, i32 55, i32 112, i32 112, i32 58, i32 59, i32 112, i32 112, i32 62, i32 63>
; CHECK-NEXT: ret <64 x i8> [[TMP1]]		; CHECK-NEXT: ret <64 x i8> [[TMP1]]
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15, i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15, i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15, i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15>)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15, i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15, i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15, i8 -128, i8 -128, i8 2, i8 3, i8 -128, i8 -128, i8 6, i8 7, i8 -128, i8 -128, i8 10, i8 11, i8 -128, i8 -128, i8 14, i8 15>)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

define <64 x i8> @blend3_avx512(<64 x i8> %InVec) {		define <64 x i8> @blend3_avx512(<64 x i8> %InVec) {
; CHECK-LABEL: @blend3_avx512(		; CHECK-LABEL: @blend3_avx512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <64 x i32> <i32 64, i32 64, i32 64, i32 64, i32 4, i32 5, i32 6, i32 7, i32 64, i32 64, i32 64, i32 64, i32 12, i32 13, i32 14, i32 15, i32 80, i32 80, i32 80, i32 80, i32 20, i32 21, i32 22, i32 23, i32 80, i32 80, i32 80, i32 80, i32 28, i32 29, i32 30, i32 31, i32 96, i32 96, i32 96, i32 96, i32 36, i32 37, i32 38, i32 39, i32 96, i32 96, i32 96, i32 96, i32 44, i32 45, i32 46, i32 47, i32 112, i32 112, i32 112, i32 112, i32 52, i32 53, i32 54, i32 55, i32 112, i32 112, i32 112, i32 112, i32 60, i32 61, i32 62, i32 63>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <64 x i32> <i32 64, i32 64, i32 64, i32 64, i32 4, i32 5, i32 6, i32 7, i32 64, i32 64, i32 64, i32 64, i32 12, i32 13, i32 14, i32 15, i32 80, i32 80, i32 80, i32 80, i32 20, i32 21, i32 22, i32 23, i32 80, i32 80, i32 80, i32 80, i32 28, i32 29, i32 30, i32 31, i32 96, i32 96, i32 96, i32 96, i32 36, i32 37, i32 38, i32 39, i32 96, i32 96, i32 96, i32 96, i32 44, i32 45, i32 46, i32 47, i32 112, i32 112, i32 112, i32 112, i32 52, i32 53, i32 54, i32 55, i32 112, i32 112, i32 112, i32 112, i32 60, i32 61, i32 62, i32 63>
; CHECK-NEXT: ret <64 x i8> [[TMP1]]		; CHECK-NEXT: ret <64 x i8> [[TMP1]]
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15>)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 12, i8 13, i8 14, i8 15>)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

define <64 x i8> @blend4_avx512(<64 x i8> %InVec) {		define <64 x i8> @blend4_avx512(<64 x i8> %InVec) {
; CHECK-LABEL: @blend4_avx512(		; CHECK-LABEL: @blend4_avx512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <64 x i32> <i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 24, i32 25, i32 26, i32 27, i32 28, i32 29, i32 30, i32 31, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 40, i32 41, i32 42, i32 43, i32 44, i32 45, i32 46, i32 47, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 56, i32 57, i32 58, i32 59, i32 60, i32 61, i32 62, i32 63>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <64 x i32> <i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 24, i32 25, i32 26, i32 27, i32 28, i32 29, i32 30, i32 31, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 40, i32 41, i32 42, i32 43, i32 44, i32 45, i32 46, i32 47, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 56, i32 57, i32 58, i32 59, i32 60, i32 61, i32 62, i32 63>
; CHECK-NEXT: ret <64 x i8> [[TMP1]]		; CHECK-NEXT: ret <64 x i8> [[TMP1]]
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15>)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 8, i8 9, i8 10, i8 11, i8 12, i8 13, i8 14, i8 15>)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

define <64 x i8> @blend5_avx512(<64 x i8> %InVec) {		define <64 x i8> @blend5_avx512(<64 x i8> %InVec) {
; CHECK-LABEL: @blend5_avx512(		; CHECK-LABEL: @blend5_avx512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <64 x i32> <i32 0, i32 1, i32 2, i32 3, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 16, i32 17, i32 18, i32 19, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 32, i32 33, i32 34, i32 35, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 48, i32 49, i32 50, i32 51, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <64 x i32> <i32 0, i32 1, i32 2, i32 3, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 16, i32 17, i32 18, i32 19, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 32, i32 33, i32 34, i32 35, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 48, i32 49, i32 50, i32 51, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112>
; CHECK-NEXT: ret <64 x i8> [[TMP1]]		; CHECK-NEXT: ret <64 x i8> [[TMP1]]
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 2, i8 3, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

define <64 x i8> @blend6_avx512(<64 x i8> %InVec) {		define <64 x i8> @blend6_avx512(<64 x i8> %InVec) {
; CHECK-LABEL: @blend6_avx512(		; CHECK-LABEL: @blend6_avx512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <64 x i32> <i32 0, i32 1, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 16, i32 17, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 32, i32 33, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 48, i32 49, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <64 x i32> <i32 0, i32 1, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 16, i32 17, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 32, i32 33, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 48, i32 49, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112>
; CHECK-NEXT: ret <64 x i8> [[TMP1]]		; CHECK-NEXT: ret <64 x i8> [[TMP1]]
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128,i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128,i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 0, i8 1, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

; movq idiom.		; movq idiom.
define <16 x i8> @movq_idiom(<16 x i8> %InVec) {		define <16 x i8> @movq_idiom(<16 x i8> %InVec) {
; CHECK-LABEL: @movq_idiom(		; CHECK-LABEL: @movq_idiom(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16, i32 16>
; CHECK-NEXT: ret <16 x i8> [[TMP1]]		; CHECK-NEXT: ret <16 x i8> [[TMP1]]
;		;
%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)		%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)
ret <16 x i8> %1		ret <16 x i8> %1
}		}

define <32 x i8> @movq_idiom_avx2(<32 x i8> %InVec) {		define <32 x i8> @movq_idiom_avx2(<32 x i8> %InVec) {
; CHECK-LABEL: @movq_idiom_avx2(		; CHECK-LABEL: @movq_idiom_avx2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <32 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <32 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 32, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48, i32 48>
; CHECK-NEXT: ret <32 x i8> [[TMP1]]		; CHECK-NEXT: ret <32 x i8> [[TMP1]]
;		;
%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)		%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)
ret <32 x i8> %1		ret <32 x i8> %1
}		}

define <64 x i8> @movq_idiom_avx512(<64 x i8> %InVec) {		define <64 x i8> @movq_idiom_avx512(<64 x i8> %InVec) {
; CHECK-LABEL: @movq_idiom_avx512(		; CHECK-LABEL: @movq_idiom_avx512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <64 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <64 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 64, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 80, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 96, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112, i32 112>
; CHECK-NEXT: ret <64 x i8> [[TMP1]]		; CHECK-NEXT: ret <64 x i8> [[TMP1]]
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

; Vector permutations using byte shuffles.		; Vector permutations using byte shuffles.

define <16 x i8> @permute1(<16 x i8> %InVec) {		define <16 x i8> @permute1(<16 x i8> %InVec) {
; CHECK-LABEL: @permute1(		; CHECK-LABEL: @permute1(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> undef, <16 x i32> <i32 4, i32 5, i32 6, i32 7, i32 4, i32 5, i32 6, i32 7, i32 12, i32 13, i32 14, i32 15, i32 12, i32 13, i32 14, i32 15>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> poison, <16 x i32> <i32 4, i32 5, i32 6, i32 7, i32 4, i32 5, i32 6, i32 7, i32 12, i32 13, i32 14, i32 15, i32 12, i32 13, i32 14, i32 15>
; CHECK-NEXT: ret <16 x i8> [[TMP1]]		; CHECK-NEXT: ret <16 x i8> [[TMP1]]
;		;
%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15>)		%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15>)
ret <16 x i8> %1		ret <16 x i8> %1
}		}

define <16 x i8> @permute2(<16 x i8> %InVec) {		define <16 x i8> @permute2(<16 x i8> %InVec) {
; CHECK-LABEL: @permute2(		; CHECK-LABEL: @permute2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> undef, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> poison, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7>
; CHECK-NEXT: ret <16 x i8> [[TMP1]]		; CHECK-NEXT: ret <16 x i8> [[TMP1]]
;		;
%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7>)		%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7>)
ret <16 x i8> %1		ret <16 x i8> %1
}		}

define <32 x i8> @permute1_avx2(<32 x i8> %InVec) {		define <32 x i8> @permute1_avx2(<32 x i8> %InVec) {
; CHECK-LABEL: @permute1_avx2(		; CHECK-LABEL: @permute1_avx2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> undef, <32 x i32> <i32 4, i32 5, i32 6, i32 7, i32 4, i32 5, i32 6, i32 7, i32 12, i32 13, i32 14, i32 15, i32 12, i32 13, i32 14, i32 15, i32 20, i32 21, i32 22, i32 23, i32 20, i32 21, i32 22, i32 23, i32 28, i32 29, i32 30, i32 31, i32 28, i32 29, i32 30, i32 31>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> poison, <32 x i32> <i32 4, i32 5, i32 6, i32 7, i32 4, i32 5, i32 6, i32 7, i32 12, i32 13, i32 14, i32 15, i32 12, i32 13, i32 14, i32 15, i32 20, i32 21, i32 22, i32 23, i32 20, i32 21, i32 22, i32 23, i32 28, i32 29, i32 30, i32 31, i32 28, i32 29, i32 30, i32 31>
; CHECK-NEXT: ret <32 x i8> [[TMP1]]		; CHECK-NEXT: ret <32 x i8> [[TMP1]]
;		;
%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15, i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15>)		%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15, i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15>)
ret <32 x i8> %1		ret <32 x i8> %1
}		}

define <32 x i8> @permute2_avx2(<32 x i8> %InVec) {		define <32 x i8> @permute2_avx2(<32 x i8> %InVec) {
; CHECK-LABEL: @permute2_avx2(		; CHECK-LABEL: @permute2_avx2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> undef, <32 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 16, i32 17, i32 18, i32 19, i32 20, i32 21, i32 22, i32 23, i32 16, i32 17, i32 18, i32 19, i32 20, i32 21, i32 22, i32 23>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> poison, <32 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 16, i32 17, i32 18, i32 19, i32 20, i32 21, i32 22, i32 23, i32 16, i32 17, i32 18, i32 19, i32 20, i32 21, i32 22, i32 23>
; CHECK-NEXT: ret <32 x i8> [[TMP1]]		; CHECK-NEXT: ret <32 x i8> [[TMP1]]
;		;
%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7>)		%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7>)
ret <32 x i8> %1		ret <32 x i8> %1
}		}

define <64 x i8> @permute1_avx512(<64 x i8> %InVec) {		define <64 x i8> @permute1_avx512(<64 x i8> %InVec) {
; CHECK-LABEL: @permute1_avx512(		; CHECK-LABEL: @permute1_avx512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> undef, <64 x i32> <i32 4, i32 5, i32 6, i32 7, i32 4, i32 5, i32 6, i32 7, i32 12, i32 13, i32 14, i32 15, i32 12, i32 13, i32 14, i32 15, i32 20, i32 21, i32 22, i32 23, i32 20, i32 21, i32 22, i32 23, i32 28, i32 29, i32 30, i32 31, i32 28, i32 29, i32 30, i32 31, i32 36, i32 37, i32 38, i32 39, i32 36, i32 37, i32 38, i32 39, i32 44, i32 45, i32 46, i32 47, i32 44, i32 45, i32 46, i32 47, i32 52, i32 53, i32 54, i32 55, i32 52, i32 53, i32 54, i32 55, i32 60, i32 61, i32 62, i32 63, i32 60, i32 61, i32 62, i32 63>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> poison, <64 x i32> <i32 4, i32 5, i32 6, i32 7, i32 4, i32 5, i32 6, i32 7, i32 12, i32 13, i32 14, i32 15, i32 12, i32 13, i32 14, i32 15, i32 20, i32 21, i32 22, i32 23, i32 20, i32 21, i32 22, i32 23, i32 28, i32 29, i32 30, i32 31, i32 28, i32 29, i32 30, i32 31, i32 36, i32 37, i32 38, i32 39, i32 36, i32 37, i32 38, i32 39, i32 44, i32 45, i32 46, i32 47, i32 44, i32 45, i32 46, i32 47, i32 52, i32 53, i32 54, i32 55, i32 52, i32 53, i32 54, i32 55, i32 60, i32 61, i32 62, i32 63, i32 60, i32 61, i32 62, i32 63>
; CHECK-NEXT: ret <64 x i8> [[TMP1]]		; CHECK-NEXT: ret <64 x i8> [[TMP1]]
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15, i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15, i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15, i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15>)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15, i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15, i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15, i8 4, i8 5, i8 6, i8 7, i8 4, i8 5, i8 6, i8 7, i8 12, i8 13, i8 14, i8 15, i8 12, i8 13, i8 14, i8 15>)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

define <64 x i8> @permute2_avx512(<64 x i8> %InVec) {		define <64 x i8> @permute2_avx512(<64 x i8> %InVec) {
; CHECK-LABEL: @permute2_avx512(		; CHECK-LABEL: @permute2_avx512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> undef, <64 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 16, i32 17, i32 18, i32 19, i32 20, i32 21, i32 22, i32 23, i32 16, i32 17, i32 18, i32 19, i32 20, i32 21, i32 22, i32 23, i32 32, i32 33, i32 34, i32 35, i32 36, i32 37, i32 38, i32 39, i32 32, i32 33, i32 34, i32 35, i32 36, i32 37, i32 38, i32 39, i32 48, i32 49, i32 50, i32 51, i32 52, i32 53, i32 54, i32 55, i32 48, i32 49, i32 50, i32 51, i32 52, i32 53, i32 54, i32 55>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> poison, <64 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 16, i32 17, i32 18, i32 19, i32 20, i32 21, i32 22, i32 23, i32 16, i32 17, i32 18, i32 19, i32 20, i32 21, i32 22, i32 23, i32 32, i32 33, i32 34, i32 35, i32 36, i32 37, i32 38, i32 39, i32 32, i32 33, i32 34, i32 35, i32 36, i32 37, i32 38, i32 39, i32 48, i32 49, i32 50, i32 51, i32 52, i32 53, i32 54, i32 55, i32 48, i32 49, i32 50, i32 51, i32 52, i32 53, i32 54, i32 55>
; CHECK-NEXT: ret <64 x i8> [[TMP1]]		; CHECK-NEXT: ret <64 x i8> [[TMP1]]
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7>)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7>)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

; Test that instcombine correctly folds a pshufb with values that		; Test that instcombine correctly folds a pshufb with values that
; are not -128 and that are not encoded in four bits.		; are not -128 and that are not encoded in four bits.
▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines
; CHECK-NEXT: ret <64 x i8> zeroinitializer		; CHECK-NEXT: ret <64 x i8> zeroinitializer
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 -127, i8 -1, i8 -53, i8 -32, i8 -4, i8 -7, i8 -33, i8 -66, i8 -99, i8 -120, i8 -100, i8 -22, i8 -17, i8 -1, i8 -11, i8 -15, i8 -126, i8 -2, i8 -52, i8 -31, i8 -5, i8 -8, i8 -34, i8 -67, i8 -100, i8 -119, i8 -101, i8 -23, i8 -16, i8 -2, i8 -12, i8 -16, i8 -125, i8 -3, i8 -51, i8 -30, i8 -6, i8 -9, i8 -35, i8 -68, i8 -101, i8 -118, i8 -102, i8 -24, i8 -15, i8 -3, i8 -13, i8 -17, i8 -124, i8 -4, i8 -56, i8 -29, i8 -7, i8 -10, i8 -36, i8 -69, i8 -102, i8 -117, i8 -103, i8 -25, i8 -14, i8 -4, i8 -14, i8 -18>)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 -127, i8 -1, i8 -53, i8 -32, i8 -4, i8 -7, i8 -33, i8 -66, i8 -99, i8 -120, i8 -100, i8 -22, i8 -17, i8 -1, i8 -11, i8 -15, i8 -126, i8 -2, i8 -52, i8 -31, i8 -5, i8 -8, i8 -34, i8 -67, i8 -100, i8 -119, i8 -101, i8 -23, i8 -16, i8 -2, i8 -12, i8 -16, i8 -125, i8 -3, i8 -51, i8 -30, i8 -6, i8 -9, i8 -35, i8 -68, i8 -101, i8 -118, i8 -102, i8 -24, i8 -15, i8 -3, i8 -13, i8 -17, i8 -124, i8 -4, i8 -56, i8 -29, i8 -7, i8 -10, i8 -36, i8 -69, i8 -102, i8 -117, i8 -103, i8 -25, i8 -14, i8 -4, i8 -14, i8 -18>)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

define <16 x i8> @permute3(<16 x i8> %InVec) {		define <16 x i8> @permute3(<16 x i8> %InVec) {
; CHECK-LABEL: @permute3(		; CHECK-LABEL: @permute3(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> undef, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> poison, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7>
; CHECK-NEXT: ret <16 x i8> [[TMP1]]		; CHECK-NEXT: ret <16 x i8> [[TMP1]]
;		;
%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 48, i8 17, i8 34, i8 51, i8 20, i8 37, i8 54, i8 23, i8 16, i8 49, i8 66, i8 19, i8 52, i8 69, i8 22, i8 55>)		%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 48, i8 17, i8 34, i8 51, i8 20, i8 37, i8 54, i8 23, i8 16, i8 49, i8 66, i8 19, i8 52, i8 69, i8 22, i8 55>)
ret <16 x i8> %1		ret <16 x i8> %1
}		}

define <32 x i8> @permute3_avx2(<32 x i8> %InVec) {		define <32 x i8> @permute3_avx2(<32 x i8> %InVec) {
; CHECK-LABEL: @permute3_avx2(		; CHECK-LABEL: @permute3_avx2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> undef, <32 x i32> <i32 4, i32 5, i32 6, i32 7, i32 4, i32 5, i32 6, i32 7, i32 12, i32 13, i32 14, i32 15, i32 12, i32 13, i32 14, i32 15, i32 20, i32 21, i32 22, i32 23, i32 20, i32 21, i32 22, i32 23, i32 28, i32 29, i32 30, i32 31, i32 28, i32 29, i32 30, i32 31>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> poison, <32 x i32> <i32 4, i32 5, i32 6, i32 7, i32 4, i32 5, i32 6, i32 7, i32 12, i32 13, i32 14, i32 15, i32 12, i32 13, i32 14, i32 15, i32 20, i32 21, i32 22, i32 23, i32 20, i32 21, i32 22, i32 23, i32 28, i32 29, i32 30, i32 31, i32 28, i32 29, i32 30, i32 31>
; CHECK-NEXT: ret <32 x i8> [[TMP1]]		; CHECK-NEXT: ret <32 x i8> [[TMP1]]
;		;
%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 52, i8 21, i8 38, i8 55, i8 20, i8 37, i8 54, i8 23, i8 28, i8 61, i8 78, i8 31, i8 60, i8 29, i8 30, i8 79, i8 52, i8 21, i8 38, i8 55, i8 20, i8 53, i8 102, i8 23, i8 92, i8 93, i8 94, i8 95, i8 108, i8 109, i8 110, i8 111>)		%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 52, i8 21, i8 38, i8 55, i8 20, i8 37, i8 54, i8 23, i8 28, i8 61, i8 78, i8 31, i8 60, i8 29, i8 30, i8 79, i8 52, i8 21, i8 38, i8 55, i8 20, i8 53, i8 102, i8 23, i8 92, i8 93, i8 94, i8 95, i8 108, i8 109, i8 110, i8 111>)
ret <32 x i8> %1		ret <32 x i8> %1
}		}

define <64 x i8> @permute3_avx512(<64 x i8> %InVec) {		define <64 x i8> @permute3_avx512(<64 x i8> %InVec) {
; CHECK-LABEL: @permute3_avx512(		; CHECK-LABEL: @permute3_avx512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> undef, <64 x i32> <i32 4, i32 5, i32 6, i32 7, i32 4, i32 5, i32 6, i32 7, i32 12, i32 13, i32 14, i32 15, i32 12, i32 13, i32 14, i32 15, i32 20, i32 21, i32 22, i32 23, i32 20, i32 21, i32 22, i32 23, i32 28, i32 29, i32 30, i32 31, i32 28, i32 29, i32 30, i32 31, i32 36, i32 37, i32 38, i32 39, i32 36, i32 37, i32 38, i32 39, i32 44, i32 45, i32 46, i32 47, i32 44, i32 45, i32 46, i32 47, i32 52, i32 53, i32 54, i32 55, i32 52, i32 53, i32 54, i32 55, i32 60, i32 61, i32 62, i32 63, i32 60, i32 61, i32 62, i32 63>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> poison, <64 x i32> <i32 4, i32 5, i32 6, i32 7, i32 4, i32 5, i32 6, i32 7, i32 12, i32 13, i32 14, i32 15, i32 12, i32 13, i32 14, i32 15, i32 20, i32 21, i32 22, i32 23, i32 20, i32 21, i32 22, i32 23, i32 28, i32 29, i32 30, i32 31, i32 28, i32 29, i32 30, i32 31, i32 36, i32 37, i32 38, i32 39, i32 36, i32 37, i32 38, i32 39, i32 44, i32 45, i32 46, i32 47, i32 44, i32 45, i32 46, i32 47, i32 52, i32 53, i32 54, i32 55, i32 52, i32 53, i32 54, i32 55, i32 60, i32 61, i32 62, i32 63, i32 60, i32 61, i32 62, i32 63>
; CHECK-NEXT: ret <64 x i8> [[TMP1]]		; CHECK-NEXT: ret <64 x i8> [[TMP1]]
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 52, i8 21, i8 38, i8 55, i8 20, i8 37, i8 54, i8 23, i8 28, i8 61, i8 78, i8 31, i8 60, i8 29, i8 30, i8 79, i8 52, i8 21, i8 38, i8 55, i8 20, i8 53, i8 102, i8 23, i8 92, i8 93, i8 94, i8 95, i8 108, i8 109, i8 110, i8 111, i8 52, i8 21, i8 38, i8 55, i8 20, i8 37, i8 54, i8 23, i8 28, i8 61, i8 78, i8 31, i8 60, i8 29, i8 30, i8 79, i8 52, i8 21, i8 38, i8 55, i8 20, i8 53, i8 102, i8 23, i8 108, i8 109, i8 110, i8 111, i8 124, i8 125, i8 126, i8 127>)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 52, i8 21, i8 38, i8 55, i8 20, i8 37, i8 54, i8 23, i8 28, i8 61, i8 78, i8 31, i8 60, i8 29, i8 30, i8 79, i8 52, i8 21, i8 38, i8 55, i8 20, i8 53, i8 102, i8 23, i8 92, i8 93, i8 94, i8 95, i8 108, i8 109, i8 110, i8 111, i8 52, i8 21, i8 38, i8 55, i8 20, i8 37, i8 54, i8 23, i8 28, i8 61, i8 78, i8 31, i8 60, i8 29, i8 30, i8 79, i8 52, i8 21, i8 38, i8 55, i8 20, i8 53, i8 102, i8 23, i8 108, i8 109, i8 110, i8 111, i8 124, i8 125, i8 126, i8 127>)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

; FIXME: Verify that instcombine is able to fold constant byte shuffles with undef mask elements.		; FIXME: Verify that instcombine is able to fold constant byte shuffles with undef mask elements.

define <16 x i8> @fold_with_undef_elts(<16 x i8> %InVec) {		define <16 x i8> @fold_with_undef_elts(<16 x i8> %InVec) {
; CHECK-LABEL: @fold_with_undef_elts(		; CHECK-LABEL: @fold_with_undef_elts(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <16 x i32> <i32 0, i32 16, i32 undef, i32 16, i32 1, i32 16, i32 undef, i32 16, i32 2, i32 16, i32 undef, i32 16, i32 3, i32 16, i32 undef, i32 16>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <16 x i8> [[INVEC:%.]], <16 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <16 x i32> <i32 0, i32 16, i32 undef, i32 16, i32 1, i32 16, i32 undef, i32 16, i32 2, i32 16, i32 undef, i32 16, i32 3, i32 16, i32 undef, i32 16>
; CHECK-NEXT: ret <16 x i8> [[TMP1]]		; CHECK-NEXT: ret <16 x i8> [[TMP1]]
;		;
%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128>)		%1 = tail call <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8> %InVec, <16 x i8> <i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128>)
ret <16 x i8> %1		ret <16 x i8> %1
}		}

define <32 x i8> @fold_with_undef_elts_avx2(<32 x i8> %InVec) {		define <32 x i8> @fold_with_undef_elts_avx2(<32 x i8> %InVec) {
; CHECK-LABEL: @fold_with_undef_elts_avx2(		; CHECK-LABEL: @fold_with_undef_elts_avx2(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <32 x i32> <i32 0, i32 32, i32 undef, i32 32, i32 1, i32 32, i32 undef, i32 32, i32 2, i32 32, i32 undef, i32 32, i32 3, i32 32, i32 undef, i32 32, i32 16, i32 48, i32 undef, i32 48, i32 17, i32 48, i32 undef, i32 48, i32 18, i32 48, i32 undef, i32 48, i32 19, i32 48, i32 undef, i32 48>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <32 x i8> [[INVEC:%.]], <32 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <32 x i32> <i32 0, i32 32, i32 undef, i32 32, i32 1, i32 32, i32 undef, i32 32, i32 2, i32 32, i32 undef, i32 32, i32 3, i32 32, i32 undef, i32 32, i32 16, i32 48, i32 undef, i32 48, i32 17, i32 48, i32 undef, i32 48, i32 18, i32 48, i32 undef, i32 48, i32 19, i32 48, i32 undef, i32 48>
; CHECK-NEXT: ret <32 x i8> [[TMP1]]		; CHECK-NEXT: ret <32 x i8> [[TMP1]]
;		;
%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128, i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128>)		%1 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> <i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128, i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128>)
ret <32 x i8> %1		ret <32 x i8> %1
}		}

define <64 x i8> @fold_with_undef_elts_avx512(<64 x i8> %InVec) {		define <64 x i8> @fold_with_undef_elts_avx512(<64 x i8> %InVec) {
; CHECK-LABEL: @fold_with_undef_elts_avx512(		; CHECK-LABEL: @fold_with_undef_elts_avx512(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <64 x i32> <i32 0, i32 64, i32 undef, i32 64, i32 1, i32 64, i32 undef, i32 64, i32 2, i32 64, i32 undef, i32 64, i32 3, i32 64, i32 undef, i32 64, i32 16, i32 80, i32 undef, i32 80, i32 17, i32 80, i32 undef, i32 80, i32 18, i32 80, i32 undef, i32 80, i32 19, i32 80, i32 undef, i32 80, i32 32, i32 96, i32 undef, i32 96, i32 33, i32 96, i32 undef, i32 96, i32 34, i32 96, i32 undef, i32 96, i32 35, i32 96, i32 undef, i32 96, i32 48, i32 112, i32 undef, i32 112, i32 49, i32 112, i32 undef, i32 112, i32 50, i32 112, i32 undef, i32 112, i32 51, i32 112, i32 undef, i32 112>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <64 x i8> [[INVEC:%.]], <64 x i8> <i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <64 x i32> <i32 0, i32 64, i32 undef, i32 64, i32 1, i32 64, i32 undef, i32 64, i32 2, i32 64, i32 undef, i32 64, i32 3, i32 64, i32 undef, i32 64, i32 16, i32 80, i32 undef, i32 80, i32 17, i32 80, i32 undef, i32 80, i32 18, i32 80, i32 undef, i32 80, i32 19, i32 80, i32 undef, i32 80, i32 32, i32 96, i32 undef, i32 96, i32 33, i32 96, i32 undef, i32 96, i32 34, i32 96, i32 undef, i32 96, i32 35, i32 96, i32 undef, i32 96, i32 48, i32 112, i32 undef, i32 112, i32 49, i32 112, i32 undef, i32 112, i32 50, i32 112, i32 undef, i32 112, i32 51, i32 112, i32 undef, i32 112>
; CHECK-NEXT: ret <64 x i8> [[TMP1]]		; CHECK-NEXT: ret <64 x i8> [[TMP1]]
;		;
%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128, i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128, i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128, i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128>)		%1 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> <i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128, i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128, i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128, i8 0, i8 -128, i8 undef, i8 -128, i8 1, i8 -128, i8 undef, i8 -128, i8 2, i8 -128, i8 undef, i8 -128, i8 3, i8 -128, i8 undef, i8 -128>)
ret <64 x i8> %1		ret <64 x i8> %1
}		}

define <16 x i8> @fold_with_allundef_elts(<16 x i8> %InVec) {		define <16 x i8> @fold_with_allundef_elts(<16 x i8> %InVec) {
; CHECK-LABEL: @fold_with_allundef_elts(		; CHECK-LABEL: @fold_with_allundef_elts(
▲ Show 20 Lines • Show All 44 Lines • ▼ Show 20 Lines	;
%2 = insertelement <32 x i8> %1, i8 %M22, i32 22		%2 = insertelement <32 x i8> %1, i8 %M22, i32 22
%3 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> %2)		%3 = tail call <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8> %InVec, <32 x i8> %2)
%4 = shufflevector <32 x i8> %3, <32 x i8> undef, <32 x i32> <i32 undef, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15, i32 16, i32 17, i32 18, i32 19, i32 20, i32 21, i32 undef, i32 23, i32 24, i32 25, i32 26, i32 27, i32 28, i32 29, i32 30, i32 31>		%4 = shufflevector <32 x i8> %3, <32 x i8> undef, <32 x i32> <i32 undef, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15, i32 16, i32 17, i32 18, i32 19, i32 20, i32 21, i32 undef, i32 23, i32 24, i32 25, i32 26, i32 27, i32 28, i32 29, i32 30, i32 31>
ret <32 x i8> %4		ret <32 x i8> %4
}		}

define <64 x i8> @demanded_elts_insertion_avx512(<64 x i8> %InVec, <64 x i8> %BaseMask, i8 %M0, i8 %M30) {		define <64 x i8> @demanded_elts_insertion_avx512(<64 x i8> %InVec, <64 x i8> %BaseMask, i8 %M0, i8 %M30) {
; CHECK-LABEL: @demanded_elts_insertion_avx512(		; CHECK-LABEL: @demanded_elts_insertion_avx512(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <64 x i8> undef, i8 [[M0:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <64 x i8> poison, i8 [[M0:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> [[INVEC:%.]], <64 x i8> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.]] = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> [[INVEC:%.]], <64 x i8> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = shufflevector <64 x i8> [[TMP2]], <64 x i8> undef, <64 x i32> zeroinitializer		; CHECK-NEXT: [[TMP3:%.*]] = shufflevector <64 x i8> [[TMP2]], <64 x i8> undef, <64 x i32> zeroinitializer
; CHECK-NEXT: ret <64 x i8> [[TMP3]]		; CHECK-NEXT: ret <64 x i8> [[TMP3]]
;		;
%1 = insertelement <64 x i8> %BaseMask, i8 %M0, i32 0		%1 = insertelement <64 x i8> %BaseMask, i8 %M0, i32 0
%2 = insertelement <64 x i8> %1, i8 %M30, i32 30		%2 = insertelement <64 x i8> %1, i8 %M30, i32 30
%3 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> %2)		%3 = tail call <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8> %InVec, <64 x i8> %2)
%4 = shufflevector <64 x i8> %3, <64 x i8> undef, <64 x i32> zeroinitializer		%4 = shufflevector <64 x i8> %3, <64 x i8> undef, <64 x i32> zeroinitializer
ret <64 x i8> %4		ret <64 x i8> %4
}		}

declare <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8>, <16 x i8>)		declare <16 x i8> @llvm.x86.ssse3.pshuf.b.128(<16 x i8>, <16 x i8>)
declare <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8>, <32 x i8>)		declare <32 x i8> @llvm.x86.avx2.pshuf.b(<32 x i8>, <32 x i8>)
declare <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8>, <64 x i8>)		declare <64 x i8> @llvm.x86.avx512.pshuf.b.512(<64 x i8>, <64 x i8>)

llvm/test/Transforms/InstCombine/X86/x86-sse-inseltpoison.ll

; NOTE: Assertions have been autogenerated by utils/update_test_checks.py		; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
; RUN: opt < %s -instcombine -mtriple=x86_64-unknown-unknown -S \| FileCheck %s		; RUN: opt < %s -instcombine -mtriple=x86_64-unknown-unknown -S \| FileCheck %s
target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"		target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"

define float @test_rcp_ss_0(float %a) {		define float @test_rcp_ss_0(float %a) {
; CHECK-LABEL: @test_rcp_ss_0(		; CHECK-LABEL: @test_rcp_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.rcp.ss(<4 x float> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.rcp.ss(<4 x float> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
Show All 39 Lines	;
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = tail call <4 x float> @llvm.x86.sse.sqrt.ss(<4 x float> %4)		%5 = tail call <4 x float> @llvm.x86.sse.sqrt.ss(<4 x float> %4)
%6 = extractelement <4 x float> %5, i32 2		%6 = extractelement <4 x float> %5, i32 2
ret float %6		ret float %6
}		}

define float @test_rsqrt_ss_0(float %a) {		define float @test_rsqrt_ss_0(float %a) {
; CHECK-LABEL: @test_rsqrt_ss_0(		; CHECK-LABEL: @test_rsqrt_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.rsqrt.ss(<4 x float> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.rsqrt.ss(<4 x float> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
▲ Show 20 Lines • Show All 196 Lines • ▼ Show 20 Lines	;
%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2		%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2
%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3		%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3
%4 = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> %a, <4 x float> %3)		%4 = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> %a, <4 x float> %3)
ret <4 x float> %4		ret <4 x float> %4
}		}

define float @test_min_ss_0(float %a, float %b) {		define float @test_min_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_min_ss_0(		; CHECK-LABEL: @test_min_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: [[TMP4:%.*]] = extractelement <4 x float> [[TMP3]], i32 0		; CHECK-NEXT: [[TMP4:%.*]] = extractelement <4 x float> [[TMP3]], i32 0
; CHECK-NEXT: ret float [[TMP4]]		; CHECK-NEXT: ret float [[TMP4]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
Show All 17 Lines	;
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> %4, <4 x float> %5)		%6 = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> %4, <4 x float> %5)
%7 = extractelement <4 x float> %6, i32 2		%7 = extractelement <4 x float> %6, i32 2
ret float %7		ret float %7
}		}

define float @test_min_ss_3(float %a) {		define float @test_min_ss_3(float %a) {
; CHECK-LABEL: @test_min_ss_3(		; CHECK-LABEL: @test_min_ss_3(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> <float undef, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> <float poison, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> [[TMP1]], <4 x float> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> [[TMP1]], <4 x float> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0		%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0
%2 = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> %1, <4 x float> %1)		%2 = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> %1, <4 x float> %1)
%3 = extractelement <4 x float> %2, i32 0		%3 = extractelement <4 x float> %2, i32 0
ret float %3		ret float %3
}		}

define <4 x float> @test_max_ss(<4 x float> %a, <4 x float> %b) {		define <4 x float> @test_max_ss(<4 x float> %a, <4 x float> %b) {
; CHECK-LABEL: @test_max_ss(		; CHECK-LABEL: @test_max_ss(
; CHECK-NEXT: [[TMP1:%.]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[A:%.]], <4 x float> [[B:%.*]])		; CHECK-NEXT: [[TMP1:%.]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[A:%.]], <4 x float> [[B:%.*]])
; CHECK-NEXT: ret <4 x float> [[TMP1]]		; CHECK-NEXT: ret <4 x float> [[TMP1]]
;		;
%1 = insertelement <4 x float> %b, float 1.000000e+00, i32 1		%1 = insertelement <4 x float> %b, float 1.000000e+00, i32 1
%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2		%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2
%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3		%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3
%4 = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> %a, <4 x float> %3)		%4 = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> %a, <4 x float> %3)
ret <4 x float> %4		ret <4 x float> %4
}		}

define float @test_max_ss_0(float %a, float %b) {		define float @test_max_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_max_ss_0(		; CHECK-LABEL: @test_max_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: [[TMP4:%.*]] = extractelement <4 x float> [[TMP3]], i32 0		; CHECK-NEXT: [[TMP4:%.*]] = extractelement <4 x float> [[TMP3]], i32 0
; CHECK-NEXT: ret float [[TMP4]]		; CHECK-NEXT: ret float [[TMP4]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
Show All 17 Lines	;
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> %4, <4 x float> %5)		%6 = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> %4, <4 x float> %5)
%7 = extractelement <4 x float> %6, i32 3		%7 = extractelement <4 x float> %6, i32 3
ret float %7		ret float %7
}		}

define float @test_max_ss_4(float %a) {		define float @test_max_ss_4(float %a) {
; CHECK-LABEL: @test_max_ss_4(		; CHECK-LABEL: @test_max_ss_4(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> <float undef, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> <float poison, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[TMP1]], <4 x float> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[TMP1]], <4 x float> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0		%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0
%2 = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> %1, <4 x float> %1)		%2 = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> %1, <4 x float> %1)
%3 = extractelement <4 x float> %2, i32 0		%3 = extractelement <4 x float> %2, i32 0
ret float %3		ret float %3
}		}

define <4 x float> @test_cmp_ss(<4 x float> %a, <4 x float> %b) {		define <4 x float> @test_cmp_ss(<4 x float> %a, <4 x float> %b) {
; CHECK-LABEL: @test_cmp_ss(		; CHECK-LABEL: @test_cmp_ss(
; CHECK-NEXT: [[TMP1:%.]] = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> [[A:%.]], <4 x float> [[B:%.*]], i8 0)		; CHECK-NEXT: [[TMP1:%.]] = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> [[A:%.]], <4 x float> [[B:%.*]], i8 0)
; CHECK-NEXT: ret <4 x float> [[TMP1]]		; CHECK-NEXT: ret <4 x float> [[TMP1]]
;		;
%1 = insertelement <4 x float> %b, float 1.000000e+00, i32 1		%1 = insertelement <4 x float> %b, float 1.000000e+00, i32 1
%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2		%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2
%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3		%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3
%4 = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> %a, <4 x float> %3, i8 0)		%4 = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> %a, <4 x float> %3, i8 0)
ret <4 x float> %4		ret <4 x float> %4
}		}

define float @test_cmp_ss_0(float %a, float %b) {		define float @test_cmp_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_cmp_ss_0(		; CHECK-LABEL: @test_cmp_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]], i8 0)		; CHECK-NEXT: [[TMP3:%.*]] = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]], i8 0)
; CHECK-NEXT: [[R:%.*]] = extractelement <4 x float> [[TMP3]], i32 0		; CHECK-NEXT: [[R:%.*]] = extractelement <4 x float> [[TMP3]], i32 0
; CHECK-NEXT: ret float [[R]]		; CHECK-NEXT: ret float [[R]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
Show All 17 Lines	;
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> %4, <4 x float> %5, i8 0)		%6 = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> %4, <4 x float> %5, i8 0)
%7 = extractelement <4 x float> %6, i32 1		%7 = extractelement <4 x float> %6, i32 1
ret float %7		ret float %7
}		}

define float @test_cmp_ss_2(float %a) {		define float @test_cmp_ss_2(float %a) {
; CHECK-LABEL: @test_cmp_ss_2(		; CHECK-LABEL: @test_cmp_ss_2(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> <float undef, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> <float poison, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> [[TMP1]], <4 x float> [[TMP1]], i8 3)		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> [[TMP1]], <4 x float> [[TMP1]], i8 3)
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0		%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0
%2 = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> %1, <4 x float> %1, i8 3)		%2 = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> %1, <4 x float> %1, i8 3)
%3 = extractelement <4 x float> %2, i32 0		%3 = extractelement <4 x float> %2, i32 0
ret float %3		ret float %3
}		}

define i32 @test_comieq_ss_0(float %a, float %b) {		define i32 @test_comieq_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comieq_ss_0(		; CHECK-LABEL: @test_comieq_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comieq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comieq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.comieq.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.comieq.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_comige_ss_0(float %a, float %b) {		define i32 @test_comige_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comige_ss_0(		; CHECK-LABEL: @test_comige_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comige.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comige.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.comige.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.comige.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_comigt_ss_0(float %a, float %b) {		define i32 @test_comigt_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comigt_ss_0(		; CHECK-LABEL: @test_comigt_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comigt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comigt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.comigt.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.comigt.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_comile_ss_0(float %a, float %b) {		define i32 @test_comile_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comile_ss_0(		; CHECK-LABEL: @test_comile_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comile.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comile.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.comile.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.comile.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_comilt_ss_0(float %a, float %b) {		define i32 @test_comilt_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comilt_ss_0(		; CHECK-LABEL: @test_comilt_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comilt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comilt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.comilt.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.comilt.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_comineq_ss_0(float %a, float %b) {		define i32 @test_comineq_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comineq_ss_0(		; CHECK-LABEL: @test_comineq_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comineq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comineq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.comineq.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.comineq.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_ucomieq_ss_0(float %a, float %b) {		define i32 @test_ucomieq_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_ucomieq_ss_0(		; CHECK-LABEL: @test_ucomieq_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomieq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomieq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.ucomieq.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.ucomieq.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_ucomige_ss_0(float %a, float %b) {		define i32 @test_ucomige_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_ucomige_ss_0(		; CHECK-LABEL: @test_ucomige_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomige.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomige.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.ucomige.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.ucomige.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_ucomigt_ss_0(float %a, float %b) {		define i32 @test_ucomigt_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_ucomigt_ss_0(		; CHECK-LABEL: @test_ucomigt_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomigt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomigt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.ucomigt.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.ucomigt.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_ucomile_ss_0(float %a, float %b) {		define i32 @test_ucomile_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_ucomile_ss_0(		; CHECK-LABEL: @test_ucomile_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomile.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomile.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.ucomile.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.ucomile.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_ucomilt_ss_0(float %a, float %b) {		define i32 @test_ucomilt_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_ucomilt_ss_0(		; CHECK-LABEL: @test_ucomilt_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomilt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomilt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.ucomilt.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.ucomilt.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_ucomineq_ss_0(float %a, float %b) {		define i32 @test_ucomineq_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_ucomineq_ss_0(		; CHECK-LABEL: @test_ucomineq_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomineq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomineq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
Show All 32 Lines

llvm/test/Transforms/InstCombine/X86/x86-sse.ll

; NOTE: Assertions have been autogenerated by utils/update_test_checks.py		; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
; RUN: opt < %s -instcombine -mtriple=x86_64-unknown-unknown -S \| FileCheck %s		; RUN: opt < %s -instcombine -mtriple=x86_64-unknown-unknown -S \| FileCheck %s
target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"		target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"

define float @test_rcp_ss_0(float %a) {		define float @test_rcp_ss_0(float %a) {
; CHECK-LABEL: @test_rcp_ss_0(		; CHECK-LABEL: @test_rcp_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.rcp.ss(<4 x float> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.rcp.ss(<4 x float> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
Show All 39 Lines	;
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = tail call <4 x float> @llvm.x86.sse.sqrt.ss(<4 x float> %4)		%5 = tail call <4 x float> @llvm.x86.sse.sqrt.ss(<4 x float> %4)
%6 = extractelement <4 x float> %5, i32 2		%6 = extractelement <4 x float> %5, i32 2
ret float %6		ret float %6
}		}

define float @test_rsqrt_ss_0(float %a) {		define float @test_rsqrt_ss_0(float %a) {
; CHECK-LABEL: @test_rsqrt_ss_0(		; CHECK-LABEL: @test_rsqrt_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.rsqrt.ss(<4 x float> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.rsqrt.ss(<4 x float> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
▲ Show 20 Lines • Show All 196 Lines • ▼ Show 20 Lines	;
%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2		%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2
%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3		%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3
%4 = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> %a, <4 x float> %3)		%4 = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> %a, <4 x float> %3)
ret <4 x float> %4		ret <4 x float> %4
}		}

define float @test_min_ss_0(float %a, float %b) {		define float @test_min_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_min_ss_0(		; CHECK-LABEL: @test_min_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: [[TMP4:%.*]] = extractelement <4 x float> [[TMP3]], i32 0		; CHECK-NEXT: [[TMP4:%.*]] = extractelement <4 x float> [[TMP3]], i32 0
; CHECK-NEXT: ret float [[TMP4]]		; CHECK-NEXT: ret float [[TMP4]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
Show All 17 Lines	;
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> %4, <4 x float> %5)		%6 = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> %4, <4 x float> %5)
%7 = extractelement <4 x float> %6, i32 2		%7 = extractelement <4 x float> %6, i32 2
ret float %7		ret float %7
}		}

define float @test_min_ss_3(float %a) {		define float @test_min_ss_3(float %a) {
; CHECK-LABEL: @test_min_ss_3(		; CHECK-LABEL: @test_min_ss_3(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> <float undef, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> <float poison, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> [[TMP1]], <4 x float> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> [[TMP1]], <4 x float> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0		%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0
%2 = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> %1, <4 x float> %1)		%2 = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> %1, <4 x float> %1)
%3 = extractelement <4 x float> %2, i32 0		%3 = extractelement <4 x float> %2, i32 0
ret float %3		ret float %3
}		}

define <4 x float> @test_max_ss(<4 x float> %a, <4 x float> %b) {		define <4 x float> @test_max_ss(<4 x float> %a, <4 x float> %b) {
; CHECK-LABEL: @test_max_ss(		; CHECK-LABEL: @test_max_ss(
; CHECK-NEXT: [[TMP1:%.]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[A:%.]], <4 x float> [[B:%.*]])		; CHECK-NEXT: [[TMP1:%.]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[A:%.]], <4 x float> [[B:%.*]])
; CHECK-NEXT: ret <4 x float> [[TMP1]]		; CHECK-NEXT: ret <4 x float> [[TMP1]]
;		;
%1 = insertelement <4 x float> %b, float 1.000000e+00, i32 1		%1 = insertelement <4 x float> %b, float 1.000000e+00, i32 1
%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2		%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2
%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3		%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3
%4 = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> %a, <4 x float> %3)		%4 = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> %a, <4 x float> %3)
ret <4 x float> %4		ret <4 x float> %4
}		}

define float @test_max_ss_0(float %a, float %b) {		define float @test_max_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_max_ss_0(		; CHECK-LABEL: @test_max_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: [[TMP4:%.*]] = extractelement <4 x float> [[TMP3]], i32 0		; CHECK-NEXT: [[TMP4:%.*]] = extractelement <4 x float> [[TMP3]], i32 0
; CHECK-NEXT: ret float [[TMP4]]		; CHECK-NEXT: ret float [[TMP4]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
Show All 17 Lines	;
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> %4, <4 x float> %5)		%6 = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> %4, <4 x float> %5)
%7 = extractelement <4 x float> %6, i32 3		%7 = extractelement <4 x float> %6, i32 3
ret float %7		ret float %7
}		}

define float @test_max_ss_4(float %a) {		define float @test_max_ss_4(float %a) {
; CHECK-LABEL: @test_max_ss_4(		; CHECK-LABEL: @test_max_ss_4(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> <float undef, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> <float poison, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[TMP1]], <4 x float> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[TMP1]], <4 x float> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0		%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0
%2 = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> %1, <4 x float> %1)		%2 = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> %1, <4 x float> %1)
%3 = extractelement <4 x float> %2, i32 0		%3 = extractelement <4 x float> %2, i32 0
ret float %3		ret float %3
}		}

define <4 x float> @test_cmp_ss(<4 x float> %a, <4 x float> %b) {		define <4 x float> @test_cmp_ss(<4 x float> %a, <4 x float> %b) {
; CHECK-LABEL: @test_cmp_ss(		; CHECK-LABEL: @test_cmp_ss(
; CHECK-NEXT: [[TMP1:%.]] = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> [[A:%.]], <4 x float> [[B:%.*]], i8 0)		; CHECK-NEXT: [[TMP1:%.]] = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> [[A:%.]], <4 x float> [[B:%.*]], i8 0)
; CHECK-NEXT: ret <4 x float> [[TMP1]]		; CHECK-NEXT: ret <4 x float> [[TMP1]]
;		;
%1 = insertelement <4 x float> %b, float 1.000000e+00, i32 1		%1 = insertelement <4 x float> %b, float 1.000000e+00, i32 1
%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2		%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2
%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3		%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3
%4 = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> %a, <4 x float> %3, i8 0)		%4 = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> %a, <4 x float> %3, i8 0)
ret <4 x float> %4		ret <4 x float> %4
}		}

define float @test_cmp_ss_0(float %a, float %b) {		define float @test_cmp_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_cmp_ss_0(		; CHECK-LABEL: @test_cmp_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]], i8 0)		; CHECK-NEXT: [[TMP3:%.*]] = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]], i8 0)
; CHECK-NEXT: [[R:%.*]] = extractelement <4 x float> [[TMP3]], i32 0		; CHECK-NEXT: [[R:%.*]] = extractelement <4 x float> [[TMP3]], i32 0
; CHECK-NEXT: ret float [[R]]		; CHECK-NEXT: ret float [[R]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
Show All 17 Lines	;
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> %4, <4 x float> %5, i8 0)		%6 = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> %4, <4 x float> %5, i8 0)
%7 = extractelement <4 x float> %6, i32 1		%7 = extractelement <4 x float> %6, i32 1
ret float %7		ret float %7
}		}

define float @test_cmp_ss_2(float %a) {		define float @test_cmp_ss_2(float %a) {
; CHECK-LABEL: @test_cmp_ss_2(		; CHECK-LABEL: @test_cmp_ss_2(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> <float undef, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> <float poison, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> [[TMP1]], <4 x float> [[TMP1]], i8 3)		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> [[TMP1]], <4 x float> [[TMP1]], i8 3)
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0		%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0
%2 = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> %1, <4 x float> %1, i8 3)		%2 = tail call <4 x float> @llvm.x86.sse.cmp.ss(<4 x float> %1, <4 x float> %1, i8 3)
%3 = extractelement <4 x float> %2, i32 0		%3 = extractelement <4 x float> %2, i32 0
ret float %3		ret float %3
}		}

define i32 @test_comieq_ss_0(float %a, float %b) {		define i32 @test_comieq_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comieq_ss_0(		; CHECK-LABEL: @test_comieq_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comieq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comieq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.comieq.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.comieq.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_comige_ss_0(float %a, float %b) {		define i32 @test_comige_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comige_ss_0(		; CHECK-LABEL: @test_comige_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comige.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comige.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.comige.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.comige.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_comigt_ss_0(float %a, float %b) {		define i32 @test_comigt_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comigt_ss_0(		; CHECK-LABEL: @test_comigt_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comigt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comigt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.comigt.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.comigt.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_comile_ss_0(float %a, float %b) {		define i32 @test_comile_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comile_ss_0(		; CHECK-LABEL: @test_comile_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comile.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comile.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.comile.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.comile.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_comilt_ss_0(float %a, float %b) {		define i32 @test_comilt_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comilt_ss_0(		; CHECK-LABEL: @test_comilt_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comilt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comilt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.comilt.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.comilt.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_comineq_ss_0(float %a, float %b) {		define i32 @test_comineq_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_comineq_ss_0(		; CHECK-LABEL: @test_comineq_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comineq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.comineq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.comineq.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.comineq.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_ucomieq_ss_0(float %a, float %b) {		define i32 @test_ucomieq_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_ucomieq_ss_0(		; CHECK-LABEL: @test_ucomieq_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomieq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomieq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.ucomieq.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.ucomieq.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_ucomige_ss_0(float %a, float %b) {		define i32 @test_ucomige_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_ucomige_ss_0(		; CHECK-LABEL: @test_ucomige_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomige.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomige.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.ucomige.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.ucomige.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_ucomigt_ss_0(float %a, float %b) {		define i32 @test_ucomigt_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_ucomigt_ss_0(		; CHECK-LABEL: @test_ucomigt_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomigt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomigt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.ucomigt.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.ucomigt.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_ucomile_ss_0(float %a, float %b) {		define i32 @test_ucomile_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_ucomile_ss_0(		; CHECK-LABEL: @test_ucomile_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomile.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomile.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.ucomile.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.ucomile.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_ucomilt_ss_0(float %a, float %b) {		define i32 @test_ucomilt_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_ucomilt_ss_0(		; CHECK-LABEL: @test_ucomilt_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomilt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomilt.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1		%6 = insertelement <4 x float> %5, float 4.000000e+00, i32 1
%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2		%7 = insertelement <4 x float> %6, float 5.000000e+00, i32 2
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call i32 @llvm.x86.sse.ucomilt.ss(<4 x float> %4, <4 x float> %8)		%9 = tail call i32 @llvm.x86.sse.ucomilt.ss(<4 x float> %4, <4 x float> %8)
ret i32 %9		ret i32 %9
}		}

define i32 @test_ucomineq_ss_0(float %a, float %b) {		define i32 @test_ucomineq_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_ucomineq_ss_0(		; CHECK-LABEL: @test_ucomineq_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomineq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse.ucomineq.ss(<4 x float> [[TMP1]], <4 x float> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
Show All 32 Lines

llvm/test/Transforms/InstCombine/X86/x86-sse2-inseltpoison.ll

Show First 20 Lines • Show All 183 Lines • ▼ Show 20 Lines
;		;
%1 = insertelement <2 x double> %b, double 2.000000e+00, i32 1		%1 = insertelement <2 x double> %b, double 2.000000e+00, i32 1
%2 = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> %a, <2 x double> %1)		%2 = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> %a, <2 x double> %1)
ret <2 x double> %2		ret <2 x double> %2
}		}

define double @test_min_sd_0(double %a, double %b) {		define double @test_min_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_min_sd_0(		; CHECK-LABEL: @test_min_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x double> [[TMP3]], i32 0		; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x double> [[TMP3]], i32 0
; CHECK-NEXT: ret double [[TMP4]]		; CHECK-NEXT: ret double [[TMP4]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
Show All 12 Lines	;
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> %2, <2 x double> %4)
%6 = extractelement <2 x double> %5, i32 1		%6 = extractelement <2 x double> %5, i32 1
ret double %6		ret double %6
}		}

define double @test_min_sd_2(double %a) {		define double @test_min_sd_2(double %a) {
; CHECK-LABEL: @test_min_sd_2(		; CHECK-LABEL: @test_min_sd_2(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double undef, double 0.000000e+00>, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double poison, double 0.000000e+00>, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> [[TMP1]], <2 x double> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> [[TMP1]], <2 x double> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[TMP3]]		; CHECK-NEXT: ret double [[TMP3]]
;		;
%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0		%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0
%2 = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> %1, <2 x double> %1)		%2 = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> %1, <2 x double> %1)
%3 = extractelement <2 x double> %2, i32 0		%3 = extractelement <2 x double> %2, i32 0
ret double %3		ret double %3
}		}

define <2 x double> @test_max_sd(<2 x double> %a, <2 x double> %b) {		define <2 x double> @test_max_sd(<2 x double> %a, <2 x double> %b) {
; CHECK-LABEL: @test_max_sd(		; CHECK-LABEL: @test_max_sd(
; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> [[A:%.]], <2 x double> [[B:%.*]])		; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> [[A:%.]], <2 x double> [[B:%.*]])
; CHECK-NEXT: ret <2 x double> [[TMP1]]		; CHECK-NEXT: ret <2 x double> [[TMP1]]
;		;
%1 = insertelement <2 x double> %b, double 2.000000e+00, i32 1		%1 = insertelement <2 x double> %b, double 2.000000e+00, i32 1
%2 = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> %a, <2 x double> %1)		%2 = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> %a, <2 x double> %1)
ret <2 x double> %2		ret <2 x double> %2
}		}

define double @test_max_sd_0(double %a, double %b) {		define double @test_max_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_max_sd_0(		; CHECK-LABEL: @test_max_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x double> [[TMP3]], i32 0		; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x double> [[TMP3]], i32 0
; CHECK-NEXT: ret double [[TMP4]]		; CHECK-NEXT: ret double [[TMP4]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
Show All 12 Lines	;
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> %2, <2 x double> %4)
%6 = extractelement <2 x double> %5, i32 1		%6 = extractelement <2 x double> %5, i32 1
ret double %6		ret double %6
}		}

define double @test_max_sd_2(double %a) {		define double @test_max_sd_2(double %a) {
; CHECK-LABEL: @test_max_sd_2(		; CHECK-LABEL: @test_max_sd_2(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double undef, double 0.000000e+00>, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double poison, double 0.000000e+00>, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> [[TMP1]], <2 x double> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> [[TMP1]], <2 x double> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[TMP3]]		; CHECK-NEXT: ret double [[TMP3]]
;		;
%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0		%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0
%2 = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> %1, <2 x double> %1)		%2 = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> %1, <2 x double> %1)
%3 = extractelement <2 x double> %2, i32 0		%3 = extractelement <2 x double> %2, i32 0
ret double %3		ret double %3
}		}

define <2 x double> @test_cmp_sd(<2 x double> %a, <2 x double> %b) {		define <2 x double> @test_cmp_sd(<2 x double> %a, <2 x double> %b) {
; CHECK-LABEL: @test_cmp_sd(		; CHECK-LABEL: @test_cmp_sd(
; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[A:%.]], <2 x double> [[B:%.*]], i8 0)		; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[A:%.]], <2 x double> [[B:%.*]], i8 0)
; CHECK-NEXT: ret <2 x double> [[TMP1]]		; CHECK-NEXT: ret <2 x double> [[TMP1]]
;		;
%1 = insertelement <2 x double> %b, double 2.000000e+00, i32 1		%1 = insertelement <2 x double> %b, double 2.000000e+00, i32 1
%2 = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %a, <2 x double> %1, i8 0)		%2 = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %a, <2 x double> %1, i8 0)
ret <2 x double> %2		ret <2 x double> %2
}		}

define double @test_cmp_sd_0(double %a, double %b) {		define double @test_cmp_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_cmp_sd_0(		; CHECK-LABEL: @test_cmp_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]], i8 0)		; CHECK-NEXT: [[TMP3:%.*]] = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]], i8 0)
; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x double> [[TMP3]], i32 0		; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x double> [[TMP3]], i32 0
; CHECK-NEXT: ret double [[TMP4]]		; CHECK-NEXT: ret double [[TMP4]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
Show All 12 Lines	;
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %2, <2 x double> %4, i8 0)		%5 = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %2, <2 x double> %4, i8 0)
%6 = extractelement <2 x double> %5, i32 1		%6 = extractelement <2 x double> %5, i32 1
ret double %6		ret double %6
}		}

define double @test_cmp_sd_2(double %a) {		define double @test_cmp_sd_2(double %a) {
; CHECK-LABEL: @test_cmp_sd_2(		; CHECK-LABEL: @test_cmp_sd_2(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double undef, double 0.000000e+00>, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double poison, double 0.000000e+00>, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[TMP1]], <2 x double> [[TMP1]], i8 3)		; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[TMP1]], <2 x double> [[TMP1]], i8 3)
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[TMP3]]		; CHECK-NEXT: ret double [[TMP3]]
;		;
%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0		%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0
%2 = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %1, <2 x double> %1, i8 3)		%2 = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %1, <2 x double> %1, i8 3)
%3 = extractelement <2 x double> %2, i32 0		%3 = extractelement <2 x double> %2, i32 0
ret double %3		ret double %3
}		}

define i32 @test_comieq_sd_0(double %a, double %b) {		define i32 @test_comieq_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comieq_sd_0(		; CHECK-LABEL: @test_comieq_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comieq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comieq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.comieq.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.comieq.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_comige_sd_0(double %a, double %b) {		define i32 @test_comige_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comige_sd_0(		; CHECK-LABEL: @test_comige_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comige.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comige.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.comige.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.comige.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_comigt_sd_0(double %a, double %b) {		define i32 @test_comigt_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comigt_sd_0(		; CHECK-LABEL: @test_comigt_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comigt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comigt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.comigt.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.comigt.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_comile_sd_0(double %a, double %b) {		define i32 @test_comile_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comile_sd_0(		; CHECK-LABEL: @test_comile_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comile.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comile.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.comile.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.comile.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_comilt_sd_0(double %a, double %b) {		define i32 @test_comilt_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comilt_sd_0(		; CHECK-LABEL: @test_comilt_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comilt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comilt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.comilt.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.comilt.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_comineq_sd_0(double %a, double %b) {		define i32 @test_comineq_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comineq_sd_0(		; CHECK-LABEL: @test_comineq_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comineq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comineq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.comineq.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.comineq.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_ucomieq_sd_0(double %a, double %b) {		define i32 @test_ucomieq_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_ucomieq_sd_0(		; CHECK-LABEL: @test_ucomieq_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomieq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomieq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.ucomieq.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.ucomieq.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_ucomige_sd_0(double %a, double %b) {		define i32 @test_ucomige_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_ucomige_sd_0(		; CHECK-LABEL: @test_ucomige_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomige.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomige.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.ucomige.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.ucomige.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_ucomigt_sd_0(double %a, double %b) {		define i32 @test_ucomigt_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_ucomigt_sd_0(		; CHECK-LABEL: @test_ucomigt_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomigt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomigt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.ucomigt.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.ucomigt.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_ucomile_sd_0(double %a, double %b) {		define i32 @test_ucomile_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_ucomile_sd_0(		; CHECK-LABEL: @test_ucomile_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomile.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomile.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.ucomile.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.ucomile.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_ucomilt_sd_0(double %a, double %b) {		define i32 @test_ucomilt_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_ucomilt_sd_0(		; CHECK-LABEL: @test_ucomilt_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomilt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomilt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.ucomilt.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.ucomilt.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_ucomineq_sd_0(double %a, double %b) {		define i32 @test_ucomineq_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_ucomineq_sd_0(		; CHECK-LABEL: @test_ucomineq_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomineq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomineq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.ucomineq.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.ucomineq.sd(<2 x double> %2, <2 x double> %4)
Show All 26 Lines

llvm/test/Transforms/InstCombine/X86/x86-sse2.ll

Show First 20 Lines • Show All 183 Lines • ▼ Show 20 Lines
;		;
%1 = insertelement <2 x double> %b, double 2.000000e+00, i32 1		%1 = insertelement <2 x double> %b, double 2.000000e+00, i32 1
%2 = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> %a, <2 x double> %1)		%2 = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> %a, <2 x double> %1)
ret <2 x double> %2		ret <2 x double> %2
}		}

define double @test_min_sd_0(double %a, double %b) {		define double @test_min_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_min_sd_0(		; CHECK-LABEL: @test_min_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x double> [[TMP3]], i32 0		; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x double> [[TMP3]], i32 0
; CHECK-NEXT: ret double [[TMP4]]		; CHECK-NEXT: ret double [[TMP4]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
Show All 12 Lines	;
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> %2, <2 x double> %4)
%6 = extractelement <2 x double> %5, i32 1		%6 = extractelement <2 x double> %5, i32 1
ret double %6		ret double %6
}		}

define double @test_min_sd_2(double %a) {		define double @test_min_sd_2(double %a) {
; CHECK-LABEL: @test_min_sd_2(		; CHECK-LABEL: @test_min_sd_2(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double undef, double 0.000000e+00>, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double poison, double 0.000000e+00>, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> [[TMP1]], <2 x double> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> [[TMP1]], <2 x double> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[TMP3]]		; CHECK-NEXT: ret double [[TMP3]]
;		;
%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0		%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0
%2 = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> %1, <2 x double> %1)		%2 = tail call <2 x double> @llvm.x86.sse2.min.sd(<2 x double> %1, <2 x double> %1)
%3 = extractelement <2 x double> %2, i32 0		%3 = extractelement <2 x double> %2, i32 0
ret double %3		ret double %3
}		}

define <2 x double> @test_max_sd(<2 x double> %a, <2 x double> %b) {		define <2 x double> @test_max_sd(<2 x double> %a, <2 x double> %b) {
; CHECK-LABEL: @test_max_sd(		; CHECK-LABEL: @test_max_sd(
; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> [[A:%.]], <2 x double> [[B:%.*]])		; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> [[A:%.]], <2 x double> [[B:%.*]])
; CHECK-NEXT: ret <2 x double> [[TMP1]]		; CHECK-NEXT: ret <2 x double> [[TMP1]]
;		;
%1 = insertelement <2 x double> %b, double 2.000000e+00, i32 1		%1 = insertelement <2 x double> %b, double 2.000000e+00, i32 1
%2 = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> %a, <2 x double> %1)		%2 = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> %a, <2 x double> %1)
ret <2 x double> %2		ret <2 x double> %2
}		}

define double @test_max_sd_0(double %a, double %b) {		define double @test_max_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_max_sd_0(		; CHECK-LABEL: @test_max_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x double> [[TMP3]], i32 0		; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x double> [[TMP3]], i32 0
; CHECK-NEXT: ret double [[TMP4]]		; CHECK-NEXT: ret double [[TMP4]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
Show All 12 Lines	;
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> %2, <2 x double> %4)
%6 = extractelement <2 x double> %5, i32 1		%6 = extractelement <2 x double> %5, i32 1
ret double %6		ret double %6
}		}

define double @test_max_sd_2(double %a) {		define double @test_max_sd_2(double %a) {
; CHECK-LABEL: @test_max_sd_2(		; CHECK-LABEL: @test_max_sd_2(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double undef, double 0.000000e+00>, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double poison, double 0.000000e+00>, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> [[TMP1]], <2 x double> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> [[TMP1]], <2 x double> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[TMP3]]		; CHECK-NEXT: ret double [[TMP3]]
;		;
%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0		%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0
%2 = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> %1, <2 x double> %1)		%2 = tail call <2 x double> @llvm.x86.sse2.max.sd(<2 x double> %1, <2 x double> %1)
%3 = extractelement <2 x double> %2, i32 0		%3 = extractelement <2 x double> %2, i32 0
ret double %3		ret double %3
}		}

define <2 x double> @test_cmp_sd(<2 x double> %a, <2 x double> %b) {		define <2 x double> @test_cmp_sd(<2 x double> %a, <2 x double> %b) {
; CHECK-LABEL: @test_cmp_sd(		; CHECK-LABEL: @test_cmp_sd(
; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[A:%.]], <2 x double> [[B:%.*]], i8 0)		; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[A:%.]], <2 x double> [[B:%.*]], i8 0)
; CHECK-NEXT: ret <2 x double> [[TMP1]]		; CHECK-NEXT: ret <2 x double> [[TMP1]]
;		;
%1 = insertelement <2 x double> %b, double 2.000000e+00, i32 1		%1 = insertelement <2 x double> %b, double 2.000000e+00, i32 1
%2 = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %a, <2 x double> %1, i8 0)		%2 = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %a, <2 x double> %1, i8 0)
ret <2 x double> %2		ret <2 x double> %2
}		}

define double @test_cmp_sd_0(double %a, double %b) {		define double @test_cmp_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_cmp_sd_0(		; CHECK-LABEL: @test_cmp_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]], i8 0)		; CHECK-NEXT: [[TMP3:%.*]] = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]], i8 0)
; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x double> [[TMP3]], i32 0		; CHECK-NEXT: [[TMP4:%.*]] = extractelement <2 x double> [[TMP3]], i32 0
; CHECK-NEXT: ret double [[TMP4]]		; CHECK-NEXT: ret double [[TMP4]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
Show All 12 Lines	;
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %2, <2 x double> %4, i8 0)		%5 = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %2, <2 x double> %4, i8 0)
%6 = extractelement <2 x double> %5, i32 1		%6 = extractelement <2 x double> %5, i32 1
ret double %6		ret double %6
}		}

define double @test_cmp_sd_2(double %a) {		define double @test_cmp_sd_2(double %a) {
; CHECK-LABEL: @test_cmp_sd_2(		; CHECK-LABEL: @test_cmp_sd_2(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double undef, double 0.000000e+00>, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double poison, double 0.000000e+00>, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[TMP1]], <2 x double> [[TMP1]], i8 3)		; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> [[TMP1]], <2 x double> [[TMP1]], i8 3)
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[TMP3]]		; CHECK-NEXT: ret double [[TMP3]]
;		;
%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0		%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0
%2 = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %1, <2 x double> %1, i8 3)		%2 = tail call <2 x double> @llvm.x86.sse2.cmp.sd(<2 x double> %1, <2 x double> %1, i8 3)
%3 = extractelement <2 x double> %2, i32 0		%3 = extractelement <2 x double> %2, i32 0
ret double %3		ret double %3
}		}

define i32 @test_comieq_sd_0(double %a, double %b) {		define i32 @test_comieq_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comieq_sd_0(		; CHECK-LABEL: @test_comieq_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comieq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comieq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.comieq.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.comieq.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_comige_sd_0(double %a, double %b) {		define i32 @test_comige_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comige_sd_0(		; CHECK-LABEL: @test_comige_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comige.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comige.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.comige.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.comige.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_comigt_sd_0(double %a, double %b) {		define i32 @test_comigt_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comigt_sd_0(		; CHECK-LABEL: @test_comigt_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comigt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comigt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.comigt.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.comigt.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_comile_sd_0(double %a, double %b) {		define i32 @test_comile_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comile_sd_0(		; CHECK-LABEL: @test_comile_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comile.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comile.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.comile.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.comile.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_comilt_sd_0(double %a, double %b) {		define i32 @test_comilt_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comilt_sd_0(		; CHECK-LABEL: @test_comilt_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comilt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comilt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.comilt.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.comilt.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_comineq_sd_0(double %a, double %b) {		define i32 @test_comineq_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_comineq_sd_0(		; CHECK-LABEL: @test_comineq_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comineq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.comineq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.comineq.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.comineq.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_ucomieq_sd_0(double %a, double %b) {		define i32 @test_ucomieq_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_ucomieq_sd_0(		; CHECK-LABEL: @test_ucomieq_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomieq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomieq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.ucomieq.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.ucomieq.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_ucomige_sd_0(double %a, double %b) {		define i32 @test_ucomige_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_ucomige_sd_0(		; CHECK-LABEL: @test_ucomige_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomige.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomige.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.ucomige.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.ucomige.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_ucomigt_sd_0(double %a, double %b) {		define i32 @test_ucomigt_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_ucomigt_sd_0(		; CHECK-LABEL: @test_ucomigt_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomigt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomigt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.ucomigt.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.ucomigt.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_ucomile_sd_0(double %a, double %b) {		define i32 @test_ucomile_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_ucomile_sd_0(		; CHECK-LABEL: @test_ucomile_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomile.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomile.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.ucomile.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.ucomile.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_ucomilt_sd_0(double %a, double %b) {		define i32 @test_ucomilt_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_ucomilt_sd_0(		; CHECK-LABEL: @test_ucomilt_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomilt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomilt.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.ucomilt.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.ucomilt.sd(<2 x double> %2, <2 x double> %4)
ret i32 %5		ret i32 %5
}		}

define i32 @test_ucomineq_sd_0(double %a, double %b) {		define i32 @test_ucomineq_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_ucomineq_sd_0(		; CHECK-LABEL: @test_ucomineq_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomineq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])		; CHECK-NEXT: [[TMP3:%.*]] = tail call i32 @llvm.x86.sse2.ucomineq.sd(<2 x double> [[TMP1]], <2 x double> [[TMP2]])
; CHECK-NEXT: ret i32 [[TMP3]]		; CHECK-NEXT: ret i32 [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call i32 @llvm.x86.sse2.ucomineq.sd(<2 x double> %2, <2 x double> %4)		%5 = tail call i32 @llvm.x86.sse2.ucomineq.sd(<2 x double> %2, <2 x double> %4)
Show All 26 Lines

llvm/test/Transforms/InstCombine/X86/x86-sse41-inseltpoison.ll

Show All 9 Lines	;
%1 = insertelement <2 x double> %a, double 1.000000e+00, i32 0		%1 = insertelement <2 x double> %a, double 1.000000e+00, i32 0
%2 = insertelement <2 x double> %b, double 2.000000e+00, i32 1		%2 = insertelement <2 x double> %b, double 2.000000e+00, i32 1
%3 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %1, <2 x double> %2, i32 10)		%3 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %1, <2 x double> %2, i32 10)
ret <2 x double> %3		ret <2 x double> %3
}		}

define double @test_round_sd_0(double %a, double %b) {		define double @test_round_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_round_sd_0(		; CHECK-LABEL: @test_round_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> undef, <2 x double> [[TMP1]], i32 10)		; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> poison, <2 x double> [[TMP1]], i32 10)
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[TMP3]]		; CHECK-NEXT: ret double [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> poison, double %b, i32 0		%3 = insertelement <2 x double> poison, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %2, <2 x double> %4, i32 10)		%5 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %2, <2 x double> %4, i32 10)
Show All 11 Lines	;
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %2, <2 x double> %4, i32 10)		%5 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %2, <2 x double> %4, i32 10)
%6 = extractelement <2 x double> %5, i32 1		%6 = extractelement <2 x double> %5, i32 1
ret double %6		ret double %6
}		}

define double @test_round_sd_2(double %a) {		define double @test_round_sd_2(double %a) {
; CHECK-LABEL: @test_round_sd_2(		; CHECK-LABEL: @test_round_sd_2(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> undef, <2 x double> [[TMP1]], i32 10)		; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> poison, <2 x double> [[TMP1]], i32 10)
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[TMP3]]		; CHECK-NEXT: ret double [[TMP3]]
;		;
%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0		%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0
%2 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %1, <2 x double> %1, i32 10)		%2 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %1, <2 x double> %1, i32 10)
%3 = extractelement <2 x double> %2, i32 0		%3 = extractelement <2 x double> %2, i32 0
ret double %3		ret double %3
}		}

define <4 x float> @test_round_ss(<4 x float> %a, <4 x float> %b) {		define <4 x float> @test_round_ss(<4 x float> %a, <4 x float> %b) {
; CHECK-LABEL: @test_round_ss(		; CHECK-LABEL: @test_round_ss(
; CHECK-NEXT: [[TMP1:%.]] = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> <float undef, float 1.000000e+00, float 2.000000e+00, float 3.000000e+00>, <4 x float> [[B:%.]], i32 10)		; CHECK-NEXT: [[TMP1:%.]] = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> <float poison, float 1.000000e+00, float 2.000000e+00, float 3.000000e+00>, <4 x float> [[B:%.]], i32 10)
; CHECK-NEXT: ret <4 x float> [[TMP1]]		; CHECK-NEXT: ret <4 x float> [[TMP1]]
;		;
%1 = insertelement <4 x float> %a, float 1.000000e+00, i32 1		%1 = insertelement <4 x float> %a, float 1.000000e+00, i32 1
%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2		%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2
%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3		%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3
%4 = insertelement <4 x float> %b, float 1.000000e+00, i32 1		%4 = insertelement <4 x float> %b, float 1.000000e+00, i32 1
%5 = insertelement <4 x float> %4, float 2.000000e+00, i32 2		%5 = insertelement <4 x float> %4, float 2.000000e+00, i32 2
%6 = insertelement <4 x float> %5, float 3.000000e+00, i32 3		%6 = insertelement <4 x float> %5, float 3.000000e+00, i32 3
%7 = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> %3, <4 x float> %6, i32 10)		%7 = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> %3, <4 x float> %6, i32 10)
ret <4 x float> %7		ret <4 x float> %7
}		}

define float @test_round_ss_0(float %a, float %b) {		define float @test_round_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_round_ss_0(		; CHECK-LABEL: @test_round_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> undef, <4 x float> [[TMP1]], i32 10)		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> poison, <4 x float> [[TMP1]], i32 10)
; CHECK-NEXT: [[R:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[R:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[R]]		; CHECK-NEXT: ret float [[R]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> poison, float %b, i32 0		%5 = insertelement <4 x float> poison, float %b, i32 0
Show All 19 Lines	;
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> %4, <4 x float> %8, i32 10)		%9 = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> %4, <4 x float> %8, i32 10)
%r = extractelement <4 x float> %9, i32 2		%r = extractelement <4 x float> %9, i32 2
ret float %r		ret float %r
}		}

define float @test_round_ss_3(float %a) {		define float @test_round_ss_3(float %a) {
; CHECK-LABEL: @test_round_ss_3(		; CHECK-LABEL: @test_round_ss_3(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> undef, <4 x float> [[TMP1]], i32 10)		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> poison, <4 x float> [[TMP1]], i32 10)
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0		%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0
%2 = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> %1, <4 x float> %1, i32 10)		%2 = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> %1, <4 x float> %1, i32 10)
%3 = extractelement <4 x float> %2, i32 0		%3 = extractelement <4 x float> %2, i32 0
ret float %3		ret float %3
}		}

declare <2 x double> @llvm.x86.sse41.round.sd(<2 x double>, <2 x double>, i32) nounwind readnone		declare <2 x double> @llvm.x86.sse41.round.sd(<2 x double>, <2 x double>, i32) nounwind readnone
declare <4 x float> @llvm.x86.sse41.round.ss(<4 x float>, <4 x float>, i32) nounwind readnone		declare <4 x float> @llvm.x86.sse41.round.ss(<4 x float>, <4 x float>, i32) nounwind readnone

llvm/test/Transforms/InstCombine/X86/x86-sse41.ll

Show All 9 Lines	;
%1 = insertelement <2 x double> %a, double 1.000000e+00, i32 0		%1 = insertelement <2 x double> %a, double 1.000000e+00, i32 0
%2 = insertelement <2 x double> %b, double 2.000000e+00, i32 1		%2 = insertelement <2 x double> %b, double 2.000000e+00, i32 1
%3 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %1, <2 x double> %2, i32 10)		%3 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %1, <2 x double> %2, i32 10)
ret <2 x double> %3		ret <2 x double> %3
}		}

define double @test_round_sd_0(double %a, double %b) {		define double @test_round_sd_0(double %a, double %b) {
; CHECK-LABEL: @test_round_sd_0(		; CHECK-LABEL: @test_round_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[B:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[B:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> undef, <2 x double> [[TMP1]], i32 10)		; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> poison, <2 x double> [[TMP1]], i32 10)
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[TMP3]]		; CHECK-NEXT: ret double [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = insertelement <2 x double> undef, double %b, i32 0		%3 = insertelement <2 x double> undef, double %b, i32 0
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %2, <2 x double> %4, i32 10)		%5 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %2, <2 x double> %4, i32 10)
Show All 11 Lines	;
%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1		%4 = insertelement <2 x double> %3, double 2.000000e+00, i32 1
%5 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %2, <2 x double> %4, i32 10)		%5 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %2, <2 x double> %4, i32 10)
%6 = extractelement <2 x double> %5, i32 1		%6 = extractelement <2 x double> %5, i32 1
ret double %6		ret double %6
}		}

define double @test_round_sd_2(double %a) {		define double @test_round_sd_2(double %a) {
; CHECK-LABEL: @test_round_sd_2(		; CHECK-LABEL: @test_round_sd_2(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> undef, <2 x double> [[TMP1]], i32 10)		; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> poison, <2 x double> [[TMP1]], i32 10)
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[TMP3]]		; CHECK-NEXT: ret double [[TMP3]]
;		;
%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0		%1 = insertelement <2 x double> zeroinitializer, double %a, i32 0
%2 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %1, <2 x double> %1, i32 10)		%2 = tail call <2 x double> @llvm.x86.sse41.round.sd(<2 x double> %1, <2 x double> %1, i32 10)
%3 = extractelement <2 x double> %2, i32 0		%3 = extractelement <2 x double> %2, i32 0
ret double %3		ret double %3
}		}

define <4 x float> @test_round_ss(<4 x float> %a, <4 x float> %b) {		define <4 x float> @test_round_ss(<4 x float> %a, <4 x float> %b) {
; CHECK-LABEL: @test_round_ss(		; CHECK-LABEL: @test_round_ss(
; CHECK-NEXT: [[TMP1:%.]] = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> <float undef, float 1.000000e+00, float 2.000000e+00, float 3.000000e+00>, <4 x float> [[B:%.]], i32 10)		; CHECK-NEXT: [[TMP1:%.]] = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> <float poison, float 1.000000e+00, float 2.000000e+00, float 3.000000e+00>, <4 x float> [[B:%.]], i32 10)
; CHECK-NEXT: ret <4 x float> [[TMP1]]		; CHECK-NEXT: ret <4 x float> [[TMP1]]
;		;
%1 = insertelement <4 x float> %a, float 1.000000e+00, i32 1		%1 = insertelement <4 x float> %a, float 1.000000e+00, i32 1
%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2		%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2
%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3		%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3
%4 = insertelement <4 x float> %b, float 1.000000e+00, i32 1		%4 = insertelement <4 x float> %b, float 1.000000e+00, i32 1
%5 = insertelement <4 x float> %4, float 2.000000e+00, i32 2		%5 = insertelement <4 x float> %4, float 2.000000e+00, i32 2
%6 = insertelement <4 x float> %5, float 3.000000e+00, i32 3		%6 = insertelement <4 x float> %5, float 3.000000e+00, i32 3
%7 = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> %3, <4 x float> %6, i32 10)		%7 = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> %3, <4 x float> %6, i32 10)
ret <4 x float> %7		ret <4 x float> %7
}		}

define float @test_round_ss_0(float %a, float %b) {		define float @test_round_ss_0(float %a, float %b) {
; CHECK-LABEL: @test_round_ss_0(		; CHECK-LABEL: @test_round_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> undef, <4 x float> [[TMP1]], i32 10)		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> poison, <4 x float> [[TMP1]], i32 10)
; CHECK-NEXT: [[R:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[R:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[R]]		; CHECK-NEXT: ret float [[R]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
%5 = insertelement <4 x float> undef, float %b, i32 0		%5 = insertelement <4 x float> undef, float %b, i32 0
Show All 19 Lines	;
%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3		%8 = insertelement <4 x float> %7, float 6.000000e+00, i32 3
%9 = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> %4, <4 x float> %8, i32 10)		%9 = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> %4, <4 x float> %8, i32 10)
%r = extractelement <4 x float> %9, i32 2		%r = extractelement <4 x float> %9, i32 2
ret float %r		ret float %r
}		}

define float @test_round_ss_3(float %a) {		define float @test_round_ss_3(float %a) {
; CHECK-LABEL: @test_round_ss_3(		; CHECK-LABEL: @test_round_ss_3(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> undef, <4 x float> [[TMP1]], i32 10)		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> poison, <4 x float> [[TMP1]], i32 10)
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0		%1 = insertelement <4 x float> zeroinitializer, float %a, i32 0
%2 = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> %1, <4 x float> %1, i32 10)		%2 = tail call <4 x float> @llvm.x86.sse41.round.ss(<4 x float> %1, <4 x float> %1, i32 10)
%3 = extractelement <4 x float> %2, i32 0		%3 = extractelement <4 x float> %2, i32 0
ret float %3		ret float %3
}		}

declare <2 x double> @llvm.x86.sse41.round.sd(<2 x double>, <2 x double>, i32) nounwind readnone		declare <2 x double> @llvm.x86.sse41.round.sd(<2 x double>, <2 x double>, i32) nounwind readnone
declare <4 x float> @llvm.x86.sse41.round.ss(<4 x float>, <4 x float>, i32) nounwind readnone		declare <4 x float> @llvm.x86.sse41.round.ss(<4 x float>, <4 x float>, i32) nounwind readnone

llvm/test/Transforms/InstCombine/X86/x86-sse4a.ll

	Show All 19 Lines
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.extrq(<2 x i64> zeroinitializer, <16 x i8> %y) nounwind			%1 = tail call <2 x i64> @llvm.x86.sse4a.extrq(<2 x i64> zeroinitializer, <16 x i8> %y) nounwind
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @test_extrq_zero_arg1(<2 x i64> %x, <16 x i8> %y) {			define <2 x i64> @test_extrq_zero_arg1(<2 x i64> %x, <16 x i8> %y) {
	; CHECK-LABEL: @test_extrq_zero_arg1(			; CHECK-LABEL: @test_extrq_zero_arg1(
	; CHECK-NEXT: [[TMP1:%.]] = bitcast <2 x i64> [[X:%.]] to <16 x i8>			; CHECK-NEXT: [[TMP1:%.]] = bitcast <2 x i64> [[X:%.]] to <16 x i8>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <16 x i8> [[TMP1]], <16 x i8> undef, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <16 x i8> [[TMP1]], <16 x i8> poison, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef>
	; CHECK-NEXT: [[TMP3:%.*]] = bitcast <16 x i8> [[TMP2]] to <2 x i64>			; CHECK-NEXT: [[TMP3:%.*]] = bitcast <16 x i8> [[TMP2]] to <2 x i64>
	; CHECK-NEXT: ret <2 x i64> [[TMP3]]			; CHECK-NEXT: ret <2 x i64> [[TMP3]]
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.extrq(<2 x i64> %x, <16 x i8> zeroinitializer) nounwind			%1 = tail call <2 x i64> @llvm.x86.sse4a.extrq(<2 x i64> %x, <16 x i8> zeroinitializer) nounwind
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @test_extrq_to_extqi(<2 x i64> %x, <16 x i8> %y) {			define <2 x i64> @test_extrq_to_extqi(<2 x i64> %x, <16 x i8> %y) {
	Show All 19 Lines
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.extrq(<2 x i64> <i64 -1, i64 undef>, <16 x i8> <i8 16, i8 15, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0>) nounwind			%1 = tail call <2 x i64> @llvm.x86.sse4a.extrq(<2 x i64> <i64 -1, i64 undef>, <16 x i8> <i8 16, i8 15, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0>) nounwind
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @test_extrq_call_constexpr(<2 x i64> %x) {			define <2 x i64> @test_extrq_call_constexpr(<2 x i64> %x) {
	; CHECK-LABEL: @test_extrq_call_constexpr(			; CHECK-LABEL: @test_extrq_call_constexpr(
	; CHECK-NEXT: [[TMP1:%.]] = bitcast <2 x i64> [[X:%.]] to <16 x i8>			; CHECK-NEXT: [[TMP1:%.]] = bitcast <2 x i64> [[X:%.]] to <16 x i8>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <16 x i8> [[TMP1]], <16 x i8> undef, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <16 x i8> [[TMP1]], <16 x i8> poison, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef>
	; CHECK-NEXT: [[TMP3:%.*]] = bitcast <16 x i8> [[TMP2]] to <2 x i64>			; CHECK-NEXT: [[TMP3:%.*]] = bitcast <16 x i8> [[TMP2]] to <2 x i64>
	; CHECK-NEXT: ret <2 x i64> [[TMP3]]			; CHECK-NEXT: ret <2 x i64> [[TMP3]]
	;			;
	%1 = call <2 x i64> @llvm.x86.sse4a.extrq(<2 x i64> %x, <16 x i8> bitcast (<2 x i64> <i64 0, i64 undef> to <16 x i8>))			%1 = call <2 x i64> @llvm.x86.sse4a.extrq(<2 x i64> %x, <16 x i8> bitcast (<2 x i64> <i64 0, i64 undef> to <16 x i8>))
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	;			;
	; EXTRQI			; EXTRQI
	;			;

	define <2 x i64> @test_extrqi_call(<2 x i64> %x) {			define <2 x i64> @test_extrqi_call(<2 x i64> %x) {
	; CHECK-LABEL: @test_extrqi_call(			; CHECK-LABEL: @test_extrqi_call(
	; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x i64> @llvm.x86.sse4a.extrqi(<2 x i64> [[X:%.]], i8 8, i8 23)			; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x i64> @llvm.x86.sse4a.extrqi(<2 x i64> [[X:%.]], i8 8, i8 23)
	; CHECK-NEXT: ret <2 x i64> [[TMP1]]			; CHECK-NEXT: ret <2 x i64> [[TMP1]]
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.extrqi(<2 x i64> %x, i8 8, i8 23)			%1 = tail call <2 x i64> @llvm.x86.sse4a.extrqi(<2 x i64> %x, i8 8, i8 23)
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @test_extrqi_shuffle_1zuu(<2 x i64> %x) {			define <2 x i64> @test_extrqi_shuffle_1zuu(<2 x i64> %x) {
	; CHECK-LABEL: @test_extrqi_shuffle_1zuu(			; CHECK-LABEL: @test_extrqi_shuffle_1zuu(
	; CHECK-NEXT: [[TMP1:%.]] = bitcast <2 x i64> [[X:%.]] to <16 x i8>			; CHECK-NEXT: [[TMP1:%.]] = bitcast <2 x i64> [[X:%.]] to <16 x i8>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <16 x i8> [[TMP1]], <16 x i8> <i8 undef, i8 undef, i8 undef, i8 undef, i8 0, i8 0, i8 0, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <16 x i32> <i32 4, i32 5, i32 6, i32 7, i32 20, i32 21, i32 22, i32 23, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <16 x i8> [[TMP1]], <16 x i8> <i8 poison, i8 poison, i8 poison, i8 poison, i8 0, i8 0, i8 0, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <16 x i32> <i32 4, i32 5, i32 6, i32 7, i32 20, i32 21, i32 22, i32 23, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef>
	; CHECK-NEXT: [[TMP3:%.*]] = bitcast <16 x i8> [[TMP2]] to <2 x i64>			; CHECK-NEXT: [[TMP3:%.*]] = bitcast <16 x i8> [[TMP2]] to <2 x i64>
	; CHECK-NEXT: ret <2 x i64> [[TMP3]]			; CHECK-NEXT: ret <2 x i64> [[TMP3]]
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.extrqi(<2 x i64> %x, i8 32, i8 32)			%1 = tail call <2 x i64> @llvm.x86.sse4a.extrqi(<2 x i64> %x, i8 32, i8 32)
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @test_extrqi_shuffle_2zzzzzzzuuuuuuuu(<2 x i64> %x) {			define <2 x i64> @test_extrqi_shuffle_2zzzzzzzuuuuuuuu(<2 x i64> %x) {
	; CHECK-LABEL: @test_extrqi_shuffle_2zzzzzzzuuuuuuuu(			; CHECK-LABEL: @test_extrqi_shuffle_2zzzzzzzuuuuuuuu(
	; CHECK-NEXT: [[TMP1:%.]] = bitcast <2 x i64> [[X:%.]] to <16 x i8>			; CHECK-NEXT: [[TMP1:%.]] = bitcast <2 x i64> [[X:%.]] to <16 x i8>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <16 x i8> [[TMP1]], <16 x i8> <i8 undef, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef, i8 undef>, <16 x i32> <i32 2, i32 17, i32 18, i32 19, i32 20, i32 21, i32 22, i32 23, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <16 x i8> [[TMP1]], <16 x i8> <i8 poison, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 0, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison, i8 poison>, <16 x i32> <i32 2, i32 17, i32 18, i32 19, i32 20, i32 21, i32 22, i32 23, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef>
	; CHECK-NEXT: [[TMP3:%.*]] = bitcast <16 x i8> [[TMP2]] to <2 x i64>			; CHECK-NEXT: [[TMP3:%.*]] = bitcast <16 x i8> [[TMP2]] to <2 x i64>
	; CHECK-NEXT: ret <2 x i64> [[TMP3]]			; CHECK-NEXT: ret <2 x i64> [[TMP3]]
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.extrqi(<2 x i64> %x, i8 8, i8 16)			%1 = tail call <2 x i64> @llvm.x86.sse4a.extrqi(<2 x i64> %x, i8 8, i8 16)
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @test_extrqi_undef(<2 x i64> %x) {			define <2 x i64> @test_extrqi_undef(<2 x i64> %x) {
	▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: ret <2 x i64> [[TMP1]]			; CHECK-NEXT: ret <2 x i64> [[TMP1]]
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.insertq(<2 x i64> %x, <2 x i64> %y) nounwind			%1 = tail call <2 x i64> @llvm.x86.sse4a.insertq(<2 x i64> %x, <2 x i64> %y) nounwind
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @test_insertq_to_insertqi(<2 x i64> %x, <2 x i64> %y) {			define <2 x i64> @test_insertq_to_insertqi(<2 x i64> %x, <2 x i64> %y) {
	; CHECK-LABEL: @test_insertq_to_insertqi(			; CHECK-LABEL: @test_insertq_to_insertqi(
	; CHECK-NEXT: [[TMP1:%.]] = call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> [[X:%.]], <2 x i64> <i64 8, i64 undef>, i8 18, i8 2)			; CHECK-NEXT: [[TMP1:%.]] = call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> [[X:%.]], <2 x i64> <i64 8, i64 poison>, i8 18, i8 2)
	; CHECK-NEXT: ret <2 x i64> [[TMP1]]			; CHECK-NEXT: ret <2 x i64> [[TMP1]]
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.insertq(<2 x i64> %x, <2 x i64> <i64 8, i64 658>) nounwind			%1 = tail call <2 x i64> @llvm.x86.sse4a.insertq(<2 x i64> %x, <2 x i64> <i64 8, i64 658>) nounwind
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @test_insertq_constant(<2 x i64> %x, <2 x i64> %y) {			define <2 x i64> @test_insertq_constant(<2 x i64> %x, <2 x i64> %y) {
	; CHECK-LABEL: @test_insertq_constant(			; CHECK-LABEL: @test_insertq_constant(
	; CHECK-NEXT: ret <2 x i64> <i64 32, i64 undef>			; CHECK-NEXT: ret <2 x i64> <i64 32, i64 undef>
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.insertq(<2 x i64> <i64 0, i64 0>, <2 x i64> <i64 8, i64 658>) nounwind			%1 = tail call <2 x i64> @llvm.x86.sse4a.insertq(<2 x i64> <i64 0, i64 0>, <2 x i64> <i64 8, i64 658>) nounwind
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @test_insertq_constant_undef(<2 x i64> %x, <2 x i64> %y) {			define <2 x i64> @test_insertq_constant_undef(<2 x i64> %x, <2 x i64> %y) {
	; CHECK-LABEL: @test_insertq_constant_undef(			; CHECK-LABEL: @test_insertq_constant_undef(
	; CHECK-NEXT: ret <2 x i64> <i64 33, i64 undef>			; CHECK-NEXT: ret <2 x i64> <i64 33, i64 undef>
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.insertq(<2 x i64> <i64 1, i64 undef>, <2 x i64> <i64 8, i64 658>) nounwind			%1 = tail call <2 x i64> @llvm.x86.sse4a.insertq(<2 x i64> <i64 1, i64 undef>, <2 x i64> <i64 8, i64 658>) nounwind
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @test_insertq_call_constexpr(<2 x i64> %x) {			define <2 x i64> @test_insertq_call_constexpr(<2 x i64> %x) {
	; CHECK-LABEL: @test_insertq_call_constexpr(			; CHECK-LABEL: @test_insertq_call_constexpr(
	; CHECK-NEXT: [[TMP1:%.]] = call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> [[X:%.]], <2 x i64> <i64 0, i64 undef>, i8 2, i8 0)			; CHECK-NEXT: [[TMP1:%.]] = call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> [[X:%.]], <2 x i64> <i64 0, i64 poison>, i8 2, i8 0)
	; CHECK-NEXT: ret <2 x i64> [[TMP1]]			; CHECK-NEXT: ret <2 x i64> [[TMP1]]
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.insertq(<2 x i64> %x, <2 x i64> bitcast (<16 x i8> trunc (<16 x i16> bitcast (<4 x i64> <i64 0, i64 undef, i64 2, i64 undef> to <16 x i16>) to <16 x i8>) to <2 x i64>))			%1 = tail call <2 x i64> @llvm.x86.sse4a.insertq(<2 x i64> %x, <2 x i64> bitcast (<16 x i8> trunc (<16 x i16> bitcast (<4 x i64> <i64 0, i64 undef, i64 2, i64 undef> to <16 x i16>) to <16 x i8>) to <2 x i64>))
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	;			;
	; INSERTQI			; INSERTQI
	Show All 28 Lines
	; CHECK-NEXT: ret <2 x i64> <i64 -131055, i64 undef>			; CHECK-NEXT: ret <2 x i64> <i64 -131055, i64 undef>
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> <i64 -1, i64 -1>, <2 x i64> <i64 8, i64 0>, i8 16, i8 1)			%1 = tail call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> <i64 -1, i64 -1>, <2 x i64> <i64 8, i64 0>, i8 16, i8 1)
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @test_insertqi_call_constexpr(<2 x i64> %x) {			define <2 x i64> @test_insertqi_call_constexpr(<2 x i64> %x) {
	; CHECK-LABEL: @test_insertqi_call_constexpr(			; CHECK-LABEL: @test_insertqi_call_constexpr(
	; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> [[X:%.]], <2 x i64> <i64 0, i64 undef>, i8 48, i8 3)			; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> [[X:%.]], <2 x i64> <i64 0, i64 poison>, i8 48, i8 3)
	; CHECK-NEXT: ret <2 x i64> [[TMP1]]			; CHECK-NEXT: ret <2 x i64> [[TMP1]]
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> %x, <2 x i64> bitcast (<16 x i8> trunc (<16 x i16> bitcast (<4 x i64> <i64 0, i64 undef, i64 2, i64 undef> to <16 x i16>) to <16 x i8>) to <2 x i64>), i8 48, i8 3)			%1 = tail call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> %x, <2 x i64> bitcast (<16 x i8> trunc (<16 x i16> bitcast (<4 x i64> <i64 0, i64 undef, i64 2, i64 undef> to <16 x i16>) to <16 x i8>) to <2 x i64>), i8 48, i8 3)
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	; The result of this insert is the second arg, since the top 64 bits of			; The result of this insert is the second arg, since the top 64 bits of
	; the result are undefined, and we copy the bottom 64 bits from the			; the result are undefined, and we copy the bottom 64 bits from the
	; second arg			; second arg
	define <2 x i64> @testInsert64Bits(<2 x i64> %v, <2 x i64> %i) {			define <2 x i64> @testInsert64Bits(<2 x i64> %v, <2 x i64> %i) {
	; CHECK-LABEL: @testInsert64Bits(			; CHECK-LABEL: @testInsert64Bits(
	; CHECK-NEXT: [[TMP1:%.]] = bitcast <2 x i64> [[I:%.]] to <16 x i8>			; CHECK-NEXT: [[TMP1:%.]] = bitcast <2 x i64> [[I:%.]] to <16 x i8>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <16 x i8> [[TMP1]], <16 x i8> undef, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <16 x i8> [[TMP1]], <16 x i8> poison, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef>
	; CHECK-NEXT: [[TMP3:%.*]] = bitcast <16 x i8> [[TMP2]] to <2 x i64>			; CHECK-NEXT: [[TMP3:%.*]] = bitcast <16 x i8> [[TMP2]] to <2 x i64>
	; CHECK-NEXT: ret <2 x i64> [[TMP3]]			; CHECK-NEXT: ret <2 x i64> [[TMP3]]
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> %v, <2 x i64> %i, i8 64, i8 0)			%1 = tail call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> %v, <2 x i64> %i, i8 64, i8 0)
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @testZeroLength(<2 x i64> %v, <2 x i64> %i) {			define <2 x i64> @testZeroLength(<2 x i64> %v, <2 x i64> %i) {
	; CHECK-LABEL: @testZeroLength(			; CHECK-LABEL: @testZeroLength(
	; CHECK-NEXT: [[TMP1:%.]] = bitcast <2 x i64> [[I:%.]] to <16 x i8>			; CHECK-NEXT: [[TMP1:%.]] = bitcast <2 x i64> [[I:%.]] to <16 x i8>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <16 x i8> [[TMP1]], <16 x i8> undef, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <16 x i8> [[TMP1]], <16 x i8> poison, <16 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef>
	; CHECK-NEXT: [[TMP3:%.*]] = bitcast <16 x i8> [[TMP2]] to <2 x i64>			; CHECK-NEXT: [[TMP3:%.*]] = bitcast <16 x i8> [[TMP2]] to <2 x i64>
	; CHECK-NEXT: ret <2 x i64> [[TMP3]]			; CHECK-NEXT: ret <2 x i64> [[TMP3]]
	;			;
	%1 = tail call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> %v, <2 x i64> %i, i8 0, i8 0)			%1 = tail call <2 x i64> @llvm.x86.sse4a.insertqi(<2 x i64> %v, <2 x i64> %i, i8 0, i8 0)
	ret <2 x i64> %1			ret <2 x i64> %1
	}			}

	define <2 x i64> @testUndefinedInsertq_1(<2 x i64> %v, <2 x i64> %i) {			define <2 x i64> @testUndefinedInsertq_1(<2 x i64> %v, <2 x i64> %i) {
	▲ Show 20 Lines • Show All 156 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/X86/x86-vec_demanded_elts-inseltpoison.ll

	; RUN: opt < %s -instcombine -mtriple=x86_64-unknown-unknown -S \| FileCheck %s			; RUN: opt < %s -instcombine -mtriple=x86_64-unknown-unknown -S \| FileCheck %s
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"

	define i16 @test1(float %f) {			define i16 @test1(float %f) {
	; CHECK-LABEL: @test1(			; CHECK-LABEL: @test1(
	; CHECK-NEXT: [[TMP281:%.*]] = fadd float %f, -1.000000e+00			; CHECK-NEXT: [[TMP281:%.*]] = fadd float %f, -1.000000e+00
	; CHECK-NEXT: [[TMP373:%.*]] = fmul float [[TMP281]], 5.000000e-01			; CHECK-NEXT: [[TMP373:%.*]] = fmul float [[TMP281]], 5.000000e-01
	; CHECK-NEXT: [[TMP374:%.*]] = insertelement <4 x float> undef, float [[TMP373]], i32 0			; CHECK-NEXT: [[TMP374:%.*]] = insertelement <4 x float> poison, float [[TMP373]], i32 0
	; CHECK-NEXT: [[TMP48:%.*]] = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> [[TMP374]], <4 x float> <float 6.553500e+04, float undef, float undef, float undef>)			; CHECK-NEXT: [[TMP48:%.*]] = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> [[TMP374]], <4 x float> <float 6.553500e+04, float poison, float poison, float poison>)
	; CHECK-NEXT: [[TMP59:%.*]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[TMP48]], <4 x float> <float 0.000000e+00, float undef, float undef, float undef>)			; CHECK-NEXT: [[TMP59:%.*]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[TMP48]], <4 x float> <float 0.000000e+00, float poison, float poison, float poison>)
	; CHECK-NEXT: [[TMP_UPGRD_1:%.*]] = tail call i32 @llvm.x86.sse.cvttss2si(<4 x float> [[TMP59]])			; CHECK-NEXT: [[TMP_UPGRD_1:%.*]] = tail call i32 @llvm.x86.sse.cvttss2si(<4 x float> [[TMP59]])
	; CHECK-NEXT: [[TMP69:%.*]] = trunc i32 [[TMP_UPGRD_1]] to i16			; CHECK-NEXT: [[TMP69:%.*]] = trunc i32 [[TMP_UPGRD_1]] to i16
	; CHECK-NEXT: ret i16 [[TMP69]]			; CHECK-NEXT: ret i16 [[TMP69]]
	;			;
	%tmp = insertelement <4 x float> poison, float %f, i32 0			%tmp = insertelement <4 x float> poison, float %f, i32 0
	%tmp10 = insertelement <4 x float> %tmp, float 0.000000e+00, i32 1			%tmp10 = insertelement <4 x float> %tmp, float 0.000000e+00, i32 1
	%tmp11 = insertelement <4 x float> %tmp10, float 0.000000e+00, i32 2			%tmp11 = insertelement <4 x float> %tmp10, float 0.000000e+00, i32 2
	%tmp12 = insertelement <4 x float> %tmp11, float 0.000000e+00, i32 3			%tmp12 = insertelement <4 x float> %tmp11, float 0.000000e+00, i32 3
	%tmp28 = tail call <4 x float> @llvm.x86.sse.sub.ss( <4 x float> %tmp12, <4 x float> < float 1.000000e+00, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )			%tmp28 = tail call <4 x float> @llvm.x86.sse.sub.ss( <4 x float> %tmp12, <4 x float> < float 1.000000e+00, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )
	%tmp37 = tail call <4 x float> @llvm.x86.sse.mul.ss( <4 x float> %tmp28, <4 x float> < float 5.000000e-01, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )			%tmp37 = tail call <4 x float> @llvm.x86.sse.mul.ss( <4 x float> %tmp28, <4 x float> < float 5.000000e-01, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )
	%tmp48 = tail call <4 x float> @llvm.x86.sse.min.ss( <4 x float> %tmp37, <4 x float> < float 6.553500e+04, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )			%tmp48 = tail call <4 x float> @llvm.x86.sse.min.ss( <4 x float> %tmp37, <4 x float> < float 6.553500e+04, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )
	%tmp59 = tail call <4 x float> @llvm.x86.sse.max.ss( <4 x float> %tmp48, <4 x float> zeroinitializer )			%tmp59 = tail call <4 x float> @llvm.x86.sse.max.ss( <4 x float> %tmp48, <4 x float> zeroinitializer )
	%tmp.upgrd.1 = tail call i32 @llvm.x86.sse.cvttss2si( <4 x float> %tmp59 )			%tmp.upgrd.1 = tail call i32 @llvm.x86.sse.cvttss2si( <4 x float> %tmp59 )
	%tmp69 = trunc i32 %tmp.upgrd.1 to i16			%tmp69 = trunc i32 %tmp.upgrd.1 to i16
	ret i16 %tmp69			ret i16 %tmp69
	}			}

	define i64 @test3(float %f, double %d) {			define i64 @test3(float %f, double %d) {
	; CHECK-LABEL: @test3(			; CHECK-LABEL: @test3(
	; CHECK-NEXT: [[V00:%.*]] = insertelement <4 x float> undef, float %f, i32 0			; CHECK-NEXT: [[V00:%.*]] = insertelement <4 x float> poison, float %f, i32 0
	; CHECK-NEXT: [[TMP0:%.*]] = tail call i32 @llvm.x86.sse.cvtss2si(<4 x float> [[V00]])			; CHECK-NEXT: [[TMP0:%.*]] = tail call i32 @llvm.x86.sse.cvtss2si(<4 x float> [[V00]])
	; CHECK-NEXT: [[V10:%.*]] = insertelement <4 x float> undef, float %f, i32 0			; CHECK-NEXT: [[V10:%.*]] = insertelement <4 x float> poison, float %f, i32 0
	; CHECK-NEXT: [[TMP1:%.*]] = tail call i64 @llvm.x86.sse.cvtss2si64(<4 x float> [[V10]])			; CHECK-NEXT: [[TMP1:%.*]] = tail call i64 @llvm.x86.sse.cvtss2si64(<4 x float> [[V10]])
	; CHECK-NEXT: [[V20:%.*]] = insertelement <4 x float> undef, float %f, i32 0			; CHECK-NEXT: [[V20:%.*]] = insertelement <4 x float> poison, float %f, i32 0
	; CHECK-NEXT: [[TMP2:%.*]] = tail call i32 @llvm.x86.sse.cvttss2si(<4 x float> [[V20]])			; CHECK-NEXT: [[TMP2:%.*]] = tail call i32 @llvm.x86.sse.cvttss2si(<4 x float> [[V20]])
	; CHECK-NEXT: [[V30:%.*]] = insertelement <4 x float> undef, float %f, i32 0			; CHECK-NEXT: [[V30:%.*]] = insertelement <4 x float> poison, float %f, i32 0
	; CHECK-NEXT: [[TMP3:%.*]] = tail call i64 @llvm.x86.sse.cvttss2si64(<4 x float> [[V30]])			; CHECK-NEXT: [[TMP3:%.*]] = tail call i64 @llvm.x86.sse.cvttss2si64(<4 x float> [[V30]])
	; CHECK-NEXT: [[V40:%.*]] = insertelement <2 x double> undef, double %d, i32 0			; CHECK-NEXT: [[V40:%.*]] = insertelement <2 x double> poison, double %d, i32 0
	; CHECK-NEXT: [[TMP4:%.*]] = tail call i32 @llvm.x86.sse2.cvtsd2si(<2 x double> [[V40]])			; CHECK-NEXT: [[TMP4:%.*]] = tail call i32 @llvm.x86.sse2.cvtsd2si(<2 x double> [[V40]])
	; CHECK-NEXT: [[V50:%.*]] = insertelement <2 x double> undef, double %d, i32 0			; CHECK-NEXT: [[V50:%.*]] = insertelement <2 x double> poison, double %d, i32 0
	; CHECK-NEXT: [[TMP5:%.*]] = tail call i64 @llvm.x86.sse2.cvtsd2si64(<2 x double> [[V50]])			; CHECK-NEXT: [[TMP5:%.*]] = tail call i64 @llvm.x86.sse2.cvtsd2si64(<2 x double> [[V50]])
	; CHECK-NEXT: [[V60:%.*]] = insertelement <2 x double> undef, double %d, i32 0			; CHECK-NEXT: [[V60:%.*]] = insertelement <2 x double> poison, double %d, i32 0
	; CHECK-NEXT: [[TMP6:%.*]] = tail call i32 @llvm.x86.sse2.cvttsd2si(<2 x double> [[V60]])			; CHECK-NEXT: [[TMP6:%.*]] = tail call i32 @llvm.x86.sse2.cvttsd2si(<2 x double> [[V60]])
	; CHECK-NEXT: [[V70:%.*]] = insertelement <2 x double> undef, double %d, i32 0			; CHECK-NEXT: [[V70:%.*]] = insertelement <2 x double> poison, double %d, i32 0
	; CHECK-NEXT: [[TMP7:%.*]] = tail call i64 @llvm.x86.sse2.cvttsd2si64(<2 x double> [[V70]])			; CHECK-NEXT: [[TMP7:%.*]] = tail call i64 @llvm.x86.sse2.cvttsd2si64(<2 x double> [[V70]])
	; CHECK-NEXT: [[TMP8:%.*]] = add i32 [[TMP0]], [[TMP2]]			; CHECK-NEXT: [[TMP8:%.*]] = add i32 [[TMP0]], [[TMP2]]
	; CHECK-NEXT: [[TMP9:%.*]] = add i32 [[TMP4]], [[TMP6]]			; CHECK-NEXT: [[TMP9:%.*]] = add i32 [[TMP4]], [[TMP6]]
	; CHECK-NEXT: [[TMP10:%.*]] = add i32 [[TMP8]], [[TMP9]]			; CHECK-NEXT: [[TMP10:%.*]] = add i32 [[TMP8]], [[TMP9]]
	; CHECK-NEXT: [[TMP11:%.*]] = sext i32 [[TMP10]] to i64			; CHECK-NEXT: [[TMP11:%.*]] = sext i32 [[TMP10]] to i64
	; CHECK-NEXT: [[TMP12:%.*]] = add i64 [[TMP1]], [[TMP3]]			; CHECK-NEXT: [[TMP12:%.*]] = add i64 [[TMP1]], [[TMP3]]
	; CHECK-NEXT: [[TMP13:%.*]] = add i64 [[TMP5]], [[TMP7]]			; CHECK-NEXT: [[TMP13:%.*]] = add i64 [[TMP5]], [[TMP7]]
	; CHECK-NEXT: [[TMP14:%.*]] = add i64 [[TMP12]], [[TMP13]]			; CHECK-NEXT: [[TMP14:%.*]] = add i64 [[TMP12]], [[TMP13]]
	▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/X86/x86-vec_demanded_elts.ll

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -instcombine -mtriple=x86_64-unknown-unknown -S \| FileCheck %s			; RUN: opt < %s -instcombine -mtriple=x86_64-unknown-unknown -S \| FileCheck %s
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"

	define i16 @test1(float %f) {			define i16 @test1(float %f) {
	; CHECK-LABEL: @test1(			; CHECK-LABEL: @test1(
	; CHECK-NEXT: [[TMP281:%.*]] = fadd float %f, -1.000000e+00			; CHECK-NEXT: [[TMP1:%.]] = fadd float [[F:%.]], -1.000000e+00
	; CHECK-NEXT: [[TMP373:%.*]] = fmul float [[TMP281]], 5.000000e-01			; CHECK-NEXT: [[TMP2:%.*]] = fmul float [[TMP1]], 5.000000e-01
	; CHECK-NEXT: [[TMP374:%.*]] = insertelement <4 x float> undef, float [[TMP373]], i32 0			; CHECK-NEXT: [[TMP3:%.*]] = insertelement <4 x float> poison, float [[TMP2]], i32 0
	; CHECK-NEXT: [[TMP48:%.*]] = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> [[TMP374]], <4 x float> <float 6.553500e+04, float undef, float undef, float undef>)			; CHECK-NEXT: [[TMP48:%.*]] = tail call <4 x float> @llvm.x86.sse.min.ss(<4 x float> [[TMP3]], <4 x float> <float 6.553500e+04, float poison, float poison, float poison>)
	; CHECK-NEXT: [[TMP59:%.*]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[TMP48]], <4 x float> <float 0.000000e+00, float undef, float undef, float undef>)			; CHECK-NEXT: [[TMP59:%.*]] = tail call <4 x float> @llvm.x86.sse.max.ss(<4 x float> [[TMP48]], <4 x float> <float 0.000000e+00, float poison, float poison, float poison>)
	; CHECK-NEXT: [[TMP_UPGRD_1:%.*]] = tail call i32 @llvm.x86.sse.cvttss2si(<4 x float> [[TMP59]])			; CHECK-NEXT: [[TMP_UPGRD_1:%.*]] = tail call i32 @llvm.x86.sse.cvttss2si(<4 x float> [[TMP59]])
	; CHECK-NEXT: [[TMP69:%.*]] = trunc i32 [[TMP_UPGRD_1]] to i16			; CHECK-NEXT: [[TMP69:%.*]] = trunc i32 [[TMP_UPGRD_1]] to i16
	; CHECK-NEXT: ret i16 [[TMP69]]			; CHECK-NEXT: ret i16 [[TMP69]]
	;			;
	%tmp = insertelement <4 x float> undef, float %f, i32 0			%tmp = insertelement <4 x float> undef, float %f, i32 0
	%tmp10 = insertelement <4 x float> %tmp, float 0.000000e+00, i32 1			%tmp10 = insertelement <4 x float> %tmp, float 0.000000e+00, i32 1
	%tmp11 = insertelement <4 x float> %tmp10, float 0.000000e+00, i32 2			%tmp11 = insertelement <4 x float> %tmp10, float 0.000000e+00, i32 2
	%tmp12 = insertelement <4 x float> %tmp11, float 0.000000e+00, i32 3			%tmp12 = insertelement <4 x float> %tmp11, float 0.000000e+00, i32 3
	%tmp28 = tail call <4 x float> @llvm.x86.sse.sub.ss( <4 x float> %tmp12, <4 x float> < float 1.000000e+00, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )			%tmp28 = tail call <4 x float> @llvm.x86.sse.sub.ss( <4 x float> %tmp12, <4 x float> < float 1.000000e+00, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )
	%tmp37 = tail call <4 x float> @llvm.x86.sse.mul.ss( <4 x float> %tmp28, <4 x float> < float 5.000000e-01, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )			%tmp37 = tail call <4 x float> @llvm.x86.sse.mul.ss( <4 x float> %tmp28, <4 x float> < float 5.000000e-01, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )
	%tmp48 = tail call <4 x float> @llvm.x86.sse.min.ss( <4 x float> %tmp37, <4 x float> < float 6.553500e+04, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )			%tmp48 = tail call <4 x float> @llvm.x86.sse.min.ss( <4 x float> %tmp37, <4 x float> < float 6.553500e+04, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )
	%tmp59 = tail call <4 x float> @llvm.x86.sse.max.ss( <4 x float> %tmp48, <4 x float> zeroinitializer )			%tmp59 = tail call <4 x float> @llvm.x86.sse.max.ss( <4 x float> %tmp48, <4 x float> zeroinitializer )
	%tmp.upgrd.1 = tail call i32 @llvm.x86.sse.cvttss2si( <4 x float> %tmp59 )			%tmp.upgrd.1 = tail call i32 @llvm.x86.sse.cvttss2si( <4 x float> %tmp59 )
	%tmp69 = trunc i32 %tmp.upgrd.1 to i16			%tmp69 = trunc i32 %tmp.upgrd.1 to i16
	ret i16 %tmp69			ret i16 %tmp69
	}			}

	define i64 @test3(float %f, double %d) {			define i64 @test3(float %f, double %d) {
	; CHECK-LABEL: @test3(			; CHECK-LABEL: @test3(
	; CHECK-NEXT: [[V00:%.*]] = insertelement <4 x float> undef, float %f, i32 0			; CHECK-NEXT: [[V03:%.]] = insertelement <4 x float> poison, float [[F:%.]], i32 0
	; CHECK-NEXT: [[TMP0:%.*]] = tail call i32 @llvm.x86.sse.cvtss2si(<4 x float> [[V00]])			; CHECK-NEXT: [[TMP0:%.*]] = tail call i32 @llvm.x86.sse.cvtss2si(<4 x float> [[V03]])
	; CHECK-NEXT: [[V10:%.*]] = insertelement <4 x float> undef, float %f, i32 0			; CHECK-NEXT: [[V13:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
	; CHECK-NEXT: [[TMP1:%.*]] = tail call i64 @llvm.x86.sse.cvtss2si64(<4 x float> [[V10]])			; CHECK-NEXT: [[TMP1:%.*]] = tail call i64 @llvm.x86.sse.cvtss2si64(<4 x float> [[V13]])
	; CHECK-NEXT: [[V20:%.*]] = insertelement <4 x float> undef, float %f, i32 0			; CHECK-NEXT: [[V23:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
	; CHECK-NEXT: [[TMP2:%.*]] = tail call i32 @llvm.x86.sse.cvttss2si(<4 x float> [[V20]])			; CHECK-NEXT: [[TMP2:%.*]] = tail call i32 @llvm.x86.sse.cvttss2si(<4 x float> [[V23]])
	; CHECK-NEXT: [[V30:%.*]] = insertelement <4 x float> undef, float %f, i32 0			; CHECK-NEXT: [[V33:%.*]] = insertelement <4 x float> poison, float [[F]], i32 0
	; CHECK-NEXT: [[TMP3:%.*]] = tail call i64 @llvm.x86.sse.cvttss2si64(<4 x float> [[V30]])			; CHECK-NEXT: [[TMP3:%.*]] = tail call i64 @llvm.x86.sse.cvttss2si64(<4 x float> [[V33]])
	; CHECK-NEXT: [[V40:%.*]] = insertelement <2 x double> undef, double %d, i32 0			; CHECK-NEXT: [[V41:%.]] = insertelement <2 x double> poison, double [[D:%.]], i32 0
	; CHECK-NEXT: [[TMP4:%.*]] = tail call i32 @llvm.x86.sse2.cvtsd2si(<2 x double> [[V40]])			; CHECK-NEXT: [[TMP4:%.*]] = tail call i32 @llvm.x86.sse2.cvtsd2si(<2 x double> [[V41]])
	; CHECK-NEXT: [[V50:%.*]] = insertelement <2 x double> undef, double %d, i32 0			; CHECK-NEXT: [[V51:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
	; CHECK-NEXT: [[TMP5:%.*]] = tail call i64 @llvm.x86.sse2.cvtsd2si64(<2 x double> [[V50]])			; CHECK-NEXT: [[TMP5:%.*]] = tail call i64 @llvm.x86.sse2.cvtsd2si64(<2 x double> [[V51]])
	; CHECK-NEXT: [[V60:%.*]] = insertelement <2 x double> undef, double %d, i32 0			; CHECK-NEXT: [[V61:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
	; CHECK-NEXT: [[TMP6:%.*]] = tail call i32 @llvm.x86.sse2.cvttsd2si(<2 x double> [[V60]])			; CHECK-NEXT: [[TMP6:%.*]] = tail call i32 @llvm.x86.sse2.cvttsd2si(<2 x double> [[V61]])
	; CHECK-NEXT: [[V70:%.*]] = insertelement <2 x double> undef, double %d, i32 0			; CHECK-NEXT: [[V71:%.*]] = insertelement <2 x double> poison, double [[D]], i32 0
	; CHECK-NEXT: [[TMP7:%.*]] = tail call i64 @llvm.x86.sse2.cvttsd2si64(<2 x double> [[V70]])			; CHECK-NEXT: [[TMP7:%.*]] = tail call i64 @llvm.x86.sse2.cvttsd2si64(<2 x double> [[V71]])
	; CHECK-NEXT: [[TMP8:%.*]] = add i32 [[TMP0]], [[TMP2]]			; CHECK-NEXT: [[TMP8:%.*]] = add i32 [[TMP0]], [[TMP2]]
	; CHECK-NEXT: [[TMP9:%.*]] = add i32 [[TMP4]], [[TMP6]]			; CHECK-NEXT: [[TMP9:%.*]] = add i32 [[TMP4]], [[TMP6]]
	; CHECK-NEXT: [[TMP10:%.*]] = add i32 [[TMP8]], [[TMP9]]			; CHECK-NEXT: [[TMP10:%.*]] = add i32 [[TMP8]], [[TMP9]]
	; CHECK-NEXT: [[TMP11:%.*]] = sext i32 [[TMP10]] to i64			; CHECK-NEXT: [[TMP11:%.*]] = sext i32 [[TMP10]] to i64
	; CHECK-NEXT: [[TMP12:%.*]] = add i64 [[TMP1]], [[TMP3]]			; CHECK-NEXT: [[TMP12:%.*]] = add i64 [[TMP1]], [[TMP3]]
	; CHECK-NEXT: [[TMP13:%.*]] = add i64 [[TMP5]], [[TMP7]]			; CHECK-NEXT: [[TMP13:%.*]] = add i64 [[TMP5]], [[TMP7]]
	; CHECK-NEXT: [[TMP14:%.*]] = add i64 [[TMP12]], [[TMP13]]			; CHECK-NEXT: [[TMP14:%.*]] = add i64 [[TMP12]], [[TMP13]]
	; CHECK-NEXT: [[TMP15:%.*]] = add i64 [[TMP14]], [[TMP11]]			; CHECK-NEXT: [[TMP15:%.*]] = add i64 [[TMP14]], [[TMP11]]
	▲ Show 20 Lines • Show All 57 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/X86/x86-vector-shifts-inseltpoison.ll

	Show First 20 Lines • Show All 2,674 Lines • ▼ Show 20 Lines
	}			}

	;			;
	; Vector Masked Shift Amounts			; Vector Masked Shift Amounts
	;			;

	define <8 x i16> @sse2_psra_w_128_masked(<8 x i16> %v, <8 x i16> %a) {			define <8 x i16> @sse2_psra_w_128_masked(<8 x i16> %v, <8 x i16> %a) {
	; CHECK-LABEL: @sse2_psra_w_128_masked(			; CHECK-LABEL: @sse2_psra_w_128_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <8 x i16> [[A:%.]], <i16 15, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <8 x i16> [[A:%.]], <i16 15, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x i16> [[TMP1]], <8 x i16> undef, <8 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x i16> [[TMP1]], <8 x i16> undef, <8 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = ashr <8 x i16> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = ashr <8 x i16> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <8 x i16> [[TMP3]]			; CHECK-NEXT: ret <8 x i16> [[TMP3]]
	;			;
	%1 = and <8 x i16> %a, <i16 15, i16 0, i16 0, i16 0, i16 undef, i16 undef, i16 undef, i16 undef>			%1 = and <8 x i16> %a, <i16 15, i16 0, i16 0, i16 0, i16 undef, i16 undef, i16 undef, i16 undef>
	%2 = tail call <8 x i16> @llvm.x86.sse2.psra.w(<8 x i16> %v, <8 x i16> %1)			%2 = tail call <8 x i16> @llvm.x86.sse2.psra.w(<8 x i16> %v, <8 x i16> %1)
	ret <8 x i16> %2			ret <8 x i16> %2
	}			}

	define <8 x i32> @avx2_psra_d_256_masked(<8 x i32> %v, <4 x i32> %a) {			define <8 x i32> @avx2_psra_d_256_masked(<8 x i32> %v, <4 x i32> %a) {
	; CHECK-LABEL: @avx2_psra_d_256_masked(			; CHECK-LABEL: @avx2_psra_d_256_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 31, i32 undef, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 31, i32 poison, i32 poison, i32 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <8 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <8 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = ashr <8 x i32> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = ashr <8 x i32> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <8 x i32> [[TMP3]]			; CHECK-NEXT: ret <8 x i32> [[TMP3]]
	;			;
	%1 = and <4 x i32> %a, <i32 31, i32 0, i32 undef, i32 undef>			%1 = and <4 x i32> %a, <i32 31, i32 0, i32 undef, i32 undef>
	%2 = tail call <8 x i32> @llvm.x86.avx2.psra.d(<8 x i32> %v, <4 x i32> %1)			%2 = tail call <8 x i32> @llvm.x86.avx2.psra.d(<8 x i32> %v, <4 x i32> %1)
	ret <8 x i32> %2			ret <8 x i32> %2
	}			}

	define <8 x i64> @avx512_psra_q_512_masked(<8 x i64> %v, <2 x i64> %a) {			define <8 x i64> @avx512_psra_q_512_masked(<8 x i64> %v, <2 x i64> %a) {
	; CHECK-LABEL: @avx512_psra_q_512_masked(			; CHECK-LABEL: @avx512_psra_q_512_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[A:%.]], <i64 63, i64 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[A:%.]], <i64 63, i64 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x i64> [[TMP1]], <2 x i64> undef, <8 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x i64> [[TMP1]], <2 x i64> undef, <8 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = ashr <8 x i64> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = ashr <8 x i64> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <8 x i64> [[TMP3]]			; CHECK-NEXT: ret <8 x i64> [[TMP3]]
	;			;
	%1 = and <2 x i64> %a, <i64 63, i64 undef>			%1 = and <2 x i64> %a, <i64 63, i64 undef>
	%2 = tail call <8 x i64> @llvm.x86.avx512.psra.q.512(<8 x i64> %v, <2 x i64> %1)			%2 = tail call <8 x i64> @llvm.x86.avx512.psra.q.512(<8 x i64> %v, <2 x i64> %1)
	ret <8 x i64> %2			ret <8 x i64> %2
	}			}

	define <4 x i32> @sse2_psrl_d_128_masked(<4 x i32> %v, <4 x i32> %a) {			define <4 x i32> @sse2_psrl_d_128_masked(<4 x i32> %v, <4 x i32> %a) {
	; CHECK-LABEL: @sse2_psrl_d_128_masked(			; CHECK-LABEL: @sse2_psrl_d_128_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 31, i32 undef, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 31, i32 poison, i32 poison, i32 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = lshr <4 x i32> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = lshr <4 x i32> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <4 x i32> [[TMP3]]			; CHECK-NEXT: ret <4 x i32> [[TMP3]]
	;			;
	%1 = and <4 x i32> %a, <i32 31, i32 0, i32 undef, i32 undef>			%1 = and <4 x i32> %a, <i32 31, i32 0, i32 undef, i32 undef>
	%2 = tail call <4 x i32> @llvm.x86.sse2.psrl.d(<4 x i32> %v, <4 x i32> %1)			%2 = tail call <4 x i32> @llvm.x86.sse2.psrl.d(<4 x i32> %v, <4 x i32> %1)
	ret <4 x i32> %2			ret <4 x i32> %2
	}			}

	define <4 x i64> @avx2_psrl_q_256_masked(<4 x i64> %v, <2 x i64> %a) {			define <4 x i64> @avx2_psrl_q_256_masked(<4 x i64> %v, <2 x i64> %a) {
	; CHECK-LABEL: @avx2_psrl_q_256_masked(			; CHECK-LABEL: @avx2_psrl_q_256_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[A:%.]], <i64 63, i64 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[A:%.]], <i64 63, i64 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x i64> [[TMP1]], <2 x i64> undef, <4 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x i64> [[TMP1]], <2 x i64> undef, <4 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = lshr <4 x i64> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = lshr <4 x i64> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <4 x i64> [[TMP3]]			; CHECK-NEXT: ret <4 x i64> [[TMP3]]
	;			;
	%1 = and <2 x i64> %a, <i64 63, i64 undef>			%1 = and <2 x i64> %a, <i64 63, i64 undef>
	%2 = tail call <4 x i64> @llvm.x86.avx2.psrl.q(<4 x i64> %v, <2 x i64> %1)			%2 = tail call <4 x i64> @llvm.x86.avx2.psrl.q(<4 x i64> %v, <2 x i64> %1)
	ret <4 x i64> %2			ret <4 x i64> %2
	}			}

	define <32 x i16> @avx512_psrl_w_512_masked(<32 x i16> %v, <8 x i16> %a) {			define <32 x i16> @avx512_psrl_w_512_masked(<32 x i16> %v, <8 x i16> %a) {
	; CHECK-LABEL: @avx512_psrl_w_512_masked(			; CHECK-LABEL: @avx512_psrl_w_512_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <8 x i16> [[A:%.]], <i16 15, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <8 x i16> [[A:%.]], <i16 15, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x i16> [[TMP1]], <8 x i16> undef, <32 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x i16> [[TMP1]], <8 x i16> undef, <32 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = lshr <32 x i16> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = lshr <32 x i16> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <32 x i16> [[TMP3]]			; CHECK-NEXT: ret <32 x i16> [[TMP3]]
	;			;
	%1 = and <8 x i16> %a, <i16 15, i16 0, i16 0, i16 0, i16 undef, i16 undef, i16 undef, i16 undef>			%1 = and <8 x i16> %a, <i16 15, i16 0, i16 0, i16 0, i16 undef, i16 undef, i16 undef, i16 undef>
	%2 = tail call <32 x i16> @llvm.x86.avx512.psrl.w.512(<32 x i16> %v, <8 x i16> %1)			%2 = tail call <32 x i16> @llvm.x86.avx512.psrl.w.512(<32 x i16> %v, <8 x i16> %1)
	ret <32 x i16> %2			ret <32 x i16> %2
	}			}

	define <2 x i64> @sse2_psll_q_128_masked(<2 x i64> %v, <2 x i64> %a) {			define <2 x i64> @sse2_psll_q_128_masked(<2 x i64> %v, <2 x i64> %a) {
	; CHECK-LABEL: @sse2_psll_q_128_masked(			; CHECK-LABEL: @sse2_psll_q_128_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[A:%.]], <i64 63, i64 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[A:%.]], <i64 63, i64 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x i64> [[TMP1]], <2 x i64> undef, <2 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x i64> [[TMP1]], <2 x i64> undef, <2 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = shl <2 x i64> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = shl <2 x i64> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <2 x i64> [[TMP3]]			; CHECK-NEXT: ret <2 x i64> [[TMP3]]
	;			;
	%1 = and <2 x i64> %a, <i64 63, i64 undef>			%1 = and <2 x i64> %a, <i64 63, i64 undef>
	%2 = tail call <2 x i64> @llvm.x86.sse2.psll.q(<2 x i64> %v, <2 x i64> %1)			%2 = tail call <2 x i64> @llvm.x86.sse2.psll.q(<2 x i64> %v, <2 x i64> %1)
	ret <2 x i64> %2			ret <2 x i64> %2
	}			}

	define <16 x i16> @avx2_psll_w_256_masked(<16 x i16> %v, <8 x i16> %a) {			define <16 x i16> @avx2_psll_w_256_masked(<16 x i16> %v, <8 x i16> %a) {
	; CHECK-LABEL: @avx2_psll_w_256_masked(			; CHECK-LABEL: @avx2_psll_w_256_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <8 x i16> [[A:%.]], <i16 15, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <8 x i16> [[A:%.]], <i16 15, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x i16> [[TMP1]], <8 x i16> undef, <16 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x i16> [[TMP1]], <8 x i16> undef, <16 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = shl <16 x i16> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = shl <16 x i16> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <16 x i16> [[TMP3]]			; CHECK-NEXT: ret <16 x i16> [[TMP3]]
	;			;
	%1 = and <8 x i16> %a, <i16 15, i16 0, i16 0, i16 0, i16 undef, i16 undef, i16 undef, i16 undef>			%1 = and <8 x i16> %a, <i16 15, i16 0, i16 0, i16 0, i16 undef, i16 undef, i16 undef, i16 undef>
	%2 = tail call <16 x i16> @llvm.x86.avx2.psll.w(<16 x i16> %v, <8 x i16> %1)			%2 = tail call <16 x i16> @llvm.x86.avx2.psll.w(<16 x i16> %v, <8 x i16> %1)
	ret <16 x i16> %2			ret <16 x i16> %2
	}			}

	define <16 x i32> @avx512_psll_d_512_masked(<16 x i32> %v, <4 x i32> %a) {			define <16 x i32> @avx512_psll_d_512_masked(<16 x i32> %v, <4 x i32> %a) {
	; CHECK-LABEL: @avx512_psll_d_512_masked(			; CHECK-LABEL: @avx512_psll_d_512_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 31, i32 undef, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 31, i32 poison, i32 poison, i32 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <16 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <16 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = shl <16 x i32> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = shl <16 x i32> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <16 x i32> [[TMP3]]			; CHECK-NEXT: ret <16 x i32> [[TMP3]]
	;			;
	%1 = and <4 x i32> %a, <i32 31, i32 0, i32 undef, i32 undef>			%1 = and <4 x i32> %a, <i32 31, i32 0, i32 undef, i32 undef>
	%2 = tail call <16 x i32> @llvm.x86.avx512.psll.d.512(<16 x i32> %v, <4 x i32> %1)			%2 = tail call <16 x i32> @llvm.x86.avx512.psll.d.512(<16 x i32> %v, <4 x i32> %1)
	ret <16 x i32> %2			ret <16 x i32> %2
	}			}
	▲ Show 20 Lines • Show All 129 Lines • ▼ Show 20 Lines
	;			;
	%1 = and <4 x i32> %a, <i32 31, i32 31, i32 31, i32 31>			%1 = and <4 x i32> %a, <i32 31, i32 31, i32 31, i32 31>
	%2 = tail call <4 x i32> @llvm.x86.avx2.psrav.d(<4 x i32> %v, <4 x i32> %1)			%2 = tail call <4 x i32> @llvm.x86.avx2.psrav.d(<4 x i32> %v, <4 x i32> %1)
	ret <4 x i32> %2			ret <4 x i32> %2
	}			}

	define <4 x i32> @avx2_psrav_d_128_masked_shuffle(<4 x i32> %v, <4 x i32> %a) {			define <4 x i32> @avx2_psrav_d_128_masked_shuffle(<4 x i32> %v, <4 x i32> %a) {
	; CHECK-LABEL: @avx2_psrav_d_128_masked_shuffle(			; CHECK-LABEL: @avx2_psrav_d_128_masked_shuffle(
	; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 undef, i32 undef, i32 15, i32 31>			; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 poison, i32 poison, i32 15, i32 31>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> <i32 2, i32 3, i32 2, i32 3>			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> <i32 2, i32 3, i32 2, i32 3>
	; CHECK-NEXT: [[TMP3:%.]] = ashr <4 x i32> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = ashr <4 x i32> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <4 x i32> [[TMP3]]			; CHECK-NEXT: ret <4 x i32> [[TMP3]]
	;			;
	%1 = and <4 x i32> %a, <i32 undef, i32 undef, i32 15, i32 31>			%1 = and <4 x i32> %a, <i32 undef, i32 undef, i32 15, i32 31>
	%2 = shufflevector <4 x i32> %1, <4 x i32> undef, <4 x i32> <i32 2, i32 3, i32 2, i32 3>			%2 = shufflevector <4 x i32> %1, <4 x i32> undef, <4 x i32> <i32 2, i32 3, i32 2, i32 3>
	%3 = tail call <4 x i32> @llvm.x86.avx2.psrav.d(<4 x i32> %v, <4 x i32> %2)			%3 = tail call <4 x i32> @llvm.x86.avx2.psrav.d(<4 x i32> %v, <4 x i32> %2)
	ret <4 x i32> %3			ret <4 x i32> %3
	▲ Show 20 Lines • Show All 850 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/X86/x86-vector-shifts.ll

	Show First 20 Lines • Show All 2,674 Lines • ▼ Show 20 Lines
	}			}

	;			;
	; Vector Masked Shift Amounts			; Vector Masked Shift Amounts
	;			;

	define <8 x i16> @sse2_psra_w_128_masked(<8 x i16> %v, <8 x i16> %a) {			define <8 x i16> @sse2_psra_w_128_masked(<8 x i16> %v, <8 x i16> %a) {
	; CHECK-LABEL: @sse2_psra_w_128_masked(			; CHECK-LABEL: @sse2_psra_w_128_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <8 x i16> [[A:%.]], <i16 15, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <8 x i16> [[A:%.]], <i16 15, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x i16> [[TMP1]], <8 x i16> undef, <8 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x i16> [[TMP1]], <8 x i16> undef, <8 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = ashr <8 x i16> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = ashr <8 x i16> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <8 x i16> [[TMP3]]			; CHECK-NEXT: ret <8 x i16> [[TMP3]]
	;			;
	%1 = and <8 x i16> %a, <i16 15, i16 0, i16 0, i16 0, i16 undef, i16 undef, i16 undef, i16 undef>			%1 = and <8 x i16> %a, <i16 15, i16 0, i16 0, i16 0, i16 undef, i16 undef, i16 undef, i16 undef>
	%2 = tail call <8 x i16> @llvm.x86.sse2.psra.w(<8 x i16> %v, <8 x i16> %1)			%2 = tail call <8 x i16> @llvm.x86.sse2.psra.w(<8 x i16> %v, <8 x i16> %1)
	ret <8 x i16> %2			ret <8 x i16> %2
	}			}

	define <8 x i32> @avx2_psra_d_256_masked(<8 x i32> %v, <4 x i32> %a) {			define <8 x i32> @avx2_psra_d_256_masked(<8 x i32> %v, <4 x i32> %a) {
	; CHECK-LABEL: @avx2_psra_d_256_masked(			; CHECK-LABEL: @avx2_psra_d_256_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 31, i32 undef, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 31, i32 poison, i32 poison, i32 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <8 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <8 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = ashr <8 x i32> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = ashr <8 x i32> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <8 x i32> [[TMP3]]			; CHECK-NEXT: ret <8 x i32> [[TMP3]]
	;			;
	%1 = and <4 x i32> %a, <i32 31, i32 0, i32 undef, i32 undef>			%1 = and <4 x i32> %a, <i32 31, i32 0, i32 undef, i32 undef>
	%2 = tail call <8 x i32> @llvm.x86.avx2.psra.d(<8 x i32> %v, <4 x i32> %1)			%2 = tail call <8 x i32> @llvm.x86.avx2.psra.d(<8 x i32> %v, <4 x i32> %1)
	ret <8 x i32> %2			ret <8 x i32> %2
	}			}

	define <8 x i64> @avx512_psra_q_512_masked(<8 x i64> %v, <2 x i64> %a) {			define <8 x i64> @avx512_psra_q_512_masked(<8 x i64> %v, <2 x i64> %a) {
	; CHECK-LABEL: @avx512_psra_q_512_masked(			; CHECK-LABEL: @avx512_psra_q_512_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[A:%.]], <i64 63, i64 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[A:%.]], <i64 63, i64 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x i64> [[TMP1]], <2 x i64> undef, <8 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x i64> [[TMP1]], <2 x i64> undef, <8 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = ashr <8 x i64> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = ashr <8 x i64> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <8 x i64> [[TMP3]]			; CHECK-NEXT: ret <8 x i64> [[TMP3]]
	;			;
	%1 = and <2 x i64> %a, <i64 63, i64 undef>			%1 = and <2 x i64> %a, <i64 63, i64 undef>
	%2 = tail call <8 x i64> @llvm.x86.avx512.psra.q.512(<8 x i64> %v, <2 x i64> %1)			%2 = tail call <8 x i64> @llvm.x86.avx512.psra.q.512(<8 x i64> %v, <2 x i64> %1)
	ret <8 x i64> %2			ret <8 x i64> %2
	}			}

	define <4 x i32> @sse2_psrl_d_128_masked(<4 x i32> %v, <4 x i32> %a) {			define <4 x i32> @sse2_psrl_d_128_masked(<4 x i32> %v, <4 x i32> %a) {
	; CHECK-LABEL: @sse2_psrl_d_128_masked(			; CHECK-LABEL: @sse2_psrl_d_128_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 31, i32 undef, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 31, i32 poison, i32 poison, i32 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = lshr <4 x i32> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = lshr <4 x i32> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <4 x i32> [[TMP3]]			; CHECK-NEXT: ret <4 x i32> [[TMP3]]
	;			;
	%1 = and <4 x i32> %a, <i32 31, i32 0, i32 undef, i32 undef>			%1 = and <4 x i32> %a, <i32 31, i32 0, i32 undef, i32 undef>
	%2 = tail call <4 x i32> @llvm.x86.sse2.psrl.d(<4 x i32> %v, <4 x i32> %1)			%2 = tail call <4 x i32> @llvm.x86.sse2.psrl.d(<4 x i32> %v, <4 x i32> %1)
	ret <4 x i32> %2			ret <4 x i32> %2
	}			}

	define <4 x i64> @avx2_psrl_q_256_masked(<4 x i64> %v, <2 x i64> %a) {			define <4 x i64> @avx2_psrl_q_256_masked(<4 x i64> %v, <2 x i64> %a) {
	; CHECK-LABEL: @avx2_psrl_q_256_masked(			; CHECK-LABEL: @avx2_psrl_q_256_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[A:%.]], <i64 63, i64 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[A:%.]], <i64 63, i64 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x i64> [[TMP1]], <2 x i64> undef, <4 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x i64> [[TMP1]], <2 x i64> undef, <4 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = lshr <4 x i64> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = lshr <4 x i64> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <4 x i64> [[TMP3]]			; CHECK-NEXT: ret <4 x i64> [[TMP3]]
	;			;
	%1 = and <2 x i64> %a, <i64 63, i64 undef>			%1 = and <2 x i64> %a, <i64 63, i64 undef>
	%2 = tail call <4 x i64> @llvm.x86.avx2.psrl.q(<4 x i64> %v, <2 x i64> %1)			%2 = tail call <4 x i64> @llvm.x86.avx2.psrl.q(<4 x i64> %v, <2 x i64> %1)
	ret <4 x i64> %2			ret <4 x i64> %2
	}			}

	define <32 x i16> @avx512_psrl_w_512_masked(<32 x i16> %v, <8 x i16> %a) {			define <32 x i16> @avx512_psrl_w_512_masked(<32 x i16> %v, <8 x i16> %a) {
	; CHECK-LABEL: @avx512_psrl_w_512_masked(			; CHECK-LABEL: @avx512_psrl_w_512_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <8 x i16> [[A:%.]], <i16 15, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <8 x i16> [[A:%.]], <i16 15, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x i16> [[TMP1]], <8 x i16> undef, <32 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x i16> [[TMP1]], <8 x i16> undef, <32 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = lshr <32 x i16> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = lshr <32 x i16> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <32 x i16> [[TMP3]]			; CHECK-NEXT: ret <32 x i16> [[TMP3]]
	;			;
	%1 = and <8 x i16> %a, <i16 15, i16 0, i16 0, i16 0, i16 undef, i16 undef, i16 undef, i16 undef>			%1 = and <8 x i16> %a, <i16 15, i16 0, i16 0, i16 0, i16 undef, i16 undef, i16 undef, i16 undef>
	%2 = tail call <32 x i16> @llvm.x86.avx512.psrl.w.512(<32 x i16> %v, <8 x i16> %1)			%2 = tail call <32 x i16> @llvm.x86.avx512.psrl.w.512(<32 x i16> %v, <8 x i16> %1)
	ret <32 x i16> %2			ret <32 x i16> %2
	}			}

	define <2 x i64> @sse2_psll_q_128_masked(<2 x i64> %v, <2 x i64> %a) {			define <2 x i64> @sse2_psll_q_128_masked(<2 x i64> %v, <2 x i64> %a) {
	; CHECK-LABEL: @sse2_psll_q_128_masked(			; CHECK-LABEL: @sse2_psll_q_128_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[A:%.]], <i64 63, i64 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[A:%.]], <i64 63, i64 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x i64> [[TMP1]], <2 x i64> undef, <2 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x i64> [[TMP1]], <2 x i64> undef, <2 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = shl <2 x i64> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = shl <2 x i64> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <2 x i64> [[TMP3]]			; CHECK-NEXT: ret <2 x i64> [[TMP3]]
	;			;
	%1 = and <2 x i64> %a, <i64 63, i64 undef>			%1 = and <2 x i64> %a, <i64 63, i64 undef>
	%2 = tail call <2 x i64> @llvm.x86.sse2.psll.q(<2 x i64> %v, <2 x i64> %1)			%2 = tail call <2 x i64> @llvm.x86.sse2.psll.q(<2 x i64> %v, <2 x i64> %1)
	ret <2 x i64> %2			ret <2 x i64> %2
	}			}

	define <16 x i16> @avx2_psll_w_256_masked(<16 x i16> %v, <8 x i16> %a) {			define <16 x i16> @avx2_psll_w_256_masked(<16 x i16> %v, <8 x i16> %a) {
	; CHECK-LABEL: @avx2_psll_w_256_masked(			; CHECK-LABEL: @avx2_psll_w_256_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <8 x i16> [[A:%.]], <i16 15, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef, i16 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <8 x i16> [[A:%.]], <i16 15, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison, i16 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x i16> [[TMP1]], <8 x i16> undef, <16 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x i16> [[TMP1]], <8 x i16> undef, <16 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = shl <16 x i16> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = shl <16 x i16> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <16 x i16> [[TMP3]]			; CHECK-NEXT: ret <16 x i16> [[TMP3]]
	;			;
	%1 = and <8 x i16> %a, <i16 15, i16 0, i16 0, i16 0, i16 undef, i16 undef, i16 undef, i16 undef>			%1 = and <8 x i16> %a, <i16 15, i16 0, i16 0, i16 0, i16 undef, i16 undef, i16 undef, i16 undef>
	%2 = tail call <16 x i16> @llvm.x86.avx2.psll.w(<16 x i16> %v, <8 x i16> %1)			%2 = tail call <16 x i16> @llvm.x86.avx2.psll.w(<16 x i16> %v, <8 x i16> %1)
	ret <16 x i16> %2			ret <16 x i16> %2
	}			}

	define <16 x i32> @avx512_psll_d_512_masked(<16 x i32> %v, <4 x i32> %a) {			define <16 x i32> @avx512_psll_d_512_masked(<16 x i32> %v, <4 x i32> %a) {
	; CHECK-LABEL: @avx512_psll_d_512_masked(			; CHECK-LABEL: @avx512_psll_d_512_masked(
	; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 31, i32 undef, i32 undef, i32 undef>			; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 31, i32 poison, i32 poison, i32 poison>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <16 x i32> zeroinitializer			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <16 x i32> zeroinitializer
	; CHECK-NEXT: [[TMP3:%.]] = shl <16 x i32> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = shl <16 x i32> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <16 x i32> [[TMP3]]			; CHECK-NEXT: ret <16 x i32> [[TMP3]]
	;			;
	%1 = and <4 x i32> %a, <i32 31, i32 0, i32 undef, i32 undef>			%1 = and <4 x i32> %a, <i32 31, i32 0, i32 undef, i32 undef>
	%2 = tail call <16 x i32> @llvm.x86.avx512.psll.d.512(<16 x i32> %v, <4 x i32> %1)			%2 = tail call <16 x i32> @llvm.x86.avx512.psll.d.512(<16 x i32> %v, <4 x i32> %1)
	ret <16 x i32> %2			ret <16 x i32> %2
	}			}
	▲ Show 20 Lines • Show All 129 Lines • ▼ Show 20 Lines
	;			;
	%1 = and <4 x i32> %a, <i32 31, i32 31, i32 31, i32 31>			%1 = and <4 x i32> %a, <i32 31, i32 31, i32 31, i32 31>
	%2 = tail call <4 x i32> @llvm.x86.avx2.psrav.d(<4 x i32> %v, <4 x i32> %1)			%2 = tail call <4 x i32> @llvm.x86.avx2.psrav.d(<4 x i32> %v, <4 x i32> %1)
	ret <4 x i32> %2			ret <4 x i32> %2
	}			}

	define <4 x i32> @avx2_psrav_d_128_masked_shuffle(<4 x i32> %v, <4 x i32> %a) {			define <4 x i32> @avx2_psrav_d_128_masked_shuffle(<4 x i32> %v, <4 x i32> %a) {
	; CHECK-LABEL: @avx2_psrav_d_128_masked_shuffle(			; CHECK-LABEL: @avx2_psrav_d_128_masked_shuffle(
	; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 undef, i32 undef, i32 15, i32 31>			; CHECK-NEXT: [[TMP1:%.]] = and <4 x i32> [[A:%.]], <i32 poison, i32 poison, i32 15, i32 31>
	; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> <i32 2, i32 3, i32 2, i32 3>			; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> <i32 2, i32 3, i32 2, i32 3>
	; CHECK-NEXT: [[TMP3:%.]] = ashr <4 x i32> [[V:%.]], [[TMP2]]			; CHECK-NEXT: [[TMP3:%.]] = ashr <4 x i32> [[V:%.]], [[TMP2]]
	; CHECK-NEXT: ret <4 x i32> [[TMP3]]			; CHECK-NEXT: ret <4 x i32> [[TMP3]]
	;			;
	%1 = and <4 x i32> %a, <i32 undef, i32 undef, i32 15, i32 31>			%1 = and <4 x i32> %a, <i32 undef, i32 undef, i32 15, i32 31>
	%2 = shufflevector <4 x i32> %1, <4 x i32> undef, <4 x i32> <i32 2, i32 3, i32 2, i32 3>			%2 = shufflevector <4 x i32> %1, <4 x i32> undef, <4 x i32> <i32 2, i32 3, i32 2, i32 3>
	%3 = tail call <4 x i32> @llvm.x86.avx2.psrav.d(<4 x i32> %v, <4 x i32> %2)			%3 = tail call <4 x i32> @llvm.x86.avx2.psrav.d(<4 x i32> %v, <4 x i32> %2)
	ret <4 x i32> %3			ret <4 x i32> %3
	▲ Show 20 Lines • Show All 850 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/X86/x86-vpermil.ll

Show First 20 Lines • Show All 275 Lines • ▼ Show 20 Lines	;
%1 = shufflevector <4 x i64> <i64 0, i64 2, i64 0, i64 2>, <4 x i64> %a1, <4 x i32> <i32 1, i32 2, i32 3, i32 4>		%1 = shufflevector <4 x i64> <i64 0, i64 2, i64 0, i64 2>, <4 x i64> %a1, <4 x i32> <i32 1, i32 2, i32 3, i32 4>
%2 = tail call <4 x double> @llvm.x86.avx.vpermilvar.pd.256(<4 x double> %a0, <4 x i64> %1)		%2 = tail call <4 x double> @llvm.x86.avx.vpermilvar.pd.256(<4 x double> %a0, <4 x i64> %1)
%3 = shufflevector <4 x double> %2, <4 x double> undef, <4 x i32> <i32 0, i32 1, i32 2, i32 undef>		%3 = shufflevector <4 x double> %2, <4 x double> undef, <4 x i32> <i32 0, i32 1, i32 2, i32 undef>
ret <4 x double> %3		ret <4 x double> %3
}		}

define <8 x double> @elts_test_vpermilvar_pd_512(<8 x double> %a0, <8 x i64> %a1, i64 %a2) {		define <8 x double> @elts_test_vpermilvar_pd_512(<8 x double> %a0, <8 x i64> %a1, i64 %a2) {
; CHECK-LABEL: @elts_test_vpermilvar_pd_512(		; CHECK-LABEL: @elts_test_vpermilvar_pd_512(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <8 x i64> undef, i64 [[A2:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <8 x i64> poison, i64 [[A2:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = tail call <8 x double> @llvm.x86.avx512.vpermilvar.pd.512(<8 x double> [[A0:%.]], <8 x i64> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.]] = tail call <8 x double> @llvm.x86.avx512.vpermilvar.pd.512(<8 x double> [[A0:%.]], <8 x i64> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = shufflevector <8 x double> [[TMP2]], <8 x double> undef, <8 x i32> zeroinitializer		; CHECK-NEXT: [[TMP3:%.*]] = shufflevector <8 x double> [[TMP2]], <8 x double> undef, <8 x i32> zeroinitializer
; CHECK-NEXT: ret <8 x double> [[TMP3]]		; CHECK-NEXT: ret <8 x double> [[TMP3]]
;		;
%1 = insertelement <8 x i64> %a1, i64 %a2, i32 0		%1 = insertelement <8 x i64> %a1, i64 %a2, i32 0
%2 = tail call <8 x double> @llvm.x86.avx512.vpermilvar.pd.512(<8 x double> %a0, <8 x i64> %1)		%2 = tail call <8 x double> @llvm.x86.avx512.vpermilvar.pd.512(<8 x double> %a0, <8 x i64> %1)
%3 = shufflevector <8 x double> %2, <8 x double> undef, <8 x i32> zeroinitializer		%3 = shufflevector <8 x double> %2, <8 x double> undef, <8 x i32> zeroinitializer
ret <8 x double> %3		ret <8 x double> %3
Show All 9 Lines

llvm/test/Transforms/InstCombine/X86/x86-xop-inseltpoison.ll

; NOTE: Assertions have been autogenerated by utils/update_test_checks.py		; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
; RUN: opt < %s -instcombine -mtriple=x86_64-unknown-unknown -S \| FileCheck %s		; RUN: opt < %s -instcombine -mtriple=x86_64-unknown-unknown -S \| FileCheck %s

define <2 x double> @test_vfrcz_sd(<2 x double> %a) {		define <2 x double> @test_vfrcz_sd(<2 x double> %a) {
; CHECK-LABEL: @test_vfrcz_sd(		; CHECK-LABEL: @test_vfrcz_sd(
; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> [[A:%.]])		; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> [[A:%.]])
; CHECK-NEXT: ret <2 x double> [[TMP1]]		; CHECK-NEXT: ret <2 x double> [[TMP1]]
;		;
%1 = insertelement <2 x double> %a, double 1.000000e+00, i32 1		%1 = insertelement <2 x double> %a, double 1.000000e+00, i32 1
%2 = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> %1)		%2 = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> %1)
ret <2 x double> %2		ret <2 x double> %2
}		}

define double @test_vfrcz_sd_0(double %a) {		define double @test_vfrcz_sd_0(double %a) {
; CHECK-LABEL: @test_vfrcz_sd_0(		; CHECK-LABEL: @test_vfrcz_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[TMP3]]		; CHECK-NEXT: ret double [[TMP3]]
;		;
%1 = insertelement <2 x double> poison, double %a, i32 0		%1 = insertelement <2 x double> poison, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> %2)		%3 = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> %2)
%4 = extractelement <2 x double> %3, i32 0		%4 = extractelement <2 x double> %3, i32 0
Show All 20 Lines	;
%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2		%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2
%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3		%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3
%4 = tail call <4 x float> @llvm.x86.xop.vfrcz.ss(<4 x float> %3)		%4 = tail call <4 x float> @llvm.x86.xop.vfrcz.ss(<4 x float> %3)
ret <4 x float> %4		ret <4 x float> %4
}		}

define float @test_vfrcz_ss_0(float %a) {		define float @test_vfrcz_ss_0(float %a) {
; CHECK-LABEL: @test_vfrcz_ss_0(		; CHECK-LABEL: @test_vfrcz_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.xop.vfrcz.ss(<4 x float> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.xop.vfrcz.ss(<4 x float> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> poison, float %a, i32 0		%1 = insertelement <4 x float> poison, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
▲ Show 20 Lines • Show All 244 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/X86/x86-xop.ll

; NOTE: Assertions have been autogenerated by utils/update_test_checks.py		; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
; RUN: opt < %s -instcombine -mtriple=x86_64-unknown-unknown -S \| FileCheck %s		; RUN: opt < %s -instcombine -mtriple=x86_64-unknown-unknown -S \| FileCheck %s

define <2 x double> @test_vfrcz_sd(<2 x double> %a) {		define <2 x double> @test_vfrcz_sd(<2 x double> %a) {
; CHECK-LABEL: @test_vfrcz_sd(		; CHECK-LABEL: @test_vfrcz_sd(
; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> [[A:%.]])		; CHECK-NEXT: [[TMP1:%.]] = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> [[A:%.]])
; CHECK-NEXT: ret <2 x double> [[TMP1]]		; CHECK-NEXT: ret <2 x double> [[TMP1]]
;		;
%1 = insertelement <2 x double> %a, double 1.000000e+00, i32 1		%1 = insertelement <2 x double> %a, double 1.000000e+00, i32 1
%2 = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> %1)		%2 = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> %1)
ret <2 x double> %2		ret <2 x double> %2
}		}

define double @test_vfrcz_sd_0(double %a) {		define double @test_vfrcz_sd_0(double %a) {
; CHECK-LABEL: @test_vfrcz_sd_0(		; CHECK-LABEL: @test_vfrcz_sd_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> undef, double [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> poison, double [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <2 x double> [[TMP2]], i32 0
; CHECK-NEXT: ret double [[TMP3]]		; CHECK-NEXT: ret double [[TMP3]]
;		;
%1 = insertelement <2 x double> undef, double %a, i32 0		%1 = insertelement <2 x double> undef, double %a, i32 0
%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1		%2 = insertelement <2 x double> %1, double 1.000000e+00, i32 1
%3 = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> %2)		%3 = tail call <2 x double> @llvm.x86.xop.vfrcz.sd(<2 x double> %2)
%4 = extractelement <2 x double> %3, i32 0		%4 = extractelement <2 x double> %3, i32 0
Show All 20 Lines	;
%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2		%2 = insertelement <4 x float> %1, float 2.000000e+00, i32 2
%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3		%3 = insertelement <4 x float> %2, float 3.000000e+00, i32 3
%4 = tail call <4 x float> @llvm.x86.xop.vfrcz.ss(<4 x float> %3)		%4 = tail call <4 x float> @llvm.x86.xop.vfrcz.ss(<4 x float> %3)
ret <4 x float> %4		ret <4 x float> %4
}		}

define float @test_vfrcz_ss_0(float %a) {		define float @test_vfrcz_ss_0(float %a) {
; CHECK-LABEL: @test_vfrcz_ss_0(		; CHECK-LABEL: @test_vfrcz_ss_0(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.xop.vfrcz.ss(<4 x float> [[TMP1]])		; CHECK-NEXT: [[TMP2:%.*]] = tail call <4 x float> @llvm.x86.xop.vfrcz.ss(<4 x float> [[TMP1]])
; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0		; CHECK-NEXT: [[TMP3:%.*]] = extractelement <4 x float> [[TMP2]], i32 0
; CHECK-NEXT: ret float [[TMP3]]		; CHECK-NEXT: ret float [[TMP3]]
;		;
%1 = insertelement <4 x float> undef, float %a, i32 0		%1 = insertelement <4 x float> undef, float %a, i32 0
%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1		%2 = insertelement <4 x float> %1, float 1.000000e+00, i32 1
%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2		%3 = insertelement <4 x float> %2, float 2.000000e+00, i32 2
%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3		%4 = insertelement <4 x float> %3, float 3.000000e+00, i32 3
▲ Show 20 Lines • Show All 244 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/bitcast-bigendian.ll

Show First 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	;
%tmp4 = bitcast i32 %tmp2 to float		%tmp4 = bitcast i32 %tmp2 to float

%add = fadd float %tmp24, %tmp4		%add = fadd float %tmp24, %tmp4
ret float %add		ret float %add
}		}

define <2 x i32> @test4(i32 %A, i32 %B){		define <2 x i32> @test4(i32 %A, i32 %B){
; CHECK-LABEL: @test4(		; CHECK-LABEL: @test4(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x i32> undef, i32 [[B:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x i32> poison, i32 [[B:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x i32> [[TMP1]], i32 [[A:%.]], i32 1		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x i32> [[TMP1]], i32 [[A:%.]], i32 1
; CHECK-NEXT: ret <2 x i32> [[TMP2]]		; CHECK-NEXT: ret <2 x i32> [[TMP2]]
;		;
%tmp38 = zext i32 %A to i64		%tmp38 = zext i32 %A to i64
%tmp32 = zext i32 %B to i64		%tmp32 = zext i32 %B to i64
%tmp33 = shl i64 %tmp32, 32		%tmp33 = shl i64 %tmp32, 32
%ins35 = or i64 %tmp33, %tmp38		%ins35 = or i64 %tmp33, %tmp38
%tmp43 = bitcast i64 %ins35 to <2 x i32>		%tmp43 = bitcast i64 %ins35 to <2 x i32>
ret <2 x i32> %tmp43		ret <2 x i32> %tmp43
}		}

define <2 x float> @test5(float %A, float %B) {		define <2 x float> @test5(float %A, float %B) {
; CHECK-LABEL: @test5(		; CHECK-LABEL: @test5(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> undef, float [[B:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> poison, float [[B:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x float> [[TMP1]], float [[A:%.]], i32 1		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x float> [[TMP1]], float [[A:%.]], i32 1
; CHECK-NEXT: ret <2 x float> [[TMP2]]		; CHECK-NEXT: ret <2 x float> [[TMP2]]
;		;
%tmp37 = bitcast float %A to i32		%tmp37 = bitcast float %A to i32
%tmp38 = zext i32 %tmp37 to i64		%tmp38 = zext i32 %tmp37 to i64
%tmp31 = bitcast float %B to i32		%tmp31 = bitcast float %B to i32
%tmp32 = zext i32 %tmp31 to i64		%tmp32 = zext i32 %tmp31 to i64
%tmp33 = shl i64 %tmp32, 32		%tmp33 = shl i64 %tmp32, 32
%ins35 = or i64 %tmp33, %tmp38		%ins35 = or i64 %tmp33, %tmp38
%tmp43 = bitcast i64 %ins35 to <2 x float>		%tmp43 = bitcast i64 %ins35 to <2 x float>
ret <2 x float> %tmp43		ret <2 x float> %tmp43
}		}

define <2 x float> @test6(float %A){		define <2 x float> @test6(float %A){
; CHECK-LABEL: @test6(		; CHECK-LABEL: @test6(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> <float undef, float 4.200000e+01>, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> <float poison, float 4.200000e+01>, float [[A:%.]], i32 0
; CHECK-NEXT: ret <2 x float> [[TMP1]]		; CHECK-NEXT: ret <2 x float> [[TMP1]]
;		;
%tmp23 = bitcast float %A to i32		%tmp23 = bitcast float %A to i32
%tmp24 = zext i32 %tmp23 to i64		%tmp24 = zext i32 %tmp23 to i64
%tmp25 = shl i64 %tmp24, 32		%tmp25 = shl i64 %tmp24, 32
%mask20 = or i64 %tmp25, 1109917696		%mask20 = or i64 %tmp25, 1109917696
%tmp35 = bitcast i64 %mask20 to <2 x float>		%tmp35 = bitcast i64 %mask20 to <2 x float>
ret <2 x float> %tmp35		ret <2 x float> %tmp35
▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/bitcast-inseltpoison.ll

Show First 20 Lines • Show All 383 Lines • ▼ Show 20 Lines	;
%bc1 = bitcast i128 %A to <2 x i64>		%bc1 = bitcast i128 %A to <2 x i64>
%ext = extractelement <2 x i64> %bc1, i32 0		%ext = extractelement <2 x i64> %bc1, i32 0
%bc2 = bitcast i64 %ext to double		%bc2 = bitcast i64 %ext to double
ret double %bc2		ret double %bc2
}		}

define <2 x i32> @test4(i32 %A, i32 %B){		define <2 x i32> @test4(i32 %A, i32 %B){
; CHECK-LABEL: @test4(		; CHECK-LABEL: @test4(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x i32> undef, i32 [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x i32> poison, i32 [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x i32> [[TMP1]], i32 [[B:%.]], i32 1		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x i32> [[TMP1]], i32 [[B:%.]], i32 1
; CHECK-NEXT: ret <2 x i32> [[TMP2]]		; CHECK-NEXT: ret <2 x i32> [[TMP2]]
;		;
%tmp38 = zext i32 %A to i64		%tmp38 = zext i32 %A to i64
%tmp32 = zext i32 %B to i64		%tmp32 = zext i32 %B to i64
%tmp33 = shl i64 %tmp32, 32		%tmp33 = shl i64 %tmp32, 32
%ins35 = or i64 %tmp33, %tmp38		%ins35 = or i64 %tmp33, %tmp38
%tmp43 = bitcast i64 %ins35 to <2 x i32>		%tmp43 = bitcast i64 %ins35 to <2 x i32>
ret <2 x i32> %tmp43		ret <2 x i32> %tmp43
}		}

; rdar://8360454		; rdar://8360454
define <2 x float> @test5(float %A, float %B) {		define <2 x float> @test5(float %A, float %B) {
; CHECK-LABEL: @test5(		; CHECK-LABEL: @test5(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x float> [[TMP1]], float [[B:%.]], i32 1		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x float> [[TMP1]], float [[B:%.]], i32 1
; CHECK-NEXT: ret <2 x float> [[TMP2]]		; CHECK-NEXT: ret <2 x float> [[TMP2]]
;		;
%tmp37 = bitcast float %A to i32		%tmp37 = bitcast float %A to i32
%tmp38 = zext i32 %tmp37 to i64		%tmp38 = zext i32 %tmp37 to i64
%tmp31 = bitcast float %B to i32		%tmp31 = bitcast float %B to i32
%tmp32 = zext i32 %tmp31 to i64		%tmp32 = zext i32 %tmp31 to i64
%tmp33 = shl i64 %tmp32, 32		%tmp33 = shl i64 %tmp32, 32
%ins35 = or i64 %tmp33, %tmp38		%ins35 = or i64 %tmp33, %tmp38
%tmp43 = bitcast i64 %ins35 to <2 x float>		%tmp43 = bitcast i64 %ins35 to <2 x float>
ret <2 x float> %tmp43		ret <2 x float> %tmp43
}		}

define <2 x float> @test6(float %A){		define <2 x float> @test6(float %A){
; CHECK-LABEL: @test6(		; CHECK-LABEL: @test6(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> <float 4.200000e+01, float undef>, float [[A:%.]], i32 1		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> <float 4.200000e+01, float poison>, float [[A:%.]], i32 1
; CHECK-NEXT: ret <2 x float> [[TMP1]]		; CHECK-NEXT: ret <2 x float> [[TMP1]]
;		;
%tmp23 = bitcast float %A to i32		%tmp23 = bitcast float %A to i32
%tmp24 = zext i32 %tmp23 to i64		%tmp24 = zext i32 %tmp23 to i64
%tmp25 = shl i64 %tmp24, 32		%tmp25 = shl i64 %tmp24, 32
%mask20 = or i64 %tmp25, 1109917696		%mask20 = or i64 %tmp25, 1109917696
%tmp35 = bitcast i64 %mask20 to <2 x float>		%tmp35 = bitcast i64 %mask20 to <2 x float>
ret <2 x float> %tmp35		ret <2 x float> %tmp35
▲ Show 20 Lines • Show All 142 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/bitcast-vec-canon-inseltpoison.ll

	Show All 25 Lines
	; CHECK-NEXT: ret <1 x i64> [[C]]			; CHECK-NEXT: ret <1 x i64> [[C]]
	;			;
	%c = bitcast double %y to <1 x i64>			%c = bitcast double %y to <1 x i64>
	ret <1 x i64> %c			ret <1 x i64> %c
	}			}

	define <1 x i64> @d(i64 %y) {			define <1 x i64> @d(i64 %y) {
	; CHECK-LABEL: @d(			; CHECK-LABEL: @d(
	; CHECK-NEXT: [[TMP1:%.]] = insertelement <1 x i64> undef, i64 [[Y:%.]], i32 0			; CHECK-NEXT: [[TMP1:%.]] = insertelement <1 x i64> poison, i64 [[Y:%.]], i32 0
	; CHECK-NEXT: ret <1 x i64> [[TMP1]]			; CHECK-NEXT: ret <1 x i64> [[TMP1]]
	;			;
	%c = bitcast i64 %y to <1 x i64>			%c = bitcast i64 %y to <1 x i64>
	ret <1 x i64> %c			ret <1 x i64> %c
	}			}

	define x86_mmx @e(<1 x i64> %y) {			define x86_mmx @e(<1 x i64> %y) {
	; CHECK-LABEL: @e(			; CHECK-LABEL: @e(
	▲ Show 20 Lines • Show All 125 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/bitcast-vec-canon.ll

	Show All 25 Lines
	; CHECK-NEXT: ret <1 x i64> [[C]]			; CHECK-NEXT: ret <1 x i64> [[C]]
	;			;
	%c = bitcast double %y to <1 x i64>			%c = bitcast double %y to <1 x i64>
	ret <1 x i64> %c			ret <1 x i64> %c
	}			}

	define <1 x i64> @d(i64 %y) {			define <1 x i64> @d(i64 %y) {
	; CHECK-LABEL: @d(			; CHECK-LABEL: @d(
	; CHECK-NEXT: [[C:%.]] = insertelement <1 x i64> undef, i64 [[Y:%.]], i32 0			; CHECK-NEXT: [[TMP1:%.]] = insertelement <1 x i64> poison, i64 [[Y:%.]], i32 0
	; CHECK-NEXT: ret <1 x i64> [[C]]			; CHECK-NEXT: ret <1 x i64> [[TMP1]]
	;			;
	%c = bitcast i64 %y to <1 x i64>			%c = bitcast i64 %y to <1 x i64>
	ret <1 x i64> %c			ret <1 x i64> %c
	}			}

	define x86_mmx @e(<1 x i64> %y) {			define x86_mmx @e(<1 x i64> %y) {
	; CHECK-LABEL: @e(			; CHECK-LABEL: @e(
	; CHECK-NEXT: [[TMP1:%.]] = extractelement <1 x i64> [[Y:%.]], i32 0			; CHECK-NEXT: [[TMP1:%.]] = extractelement <1 x i64> [[Y:%.]], i32 0
	▲ Show 20 Lines • Show All 124 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/bitcast.ll

Show First 20 Lines • Show All 383 Lines • ▼ Show 20 Lines	;
%bc1 = bitcast i128 %A to <2 x i64>		%bc1 = bitcast i128 %A to <2 x i64>
%ext = extractelement <2 x i64> %bc1, i32 0		%ext = extractelement <2 x i64> %bc1, i32 0
%bc2 = bitcast i64 %ext to double		%bc2 = bitcast i64 %ext to double
ret double %bc2		ret double %bc2
}		}

define <2 x i32> @test4(i32 %A, i32 %B){		define <2 x i32> @test4(i32 %A, i32 %B){
; CHECK-LABEL: @test4(		; CHECK-LABEL: @test4(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x i32> undef, i32 [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x i32> poison, i32 [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x i32> [[TMP1]], i32 [[B:%.]], i32 1		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x i32> [[TMP1]], i32 [[B:%.]], i32 1
; CHECK-NEXT: ret <2 x i32> [[TMP2]]		; CHECK-NEXT: ret <2 x i32> [[TMP2]]
;		;
%tmp38 = zext i32 %A to i64		%tmp38 = zext i32 %A to i64
%tmp32 = zext i32 %B to i64		%tmp32 = zext i32 %B to i64
%tmp33 = shl i64 %tmp32, 32		%tmp33 = shl i64 %tmp32, 32
%ins35 = or i64 %tmp33, %tmp38		%ins35 = or i64 %tmp33, %tmp38
%tmp43 = bitcast i64 %ins35 to <2 x i32>		%tmp43 = bitcast i64 %ins35 to <2 x i32>
ret <2 x i32> %tmp43		ret <2 x i32> %tmp43
}		}

; rdar://8360454		; rdar://8360454
define <2 x float> @test5(float %A, float %B) {		define <2 x float> @test5(float %A, float %B) {
; CHECK-LABEL: @test5(		; CHECK-LABEL: @test5(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> undef, float [[A:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> poison, float [[A:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x float> [[TMP1]], float [[B:%.]], i32 1		; CHECK-NEXT: [[TMP2:%.]] = insertelement <2 x float> [[TMP1]], float [[B:%.]], i32 1
; CHECK-NEXT: ret <2 x float> [[TMP2]]		; CHECK-NEXT: ret <2 x float> [[TMP2]]
;		;
%tmp37 = bitcast float %A to i32		%tmp37 = bitcast float %A to i32
%tmp38 = zext i32 %tmp37 to i64		%tmp38 = zext i32 %tmp37 to i64
%tmp31 = bitcast float %B to i32		%tmp31 = bitcast float %B to i32
%tmp32 = zext i32 %tmp31 to i64		%tmp32 = zext i32 %tmp31 to i64
%tmp33 = shl i64 %tmp32, 32		%tmp33 = shl i64 %tmp32, 32
%ins35 = or i64 %tmp33, %tmp38		%ins35 = or i64 %tmp33, %tmp38
%tmp43 = bitcast i64 %ins35 to <2 x float>		%tmp43 = bitcast i64 %ins35 to <2 x float>
ret <2 x float> %tmp43		ret <2 x float> %tmp43
}		}

define <2 x float> @test6(float %A){		define <2 x float> @test6(float %A){
; CHECK-LABEL: @test6(		; CHECK-LABEL: @test6(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> <float 4.200000e+01, float undef>, float [[A:%.]], i32 1		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> <float 4.200000e+01, float poison>, float [[A:%.]], i32 1
; CHECK-NEXT: ret <2 x float> [[TMP1]]		; CHECK-NEXT: ret <2 x float> [[TMP1]]
;		;
%tmp23 = bitcast float %A to i32		%tmp23 = bitcast float %A to i32
%tmp24 = zext i32 %tmp23 to i64		%tmp24 = zext i32 %tmp23 to i64
%tmp25 = shl i64 %tmp24, 32		%tmp25 = shl i64 %tmp24, 32
%mask20 = or i64 %tmp25, 1109917696		%mask20 = or i64 %tmp25, 1109917696
%tmp35 = bitcast i64 %mask20 to <2 x float>		%tmp35 = bitcast i64 %mask20 to <2 x float>
ret <2 x float> %tmp35		ret <2 x float> %tmp35
▲ Show 20 Lines • Show All 142 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/broadcast-inseltpoison.ll

Show All 23 Lines	;
%t4 = insertelement <4 x float> %t, float %arg, i32 2		%t4 = insertelement <4 x float> %t, float %arg, i32 2
%t5 = insertelement <4 x float> %t4, float %arg, i32 0		%t5 = insertelement <4 x float> %t4, float %arg, i32 0
%t6 = insertelement <4 x float> %t5, float %arg, i32 3		%t6 = insertelement <4 x float> %t5, float %arg, i32 3
ret <4 x float> %t6		ret <4 x float> %t6
}		}

define <4 x float> @good3(float %arg) {		define <4 x float> @good3(float %arg) {
; CHECK-LABEL: @good3(		; CHECK-LABEL: @good3(
; CHECK-NEXT: [[T:%.]] = insertelement <4 x float> undef, float [[ARG:%.]], i32 0		; CHECK-NEXT: [[T:%.]] = insertelement <4 x float> poison, float [[ARG:%.]], i32 0
; CHECK-NEXT: [[T6:%.*]] = shufflevector <4 x float> [[T]], <4 x float> undef, <4 x i32> zeroinitializer		; CHECK-NEXT: [[T6:%.*]] = shufflevector <4 x float> [[T]], <4 x float> undef, <4 x i32> zeroinitializer
; CHECK-NEXT: ret <4 x float> [[T6]]		; CHECK-NEXT: ret <4 x float> [[T6]]
;		;
%t = insertelement <4 x float> zeroinitializer, float %arg, i32 0		%t = insertelement <4 x float> zeroinitializer, float %arg, i32 0
%t4 = insertelement <4 x float> %t, float %arg, i32 1		%t4 = insertelement <4 x float> %t, float %arg, i32 1
%t5 = insertelement <4 x float> %t4, float %arg, i32 2		%t5 = insertelement <4 x float> %t4, float %arg, i32 2
%t6 = insertelement <4 x float> %t5, float %arg, i32 3		%t6 = insertelement <4 x float> %t5, float %arg, i32 3
ret <4 x float> %t6		ret <4 x float> %t6
}		}

define <4 x float> @good4(float %arg) {		define <4 x float> @good4(float %arg) {
; CHECK-LABEL: @good4(		; CHECK-LABEL: @good4(
; CHECK-NEXT: [[T:%.]] = insertelement <4 x float> undef, float [[ARG:%.]], i32 0		; CHECK-NEXT: [[T:%.]] = insertelement <4 x float> poison, float [[ARG:%.]], i32 0
; CHECK-NEXT: [[TMP1:%.*]] = fadd <4 x float> [[T]], [[T]]		; CHECK-NEXT: [[TMP1:%.*]] = fadd <4 x float> [[T]], [[T]]
; CHECK-NEXT: [[T7:%.*]] = shufflevector <4 x float> [[TMP1]], <4 x float> undef, <4 x i32> zeroinitializer		; CHECK-NEXT: [[T7:%.*]] = shufflevector <4 x float> [[TMP1]], <4 x float> undef, <4 x i32> zeroinitializer
; CHECK-NEXT: ret <4 x float> [[T7]]		; CHECK-NEXT: ret <4 x float> [[T7]]
;		;
%t = insertelement <4 x float> zeroinitializer, float %arg, i32 0		%t = insertelement <4 x float> zeroinitializer, float %arg, i32 0
%t4 = insertelement <4 x float> %t, float %arg, i32 1		%t4 = insertelement <4 x float> %t, float %arg, i32 1
%t5 = insertelement <4 x float> %t4, float %arg, i32 2		%t5 = insertelement <4 x float> %t4, float %arg, i32 2
%t6 = insertelement <4 x float> %t5, float %arg, i32 3		%t6 = insertelement <4 x float> %t5, float %arg, i32 3
▲ Show 20 Lines • Show All 126 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/broadcast.ll

Show All 23 Lines	;
%t4 = insertelement <4 x float> %t, float %arg, i32 2		%t4 = insertelement <4 x float> %t, float %arg, i32 2
%t5 = insertelement <4 x float> %t4, float %arg, i32 0		%t5 = insertelement <4 x float> %t4, float %arg, i32 0
%t6 = insertelement <4 x float> %t5, float %arg, i32 3		%t6 = insertelement <4 x float> %t5, float %arg, i32 3
ret <4 x float> %t6		ret <4 x float> %t6
}		}

define <4 x float> @good3(float %arg) {		define <4 x float> @good3(float %arg) {
; CHECK-LABEL: @good3(		; CHECK-LABEL: @good3(
; CHECK-NEXT: [[T:%.]] = insertelement <4 x float> undef, float [[ARG:%.]], i32 0		; CHECK-NEXT: [[T:%.]] = insertelement <4 x float> poison, float [[ARG:%.]], i32 0
; CHECK-NEXT: [[T6:%.*]] = shufflevector <4 x float> [[T]], <4 x float> undef, <4 x i32> zeroinitializer		; CHECK-NEXT: [[T6:%.*]] = shufflevector <4 x float> [[T]], <4 x float> undef, <4 x i32> zeroinitializer
; CHECK-NEXT: ret <4 x float> [[T6]]		; CHECK-NEXT: ret <4 x float> [[T6]]
;		;
%t = insertelement <4 x float> zeroinitializer, float %arg, i32 0		%t = insertelement <4 x float> zeroinitializer, float %arg, i32 0
%t4 = insertelement <4 x float> %t, float %arg, i32 1		%t4 = insertelement <4 x float> %t, float %arg, i32 1
%t5 = insertelement <4 x float> %t4, float %arg, i32 2		%t5 = insertelement <4 x float> %t4, float %arg, i32 2
%t6 = insertelement <4 x float> %t5, float %arg, i32 3		%t6 = insertelement <4 x float> %t5, float %arg, i32 3
ret <4 x float> %t6		ret <4 x float> %t6
}		}

define <4 x float> @good4(float %arg) {		define <4 x float> @good4(float %arg) {
; CHECK-LABEL: @good4(		; CHECK-LABEL: @good4(
; CHECK-NEXT: [[T:%.]] = insertelement <4 x float> undef, float [[ARG:%.]], i32 0		; CHECK-NEXT: [[T:%.]] = insertelement <4 x float> poison, float [[ARG:%.]], i32 0
; CHECK-NEXT: [[TMP1:%.*]] = fadd <4 x float> [[T]], [[T]]		; CHECK-NEXT: [[TMP1:%.*]] = fadd <4 x float> [[T]], [[T]]
; CHECK-NEXT: [[T7:%.*]] = shufflevector <4 x float> [[TMP1]], <4 x float> undef, <4 x i32> zeroinitializer		; CHECK-NEXT: [[T7:%.*]] = shufflevector <4 x float> [[TMP1]], <4 x float> undef, <4 x i32> zeroinitializer
; CHECK-NEXT: ret <4 x float> [[T7]]		; CHECK-NEXT: ret <4 x float> [[T7]]
;		;
%t = insertelement <4 x float> zeroinitializer, float %arg, i32 0		%t = insertelement <4 x float> zeroinitializer, float %arg, i32 0
%t4 = insertelement <4 x float> %t, float %arg, i32 1		%t4 = insertelement <4 x float> %t, float %arg, i32 1
%t5 = insertelement <4 x float> %t4, float %arg, i32 2		%t5 = insertelement <4 x float> %t4, float %arg, i32 2
%t6 = insertelement <4 x float> %t5, float %arg, i32 3		%t6 = insertelement <4 x float> %t5, float %arg, i32 3
▲ Show 20 Lines • Show All 126 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/cast.ll

Show First 20 Lines • Show All 868 Lines • ▼ Show 20 Lines	;
%p9 = trunc i128 %p11 to i96		%p9 = trunc i128 %p11 to i96
%p10 = bitcast i96 %p9 to <3 x i32>		%p10 = bitcast i96 %p9 to <3 x i32>
ret <3 x i32> %p10		ret <3 x i32> %p10

}		}

define <4 x i32> @test61(<3 x i32> %call4) {		define <4 x i32> @test61(<3 x i32> %call4) {
; BE-LABEL: @test61(		; BE-LABEL: @test61(
; BE-NEXT: [[P10:%.]] = shufflevector <3 x i32> [[CALL4:%.]], <3 x i32> <i32 0, i32 undef, i32 undef>, <4 x i32> <i32 3, i32 0, i32 1, i32 2>		; BE-NEXT: [[P10:%.]] = shufflevector <3 x i32> [[CALL4:%.]], <3 x i32> <i32 0, i32 poison, i32 poison>, <4 x i32> <i32 3, i32 0, i32 1, i32 2>
; BE-NEXT: ret <4 x i32> [[P10]]		; BE-NEXT: ret <4 x i32> [[P10]]
;		;
; LE-LABEL: @test61(		; LE-LABEL: @test61(
; LE-NEXT: [[P10:%.]] = shufflevector <3 x i32> [[CALL4:%.]], <3 x i32> <i32 0, i32 undef, i32 undef>, <4 x i32> <i32 0, i32 1, i32 2, i32 3>		; LE-NEXT: [[P10:%.]] = shufflevector <3 x i32> [[CALL4:%.]], <3 x i32> <i32 0, i32 poison, i32 poison>, <4 x i32> <i32 0, i32 1, i32 2, i32 3>
; LE-NEXT: ret <4 x i32> [[P10]]		; LE-NEXT: ret <4 x i32> [[P10]]
;		;
%p11 = bitcast <3 x i32> %call4 to i96		%p11 = bitcast <3 x i32> %call4 to i96
%p9 = zext i96 %p11 to i128		%p9 = zext i96 %p11 to i128
%p10 = bitcast i128 %p9 to <4 x i32>		%p10 = bitcast i128 %p9 to <4 x i32>
ret <4 x i32> %p10		ret <4 x i32> %p10
}		}

define <4 x i32> @test62(<3 x float> %call4) {		define <4 x i32> @test62(<3 x float> %call4) {
; BE-LABEL: @test62(		; BE-LABEL: @test62(
; BE-NEXT: [[TMP1:%.]] = bitcast <3 x float> [[CALL4:%.]] to <3 x i32>		; BE-NEXT: [[TMP1:%.]] = bitcast <3 x float> [[CALL4:%.]] to <3 x i32>
; BE-NEXT: [[P10:%.*]] = shufflevector <3 x i32> [[TMP1]], <3 x i32> <i32 0, i32 undef, i32 undef>, <4 x i32> <i32 3, i32 0, i32 1, i32 2>		; BE-NEXT: [[P10:%.*]] = shufflevector <3 x i32> [[TMP1]], <3 x i32> <i32 0, i32 poison, i32 poison>, <4 x i32> <i32 3, i32 0, i32 1, i32 2>
; BE-NEXT: ret <4 x i32> [[P10]]		; BE-NEXT: ret <4 x i32> [[P10]]
;		;
; LE-LABEL: @test62(		; LE-LABEL: @test62(
; LE-NEXT: [[TMP1:%.]] = bitcast <3 x float> [[CALL4:%.]] to <3 x i32>		; LE-NEXT: [[TMP1:%.]] = bitcast <3 x float> [[CALL4:%.]] to <3 x i32>
; LE-NEXT: [[P10:%.*]] = shufflevector <3 x i32> [[TMP1]], <3 x i32> <i32 0, i32 undef, i32 undef>, <4 x i32> <i32 0, i32 1, i32 2, i32 3>		; LE-NEXT: [[P10:%.*]] = shufflevector <3 x i32> [[TMP1]], <3 x i32> <i32 0, i32 poison, i32 poison>, <4 x i32> <i32 0, i32 1, i32 2, i32 3>
; LE-NEXT: ret <4 x i32> [[P10]]		; LE-NEXT: ret <4 x i32> [[P10]]
;		;
%p11 = bitcast <3 x float> %call4 to i96		%p11 = bitcast <3 x float> %call4 to i96
%p9 = zext i96 %p11 to i128		%p9 = zext i96 %p11 to i128
%p10 = bitcast i128 %p9 to <4 x i32>		%p10 = bitcast i128 %p9 to <4 x i32>
ret <4 x i32> %p10		ret <4 x i32> %p10
}		}

▲ Show 20 Lines • Show All 1,272 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/getelementptr.ll

Show First 20 Lines • Show All 211 Lines • ▼ Show 20 Lines	;
%C = icmp eq <2 x i32*> %A, %B		%C = icmp eq <2 x i32*> %A, %B
ret <2 x i1> %C		ret <2 x i1> %C
}		}

define <2 x i1> @test13_vector2(i64 %X, <2 x %S*> %P) nounwind {		define <2 x i1> @test13_vector2(i64 %X, <2 x %S*> %P) nounwind {
; CHECK-LABEL: @test13_vector2(		; CHECK-LABEL: @test13_vector2(
; CHECK-NEXT: [[DOTSPLATINSERT:%.]] = insertelement <2 x i64> undef, i64 [[X:%.]], i32 0		; CHECK-NEXT: [[DOTSPLATINSERT:%.]] = insertelement <2 x i64> undef, i64 [[X:%.]], i32 0
; CHECK-NEXT: [[TMP1:%.*]] = shl <2 x i64> [[DOTSPLATINSERT]], <i64 2, i64 0>		; CHECK-NEXT: [[TMP1:%.*]] = shl <2 x i64> [[DOTSPLATINSERT]], <i64 2, i64 0>
; CHECK-NEXT: [[TMP2:%.*]] = icmp eq <2 x i64> [[TMP1]], <i64 -4, i64 undef>		; CHECK-NEXT: [[TMP2:%.*]] = icmp eq <2 x i64> [[TMP1]], <i64 -4, i64 poison>
; CHECK-NEXT: [[C:%.*]] = shufflevector <2 x i1> [[TMP2]], <2 x i1> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[C:%.*]] = shufflevector <2 x i1> [[TMP2]], <2 x i1> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i1> [[C]]		; CHECK-NEXT: ret <2 x i1> [[C]]
;		;
%A = getelementptr inbounds %S, <2 x %S*> %P, <2 x i64> zeroinitializer, <2 x i32> <i32 1, i32 1>, i64 %X		%A = getelementptr inbounds %S, <2 x %S*> %P, <2 x i64> zeroinitializer, <2 x i32> <i32 1, i32 1>, i64 %X
%B = getelementptr inbounds %S, <2 x %S*> %P, <2 x i64> <i64 0, i64 0>, <2 x i32> <i32 0, i32 0>		%B = getelementptr inbounds %S, <2 x %S*> %P, <2 x i64> <i64 0, i64 0>, <2 x i32> <i32 0, i32 0>
%C = icmp eq <2 x i32*> %A, %B		%C = icmp eq <2 x i32*> %A, %B
ret <2 x i1> %C		ret <2 x i1> %C
}		}

; This is a test of icmp + shl nuw in disguise - 4611... is 0x3fff...		; This is a test of icmp + shl nuw in disguise - 4611... is 0x3fff...
define <2 x i1> @test13_vector3(i64 %X, <2 x %S*> %P) nounwind {		define <2 x i1> @test13_vector3(i64 %X, <2 x %S*> %P) nounwind {
; CHECK-LABEL: @test13_vector3(		; CHECK-LABEL: @test13_vector3(
; CHECK-NEXT: [[DOTSPLATINSERT:%.]] = insertelement <2 x i64> undef, i64 [[X:%.]], i32 0		; CHECK-NEXT: [[DOTSPLATINSERT:%.]] = insertelement <2 x i64> undef, i64 [[X:%.]], i32 0
; CHECK-NEXT: [[TMP1:%.*]] = shl <2 x i64> [[DOTSPLATINSERT]], <i64 2, i64 0>		; CHECK-NEXT: [[TMP1:%.*]] = shl <2 x i64> [[DOTSPLATINSERT]], <i64 2, i64 0>
; CHECK-NEXT: [[TMP2:%.*]] = icmp eq <2 x i64> [[TMP1]], <i64 4, i64 undef>		; CHECK-NEXT: [[TMP2:%.*]] = icmp eq <2 x i64> [[TMP1]], <i64 4, i64 poison>
; CHECK-NEXT: [[C:%.*]] = shufflevector <2 x i1> [[TMP2]], <2 x i1> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[C:%.*]] = shufflevector <2 x i1> [[TMP2]], <2 x i1> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i1> [[C]]		; CHECK-NEXT: ret <2 x i1> [[C]]
;		;
%A = getelementptr inbounds %S, <2 x %S*> %P, <2 x i64> zeroinitializer, <2 x i32> <i32 1, i32 1>, i64 %X		%A = getelementptr inbounds %S, <2 x %S*> %P, <2 x i64> zeroinitializer, <2 x i32> <i32 1, i32 1>, i64 %X
%B = getelementptr inbounds %S, <2 x %S*> %P, <2 x i64> <i64 0, i64 0>, <2 x i32> <i32 1, i32 1>, i64 1		%B = getelementptr inbounds %S, <2 x %S*> %P, <2 x i64> <i64 0, i64 0>, <2 x i32> <i32 1, i32 1>, i64 1
%C = icmp eq <2 x i32*> %A, %B		%C = icmp eq <2 x i32*> %A, %B
ret <2 x i1> %C		ret <2 x i1> %C
}		}
▲ Show 20 Lines • Show All 1,033 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/insert-const-shuf.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -instcombine %s \| FileCheck %s			; RUN: opt -S -instcombine %s \| FileCheck %s

	; Eliminate the insertelement.			; Eliminate the insertelement.

	define <4 x float> @PR29126(<4 x float> %x) {			define <4 x float> @PR29126(<4 x float> %x) {
	; CHECK-LABEL: @PR29126(			; CHECK-LABEL: @PR29126(
	; CHECK-NEXT: [[INS:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float undef, float 1.000000e+00, float 2.000000e+00, float 4.200000e+01>, <4 x i32> <i32 0, i32 5, i32 6, i32 7>			; CHECK-NEXT: [[INS:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float poison, float 1.000000e+00, float 2.000000e+00, float 4.200000e+01>, <4 x i32> <i32 0, i32 5, i32 6, i32 7>
	; CHECK-NEXT: ret <4 x float> [[INS]]			; CHECK-NEXT: ret <4 x float> [[INS]]
	;			;
	%shuf = shufflevector <4 x float> %x, <4 x float> <float undef, float 1.0, float 2.0, float undef>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>			%shuf = shufflevector <4 x float> %x, <4 x float> <float undef, float 1.0, float 2.0, float undef>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>
	%ins = insertelement <4 x float> %shuf, float 42.0, i32 3			%ins = insertelement <4 x float> %shuf, float 42.0, i32 3
	ret <4 x float> %ins			ret <4 x float> %ins
	}			}

	; A chain of inserts should collapse.			; A chain of inserts should collapse.

	define <4 x float> @twoInserts(<4 x float> %x) {			define <4 x float> @twoInserts(<4 x float> %x) {
	; CHECK-LABEL: @twoInserts(			; CHECK-LABEL: @twoInserts(
	; CHECK-NEXT: [[INS2:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float undef, float 0.000000e+00, float 4.200000e+01, float 1.100000e+01>, <4 x i32> <i32 0, i32 5, i32 6, i32 7>			; CHECK-NEXT: [[INS2:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float poison, float 0.000000e+00, float 4.200000e+01, float 1.100000e+01>, <4 x i32> <i32 0, i32 5, i32 6, i32 7>
	; CHECK-NEXT: ret <4 x float> [[INS2]]			; CHECK-NEXT: ret <4 x float> [[INS2]]
	;			;
	%shuf = shufflevector <4 x float> %x, <4 x float> zeroinitializer, <4 x i32> <i32 0, i32 5, i32 6, i32 3>			%shuf = shufflevector <4 x float> %x, <4 x float> zeroinitializer, <4 x i32> <i32 0, i32 5, i32 6, i32 3>
	%ins1 = insertelement <4 x float> %shuf, float 42.0, i32 2			%ins1 = insertelement <4 x float> %shuf, float 42.0, i32 2
	%ins2 = insertelement <4 x float> %ins1, float 11.0, i32 3			%ins2 = insertelement <4 x float> %ins1, float 11.0, i32 3
	ret <4 x float> %ins2			ret <4 x float> %ins2
	}			}

	define <4 x i32> @shuffleRetain(<4 x i32> %base) {			define <4 x i32> @shuffleRetain(<4 x i32> %base) {
	; CHECK-LABEL: @shuffleRetain(			; CHECK-LABEL: @shuffleRetain(
	; CHECK-NEXT: [[SHUF:%.]] = shufflevector <4 x i32> [[BASE:%.]], <4 x i32> <i32 undef, i32 undef, i32 undef, i32 1>, <4 x i32> <i32 1, i32 2, i32 undef, i32 7>			; CHECK-NEXT: [[SHUF:%.]] = shufflevector <4 x i32> [[BASE:%.]], <4 x i32> <i32 poison, i32 poison, i32 poison, i32 1>, <4 x i32> <i32 1, i32 2, i32 undef, i32 7>
	; CHECK-NEXT: ret <4 x i32> [[SHUF]]			; CHECK-NEXT: ret <4 x i32> [[SHUF]]
	;			;
	%shuf = shufflevector <4 x i32> %base, <4 x i32> <i32 4, i32 3, i32 2, i32 1>, <4 x i32> <i32 1, i32 2, i32 undef, i32 7>			%shuf = shufflevector <4 x i32> %base, <4 x i32> <i32 4, i32 3, i32 2, i32 1>, <4 x i32> <i32 1, i32 2, i32 undef, i32 7>
	ret <4 x i32> %shuf			ret <4 x i32> %shuf
	}			}

	; TODO: Transform an arbitrary shuffle with constant into a shuffle that is equivalant to a vector select.			; TODO: Transform an arbitrary shuffle with constant into a shuffle that is equivalant to a vector select.

	define <4 x float> @disguisedSelect(<4 x float> %x) {			define <4 x float> @disguisedSelect(<4 x float> %x) {
	; CHECK-LABEL: @disguisedSelect(			; CHECK-LABEL: @disguisedSelect(
	; CHECK-NEXT: [[SHUF:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float undef, float 1.000000e+00, float 2.000000e+00, float undef>, <4 x i32> <i32 undef, i32 6, i32 5, i32 3>			; CHECK-NEXT: [[SHUF:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float poison, float 1.000000e+00, float 2.000000e+00, float poison>, <4 x i32> <i32 undef, i32 6, i32 5, i32 3>
	; CHECK-NEXT: [[INS:%.*]] = insertelement <4 x float> [[SHUF]], float 4.000000e+00, i32 0			; CHECK-NEXT: [[INS:%.*]] = insertelement <4 x float> [[SHUF]], float 4.000000e+00, i32 0
	; CHECK-NEXT: ret <4 x float> [[INS]]			; CHECK-NEXT: ret <4 x float> [[INS]]
	;			;
	%shuf = shufflevector <4 x float> %x, <4 x float> <float undef, float 1.0, float 2.0, float 3.0>, <4 x i32> <i32 7, i32 6, i32 5, i32 3>			%shuf = shufflevector <4 x float> %x, <4 x float> <float undef, float 1.0, float 2.0, float 3.0>, <4 x i32> <i32 7, i32 6, i32 5, i32 3>
	%ins = insertelement <4 x float> %shuf, float 4.0, i32 0			%ins = insertelement <4 x float> %shuf, float 4.0, i32 0
	ret <4 x float> %ins			ret <4 x float> %ins
	}			}

	; TODO: Fold arbitrary (non-select-equivalent) shuffles if the new shuffle would have the same shuffle mask.			; TODO: Fold arbitrary (non-select-equivalent) shuffles if the new shuffle would have the same shuffle mask.

	define <4 x float> @notSelectButNoMaskDifference(<4 x float> %x) {			define <4 x float> @notSelectButNoMaskDifference(<4 x float> %x) {
	; CHECK-LABEL: @notSelectButNoMaskDifference(			; CHECK-LABEL: @notSelectButNoMaskDifference(
	; CHECK-NEXT: [[SHUF:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float undef, float 1.000000e+00, float 2.000000e+00, float undef>, <4 x i32> <i32 1, i32 5, i32 6, i32 undef>			; CHECK-NEXT: [[SHUF:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float poison, float 1.000000e+00, float 2.000000e+00, float poison>, <4 x i32> <i32 1, i32 5, i32 6, i32 undef>
	; CHECK-NEXT: [[INS:%.*]] = insertelement <4 x float> [[SHUF]], float 4.000000e+00, i32 3			; CHECK-NEXT: [[INS:%.*]] = insertelement <4 x float> [[SHUF]], float 4.000000e+00, i32 3
	; CHECK-NEXT: ret <4 x float> [[INS]]			; CHECK-NEXT: ret <4 x float> [[INS]]
	;			;
	%shuf = shufflevector <4 x float> %x, <4 x float> <float undef, float 1.0, float 2.0, float 3.0>, <4 x i32> <i32 1, i32 5, i32 6, i32 3>			%shuf = shufflevector <4 x float> %x, <4 x float> <float undef, float 1.0, float 2.0, float 3.0>, <4 x i32> <i32 1, i32 5, i32 6, i32 3>
	%ins = insertelement <4 x float> %shuf, float 4.0, i32 3			%ins = insertelement <4 x float> %shuf, float 4.0, i32 3
	ret <4 x float> %ins			ret <4 x float> %ins
	}			}

	; We purposely do not touch arbitrary (non-select-equivalent) shuffles because folding the insert may create a more expensive shuffle.			; We purposely do not touch arbitrary (non-select-equivalent) shuffles because folding the insert may create a more expensive shuffle.

	define <4 x float> @tooRisky(<4 x float> %x) {			define <4 x float> @tooRisky(<4 x float> %x) {
	; CHECK-LABEL: @tooRisky(			; CHECK-LABEL: @tooRisky(
	; CHECK-NEXT: [[SHUF:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float 1.000000e+00, float undef, float undef, float undef>, <4 x i32> <i32 1, i32 4, i32 4, i32 undef>			; CHECK-NEXT: [[SHUF:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float 1.000000e+00, float poison, float poison, float poison>, <4 x i32> <i32 1, i32 4, i32 4, i32 undef>
	; CHECK-NEXT: [[INS:%.*]] = insertelement <4 x float> [[SHUF]], float 4.000000e+00, i32 3			; CHECK-NEXT: [[INS:%.*]] = insertelement <4 x float> [[SHUF]], float 4.000000e+00, i32 3
	; CHECK-NEXT: ret <4 x float> [[INS]]			; CHECK-NEXT: ret <4 x float> [[INS]]
	;			;
	%shuf = shufflevector <4 x float> %x, <4 x float> <float 1.0, float undef, float undef, float undef>, <4 x i32> <i32 1, i32 4, i32 4, i32 4>			%shuf = shufflevector <4 x float> %x, <4 x float> <float 1.0, float undef, float undef, float undef>, <4 x i32> <i32 1, i32 4, i32 4, i32 4>
	%ins = insertelement <4 x float> %shuf, float 4.0, i32 3			%ins = insertelement <4 x float> %shuf, float 4.0, i32 3
	ret <4 x float> %ins			ret <4 x float> %ins
	}			}

	; Don't transform insert to shuffle if the original shuffle is not removed.			; Don't transform insert to shuffle if the original shuffle is not removed.
	; TODO: Ease the one-use restriction if the insert scalar would simplify the shuffle to a full vector constant?			; TODO: Ease the one-use restriction if the insert scalar would simplify the shuffle to a full vector constant?

	define <3 x float> @twoShufUses(<3 x float> %x) {			define <3 x float> @twoShufUses(<3 x float> %x) {
	; CHECK-LABEL: @twoShufUses(			; CHECK-LABEL: @twoShufUses(
	; CHECK-NEXT: [[SHUF:%.]] = shufflevector <3 x float> [[X:%.]], <3 x float> <float undef, float 1.000000e+00, float 2.000000e+00>, <3 x i32> <i32 0, i32 4, i32 5>			; CHECK-NEXT: [[SHUF:%.]] = shufflevector <3 x float> [[X:%.]], <3 x float> <float poison, float 1.000000e+00, float 2.000000e+00>, <3 x i32> <i32 0, i32 4, i32 5>
	; CHECK-NEXT: [[INS:%.*]] = insertelement <3 x float> [[SHUF]], float 4.200000e+01, i2 1			; CHECK-NEXT: [[INS:%.*]] = insertelement <3 x float> [[SHUF]], float 4.200000e+01, i2 1
	; CHECK-NEXT: [[ADD:%.*]] = fadd <3 x float> [[SHUF]], [[INS]]			; CHECK-NEXT: [[ADD:%.*]] = fadd <3 x float> [[SHUF]], [[INS]]
	; CHECK-NEXT: ret <3 x float> [[ADD]]			; CHECK-NEXT: ret <3 x float> [[ADD]]
	;			;
	%shuf = shufflevector <3 x float> %x, <3 x float> <float undef, float 1.0, float 2.0>, <3 x i32> <i32 0, i32 4, i32 5>			%shuf = shufflevector <3 x float> %x, <3 x float> <float undef, float 1.0, float 2.0>, <3 x i32> <i32 0, i32 4, i32 5>
	%ins = insertelement <3 x float> %shuf, float 42.0, i2 1			%ins = insertelement <3 x float> %shuf, float 42.0, i2 1
	%add = fadd <3 x float> %shuf, %ins			%add = fadd <3 x float> %shuf, %ins
	ret <3 x float> %add			ret <3 x float> %add
	}			}

	; The inserted scalar constant index is out-of-bounds for the shuffle vector constant.			; The inserted scalar constant index is out-of-bounds for the shuffle vector constant.

	define <5 x i8> @longerMask(<3 x i8> %x) {			define <5 x i8> @longerMask(<3 x i8> %x) {
	; CHECK-LABEL: @longerMask(			; CHECK-LABEL: @longerMask(
	; CHECK-NEXT: [[SHUF:%.]] = shufflevector <3 x i8> [[X:%.]], <3 x i8> <i8 undef, i8 1, i8 undef>, <5 x i32> <i32 2, i32 1, i32 4, i32 undef, i32 undef>			; CHECK-NEXT: [[SHUF:%.]] = shufflevector <3 x i8> [[X:%.]], <3 x i8> <i8 poison, i8 1, i8 poison>, <5 x i32> <i32 2, i32 1, i32 4, i32 undef, i32 undef>
	; CHECK-NEXT: [[INS:%.*]] = insertelement <5 x i8> [[SHUF]], i8 42, i17 4			; CHECK-NEXT: [[INS:%.*]] = insertelement <5 x i8> [[SHUF]], i8 42, i17 4
	; CHECK-NEXT: ret <5 x i8> [[INS]]			; CHECK-NEXT: ret <5 x i8> [[INS]]
	;			;
	%shuf = shufflevector <3 x i8> %x, <3 x i8> <i8 undef, i8 1, i8 2>, <5 x i32> <i32 2, i32 1, i32 4, i32 3, i32 0>			%shuf = shufflevector <3 x i8> %x, <3 x i8> <i8 undef, i8 1, i8 2>, <5 x i32> <i32 2, i32 1, i32 4, i32 3, i32 0>
	%ins = insertelement <5 x i8> %shuf, i8 42, i17 4			%ins = insertelement <5 x i8> %shuf, i8 42, i17 4
	ret <5 x i8> %ins			ret <5 x i8> %ins
	}			}

	; TODO: The inserted constant could get folded into the shuffle vector constant.			; TODO: The inserted constant could get folded into the shuffle vector constant.

	define <3 x i8> @shorterMask(<5 x i8> %x) {			define <3 x i8> @shorterMask(<5 x i8> %x) {
	; CHECK-LABEL: @shorterMask(			; CHECK-LABEL: @shorterMask(
	; CHECK-NEXT: [[SHUF:%.]] = shufflevector <5 x i8> [[X:%.]], <5 x i8> undef, <3 x i32> <i32 undef, i32 1, i32 4>			; CHECK-NEXT: [[SHUF:%.]] = shufflevector <5 x i8> [[X:%.]], <5 x i8> poison, <3 x i32> <i32 undef, i32 1, i32 4>
	; CHECK-NEXT: [[INS:%.*]] = insertelement <3 x i8> [[SHUF]], i8 42, i21 0			; CHECK-NEXT: [[INS:%.*]] = insertelement <3 x i8> [[SHUF]], i8 42, i21 0
	; CHECK-NEXT: ret <3 x i8> [[INS]]			; CHECK-NEXT: ret <3 x i8> [[INS]]
	;			;
	%shuf = shufflevector <5 x i8> %x, <5 x i8> <i8 undef, i8 1, i8 2, i8 3, i8 4>, <3 x i32> <i32 2, i32 1, i32 4>			%shuf = shufflevector <5 x i8> %x, <5 x i8> <i8 undef, i8 1, i8 2, i8 3, i8 4>, <3 x i32> <i32 2, i32 1, i32 4>
	%ins = insertelement <3 x i8> %shuf, i8 42, i21 0			%ins = insertelement <3 x i8> %shuf, i8 42, i21 0
	ret <3 x i8> %ins			ret <3 x i8> %ins
	}			}

llvm/test/Transforms/InstCombine/insert-extract-shuffle-inseltpoison.ll

Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines
}		}

; PR26015: https://llvm.org/bugs/show_bug.cgi?id=26015		; PR26015: https://llvm.org/bugs/show_bug.cgi?id=26015
; The widening shuffle must be inserted before any uses.		; The widening shuffle must be inserted before any uses.

define <8 x i16> @pr26015(<4 x i16> %t0) {		define <8 x i16> @pr26015(<4 x i16> %t0) {
; CHECK-LABEL: @pr26015(		; CHECK-LABEL: @pr26015(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <4 x i16> [[T0:%.]], <4 x i16> undef, <8 x i32> <i32 undef, i32 undef, i32 2, i32 3, i32 undef, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <4 x i16> [[T0:%.]], <4 x i16> undef, <8 x i32> <i32 undef, i32 undef, i32 2, i32 3, i32 undef, i32 undef, i32 undef, i32 undef>
; CHECK-NEXT: [[T5:%.*]] = shufflevector <8 x i16> <i16 0, i16 0, i16 0, i16 undef, i16 0, i16 0, i16 0, i16 undef>, <8 x i16> [[TMP1]], <8 x i32> <i32 0, i32 1, i32 2, i32 10, i32 4, i32 5, i32 6, i32 11>		; CHECK-NEXT: [[T5:%.*]] = shufflevector <8 x i16> <i16 0, i16 0, i16 0, i16 poison, i16 0, i16 0, i16 0, i16 poison>, <8 x i16> [[TMP1]], <8 x i32> <i32 0, i32 1, i32 2, i32 10, i32 4, i32 5, i32 6, i32 11>
; CHECK-NEXT: ret <8 x i16> [[T5]]		; CHECK-NEXT: ret <8 x i16> [[T5]]
;		;
%t1 = extractelement <4 x i16> %t0, i32 2		%t1 = extractelement <4 x i16> %t0, i32 2
%t2 = insertelement <8 x i16> zeroinitializer, i16 %t1, i32 3		%t2 = insertelement <8 x i16> zeroinitializer, i16 %t1, i32 3
%t3 = insertelement <8 x i16> %t2, i16 0, i32 6		%t3 = insertelement <8 x i16> %t2, i16 0, i32 6
%t4 = extractelement <4 x i16> %t0, i32 3		%t4 = extractelement <4 x i16> %t0, i32 3
%t5 = insertelement <8 x i16> %t3, i16 %t4, i32 7		%t5 = insertelement <8 x i16> %t3, i16 %t4, i32 7
ret <8 x i16> %t5		ret <8 x i16> %t5
}		}

; PR25999: https://llvm.org/bugs/show_bug.cgi?id=25999		; PR25999: https://llvm.org/bugs/show_bug.cgi?id=25999
; TODO: The widening shuffle could be inserted at the start of the function to allow the first extract to use it.		; TODO: The widening shuffle could be inserted at the start of the function to allow the first extract to use it.

define <8 x i16> @pr25999(<4 x i16> %t0, i1 %b) {		define <8 x i16> @pr25999(<4 x i16> %t0, i1 %b) {
; CHECK-LABEL: @pr25999(		; CHECK-LABEL: @pr25999(
; CHECK-NEXT: [[T1:%.]] = extractelement <4 x i16> [[T0:%.]], i32 2		; CHECK-NEXT: [[T1:%.]] = extractelement <4 x i16> [[T0:%.]], i32 2
; CHECK-NEXT: br i1 [[B:%.]], label [[IF:%.]], label [[END:%.*]]		; CHECK-NEXT: br i1 [[B:%.]], label [[IF:%.]], label [[END:%.*]]
; CHECK: if:		; CHECK: if:
; CHECK-NEXT: [[TMP1:%.*]] = shufflevector <4 x i16> [[T0]], <4 x i16> undef, <8 x i32> <i32 undef, i32 undef, i32 undef, i32 3, i32 undef, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP1:%.*]] = shufflevector <4 x i16> [[T0]], <4 x i16> undef, <8 x i32> <i32 undef, i32 undef, i32 undef, i32 3, i32 undef, i32 undef, i32 undef, i32 undef>
; CHECK-NEXT: [[T3:%.*]] = insertelement <8 x i16> <i16 0, i16 0, i16 0, i16 undef, i16 0, i16 0, i16 0, i16 undef>, i16 [[T1]], i32 3		; CHECK-NEXT: [[T3:%.*]] = insertelement <8 x i16> <i16 0, i16 0, i16 0, i16 poison, i16 0, i16 0, i16 0, i16 poison>, i16 [[T1]], i32 3
; CHECK-NEXT: [[T5:%.*]] = shufflevector <8 x i16> [[T3]], <8 x i16> [[TMP1]], <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 11>		; CHECK-NEXT: [[T5:%.*]] = shufflevector <8 x i16> [[T3]], <8 x i16> [[TMP1]], <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 11>
; CHECK-NEXT: ret <8 x i16> [[T5]]		; CHECK-NEXT: ret <8 x i16> [[T5]]
; CHECK: end:		; CHECK: end:
; CHECK-NEXT: [[A1:%.*]] = add i16 [[T1]], 4		; CHECK-NEXT: [[A1:%.*]] = add i16 [[T1]], 4
; CHECK-NEXT: [[T6:%.*]] = insertelement <8 x i16> <i16 undef, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0>, i16 [[A1]], i32 0		; CHECK-NEXT: [[T6:%.*]] = insertelement <8 x i16> <i16 poison, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0>, i16 [[A1]], i32 0
; CHECK-NEXT: ret <8 x i16> [[T6]]		; CHECK-NEXT: ret <8 x i16> [[T6]]
;		;

%t1 = extractelement <4 x i16> %t0, i32 2		%t1 = extractelement <4 x i16> %t0, i32 2
br i1 %b, label %if, label %end		br i1 %b, label %if, label %end

if:		if:
%t2 = insertelement <8 x i16> zeroinitializer, i16 %t1, i32 3		%t2 = insertelement <8 x i16> zeroinitializer, i16 %t1, i32 3
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines

define double @pr26354(<2 x double>* %tmp, i1 %B) {		define double @pr26354(<2 x double>* %tmp, i1 %B) {
; CHECK-LABEL: @pr26354(		; CHECK-LABEL: @pr26354(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[LD:%.]] = load <2 x double>, <2 x double> [[TMP:%.*]], align 16		; CHECK-NEXT: [[LD:%.]] = load <2 x double>, <2 x double> [[TMP:%.*]], align 16
; CHECK-NEXT: br i1 [[B:%.]], label [[IF:%.]], label [[END:%.*]]		; CHECK-NEXT: br i1 [[B:%.]], label [[IF:%.]], label [[END:%.*]]
; CHECK: if:		; CHECK: if:
; CHECK-NEXT: [[E2:%.*]] = extractelement <2 x double> [[LD]], i32 1		; CHECK-NEXT: [[E2:%.*]] = extractelement <2 x double> [[LD]], i32 1
; CHECK-NEXT: [[I1:%.*]] = insertelement <4 x double> <double 0.000000e+00, double 0.000000e+00, double 0.000000e+00, double undef>, double [[E2]], i32 3		; CHECK-NEXT: [[I1:%.*]] = insertelement <4 x double> <double 0.000000e+00, double 0.000000e+00, double 0.000000e+00, double poison>, double [[E2]], i32 3
; CHECK-NEXT: br label [[END]]		; CHECK-NEXT: br label [[END]]
; CHECK: end:		; CHECK: end:
; CHECK-NEXT: [[PH:%.]] = phi <4 x double> [ undef, [[ENTRY:%.]] ], [ [[I1]], [[IF]] ]		; CHECK-NEXT: [[PH:%.]] = phi <4 x double> [ undef, [[ENTRY:%.]] ], [ [[I1]], [[IF]] ]
; CHECK-NEXT: [[E1:%.*]] = extractelement <2 x double> [[LD]], i32 0		; CHECK-NEXT: [[E1:%.*]] = extractelement <2 x double> [[LD]], i32 0
; CHECK-NEXT: [[E3:%.*]] = extractelement <4 x double> [[PH]], i32 1		; CHECK-NEXT: [[E3:%.*]] = extractelement <4 x double> [[PH]], i32 1
; CHECK-NEXT: [[MU:%.*]] = fmul double [[E1]], [[E3]]		; CHECK-NEXT: [[MU:%.*]] = fmul double [[E1]], [[E3]]
; CHECK-NEXT: ret double [[MU]]		; CHECK-NEXT: ret double [[MU]]
;		;
Show All 21 Lines
define <4 x float> @PR30923(<2 x float> %x) {		define <4 x float> @PR30923(<2 x float> %x) {
; CHECK-LABEL: @PR30923(		; CHECK-LABEL: @PR30923(
; CHECK-NEXT: bb1:		; CHECK-NEXT: bb1:
; CHECK-NEXT: [[EXT1:%.]] = extractelement <2 x float> [[X:%.]], i32 1		; CHECK-NEXT: [[EXT1:%.]] = extractelement <2 x float> [[X:%.]], i32 1
; CHECK-NEXT: store float [[EXT1]], float* undef, align 4		; CHECK-NEXT: store float [[EXT1]], float* undef, align 4
; CHECK-NEXT: br label [[BB2:%.*]]		; CHECK-NEXT: br label [[BB2:%.*]]
; CHECK: bb2:		; CHECK: bb2:
; CHECK-NEXT: [[EXT2:%.*]] = extractelement <2 x float> [[X]], i32 0		; CHECK-NEXT: [[EXT2:%.*]] = extractelement <2 x float> [[X]], i32 0
; CHECK-NEXT: [[INS1:%.*]] = insertelement <4 x float> <float 0.000000e+00, float 0.000000e+00, float undef, float undef>, float [[EXT2]], i32 2		; CHECK-NEXT: [[INS1:%.*]] = insertelement <4 x float> <float 0.000000e+00, float 0.000000e+00, float poison, float poison>, float [[EXT2]], i32 2
; CHECK-NEXT: [[INS2:%.*]] = insertelement <4 x float> [[INS1]], float [[EXT1]], i32 3		; CHECK-NEXT: [[INS2:%.*]] = insertelement <4 x float> [[INS1]], float [[EXT1]], i32 3
; CHECK-NEXT: ret <4 x float> [[INS2]]		; CHECK-NEXT: ret <4 x float> [[INS2]]
;		;
bb1:		bb1:
%ext1 = extractelement <2 x float> %x, i32 1		%ext1 = extractelement <2 x float> %x, i32 1
store float %ext1, float* undef, align 4		store float %ext1, float* undef, align 4
br label %bb2		br label %bb2

bb2:		bb2:
%widen = shufflevector <2 x float> %x, <2 x float> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef>		%widen = shufflevector <2 x float> %x, <2 x float> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef>
%ext2 = extractelement <4 x float> %widen, i32 0		%ext2 = extractelement <4 x float> %widen, i32 0
%ins1 = insertelement <4 x float> <float 0.0, float 0.0, float undef, float undef>, float %ext2, i32 2		%ins1 = insertelement <4 x float> <float 0.0, float 0.0, float undef, float undef>, float %ext2, i32 2
%ins2 = insertelement <4 x float> %ins1, float %ext1, i32 3		%ins2 = insertelement <4 x float> %ins1, float %ext1, i32 3
ret <4 x float> %ins2		ret <4 x float> %ins2
}		}

; Don't insert extractelements from the wider vector before the def of the index operand.		; Don't insert extractelements from the wider vector before the def of the index operand.

define <4 x i32> @extractelt_insertion(<2 x i32> %x, i32 %y) {		define <4 x i32> @extractelt_insertion(<2 x i32> %x, i32 %y) {
; CHECK-LABEL: @extractelt_insertion(		; CHECK-LABEL: @extractelt_insertion(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = shufflevector <2 x i32> [[X:%.]], <2 x i32> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP0:%.]] = shufflevector <2 x i32> [[X:%.]], <2 x i32> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef>
; CHECK-NEXT: [[B:%.*]] = shufflevector <4 x i32> <i32 0, i32 0, i32 0, i32 undef>, <4 x i32> [[TMP0]], <4 x i32> <i32 0, i32 1, i32 2, i32 5>		; CHECK-NEXT: [[B:%.*]] = shufflevector <4 x i32> <i32 0, i32 0, i32 0, i32 poison>, <4 x i32> [[TMP0]], <4 x i32> <i32 0, i32 1, i32 2, i32 5>
; CHECK-NEXT: [[C:%.]] = add i32 [[Y:%.]], 3		; CHECK-NEXT: [[C:%.]] = add i32 [[Y:%.]], 3
; CHECK-NEXT: [[TMP1:%.*]] = extractelement <4 x i32> [[TMP0]], i32 [[C]]		; CHECK-NEXT: [[TMP1:%.*]] = extractelement <4 x i32> [[TMP0]], i32 [[C]]
; CHECK-NEXT: [[E:%.*]] = icmp eq i32 [[TMP1]], 0		; CHECK-NEXT: [[E:%.*]] = icmp eq i32 [[TMP1]], 0
; CHECK-NEXT: [[RET:%.*]] = select i1 [[E]], <4 x i32> [[B]], <4 x i32> zeroinitializer		; CHECK-NEXT: [[RET:%.*]] = select i1 [[E]], <4 x i32> [[B]], <4 x i32> zeroinitializer
; CHECK-NEXT: ret <4 x i32> [[RET]]		; CHECK-NEXT: ret <4 x i32> [[RET]]
;		;
entry:		entry:
%a = extractelement <2 x i32> %x, i32 1		%a = extractelement <2 x i32> %x, i32 1
▲ Show 20 Lines • Show All 132 Lines • ▼ Show 20 Lines	;
%r = shufflevector <4 x float> %y, <4 x float> %xv, <4 x i32> <i32 6, i32 undef, i32 2, i32 3>		%r = shufflevector <4 x float> %y, <4 x float> %xv, <4 x i32> <i32 6, i32 undef, i32 2, i32 3>
ret <4 x float> %r		ret <4 x float> %r
}		}

; Negative test - size-changing shuffle.		; Negative test - size-changing shuffle.

define <5 x float> @insert_not_undef_shuffle_translate_commute_lengthen(float %x, <4 x float> %y, <4 x float> %q) {		define <5 x float> @insert_not_undef_shuffle_translate_commute_lengthen(float %x, <4 x float> %y, <4 x float> %q) {
; CHECK-LABEL: @insert_not_undef_shuffle_translate_commute_lengthen(		; CHECK-LABEL: @insert_not_undef_shuffle_translate_commute_lengthen(
; CHECK-NEXT: [[XV:%.]] = insertelement <4 x float> undef, float [[X:%.]], i32 2		; CHECK-NEXT: [[XV:%.]] = insertelement <4 x float> poison, float [[X:%.]], i32 2
; CHECK-NEXT: [[R:%.]] = shufflevector <4 x float> [[Y:%.]], <4 x float> [[XV]], <5 x i32> <i32 0, i32 6, i32 2, i32 undef, i32 undef>		; CHECK-NEXT: [[R:%.]] = shufflevector <4 x float> [[Y:%.]], <4 x float> [[XV]], <5 x i32> <i32 0, i32 6, i32 2, i32 undef, i32 undef>
; CHECK-NEXT: ret <5 x float> [[R]]		; CHECK-NEXT: ret <5 x float> [[R]]
;		;
%xv = insertelement <4 x float> %q, float %x, i32 2		%xv = insertelement <4 x float> %q, float %x, i32 2
%r = shufflevector <4 x float> %y, <4 x float> %xv, <5 x i32> <i32 0, i32 6, i32 2, i32 undef, i32 undef>		%r = shufflevector <4 x float> %y, <4 x float> %xv, <5 x i32> <i32 0, i32 6, i32 2, i32 undef, i32 undef>
ret <5 x float> %r		ret <5 x float> %r
}		}

▲ Show 20 Lines • Show All 308 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/insert-extract-shuffle.ll

Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines
}		}

; PR26015: https://llvm.org/bugs/show_bug.cgi?id=26015		; PR26015: https://llvm.org/bugs/show_bug.cgi?id=26015
; The widening shuffle must be inserted before any uses.		; The widening shuffle must be inserted before any uses.

define <8 x i16> @pr26015(<4 x i16> %t0) {		define <8 x i16> @pr26015(<4 x i16> %t0) {
; CHECK-LABEL: @pr26015(		; CHECK-LABEL: @pr26015(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <4 x i16> [[T0:%.]], <4 x i16> undef, <8 x i32> <i32 undef, i32 undef, i32 2, i32 3, i32 undef, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <4 x i16> [[T0:%.]], <4 x i16> undef, <8 x i32> <i32 undef, i32 undef, i32 2, i32 3, i32 undef, i32 undef, i32 undef, i32 undef>
; CHECK-NEXT: [[T5:%.*]] = shufflevector <8 x i16> <i16 0, i16 0, i16 0, i16 undef, i16 0, i16 0, i16 0, i16 undef>, <8 x i16> [[TMP1]], <8 x i32> <i32 0, i32 1, i32 2, i32 10, i32 4, i32 5, i32 6, i32 11>		; CHECK-NEXT: [[T5:%.*]] = shufflevector <8 x i16> <i16 0, i16 0, i16 0, i16 poison, i16 0, i16 0, i16 0, i16 poison>, <8 x i16> [[TMP1]], <8 x i32> <i32 0, i32 1, i32 2, i32 10, i32 4, i32 5, i32 6, i32 11>
; CHECK-NEXT: ret <8 x i16> [[T5]]		; CHECK-NEXT: ret <8 x i16> [[T5]]
;		;
%t1 = extractelement <4 x i16> %t0, i32 2		%t1 = extractelement <4 x i16> %t0, i32 2
%t2 = insertelement <8 x i16> zeroinitializer, i16 %t1, i32 3		%t2 = insertelement <8 x i16> zeroinitializer, i16 %t1, i32 3
%t3 = insertelement <8 x i16> %t2, i16 0, i32 6		%t3 = insertelement <8 x i16> %t2, i16 0, i32 6
%t4 = extractelement <4 x i16> %t0, i32 3		%t4 = extractelement <4 x i16> %t0, i32 3
%t5 = insertelement <8 x i16> %t3, i16 %t4, i32 7		%t5 = insertelement <8 x i16> %t3, i16 %t4, i32 7
ret <8 x i16> %t5		ret <8 x i16> %t5
}		}

; PR25999: https://llvm.org/bugs/show_bug.cgi?id=25999		; PR25999: https://llvm.org/bugs/show_bug.cgi?id=25999
; TODO: The widening shuffle could be inserted at the start of the function to allow the first extract to use it.		; TODO: The widening shuffle could be inserted at the start of the function to allow the first extract to use it.

define <8 x i16> @pr25999(<4 x i16> %t0, i1 %b) {		define <8 x i16> @pr25999(<4 x i16> %t0, i1 %b) {
; CHECK-LABEL: @pr25999(		; CHECK-LABEL: @pr25999(
; CHECK-NEXT: [[T1:%.]] = extractelement <4 x i16> [[T0:%.]], i32 2		; CHECK-NEXT: [[T1:%.]] = extractelement <4 x i16> [[T0:%.]], i32 2
; CHECK-NEXT: br i1 [[B:%.]], label [[IF:%.]], label [[END:%.*]]		; CHECK-NEXT: br i1 [[B:%.]], label [[IF:%.]], label [[END:%.*]]
; CHECK: if:		; CHECK: if:
; CHECK-NEXT: [[TMP1:%.*]] = shufflevector <4 x i16> [[T0]], <4 x i16> undef, <8 x i32> <i32 undef, i32 undef, i32 undef, i32 3, i32 undef, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP1:%.*]] = shufflevector <4 x i16> [[T0]], <4 x i16> undef, <8 x i32> <i32 undef, i32 undef, i32 undef, i32 3, i32 undef, i32 undef, i32 undef, i32 undef>
; CHECK-NEXT: [[T3:%.*]] = insertelement <8 x i16> <i16 0, i16 0, i16 0, i16 undef, i16 0, i16 0, i16 0, i16 undef>, i16 [[T1]], i32 3		; CHECK-NEXT: [[T3:%.*]] = insertelement <8 x i16> <i16 0, i16 0, i16 0, i16 poison, i16 0, i16 0, i16 0, i16 poison>, i16 [[T1]], i32 3
; CHECK-NEXT: [[T5:%.*]] = shufflevector <8 x i16> [[T3]], <8 x i16> [[TMP1]], <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 11>		; CHECK-NEXT: [[T5:%.*]] = shufflevector <8 x i16> [[T3]], <8 x i16> [[TMP1]], <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 11>
; CHECK-NEXT: ret <8 x i16> [[T5]]		; CHECK-NEXT: ret <8 x i16> [[T5]]
; CHECK: end:		; CHECK: end:
; CHECK-NEXT: [[A1:%.*]] = add i16 [[T1]], 4		; CHECK-NEXT: [[A1:%.*]] = add i16 [[T1]], 4
; CHECK-NEXT: [[T6:%.*]] = insertelement <8 x i16> <i16 undef, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0>, i16 [[A1]], i32 0		; CHECK-NEXT: [[T6:%.*]] = insertelement <8 x i16> <i16 poison, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0>, i16 [[A1]], i32 0
; CHECK-NEXT: ret <8 x i16> [[T6]]		; CHECK-NEXT: ret <8 x i16> [[T6]]
;		;

%t1 = extractelement <4 x i16> %t0, i32 2		%t1 = extractelement <4 x i16> %t0, i32 2
br i1 %b, label %if, label %end		br i1 %b, label %if, label %end

if:		if:
%t2 = insertelement <8 x i16> zeroinitializer, i16 %t1, i32 3		%t2 = insertelement <8 x i16> zeroinitializer, i16 %t1, i32 3
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines

define double @pr26354(<2 x double>* %tmp, i1 %B) {		define double @pr26354(<2 x double>* %tmp, i1 %B) {
; CHECK-LABEL: @pr26354(		; CHECK-LABEL: @pr26354(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[LD:%.]] = load <2 x double>, <2 x double> [[TMP:%.*]], align 16		; CHECK-NEXT: [[LD:%.]] = load <2 x double>, <2 x double> [[TMP:%.*]], align 16
; CHECK-NEXT: br i1 [[B:%.]], label [[IF:%.]], label [[END:%.*]]		; CHECK-NEXT: br i1 [[B:%.]], label [[IF:%.]], label [[END:%.*]]
; CHECK: if:		; CHECK: if:
; CHECK-NEXT: [[E2:%.*]] = extractelement <2 x double> [[LD]], i32 1		; CHECK-NEXT: [[E2:%.*]] = extractelement <2 x double> [[LD]], i32 1
; CHECK-NEXT: [[I1:%.*]] = insertelement <4 x double> <double 0.000000e+00, double 0.000000e+00, double 0.000000e+00, double undef>, double [[E2]], i32 3		; CHECK-NEXT: [[I1:%.*]] = insertelement <4 x double> <double 0.000000e+00, double 0.000000e+00, double 0.000000e+00, double poison>, double [[E2]], i32 3
; CHECK-NEXT: br label [[END]]		; CHECK-NEXT: br label [[END]]
; CHECK: end:		; CHECK: end:
; CHECK-NEXT: [[PH:%.]] = phi <4 x double> [ undef, [[ENTRY:%.]] ], [ [[I1]], [[IF]] ]		; CHECK-NEXT: [[PH:%.]] = phi <4 x double> [ undef, [[ENTRY:%.]] ], [ [[I1]], [[IF]] ]
; CHECK-NEXT: [[E1:%.*]] = extractelement <2 x double> [[LD]], i32 0		; CHECK-NEXT: [[E1:%.*]] = extractelement <2 x double> [[LD]], i32 0
; CHECK-NEXT: [[E3:%.*]] = extractelement <4 x double> [[PH]], i32 1		; CHECK-NEXT: [[E3:%.*]] = extractelement <4 x double> [[PH]], i32 1
; CHECK-NEXT: [[MU:%.*]] = fmul double [[E1]], [[E3]]		; CHECK-NEXT: [[MU:%.*]] = fmul double [[E1]], [[E3]]
; CHECK-NEXT: ret double [[MU]]		; CHECK-NEXT: ret double [[MU]]
;		;
Show All 21 Lines
define <4 x float> @PR30923(<2 x float> %x) {		define <4 x float> @PR30923(<2 x float> %x) {
; CHECK-LABEL: @PR30923(		; CHECK-LABEL: @PR30923(
; CHECK-NEXT: bb1:		; CHECK-NEXT: bb1:
; CHECK-NEXT: [[EXT1:%.]] = extractelement <2 x float> [[X:%.]], i32 1		; CHECK-NEXT: [[EXT1:%.]] = extractelement <2 x float> [[X:%.]], i32 1
; CHECK-NEXT: store float [[EXT1]], float* undef, align 4		; CHECK-NEXT: store float [[EXT1]], float* undef, align 4
; CHECK-NEXT: br label [[BB2:%.*]]		; CHECK-NEXT: br label [[BB2:%.*]]
; CHECK: bb2:		; CHECK: bb2:
; CHECK-NEXT: [[EXT2:%.*]] = extractelement <2 x float> [[X]], i32 0		; CHECK-NEXT: [[EXT2:%.*]] = extractelement <2 x float> [[X]], i32 0
; CHECK-NEXT: [[INS1:%.*]] = insertelement <4 x float> <float 0.000000e+00, float 0.000000e+00, float undef, float undef>, float [[EXT2]], i32 2		; CHECK-NEXT: [[INS1:%.*]] = insertelement <4 x float> <float 0.000000e+00, float 0.000000e+00, float poison, float poison>, float [[EXT2]], i32 2
; CHECK-NEXT: [[INS2:%.*]] = insertelement <4 x float> [[INS1]], float [[EXT1]], i32 3		; CHECK-NEXT: [[INS2:%.*]] = insertelement <4 x float> [[INS1]], float [[EXT1]], i32 3
; CHECK-NEXT: ret <4 x float> [[INS2]]		; CHECK-NEXT: ret <4 x float> [[INS2]]
;		;
bb1:		bb1:
%ext1 = extractelement <2 x float> %x, i32 1		%ext1 = extractelement <2 x float> %x, i32 1
store float %ext1, float* undef, align 4		store float %ext1, float* undef, align 4
br label %bb2		br label %bb2

bb2:		bb2:
%widen = shufflevector <2 x float> %x, <2 x float> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef>		%widen = shufflevector <2 x float> %x, <2 x float> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef>
%ext2 = extractelement <4 x float> %widen, i32 0		%ext2 = extractelement <4 x float> %widen, i32 0
%ins1 = insertelement <4 x float> <float 0.0, float 0.0, float undef, float undef>, float %ext2, i32 2		%ins1 = insertelement <4 x float> <float 0.0, float 0.0, float undef, float undef>, float %ext2, i32 2
%ins2 = insertelement <4 x float> %ins1, float %ext1, i32 3		%ins2 = insertelement <4 x float> %ins1, float %ext1, i32 3
ret <4 x float> %ins2		ret <4 x float> %ins2
}		}

; Don't insert extractelements from the wider vector before the def of the index operand.		; Don't insert extractelements from the wider vector before the def of the index operand.

define <4 x i32> @extractelt_insertion(<2 x i32> %x, i32 %y) {		define <4 x i32> @extractelt_insertion(<2 x i32> %x, i32 %y) {
; CHECK-LABEL: @extractelt_insertion(		; CHECK-LABEL: @extractelt_insertion(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = shufflevector <2 x i32> [[X:%.]], <2 x i32> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP0:%.]] = shufflevector <2 x i32> [[X:%.]], <2 x i32> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef>
; CHECK-NEXT: [[B:%.*]] = shufflevector <4 x i32> <i32 0, i32 0, i32 0, i32 undef>, <4 x i32> [[TMP0]], <4 x i32> <i32 0, i32 1, i32 2, i32 5>		; CHECK-NEXT: [[B:%.*]] = shufflevector <4 x i32> <i32 0, i32 0, i32 0, i32 poison>, <4 x i32> [[TMP0]], <4 x i32> <i32 0, i32 1, i32 2, i32 5>
; CHECK-NEXT: [[C:%.]] = add i32 [[Y:%.]], 3		; CHECK-NEXT: [[C:%.]] = add i32 [[Y:%.]], 3
; CHECK-NEXT: [[TMP1:%.*]] = extractelement <4 x i32> [[TMP0]], i32 [[C]]		; CHECK-NEXT: [[TMP1:%.*]] = extractelement <4 x i32> [[TMP0]], i32 [[C]]
; CHECK-NEXT: [[E:%.*]] = icmp eq i32 [[TMP1]], 0		; CHECK-NEXT: [[E:%.*]] = icmp eq i32 [[TMP1]], 0
; CHECK-NEXT: [[RET:%.*]] = select i1 [[E]], <4 x i32> [[B]], <4 x i32> zeroinitializer		; CHECK-NEXT: [[RET:%.*]] = select i1 [[E]], <4 x i32> [[B]], <4 x i32> zeroinitializer
; CHECK-NEXT: ret <4 x i32> [[RET]]		; CHECK-NEXT: ret <4 x i32> [[RET]]
;		;
entry:		entry:
%a = extractelement <2 x i32> %x, i32 1		%a = extractelement <2 x i32> %x, i32 1
▲ Show 20 Lines • Show All 132 Lines • ▼ Show 20 Lines	;
%r = shufflevector <4 x float> %y, <4 x float> %xv, <4 x i32> <i32 6, i32 undef, i32 2, i32 3>		%r = shufflevector <4 x float> %y, <4 x float> %xv, <4 x i32> <i32 6, i32 undef, i32 2, i32 3>
ret <4 x float> %r		ret <4 x float> %r
}		}

; Negative test - size-changing shuffle.		; Negative test - size-changing shuffle.

define <5 x float> @insert_not_undef_shuffle_translate_commute_lengthen(float %x, <4 x float> %y, <4 x float> %q) {		define <5 x float> @insert_not_undef_shuffle_translate_commute_lengthen(float %x, <4 x float> %y, <4 x float> %q) {
; CHECK-LABEL: @insert_not_undef_shuffle_translate_commute_lengthen(		; CHECK-LABEL: @insert_not_undef_shuffle_translate_commute_lengthen(
; CHECK-NEXT: [[XV:%.]] = insertelement <4 x float> undef, float [[X:%.]], i32 2		; CHECK-NEXT: [[XV:%.]] = insertelement <4 x float> poison, float [[X:%.]], i32 2
; CHECK-NEXT: [[R:%.]] = shufflevector <4 x float> [[Y:%.]], <4 x float> [[XV]], <5 x i32> <i32 0, i32 6, i32 2, i32 undef, i32 undef>		; CHECK-NEXT: [[R:%.]] = shufflevector <4 x float> [[Y:%.]], <4 x float> [[XV]], <5 x i32> <i32 0, i32 6, i32 2, i32 undef, i32 undef>
; CHECK-NEXT: ret <5 x float> [[R]]		; CHECK-NEXT: ret <5 x float> [[R]]
;		;
%xv = insertelement <4 x float> %q, float %x, i32 2		%xv = insertelement <4 x float> %q, float %x, i32 2
%r = shufflevector <4 x float> %y, <4 x float> %xv, <5 x i32> <i32 0, i32 6, i32 2, i32 undef, i32 undef>		%r = shufflevector <4 x float> %y, <4 x float> %xv, <5 x i32> <i32 0, i32 6, i32 2, i32 undef, i32 undef>
ret <5 x float> %r		ret <5 x float> %r
}		}

▲ Show 20 Lines • Show All 308 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/masked_intrinsics-inseltpoison.ll

Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	;
%ptv1 = insertelement <2 x double> poison, double %pt, i64 0		%ptv1 = insertelement <2 x double> poison, double %pt, i64 0
%ptv2 = insertelement <2 x double> %ptv1, double %pt, i64 1		%ptv2 = insertelement <2 x double> %ptv1, double %pt, i64 1
%res = call <2 x double> @llvm.masked.load.v2f64.p0v2f64(<2 x double>* %ptr, i32 2, <2 x i1> <i1 true, i1 false>, <2 x double> %ptv2)		%res = call <2 x double> @llvm.masked.load.v2f64.p0v2f64(<2 x double>* %ptr, i32 2, <2 x i1> <i1 true, i1 false>, <2 x double> %ptv2)
ret <2 x double> %res		ret <2 x double> %res
}		}

define double @load_all(double* %base, double %pt) {		define double @load_all(double* %base, double %pt) {
; CHECK-LABEL: @load_all(		; CHECK-LABEL: @load_all(
; CHECK-NEXT: [[PTRS:%.]] = getelementptr double, double [[BASE:%.*]], <4 x i64> <i64 0, i64 undef, i64 2, i64 3>		; CHECK-NEXT: [[PTRS:%.]] = getelementptr double, double [[BASE:%.*]], <4 x i64> <i64 0, i64 poison, i64 2, i64 3>
; CHECK-NEXT: [[RES:%.]] = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64(<4 x double> [[PTRS]], i32 4, <4 x i1> <i1 true, i1 false, i1 true, i1 true>, <4 x double> undef)		; CHECK-NEXT: [[RES:%.]] = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64(<4 x double> [[PTRS]], i32 4, <4 x i1> <i1 true, i1 false, i1 true, i1 true>, <4 x double> undef)
; CHECK-NEXT: [[ELT:%.*]] = extractelement <4 x double> [[RES]], i64 2		; CHECK-NEXT: [[ELT:%.*]] = extractelement <4 x double> [[RES]], i64 2
; CHECK-NEXT: ret double [[ELT]]		; CHECK-NEXT: ret double [[ELT]]
;		;
%ptrs = getelementptr double, double* %base, <4 x i64> <i64 0, i64 1, i64 2, i64 3>		%ptrs = getelementptr double, double* %base, <4 x i64> <i64 0, i64 1, i64 2, i64 3>
%res = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64(<4 x double*> %ptrs, i32 4, <4 x i1> <i1 true, i1 false, i1 true, i1 true>, <4 x double> undef)		%res = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64(<4 x double*> %ptrs, i32 4, <4 x i1> <i1 true, i1 false, i1 true, i1 true>, <4 x double> undef)
%elt = extractelement <4 x double> %res, i64 2		%elt = extractelement <4 x double> %res, i64 2
ret double %elt		ret double %elt
▲ Show 20 Lines • Show All 116 Lines • ▼ Show 20 Lines
;		;
%res = call <2 x double> @llvm.masked.gather.v2f64.v2p0f64(<2 x double*> %ptrs, i32 4, <2 x i1> zeroinitializer, <2 x double> %passthru)		%res = call <2 x double> @llvm.masked.gather.v2f64.v2p0f64(<2 x double*> %ptrs, i32 4, <2 x i1> zeroinitializer, <2 x double> %passthru)
ret <2 x double> %res		ret <2 x double> %res
}		}


define <2 x double> @gather_onemask(<2 x double*> %ptrs, <2 x double> %passthru) {		define <2 x double> @gather_onemask(<2 x double*> %ptrs, <2 x double> %passthru) {
; CHECK-LABEL: @gather_onemask(		; CHECK-LABEL: @gather_onemask(
; CHECK-NEXT: [[RES:%.]] = call <2 x double> @llvm.masked.gather.v2f64.v2p0f64(<2 x double> [[PTRS:%.*]], i32 4, <2 x i1> <i1 true, i1 true>, <2 x double> undef)		; CHECK-NEXT: [[RES:%.]] = call <2 x double> @llvm.masked.gather.v2f64.v2p0f64(<2 x double> [[PTRS:%.*]], i32 4, <2 x i1> <i1 true, i1 true>, <2 x double> poison)
; CHECK-NEXT: ret <2 x double> [[RES]]		; CHECK-NEXT: ret <2 x double> [[RES]]
;		;
%res = call <2 x double> @llvm.masked.gather.v2f64.v2p0f64(<2 x double*> %ptrs, i32 4, <2 x i1> <i1 true, i1 true>, <2 x double> %passthru)		%res = call <2 x double> @llvm.masked.gather.v2f64.v2p0f64(<2 x double*> %ptrs, i32 4, <2 x i1> <i1 true, i1 true>, <2 x double> %passthru)
ret <2 x double> %res		ret <2 x double> %res
}		}

define <4 x double> @gather_lane2(double* %base, double %pt) {		define <4 x double> @gather_lane2(double* %base, double %pt) {
; CHECK-LABEL: @gather_lane2(		; CHECK-LABEL: @gather_lane2(
; CHECK-NEXT: [[PTRS:%.]] = getelementptr double, double [[BASE:%.*]], <4 x i64> <i64 undef, i64 undef, i64 2, i64 undef>		; CHECK-NEXT: [[PTRS:%.]] = getelementptr double, double [[BASE:%.*]], <4 x i64> <i64 poison, i64 poison, i64 2, i64 poison>
; CHECK-NEXT: [[PT_V1:%.]] = insertelement <4 x double> poison, double [[PT:%.]], i64 0		; CHECK-NEXT: [[PT_V1:%.]] = insertelement <4 x double> poison, double [[PT:%.]], i64 0
; CHECK-NEXT: [[PT_V2:%.*]] = shufflevector <4 x double> [[PT_V1]], <4 x double> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 0>		; CHECK-NEXT: [[PT_V2:%.*]] = shufflevector <4 x double> [[PT_V1]], <4 x double> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 0>
; CHECK-NEXT: [[RES:%.]] = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64(<4 x double> [[PTRS]], i32 4, <4 x i1> <i1 false, i1 false, i1 true, i1 false>, <4 x double> [[PT_V2]])		; CHECK-NEXT: [[RES:%.]] = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64(<4 x double> [[PTRS]], i32 4, <4 x i1> <i1 false, i1 false, i1 true, i1 false>, <4 x double> [[PT_V2]])
; CHECK-NEXT: ret <4 x double> [[RES]]		; CHECK-NEXT: ret <4 x double> [[RES]]
;		;
%ptrs = getelementptr double, double *%base, <4 x i64> <i64 0, i64 1, i64 2, i64 3>		%ptrs = getelementptr double, double *%base, <4 x i64> <i64 0, i64 1, i64 2, i64 3>
%pt_v1 = insertelement <4 x double> poison, double %pt, i64 0		%pt_v1 = insertelement <4 x double> poison, double %pt, i64 0
%pt_v2 = shufflevector <4 x double> %pt_v1, <4 x double> undef, <4 x i32> zeroinitializer		%pt_v2 = shufflevector <4 x double> %pt_v1, <4 x double> undef, <4 x i32> zeroinitializer
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
call void @llvm.masked.scatter.v2f64.v2p0f64(<2 x double> %val, <2 x double*> %ptrs, i32 8, <2 x i1> zeroinitializer)		call void @llvm.masked.scatter.v2f64.v2p0f64(<2 x double> %val, <2 x double*> %ptrs, i32 8, <2 x i1> zeroinitializer)
ret void		ret void
}		}

define void @scatter_demandedelts(double* %ptr, double %val) {		define void @scatter_demandedelts(double* %ptr, double %val) {
; CHECK-LABEL: @scatter_demandedelts(		; CHECK-LABEL: @scatter_demandedelts(
; CHECK-NEXT: [[PTRS:%.]] = getelementptr double, double [[PTR:%.*]], <2 x i64> <i64 0, i64 undef>		; CHECK-NEXT: [[PTRS:%.]] = getelementptr double, double [[PTR:%.*]], <2 x i64> <i64 0, i64 poison>
; CHECK-NEXT: [[VALVEC1:%.]] = insertelement <2 x double> poison, double [[VAL:%.]], i32 0		; CHECK-NEXT: [[VALVEC1:%.]] = insertelement <2 x double> poison, double [[VAL:%.]], i32 0
; CHECK-NEXT: call void @llvm.masked.scatter.v2f64.v2p0f64(<2 x double> [[VALVEC1]], <2 x double*> [[PTRS]], i32 8, <2 x i1> <i1 true, i1 false>)		; CHECK-NEXT: call void @llvm.masked.scatter.v2f64.v2p0f64(<2 x double> [[VALVEC1]], <2 x double*> [[PTRS]], i32 8, <2 x i1> <i1 true, i1 false>)
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
%ptrs = getelementptr double, double* %ptr, <2 x i64> <i64 0, i64 1>		%ptrs = getelementptr double, double* %ptr, <2 x i64> <i64 0, i64 1>
%valvec1 = insertelement <2 x double> poison, double %val, i32 0		%valvec1 = insertelement <2 x double> poison, double %val, i32 0
%valvec2 = insertelement <2 x double> %valvec1, double %val, i32 1		%valvec2 = insertelement <2 x double> %valvec1, double %val, i32 1
call void @llvm.masked.scatter.v2f64.v2p0f64(<2 x double> %valvec2, <2 x double*> %ptrs, i32 8, <2 x i1> <i1 true, i1 false>)		call void @llvm.masked.scatter.v2f64.v2p0f64(<2 x double> %valvec2, <2 x double*> %ptrs, i32 8, <2 x i1> <i1 true, i1 false>)
ret void		ret void
}		}

llvm/test/Transforms/InstCombine/masked_intrinsics.ll

Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	;
%ptv1 = insertelement <2 x double> undef, double %pt, i64 0		%ptv1 = insertelement <2 x double> undef, double %pt, i64 0
%ptv2 = insertelement <2 x double> %ptv1, double %pt, i64 1		%ptv2 = insertelement <2 x double> %ptv1, double %pt, i64 1
%res = call <2 x double> @llvm.masked.load.v2f64.p0v2f64(<2 x double>* %ptr, i32 2, <2 x i1> <i1 true, i1 false>, <2 x double> %ptv2)		%res = call <2 x double> @llvm.masked.load.v2f64.p0v2f64(<2 x double>* %ptr, i32 2, <2 x i1> <i1 true, i1 false>, <2 x double> %ptv2)
ret <2 x double> %res		ret <2 x double> %res
}		}

define double @load_all(double* %base, double %pt) {		define double @load_all(double* %base, double %pt) {
; CHECK-LABEL: @load_all(		; CHECK-LABEL: @load_all(
; CHECK-NEXT: [[PTRS:%.]] = getelementptr double, double [[BASE:%.*]], <4 x i64> <i64 0, i64 undef, i64 2, i64 3>		; CHECK-NEXT: [[PTRS:%.]] = getelementptr double, double [[BASE:%.*]], <4 x i64> <i64 0, i64 poison, i64 2, i64 3>
; CHECK-NEXT: [[RES:%.]] = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64(<4 x double> [[PTRS]], i32 4, <4 x i1> <i1 true, i1 false, i1 true, i1 true>, <4 x double> undef)		; CHECK-NEXT: [[RES:%.]] = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64(<4 x double> [[PTRS]], i32 4, <4 x i1> <i1 true, i1 false, i1 true, i1 true>, <4 x double> undef)
; CHECK-NEXT: [[ELT:%.*]] = extractelement <4 x double> [[RES]], i64 2		; CHECK-NEXT: [[ELT:%.*]] = extractelement <4 x double> [[RES]], i64 2
; CHECK-NEXT: ret double [[ELT]]		; CHECK-NEXT: ret double [[ELT]]
;		;
%ptrs = getelementptr double, double* %base, <4 x i64> <i64 0, i64 1, i64 2, i64 3>		%ptrs = getelementptr double, double* %base, <4 x i64> <i64 0, i64 1, i64 2, i64 3>
%res = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64(<4 x double*> %ptrs, i32 4, <4 x i1> <i1 true, i1 false, i1 true, i1 true>, <4 x double> undef)		%res = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64(<4 x double*> %ptrs, i32 4, <4 x i1> <i1 true, i1 false, i1 true, i1 true>, <4 x double> undef)
%elt = extractelement <4 x double> %res, i64 2		%elt = extractelement <4 x double> %res, i64 2
ret double %elt		ret double %elt
▲ Show 20 Lines • Show All 116 Lines • ▼ Show 20 Lines
;		;
%res = call <2 x double> @llvm.masked.gather.v2f64.v2p0f64(<2 x double*> %ptrs, i32 4, <2 x i1> zeroinitializer, <2 x double> %passthru)		%res = call <2 x double> @llvm.masked.gather.v2f64.v2p0f64(<2 x double*> %ptrs, i32 4, <2 x i1> zeroinitializer, <2 x double> %passthru)
ret <2 x double> %res		ret <2 x double> %res
}		}


define <2 x double> @gather_onemask(<2 x double*> %ptrs, <2 x double> %passthru) {		define <2 x double> @gather_onemask(<2 x double*> %ptrs, <2 x double> %passthru) {
; CHECK-LABEL: @gather_onemask(		; CHECK-LABEL: @gather_onemask(
; CHECK-NEXT: [[RES:%.]] = call <2 x double> @llvm.masked.gather.v2f64.v2p0f64(<2 x double> [[PTRS:%.*]], i32 4, <2 x i1> <i1 true, i1 true>, <2 x double> undef)		; CHECK-NEXT: [[RES:%.]] = call <2 x double> @llvm.masked.gather.v2f64.v2p0f64(<2 x double> [[PTRS:%.*]], i32 4, <2 x i1> <i1 true, i1 true>, <2 x double> poison)
; CHECK-NEXT: ret <2 x double> [[RES]]		; CHECK-NEXT: ret <2 x double> [[RES]]
;		;
%res = call <2 x double> @llvm.masked.gather.v2f64.v2p0f64(<2 x double*> %ptrs, i32 4, <2 x i1> <i1 true, i1 true>, <2 x double> %passthru)		%res = call <2 x double> @llvm.masked.gather.v2f64.v2p0f64(<2 x double*> %ptrs, i32 4, <2 x i1> <i1 true, i1 true>, <2 x double> %passthru)
ret <2 x double> %res		ret <2 x double> %res
}		}

define <4 x double> @gather_lane2(double* %base, double %pt) {		define <4 x double> @gather_lane2(double* %base, double %pt) {
; CHECK-LABEL: @gather_lane2(		; CHECK-LABEL: @gather_lane2(
; CHECK-NEXT: [[PTRS:%.]] = getelementptr double, double [[BASE:%.*]], <4 x i64> <i64 undef, i64 undef, i64 2, i64 undef>		; CHECK-NEXT: [[PTRS:%.]] = getelementptr double, double [[BASE:%.*]], <4 x i64> <i64 poison, i64 poison, i64 2, i64 poison>
; CHECK-NEXT: [[PT_V1:%.]] = insertelement <4 x double> undef, double [[PT:%.]], i64 0		; CHECK-NEXT: [[PT_V1:%.]] = insertelement <4 x double> undef, double [[PT:%.]], i64 0
; CHECK-NEXT: [[PT_V2:%.*]] = shufflevector <4 x double> [[PT_V1]], <4 x double> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 0>		; CHECK-NEXT: [[PT_V2:%.*]] = shufflevector <4 x double> [[PT_V1]], <4 x double> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 0>
; CHECK-NEXT: [[RES:%.]] = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64(<4 x double> [[PTRS]], i32 4, <4 x i1> <i1 false, i1 false, i1 true, i1 false>, <4 x double> [[PT_V2]])		; CHECK-NEXT: [[RES:%.]] = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64(<4 x double> [[PTRS]], i32 4, <4 x i1> <i1 false, i1 false, i1 true, i1 false>, <4 x double> [[PT_V2]])
; CHECK-NEXT: ret <4 x double> [[RES]]		; CHECK-NEXT: ret <4 x double> [[RES]]
;		;
%ptrs = getelementptr double, double *%base, <4 x i64> <i64 0, i64 1, i64 2, i64 3>		%ptrs = getelementptr double, double *%base, <4 x i64> <i64 0, i64 1, i64 2, i64 3>
%pt_v1 = insertelement <4 x double> undef, double %pt, i64 0		%pt_v1 = insertelement <4 x double> undef, double %pt, i64 0
%pt_v2 = shufflevector <4 x double> %pt_v1, <4 x double> undef, <4 x i32> zeroinitializer		%pt_v2 = shufflevector <4 x double> %pt_v1, <4 x double> undef, <4 x i32> zeroinitializer
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
call void @llvm.masked.scatter.v2f64.v2p0f64(<2 x double> %val, <2 x double*> %ptrs, i32 8, <2 x i1> zeroinitializer)		call void @llvm.masked.scatter.v2f64.v2p0f64(<2 x double> %val, <2 x double*> %ptrs, i32 8, <2 x i1> zeroinitializer)
ret void		ret void
}		}

define void @scatter_demandedelts(double* %ptr, double %val) {		define void @scatter_demandedelts(double* %ptr, double %val) {
; CHECK-LABEL: @scatter_demandedelts(		; CHECK-LABEL: @scatter_demandedelts(
; CHECK-NEXT: [[PTRS:%.]] = getelementptr double, double [[PTR:%.*]], <2 x i64> <i64 0, i64 undef>		; CHECK-NEXT: [[PTRS:%.]] = getelementptr double, double [[PTR:%.*]], <2 x i64> <i64 0, i64 poison>
; CHECK-NEXT: [[VALVEC1:%.]] = insertelement <2 x double> undef, double [[VAL:%.]], i32 0		; CHECK-NEXT: [[VALVEC1:%.]] = insertelement <2 x double> undef, double [[VAL:%.]], i32 0
; CHECK-NEXT: call void @llvm.masked.scatter.v2f64.v2p0f64(<2 x double> [[VALVEC1]], <2 x double*> [[PTRS]], i32 8, <2 x i1> <i1 true, i1 false>)		; CHECK-NEXT: call void @llvm.masked.scatter.v2f64.v2p0f64(<2 x double> [[VALVEC1]], <2 x double*> [[PTRS]], i32 8, <2 x i1> <i1 true, i1 false>)
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
%ptrs = getelementptr double, double* %ptr, <2 x i64> <i64 0, i64 1>		%ptrs = getelementptr double, double* %ptr, <2 x i64> <i64 0, i64 1>
%valvec1 = insertelement <2 x double> undef, double %val, i32 0		%valvec1 = insertelement <2 x double> undef, double %val, i32 0
%valvec2 = insertelement <2 x double> %valvec1, double %val, i32 1		%valvec2 = insertelement <2 x double> %valvec1, double %val, i32 1
call void @llvm.masked.scatter.v2f64.v2p0f64(<2 x double> %valvec2, <2 x double*> %ptrs, i32 8, <2 x i1> <i1 true, i1 false>)		call void @llvm.masked.scatter.v2f64.v2p0f64(<2 x double> %valvec2, <2 x double*> %ptrs, i32 8, <2 x i1> <i1 true, i1 false>)
ret void		ret void
}		}

llvm/test/Transforms/InstCombine/minmax-fold.ll

Show First 20 Lines • Show All 1,446 Lines • ▼ Show 20 Lines	;
%u7 = select i1 %u6, i32 %u5, i32 55		%u7 = select i1 %u6, i32 %u5, i32 55
%r = trunc i32 %u7 to i8		%r = trunc i32 %u7 to i8
ret i8 %r		ret i8 %r
}		}

define i8 @PR46271(<2 x i8> %x) {		define i8 @PR46271(<2 x i8> %x) {
; CHECK-LABEL: @PR46271(		; CHECK-LABEL: @PR46271(
; CHECK-NEXT: [[A:%.]] = icmp sgt <2 x i8> [[X:%.]], <i8 -1, i8 -1>		; CHECK-NEXT: [[A:%.]] = icmp sgt <2 x i8> [[X:%.]], <i8 -1, i8 -1>
; CHECK-NEXT: [[B:%.*]] = select <2 x i1> [[A]], <2 x i8> [[X]], <2 x i8> <i8 undef, i8 -1>		; CHECK-NEXT: [[B:%.*]] = select <2 x i1> [[A]], <2 x i8> [[X]], <2 x i8> <i8 poison, i8 -1>
; CHECK-NEXT: [[TMP1:%.*]] = extractelement <2 x i8> [[B]], i32 1		; CHECK-NEXT: [[TMP1:%.*]] = extractelement <2 x i8> [[B]], i32 1
; CHECK-NEXT: [[R:%.*]] = xor i8 [[TMP1]], -1		; CHECK-NEXT: [[R:%.*]] = xor i8 [[TMP1]], -1
; CHECK-NEXT: ret i8 [[R]]		; CHECK-NEXT: ret i8 [[R]]
;		;
%a = icmp sgt <2 x i8> %x, <i8 -1, i8 -1>		%a = icmp sgt <2 x i8> %x, <i8 -1, i8 -1>
%b = select <2 x i1> %a, <2 x i8> %x, <2 x i8> <i8 undef, i8 -1>		%b = select <2 x i1> %a, <2 x i8> %x, <2 x i8> <i8 undef, i8 -1>
%not = xor <2 x i8> %b, <i8 undef, i8 -1>		%not = xor <2 x i8> %b, <i8 undef, i8 -1>
%r = extractelement <2 x i8> %not, i32 1		%r = extractelement <2 x i8> %not, i32 1
ret i8 %r		ret i8 %r
}		}

llvm/test/Transforms/InstCombine/pr2645-0.ll

	; RUN: opt < %s -instcombine -S \| grep "insertelement <4 x float> undef"			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt < %s -instcombine -S \| grep "insertelement <4 x float> poison"

	; Instcombine should be able to prove that none of the			; Instcombine should be able to prove that none of the
	; insertelement's first operand's elements are needed.			; insertelement's first operand's elements are needed.

	define internal void @""(i8*) {			define internal void @""(i8*) {
	; <label>:1			; <label>:1
	bitcast i8* %0 to i32* ; <i32*>:2 [#uses=1]			bitcast i8* %0 to i32* ; <i32*>:2 [#uses=1]
	load i32, i32* %2, align 1 ; <i32>:3 [#uses=1]			load i32, i32* %2, align 1 ; <i32>:3 [#uses=1]
	Show All 24 Lines

llvm/test/Transforms/InstCombine/shuffle_select.ll

Show First 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	;
%s = shufflevector <4 x i32> %b, <4 x i32> %v, <4 x i32> <i32 0, i32 undef, i32 2, i32 7>		%s = shufflevector <4 x i32> %b, <4 x i32> %v, <4 x i32> <i32 0, i32 undef, i32 2, i32 7>
ret <4 x i32> %s		ret <4 x i32> %s
}		}

; Constant operand 0 (LHS) could work for some non-commutative binops?		; Constant operand 0 (LHS) could work for some non-commutative binops?

define <4 x i32> @sub(<4 x i32> %v) {		define <4 x i32> @sub(<4 x i32> %v) {
; CHECK-LABEL: @sub(		; CHECK-LABEL: @sub(
; CHECK-NEXT: [[B:%.]] = sub <4 x i32> <i32 undef, i32 undef, i32 undef, i32 14>, [[V:%.]]		; CHECK-NEXT: [[B:%.]] = sub <4 x i32> <i32 poison, i32 poison, i32 poison, i32 14>, [[V:%.]]
; CHECK-NEXT: [[S:%.*]] = shufflevector <4 x i32> [[V]], <4 x i32> [[B]], <4 x i32> <i32 0, i32 1, i32 2, i32 7>		; CHECK-NEXT: [[S:%.*]] = shufflevector <4 x i32> [[V]], <4 x i32> [[B]], <4 x i32> <i32 0, i32 1, i32 2, i32 7>
; CHECK-NEXT: ret <4 x i32> [[S]]		; CHECK-NEXT: ret <4 x i32> [[S]]
;		;
%b = sub <4 x i32> <i32 11, i32 12, i32 13, i32 14>, %v		%b = sub <4 x i32> <i32 11, i32 12, i32 13, i32 14>, %v
%s = shufflevector <4 x i32> %v, <4 x i32> %b, <4 x i32> <i32 0, i32 1, i32 2, i32 7>		%s = shufflevector <4 x i32> %v, <4 x i32> %b, <4 x i32> <i32 0, i32 1, i32 2, i32 7>
ret <4 x i32> %s		ret <4 x i32> %s
}		}

▲ Show 20 Lines • Show All 279 Lines • ▼ Show 20 Lines
;		;
%b = fadd <4 x float> %v, <float 41.0, float 42.0, float 43.0, float 44.0>		%b = fadd <4 x float> %v, <float 41.0, float 42.0, float 43.0, float 44.0>
%s = shufflevector <4 x float> %b, <4 x float> %v, <4 x i32> <i32 0, i32 1, i32 6, i32 7>		%s = shufflevector <4 x float> %b, <4 x float> %v, <4 x i32> <i32 0, i32 1, i32 6, i32 7>
ret <4 x float> %s		ret <4 x float> %s
}		}

define <4 x double> @fsub(<4 x double> %v) {		define <4 x double> @fsub(<4 x double> %v) {
; CHECK-LABEL: @fsub(		; CHECK-LABEL: @fsub(
; CHECK-NEXT: [[B:%.]] = fsub <4 x double> <double undef, double undef, double 4.300000e+01, double 4.400000e+01>, [[V:%.]]		; CHECK-NEXT: [[B:%.]] = fsub <4 x double> <double poison, double poison, double 4.300000e+01, double 4.400000e+01>, [[V:%.]]
; CHECK-NEXT: [[S:%.*]] = shufflevector <4 x double> [[V]], <4 x double> [[B]], <4 x i32> <i32 undef, i32 1, i32 6, i32 7>		; CHECK-NEXT: [[S:%.*]] = shufflevector <4 x double> [[V]], <4 x double> [[B]], <4 x i32> <i32 undef, i32 1, i32 6, i32 7>
; CHECK-NEXT: ret <4 x double> [[S]]		; CHECK-NEXT: ret <4 x double> [[S]]
;		;
%b = fsub <4 x double> <double 41.0, double 42.0, double 43.0, double 44.0>, %v		%b = fsub <4 x double> <double 41.0, double 42.0, double 43.0, double 44.0>, %v
%s = shufflevector <4 x double> %v, <4 x double> %b, <4 x i32> <i32 undef, i32 1, i32 6, i32 7>		%s = shufflevector <4 x double> %v, <4 x double> %b, <4 x i32> <i32 undef, i32 1, i32 6, i32 7>
ret <4 x double> %s		ret <4 x double> %s
}		}

; Propagate any FMF.		; Propagate any FMF.

define <4 x float> @fmul(<4 x float> %v) {		define <4 x float> @fmul(<4 x float> %v) {
; CHECK-LABEL: @fmul(		; CHECK-LABEL: @fmul(
; CHECK-NEXT: [[S:%.]] = fmul nnan ninf <4 x float> [[V:%.]], <float 4.100000e+01, float 1.000000e+00, float 1.000000e+00, float 1.000000e+00>		; CHECK-NEXT: [[S:%.]] = fmul nnan ninf <4 x float> [[V:%.]], <float 4.100000e+01, float 1.000000e+00, float 1.000000e+00, float 1.000000e+00>
; CHECK-NEXT: ret <4 x float> [[S]]		; CHECK-NEXT: ret <4 x float> [[S]]
;		;
%b = fmul nnan ninf <4 x float> %v, <float 41.0, float 42.0, float 43.0, float 44.0>		%b = fmul nnan ninf <4 x float> %v, <float 41.0, float 42.0, float 43.0, float 44.0>
%s = shufflevector <4 x float> %b, <4 x float> %v, <4 x i32> <i32 0, i32 5, i32 6, i32 7>		%s = shufflevector <4 x float> %b, <4 x float> %v, <4 x i32> <i32 0, i32 5, i32 6, i32 7>
ret <4 x float> %s		ret <4 x float> %s
}		}

define <4 x double> @fdiv_constant_op0(<4 x double> %v) {		define <4 x double> @fdiv_constant_op0(<4 x double> %v) {
; CHECK-LABEL: @fdiv_constant_op0(		; CHECK-LABEL: @fdiv_constant_op0(
; CHECK-NEXT: [[B:%.]] = fdiv fast <4 x double> <double undef, double undef, double 4.300000e+01, double 4.400000e+01>, [[V:%.]]		; CHECK-NEXT: [[B:%.]] = fdiv fast <4 x double> <double poison, double poison, double 4.300000e+01, double 4.400000e+01>, [[V:%.]]
; CHECK-NEXT: [[S:%.*]] = shufflevector <4 x double> [[V]], <4 x double> [[B]], <4 x i32> <i32 undef, i32 1, i32 6, i32 7>		; CHECK-NEXT: [[S:%.*]] = shufflevector <4 x double> [[V]], <4 x double> [[B]], <4 x i32> <i32 undef, i32 1, i32 6, i32 7>
; CHECK-NEXT: ret <4 x double> [[S]]		; CHECK-NEXT: ret <4 x double> [[S]]
;		;
%b = fdiv fast <4 x double> <double 41.0, double 42.0, double 43.0, double 44.0>, %v		%b = fdiv fast <4 x double> <double 41.0, double 42.0, double 43.0, double 44.0>, %v
%s = shufflevector <4 x double> %v, <4 x double> %b, <4 x i32> <i32 undef, i32 1, i32 6, i32 7>		%s = shufflevector <4 x double> %v, <4 x double> %b, <4 x i32> <i32 undef, i32 1, i32 6, i32 7>
ret <4 x double> %s		ret <4 x double> %s
}		}

define <4 x double> @fdiv_constant_op1(<4 x double> %v) {		define <4 x double> @fdiv_constant_op1(<4 x double> %v) {
; CHECK-LABEL: @fdiv_constant_op1(		; CHECK-LABEL: @fdiv_constant_op1(
; CHECK-NEXT: [[S:%.]] = fdiv reassoc <4 x double> [[V:%.]], <double undef, double 1.000000e+00, double 4.300000e+01, double 4.400000e+01>		; CHECK-NEXT: [[S:%.]] = fdiv reassoc <4 x double> [[V:%.]], <double undef, double 1.000000e+00, double 4.300000e+01, double 4.400000e+01>
; CHECK-NEXT: ret <4 x double> [[S]]		; CHECK-NEXT: ret <4 x double> [[S]]
;		;
%b = fdiv reassoc <4 x double> %v, <double 41.0, double 42.0, double 43.0, double 44.0>		%b = fdiv reassoc <4 x double> %v, <double 41.0, double 42.0, double 43.0, double 44.0>
%s = shufflevector <4 x double> %v, <4 x double> %b, <4 x i32> <i32 undef, i32 1, i32 6, i32 7>		%s = shufflevector <4 x double> %v, <4 x double> %b, <4 x i32> <i32 undef, i32 1, i32 6, i32 7>
ret <4 x double> %s		ret <4 x double> %s
}		}

define <4 x double> @frem(<4 x double> %v) {		define <4 x double> @frem(<4 x double> %v) {
; CHECK-LABEL: @frem(		; CHECK-LABEL: @frem(
; CHECK-NEXT: [[B:%.]] = frem <4 x double> <double 4.100000e+01, double 4.200000e+01, double undef, double undef>, [[V:%.]]		; CHECK-NEXT: [[B:%.]] = frem <4 x double> <double 4.100000e+01, double 4.200000e+01, double poison, double poison>, [[V:%.]]
; CHECK-NEXT: [[S:%.*]] = shufflevector <4 x double> [[B]], <4 x double> [[V]], <4 x i32> <i32 0, i32 1, i32 6, i32 7>		; CHECK-NEXT: [[S:%.*]] = shufflevector <4 x double> [[B]], <4 x double> [[V]], <4 x i32> <i32 0, i32 1, i32 6, i32 7>
; CHECK-NEXT: ret <4 x double> [[S]]		; CHECK-NEXT: ret <4 x double> [[S]]
;		;
%b = frem <4 x double> <double 41.0, double 42.0, double 43.0, double 44.0>, %v		%b = frem <4 x double> <double 41.0, double 42.0, double 43.0, double 44.0>, %v
%s = shufflevector <4 x double> %b, <4 x double> %v, <4 x i32> <i32 0, i32 1, i32 6, i32 7>		%s = shufflevector <4 x double> %b, <4 x double> %v, <4 x i32> <i32 0, i32 1, i32 6, i32 7>
ret <4 x double> %s		ret <4 x double> %s
}		}

▲ Show 20 Lines • Show All 382 Lines • ▼ Show 20 Lines	;
%t3 = shufflevector <4 x double> %t1, <4 x double> %t2, <4 x i32> <i32 undef, i32 1, i32 6, i32 7>		%t3 = shufflevector <4 x double> %t1, <4 x double> %t2, <4 x i32> <i32 undef, i32 1, i32 6, i32 7>
ret <4 x double> %t3		ret <4 x double> %t3
}		}

; The variable operand must be either the first operand or second operand in both binops.		; The variable operand must be either the first operand or second operand in both binops.

define <4 x double> @frem_frem(<4 x double> %v0) {		define <4 x double> @frem_frem(<4 x double> %v0) {
; CHECK-LABEL: @frem_frem(		; CHECK-LABEL: @frem_frem(
; CHECK-NEXT: [[T1:%.]] = frem <4 x double> <double 1.000000e+00, double 2.000000e+00, double undef, double undef>, [[V0:%.]]		; CHECK-NEXT: [[T1:%.]] = frem <4 x double> <double 1.000000e+00, double 2.000000e+00, double poison, double poison>, [[V0:%.]]
; CHECK-NEXT: [[T2:%.*]] = frem <4 x double> [[V0]], <double undef, double undef, double 7.000000e+00, double 8.000000e+00>		; CHECK-NEXT: [[T2:%.*]] = frem <4 x double> [[V0]], <double poison, double poison, double 7.000000e+00, double 8.000000e+00>
; CHECK-NEXT: [[T3:%.*]] = shufflevector <4 x double> [[T1]], <4 x double> [[T2]], <4 x i32> <i32 0, i32 1, i32 6, i32 7>		; CHECK-NEXT: [[T3:%.*]] = shufflevector <4 x double> [[T1]], <4 x double> [[T2]], <4 x i32> <i32 0, i32 1, i32 6, i32 7>
; CHECK-NEXT: ret <4 x double> [[T3]]		; CHECK-NEXT: ret <4 x double> [[T3]]
;		;
%t1 = frem <4 x double> <double 1.0, double 2.0, double 3.0, double 4.0>, %v0		%t1 = frem <4 x double> <double 1.0, double 2.0, double 3.0, double 4.0>, %v0
%t2 = frem <4 x double> %v0, <double 5.0, double 6.0, double 7.0, double 8.0>		%t2 = frem <4 x double> %v0, <double 5.0, double 6.0, double 7.0, double 8.0>
%t3 = shufflevector <4 x double> %t1, <4 x double> %t2, <4 x i32> <i32 0, i32 1, i32 6, i32 7>		%t3 = shufflevector <4 x double> %t1, <4 x double> %t2, <4 x i32> <i32 0, i32 1, i32 6, i32 7>
ret <4 x double> %t3		ret <4 x double> %t3
}		}
▲ Show 20 Lines • Show All 475 Lines • ▼ Show 20 Lines	;
%t3 = shufflevector <4 x double> %t1, <4 x double> %t2, <4 x i32> <i32 undef, i32 1, i32 6, i32 7>		%t3 = shufflevector <4 x double> %t1, <4 x double> %t2, <4 x i32> <i32 undef, i32 1, i32 6, i32 7>
ret <4 x double> %t3		ret <4 x double> %t3
}		}

; The variable operand must be either the first operand or second operand in both binops.		; The variable operand must be either the first operand or second operand in both binops.

define <4 x double> @fdiv_2_vars(<4 x double> %v0, <4 x double> %v1) {		define <4 x double> @fdiv_2_vars(<4 x double> %v0, <4 x double> %v1) {
; CHECK-LABEL: @fdiv_2_vars(		; CHECK-LABEL: @fdiv_2_vars(
; CHECK-NEXT: [[T1:%.]] = fdiv <4 x double> <double 1.000000e+00, double 2.000000e+00, double undef, double undef>, [[V0:%.]]		; CHECK-NEXT: [[T1:%.]] = fdiv <4 x double> <double 1.000000e+00, double 2.000000e+00, double poison, double poison>, [[V0:%.]]
; CHECK-NEXT: [[T2:%.]] = fdiv <4 x double> [[V1:%.]], <double undef, double undef, double 7.000000e+00, double 8.000000e+00>		; CHECK-NEXT: [[T2:%.]] = fdiv <4 x double> [[V1:%.]], <double poison, double poison, double 7.000000e+00, double 8.000000e+00>
; CHECK-NEXT: [[T3:%.*]] = shufflevector <4 x double> [[T1]], <4 x double> [[T2]], <4 x i32> <i32 0, i32 1, i32 6, i32 7>		; CHECK-NEXT: [[T3:%.*]] = shufflevector <4 x double> [[T1]], <4 x double> [[T2]], <4 x i32> <i32 0, i32 1, i32 6, i32 7>
; CHECK-NEXT: ret <4 x double> [[T3]]		; CHECK-NEXT: ret <4 x double> [[T3]]
;		;
%t1 = fdiv <4 x double> <double 1.0, double 2.0, double 3.0, double 4.0>, %v0		%t1 = fdiv <4 x double> <double 1.0, double 2.0, double 3.0, double 4.0>, %v0
%t2 = fdiv <4 x double> %v1, <double 5.0, double 6.0, double 7.0, double 8.0>		%t2 = fdiv <4 x double> %v1, <double 5.0, double 6.0, double 7.0, double 8.0>
%t3 = shufflevector <4 x double> %t1, <4 x double> %t2, <4 x i32> <i32 0, i32 1, i32 6, i32 7>		%t3 = shufflevector <4 x double> %t1, <4 x double> %t2, <4 x i32> <i32 0, i32 1, i32 6, i32 7>
ret <4 x double> %t3		ret <4 x double> %t3
}		}
Show All 37 Lines	;
ret <4 x i32> %t3		ret <4 x i32> %t3
}		}

; Negative test: shift amount (operand 1) must be constant.		; Negative test: shift amount (operand 1) must be constant.

define <4 x i32> @shl_mul_not_constant_shift_amount(<4 x i32> %v0) {		define <4 x i32> @shl_mul_not_constant_shift_amount(<4 x i32> %v0) {
; CHECK-LABEL: @shl_mul_not_constant_shift_amount(		; CHECK-LABEL: @shl_mul_not_constant_shift_amount(
; CHECK-NEXT: [[T1:%.]] = shl <4 x i32> <i32 1, i32 2, i32 3, i32 4>, [[V0:%.]]		; CHECK-NEXT: [[T1:%.]] = shl <4 x i32> <i32 1, i32 2, i32 3, i32 4>, [[V0:%.]]
; CHECK-NEXT: [[T2:%.*]] = mul <4 x i32> [[V0]], <i32 5, i32 6, i32 undef, i32 undef>		; CHECK-NEXT: [[T2:%.*]] = mul <4 x i32> [[V0]], <i32 5, i32 6, i32 poison, i32 poison>
; CHECK-NEXT: [[T3:%.*]] = shufflevector <4 x i32> [[T2]], <4 x i32> [[T1]], <4 x i32> <i32 0, i32 1, i32 6, i32 7>		; CHECK-NEXT: [[T3:%.*]] = shufflevector <4 x i32> [[T2]], <4 x i32> [[T1]], <4 x i32> <i32 0, i32 1, i32 6, i32 7>
; CHECK-NEXT: ret <4 x i32> [[T3]]		; CHECK-NEXT: ret <4 x i32> [[T3]]
;		;
%t1 = shl <4 x i32> <i32 1, i32 2, i32 3, i32 4>, %v0		%t1 = shl <4 x i32> <i32 1, i32 2, i32 3, i32 4>, %v0
%t2 = mul <4 x i32> %v0, <i32 5, i32 6, i32 7, i32 8>		%t2 = mul <4 x i32> %v0, <i32 5, i32 6, i32 7, i32 8>
%t3 = shufflevector <4 x i32> %t1, <4 x i32> %t2, <4 x i32> <i32 4, i32 5, i32 2, i32 3>		%t3 = shufflevector <4 x i32> %t1, <4 x i32> %t2, <4 x i32> <i32 4, i32 5, i32 2, i32 3>
ret <4 x i32> %t3		ret <4 x i32> %t3
}		}
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	;
ret <4 x i8> %t3		ret <4 x i8> %t3
}		}

; Negative test: not all 'or' insts can be converted to 'add'.		; Negative test: not all 'or' insts can be converted to 'add'.

define <4 x i8> @or_add_not_enough_masking(<4 x i8> %v) {		define <4 x i8> @or_add_not_enough_masking(<4 x i8> %v) {
; CHECK-LABEL: @or_add_not_enough_masking(		; CHECK-LABEL: @or_add_not_enough_masking(
; CHECK-NEXT: [[V0:%.]] = lshr <4 x i8> [[V:%.]], <i8 1, i8 1, i8 1, i8 1>		; CHECK-NEXT: [[V0:%.]] = lshr <4 x i8> [[V:%.]], <i8 1, i8 1, i8 1, i8 1>
; CHECK-NEXT: [[T1:%.*]] = or <4 x i8> [[V0]], <i8 undef, i8 undef, i8 -64, i8 -64>		; CHECK-NEXT: [[T1:%.*]] = or <4 x i8> [[V0]], <i8 poison, i8 poison, i8 -64, i8 -64>
; CHECK-NEXT: [[T2:%.*]] = add <4 x i8> [[V0]], <i8 1, i8 2, i8 undef, i8 undef>		; CHECK-NEXT: [[T2:%.*]] = add <4 x i8> [[V0]], <i8 1, i8 2, i8 poison, i8 poison>
; CHECK-NEXT: [[T3:%.*]] = shufflevector <4 x i8> [[T2]], <4 x i8> [[T1]], <4 x i32> <i32 0, i32 1, i32 6, i32 7>		; CHECK-NEXT: [[T3:%.*]] = shufflevector <4 x i8> [[T2]], <4 x i8> [[T1]], <4 x i32> <i32 0, i32 1, i32 6, i32 7>
; CHECK-NEXT: ret <4 x i8> [[T3]]		; CHECK-NEXT: ret <4 x i8> [[T3]]
;		;
%v0 = lshr <4 x i8> %v, <i8 1, i8 1, i8 1, i8 1> ; clear not enough top bits		%v0 = lshr <4 x i8> %v, <i8 1, i8 1, i8 1, i8 1> ; clear not enough top bits
%t1 = or <4 x i8> %v0, <i8 192, i8 192, i8 192, i8 192> ; set some top bits		%t1 = or <4 x i8> %v0, <i8 192, i8 192, i8 192, i8 192> ; set some top bits
%t2 = add nsw nuw <4 x i8> %v0, <i8 1, i8 2, i8 3, i8 4> ; this can't be converted to 'or'		%t2 = add nsw nuw <4 x i8> %v0, <i8 1, i8 2, i8 3, i8 4> ; this can't be converted to 'or'
%t3 = shufflevector <4 x i8> %t1, <4 x i8> %t2, <4 x i32> <i32 4, i32 5, i32 2, i32 3>		%t3 = shufflevector <4 x i8> %t1, <4 x i8> %t2, <4 x i32> <i32 4, i32 5, i32 2, i32 3>
ret <4 x i8> %t3		ret <4 x i8> %t3
▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/shufflevector-div-rem-inseltpoison.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -instcombine -S -o - \| FileCheck %s			; RUN: opt < %s -instcombine -S -o - \| FileCheck %s

	; This test case was added as a reproducer for a miscompile, where instcombine			; This test case was added as a reproducer for a miscompile, where instcombine
	; introduced an			; introduced an
	; srem <2 x i16> %1, <i16 undef, i16 2>			; srem <2 x i16> %1, <i16 undef, i16 2>
	; instruction, which makes the whole srem undefined (even if we only end up			; instruction, which makes the whole srem undefined (even if we only end up
	; extracting the second element in the vector).			; extracting the second element in the vector).
	define i16 @test_srem_orig(i16 %a, i1 %cmp) {			define i16 @test_srem_orig(i16 %a, i1 %cmp) {
	; CHECK-LABEL: @test_srem_orig(			; CHECK-LABEL: @test_srem_orig(
	; CHECK-NEXT: [[SPLATINSERT:%.]] = insertelement <2 x i16> poison, i16 [[A:%.]], i32 0			; CHECK-NEXT: [[SPLATINSERT:%.]] = insertelement <2 x i16> poison, i16 [[A:%.]], i32 0
	; CHECK-NEXT: [[TMP1:%.*]] = srem <2 x i16> [[SPLATINSERT]], <i16 2, i16 1>			; CHECK-NEXT: [[TMP1:%.*]] = srem <2 x i16> [[SPLATINSERT]], <i16 2, i16 1>
	; CHECK-NEXT: [[SPLAT_OP:%.*]] = shufflevector <2 x i16> [[TMP1]], <2 x i16> undef, <2 x i32> <i32 undef, i32 0>			; CHECK-NEXT: [[SPLAT_OP:%.*]] = shufflevector <2 x i16> [[TMP1]], <2 x i16> undef, <2 x i32> <i32 undef, i32 0>
	; CHECK-NEXT: [[T2:%.]] = select i1 [[CMP:%.]], <2 x i16> <i16 undef, i16 1>, <2 x i16> [[SPLAT_OP]]			; CHECK-NEXT: [[T2:%.]] = select i1 [[CMP:%.]], <2 x i16> <i16 poison, i16 1>, <2 x i16> [[SPLAT_OP]]
	; CHECK-NEXT: [[T3:%.*]] = extractelement <2 x i16> [[T2]], i32 1			; CHECK-NEXT: [[T3:%.*]] = extractelement <2 x i16> [[T2]], i32 1
	; CHECK-NEXT: ret i16 [[T3]]			; CHECK-NEXT: ret i16 [[T3]]
	;			;
	%splatinsert = insertelement <2 x i16> poison, i16 %a, i32 0			%splatinsert = insertelement <2 x i16> poison, i16 %a, i32 0
	%splat = shufflevector <2 x i16> %splatinsert, <2 x i16> undef, <2 x i32> zeroinitializer			%splat = shufflevector <2 x i16> %splatinsert, <2 x i16> undef, <2 x i32> zeroinitializer
	%t1 = select i1 %cmp, <2 x i16> <i16 1, i16 1>, <2 x i16> %splat			%t1 = select i1 %cmp, <2 x i16> <i16 1, i16 1>, <2 x i16> %splat
	%t2 = srem <2 x i16> %t1, <i16 2, i16 2>			%t2 = srem <2 x i16> %t1, <i16 2, i16 2>
	%t3 = extractelement <2 x i16> %t2, i32 1			%t3 = extractelement <2 x i16> %t2, i32 1
	▲ Show 20 Lines • Show All 100 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/shufflevector-div-rem.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -instcombine -S -o - \| FileCheck %s			; RUN: opt < %s -instcombine -S -o - \| FileCheck %s

	; This test case was added as a reproducer for a miscompile, where instcombine			; This test case was added as a reproducer for a miscompile, where instcombine
	; introduced an			; introduced an
	; srem <2 x i16> %1, <i16 undef, i16 2>			; srem <2 x i16> %1, <i16 undef, i16 2>
	; instruction, which makes the whole srem undefined (even if we only end up			; instruction, which makes the whole srem undefined (even if we only end up
	; extracting the second element in the vector).			; extracting the second element in the vector).
	define i16 @test_srem_orig(i16 %a, i1 %cmp) {			define i16 @test_srem_orig(i16 %a, i1 %cmp) {
	; CHECK-LABEL: @test_srem_orig(			; CHECK-LABEL: @test_srem_orig(
	; CHECK-NEXT: [[SPLATINSERT:%.]] = insertelement <2 x i16> undef, i16 [[A:%.]], i32 0			; CHECK-NEXT: [[SPLATINSERT:%.]] = insertelement <2 x i16> undef, i16 [[A:%.]], i32 0
	; CHECK-NEXT: [[TMP1:%.*]] = srem <2 x i16> [[SPLATINSERT]], <i16 2, i16 1>			; CHECK-NEXT: [[TMP1:%.*]] = srem <2 x i16> [[SPLATINSERT]], <i16 2, i16 1>
	; CHECK-NEXT: [[SPLAT_OP:%.*]] = shufflevector <2 x i16> [[TMP1]], <2 x i16> undef, <2 x i32> <i32 undef, i32 0>			; CHECK-NEXT: [[SPLAT_OP:%.*]] = shufflevector <2 x i16> [[TMP1]], <2 x i16> undef, <2 x i32> <i32 undef, i32 0>
	; CHECK-NEXT: [[T2:%.]] = select i1 [[CMP:%.]], <2 x i16> <i16 undef, i16 1>, <2 x i16> [[SPLAT_OP]]			; CHECK-NEXT: [[T2:%.]] = select i1 [[CMP:%.]], <2 x i16> <i16 poison, i16 1>, <2 x i16> [[SPLAT_OP]]
	; CHECK-NEXT: [[T3:%.*]] = extractelement <2 x i16> [[T2]], i32 1			; CHECK-NEXT: [[T3:%.*]] = extractelement <2 x i16> [[T2]], i32 1
	; CHECK-NEXT: ret i16 [[T3]]			; CHECK-NEXT: ret i16 [[T3]]
	;			;
	%splatinsert = insertelement <2 x i16> undef, i16 %a, i32 0			%splatinsert = insertelement <2 x i16> undef, i16 %a, i32 0
	%splat = shufflevector <2 x i16> %splatinsert, <2 x i16> undef, <2 x i32> zeroinitializer			%splat = shufflevector <2 x i16> %splatinsert, <2 x i16> undef, <2 x i32> zeroinitializer
	%t1 = select i1 %cmp, <2 x i16> <i16 1, i16 1>, <2 x i16> %splat			%t1 = select i1 %cmp, <2 x i16> <i16 1, i16 1>, <2 x i16> %splat
	%t2 = srem <2 x i16> %t1, <i16 2, i16 2>			%t2 = srem <2 x i16> %t1, <i16 2, i16 2>
	%t3 = extractelement <2 x i16> %t2, i32 1			%t3 = extractelement <2 x i16> %t2, i32 1
	▲ Show 20 Lines • Show All 100 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/sub-of-negatible.ll

Show First 20 Lines • Show All 825 Lines • ▼ Show 20 Lines	;
%t0 = shl <2 x i4> <i4 -6, i4 5>, %x		%t0 = shl <2 x i4> <i4 -6, i4 5>, %x
%t1 = shufflevector <2 x i4> %t0, <2 x i4> undef, <2 x i32> <i32 0, i32 2>		%t1 = shufflevector <2 x i4> %t0, <2 x i4> undef, <2 x i32> <i32 0, i32 2>
%t2 = sub <2 x i4> %y, %t1		%t2 = sub <2 x i4> %y, %t1
ret <2 x i4> %t2		ret <2 x i4> %t2
}		}
define <2 x i4> @negate_shufflevector_twoinputs(<2 x i4> %x, <2 x i4> %y, <2 x i4> %z) {		define <2 x i4> @negate_shufflevector_twoinputs(<2 x i4> %x, <2 x i4> %y, <2 x i4> %z) {
; CHECK-LABEL: @negate_shufflevector_twoinputs(		; CHECK-LABEL: @negate_shufflevector_twoinputs(
; CHECK-NEXT: [[T0_NEG:%.]] = shl <2 x i4> <i4 6, i4 -5>, [[X:%.]]		; CHECK-NEXT: [[T0_NEG:%.]] = shl <2 x i4> <i4 6, i4 -5>, [[X:%.]]
; CHECK-NEXT: [[T1_NEG:%.]] = add <2 x i4> [[Y:%.]], <i4 undef, i4 1>		; CHECK-NEXT: [[T1_NEG:%.]] = add <2 x i4> [[Y:%.]], <i4 poison, i4 1>
; CHECK-NEXT: [[T2_NEG:%.*]] = shufflevector <2 x i4> [[T0_NEG]], <2 x i4> [[T1_NEG]], <2 x i32> <i32 0, i32 3>		; CHECK-NEXT: [[T2_NEG:%.*]] = shufflevector <2 x i4> [[T0_NEG]], <2 x i4> [[T1_NEG]], <2 x i32> <i32 0, i32 3>
; CHECK-NEXT: [[T3:%.]] = add <2 x i4> [[T2_NEG]], [[Z:%.]]		; CHECK-NEXT: [[T3:%.]] = add <2 x i4> [[T2_NEG]], [[Z:%.]]
; CHECK-NEXT: ret <2 x i4> [[T3]]		; CHECK-NEXT: ret <2 x i4> [[T3]]
;		;
%t0 = shl <2 x i4> <i4 -6, i4 5>, %x		%t0 = shl <2 x i4> <i4 -6, i4 5>, %x
%t1 = xor <2 x i4> %y, <i4 -1, i4 -1>		%t1 = xor <2 x i4> %y, <i4 -1, i4 -1>
%t2 = shufflevector <2 x i4> %t0, <2 x i4> %t1, <2 x i32> <i32 0, i32 3>		%t2 = shufflevector <2 x i4> %t0, <2 x i4> %t1, <2 x i32> <i32 0, i32 3>
%t3 = sub <2 x i4> %z, %t2		%t3 = sub <2 x i4> %z, %t2
▲ Show 20 Lines • Show All 564 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/trunc.ll

Show First 20 Lines • Show All 902 Lines • ▼ Show 20 Lines	;
ret <8 x i16> %conv		ret <8 x i16> %conv
}		}

; Although the mask is the same value, we don't create a shuffle for types that the backend may not be able to handle:		; Although the mask is the same value, we don't create a shuffle for types that the backend may not be able to handle:
; trunc (shuffle X, C, Mask) --> shuffle (trunc X), C', Mask		; trunc (shuffle X, C, Mask) --> shuffle (trunc X), C', Mask

define <4 x i8> @wide_shuf(<4 x i32> %x) {		define <4 x i8> @wide_shuf(<4 x i32> %x) {
; CHECK-LABEL: @wide_shuf(		; CHECK-LABEL: @wide_shuf(
; CHECK-NEXT: [[SHUF:%.]] = shufflevector <4 x i32> [[X:%.]], <4 x i32> <i32 undef, i32 3634, i32 90, i32 undef>, <4 x i32> <i32 1, i32 5, i32 6, i32 2>		; CHECK-NEXT: [[SHUF:%.]] = shufflevector <4 x i32> [[X:%.]], <4 x i32> <i32 poison, i32 3634, i32 90, i32 poison>, <4 x i32> <i32 1, i32 5, i32 6, i32 2>
; CHECK-NEXT: [[TRUNC:%.*]] = trunc <4 x i32> [[SHUF]] to <4 x i8>		; CHECK-NEXT: [[TRUNC:%.*]] = trunc <4 x i32> [[SHUF]] to <4 x i8>
; CHECK-NEXT: ret <4 x i8> [[TRUNC]]		; CHECK-NEXT: ret <4 x i8> [[TRUNC]]
;		;
%shuf = shufflevector <4 x i32> %x, <4 x i32> <i32 35, i32 3634, i32 90, i32 -1>, <4 x i32> <i32 1, i32 5, i32 6, i32 2>		%shuf = shufflevector <4 x i32> %x, <4 x i32> <i32 35, i32 3634, i32 90, i32 -1>, <4 x i32> <i32 1, i32 5, i32 6, i32 2>
%trunc = trunc <4 x i32> %shuf to <4 x i8>		%trunc = trunc <4 x i32> %shuf to <4 x i8>
ret <4 x i8> %trunc		ret <4 x i8> %trunc
}		}

▲ Show 20 Lines • Show All 104 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/vec_demanded_elts-inseltpoison.ll

Show First 20 Lines • Show All 69 Lines • ▼ Show 20 Lines
;		;
%shuffle.i = shufflevector <2 x float> %y, <2 x float> %y, <4 x i32> <i32 0, i32 1, i32 0, i32 1>		%shuffle.i = shufflevector <2 x float> %y, <2 x float> %y, <4 x i32> <i32 0, i32 1, i32 0, i32 1>
%shuffle9.i = shufflevector <4 x float> %x, <4 x float> %shuffle.i, <4 x i32> <i32 4, i32 5, i32 2, i32 3>		%shuffle9.i = shufflevector <4 x float> %x, <4 x float> %shuffle.i, <4 x i32> <i32 4, i32 5, i32 2, i32 3>
ret <4 x float> %shuffle9.i		ret <4 x float> %shuffle9.i
}		}

define <2 x float> @test_fptrunc(double %f) {		define <2 x float> @test_fptrunc(double %f) {
; CHECK-LABEL: @test_fptrunc(		; CHECK-LABEL: @test_fptrunc(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double undef, double 0.000000e+00>, double [[F:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double poison, double 0.000000e+00>, double [[F:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = fptrunc <2 x double> [[TMP1]] to <2 x float>		; CHECK-NEXT: [[TMP2:%.*]] = fptrunc <2 x double> [[TMP1]] to <2 x float>
; CHECK-NEXT: ret <2 x float> [[TMP2]]		; CHECK-NEXT: ret <2 x float> [[TMP2]]
;		;
%t9 = insertelement <4 x double> poison, double %f, i32 0		%t9 = insertelement <4 x double> poison, double %f, i32 0
%t10 = insertelement <4 x double> %t9, double 0.000000e+00, i32 1		%t10 = insertelement <4 x double> %t9, double 0.000000e+00, i32 1
%t11 = insertelement <4 x double> %t10, double 0.000000e+00, i32 2		%t11 = insertelement <4 x double> %t10, double 0.000000e+00, i32 2
%t12 = insertelement <4 x double> %t11, double 0.000000e+00, i32 3		%t12 = insertelement <4 x double> %t11, double 0.000000e+00, i32 3
%t5 = fptrunc <4 x double> %t12 to <4 x float>		%t5 = fptrunc <4 x double> %t12 to <4 x float>
%ret = shufflevector <4 x float> %t5, <4 x float> undef, <2 x i32> <i32 0, i32 1>		%ret = shufflevector <4 x float> %t5, <4 x float> undef, <2 x i32> <i32 0, i32 1>
ret <2 x float> %ret		ret <2 x float> %ret
}		}

define <2 x double> @test_fpext(float %f) {		define <2 x double> @test_fpext(float %f) {
; CHECK-LABEL: @test_fpext(		; CHECK-LABEL: @test_fpext(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> <float undef, float 0.000000e+00>, float [[F:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> <float poison, float 0.000000e+00>, float [[F:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = fpext <2 x float> [[TMP1]] to <2 x double>		; CHECK-NEXT: [[TMP2:%.*]] = fpext <2 x float> [[TMP1]] to <2 x double>
; CHECK-NEXT: ret <2 x double> [[TMP2]]		; CHECK-NEXT: ret <2 x double> [[TMP2]]
;		;
%t9 = insertelement <4 x float> poison, float %f, i32 0		%t9 = insertelement <4 x float> poison, float %f, i32 0
%t10 = insertelement <4 x float> %t9, float 0.000000e+00, i32 1		%t10 = insertelement <4 x float> %t9, float 0.000000e+00, i32 1
%t11 = insertelement <4 x float> %t10, float 0.000000e+00, i32 2		%t11 = insertelement <4 x float> %t10, float 0.000000e+00, i32 2
%t12 = insertelement <4 x float> %t11, float 0.000000e+00, i32 3		%t12 = insertelement <4 x float> %t11, float 0.000000e+00, i32 3
%t5 = fpext <4 x float> %t12 to <4 x double>		%t5 = fpext <4 x float> %t12 to <4 x double>
%ret = shufflevector <4 x double> %t5, <4 x double> undef, <2 x i32> <i32 0, i32 1>		%ret = shufflevector <4 x double> %t5, <4 x double> undef, <2 x i32> <i32 0, i32 1>
ret <2 x double> %ret		ret <2 x double> %ret
}		}

define <4 x double> @test_shuffle(<4 x double> %f) {		define <4 x double> @test_shuffle(<4 x double> %f) {
; CHECK-LABEL: @test_shuffle(		; CHECK-LABEL: @test_shuffle(
; CHECK-NEXT: [[RET1:%.]] = insertelement <4 x double> [[F:%.]], double 1.000000e+00, i32 3		; CHECK-NEXT: [[RET1:%.]] = insertelement <4 x double> [[F:%.]], double 1.000000e+00, i32 3
; CHECK-NEXT: ret <4 x double> [[RET1]]		; CHECK-NEXT: ret <4 x double> [[RET1]]
;		;
%ret = shufflevector <4 x double> %f, <4 x double> <double undef, double 1.0, double undef, double undef>, <4 x i32> <i32 0, i32 1, i32 2, i32 5>		%ret = shufflevector <4 x double> %f, <4 x double> <double undef, double 1.0, double undef, double undef>, <4 x i32> <i32 0, i32 1, i32 2, i32 5>
ret <4 x double> %ret		ret <4 x double> %ret
}		}

define <4 x float> @test_select(float %f, float %g) {		define <4 x float> @test_select(float %f, float %g) {
; CHECK-LABEL: @test_select(		; CHECK-LABEL: @test_select(
; CHECK-NEXT: [[A3:%.]] = insertelement <4 x float> <float undef, float undef, float undef, float 3.000000e+00>, float [[F:%.]], i32 0		; CHECK-NEXT: [[A3:%.]] = insertelement <4 x float> <float poison, float poison, float poison, float 3.000000e+00>, float [[F:%.]], i32 0
; CHECK-NEXT: [[RET:%.*]] = shufflevector <4 x float> [[A3]], <4 x float> <float undef, float 4.000000e+00, float 5.000000e+00, float undef>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>		; CHECK-NEXT: [[RET:%.*]] = shufflevector <4 x float> [[A3]], <4 x float> <float poison, float 4.000000e+00, float 5.000000e+00, float poison>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>
; CHECK-NEXT: ret <4 x float> [[RET]]		; CHECK-NEXT: ret <4 x float> [[RET]]
;		;
%a0 = insertelement <4 x float> poison, float %f, i32 0		%a0 = insertelement <4 x float> poison, float %f, i32 0
%a1 = insertelement <4 x float> %a0, float 1.000000e+00, i32 1		%a1 = insertelement <4 x float> %a0, float 1.000000e+00, i32 1
%a2 = insertelement <4 x float> %a1, float 2.000000e+00, i32 2		%a2 = insertelement <4 x float> %a1, float 2.000000e+00, i32 2
%a3 = insertelement <4 x float> %a2, float 3.000000e+00, i32 3		%a3 = insertelement <4 x float> %a2, float 3.000000e+00, i32 3
%b0 = insertelement <4 x float> poison, float %g, i32 0		%b0 = insertelement <4 x float> poison, float %g, i32 0
%b1 = insertelement <4 x float> %b0, float 4.000000e+00, i32 1		%b1 = insertelement <4 x float> %b0, float 4.000000e+00, i32 1
▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	;
%shuffle = shufflevector <4 x float> %out123, <4 x float> undef, <4 x i32> <i32 0, i32 undef, i32 undef, i32 undef>		%shuffle = shufflevector <4 x float> %out123, <4 x float> undef, <4 x i32> <i32 0, i32 undef, i32 undef, i32 undef>
ret <4 x float> %shuffle		ret <4 x float> %shuffle
}		}

; Test undef replacement in constant vector elements with binops.		; Test undef replacement in constant vector elements with binops.

define <3 x i8> @shuf_add(<3 x i8> %x) {		define <3 x i8> @shuf_add(<3 x i8> %x) {
; CHECK-LABEL: @shuf_add(		; CHECK-LABEL: @shuf_add(
; CHECK-NEXT: [[BO:%.]] = add <3 x i8> [[X:%.]], <i8 undef, i8 2, i8 3>		; CHECK-NEXT: [[BO:%.]] = add <3 x i8> [[X:%.]], <i8 poison, i8 2, i8 3>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 2>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 2>
; CHECK-NEXT: ret <3 x i8> [[R]]		; CHECK-NEXT: ret <3 x i8> [[R]]
;		;
%bo = add nsw <3 x i8> %x, <i8 1, i8 2, i8 3>		%bo = add nsw <3 x i8> %x, <i8 1, i8 2, i8 3>
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 2>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 2>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

define <3 x i8> @shuf_sub(<3 x i8> %x) {		define <3 x i8> @shuf_sub(<3 x i8> %x) {
; CHECK-LABEL: @shuf_sub(		; CHECK-LABEL: @shuf_sub(
; CHECK-NEXT: [[BO:%.]] = sub <3 x i8> <i8 1, i8 undef, i8 3>, [[X:%.]]		; CHECK-NEXT: [[BO:%.]] = sub <3 x i8> <i8 1, i8 poison, i8 3>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 0, i32 undef, i32 2>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 0, i32 undef, i32 2>
; CHECK-NEXT: ret <3 x i8> [[R]]		; CHECK-NEXT: ret <3 x i8> [[R]]
;		;
%bo = sub nuw <3 x i8> <i8 1, i8 2, i8 3>, %x		%bo = sub nuw <3 x i8> <i8 1, i8 2, i8 3>, %x
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 0, i32 undef, i32 2>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 0, i32 undef, i32 2>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

define <3 x i8> @shuf_mul(<3 x i8> %x) {		define <3 x i8> @shuf_mul(<3 x i8> %x) {
; CHECK-LABEL: @shuf_mul(		; CHECK-LABEL: @shuf_mul(
; CHECK-NEXT: [[BO:%.]] = mul <3 x i8> [[X:%.]], <i8 1, i8 undef, i8 3>		; CHECK-NEXT: [[BO:%.]] = mul <3 x i8> [[X:%.]], <i8 1, i8 poison, i8 3>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 0, i32 2, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 0, i32 2, i32 0>
; CHECK-NEXT: ret <3 x i8> [[R]]		; CHECK-NEXT: ret <3 x i8> [[R]]
;		;
%bo = mul nsw <3 x i8> %x, <i8 1, i8 2, i8 3>		%bo = mul nsw <3 x i8> %x, <i8 1, i8 2, i8 3>
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 0, i32 2, i32 0>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 0, i32 2, i32 0>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

define <3 x i8> @shuf_and(<3 x i8> %x) {		define <3 x i8> @shuf_and(<3 x i8> %x) {
; CHECK-LABEL: @shuf_and(		; CHECK-LABEL: @shuf_and(
; CHECK-NEXT: [[BO:%.]] = and <3 x i8> [[X:%.]], <i8 1, i8 2, i8 undef>		; CHECK-NEXT: [[BO:%.]] = and <3 x i8> [[X:%.]], <i8 1, i8 2, i8 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 1, i32 1, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 1, i32 1, i32 0>
; CHECK-NEXT: ret <3 x i8> [[R]]		; CHECK-NEXT: ret <3 x i8> [[R]]
;		;
%bo = and <3 x i8> %x, <i8 1, i8 2, i8 3>		%bo = and <3 x i8> %x, <i8 1, i8 2, i8 3>
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 1, i32 1, i32 0>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 1, i32 1, i32 0>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

define <3 x i8> @shuf_or(<3 x i8> %x) {		define <3 x i8> @shuf_or(<3 x i8> %x) {
; CHECK-LABEL: @shuf_or(		; CHECK-LABEL: @shuf_or(
; CHECK-NEXT: [[BO:%.]] = or <3 x i8> [[X:%.]], <i8 1, i8 2, i8 undef>		; CHECK-NEXT: [[BO:%.]] = or <3 x i8> [[X:%.]], <i8 1, i8 2, i8 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 0>
; CHECK-NEXT: ret <3 x i8> [[R]]		; CHECK-NEXT: ret <3 x i8> [[R]]
;		;
%bo = or <3 x i8> %x, <i8 1, i8 2, i8 3>		%bo = or <3 x i8> %x, <i8 1, i8 2, i8 3>
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 0>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 0>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

define <3 x i8> @shuf_xor(<3 x i8> %x) {		define <3 x i8> @shuf_xor(<3 x i8> %x) {
; CHECK-LABEL: @shuf_xor(		; CHECK-LABEL: @shuf_xor(
; CHECK-NEXT: [[BO:%.]] = xor <3 x i8> [[X:%.]], <i8 1, i8 undef, i8 3>		; CHECK-NEXT: [[BO:%.]] = xor <3 x i8> [[X:%.]], <i8 1, i8 poison, i8 3>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 2, i32 undef, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 2, i32 undef, i32 0>
; CHECK-NEXT: ret <3 x i8> [[R]]		; CHECK-NEXT: ret <3 x i8> [[R]]
;		;
%bo = xor <3 x i8> %x, <i8 1, i8 2, i8 3>		%bo = xor <3 x i8> %x, <i8 1, i8 2, i8 3>
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 2, i32 undef, i32 0>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 2, i32 undef, i32 0>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

▲ Show 20 Lines • Show All 148 Lines • ▼ Show 20 Lines
;		;
%bo = urem <3 x i8> %x, <i8 1, i8 2, i8 3>		%bo = urem <3 x i8> %x, <i8 1, i8 2, i8 3>
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 undef, i32 1, i32 0>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 undef, i32 1, i32 0>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

define <3 x float> @shuf_fadd(<3 x float> %x) {		define <3 x float> @shuf_fadd(<3 x float> %x) {
; CHECK-LABEL: @shuf_fadd(		; CHECK-LABEL: @shuf_fadd(
; CHECK-NEXT: [[BO:%.]] = fadd <3 x float> [[X:%.]], <float 1.000000e+00, float 2.000000e+00, float undef>		; CHECK-NEXT: [[BO:%.]] = fadd <3 x float> [[X:%.]], <float 1.000000e+00, float 2.000000e+00, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = fadd <3 x float> %x, <float 1.0, float 2.0, float 3.0>		%bo = fadd <3 x float> %x, <float 1.0, float 2.0, float 3.0>
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>
ret <3 x float> %r		ret <3 x float> %r
}		}

define <3 x float> @shuf_fsub(<3 x float> %x) {		define <3 x float> @shuf_fsub(<3 x float> %x) {
; CHECK-LABEL: @shuf_fsub(		; CHECK-LABEL: @shuf_fsub(
; CHECK-NEXT: [[BO:%.]] = fsub fast <3 x float> <float 1.000000e+00, float undef, float 3.000000e+00>, [[X:%.]]		; CHECK-NEXT: [[BO:%.]] = fsub fast <3 x float> <float 1.000000e+00, float poison, float 3.000000e+00>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = fsub fast <3 x float> <float 1.0, float 2.0, float 3.0>, %x		%bo = fsub fast <3 x float> <float 1.0, float 2.0, float 3.0>, %x
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>
ret <3 x float> %r		ret <3 x float> %r
}		}

define <3 x float> @shuf_fmul(<3 x float> %x) {		define <3 x float> @shuf_fmul(<3 x float> %x) {
; CHECK-LABEL: @shuf_fmul(		; CHECK-LABEL: @shuf_fmul(
; CHECK-NEXT: [[BO:%.]] = fmul reassoc <3 x float> [[X:%.]], <float 1.000000e+00, float 2.000000e+00, float undef>		; CHECK-NEXT: [[BO:%.]] = fmul reassoc <3 x float> [[X:%.]], <float 1.000000e+00, float 2.000000e+00, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = fmul reassoc <3 x float> %x, <float 1.0, float 2.0, float 3.0>		%bo = fmul reassoc <3 x float> %x, <float 1.0, float 2.0, float 3.0>
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>
ret <3 x float> %r		ret <3 x float> %r
}		}

define <3 x float> @shuf_fdiv_const_op0(<3 x float> %x) {		define <3 x float> @shuf_fdiv_const_op0(<3 x float> %x) {
; CHECK-LABEL: @shuf_fdiv_const_op0(		; CHECK-LABEL: @shuf_fdiv_const_op0(
; CHECK-NEXT: [[BO:%.]] = fdiv reassoc ninf <3 x float> <float 1.000000e+00, float undef, float 3.000000e+00>, [[X:%.]]		; CHECK-NEXT: [[BO:%.]] = fdiv reassoc ninf <3 x float> <float 1.000000e+00, float poison, float 3.000000e+00>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = fdiv ninf reassoc <3 x float> <float 1.0, float 2.0, float 3.0>, %x		%bo = fdiv ninf reassoc <3 x float> <float 1.0, float 2.0, float 3.0>, %x
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>
ret <3 x float> %r		ret <3 x float> %r
}		}

define <3 x float> @shuf_fdiv_const_op1(<3 x float> %x) {		define <3 x float> @shuf_fdiv_const_op1(<3 x float> %x) {
; CHECK-LABEL: @shuf_fdiv_const_op1(		; CHECK-LABEL: @shuf_fdiv_const_op1(
; CHECK-NEXT: [[BO:%.]] = fdiv nnan ninf <3 x float> [[X:%.]], <float 1.000000e+00, float 2.000000e+00, float undef>		; CHECK-NEXT: [[BO:%.]] = fdiv nnan ninf <3 x float> [[X:%.]], <float 1.000000e+00, float 2.000000e+00, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = fdiv ninf nnan <3 x float> %x, <float 1.0, float 2.0, float 3.0>		%bo = fdiv ninf nnan <3 x float> %x, <float 1.0, float 2.0, float 3.0>
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>
ret <3 x float> %r		ret <3 x float> %r
}		}

define <3 x float> @shuf_frem_const_op0(<3 x float> %x) {		define <3 x float> @shuf_frem_const_op0(<3 x float> %x) {
; CHECK-LABEL: @shuf_frem_const_op0(		; CHECK-LABEL: @shuf_frem_const_op0(
; CHECK-NEXT: [[BO:%.]] = frem nnan <3 x float> <float 1.000000e+00, float undef, float 3.000000e+00>, [[X:%.]]		; CHECK-NEXT: [[BO:%.]] = frem nnan <3 x float> <float 1.000000e+00, float poison, float 3.000000e+00>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 2, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 2, i32 0>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = frem nnan <3 x float> <float 1.0, float 2.0, float 3.0>, %x		%bo = frem nnan <3 x float> <float 1.0, float 2.0, float 3.0>, %x
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 2, i32 0>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 2, i32 0>
ret <3 x float> %r		ret <3 x float> %r
}		}

define <3 x float> @shuf_frem_const_op1(<3 x float> %x) {		define <3 x float> @shuf_frem_const_op1(<3 x float> %x) {
; CHECK-LABEL: @shuf_frem_const_op1(		; CHECK-LABEL: @shuf_frem_const_op1(
; CHECK-NEXT: [[BO:%.]] = frem reassoc ninf <3 x float> [[X:%.]], <float undef, float 2.000000e+00, float 3.000000e+00>		; CHECK-NEXT: [[BO:%.]] = frem reassoc ninf <3 x float> [[X:%.]], <float poison, float 2.000000e+00, float 3.000000e+00>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 1, i32 undef, i32 2>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 1, i32 undef, i32 2>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = frem ninf reassoc <3 x float> %x, <float 1.0, float 2.0, float 3.0>		%bo = frem ninf reassoc <3 x float> %x, <float 1.0, float 2.0, float 3.0>
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 1, i32 undef, i32 2>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 1, i32 undef, i32 2>
ret <3 x float> %r		ret <3 x float> %r
}		}

Show All 24 Lines	;
%ee = extractelement <2 x i32*> %gep, i32 1		%ee = extractelement <2 x i32*> %gep, i32 1
ret i32* %ee		ret i32* %ee
}		}


define i32* @gep_splat_base_w_cv_idx(i32* %base) {		define i32* @gep_splat_base_w_cv_idx(i32* %base) {
; CHECK-LABEL: @gep_splat_base_w_cv_idx(		; CHECK-LABEL: @gep_splat_base_w_cv_idx(
; CHECK-NEXT: [[BASEVEC2:%.]] = insertelement <2 x i32> undef, i32* [[BASE:%.*]], i32 1		; CHECK-NEXT: [[BASEVEC2:%.]] = insertelement <2 x i32> undef, i32* [[BASE:%.*]], i32 1
; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, <2 x i32> [[BASEVEC2]], <2 x i64> <i64 undef, i64 1>		; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, <2 x i32> [[BASEVEC2]], <2 x i64> <i64 poison, i64 1>
; CHECK-NEXT: [[EE:%.]] = extractelement <2 x i32> [[GEP]], i32 1		; CHECK-NEXT: [[EE:%.]] = extractelement <2 x i32> [[GEP]], i32 1
; CHECK-NEXT: ret i32* [[EE]]		; CHECK-NEXT: ret i32* [[EE]]
;		;
%basevec1 = insertelement <2 x i32> poison, i32 %base, i32 0		%basevec1 = insertelement <2 x i32> poison, i32 %base, i32 0
%basevec2 = shufflevector <2 x i32> %basevec1, <2 x i32> undef, <2 x i32> zeroinitializer		%basevec2 = shufflevector <2 x i32> %basevec1, <2 x i32> undef, <2 x i32> zeroinitializer
%gep = getelementptr i32, <2 x i32*> %basevec2, <2 x i64> <i64 0, i64 1>		%gep = getelementptr i32, <2 x i32*> %basevec2, <2 x i64> <i64 0, i64 1>
%ee = extractelement <2 x i32*> %gep, i32 1		%ee = extractelement <2 x i32*> %gep, i32 1
ret i32* %ee		ret i32* %ee
Show All 13 Lines	;
ret i32* %ee		ret i32* %ee
}		}


@GLOBAL = internal global i32 zeroinitializer		@GLOBAL = internal global i32 zeroinitializer

define i32* @gep_cvbase_w_s_idx(<2 x i32*> %base, i64 %raw_addr) {		define i32* @gep_cvbase_w_s_idx(<2 x i32*> %base, i64 %raw_addr) {
; CHECK-LABEL: @gep_cvbase_w_s_idx(		; CHECK-LABEL: @gep_cvbase_w_s_idx(
; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, <2 x i32> <i32* undef, i32* @GLOBAL>, i64 [[RAW_ADDR:%.*]]		; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, <2 x i32> <i32* poison, i32* @GLOBAL>, i64 [[RAW_ADDR:%.*]]
; CHECK-NEXT: [[EE:%.]] = extractelement <2 x i32> [[GEP]], i32 1		; CHECK-NEXT: [[EE:%.]] = extractelement <2 x i32> [[GEP]], i32 1
; CHECK-NEXT: ret i32* [[EE]]		; CHECK-NEXT: ret i32* [[EE]]
;		;
%gep = getelementptr i32, <2 x i32> <i32 @GLOBAL, i32* @GLOBAL>, i64 %raw_addr		%gep = getelementptr i32, <2 x i32> <i32 @GLOBAL, i32* @GLOBAL>, i64 %raw_addr
%ee = extractelement <2 x i32*> %gep, i32 1		%ee = extractelement <2 x i32*> %gep, i32 1
ret i32* %ee		ret i32* %ee
}		}

define i32* @gep_cvbase_w_cv_idx(<2 x i32*> %base, i64 %raw_addr) {		define i32* @gep_cvbase_w_cv_idx(<2 x i32*> %base, i64 %raw_addr) {
; CHECK-LABEL: @gep_cvbase_w_cv_idx(		; CHECK-LABEL: @gep_cvbase_w_cv_idx(
; CHECK-NEXT: ret i32* getelementptr inbounds (i32, i32* @GLOBAL, i64 1)		; CHECK-NEXT: ret i32* getelementptr inbounds (i32, i32* @GLOBAL, i64 1)
;		;
%gep = getelementptr i32, <2 x i32> <i32 @GLOBAL, i32* @GLOBAL>, <2 x i64> <i64 0, i64 1>		%gep = getelementptr i32, <2 x i32> <i32 @GLOBAL, i32* @GLOBAL>, <2 x i64> <i64 0, i64 1>
%ee = extractelement <2 x i32*> %gep, i32 1		%ee = extractelement <2 x i32*> %gep, i32 1
ret i32* %ee		ret i32* %ee
}		}


define i32* @gep_sbase_w_cv_idx(i32* %base) {		define i32* @gep_sbase_w_cv_idx(i32* %base) {
; CHECK-LABEL: @gep_sbase_w_cv_idx(		; CHECK-LABEL: @gep_sbase_w_cv_idx(
; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, i32 [[BASE:%.*]], <2 x i64> <i64 undef, i64 1>		; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, i32 [[BASE:%.*]], <2 x i64> <i64 poison, i64 1>
; CHECK-NEXT: [[EE:%.]] = extractelement <2 x i32> [[GEP]], i32 1		; CHECK-NEXT: [[EE:%.]] = extractelement <2 x i32> [[GEP]], i32 1
; CHECK-NEXT: ret i32* [[EE]]		; CHECK-NEXT: ret i32* [[EE]]
;		;
%gep = getelementptr i32, i32* %base, <2 x i64> <i64 0, i64 1>		%gep = getelementptr i32, i32* %base, <2 x i64> <i64 0, i64 1>
%ee = extractelement <2 x i32*> %gep, i32 1		%ee = extractelement <2 x i32*> %gep, i32 1
ret i32* %ee		ret i32* %ee
}		}

▲ Show 20 Lines • Show All 64 Lines • ▼ Show 20 Lines

@global = external global [0 x i32], align 4		@global = external global [0 x i32], align 4

; Make sure we don't get stuck in a loop turning the zeroinitializer into		; Make sure we don't get stuck in a loop turning the zeroinitializer into
; <0, undef, undef, undef> and then changing it back.		; <0, undef, undef, undef> and then changing it back.
define i32* @zero_sized_type_extract(<4 x i64> %arg, i64 %arg1) {		define i32* @zero_sized_type_extract(<4 x i64> %arg, i64 %arg1) {
; CHECK-LABEL: @zero_sized_type_extract(		; CHECK-LABEL: @zero_sized_type_extract(
; CHECK-NEXT: bb:		; CHECK-NEXT: bb:
; CHECK-NEXT: [[T:%.]] = getelementptr inbounds [0 x i32], <4 x [0 x i32]> <[0 x i32]* @global, [0 x i32]* undef, [0 x i32]* undef, [0 x i32]* undef>, <4 x i64> <i64 0, i64 undef, i64 undef, i64 undef>, <4 x i64> [[ARG:%.*]]		; CHECK-NEXT: [[T:%.]] = getelementptr inbounds [0 x i32], <4 x [0 x i32]> <[0 x i32]* @global, [0 x i32]* poison, [0 x i32]* poison, [0 x i32]* poison>, <4 x i64> <i64 0, i64 poison, i64 poison, i64 poison>, <4 x i64> [[ARG:%.*]]
; CHECK-NEXT: [[T2:%.]] = extractelement <4 x i32> [[T]], i64 0		; CHECK-NEXT: [[T2:%.]] = extractelement <4 x i32> [[T]], i64 0
; CHECK-NEXT: ret i32* [[T2]]		; CHECK-NEXT: ret i32* [[T2]]
;		;
bb:		bb:
%t = getelementptr inbounds [0 x i32], <4 x [0 x i32]> <[0 x i32] @global, [0 x i32]* @global, [0 x i32]* @global, [0 x i32]* @global>, <4 x i64> zeroinitializer, <4 x i64> %arg		%t = getelementptr inbounds [0 x i32], <4 x [0 x i32]> <[0 x i32] @global, [0 x i32]* @global, [0 x i32]* @global, [0 x i32]* @global>, <4 x i64> zeroinitializer, <4 x i64> %arg
%t2 = extractelement <4 x i32*> %t, i64 0		%t2 = extractelement <4 x i32*> %t, i64 0
ret i32* %t2		ret i32* %t2
}		}
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	;
%fval = shufflevector <4 x float> %y, <4 x float> %x, <4 x i32> <i32 0, i32 7, i32 6, i32 undef>		%fval = shufflevector <4 x float> %y, <4 x float> %x, <4 x i32> <i32 0, i32 7, i32 6, i32 undef>
%cond = shufflevector <4 x i1> %cmp, <4 x i1> undef, <4 x i32> <i32 undef, i32 1, i32 2, i32 3>		%cond = shufflevector <4 x i1> %cmp, <4 x i1> undef, <4 x i32> <i32 undef, i32 1, i32 2, i32 3>
%r = select <4 x i1> %cond, <4 x float> %tval, <4 x float> %fval		%r = select <4 x i1> %cond, <4 x float> %tval, <4 x float> %fval
ret <4 x float> %r		ret <4 x float> %r
}		}

define <4 x i8> @select_cond_with_undef_true_false_elts(<4 x i8> %x, <4 x i8> %y, <4 x i1> %cmp) {		define <4 x i8> @select_cond_with_undef_true_false_elts(<4 x i8> %x, <4 x i8> %y, <4 x i1> %cmp) {
; CHECK-LABEL: @select_cond_with_undef_true_false_elts(		; CHECK-LABEL: @select_cond_with_undef_true_false_elts(
; CHECK-NEXT: [[TVAL:%.]] = shufflevector <4 x i8> [[Y:%.]], <4 x i8> undef, <4 x i32> <i32 undef, i32 1, i32 2, i32 3>		; CHECK-NEXT: [[TVAL:%.]] = shufflevector <4 x i8> [[Y:%.]], <4 x i8> poison, <4 x i32> <i32 undef, i32 1, i32 2, i32 3>
; CHECK-NEXT: [[COND:%.]] = shufflevector <4 x i1> [[CMP:%.]], <4 x i1> undef, <4 x i32> <i32 0, i32 1, i32 0, i32 1>		; CHECK-NEXT: [[COND:%.]] = shufflevector <4 x i1> [[CMP:%.]], <4 x i1> undef, <4 x i32> <i32 0, i32 1, i32 0, i32 1>
; CHECK-NEXT: [[R:%.]] = select <4 x i1> [[COND]], <4 x i8> [[TVAL]], <4 x i8> [[X:%.]]		; CHECK-NEXT: [[R:%.]] = select <4 x i1> [[COND]], <4 x i8> [[TVAL]], <4 x i8> [[X:%.]]
; CHECK-NEXT: ret <4 x i8> [[R]]		; CHECK-NEXT: ret <4 x i8> [[R]]
;		;
%tval = shufflevector <4 x i8> %x, <4 x i8> %y, <4 x i32> <i32 undef, i32 5, i32 6, i32 7>		%tval = shufflevector <4 x i8> %x, <4 x i8> %y, <4 x i32> <i32 undef, i32 5, i32 6, i32 7>
%cond = shufflevector <4 x i1> %cmp, <4 x i1> undef, <4 x i32> <i32 0, i32 1, i32 0, i32 1>		%cond = shufflevector <4 x i1> %cmp, <4 x i1> undef, <4 x i32> <i32 0, i32 1, i32 0, i32 1>
%r = select <4 x i1> %cond, <4 x i8> %tval, <4 x i8> %x		%r = select <4 x i1> %cond, <4 x i8> %tval, <4 x i8> %x
ret <4 x i8> %r		ret <4 x i8> %r
▲ Show 20 Lines • Show All 117 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/vec_demanded_elts.ll

Show First 20 Lines • Show All 69 Lines • ▼ Show 20 Lines
;		;
%shuffle.i = shufflevector <2 x float> %y, <2 x float> %y, <4 x i32> <i32 0, i32 1, i32 0, i32 1>		%shuffle.i = shufflevector <2 x float> %y, <2 x float> %y, <4 x i32> <i32 0, i32 1, i32 0, i32 1>
%shuffle9.i = shufflevector <4 x float> %x, <4 x float> %shuffle.i, <4 x i32> <i32 4, i32 5, i32 2, i32 3>		%shuffle9.i = shufflevector <4 x float> %x, <4 x float> %shuffle.i, <4 x i32> <i32 4, i32 5, i32 2, i32 3>
ret <4 x float> %shuffle9.i		ret <4 x float> %shuffle9.i
}		}

define <2 x float> @test_fptrunc(double %f) {		define <2 x float> @test_fptrunc(double %f) {
; CHECK-LABEL: @test_fptrunc(		; CHECK-LABEL: @test_fptrunc(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double undef, double 0.000000e+00>, double [[F:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x double> <double poison, double 0.000000e+00>, double [[F:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = fptrunc <2 x double> [[TMP1]] to <2 x float>		; CHECK-NEXT: [[TMP2:%.*]] = fptrunc <2 x double> [[TMP1]] to <2 x float>
; CHECK-NEXT: ret <2 x float> [[TMP2]]		; CHECK-NEXT: ret <2 x float> [[TMP2]]
;		;
%t9 = insertelement <4 x double> undef, double %f, i32 0		%t9 = insertelement <4 x double> undef, double %f, i32 0
%t10 = insertelement <4 x double> %t9, double 0.000000e+00, i32 1		%t10 = insertelement <4 x double> %t9, double 0.000000e+00, i32 1
%t11 = insertelement <4 x double> %t10, double 0.000000e+00, i32 2		%t11 = insertelement <4 x double> %t10, double 0.000000e+00, i32 2
%t12 = insertelement <4 x double> %t11, double 0.000000e+00, i32 3		%t12 = insertelement <4 x double> %t11, double 0.000000e+00, i32 3
%t5 = fptrunc <4 x double> %t12 to <4 x float>		%t5 = fptrunc <4 x double> %t12 to <4 x float>
%ret = shufflevector <4 x float> %t5, <4 x float> undef, <2 x i32> <i32 0, i32 1>		%ret = shufflevector <4 x float> %t5, <4 x float> undef, <2 x i32> <i32 0, i32 1>
ret <2 x float> %ret		ret <2 x float> %ret
}		}

define <2 x double> @test_fpext(float %f) {		define <2 x double> @test_fpext(float %f) {
; CHECK-LABEL: @test_fpext(		; CHECK-LABEL: @test_fpext(
; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> <float undef, float 0.000000e+00>, float [[F:%.]], i32 0		; CHECK-NEXT: [[TMP1:%.]] = insertelement <2 x float> <float poison, float 0.000000e+00>, float [[F:%.]], i32 0
; CHECK-NEXT: [[TMP2:%.*]] = fpext <2 x float> [[TMP1]] to <2 x double>		; CHECK-NEXT: [[TMP2:%.*]] = fpext <2 x float> [[TMP1]] to <2 x double>
; CHECK-NEXT: ret <2 x double> [[TMP2]]		; CHECK-NEXT: ret <2 x double> [[TMP2]]
;		;
%t9 = insertelement <4 x float> undef, float %f, i32 0		%t9 = insertelement <4 x float> undef, float %f, i32 0
%t10 = insertelement <4 x float> %t9, float 0.000000e+00, i32 1		%t10 = insertelement <4 x float> %t9, float 0.000000e+00, i32 1
%t11 = insertelement <4 x float> %t10, float 0.000000e+00, i32 2		%t11 = insertelement <4 x float> %t10, float 0.000000e+00, i32 2
%t12 = insertelement <4 x float> %t11, float 0.000000e+00, i32 3		%t12 = insertelement <4 x float> %t11, float 0.000000e+00, i32 3
%t5 = fpext <4 x float> %t12 to <4 x double>		%t5 = fpext <4 x float> %t12 to <4 x double>
%ret = shufflevector <4 x double> %t5, <4 x double> undef, <2 x i32> <i32 0, i32 1>		%ret = shufflevector <4 x double> %t5, <4 x double> undef, <2 x i32> <i32 0, i32 1>
ret <2 x double> %ret		ret <2 x double> %ret
}		}

define <4 x double> @test_shuffle(<4 x double> %f) {		define <4 x double> @test_shuffle(<4 x double> %f) {
; CHECK-LABEL: @test_shuffle(		; CHECK-LABEL: @test_shuffle(
; CHECK-NEXT: [[RET1:%.]] = insertelement <4 x double> [[F:%.]], double 1.000000e+00, i32 3		; CHECK-NEXT: [[RET1:%.]] = insertelement <4 x double> [[F:%.]], double 1.000000e+00, i32 3
; CHECK-NEXT: ret <4 x double> [[RET1]]		; CHECK-NEXT: ret <4 x double> [[RET1]]
;		;
%ret = shufflevector <4 x double> %f, <4 x double> <double undef, double 1.0, double undef, double undef>, <4 x i32> <i32 0, i32 1, i32 2, i32 5>		%ret = shufflevector <4 x double> %f, <4 x double> <double undef, double 1.0, double undef, double undef>, <4 x i32> <i32 0, i32 1, i32 2, i32 5>
ret <4 x double> %ret		ret <4 x double> %ret
}		}

define <4 x float> @test_select(float %f, float %g) {		define <4 x float> @test_select(float %f, float %g) {
; CHECK-LABEL: @test_select(		; CHECK-LABEL: @test_select(
; CHECK-NEXT: [[A3:%.]] = insertelement <4 x float> <float undef, float undef, float undef, float 3.000000e+00>, float [[F:%.]], i32 0		; CHECK-NEXT: [[A3:%.]] = insertelement <4 x float> <float poison, float poison, float poison, float 3.000000e+00>, float [[F:%.]], i32 0
; CHECK-NEXT: [[RET:%.*]] = shufflevector <4 x float> [[A3]], <4 x float> <float undef, float 4.000000e+00, float 5.000000e+00, float undef>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>		; CHECK-NEXT: [[RET:%.*]] = shufflevector <4 x float> [[A3]], <4 x float> <float poison, float 4.000000e+00, float 5.000000e+00, float poison>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>
; CHECK-NEXT: ret <4 x float> [[RET]]		; CHECK-NEXT: ret <4 x float> [[RET]]
;		;
%a0 = insertelement <4 x float> undef, float %f, i32 0		%a0 = insertelement <4 x float> undef, float %f, i32 0
%a1 = insertelement <4 x float> %a0, float 1.000000e+00, i32 1		%a1 = insertelement <4 x float> %a0, float 1.000000e+00, i32 1
%a2 = insertelement <4 x float> %a1, float 2.000000e+00, i32 2		%a2 = insertelement <4 x float> %a1, float 2.000000e+00, i32 2
%a3 = insertelement <4 x float> %a2, float 3.000000e+00, i32 3		%a3 = insertelement <4 x float> %a2, float 3.000000e+00, i32 3
%b0 = insertelement <4 x float> undef, float %g, i32 0		%b0 = insertelement <4 x float> undef, float %g, i32 0
%b1 = insertelement <4 x float> %b0, float 4.000000e+00, i32 1		%b1 = insertelement <4 x float> %b0, float 4.000000e+00, i32 1
▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	;
%shuffle = shufflevector <4 x float> %out123, <4 x float> undef, <4 x i32> <i32 0, i32 undef, i32 undef, i32 undef>		%shuffle = shufflevector <4 x float> %out123, <4 x float> undef, <4 x i32> <i32 0, i32 undef, i32 undef, i32 undef>
ret <4 x float> %shuffle		ret <4 x float> %shuffle
}		}

; Test undef replacement in constant vector elements with binops.		; Test undef replacement in constant vector elements with binops.

define <3 x i8> @shuf_add(<3 x i8> %x) {		define <3 x i8> @shuf_add(<3 x i8> %x) {
; CHECK-LABEL: @shuf_add(		; CHECK-LABEL: @shuf_add(
; CHECK-NEXT: [[BO:%.]] = add <3 x i8> [[X:%.]], <i8 undef, i8 2, i8 3>		; CHECK-NEXT: [[BO:%.]] = add <3 x i8> [[X:%.]], <i8 poison, i8 2, i8 3>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 2>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 2>
; CHECK-NEXT: ret <3 x i8> [[R]]		; CHECK-NEXT: ret <3 x i8> [[R]]
;		;
%bo = add nsw <3 x i8> %x, <i8 1, i8 2, i8 3>		%bo = add nsw <3 x i8> %x, <i8 1, i8 2, i8 3>
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 2>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 2>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

define <3 x i8> @shuf_sub(<3 x i8> %x) {		define <3 x i8> @shuf_sub(<3 x i8> %x) {
; CHECK-LABEL: @shuf_sub(		; CHECK-LABEL: @shuf_sub(
; CHECK-NEXT: [[BO:%.]] = sub <3 x i8> <i8 1, i8 undef, i8 3>, [[X:%.]]		; CHECK-NEXT: [[BO:%.]] = sub <3 x i8> <i8 1, i8 poison, i8 3>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 0, i32 undef, i32 2>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 0, i32 undef, i32 2>
; CHECK-NEXT: ret <3 x i8> [[R]]		; CHECK-NEXT: ret <3 x i8> [[R]]
;		;
%bo = sub nuw <3 x i8> <i8 1, i8 2, i8 3>, %x		%bo = sub nuw <3 x i8> <i8 1, i8 2, i8 3>, %x
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 0, i32 undef, i32 2>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 0, i32 undef, i32 2>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

define <3 x i8> @shuf_mul(<3 x i8> %x) {		define <3 x i8> @shuf_mul(<3 x i8> %x) {
; CHECK-LABEL: @shuf_mul(		; CHECK-LABEL: @shuf_mul(
; CHECK-NEXT: [[BO:%.]] = mul <3 x i8> [[X:%.]], <i8 1, i8 undef, i8 3>		; CHECK-NEXT: [[BO:%.]] = mul <3 x i8> [[X:%.]], <i8 1, i8 poison, i8 3>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 0, i32 2, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 0, i32 2, i32 0>
; CHECK-NEXT: ret <3 x i8> [[R]]		; CHECK-NEXT: ret <3 x i8> [[R]]
;		;
%bo = mul nsw <3 x i8> %x, <i8 1, i8 2, i8 3>		%bo = mul nsw <3 x i8> %x, <i8 1, i8 2, i8 3>
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 0, i32 2, i32 0>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 0, i32 2, i32 0>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

define <3 x i8> @shuf_and(<3 x i8> %x) {		define <3 x i8> @shuf_and(<3 x i8> %x) {
; CHECK-LABEL: @shuf_and(		; CHECK-LABEL: @shuf_and(
; CHECK-NEXT: [[BO:%.]] = and <3 x i8> [[X:%.]], <i8 1, i8 2, i8 undef>		; CHECK-NEXT: [[BO:%.]] = and <3 x i8> [[X:%.]], <i8 1, i8 2, i8 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 1, i32 1, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 1, i32 1, i32 0>
; CHECK-NEXT: ret <3 x i8> [[R]]		; CHECK-NEXT: ret <3 x i8> [[R]]
;		;
%bo = and <3 x i8> %x, <i8 1, i8 2, i8 3>		%bo = and <3 x i8> %x, <i8 1, i8 2, i8 3>
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 1, i32 1, i32 0>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 1, i32 1, i32 0>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

define <3 x i8> @shuf_or(<3 x i8> %x) {		define <3 x i8> @shuf_or(<3 x i8> %x) {
; CHECK-LABEL: @shuf_or(		; CHECK-LABEL: @shuf_or(
; CHECK-NEXT: [[BO:%.]] = or <3 x i8> [[X:%.]], <i8 1, i8 2, i8 undef>		; CHECK-NEXT: [[BO:%.]] = or <3 x i8> [[X:%.]], <i8 1, i8 2, i8 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 0>
; CHECK-NEXT: ret <3 x i8> [[R]]		; CHECK-NEXT: ret <3 x i8> [[R]]
;		;
%bo = or <3 x i8> %x, <i8 1, i8 2, i8 3>		%bo = or <3 x i8> %x, <i8 1, i8 2, i8 3>
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 0>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 1, i32 undef, i32 0>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

define <3 x i8> @shuf_xor(<3 x i8> %x) {		define <3 x i8> @shuf_xor(<3 x i8> %x) {
; CHECK-LABEL: @shuf_xor(		; CHECK-LABEL: @shuf_xor(
; CHECK-NEXT: [[BO:%.]] = xor <3 x i8> [[X:%.]], <i8 1, i8 undef, i8 3>		; CHECK-NEXT: [[BO:%.]] = xor <3 x i8> [[X:%.]], <i8 1, i8 poison, i8 3>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 2, i32 undef, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x i8> [[BO]], <3 x i8> undef, <3 x i32> <i32 2, i32 undef, i32 0>
; CHECK-NEXT: ret <3 x i8> [[R]]		; CHECK-NEXT: ret <3 x i8> [[R]]
;		;
%bo = xor <3 x i8> %x, <i8 1, i8 2, i8 3>		%bo = xor <3 x i8> %x, <i8 1, i8 2, i8 3>
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 2, i32 undef, i32 0>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 2, i32 undef, i32 0>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

▲ Show 20 Lines • Show All 148 Lines • ▼ Show 20 Lines
;		;
%bo = urem <3 x i8> %x, <i8 1, i8 2, i8 3>		%bo = urem <3 x i8> %x, <i8 1, i8 2, i8 3>
%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 undef, i32 1, i32 0>		%r = shufflevector <3 x i8> %bo, <3 x i8> undef, <3 x i32> <i32 undef, i32 1, i32 0>
ret <3 x i8> %r		ret <3 x i8> %r
}		}

define <3 x float> @shuf_fadd(<3 x float> %x) {		define <3 x float> @shuf_fadd(<3 x float> %x) {
; CHECK-LABEL: @shuf_fadd(		; CHECK-LABEL: @shuf_fadd(
; CHECK-NEXT: [[BO:%.]] = fadd <3 x float> [[X:%.]], <float 1.000000e+00, float 2.000000e+00, float undef>		; CHECK-NEXT: [[BO:%.]] = fadd <3 x float> [[X:%.]], <float 1.000000e+00, float 2.000000e+00, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = fadd <3 x float> %x, <float 1.0, float 2.0, float 3.0>		%bo = fadd <3 x float> %x, <float 1.0, float 2.0, float 3.0>
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>
ret <3 x float> %r		ret <3 x float> %r
}		}

define <3 x float> @shuf_fsub(<3 x float> %x) {		define <3 x float> @shuf_fsub(<3 x float> %x) {
; CHECK-LABEL: @shuf_fsub(		; CHECK-LABEL: @shuf_fsub(
; CHECK-NEXT: [[BO:%.]] = fsub fast <3 x float> <float 1.000000e+00, float undef, float 3.000000e+00>, [[X:%.]]		; CHECK-NEXT: [[BO:%.]] = fsub fast <3 x float> <float 1.000000e+00, float poison, float 3.000000e+00>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = fsub fast <3 x float> <float 1.0, float 2.0, float 3.0>, %x		%bo = fsub fast <3 x float> <float 1.0, float 2.0, float 3.0>, %x
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>
ret <3 x float> %r		ret <3 x float> %r
}		}

define <3 x float> @shuf_fmul(<3 x float> %x) {		define <3 x float> @shuf_fmul(<3 x float> %x) {
; CHECK-LABEL: @shuf_fmul(		; CHECK-LABEL: @shuf_fmul(
; CHECK-NEXT: [[BO:%.]] = fmul reassoc <3 x float> [[X:%.]], <float 1.000000e+00, float 2.000000e+00, float undef>		; CHECK-NEXT: [[BO:%.]] = fmul reassoc <3 x float> [[X:%.]], <float 1.000000e+00, float 2.000000e+00, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = fmul reassoc <3 x float> %x, <float 1.0, float 2.0, float 3.0>		%bo = fmul reassoc <3 x float> %x, <float 1.0, float 2.0, float 3.0>
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>
ret <3 x float> %r		ret <3 x float> %r
}		}

define <3 x float> @shuf_fdiv_const_op0(<3 x float> %x) {		define <3 x float> @shuf_fdiv_const_op0(<3 x float> %x) {
; CHECK-LABEL: @shuf_fdiv_const_op0(		; CHECK-LABEL: @shuf_fdiv_const_op0(
; CHECK-NEXT: [[BO:%.]] = fdiv reassoc ninf <3 x float> <float 1.000000e+00, float undef, float 3.000000e+00>, [[X:%.]]		; CHECK-NEXT: [[BO:%.]] = fdiv reassoc ninf <3 x float> <float 1.000000e+00, float poison, float 3.000000e+00>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = fdiv ninf reassoc <3 x float> <float 1.0, float 2.0, float 3.0>, %x		%bo = fdiv ninf reassoc <3 x float> <float 1.0, float 2.0, float 3.0>, %x
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 0, i32 2>
ret <3 x float> %r		ret <3 x float> %r
}		}

define <3 x float> @shuf_fdiv_const_op1(<3 x float> %x) {		define <3 x float> @shuf_fdiv_const_op1(<3 x float> %x) {
; CHECK-LABEL: @shuf_fdiv_const_op1(		; CHECK-LABEL: @shuf_fdiv_const_op1(
; CHECK-NEXT: [[BO:%.]] = fdiv nnan ninf <3 x float> [[X:%.]], <float 1.000000e+00, float 2.000000e+00, float undef>		; CHECK-NEXT: [[BO:%.]] = fdiv nnan ninf <3 x float> [[X:%.]], <float 1.000000e+00, float 2.000000e+00, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = fdiv ninf nnan <3 x float> %x, <float 1.0, float 2.0, float 3.0>		%bo = fdiv ninf nnan <3 x float> %x, <float 1.0, float 2.0, float 3.0>
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 1, i32 0>
ret <3 x float> %r		ret <3 x float> %r
}		}

define <3 x float> @shuf_frem_const_op0(<3 x float> %x) {		define <3 x float> @shuf_frem_const_op0(<3 x float> %x) {
; CHECK-LABEL: @shuf_frem_const_op0(		; CHECK-LABEL: @shuf_frem_const_op0(
; CHECK-NEXT: [[BO:%.]] = frem nnan <3 x float> <float 1.000000e+00, float undef, float 3.000000e+00>, [[X:%.]]		; CHECK-NEXT: [[BO:%.]] = frem nnan <3 x float> <float 1.000000e+00, float poison, float 3.000000e+00>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 2, i32 0>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 undef, i32 2, i32 0>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = frem nnan <3 x float> <float 1.0, float 2.0, float 3.0>, %x		%bo = frem nnan <3 x float> <float 1.0, float 2.0, float 3.0>, %x
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 2, i32 0>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 undef, i32 2, i32 0>
ret <3 x float> %r		ret <3 x float> %r
}		}

define <3 x float> @shuf_frem_const_op1(<3 x float> %x) {		define <3 x float> @shuf_frem_const_op1(<3 x float> %x) {
; CHECK-LABEL: @shuf_frem_const_op1(		; CHECK-LABEL: @shuf_frem_const_op1(
; CHECK-NEXT: [[BO:%.]] = frem reassoc ninf <3 x float> [[X:%.]], <float undef, float 2.000000e+00, float 3.000000e+00>		; CHECK-NEXT: [[BO:%.]] = frem reassoc ninf <3 x float> [[X:%.]], <float poison, float 2.000000e+00, float 3.000000e+00>
; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 1, i32 undef, i32 2>		; CHECK-NEXT: [[R:%.*]] = shufflevector <3 x float> [[BO]], <3 x float> undef, <3 x i32> <i32 1, i32 undef, i32 2>
; CHECK-NEXT: ret <3 x float> [[R]]		; CHECK-NEXT: ret <3 x float> [[R]]
;		;
%bo = frem ninf reassoc <3 x float> %x, <float 1.0, float 2.0, float 3.0>		%bo = frem ninf reassoc <3 x float> %x, <float 1.0, float 2.0, float 3.0>
%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 1, i32 undef, i32 2>		%r = shufflevector <3 x float> %bo, <3 x float> undef, <3 x i32> <i32 1, i32 undef, i32 2>
ret <3 x float> %r		ret <3 x float> %r
}		}

Show All 24 Lines	;
%ee = extractelement <2 x i32*> %gep, i32 1		%ee = extractelement <2 x i32*> %gep, i32 1
ret i32* %ee		ret i32* %ee
}		}


define i32* @gep_splat_base_w_cv_idx(i32* %base) {		define i32* @gep_splat_base_w_cv_idx(i32* %base) {
; CHECK-LABEL: @gep_splat_base_w_cv_idx(		; CHECK-LABEL: @gep_splat_base_w_cv_idx(
; CHECK-NEXT: [[BASEVEC2:%.]] = insertelement <2 x i32> undef, i32* [[BASE:%.*]], i32 1		; CHECK-NEXT: [[BASEVEC2:%.]] = insertelement <2 x i32> undef, i32* [[BASE:%.*]], i32 1
; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, <2 x i32> [[BASEVEC2]], <2 x i64> <i64 undef, i64 1>		; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, <2 x i32> [[BASEVEC2]], <2 x i64> <i64 poison, i64 1>
; CHECK-NEXT: [[EE:%.]] = extractelement <2 x i32> [[GEP]], i32 1		; CHECK-NEXT: [[EE:%.]] = extractelement <2 x i32> [[GEP]], i32 1
; CHECK-NEXT: ret i32* [[EE]]		; CHECK-NEXT: ret i32* [[EE]]
;		;
%basevec1 = insertelement <2 x i32> undef, i32 %base, i32 0		%basevec1 = insertelement <2 x i32> undef, i32 %base, i32 0
%basevec2 = shufflevector <2 x i32> %basevec1, <2 x i32> undef, <2 x i32> zeroinitializer		%basevec2 = shufflevector <2 x i32> %basevec1, <2 x i32> undef, <2 x i32> zeroinitializer
%gep = getelementptr i32, <2 x i32*> %basevec2, <2 x i64> <i64 0, i64 1>		%gep = getelementptr i32, <2 x i32*> %basevec2, <2 x i64> <i64 0, i64 1>
%ee = extractelement <2 x i32*> %gep, i32 1		%ee = extractelement <2 x i32*> %gep, i32 1
ret i32* %ee		ret i32* %ee
Show All 13 Lines	;
ret i32* %ee		ret i32* %ee
}		}


@GLOBAL = internal global i32 zeroinitializer		@GLOBAL = internal global i32 zeroinitializer

define i32* @gep_cvbase_w_s_idx(<2 x i32*> %base, i64 %raw_addr) {		define i32* @gep_cvbase_w_s_idx(<2 x i32*> %base, i64 %raw_addr) {
; CHECK-LABEL: @gep_cvbase_w_s_idx(		; CHECK-LABEL: @gep_cvbase_w_s_idx(
; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, <2 x i32> <i32* undef, i32* @GLOBAL>, i64 [[RAW_ADDR:%.*]]		; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, <2 x i32> <i32* poison, i32* @GLOBAL>, i64 [[RAW_ADDR:%.*]]
; CHECK-NEXT: [[EE:%.]] = extractelement <2 x i32> [[GEP]], i32 1		; CHECK-NEXT: [[EE:%.]] = extractelement <2 x i32> [[GEP]], i32 1
; CHECK-NEXT: ret i32* [[EE]]		; CHECK-NEXT: ret i32* [[EE]]
;		;
%gep = getelementptr i32, <2 x i32> <i32 @GLOBAL, i32* @GLOBAL>, i64 %raw_addr		%gep = getelementptr i32, <2 x i32> <i32 @GLOBAL, i32* @GLOBAL>, i64 %raw_addr
%ee = extractelement <2 x i32*> %gep, i32 1		%ee = extractelement <2 x i32*> %gep, i32 1
ret i32* %ee		ret i32* %ee
}		}

define i32* @gep_cvbase_w_cv_idx(<2 x i32*> %base, i64 %raw_addr) {		define i32* @gep_cvbase_w_cv_idx(<2 x i32*> %base, i64 %raw_addr) {
; CHECK-LABEL: @gep_cvbase_w_cv_idx(		; CHECK-LABEL: @gep_cvbase_w_cv_idx(
; CHECK-NEXT: ret i32* getelementptr inbounds (i32, i32* @GLOBAL, i64 1)		; CHECK-NEXT: ret i32* getelementptr inbounds (i32, i32* @GLOBAL, i64 1)
;		;
%gep = getelementptr i32, <2 x i32> <i32 @GLOBAL, i32* @GLOBAL>, <2 x i64> <i64 0, i64 1>		%gep = getelementptr i32, <2 x i32> <i32 @GLOBAL, i32* @GLOBAL>, <2 x i64> <i64 0, i64 1>
%ee = extractelement <2 x i32*> %gep, i32 1		%ee = extractelement <2 x i32*> %gep, i32 1
ret i32* %ee		ret i32* %ee
}		}


define i32* @gep_sbase_w_cv_idx(i32* %base) {		define i32* @gep_sbase_w_cv_idx(i32* %base) {
; CHECK-LABEL: @gep_sbase_w_cv_idx(		; CHECK-LABEL: @gep_sbase_w_cv_idx(
; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, i32 [[BASE:%.*]], <2 x i64> <i64 undef, i64 1>		; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, i32 [[BASE:%.*]], <2 x i64> <i64 poison, i64 1>
; CHECK-NEXT: [[EE:%.]] = extractelement <2 x i32> [[GEP]], i32 1		; CHECK-NEXT: [[EE:%.]] = extractelement <2 x i32> [[GEP]], i32 1
; CHECK-NEXT: ret i32* [[EE]]		; CHECK-NEXT: ret i32* [[EE]]
;		;
%gep = getelementptr i32, i32* %base, <2 x i64> <i64 0, i64 1>		%gep = getelementptr i32, i32* %base, <2 x i64> <i64 0, i64 1>
%ee = extractelement <2 x i32*> %gep, i32 1		%ee = extractelement <2 x i32*> %gep, i32 1
ret i32* %ee		ret i32* %ee
}		}

▲ Show 20 Lines • Show All 64 Lines • ▼ Show 20 Lines

@global = external global [0 x i32], align 4		@global = external global [0 x i32], align 4

; Make sure we don't get stuck in a loop turning the zeroinitializer into		; Make sure we don't get stuck in a loop turning the zeroinitializer into
; <0, undef, undef, undef> and then changing it back.		; <0, undef, undef, undef> and then changing it back.
define i32* @zero_sized_type_extract(<4 x i64> %arg, i64 %arg1) {		define i32* @zero_sized_type_extract(<4 x i64> %arg, i64 %arg1) {
; CHECK-LABEL: @zero_sized_type_extract(		; CHECK-LABEL: @zero_sized_type_extract(
; CHECK-NEXT: bb:		; CHECK-NEXT: bb:
; CHECK-NEXT: [[T:%.]] = getelementptr inbounds [0 x i32], <4 x [0 x i32]> <[0 x i32]* @global, [0 x i32]* undef, [0 x i32]* undef, [0 x i32]* undef>, <4 x i64> <i64 0, i64 undef, i64 undef, i64 undef>, <4 x i64> [[ARG:%.*]]		; CHECK-NEXT: [[T:%.]] = getelementptr inbounds [0 x i32], <4 x [0 x i32]> <[0 x i32]* @global, [0 x i32]* poison, [0 x i32]* poison, [0 x i32]* poison>, <4 x i64> <i64 0, i64 poison, i64 poison, i64 poison>, <4 x i64> [[ARG:%.*]]
; CHECK-NEXT: [[T2:%.]] = extractelement <4 x i32> [[T]], i64 0		; CHECK-NEXT: [[T2:%.]] = extractelement <4 x i32> [[T]], i64 0
; CHECK-NEXT: ret i32* [[T2]]		; CHECK-NEXT: ret i32* [[T2]]
;		;
bb:		bb:
%t = getelementptr inbounds [0 x i32], <4 x [0 x i32]> <[0 x i32] @global, [0 x i32]* @global, [0 x i32]* @global, [0 x i32]* @global>, <4 x i64> zeroinitializer, <4 x i64> %arg		%t = getelementptr inbounds [0 x i32], <4 x [0 x i32]> <[0 x i32] @global, [0 x i32]* @global, [0 x i32]* @global, [0 x i32]* @global>, <4 x i64> zeroinitializer, <4 x i64> %arg
%t2 = extractelement <4 x i32*> %t, i64 0		%t2 = extractelement <4 x i32*> %t, i64 0
ret i32* %t2		ret i32* %t2
}		}
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	;
%fval = shufflevector <4 x float> %y, <4 x float> %x, <4 x i32> <i32 0, i32 7, i32 6, i32 undef>		%fval = shufflevector <4 x float> %y, <4 x float> %x, <4 x i32> <i32 0, i32 7, i32 6, i32 undef>
%cond = shufflevector <4 x i1> %cmp, <4 x i1> undef, <4 x i32> <i32 undef, i32 1, i32 2, i32 3>		%cond = shufflevector <4 x i1> %cmp, <4 x i1> undef, <4 x i32> <i32 undef, i32 1, i32 2, i32 3>
%r = select <4 x i1> %cond, <4 x float> %tval, <4 x float> %fval		%r = select <4 x i1> %cond, <4 x float> %tval, <4 x float> %fval
ret <4 x float> %r		ret <4 x float> %r
}		}

define <4 x i8> @select_cond_with_undef_true_false_elts(<4 x i8> %x, <4 x i8> %y, <4 x i1> %cmp) {		define <4 x i8> @select_cond_with_undef_true_false_elts(<4 x i8> %x, <4 x i8> %y, <4 x i1> %cmp) {
; CHECK-LABEL: @select_cond_with_undef_true_false_elts(		; CHECK-LABEL: @select_cond_with_undef_true_false_elts(
; CHECK-NEXT: [[TVAL:%.]] = shufflevector <4 x i8> [[Y:%.]], <4 x i8> undef, <4 x i32> <i32 undef, i32 1, i32 2, i32 3>		; CHECK-NEXT: [[TVAL:%.]] = shufflevector <4 x i8> [[Y:%.]], <4 x i8> poison, <4 x i32> <i32 undef, i32 1, i32 2, i32 3>
; CHECK-NEXT: [[COND:%.]] = shufflevector <4 x i1> [[CMP:%.]], <4 x i1> undef, <4 x i32> <i32 0, i32 1, i32 0, i32 1>		; CHECK-NEXT: [[COND:%.]] = shufflevector <4 x i1> [[CMP:%.]], <4 x i1> undef, <4 x i32> <i32 0, i32 1, i32 0, i32 1>
; CHECK-NEXT: [[R:%.]] = select <4 x i1> [[COND]], <4 x i8> [[TVAL]], <4 x i8> [[X:%.]]		; CHECK-NEXT: [[R:%.]] = select <4 x i1> [[COND]], <4 x i8> [[TVAL]], <4 x i8> [[X:%.]]
; CHECK-NEXT: ret <4 x i8> [[R]]		; CHECK-NEXT: ret <4 x i8> [[R]]
;		;
%tval = shufflevector <4 x i8> %x, <4 x i8> %y, <4 x i32> <i32 undef, i32 5, i32 6, i32 7>		%tval = shufflevector <4 x i8> %x, <4 x i8> %y, <4 x i32> <i32 undef, i32 5, i32 6, i32 7>
%cond = shufflevector <4 x i1> %cmp, <4 x i1> undef, <4 x i32> <i32 0, i32 1, i32 0, i32 1>		%cond = shufflevector <4 x i1> %cmp, <4 x i1> undef, <4 x i32> <i32 0, i32 1, i32 0, i32 1>
%r = select <4 x i1> %cond, <4 x i8> %tval, <4 x i8> %x		%r = select <4 x i1> %cond, <4 x i8> %tval, <4 x i8> %x
ret <4 x i8> %r		ret <4 x i8> %r
▲ Show 20 Lines • Show All 117 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/vec_shuffle-inseltpoison.ll

Show First 20 Lines • Show All 79 Lines • ▼ Show 20 Lines
;		;
%r = shufflevector <4 x float> %x, <4 x float> undef, <4 x i32> < i32 0, i32 1, i32 6, i32 7 >		%r = shufflevector <4 x float> %x, <4 x float> undef, <4 x i32> < i32 0, i32 1, i32 6, i32 7 >
ret <4 x float> %r		ret <4 x float> %r
}		}

; This should turn into a single shuffle.		; This should turn into a single shuffle.
define <4 x float> @test8(<4 x float> %x, <4 x float> %y) {		define <4 x float> @test8(<4 x float> %x, <4 x float> %y) {
; CHECK-LABEL: @test8(		; CHECK-LABEL: @test8(
; CHECK-NEXT: [[T134:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> [[Y:%.*]], <4 x i32> <i32 1, i32 undef, i32 3, i32 4>		; CHECK-NEXT: [[T132:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> poison, <4 x i32> <i32 1, i32 undef, i32 3, i32 undef>
		; CHECK-NEXT: [[T134:%.]] = shufflevector <4 x float> [[T132]], <4 x float> [[Y:%.]], <4 x i32> <i32 0, i32 undef, i32 2, i32 4>
; CHECK-NEXT: ret <4 x float> [[T134]]		; CHECK-NEXT: ret <4 x float> [[T134]]
;		;
%t4 = extractelement <4 x float> %x, i32 1		%t4 = extractelement <4 x float> %x, i32 1
%t2 = extractelement <4 x float> %x, i32 3		%t2 = extractelement <4 x float> %x, i32 3
%t1 = extractelement <4 x float> %y, i32 0		%t1 = extractelement <4 x float> %y, i32 0
%t128 = insertelement <4 x float> poison, float %t4, i32 0		%t128 = insertelement <4 x float> poison, float %t4, i32 0
%t130 = insertelement <4 x float> %t128, float undef, i32 1		%t130 = insertelement <4 x float> %t128, float undef, i32 1
%t132 = insertelement <4 x float> %t130, float %t2, i32 2		%t132 = insertelement <4 x float> %t130, float %t2, i32 2
▲ Show 20 Lines • Show All 442 Lines • ▼ Show 20 Lines	;
call void @use(<2 x float> %t1)		call void @use(<2 x float> %t1)
ret <2 x float> %r		ret <2 x float> %r
}		}

; Math before splat allows replacing constant elements with undef lanes.		; Math before splat allows replacing constant elements with undef lanes.

define <4 x i32> @mul_const_splat(<4 x i32> %v) {		define <4 x i32> @mul_const_splat(<4 x i32> %v) {
; CHECK-LABEL: @mul_const_splat(		; CHECK-LABEL: @mul_const_splat(
; CHECK-NEXT: [[TMP1:%.]] = mul <4 x i32> [[V:%.]], <i32 undef, i32 42, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = mul <4 x i32> [[V:%.]], <i32 poison, i32 42, i32 poison, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 1, i32 1>		; CHECK-NEXT: [[R:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 1, i32 1>
; CHECK-NEXT: ret <4 x i32> [[R]]		; CHECK-NEXT: ret <4 x i32> [[R]]
;		;
%t1 = shufflevector <4 x i32> %v, <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 1, i32 1>		%t1 = shufflevector <4 x i32> %v, <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 1, i32 1>
%r = mul <4 x i32> <i32 42, i32 42, i32 42, i32 42>, %t1		%r = mul <4 x i32> <i32 42, i32 42, i32 42, i32 42>, %t1
ret <4 x i32> %r		ret <4 x i32> %r
}		}

▲ Show 20 Lines • Show All 251 Lines • ▼ Show 20 Lines	;
ret <2 x i32> %r		ret <2 x i32> %r
}		}

; Test shuffle followed by binop with splat constant for all 18 binop opcodes.		; Test shuffle followed by binop with splat constant for all 18 binop opcodes.
; Test with constant as operand 0 and operand 1 for non-commutative opcodes.		; Test with constant as operand 0 and operand 1 for non-commutative opcodes.

define <2 x i32> @add_splat_constant(<2 x i32> %x) {		define <2 x i32> @add_splat_constant(<2 x i32> %x) {
; CHECK-LABEL: @add_splat_constant(		; CHECK-LABEL: @add_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = add <2 x i32> [[X:%.]], <i32 42, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = add <2 x i32> [[X:%.]], <i32 42, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = add <2 x i32> %splat, <i32 42, i32 42>		%r = add <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @sub_splat_constant0(<2 x i32> %x) {		define <2 x i32> @sub_splat_constant0(<2 x i32> %x) {
; CHECK-LABEL: @sub_splat_constant0(		; CHECK-LABEL: @sub_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = sub <2 x i32> <i32 42, i32 undef>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = sub <2 x i32> <i32 42, i32 poison>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = sub <2 x i32> <i32 42, i32 42>, %splat		%r = sub <2 x i32> <i32 42, i32 42>, %splat
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @sub_splat_constant1(<2 x i32> %x) {		define <2 x i32> @sub_splat_constant1(<2 x i32> %x) {
; CHECK-LABEL: @sub_splat_constant1(		; CHECK-LABEL: @sub_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = add <2 x i32> [[X:%.]], <i32 -42, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = add <2 x i32> [[X:%.]], <i32 -42, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = sub <2 x i32> %splat, <i32 42, i32 42>		%r = sub <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @mul_splat_constant(<2 x i32> %x) {		define <2 x i32> @mul_splat_constant(<2 x i32> %x) {
; CHECK-LABEL: @mul_splat_constant(		; CHECK-LABEL: @mul_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = mul <2 x i32> [[X:%.]], <i32 42, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = mul <2 x i32> [[X:%.]], <i32 42, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = mul <2 x i32> %splat, <i32 42, i32 42>		%r = mul <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

▲ Show 20 Lines • Show All 148 Lines • ▼ Show 20 Lines
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = sdiv <2 x i32> %splat, <i32 42, i32 42>		%r = sdiv <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @and_splat_constant(<2 x i32> %x) {		define <2 x i32> @and_splat_constant(<2 x i32> %x) {
; CHECK-LABEL: @and_splat_constant(		; CHECK-LABEL: @and_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = and <2 x i32> [[X:%.]], <i32 42, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = and <2 x i32> [[X:%.]], <i32 42, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = and <2 x i32> %splat, <i32 42, i32 42>		%r = and <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

Show All 38 Lines	entry:
%and = and <4 x i16> %shuffle, <i16 0, i16 0, i16 0, i16 -1>		%and = and <4 x i16> %shuffle, <i16 0, i16 0, i16 0, i16 -1>
ret <4 x i16> %and		ret <4 x i16> %and
}		}

; We can move the AND across the shuffle, as -1 (AND identity value) is used for undef lanes.		; We can move the AND across the shuffle, as -1 (AND identity value) is used for undef lanes.
define <4 x i16> @and_constant_mask_undef_4(<4 x i16> %add) {		define <4 x i16> @and_constant_mask_undef_4(<4 x i16> %add) {
; CHECK-LABEL: @and_constant_mask_undef_4(		; CHECK-LABEL: @and_constant_mask_undef_4(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = and <4 x i16> [[ADD:%.]], <i16 9, i16 20, i16 undef, i16 undef>		; CHECK-NEXT: [[TMP0:%.]] = and <4 x i16> [[ADD:%.]], <i16 9, i16 20, i16 poison, i16 poison>
; CHECK-NEXT: [[AND:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 0, i32 1, i32 1, i32 undef>		; CHECK-NEXT: [[AND:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 0, i32 1, i32 1, i32 undef>
; CHECK-NEXT: ret <4 x i16> [[AND]]		; CHECK-NEXT: ret <4 x i16> [[AND]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %add, <4 x i16> undef, <4 x i32> <i32 0, i32 1, i32 1, i32 undef>		%shuffle = shufflevector <4 x i16> %add, <4 x i16> undef, <4 x i32> <i32 0, i32 1, i32 1, i32 undef>
%and = and <4 x i16> %shuffle, <i16 9, i16 20, i16 20, i16 -1>		%and = and <4 x i16> %shuffle, <i16 9, i16 20, i16 20, i16 -1>
ret <4 x i16> %and		ret <4 x i16> %and
}		}

define <4 x i16> @and_constant_mask_not_undef(<4 x i16> %add) {		define <4 x i16> @and_constant_mask_not_undef(<4 x i16> %add) {
; CHECK-LABEL: @and_constant_mask_not_undef(		; CHECK-LABEL: @and_constant_mask_not_undef(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = and <4 x i16> [[ADD:%.]], <i16 undef, i16 -1, i16 0, i16 0>		; CHECK-NEXT: [[TMP0:%.]] = and <4 x i16> [[ADD:%.]], <i16 poison, i16 -1, i16 0, i16 0>
; CHECK-NEXT: [[AND:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>		; CHECK-NEXT: [[AND:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>
; CHECK-NEXT: ret <4 x i16> [[AND]]		; CHECK-NEXT: ret <4 x i16> [[AND]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %add, <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>		%shuffle = shufflevector <4 x i16> %add, <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>
%and = and <4 x i16> %shuffle, <i16 0, i16 0, i16 -1, i16 -1>		%and = and <4 x i16> %shuffle, <i16 0, i16 0, i16 -1, i16 -1>
ret <4 x i16> %and		ret <4 x i16> %and
}		}
Show All 39 Lines	entry:
%or = or <4 x i16> %shuffle, <i16 0, i16 -1, i16 -1, i16 0>		%or = or <4 x i16> %shuffle, <i16 0, i16 -1, i16 -1, i16 0>
ret <4 x i16> %or		ret <4 x i16> %or
}		}

; We can move the OR across the shuffle, as 0 (OR identity value) is used for undef lanes.		; We can move the OR across the shuffle, as 0 (OR identity value) is used for undef lanes.
define <4 x i16> @or_constant_mask_undef_4(<4 x i16> %in) {		define <4 x i16> @or_constant_mask_undef_4(<4 x i16> %in) {
; CHECK-LABEL: @or_constant_mask_undef_4(		; CHECK-LABEL: @or_constant_mask_undef_4(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = or <4 x i16> [[IN:%.]], <i16 undef, i16 99, i16 undef, i16 undef>		; CHECK-NEXT: [[TMP0:%.]] = or <4 x i16> [[IN:%.]], <i16 poison, i16 99, i16 poison, i16 poison>
; CHECK-NEXT: [[OR:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 undef, i32 1, i32 1, i32 undef>		; CHECK-NEXT: [[OR:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 undef, i32 1, i32 1, i32 undef>
; CHECK-NEXT: ret <4 x i16> [[OR]]		; CHECK-NEXT: ret <4 x i16> [[OR]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 1, i32 1, i32 undef>		%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 1, i32 1, i32 undef>
%or = or <4 x i16> %shuffle, <i16 0, i16 99, i16 99, i16 0>		%or = or <4 x i16> %shuffle, <i16 0, i16 99, i16 99, i16 0>
ret <4 x i16> %or		ret <4 x i16> %or
}		}

define <4 x i16> @or_constant_mask_not_undef(<4 x i16> %in) {		define <4 x i16> @or_constant_mask_not_undef(<4 x i16> %in) {
; CHECK-LABEL: @or_constant_mask_not_undef(		; CHECK-LABEL: @or_constant_mask_not_undef(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = or <4 x i16> [[IN:%.]], <i16 undef, i16 -1, i16 0, i16 0>		; CHECK-NEXT: [[TMP0:%.]] = or <4 x i16> [[IN:%.]], <i16 poison, i16 -1, i16 0, i16 0>
; CHECK-NEXT: [[AND:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>		; CHECK-NEXT: [[AND:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>
; CHECK-NEXT: ret <4 x i16> [[AND]]		; CHECK-NEXT: ret <4 x i16> [[AND]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>		%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>
%and = or <4 x i16> %shuffle, <i16 0, i16 0, i16 -1, i16 -1>		%and = or <4 x i16> %shuffle, <i16 0, i16 0, i16 -1, i16 -1>
ret <4 x i16> %and		ret <4 x i16> %and
}		}
Show All 21 Lines	entry:
%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 undef, i32 1, i32 1>		%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 undef, i32 1, i32 1>
%add = add <4 x i16> %shuffle, <i16 10, i16 3, i16 0, i16 0>		%add = add <4 x i16> %shuffle, <i16 10, i16 3, i16 0, i16 0>
ret <4 x i16> %add		ret <4 x i16> %add
}		}

define <4 x i16> @add_constant_mask_undef_2(<4 x i16> %in) {		define <4 x i16> @add_constant_mask_undef_2(<4 x i16> %in) {
; CHECK-LABEL: @add_constant_mask_undef_2(		; CHECK-LABEL: @add_constant_mask_undef_2(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = add <4 x i16> [[IN:%.]], <i16 undef, i16 0, i16 3, i16 undef>		; CHECK-NEXT: [[TMP0:%.]] = add <4 x i16> [[IN:%.]], <i16 poison, i16 0, i16 3, i16 poison>
; CHECK-NEXT: [[ADD:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 undef, i32 2, i32 1, i32 1>		; CHECK-NEXT: [[ADD:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 undef, i32 2, i32 1, i32 1>
; CHECK-NEXT: ret <4 x i16> [[ADD]]		; CHECK-NEXT: ret <4 x i16> [[ADD]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 2, i32 1, i32 1>		%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 2, i32 1, i32 1>
%add = add <4 x i16> %shuffle, <i16 10, i16 3, i16 0, i16 0>		%add = add <4 x i16> %shuffle, <i16 10, i16 3, i16 0, i16 0>
ret <4 x i16> %add		ret <4 x i16> %add
}		}

define <4 x i16> @sub_constant_mask_undef(<4 x i16> %in) {		define <4 x i16> @sub_constant_mask_undef(<4 x i16> %in) {
; CHECK-LABEL: @sub_constant_mask_undef(		; CHECK-LABEL: @sub_constant_mask_undef(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[SUB:%.]] = shufflevector <4 x i16> [[IN:%.]], <4 x i16> undef, <4 x i32> <i32 undef, i32 undef, i32 1, i32 1>		; CHECK-NEXT: [[SUB:%.]] = shufflevector <4 x i16> [[IN:%.]], <4 x i16> undef, <4 x i32> <i32 undef, i32 undef, i32 1, i32 1>
; CHECK-NEXT: ret <4 x i16> [[SUB]]		; CHECK-NEXT: ret <4 x i16> [[SUB]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 undef, i32 1, i32 1>		%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 undef, i32 1, i32 1>
%sub = sub <4 x i16> %shuffle, <i16 10, i16 3, i16 0, i16 0>		%sub = sub <4 x i16> %shuffle, <i16 10, i16 3, i16 0, i16 0>
ret <4 x i16> %sub		ret <4 x i16> %sub
}		}

define <4 x i16> @sub_constant_mask_undef_2(<4 x i16> %in) {		define <4 x i16> @sub_constant_mask_undef_2(<4 x i16> %in) {
; CHECK-LABEL: @sub_constant_mask_undef_2(		; CHECK-LABEL: @sub_constant_mask_undef_2(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = add <4 x i16> [[IN:%.]], <i16 undef, i16 0, i16 -10, i16 undef>		; CHECK-NEXT: [[TMP0:%.]] = add <4 x i16> [[IN:%.]], <i16 poison, i16 0, i16 -10, i16 poison>
; CHECK-NEXT: [[SUB:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 1, i32 1, i32 2, i32 undef>		; CHECK-NEXT: [[SUB:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 1, i32 1, i32 2, i32 undef>
; CHECK-NEXT: ret <4 x i16> [[SUB]]		; CHECK-NEXT: ret <4 x i16> [[SUB]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 1, i32 1, i32 2, i32 undef>		%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 1, i32 1, i32 2, i32 undef>
%sub = sub <4 x i16> %shuffle, <i16 0, i16 0, i16 10, i16 99>		%sub = sub <4 x i16> %shuffle, <i16 0, i16 0, i16 10, i16 99>
ret <4 x i16> %sub		ret <4 x i16> %sub
}		}

define <2 x i32> @or_splat_constant(<2 x i32> %x) {		define <2 x i32> @or_splat_constant(<2 x i32> %x) {
; CHECK-LABEL: @or_splat_constant(		; CHECK-LABEL: @or_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = or <2 x i32> [[X:%.]], <i32 42, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = or <2 x i32> [[X:%.]], <i32 42, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = or <2 x i32> %splat, <i32 42, i32 42>		%r = or <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @xor_splat_constant(<2 x i32> %x) {		define <2 x i32> @xor_splat_constant(<2 x i32> %x) {
; CHECK-LABEL: @xor_splat_constant(		; CHECK-LABEL: @xor_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = xor <2 x i32> [[X:%.]], <i32 42, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = xor <2 x i32> [[X:%.]], <i32 42, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = xor <2 x i32> %splat, <i32 42, i32 42>		%r = xor <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x float> @fadd_splat_constant(<2 x float> %x) {		define <2 x float> @fadd_splat_constant(<2 x float> %x) {
; CHECK-LABEL: @fadd_splat_constant(		; CHECK-LABEL: @fadd_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x float> [[X:%.]], <float 4.200000e+01, float undef>		; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x float> [[X:%.]], <float 4.200000e+01, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fadd <2 x float> %splat, <float 42.0, float 42.0>		%r = fadd <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fsub_splat_constant0(<2 x float> %x) {		define <2 x float> @fsub_splat_constant0(<2 x float> %x) {
; CHECK-LABEL: @fsub_splat_constant0(		; CHECK-LABEL: @fsub_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = fsub <2 x float> <float 4.200000e+01, float undef>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = fsub <2 x float> <float 4.200000e+01, float poison>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fsub <2 x float> <float 42.0, float 42.0>, %splat		%r = fsub <2 x float> <float 42.0, float 42.0>, %splat
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fsub_splat_constant1(<2 x float> %x) {		define <2 x float> @fsub_splat_constant1(<2 x float> %x) {
; CHECK-LABEL: @fsub_splat_constant1(		; CHECK-LABEL: @fsub_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x float> [[X:%.]], <float -4.200000e+01, float undef>		; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x float> [[X:%.]], <float -4.200000e+01, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fsub <2 x float> %splat, <float 42.0, float 42.0>		%r = fsub <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fneg(<2 x float> %x) {		define <2 x float> @fneg(<2 x float> %x) {
; CHECK-LABEL: @fneg(		; CHECK-LABEL: @fneg(
; CHECK-NEXT: [[TMP1:%.]] = fneg <2 x float> [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = fneg <2 x float> [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fsub <2 x float> <float -0.0, float -0.0>, %splat		%r = fsub <2 x float> <float -0.0, float -0.0>, %splat
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fmul_splat_constant(<2 x float> %x) {		define <2 x float> @fmul_splat_constant(<2 x float> %x) {
; CHECK-LABEL: @fmul_splat_constant(		; CHECK-LABEL: @fmul_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = fmul <2 x float> [[X:%.]], <float 4.200000e+01, float undef>		; CHECK-NEXT: [[TMP1:%.]] = fmul <2 x float> [[X:%.]], <float 4.200000e+01, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fmul <2 x float> %splat, <float 42.0, float 42.0>		%r = fmul <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fdiv_splat_constant0(<2 x float> %x) {		define <2 x float> @fdiv_splat_constant0(<2 x float> %x) {
; CHECK-LABEL: @fdiv_splat_constant0(		; CHECK-LABEL: @fdiv_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = fdiv <2 x float> <float 4.200000e+01, float undef>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = fdiv <2 x float> <float 4.200000e+01, float poison>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fdiv <2 x float> <float 42.0, float 42.0>, %splat		%r = fdiv <2 x float> <float 42.0, float 42.0>, %splat
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fdiv_splat_constant1(<2 x float> %x) {		define <2 x float> @fdiv_splat_constant1(<2 x float> %x) {
; CHECK-LABEL: @fdiv_splat_constant1(		; CHECK-LABEL: @fdiv_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = fdiv <2 x float> [[X:%.]], <float 4.200000e+01, float undef>		; CHECK-NEXT: [[TMP1:%.]] = fdiv <2 x float> [[X:%.]], <float 4.200000e+01, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fdiv <2 x float> %splat, <float 42.0, float 42.0>		%r = fdiv <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @frem_splat_constant0(<2 x float> %x) {		define <2 x float> @frem_splat_constant0(<2 x float> %x) {
; CHECK-LABEL: @frem_splat_constant0(		; CHECK-LABEL: @frem_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = frem <2 x float> <float 4.200000e+01, float undef>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = frem <2 x float> <float 4.200000e+01, float poison>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = frem <2 x float> <float 42.0, float 42.0>, %splat		%r = frem <2 x float> <float 42.0, float 42.0>, %splat
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @frem_splat_constant1(<2 x float> %x) {		define <2 x float> @frem_splat_constant1(<2 x float> %x) {
; CHECK-LABEL: @frem_splat_constant1(		; CHECK-LABEL: @frem_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = frem <2 x float> [[X:%.]], <float 4.200000e+01, float undef>		; CHECK-NEXT: [[TMP1:%.]] = frem <2 x float> [[X:%.]], <float 4.200000e+01, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = frem <2 x float> %splat, <float 42.0, float 42.0>		%r = frem <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

▲ Show 20 Lines • Show All 122 Lines • ▼ Show 20 Lines	;
%ext2 = extractelement <2 x float> %x, i32 0		%ext2 = extractelement <2 x float> %x, i32 0
%I = insertelement <4 x float> %widen, float %ext2, i16 0		%I = insertelement <4 x float> %widen, float %ext2, i16 0
store <4 x float> %I, <4 x float>* %p		store <4 x float> %I, <4 x float>* %p
ret <4 x float> %widen		ret <4 x float> %widen
}		}

define <4 x i32> @splat_assoc_add(<4 x i32> %x, <4 x i32> %y) {		define <4 x i32> @splat_assoc_add(<4 x i32> %x, <4 x i32> %y) {
; CHECK-LABEL: @splat_assoc_add(		; CHECK-LABEL: @splat_assoc_add(
; CHECK-NEXT: [[TMP1:%.]] = add <4 x i32> [[X:%.]], <i32 317426, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = add <4 x i32> [[X:%.]], <i32 317426, i32 poison, i32 poison, i32 poison>
; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer		; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer
; CHECK-NEXT: [[R:%.]] = add <4 x i32> [[TMP2]], [[Y:%.]]		; CHECK-NEXT: [[R:%.]] = add <4 x i32> [[TMP2]], [[Y:%.]]
; CHECK-NEXT: ret <4 x i32> [[R]]		; CHECK-NEXT: ret <4 x i32> [[R]]
;		;
%splatx = shufflevector <4 x i32> %x, <4 x i32> undef, <4 x i32> zeroinitializer		%splatx = shufflevector <4 x i32> %x, <4 x i32> undef, <4 x i32> zeroinitializer
%a = add <4 x i32> %y, <i32 317426, i32 317426, i32 317426, i32 317426>		%a = add <4 x i32> %y, <i32 317426, i32 317426, i32 317426, i32 317426>
%r = add <4 x i32> %splatx, %a		%r = add <4 x i32> %splatx, %a
ret <4 x i32> %r		ret <4 x i32> %r
Show All 12 Lines	;
%r = add <vscale x 4 x i32> %splatx, %a		%r = add <vscale x 4 x i32> %splatx, %a
ret <vscale x 4 x i32> %r		ret <vscale x 4 x i32> %r
}		}

; Undefs in splat mask are replaced with defined splat index		; Undefs in splat mask are replaced with defined splat index

define <4 x i32> @splat_assoc_add_undef_mask_elts(<4 x i32> %x, <4 x i32> %y) {		define <4 x i32> @splat_assoc_add_undef_mask_elts(<4 x i32> %x, <4 x i32> %y) {
; CHECK-LABEL: @splat_assoc_add_undef_mask_elts(		; CHECK-LABEL: @splat_assoc_add_undef_mask_elts(
; CHECK-NEXT: [[TMP1:%.]] = add <4 x i32> [[X:%.]], <i32 42, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = add <4 x i32> [[X:%.]], <i32 42, i32 poison, i32 poison, i32 poison>
; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer		; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer
; CHECK-NEXT: [[R:%.]] = add <4 x i32> [[TMP2]], [[Y:%.]]		; CHECK-NEXT: [[R:%.]] = add <4 x i32> [[TMP2]], [[Y:%.]]
; CHECK-NEXT: ret <4 x i32> [[R]]		; CHECK-NEXT: ret <4 x i32> [[R]]
;		;
%splatx = shufflevector <4 x i32> %x, <4 x i32> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef>		%splatx = shufflevector <4 x i32> %x, <4 x i32> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef>
%a = add <4 x i32> %y, <i32 42, i32 42, i32 42, i32 42>		%a = add <4 x i32> %y, <i32 42, i32 42, i32 42, i32 42>
%r = add <4 x i32> %splatx, %a		%r = add <4 x i32> %splatx, %a
ret <4 x i32> %r		ret <4 x i32> %r
}		}

; Undefs in splat mask are replaced with defined splat index		; Undefs in splat mask are replaced with defined splat index

define <4 x i32> @splat_assoc_add_undef_mask_elt_at_splat_index(<4 x i32> %x, <4 x i32> %y) {		define <4 x i32> @splat_assoc_add_undef_mask_elt_at_splat_index(<4 x i32> %x, <4 x i32> %y) {
; CHECK-LABEL: @splat_assoc_add_undef_mask_elt_at_splat_index(		; CHECK-LABEL: @splat_assoc_add_undef_mask_elt_at_splat_index(
; CHECK-NEXT: [[TMP1:%.]] = add <4 x i32> [[X:%.]], <i32 42, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = add <4 x i32> [[X:%.]], <i32 42, i32 poison, i32 poison, i32 poison>
; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer		; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer
; CHECK-NEXT: [[R:%.]] = add <4 x i32> [[TMP2]], [[Y:%.]]		; CHECK-NEXT: [[R:%.]] = add <4 x i32> [[TMP2]], [[Y:%.]]
; CHECK-NEXT: ret <4 x i32> [[R]]		; CHECK-NEXT: ret <4 x i32> [[R]]
;		;
%splatx = shufflevector <4 x i32> %x, <4 x i32> undef, <4 x i32> <i32 undef, i32 0, i32 0, i32 0>		%splatx = shufflevector <4 x i32> %x, <4 x i32> undef, <4 x i32> <i32 undef, i32 0, i32 0, i32 0>
%a = add <4 x i32> %y, <i32 42, i32 42, i32 42, i32 42>		%a = add <4 x i32> %y, <i32 42, i32 42, i32 42, i32 42>
%r = add <4 x i32> %splatx, %a		%r = add <4 x i32> %splatx, %a
ret <4 x i32> %r		ret <4 x i32> %r
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	;
%r = add <4 x i32> %splatx, %a		%r = add <4 x i32> %splatx, %a
ret <4 x i32> %r		ret <4 x i32> %r
}		}

; Non-zero splat index; commute operands; FMF intersect		; Non-zero splat index; commute operands; FMF intersect

define <2 x float> @splat_assoc_fmul(<2 x float> %x, <2 x float> %y) {		define <2 x float> @splat_assoc_fmul(<2 x float> %x, <2 x float> %y) {
; CHECK-LABEL: @splat_assoc_fmul(		; CHECK-LABEL: @splat_assoc_fmul(
; CHECK-NEXT: [[TMP1:%.]] = fmul reassoc nsz <2 x float> [[X:%.]], <float undef, float 3.000000e+00>		; CHECK-NEXT: [[TMP1:%.]] = fmul reassoc nsz <2 x float> [[X:%.]], <float poison, float 3.000000e+00>
; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> <i32 1, i32 1>		; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> <i32 1, i32 1>
; CHECK-NEXT: [[R:%.]] = fmul reassoc nsz <2 x float> [[TMP2]], [[Y:%.]]		; CHECK-NEXT: [[R:%.]] = fmul reassoc nsz <2 x float> [[TMP2]], [[Y:%.]]
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splatx = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> <i32 1, i32 1>		%splatx = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> <i32 1, i32 1>
%a = fmul reassoc nsz <2 x float> %y, <float 3.0, float 3.0>		%a = fmul reassoc nsz <2 x float> %y, <float 3.0, float 3.0>
%r = fmul reassoc nsz nnan <2 x float> %a, %splatx		%r = fmul reassoc nsz nnan <2 x float> %a, %splatx
ret <2 x float> %r		ret <2 x float> %r
▲ Show 20 Lines • Show All 184 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/vec_shuffle.ll

Show First 20 Lines • Show All 538 Lines • ▼ Show 20 Lines	;
call void @use(<2 x float> %t1)		call void @use(<2 x float> %t1)
ret <2 x float> %r		ret <2 x float> %r
}		}

; Math before splat allows replacing constant elements with undef lanes.		; Math before splat allows replacing constant elements with undef lanes.

define <4 x i32> @mul_const_splat(<4 x i32> %v) {		define <4 x i32> @mul_const_splat(<4 x i32> %v) {
; CHECK-LABEL: @mul_const_splat(		; CHECK-LABEL: @mul_const_splat(
; CHECK-NEXT: [[TMP1:%.]] = mul <4 x i32> [[V:%.]], <i32 undef, i32 42, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = mul <4 x i32> [[V:%.]], <i32 poison, i32 42, i32 poison, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 1, i32 1>		; CHECK-NEXT: [[R:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 1, i32 1>
; CHECK-NEXT: ret <4 x i32> [[R]]		; CHECK-NEXT: ret <4 x i32> [[R]]
;		;
%t1 = shufflevector <4 x i32> %v, <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 1, i32 1>		%t1 = shufflevector <4 x i32> %v, <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 1, i32 1>
%r = mul <4 x i32> <i32 42, i32 42, i32 42, i32 42>, %t1		%r = mul <4 x i32> <i32 42, i32 42, i32 42, i32 42>, %t1
ret <4 x i32> %r		ret <4 x i32> %r
}		}

▲ Show 20 Lines • Show All 251 Lines • ▼ Show 20 Lines	;
ret <2 x i32> %r		ret <2 x i32> %r
}		}

; Test shuffle followed by binop with splat constant for all 18 binop opcodes.		; Test shuffle followed by binop with splat constant for all 18 binop opcodes.
; Test with constant as operand 0 and operand 1 for non-commutative opcodes.		; Test with constant as operand 0 and operand 1 for non-commutative opcodes.

define <2 x i32> @add_splat_constant(<2 x i32> %x) {		define <2 x i32> @add_splat_constant(<2 x i32> %x) {
; CHECK-LABEL: @add_splat_constant(		; CHECK-LABEL: @add_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = add <2 x i32> [[X:%.]], <i32 42, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = add <2 x i32> [[X:%.]], <i32 42, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = add <2 x i32> %splat, <i32 42, i32 42>		%r = add <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @sub_splat_constant0(<2 x i32> %x) {		define <2 x i32> @sub_splat_constant0(<2 x i32> %x) {
; CHECK-LABEL: @sub_splat_constant0(		; CHECK-LABEL: @sub_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = sub <2 x i32> <i32 42, i32 undef>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = sub <2 x i32> <i32 42, i32 poison>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = sub <2 x i32> <i32 42, i32 42>, %splat		%r = sub <2 x i32> <i32 42, i32 42>, %splat
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @sub_splat_constant1(<2 x i32> %x) {		define <2 x i32> @sub_splat_constant1(<2 x i32> %x) {
; CHECK-LABEL: @sub_splat_constant1(		; CHECK-LABEL: @sub_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = add <2 x i32> [[X:%.]], <i32 -42, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = add <2 x i32> [[X:%.]], <i32 -42, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = sub <2 x i32> %splat, <i32 42, i32 42>		%r = sub <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @mul_splat_constant(<2 x i32> %x) {		define <2 x i32> @mul_splat_constant(<2 x i32> %x) {
; CHECK-LABEL: @mul_splat_constant(		; CHECK-LABEL: @mul_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = mul <2 x i32> [[X:%.]], <i32 42, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = mul <2 x i32> [[X:%.]], <i32 42, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = mul <2 x i32> %splat, <i32 42, i32 42>		%r = mul <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

▲ Show 20 Lines • Show All 148 Lines • ▼ Show 20 Lines
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = sdiv <2 x i32> %splat, <i32 42, i32 42>		%r = sdiv <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @and_splat_constant(<2 x i32> %x) {		define <2 x i32> @and_splat_constant(<2 x i32> %x) {
; CHECK-LABEL: @and_splat_constant(		; CHECK-LABEL: @and_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = and <2 x i32> [[X:%.]], <i32 42, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = and <2 x i32> [[X:%.]], <i32 42, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = and <2 x i32> %splat, <i32 42, i32 42>		%r = and <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

Show All 38 Lines	entry:
%and = and <4 x i16> %shuffle, <i16 0, i16 0, i16 0, i16 -1>		%and = and <4 x i16> %shuffle, <i16 0, i16 0, i16 0, i16 -1>
ret <4 x i16> %and		ret <4 x i16> %and
}		}

; We can move the AND across the shuffle, as -1 (AND identity value) is used for undef lanes.		; We can move the AND across the shuffle, as -1 (AND identity value) is used for undef lanes.
define <4 x i16> @and_constant_mask_undef_4(<4 x i16> %add) {		define <4 x i16> @and_constant_mask_undef_4(<4 x i16> %add) {
; CHECK-LABEL: @and_constant_mask_undef_4(		; CHECK-LABEL: @and_constant_mask_undef_4(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = and <4 x i16> [[ADD:%.]], <i16 9, i16 20, i16 undef, i16 undef>		; CHECK-NEXT: [[TMP0:%.]] = and <4 x i16> [[ADD:%.]], <i16 9, i16 20, i16 poison, i16 poison>
; CHECK-NEXT: [[AND:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 0, i32 1, i32 1, i32 undef>		; CHECK-NEXT: [[AND:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 0, i32 1, i32 1, i32 undef>
; CHECK-NEXT: ret <4 x i16> [[AND]]		; CHECK-NEXT: ret <4 x i16> [[AND]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %add, <4 x i16> undef, <4 x i32> <i32 0, i32 1, i32 1, i32 undef>		%shuffle = shufflevector <4 x i16> %add, <4 x i16> undef, <4 x i32> <i32 0, i32 1, i32 1, i32 undef>
%and = and <4 x i16> %shuffle, <i16 9, i16 20, i16 20, i16 -1>		%and = and <4 x i16> %shuffle, <i16 9, i16 20, i16 20, i16 -1>
ret <4 x i16> %and		ret <4 x i16> %and
}		}

define <4 x i16> @and_constant_mask_not_undef(<4 x i16> %add) {		define <4 x i16> @and_constant_mask_not_undef(<4 x i16> %add) {
; CHECK-LABEL: @and_constant_mask_not_undef(		; CHECK-LABEL: @and_constant_mask_not_undef(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = and <4 x i16> [[ADD:%.]], <i16 undef, i16 -1, i16 0, i16 0>		; CHECK-NEXT: [[TMP0:%.]] = and <4 x i16> [[ADD:%.]], <i16 poison, i16 -1, i16 0, i16 0>
; CHECK-NEXT: [[AND:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>		; CHECK-NEXT: [[AND:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>
; CHECK-NEXT: ret <4 x i16> [[AND]]		; CHECK-NEXT: ret <4 x i16> [[AND]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %add, <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>		%shuffle = shufflevector <4 x i16> %add, <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>
%and = and <4 x i16> %shuffle, <i16 0, i16 0, i16 -1, i16 -1>		%and = and <4 x i16> %shuffle, <i16 0, i16 0, i16 -1, i16 -1>
ret <4 x i16> %and		ret <4 x i16> %and
}		}
Show All 39 Lines	entry:
%or = or <4 x i16> %shuffle, <i16 0, i16 -1, i16 -1, i16 0>		%or = or <4 x i16> %shuffle, <i16 0, i16 -1, i16 -1, i16 0>
ret <4 x i16> %or		ret <4 x i16> %or
}		}

; We can move the OR across the shuffle, as 0 (OR identity value) is used for undef lanes.		; We can move the OR across the shuffle, as 0 (OR identity value) is used for undef lanes.
define <4 x i16> @or_constant_mask_undef_4(<4 x i16> %in) {		define <4 x i16> @or_constant_mask_undef_4(<4 x i16> %in) {
; CHECK-LABEL: @or_constant_mask_undef_4(		; CHECK-LABEL: @or_constant_mask_undef_4(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = or <4 x i16> [[IN:%.]], <i16 undef, i16 99, i16 undef, i16 undef>		; CHECK-NEXT: [[TMP0:%.]] = or <4 x i16> [[IN:%.]], <i16 poison, i16 99, i16 poison, i16 poison>
; CHECK-NEXT: [[OR:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 undef, i32 1, i32 1, i32 undef>		; CHECK-NEXT: [[OR:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 undef, i32 1, i32 1, i32 undef>
; CHECK-NEXT: ret <4 x i16> [[OR]]		; CHECK-NEXT: ret <4 x i16> [[OR]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 1, i32 1, i32 undef>		%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 1, i32 1, i32 undef>
%or = or <4 x i16> %shuffle, <i16 0, i16 99, i16 99, i16 0>		%or = or <4 x i16> %shuffle, <i16 0, i16 99, i16 99, i16 0>
ret <4 x i16> %or		ret <4 x i16> %or
}		}

define <4 x i16> @or_constant_mask_not_undef(<4 x i16> %in) {		define <4 x i16> @or_constant_mask_not_undef(<4 x i16> %in) {
; CHECK-LABEL: @or_constant_mask_not_undef(		; CHECK-LABEL: @or_constant_mask_not_undef(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = or <4 x i16> [[IN:%.]], <i16 undef, i16 -1, i16 0, i16 0>		; CHECK-NEXT: [[TMP0:%.]] = or <4 x i16> [[IN:%.]], <i16 poison, i16 -1, i16 0, i16 0>
; CHECK-NEXT: [[AND:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>		; CHECK-NEXT: [[AND:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>
; CHECK-NEXT: ret <4 x i16> [[AND]]		; CHECK-NEXT: ret <4 x i16> [[AND]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>		%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 2, i32 3, i32 1, i32 1>
%and = or <4 x i16> %shuffle, <i16 0, i16 0, i16 -1, i16 -1>		%and = or <4 x i16> %shuffle, <i16 0, i16 0, i16 -1, i16 -1>
ret <4 x i16> %and		ret <4 x i16> %and
}		}
Show All 21 Lines	entry:
%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 undef, i32 1, i32 1>		%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 undef, i32 1, i32 1>
%add = add <4 x i16> %shuffle, <i16 10, i16 3, i16 0, i16 0>		%add = add <4 x i16> %shuffle, <i16 10, i16 3, i16 0, i16 0>
ret <4 x i16> %add		ret <4 x i16> %add
}		}

define <4 x i16> @add_constant_mask_undef_2(<4 x i16> %in) {		define <4 x i16> @add_constant_mask_undef_2(<4 x i16> %in) {
; CHECK-LABEL: @add_constant_mask_undef_2(		; CHECK-LABEL: @add_constant_mask_undef_2(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = add <4 x i16> [[IN:%.]], <i16 undef, i16 0, i16 3, i16 undef>		; CHECK-NEXT: [[TMP0:%.]] = add <4 x i16> [[IN:%.]], <i16 poison, i16 0, i16 3, i16 poison>
; CHECK-NEXT: [[ADD:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 undef, i32 2, i32 1, i32 1>		; CHECK-NEXT: [[ADD:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 undef, i32 2, i32 1, i32 1>
; CHECK-NEXT: ret <4 x i16> [[ADD]]		; CHECK-NEXT: ret <4 x i16> [[ADD]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 2, i32 1, i32 1>		%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 2, i32 1, i32 1>
%add = add <4 x i16> %shuffle, <i16 10, i16 3, i16 0, i16 0>		%add = add <4 x i16> %shuffle, <i16 10, i16 3, i16 0, i16 0>
ret <4 x i16> %add		ret <4 x i16> %add
}		}

define <4 x i16> @sub_constant_mask_undef(<4 x i16> %in) {		define <4 x i16> @sub_constant_mask_undef(<4 x i16> %in) {
; CHECK-LABEL: @sub_constant_mask_undef(		; CHECK-LABEL: @sub_constant_mask_undef(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[SUB:%.]] = shufflevector <4 x i16> [[IN:%.]], <4 x i16> undef, <4 x i32> <i32 undef, i32 undef, i32 1, i32 1>		; CHECK-NEXT: [[SUB:%.]] = shufflevector <4 x i16> [[IN:%.]], <4 x i16> undef, <4 x i32> <i32 undef, i32 undef, i32 1, i32 1>
; CHECK-NEXT: ret <4 x i16> [[SUB]]		; CHECK-NEXT: ret <4 x i16> [[SUB]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 undef, i32 1, i32 1>		%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 undef, i32 undef, i32 1, i32 1>
%sub = sub <4 x i16> %shuffle, <i16 10, i16 3, i16 0, i16 0>		%sub = sub <4 x i16> %shuffle, <i16 10, i16 3, i16 0, i16 0>
ret <4 x i16> %sub		ret <4 x i16> %sub
}		}

define <4 x i16> @sub_constant_mask_undef_2(<4 x i16> %in) {		define <4 x i16> @sub_constant_mask_undef_2(<4 x i16> %in) {
; CHECK-LABEL: @sub_constant_mask_undef_2(		; CHECK-LABEL: @sub_constant_mask_undef_2(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = add <4 x i16> [[IN:%.]], <i16 undef, i16 0, i16 -10, i16 undef>		; CHECK-NEXT: [[TMP0:%.]] = add <4 x i16> [[IN:%.]], <i16 poison, i16 0, i16 -10, i16 poison>
; CHECK-NEXT: [[SUB:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 1, i32 1, i32 2, i32 undef>		; CHECK-NEXT: [[SUB:%.*]] = shufflevector <4 x i16> [[TMP0]], <4 x i16> undef, <4 x i32> <i32 1, i32 1, i32 2, i32 undef>
; CHECK-NEXT: ret <4 x i16> [[SUB]]		; CHECK-NEXT: ret <4 x i16> [[SUB]]
;		;
entry:		entry:
%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 1, i32 1, i32 2, i32 undef>		%shuffle = shufflevector <4 x i16> %in, <4 x i16> undef, <4 x i32> <i32 1, i32 1, i32 2, i32 undef>
%sub = sub <4 x i16> %shuffle, <i16 0, i16 0, i16 10, i16 99>		%sub = sub <4 x i16> %shuffle, <i16 0, i16 0, i16 10, i16 99>
ret <4 x i16> %sub		ret <4 x i16> %sub
}		}

define <2 x i32> @or_splat_constant(<2 x i32> %x) {		define <2 x i32> @or_splat_constant(<2 x i32> %x) {
; CHECK-LABEL: @or_splat_constant(		; CHECK-LABEL: @or_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = or <2 x i32> [[X:%.]], <i32 42, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = or <2 x i32> [[X:%.]], <i32 42, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = or <2 x i32> %splat, <i32 42, i32 42>		%r = or <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @xor_splat_constant(<2 x i32> %x) {		define <2 x i32> @xor_splat_constant(<2 x i32> %x) {
; CHECK-LABEL: @xor_splat_constant(		; CHECK-LABEL: @xor_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = xor <2 x i32> [[X:%.]], <i32 42, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = xor <2 x i32> [[X:%.]], <i32 42, i32 poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = xor <2 x i32> %splat, <i32 42, i32 42>		%r = xor <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x float> @fadd_splat_constant(<2 x float> %x) {		define <2 x float> @fadd_splat_constant(<2 x float> %x) {
; CHECK-LABEL: @fadd_splat_constant(		; CHECK-LABEL: @fadd_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x float> [[X:%.]], <float 4.200000e+01, float undef>		; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x float> [[X:%.]], <float 4.200000e+01, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fadd <2 x float> %splat, <float 42.0, float 42.0>		%r = fadd <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fsub_splat_constant0(<2 x float> %x) {		define <2 x float> @fsub_splat_constant0(<2 x float> %x) {
; CHECK-LABEL: @fsub_splat_constant0(		; CHECK-LABEL: @fsub_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = fsub <2 x float> <float 4.200000e+01, float undef>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = fsub <2 x float> <float 4.200000e+01, float poison>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fsub <2 x float> <float 42.0, float 42.0>, %splat		%r = fsub <2 x float> <float 42.0, float 42.0>, %splat
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fsub_splat_constant1(<2 x float> %x) {		define <2 x float> @fsub_splat_constant1(<2 x float> %x) {
; CHECK-LABEL: @fsub_splat_constant1(		; CHECK-LABEL: @fsub_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x float> [[X:%.]], <float -4.200000e+01, float undef>		; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x float> [[X:%.]], <float -4.200000e+01, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fsub <2 x float> %splat, <float 42.0, float 42.0>		%r = fsub <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fneg(<2 x float> %x) {		define <2 x float> @fneg(<2 x float> %x) {
; CHECK-LABEL: @fneg(		; CHECK-LABEL: @fneg(
; CHECK-NEXT: [[TMP1:%.]] = fneg <2 x float> [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = fneg <2 x float> [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fsub <2 x float> <float -0.0, float -0.0>, %splat		%r = fsub <2 x float> <float -0.0, float -0.0>, %splat
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fmul_splat_constant(<2 x float> %x) {		define <2 x float> @fmul_splat_constant(<2 x float> %x) {
; CHECK-LABEL: @fmul_splat_constant(		; CHECK-LABEL: @fmul_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = fmul <2 x float> [[X:%.]], <float 4.200000e+01, float undef>		; CHECK-NEXT: [[TMP1:%.]] = fmul <2 x float> [[X:%.]], <float 4.200000e+01, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fmul <2 x float> %splat, <float 42.0, float 42.0>		%r = fmul <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fdiv_splat_constant0(<2 x float> %x) {		define <2 x float> @fdiv_splat_constant0(<2 x float> %x) {
; CHECK-LABEL: @fdiv_splat_constant0(		; CHECK-LABEL: @fdiv_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = fdiv <2 x float> <float 4.200000e+01, float undef>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = fdiv <2 x float> <float 4.200000e+01, float poison>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fdiv <2 x float> <float 42.0, float 42.0>, %splat		%r = fdiv <2 x float> <float 42.0, float 42.0>, %splat
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fdiv_splat_constant1(<2 x float> %x) {		define <2 x float> @fdiv_splat_constant1(<2 x float> %x) {
; CHECK-LABEL: @fdiv_splat_constant1(		; CHECK-LABEL: @fdiv_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = fdiv <2 x float> [[X:%.]], <float 4.200000e+01, float undef>		; CHECK-NEXT: [[TMP1:%.]] = fdiv <2 x float> [[X:%.]], <float 4.200000e+01, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fdiv <2 x float> %splat, <float 42.0, float 42.0>		%r = fdiv <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @frem_splat_constant0(<2 x float> %x) {		define <2 x float> @frem_splat_constant0(<2 x float> %x) {
; CHECK-LABEL: @frem_splat_constant0(		; CHECK-LABEL: @frem_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = frem <2 x float> <float 4.200000e+01, float undef>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = frem <2 x float> <float 4.200000e+01, float poison>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = frem <2 x float> <float 42.0, float 42.0>, %splat		%r = frem <2 x float> <float 42.0, float 42.0>, %splat
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @frem_splat_constant1(<2 x float> %x) {		define <2 x float> @frem_splat_constant1(<2 x float> %x) {
; CHECK-LABEL: @frem_splat_constant1(		; CHECK-LABEL: @frem_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = frem <2 x float> [[X:%.]], <float 4.200000e+01, float undef>		; CHECK-NEXT: [[TMP1:%.]] = frem <2 x float> [[X:%.]], <float 4.200000e+01, float poison>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = frem <2 x float> %splat, <float 42.0, float 42.0>		%r = frem <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

▲ Show 20 Lines • Show All 122 Lines • ▼ Show 20 Lines	;
%ext2 = extractelement <2 x float> %x, i32 0		%ext2 = extractelement <2 x float> %x, i32 0
%I = insertelement <4 x float> %widen, float %ext2, i16 0		%I = insertelement <4 x float> %widen, float %ext2, i16 0
store <4 x float> %I, <4 x float>* %p		store <4 x float> %I, <4 x float>* %p
ret <4 x float> %widen		ret <4 x float> %widen
}		}

define <4 x i32> @splat_assoc_add(<4 x i32> %x, <4 x i32> %y) {		define <4 x i32> @splat_assoc_add(<4 x i32> %x, <4 x i32> %y) {
; CHECK-LABEL: @splat_assoc_add(		; CHECK-LABEL: @splat_assoc_add(
; CHECK-NEXT: [[TMP1:%.]] = add <4 x i32> [[X:%.]], <i32 317426, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = add <4 x i32> [[X:%.]], <i32 317426, i32 poison, i32 poison, i32 poison>
; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer		; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer
; CHECK-NEXT: [[R:%.]] = add <4 x i32> [[TMP2]], [[Y:%.]]		; CHECK-NEXT: [[R:%.]] = add <4 x i32> [[TMP2]], [[Y:%.]]
; CHECK-NEXT: ret <4 x i32> [[R]]		; CHECK-NEXT: ret <4 x i32> [[R]]
;		;
%splatx = shufflevector <4 x i32> %x, <4 x i32> undef, <4 x i32> zeroinitializer		%splatx = shufflevector <4 x i32> %x, <4 x i32> undef, <4 x i32> zeroinitializer
%a = add <4 x i32> %y, <i32 317426, i32 317426, i32 317426, i32 317426>		%a = add <4 x i32> %y, <i32 317426, i32 317426, i32 317426, i32 317426>
%r = add <4 x i32> %splatx, %a		%r = add <4 x i32> %splatx, %a
ret <4 x i32> %r		ret <4 x i32> %r
Show All 12 Lines	;
%r = add <vscale x 4 x i32> %splatx, %a		%r = add <vscale x 4 x i32> %splatx, %a
ret <vscale x 4 x i32> %r		ret <vscale x 4 x i32> %r
}		}

; Undefs in splat mask are replaced with defined splat index		; Undefs in splat mask are replaced with defined splat index

define <4 x i32> @splat_assoc_add_undef_mask_elts(<4 x i32> %x, <4 x i32> %y) {		define <4 x i32> @splat_assoc_add_undef_mask_elts(<4 x i32> %x, <4 x i32> %y) {
; CHECK-LABEL: @splat_assoc_add_undef_mask_elts(		; CHECK-LABEL: @splat_assoc_add_undef_mask_elts(
; CHECK-NEXT: [[TMP1:%.]] = add <4 x i32> [[X:%.]], <i32 42, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = add <4 x i32> [[X:%.]], <i32 42, i32 poison, i32 poison, i32 poison>
; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer		; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer
; CHECK-NEXT: [[R:%.]] = add <4 x i32> [[TMP2]], [[Y:%.]]		; CHECK-NEXT: [[R:%.]] = add <4 x i32> [[TMP2]], [[Y:%.]]
; CHECK-NEXT: ret <4 x i32> [[R]]		; CHECK-NEXT: ret <4 x i32> [[R]]
;		;
%splatx = shufflevector <4 x i32> %x, <4 x i32> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef>		%splatx = shufflevector <4 x i32> %x, <4 x i32> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef>
%a = add <4 x i32> %y, <i32 42, i32 42, i32 42, i32 42>		%a = add <4 x i32> %y, <i32 42, i32 42, i32 42, i32 42>
%r = add <4 x i32> %splatx, %a		%r = add <4 x i32> %splatx, %a
ret <4 x i32> %r		ret <4 x i32> %r
}		}

; Undefs in splat mask are replaced with defined splat index		; Undefs in splat mask are replaced with defined splat index

define <4 x i32> @splat_assoc_add_undef_mask_elt_at_splat_index(<4 x i32> %x, <4 x i32> %y) {		define <4 x i32> @splat_assoc_add_undef_mask_elt_at_splat_index(<4 x i32> %x, <4 x i32> %y) {
; CHECK-LABEL: @splat_assoc_add_undef_mask_elt_at_splat_index(		; CHECK-LABEL: @splat_assoc_add_undef_mask_elt_at_splat_index(
; CHECK-NEXT: [[TMP1:%.]] = add <4 x i32> [[X:%.]], <i32 42, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP1:%.]] = add <4 x i32> [[X:%.]], <i32 42, i32 poison, i32 poison, i32 poison>
; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer		; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> zeroinitializer
; CHECK-NEXT: [[R:%.]] = add <4 x i32> [[TMP2]], [[Y:%.]]		; CHECK-NEXT: [[R:%.]] = add <4 x i32> [[TMP2]], [[Y:%.]]
; CHECK-NEXT: ret <4 x i32> [[R]]		; CHECK-NEXT: ret <4 x i32> [[R]]
;		;
%splatx = shufflevector <4 x i32> %x, <4 x i32> undef, <4 x i32> <i32 undef, i32 0, i32 0, i32 0>		%splatx = shufflevector <4 x i32> %x, <4 x i32> undef, <4 x i32> <i32 undef, i32 0, i32 0, i32 0>
%a = add <4 x i32> %y, <i32 42, i32 42, i32 42, i32 42>		%a = add <4 x i32> %y, <i32 42, i32 42, i32 42, i32 42>
%r = add <4 x i32> %splatx, %a		%r = add <4 x i32> %splatx, %a
ret <4 x i32> %r		ret <4 x i32> %r
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	;
%r = add <4 x i32> %splatx, %a		%r = add <4 x i32> %splatx, %a
ret <4 x i32> %r		ret <4 x i32> %r
}		}

; Non-zero splat index; commute operands; FMF intersect		; Non-zero splat index; commute operands; FMF intersect

define <2 x float> @splat_assoc_fmul(<2 x float> %x, <2 x float> %y) {		define <2 x float> @splat_assoc_fmul(<2 x float> %x, <2 x float> %y) {
; CHECK-LABEL: @splat_assoc_fmul(		; CHECK-LABEL: @splat_assoc_fmul(
; CHECK-NEXT: [[TMP1:%.]] = fmul reassoc nsz <2 x float> [[X:%.]], <float undef, float 3.000000e+00>		; CHECK-NEXT: [[TMP1:%.]] = fmul reassoc nsz <2 x float> [[X:%.]], <float poison, float 3.000000e+00>
; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> <i32 1, i32 1>		; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> <i32 1, i32 1>
; CHECK-NEXT: [[R:%.]] = fmul reassoc nsz <2 x float> [[TMP2]], [[Y:%.]]		; CHECK-NEXT: [[R:%.]] = fmul reassoc nsz <2 x float> [[TMP2]], [[Y:%.]]
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splatx = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> <i32 1, i32 1>		%splatx = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> <i32 1, i32 1>
%a = fmul reassoc nsz <2 x float> %y, <float 3.0, float 3.0>		%a = fmul reassoc nsz <2 x float> %y, <float 3.0, float 3.0>
%r = fmul reassoc nsz nnan <2 x float> %a, %splatx		%r = fmul reassoc nsz nnan <2 x float> %a, %splatx
ret <2 x float> %r		ret <2 x float> %r
▲ Show 20 Lines • Show All 184 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/vector-casts-inseltpoison.ll

	Show First 20 Lines • Show All 316 Lines • ▼ Show 20 Lines
	}			}

	; TODO: Strengthen the backend, so we can have this canonicalization.			; TODO: Strengthen the backend, so we can have this canonicalization.
	; Insert a scalar int into a constant vector and truncate:			; Insert a scalar int into a constant vector and truncate:
	; trunc (inselt C, X, Index) --> inselt C, (trunc X), Index			; trunc (inselt C, X, Index) --> inselt C, (trunc X), Index

	define <3 x i16> @trunc_inselt1(i32 %x) {			define <3 x i16> @trunc_inselt1(i32 %x) {
	; CHECK-LABEL: @trunc_inselt1(			; CHECK-LABEL: @trunc_inselt1(
	; CHECK-NEXT: [[VEC:%.]] = insertelement <3 x i32> <i32 3, i32 undef, i32 65536>, i32 [[X:%.]], i32 1			; CHECK-NEXT: [[VEC:%.]] = insertelement <3 x i32> <i32 3, i32 poison, i32 65536>, i32 [[X:%.]], i32 1
	; CHECK-NEXT: [[TRUNC:%.*]] = trunc <3 x i32> [[VEC]] to <3 x i16>			; CHECK-NEXT: [[TRUNC:%.*]] = trunc <3 x i32> [[VEC]] to <3 x i16>
	; CHECK-NEXT: ret <3 x i16> [[TRUNC]]			; CHECK-NEXT: ret <3 x i16> [[TRUNC]]
	;			;
	%vec = insertelement <3 x i32> <i32 3, i32 -2, i32 65536>, i32 %x, i32 1			%vec = insertelement <3 x i32> <i32 3, i32 -2, i32 65536>, i32 %x, i32 1
	%trunc = trunc <3 x i32> %vec to <3 x i16>			%trunc = trunc <3 x i32> %vec to <3 x i16>
	ret <3 x i16> %trunc			ret <3 x i16> %trunc
	}			}

	▲ Show 20 Lines • Show All 80 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/vector-casts.ll

	Show First 20 Lines • Show All 316 Lines • ▼ Show 20 Lines
	}			}

	; TODO: Strengthen the backend, so we can have this canonicalization.			; TODO: Strengthen the backend, so we can have this canonicalization.
	; Insert a scalar int into a constant vector and truncate:			; Insert a scalar int into a constant vector and truncate:
	; trunc (inselt C, X, Index) --> inselt C, (trunc X), Index			; trunc (inselt C, X, Index) --> inselt C, (trunc X), Index

	define <3 x i16> @trunc_inselt1(i32 %x) {			define <3 x i16> @trunc_inselt1(i32 %x) {
	; CHECK-LABEL: @trunc_inselt1(			; CHECK-LABEL: @trunc_inselt1(
	; CHECK-NEXT: [[VEC:%.]] = insertelement <3 x i32> <i32 3, i32 undef, i32 65536>, i32 [[X:%.]], i32 1			; CHECK-NEXT: [[VEC:%.]] = insertelement <3 x i32> <i32 3, i32 poison, i32 65536>, i32 [[X:%.]], i32 1
	; CHECK-NEXT: [[TRUNC:%.*]] = trunc <3 x i32> [[VEC]] to <3 x i16>			; CHECK-NEXT: [[TRUNC:%.*]] = trunc <3 x i32> [[VEC]] to <3 x i16>
	; CHECK-NEXT: ret <3 x i16> [[TRUNC]]			; CHECK-NEXT: ret <3 x i16> [[TRUNC]]
	;			;
	%vec = insertelement <3 x i32> <i32 3, i32 -2, i32 65536>, i32 %x, i32 1			%vec = insertelement <3 x i32> <i32 3, i32 -2, i32 65536>, i32 %x, i32 1
	%trunc = trunc <3 x i32> %vec to <3 x i16>			%trunc = trunc <3 x i32> %vec to <3 x i16>
	ret <3 x i16> %trunc			ret <3 x i16> %trunc
	}			}

	▲ Show 20 Lines • Show All 80 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/vector_insertelt_shuffle-inseltpoison.ll

; NOTE: Assertions have been autogenerated by utils/update_test_checks.py		; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
; RUN: opt < %s -instcombine -S \| FileCheck %s		; RUN: opt < %s -instcombine -S \| FileCheck %s

; insertelements should fold to shuffle		; insertelements should fold to shuffle
define <4 x float> @foo(<4 x float> %x) {		define <4 x float> @foo(<4 x float> %x) {
; CHECK-LABEL: @foo(		; CHECK-LABEL: @foo(
; CHECK-NEXT: [[INS2:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float undef, float 1.000000e+00, float 2.000000e+00, float undef>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>		; CHECK-NEXT: [[INS2:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float poison, float 1.000000e+00, float 2.000000e+00, float poison>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>
; CHECK-NEXT: ret <4 x float> [[INS2]]		; CHECK-NEXT: ret <4 x float> [[INS2]]
;		;
%ins1 = insertelement<4 x float> %x, float 1.0, i32 1		%ins1 = insertelement<4 x float> %x, float 1.0, i32 1
%ins2 = insertelement<4 x float> %ins1, float 2.0, i32 2		%ins2 = insertelement<4 x float> %ins1, float 2.0, i32 2
ret <4 x float> %ins2		ret <4 x float> %ins2
}		}

; Insert of a constant is canonicalized ahead of insert of a variable.		; Insert of a constant is canonicalized ahead of insert of a variable.
Show All 20 Lines	;
ret <4 x float> %ins2		ret <4 x float> %ins2
}		}

; insertelements should fold to shuffle		; insertelements should fold to shuffle
define <4 x float> @bazz(<4 x float> %x, i32 %a) {		define <4 x float> @bazz(<4 x float> %x, i32 %a) {
; CHECK-LABEL: @bazz(		; CHECK-LABEL: @bazz(
; CHECK-NEXT: [[INS1:%.]] = insertelement <4 x float> [[X:%.]], float 1.000000e+00, i32 3		; CHECK-NEXT: [[INS1:%.]] = insertelement <4 x float> [[X:%.]], float 1.000000e+00, i32 3
; CHECK-NEXT: [[INS2:%.]] = insertelement <4 x float> [[INS1]], float 5.000000e+00, i32 [[A:%.]]		; CHECK-NEXT: [[INS2:%.]] = insertelement <4 x float> [[INS1]], float 5.000000e+00, i32 [[A:%.]]
; CHECK-NEXT: [[INS5:%.*]] = shufflevector <4 x float> [[INS2]], <4 x float> <float undef, float 1.000000e+00, float 2.000000e+00, float undef>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>		; CHECK-NEXT: [[INS5:%.*]] = shufflevector <4 x float> [[INS2]], <4 x float> <float poison, float 1.000000e+00, float 2.000000e+00, float poison>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>
; CHECK-NEXT: [[INS6:%.*]] = insertelement <4 x float> [[INS5]], float 7.000000e+00, i32 [[A]]		; CHECK-NEXT: [[INS6:%.*]] = insertelement <4 x float> [[INS5]], float 7.000000e+00, i32 [[A]]
; CHECK-NEXT: ret <4 x float> [[INS6]]		; CHECK-NEXT: ret <4 x float> [[INS6]]
;		;
%ins1 = insertelement<4 x float> %x, float 1.0, i32 3		%ins1 = insertelement<4 x float> %x, float 1.0, i32 3
%ins2 = insertelement<4 x float> %ins1, float 5.0, i32 %a		%ins2 = insertelement<4 x float> %ins1, float 5.0, i32 %a
%ins3 = insertelement<4 x float> %ins2, float 3.0, i32 2		%ins3 = insertelement<4 x float> %ins2, float 3.0, i32 2
%ins4 = insertelement<4 x float> %ins3, float 1.0, i32 1		%ins4 = insertelement<4 x float> %ins3, float 1.0, i32 1
%ins5 = insertelement<4 x float> %ins4, float 2.0, i32 2		%ins5 = insertelement<4 x float> %ins4, float 2.0, i32 2
▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/vector_insertelt_shuffle.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -instcombine -S \| FileCheck %s			; RUN: opt < %s -instcombine -S \| FileCheck %s

	; insertelements should fold to shuffle			; insertelements should fold to shuffle
	define <4 x float> @foo(<4 x float> %x) {			define <4 x float> @foo(<4 x float> %x) {
	; CHECK-LABEL: @foo(			; CHECK-LABEL: @foo(
	; CHECK-NEXT: [[INS2:%.*]] = shufflevector <4 x float> %x, <4 x float> <float undef, float 1.000000e+00, float 2.000000e+00, float undef>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>			; CHECK-NEXT: [[INS2:%.]] = shufflevector <4 x float> [[X:%.]], <4 x float> <float poison, float 1.000000e+00, float 2.000000e+00, float poison>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>
	; CHECK-NEXT: ret <4 x float> [[INS2]]			; CHECK-NEXT: ret <4 x float> [[INS2]]
	;			;
	%ins1 = insertelement<4 x float> %x, float 1.0, i32 1			%ins1 = insertelement<4 x float> %x, float 1.0, i32 1
	%ins2 = insertelement<4 x float> %ins1, float 2.0, i32 2			%ins2 = insertelement<4 x float> %ins1, float 2.0, i32 2
	ret <4 x float> %ins2			ret <4 x float> %ins2
	}			}

	; Insert of a constant is canonicalized ahead of insert of a variable.			; Insert of a constant is canonicalized ahead of insert of a variable.

	define <4 x float> @bar(<4 x float> %x, float %a) {			define <4 x float> @bar(<4 x float> %x, float %a) {
	; CHECK-LABEL: @bar(			; CHECK-LABEL: @bar(
	; CHECK-NEXT: [[TMP1:%.*]] = insertelement <4 x float> %x, float 2.000000e+00, i32 2			; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x float> [[X:%.]], float 2.000000e+00, i32 2
	; CHECK-NEXT: [[INS2:%.*]] = insertelement <4 x float> [[TMP1]], float %a, i32 1			; CHECK-NEXT: [[INS2:%.]] = insertelement <4 x float> [[TMP1]], float [[A:%.]], i32 1
	; CHECK-NEXT: ret <4 x float> [[INS2]]			; CHECK-NEXT: ret <4 x float> [[INS2]]
	;			;
	%ins1 = insertelement<4 x float> %x, float %a, i32 1			%ins1 = insertelement<4 x float> %x, float %a, i32 1
	%ins2 = insertelement<4 x float> %ins1, float 2.0, i32 2			%ins2 = insertelement<4 x float> %ins1, float 2.0, i32 2
	ret <4 x float> %ins2			ret <4 x float> %ins2
	}			}

	define <4 x float> @baz(<4 x float> %x, i32 %a) {			define <4 x float> @baz(<4 x float> %x, i32 %a) {
	; CHECK-LABEL: @baz(			; CHECK-LABEL: @baz(
	; CHECK-NEXT: [[INS1:%.*]] = insertelement <4 x float> %x, float 1.000000e+00, i32 1			; CHECK-NEXT: [[INS1:%.]] = insertelement <4 x float> [[X:%.]], float 1.000000e+00, i32 1
	; CHECK-NEXT: [[INS2:%.*]] = insertelement <4 x float> [[INS1]], float 2.000000e+00, i32 %a			; CHECK-NEXT: [[INS2:%.]] = insertelement <4 x float> [[INS1]], float 2.000000e+00, i32 [[A:%.]]
	; CHECK-NEXT: ret <4 x float> [[INS2]]			; CHECK-NEXT: ret <4 x float> [[INS2]]
	;			;
	%ins1 = insertelement<4 x float> %x, float 1.0, i32 1			%ins1 = insertelement<4 x float> %x, float 1.0, i32 1
	%ins2 = insertelement<4 x float> %ins1, float 2.0, i32 %a			%ins2 = insertelement<4 x float> %ins1, float 2.0, i32 %a
	ret <4 x float> %ins2			ret <4 x float> %ins2
	}			}

	; insertelements should fold to shuffle			; insertelements should fold to shuffle
	define <4 x float> @bazz(<4 x float> %x, i32 %a) {			define <4 x float> @bazz(<4 x float> %x, i32 %a) {
	; CHECK-LABEL: @bazz(			; CHECK-LABEL: @bazz(
	; CHECK-NEXT: [[INS1:%.*]] = insertelement <4 x float> %x, float 1.000000e+00, i32 3			; CHECK-NEXT: [[INS1:%.]] = insertelement <4 x float> [[X:%.]], float 1.000000e+00, i32 3
	; CHECK-NEXT: [[INS2:%.*]] = insertelement <4 x float> [[INS1]], float 5.000000e+00, i32 %a			; CHECK-NEXT: [[INS2:%.]] = insertelement <4 x float> [[INS1]], float 5.000000e+00, i32 [[A:%.]]
	; CHECK-NEXT: [[INS5:%.*]] = shufflevector <4 x float> [[INS2]], <4 x float> <float undef, float 1.000000e+00, float 2.000000e+00, float undef>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>			; CHECK-NEXT: [[INS5:%.*]] = shufflevector <4 x float> [[INS2]], <4 x float> <float poison, float 1.000000e+00, float 2.000000e+00, float poison>, <4 x i32> <i32 0, i32 5, i32 6, i32 3>
	; CHECK-NEXT: [[INS6:%.*]] = insertelement <4 x float> [[INS5]], float 7.000000e+00, i32 %a			; CHECK-NEXT: [[INS6:%.*]] = insertelement <4 x float> [[INS5]], float 7.000000e+00, i32 [[A]]
	; CHECK-NEXT: ret <4 x float> [[INS6]]			; CHECK-NEXT: ret <4 x float> [[INS6]]
	;			;
	%ins1 = insertelement<4 x float> %x, float 1.0, i32 3			%ins1 = insertelement<4 x float> %x, float 1.0, i32 3
	%ins2 = insertelement<4 x float> %ins1, float 5.0, i32 %a			%ins2 = insertelement<4 x float> %ins1, float 5.0, i32 %a
	%ins3 = insertelement<4 x float> %ins2, float 3.0, i32 2			%ins3 = insertelement<4 x float> %ins2, float 3.0, i32 2
	%ins4 = insertelement<4 x float> %ins3, float 1.0, i32 1			%ins4 = insertelement<4 x float> %ins3, float 1.0, i32 1
	%ins5 = insertelement<4 x float> %ins4, float 2.0, i32 2			%ins5 = insertelement<4 x float> %ins4, float 2.0, i32 2
	%ins6 = insertelement<4 x float> %ins5, float 7.0, i32 %a			%ins6 = insertelement<4 x float> %ins5, float 7.0, i32 %a
	ret <4 x float> %ins6			ret <4 x float> %ins6
	}			}

	; Out of bounds index folds to undef			; Out of bounds index folds to undef
	define <4 x float> @bazzz(<4 x float> %x) {			define <4 x float> @bazzz(<4 x float> %x) {
	; CHECK-LABEL: @bazzz(			; CHECK-LABEL: @bazzz(
	; CHECK-NEXT: ret <4 x float> <float undef, float undef, float 2.000000e+00, float undef>			; CHECK-NEXT: ret <4 x float> <float undef, float undef, float 2.000000e+00, float undef>
	;			;
	%ins1 = insertelement<4 x float> %x, float 1.0, i32 5			%ins1 = insertelement<4 x float> %x, float 1.0, i32 5
	%ins2 = insertelement<4 x float> %ins1, float 2.0, i32 2			%ins2 = insertelement<4 x float> %ins1, float 2.0, i32 2
	ret <4 x float> %ins2			ret <4 x float> %ins2
	}			}

	define <4 x float> @bazzzz(<4 x float> %x) {			define <4 x float> @bazzzz(<4 x float> %x) {
	; CHECK-LABEL: @bazzzz(			; CHECK-LABEL: @bazzzz(
	; CHECK-NEXT: ret <4 x float> <float undef, float undef, float 2.000000e+00, float undef>			; CHECK-NEXT: ret <4 x float> <float undef, float undef, float 2.000000e+00, float undef>
	;			;
	%ins1 = insertelement<4 x float> %x, float 1.0, i32 undef			%ins1 = insertelement<4 x float> %x, float 1.0, i32 undef
	%ins2 = insertelement<4 x float> %ins1, float 2.0, i32 2			%ins2 = insertelement<4 x float> %ins1, float 2.0, i32 2
	ret <4 x float> %ins2			ret <4 x float> %ins2
	}			}

	define <4 x float> @bazzzzz() {			define <4 x float> @bazzzzz() {
	; CHECK-LABEL: @bazzzzz(			; CHECK-LABEL: @bazzzzz(
	Show All 16 Lines

llvm/test/Transforms/LoopVectorize/X86/invariant-load-gather.ll

	Show All 38 Lines
	; CHECK-NEXT: [[TMP3:%.]] = icmp ne <16 x i32> [[BROADCAST_SPLAT]], zeroinitializer			; CHECK-NEXT: [[TMP3:%.]] = icmp ne <16 x i32> [[BROADCAST_SPLAT]], zeroinitializer
	; CHECK-NEXT: [[TMP4:%.]] = bitcast i32 [[TMP2]] to <16 x i32>*			; CHECK-NEXT: [[TMP4:%.]] = bitcast i32 [[TMP2]] to <16 x i32>*
	; CHECK-NEXT: store <16 x i32> [[BROADCAST_SPLAT7]], <16 x i32>* [[TMP4]], align 4, !alias.scope !0, !noalias !3			; CHECK-NEXT: store <16 x i32> [[BROADCAST_SPLAT7]], <16 x i32>* [[TMP4]], align 4, !alias.scope !0, !noalias !3
	; CHECK-NEXT: [[INDEX_NEXT]] = add i64 [[INDEX]], 16			; CHECK-NEXT: [[INDEX_NEXT]] = add i64 [[INDEX]], 16
	; CHECK-NEXT: [[TMP5:%.*]] = icmp eq i64 [[INDEX_NEXT]], [[N_VEC]]			; CHECK-NEXT: [[TMP5:%.*]] = icmp eq i64 [[INDEX_NEXT]], [[N_VEC]]
	; CHECK-NEXT: br i1 [[TMP5]], label [[MIDDLE_BLOCK:%.]], label [[VECTOR_BODY]], [[LOOP5:!llvm.loop !.]]			; CHECK-NEXT: br i1 [[TMP5]], label [[MIDDLE_BLOCK:%.]], label [[VECTOR_BODY]], [[LOOP5:!llvm.loop !.]]
	; CHECK: middle.block:			; CHECK: middle.block:
	; CHECK-NEXT: [[WIDE_MASKED_GATHER:%.]] = call <16 x i32> @llvm.masked.gather.v16i32.v16p0i32(<16 x i32> [[BROADCAST_SPLAT]], i32 4, <16 x i1> [[TMP3]], <16 x i32> undef), !alias.scope !3			; CHECK-NEXT: [[WIDE_MASKED_GATHER:%.]] = call <16 x i32> @llvm.masked.gather.v16i32.v16p0i32(<16 x i32> [[BROADCAST_SPLAT]], i32 4, <16 x i1> [[TMP3]], <16 x i32> undef), !alias.scope !3
	; CHECK-NEXT: [[PREDPHI:%.*]] = select <16 x i1> [[TMP3]], <16 x i32> [[WIDE_MASKED_GATHER]], <16 x i32> <i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 1>			; CHECK-NEXT: [[PREDPHI:%.*]] = select <16 x i1> [[TMP3]], <16 x i32> [[WIDE_MASKED_GATHER]], <16 x i32> <i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 1>
	; CHECK-NEXT: [[CMP_N:%.*]] = icmp eq i64 [[SMAX]], [[N_VEC]]			; CHECK-NEXT: [[CMP_N:%.*]] = icmp eq i64 [[SMAX]], [[N_VEC]]
	; CHECK-NEXT: [[TMP6:%.*]] = extractelement <16 x i32> [[PREDPHI]], i32 15			; CHECK-NEXT: [[TMP6:%.*]] = extractelement <16 x i32> [[PREDPHI]], i32 15
	; CHECK-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.]], label [[VEC_EPILOG_ITER_CHECK:%.]]			; CHECK-NEXT: br i1 [[CMP_N]], label [[FOR_END:%.]], label [[VEC_EPILOG_ITER_CHECK:%.]]
	; CHECK: vec.epilog.iter.check:			; CHECK: vec.epilog.iter.check:
	; CHECK-NEXT: [[N_VEC_REMAINING:%.*]] = and i64 [[SMAX]], 8			; CHECK-NEXT: [[N_VEC_REMAINING:%.*]] = and i64 [[SMAX]], 8
	; CHECK-NEXT: [[MIN_EPILOG_ITERS_CHECK_NOT_NOT:%.*]] = icmp eq i64 [[N_VEC_REMAINING]], 0			; CHECK-NEXT: [[MIN_EPILOG_ITERS_CHECK_NOT_NOT:%.*]] = icmp eq i64 [[N_VEC_REMAINING]], 0
	; CHECK-NEXT: br i1 [[MIN_EPILOG_ITERS_CHECK_NOT_NOT]], label [[VEC_EPILOG_SCALAR_PH]], label [[VEC_EPILOG_PH]]			; CHECK-NEXT: br i1 [[MIN_EPILOG_ITERS_CHECK_NOT_NOT]], label [[VEC_EPILOG_SCALAR_PH]], label [[VEC_EPILOG_PH]]
	; CHECK: vec.epilog.ph:			; CHECK: vec.epilog.ph:
	Show All 12 Lines
	; CHECK-NEXT: [[TMP9:%.]] = icmp ne <8 x i32> [[BROADCAST_SPLAT17]], zeroinitializer			; CHECK-NEXT: [[TMP9:%.]] = icmp ne <8 x i32> [[BROADCAST_SPLAT17]], zeroinitializer
	; CHECK-NEXT: [[TMP10:%.]] = bitcast i32 [[TMP8]] to <8 x i32>*			; CHECK-NEXT: [[TMP10:%.]] = bitcast i32 [[TMP8]] to <8 x i32>*
	; CHECK-NEXT: store <8 x i32> [[BROADCAST_SPLAT19]], <8 x i32>* [[TMP10]], align 4			; CHECK-NEXT: store <8 x i32> [[BROADCAST_SPLAT19]], <8 x i32>* [[TMP10]], align 4
	; CHECK-NEXT: [[INDEX_NEXT13]] = add i64 [[INDEX12]], 8			; CHECK-NEXT: [[INDEX_NEXT13]] = add i64 [[INDEX12]], 8
	; CHECK-NEXT: [[TMP11:%.*]] = icmp eq i64 [[INDEX_NEXT13]], [[N_VEC11]]			; CHECK-NEXT: [[TMP11:%.*]] = icmp eq i64 [[INDEX_NEXT13]], [[N_VEC11]]
	; CHECK-NEXT: br i1 [[TMP11]], label [[VEC_EPILOG_MIDDLE_BLOCK:%.]], label [[VEC_EPILOG_VECTOR_BODY]], [[LOOP7:!llvm.loop !.]]			; CHECK-NEXT: br i1 [[TMP11]], label [[VEC_EPILOG_MIDDLE_BLOCK:%.]], label [[VEC_EPILOG_VECTOR_BODY]], [[LOOP7:!llvm.loop !.]]
	; CHECK: vec.epilog.middle.block:			; CHECK: vec.epilog.middle.block:
	; CHECK-NEXT: [[WIDE_MASKED_GATHER20:%.]] = call <8 x i32> @llvm.masked.gather.v8i32.v8p0i32(<8 x i32> [[BROADCAST_SPLAT17]], i32 4, <8 x i1> [[TMP9]], <8 x i32> undef)			; CHECK-NEXT: [[WIDE_MASKED_GATHER20:%.]] = call <8 x i32> @llvm.masked.gather.v8i32.v8p0i32(<8 x i32> [[BROADCAST_SPLAT17]], i32 4, <8 x i1> [[TMP9]], <8 x i32> undef)
	; CHECK-NEXT: [[PREDPHI21:%.*]] = select <8 x i1> [[TMP9]], <8 x i32> [[WIDE_MASKED_GATHER20]], <8 x i32> <i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 undef, i32 1>			; CHECK-NEXT: [[PREDPHI21:%.*]] = select <8 x i1> [[TMP9]], <8 x i32> [[WIDE_MASKED_GATHER20]], <8 x i32> <i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 poison, i32 1>
	; CHECK-NEXT: [[CMP_N14:%.*]] = icmp eq i64 [[SMAX9]], [[N_VEC11]]			; CHECK-NEXT: [[CMP_N14:%.*]] = icmp eq i64 [[SMAX9]], [[N_VEC11]]
	; CHECK-NEXT: [[TMP12:%.*]] = extractelement <8 x i32> [[PREDPHI21]], i32 7			; CHECK-NEXT: [[TMP12:%.*]] = extractelement <8 x i32> [[PREDPHI21]], i32 7
	; CHECK-NEXT: br i1 [[CMP_N14]], label [[FOR_END_LOOPEXIT:%.*]], label [[VEC_EPILOG_SCALAR_PH]]			; CHECK-NEXT: br i1 [[CMP_N14]], label [[FOR_END_LOOPEXIT:%.*]], label [[VEC_EPILOG_SCALAR_PH]]
	; CHECK: vec.epilog.scalar.ph:			; CHECK: vec.epilog.scalar.ph:
	; CHECK-NEXT: [[BC_RESUME_VAL:%.]] = phi i64 [ [[N_VEC11]], [[VEC_EPILOG_MIDDLE_BLOCK]] ], [ [[N_VEC]], [[VEC_EPILOG_ITER_CHECK]] ], [ 0, [[VECTOR_MEMCHECK]] ], [ 0, [[ITER_CHECK:%.]] ]			; CHECK-NEXT: [[BC_RESUME_VAL:%.]] = phi i64 [ [[N_VEC11]], [[VEC_EPILOG_MIDDLE_BLOCK]] ], [ [[N_VEC]], [[VEC_EPILOG_ITER_CHECK]] ], [ 0, [[VECTOR_MEMCHECK]] ], [ 0, [[ITER_CHECK:%.]] ]
	; CHECK-NEXT: br label [[FOR_BODY:%.*]]			; CHECK-NEXT: br label [[FOR_BODY:%.*]]
	; CHECK: for.body:			; CHECK: for.body:
	; CHECK-NEXT: [[I:%.]] = phi i64 [ [[I_NEXT:%.]], [[LATCH:%.*]] ], [ [[BC_RESUME_VAL]], [[VEC_EPILOG_SCALAR_PH]] ]			; CHECK-NEXT: [[I:%.]] = phi i64 [ [[I_NEXT:%.]], [[LATCH:%.*]] ], [ [[BC_RESUME_VAL]], [[VEC_EPILOG_SCALAR_PH]] ]
	▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/induction.ll

	Show First 20 Lines • Show All 421 Lines • ▼ Show 20 Lines
	;			;
	; UNROLL-LABEL: @iv_vector_and_scalar_users(			; UNROLL-LABEL: @iv_vector_and_scalar_users(
	; UNROLL: vector.body:			; UNROLL: vector.body:
	; UNROLL: %index = phi i64 [ 0, %vector.ph ], [ %index.next, %vector.body ]			; UNROLL: %index = phi i64 [ 0, %vector.ph ], [ %index.next, %vector.body ]
	; UNROLL: %vec.ind2 = phi <2 x i32> [ <i32 0, i32 1>, %vector.ph ], [ %vec.ind.next5, %vector.body ]			; UNROLL: %vec.ind2 = phi <2 x i32> [ <i32 0, i32 1>, %vector.ph ], [ %vec.ind.next5, %vector.body ]
	; UNROLL: %[[i1:.+]] = or i64 %index, 1			; UNROLL: %[[i1:.+]] = or i64 %index, 1
	; UNROLL: %[[i2:.+]] = or i64 %index, 2			; UNROLL: %[[i2:.+]] = or i64 %index, 2
	; UNROLL: %[[i3:.+]] = or i64 %index, 3			; UNROLL: %[[i3:.+]] = or i64 %index, 3
	; UNROLL: %[[add:.+]]= add <2 x i32> %[[splat:.+]], <i32 2, i32 undef>			; UNROLL: %[[add:.+]]= add <2 x i32> %[[splat:.+]], <i32 2, i32 poison>
	; UNROLL: getelementptr inbounds %pair.i16, %pair.i16* %p, i64 %index, i32 1			; UNROLL: getelementptr inbounds %pair.i16, %pair.i16* %p, i64 %index, i32 1
	; UNROLL: getelementptr inbounds %pair.i16, %pair.i16* %p, i64 %[[i1]], i32 1			; UNROLL: getelementptr inbounds %pair.i16, %pair.i16* %p, i64 %[[i1]], i32 1
	; UNROLL: getelementptr inbounds %pair.i16, %pair.i16* %p, i64 %[[i2]], i32 1			; UNROLL: getelementptr inbounds %pair.i16, %pair.i16* %p, i64 %[[i2]], i32 1
	; UNROLL: getelementptr inbounds %pair.i16, %pair.i16* %p, i64 %[[i3]], i32 1			; UNROLL: getelementptr inbounds %pair.i16, %pair.i16* %p, i64 %[[i3]], i32 1
	; UNROLL: %index.next = add i64 %index, 4			; UNROLL: %index.next = add i64 %index, 4
	; UNROLL: %vec.ind.next5 = add <2 x i32> %vec.ind2, <i32 4, i32 4>			; UNROLL: %vec.ind.next5 = add <2 x i32> %vec.ind2, <i32 4, i32 4>

	%pair.i16 = type { i16, i16 }			%pair.i16 = type { i16, i16 }
	▲ Show 20 Lines • Show All 458 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/reduction-inloop-pred.ll

	Show First 20 Lines • Show All 1,522 Lines • ▼ Show 20 Lines
	}			}

	; Conditional reductions with multi-input phis.			; Conditional reductions with multi-input phis.
	define float @reduction_conditional(float* %A, float* %B, float* %C, float %S) {			define float @reduction_conditional(float* %A, float* %B, float* %C, float %S) {
	; CHECK-LABEL: @reduction_conditional(			; CHECK-LABEL: @reduction_conditional(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; CHECK-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; CHECK: vector.ph:			; CHECK: vector.ph:
	; CHECK-NEXT: [[TMP0:%.]] = insertelement <4 x float> <float undef, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[S:%.]], i32 0			; CHECK-NEXT: [[TMP0:%.]] = insertelement <4 x float> <float poison, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[S:%.]], i32 0
	; CHECK-NEXT: br label [[VECTOR_BODY:%.*]]			; CHECK-NEXT: br label [[VECTOR_BODY:%.*]]
	; CHECK: vector.body:			; CHECK: vector.body:
	; CHECK-NEXT: [[INDEX:%.]] = phi i64 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]			; CHECK-NEXT: [[INDEX:%.]] = phi i64 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; CHECK-NEXT: [[VEC_PHI:%.]] = phi <4 x float> [ [[TMP0]], [[VECTOR_PH]] ], [ [[PREDPHI3:%.]], [[VECTOR_BODY]] ]			; CHECK-NEXT: [[VEC_PHI:%.]] = phi <4 x float> [ [[TMP0]], [[VECTOR_PH]] ], [ [[PREDPHI3:%.]], [[VECTOR_BODY]] ]
	; CHECK-NEXT: [[TMP1:%.]] = getelementptr inbounds float, float [[A:%.*]], i64 [[INDEX]]			; CHECK-NEXT: [[TMP1:%.]] = getelementptr inbounds float, float [[A:%.*]], i64 [[INDEX]]
	; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[TMP1]] to <4 x float>*			; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[TMP1]] to <4 x float>*
	; CHECK-NEXT: [[WIDE_LOAD:%.]] = load <4 x float>, <4 x float> [[TMP2]], align 4			; CHECK-NEXT: [[WIDE_LOAD:%.]] = load <4 x float>, <4 x float> [[TMP2]], align 4
	; CHECK-NEXT: [[TMP3:%.]] = getelementptr inbounds float, float [[B:%.*]], i64 [[INDEX]]			; CHECK-NEXT: [[TMP3:%.]] = getelementptr inbounds float, float [[B:%.*]], i64 [[INDEX]]
	▲ Show 20 Lines • Show All 271 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/reduction-inloop.ll

	Show First 20 Lines • Show All 794 Lines • ▼ Show 20 Lines
	}			}

	; Conditional reductions with multi-input phis.			; Conditional reductions with multi-input phis.
	define float @reduction_conditional(float* %A, float* %B, float* %C, float %S) {			define float @reduction_conditional(float* %A, float* %B, float* %C, float %S) {
	; CHECK-LABEL: @reduction_conditional(			; CHECK-LABEL: @reduction_conditional(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]			; CHECK-NEXT: br i1 false, label [[SCALAR_PH:%.]], label [[VECTOR_PH:%.]]
	; CHECK: vector.ph:			; CHECK: vector.ph:
	; CHECK-NEXT: [[TMP0:%.]] = insertelement <4 x float> <float undef, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[S:%.]], i32 0			; CHECK-NEXT: [[TMP0:%.]] = insertelement <4 x float> <float poison, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00>, float [[S:%.]], i32 0
	; CHECK-NEXT: br label [[VECTOR_BODY:%.*]]			; CHECK-NEXT: br label [[VECTOR_BODY:%.*]]
	; CHECK: vector.body:			; CHECK: vector.body:
	; CHECK-NEXT: [[INDEX:%.]] = phi i64 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]			; CHECK-NEXT: [[INDEX:%.]] = phi i64 [ 0, [[VECTOR_PH]] ], [ [[INDEX_NEXT:%.]], [[VECTOR_BODY]] ]
	; CHECK-NEXT: [[VEC_PHI:%.]] = phi <4 x float> [ [[TMP0]], [[VECTOR_PH]] ], [ [[PREDPHI3:%.]], [[VECTOR_BODY]] ]			; CHECK-NEXT: [[VEC_PHI:%.]] = phi <4 x float> [ [[TMP0]], [[VECTOR_PH]] ], [ [[PREDPHI3:%.]], [[VECTOR_BODY]] ]
	; CHECK-NEXT: [[TMP1:%.]] = getelementptr inbounds float, float [[A:%.*]], i64 [[INDEX]]			; CHECK-NEXT: [[TMP1:%.]] = getelementptr inbounds float, float [[A:%.*]], i64 [[INDEX]]
	; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[TMP1]] to <4 x float>*			; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[TMP1]] to <4 x float>*
	; CHECK-NEXT: [[WIDE_LOAD:%.]] = load <4 x float>, <4 x float> [[TMP2]], align 4			; CHECK-NEXT: [[WIDE_LOAD:%.]] = load <4 x float>, <4 x float> [[TMP2]], align 4
	; CHECK-NEXT: [[TMP3:%.]] = getelementptr inbounds float, float [[B:%.*]], i64 [[INDEX]]			; CHECK-NEXT: [[TMP3:%.]] = getelementptr inbounds float, float [[B:%.*]], i64 [[INDEX]]
	▲ Show 20 Lines • Show All 290 Lines • Show Last 20 Lines

llvm/test/Transforms/PhaseOrdering/X86/horiz-math-inseltpoison.ll

Show First 20 Lines • Show All 105 Lines • ▼ Show 20 Lines	;
ret <8 x float> %vecinit30		ret <8 x float> %vecinit30
}		}

define <8 x float> @reverse_hadd_v8f32(<8 x float> %a, <8 x float> %b) #0 {		define <8 x float> @reverse_hadd_v8f32(<8 x float> %a, <8 x float> %b) #0 {
; CHECK-LABEL: @reverse_hadd_v8f32(		; CHECK-LABEL: @reverse_hadd_v8f32(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <8 x float> [[A:%.]], <8 x float> [[B:%.*]], <8 x i32> <i32 0, i32 2, i32 8, i32 10, i32 4, i32 6, i32 12, i32 14>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <8 x float> [[A:%.]], <8 x float> [[B:%.*]], <8 x i32> <i32 0, i32 2, i32 8, i32 10, i32 4, i32 6, i32 12, i32 14>
; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x float> [[A]], <8 x float> [[B]], <8 x i32> <i32 1, i32 3, i32 9, i32 11, i32 5, i32 7, i32 13, i32 15>		; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x float> [[A]], <8 x float> [[B]], <8 x i32> <i32 1, i32 3, i32 9, i32 11, i32 5, i32 7, i32 13, i32 15>
; CHECK-NEXT: [[TMP3:%.*]] = fadd <8 x float> [[TMP1]], [[TMP2]]		; CHECK-NEXT: [[TMP3:%.*]] = fadd <8 x float> [[TMP1]], [[TMP2]]
; CHECK-NEXT: [[SHUFFLE:%.*]] = shufflevector <8 x float> [[TMP3]], <8 x float> undef, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>		; CHECK-NEXT: [[SHUFFLE:%.*]] = shufflevector <8 x float> [[TMP3]], <8 x float> poison, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>
; CHECK-NEXT: ret <8 x float> [[SHUFFLE]]		; CHECK-NEXT: ret <8 x float> [[SHUFFLE]]
;		;
%vecext = extractelement <8 x float> %a, i32 0		%vecext = extractelement <8 x float> %a, i32 0
%vecext1 = extractelement <8 x float> %a, i32 1		%vecext1 = extractelement <8 x float> %a, i32 1
%add = fadd float %vecext, %vecext1		%add = fadd float %vecext, %vecext1
%vecinit = insertelement <8 x float> poison, float %add, i32 0		%vecinit = insertelement <8 x float> poison, float %add, i32 0
%vecext2 = extractelement <8 x float> %a, i32 2		%vecext2 = extractelement <8 x float> %a, i32 2
%vecext3 = extractelement <8 x float> %a, i32 3		%vecext3 = extractelement <8 x float> %a, i32 3
Show All 31 Lines

llvm/test/Transforms/PhaseOrdering/X86/horiz-math.ll

Show First 20 Lines • Show All 105 Lines • ▼ Show 20 Lines	;
ret <8 x float> %vecinit30		ret <8 x float> %vecinit30
}		}

define <8 x float> @reverse_hadd_v8f32(<8 x float> %a, <8 x float> %b) #0 {		define <8 x float> @reverse_hadd_v8f32(<8 x float> %a, <8 x float> %b) #0 {
; CHECK-LABEL: @reverse_hadd_v8f32(		; CHECK-LABEL: @reverse_hadd_v8f32(
; CHECK-NEXT: [[TMP1:%.]] = shufflevector <8 x float> [[A:%.]], <8 x float> [[B:%.*]], <8 x i32> <i32 0, i32 2, i32 8, i32 10, i32 4, i32 6, i32 12, i32 14>		; CHECK-NEXT: [[TMP1:%.]] = shufflevector <8 x float> [[A:%.]], <8 x float> [[B:%.*]], <8 x i32> <i32 0, i32 2, i32 8, i32 10, i32 4, i32 6, i32 12, i32 14>
; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x float> [[A]], <8 x float> [[B]], <8 x i32> <i32 1, i32 3, i32 9, i32 11, i32 5, i32 7, i32 13, i32 15>		; CHECK-NEXT: [[TMP2:%.*]] = shufflevector <8 x float> [[A]], <8 x float> [[B]], <8 x i32> <i32 1, i32 3, i32 9, i32 11, i32 5, i32 7, i32 13, i32 15>
; CHECK-NEXT: [[TMP3:%.*]] = fadd <8 x float> [[TMP1]], [[TMP2]]		; CHECK-NEXT: [[TMP3:%.*]] = fadd <8 x float> [[TMP1]], [[TMP2]]
; CHECK-NEXT: [[SHUFFLE:%.*]] = shufflevector <8 x float> [[TMP3]], <8 x float> undef, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>		; CHECK-NEXT: [[SHUFFLE:%.*]] = shufflevector <8 x float> [[TMP3]], <8 x float> poison, <8 x i32> <i32 7, i32 6, i32 5, i32 4, i32 3, i32 2, i32 1, i32 0>
; CHECK-NEXT: ret <8 x float> [[SHUFFLE]]		; CHECK-NEXT: ret <8 x float> [[SHUFFLE]]
;		;
%vecext = extractelement <8 x float> %a, i32 0		%vecext = extractelement <8 x float> %a, i32 0
%vecext1 = extractelement <8 x float> %a, i32 1		%vecext1 = extractelement <8 x float> %a, i32 1
%add = fadd float %vecext, %vecext1		%add = fadd float %vecext, %vecext1
%vecinit = insertelement <8 x float> undef, float %add, i32 0		%vecinit = insertelement <8 x float> undef, float %add, i32 0
%vecext2 = extractelement <8 x float> %a, i32 2		%vecext2 = extractelement <8 x float> %a, i32 2
%vecext3 = extractelement <8 x float> %a, i32 3		%vecext3 = extractelement <8 x float> %a, i32 3
Show All 31 Lines

llvm/test/Transforms/SLPVectorizer/AArch64/getelementptr.ll

	Show All 40 Lines
	; YAML-NEXT: - TreeSize: '3'			; YAML-NEXT: - TreeSize: '3'

	define i32 @getelementptr_4x32(i32* nocapture readonly %g, i32 %n, i32 %x, i32 %y, i32 %z) {			define i32 @getelementptr_4x32(i32* nocapture readonly %g, i32 %n, i32 %x, i32 %y, i32 %z) {
	; CHECK-LABEL: @getelementptr_4x32(			; CHECK-LABEL: @getelementptr_4x32(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[CMP31:%.]] = icmp sgt i32 [[N:%.]], 0			; CHECK-NEXT: [[CMP31:%.]] = icmp sgt i32 [[N:%.]], 0
	; CHECK-NEXT: br i1 [[CMP31]], label [[FOR_BODY_PREHEADER:%.]], label [[FOR_COND_CLEANUP:%.]]			; CHECK-NEXT: br i1 [[CMP31]], label [[FOR_BODY_PREHEADER:%.]], label [[FOR_COND_CLEANUP:%.]]
	; CHECK: for.body.preheader:			; CHECK: for.body.preheader:
	; CHECK-NEXT: [[TMP0:%.]] = insertelement <4 x i32> <i32 0, i32 undef, i32 undef, i32 undef>, i32 [[X:%.]], i32 1			; CHECK-NEXT: [[TMP0:%.]] = insertelement <4 x i32> <i32 0, i32 poison, i32 poison, i32 poison>, i32 [[X:%.]], i32 1
	; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x i32> [[TMP0]], i32 [[Y:%.]], i32 2			; CHECK-NEXT: [[TMP1:%.]] = insertelement <4 x i32> [[TMP0]], i32 [[Y:%.]], i32 2
	; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x i32> [[TMP1]], i32 [[Z:%.]], i32 3			; CHECK-NEXT: [[TMP2:%.]] = insertelement <4 x i32> [[TMP1]], i32 [[Z:%.]], i32 3
	; CHECK-NEXT: br label [[FOR_BODY:%.*]]			; CHECK-NEXT: br label [[FOR_BODY:%.*]]
	; CHECK: for.cond.cleanup.loopexit:			; CHECK: for.cond.cleanup.loopexit:
	; CHECK-NEXT: [[TMP3:%.]] = extractelement <2 x i32> [[TMP20:%.]], i32 1			; CHECK-NEXT: [[TMP3:%.]] = extractelement <2 x i32> [[TMP20:%.]], i32 1
	; CHECK-NEXT: br label [[FOR_COND_CLEANUP]]			; CHECK-NEXT: br label [[FOR_COND_CLEANUP]]
	; CHECK: for.cond.cleanup:			; CHECK: for.cond.cleanup:
	; CHECK-NEXT: [[SUM_0_LCSSA:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[TMP3]], [[FOR_COND_CLEANUP_LOOPEXIT:%.*]] ]			; CHECK-NEXT: [[SUM_0_LCSSA:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[TMP3]], [[FOR_COND_CLEANUP_LOOPEXIT:%.*]] ]
	Show All 21 Lines
	; CHECK-NEXT: [[ARRAYIDX10:%.]] = getelementptr inbounds i32, i32 [[G]], i64 [[TMP15]]			; CHECK-NEXT: [[ARRAYIDX10:%.]] = getelementptr inbounds i32, i32 [[G]], i64 [[TMP15]]
	; CHECK-NEXT: [[T10:%.]] = load i32, i32 [[ARRAYIDX10]], align 4			; CHECK-NEXT: [[T10:%.]] = load i32, i32 [[ARRAYIDX10]], align 4
	; CHECK-NEXT: [[ADD11:%.*]] = add nsw i32 [[ADD6]], [[T10]]			; CHECK-NEXT: [[ADD11:%.*]] = add nsw i32 [[ADD6]], [[T10]]
	; CHECK-NEXT: [[TMP16:%.*]] = extractelement <4 x i32> [[TMP8]], i32 3			; CHECK-NEXT: [[TMP16:%.*]] = extractelement <4 x i32> [[TMP8]], i32 3
	; CHECK-NEXT: [[TMP17:%.*]] = sext i32 [[TMP16]] to i64			; CHECK-NEXT: [[TMP17:%.*]] = sext i32 [[TMP16]] to i64
	; CHECK-NEXT: [[ARRAYIDX15:%.]] = getelementptr inbounds i32, i32 [[G]], i64 [[TMP17]]			; CHECK-NEXT: [[ARRAYIDX15:%.]] = getelementptr inbounds i32, i32 [[G]], i64 [[TMP17]]
	; CHECK-NEXT: [[T12:%.]] = load i32, i32 [[ARRAYIDX15]], align 4			; CHECK-NEXT: [[T12:%.]] = load i32, i32 [[ARRAYIDX15]], align 4
	; CHECK-NEXT: [[TMP18:%.*]] = insertelement <2 x i32> [[TMP4]], i32 [[ADD11]], i32 1			; CHECK-NEXT: [[TMP18:%.*]] = insertelement <2 x i32> [[TMP4]], i32 [[ADD11]], i32 1
	; CHECK-NEXT: [[TMP19:%.*]] = insertelement <2 x i32> <i32 1, i32 undef>, i32 [[T12]], i32 1			; CHECK-NEXT: [[TMP19:%.*]] = insertelement <2 x i32> <i32 1, i32 poison>, i32 [[T12]], i32 1
	; CHECK-NEXT: [[TMP20]] = add nsw <2 x i32> [[TMP18]], [[TMP19]]			; CHECK-NEXT: [[TMP20]] = add nsw <2 x i32> [[TMP18]], [[TMP19]]
	; CHECK-NEXT: [[TMP21:%.*]] = extractelement <2 x i32> [[TMP20]], i32 0			; CHECK-NEXT: [[TMP21:%.*]] = extractelement <2 x i32> [[TMP20]], i32 0
	; CHECK-NEXT: [[EXITCOND:%.*]] = icmp eq i32 [[TMP21]], [[N]]			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp eq i32 [[TMP21]], [[N]]
	; CHECK-NEXT: br i1 [[EXITCOND]], label [[FOR_COND_CLEANUP_LOOPEXIT]], label [[FOR_BODY]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[FOR_COND_CLEANUP_LOOPEXIT]], label [[FOR_BODY]]
	;			;
	entry:			entry:
	%cmp31 = icmp sgt i32 %n, 0			%cmp31 = icmp sgt i32 %n, 0
	br i1 %cmp31, label %for.body.preheader, label %for.cond.cleanup			br i1 %cmp31, label %for.body.preheader, label %for.cond.cleanup
	▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[ARRAYIDX10:%.]] = getelementptr inbounds i32, i32 [[G]], i64 [[TMP12]]			; CHECK-NEXT: [[ARRAYIDX10:%.]] = getelementptr inbounds i32, i32 [[G]], i64 [[TMP12]]
	; CHECK-NEXT: [[T10:%.]] = load i32, i32 [[ARRAYIDX10]], align 4			; CHECK-NEXT: [[T10:%.]] = load i32, i32 [[ARRAYIDX10]], align 4
	; CHECK-NEXT: [[ADD11:%.*]] = add nsw i32 [[ADD6]], [[T10]]			; CHECK-NEXT: [[ADD11:%.*]] = add nsw i32 [[ADD6]], [[T10]]
	; CHECK-NEXT: [[TMP13:%.*]] = extractelement <2 x i32> [[TMP10]], i32 1			; CHECK-NEXT: [[TMP13:%.*]] = extractelement <2 x i32> [[TMP10]], i32 1
	; CHECK-NEXT: [[TMP14:%.*]] = sext i32 [[TMP13]] to i64			; CHECK-NEXT: [[TMP14:%.*]] = sext i32 [[TMP13]] to i64
	; CHECK-NEXT: [[ARRAYIDX15:%.]] = getelementptr inbounds i32, i32 [[G]], i64 [[TMP14]]			; CHECK-NEXT: [[ARRAYIDX15:%.]] = getelementptr inbounds i32, i32 [[G]], i64 [[TMP14]]
	; CHECK-NEXT: [[T12:%.]] = load i32, i32 [[ARRAYIDX15]], align 4			; CHECK-NEXT: [[T12:%.]] = load i32, i32 [[ARRAYIDX15]], align 4
	; CHECK-NEXT: [[TMP15:%.*]] = insertelement <2 x i32> [[TMP3]], i32 [[ADD11]], i32 1			; CHECK-NEXT: [[TMP15:%.*]] = insertelement <2 x i32> [[TMP3]], i32 [[ADD11]], i32 1
	; CHECK-NEXT: [[TMP16:%.*]] = insertelement <2 x i32> <i32 1, i32 undef>, i32 [[T12]], i32 1			; CHECK-NEXT: [[TMP16:%.*]] = insertelement <2 x i32> <i32 1, i32 poison>, i32 [[T12]], i32 1
	; CHECK-NEXT: [[TMP17]] = add nsw <2 x i32> [[TMP15]], [[TMP16]]			; CHECK-NEXT: [[TMP17]] = add nsw <2 x i32> [[TMP15]], [[TMP16]]
	; CHECK-NEXT: [[TMP18:%.*]] = extractelement <2 x i32> [[TMP17]], i32 0			; CHECK-NEXT: [[TMP18:%.*]] = extractelement <2 x i32> [[TMP17]], i32 0
	; CHECK-NEXT: [[EXITCOND:%.*]] = icmp eq i32 [[TMP18]], [[N]]			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp eq i32 [[TMP18]], [[N]]
	; CHECK-NEXT: br i1 [[EXITCOND]], label [[FOR_COND_CLEANUP_LOOPEXIT]], label [[FOR_BODY]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[FOR_COND_CLEANUP_LOOPEXIT]], label [[FOR_BODY]]
	;			;
	entry:			entry:
	%cmp31 = icmp sgt i32 %n, 0			%cmp31 = icmp sgt i32 %n, 0
	br i1 %cmp31, label %for.body.preheader, label %for.cond.cleanup			br i1 %cmp31, label %for.body.preheader, label %for.cond.cleanup
	▲ Show 20 Lines • Show All 193 Lines • Show Last 20 Lines

llvm/test/Transforms/SLPVectorizer/X86/alternate-cast-inseltpoison.ll

Show First 20 Lines • Show All 191 Lines • ▼ Show 20 Lines	;
%r6 = insertelement <8 x i32> %r5, i32 %ab6, i32 6		%r6 = insertelement <8 x i32> %r5, i32 %ab6, i32 6
%r7 = insertelement <8 x i32> %r6, i32 %ab7, i32 7		%r7 = insertelement <8 x i32> %r6, i32 %ab7, i32 7
ret <8 x i32> %r7		ret <8 x i32> %r7
}		}

define <8 x float> @fneg_fabs(<8 x float> %a) {		define <8 x float> @fneg_fabs(<8 x float> %a) {
; CHECK-LABEL: @fneg_fabs(		; CHECK-LABEL: @fneg_fabs(
; CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x float> [[A:%.]] to <8 x i32>		; CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x float> [[A:%.]] to <8 x i32>
; CHECK-NEXT: [[TMP2:%.*]] = xor <8 x i32> [[TMP1]], <i32 -2147483648, i32 -2147483648, i32 -2147483648, i32 -2147483648, i32 undef, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP2:%.*]] = xor <8 x i32> [[TMP1]], <i32 -2147483648, i32 -2147483648, i32 -2147483648, i32 -2147483648, i32 poison, i32 poison, i32 poison, i32 poison>
; CHECK-NEXT: [[TMP3:%.*]] = and <8 x i32> [[TMP1]], <i32 undef, i32 undef, i32 undef, i32 undef, i32 2147483647, i32 2147483647, i32 2147483647, i32 2147483647>		; CHECK-NEXT: [[TMP3:%.*]] = and <8 x i32> [[TMP1]], <i32 poison, i32 poison, i32 poison, i32 poison, i32 2147483647, i32 2147483647, i32 2147483647, i32 2147483647>
; CHECK-NEXT: [[TMP4:%.*]] = shufflevector <8 x i32> [[TMP2]], <8 x i32> [[TMP3]], <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 12, i32 13, i32 14, i32 15>		; CHECK-NEXT: [[TMP4:%.*]] = shufflevector <8 x i32> [[TMP2]], <8 x i32> [[TMP3]], <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 12, i32 13, i32 14, i32 15>
; CHECK-NEXT: [[TMP5:%.*]] = bitcast <8 x i32> [[TMP4]] to <8 x float>		; CHECK-NEXT: [[TMP5:%.*]] = bitcast <8 x i32> [[TMP4]] to <8 x float>
; CHECK-NEXT: ret <8 x float> [[TMP5]]		; CHECK-NEXT: ret <8 x float> [[TMP5]]
;		;
%a0 = extractelement <8 x float> %a, i32 0		%a0 = extractelement <8 x float> %a, i32 0
%a1 = extractelement <8 x float> %a, i32 1		%a1 = extractelement <8 x float> %a, i32 1
%a2 = extractelement <8 x float> %a, i32 2		%a2 = extractelement <8 x float> %a, i32 2
%a3 = extractelement <8 x float> %a, i32 3		%a3 = extractelement <8 x float> %a, i32 3
▲ Show 20 Lines • Show All 257 Lines • Show Last 20 Lines

llvm/test/Transforms/SLPVectorizer/X86/alternate-cast.ll

Show First 20 Lines • Show All 191 Lines • ▼ Show 20 Lines	;
%r6 = insertelement <8 x i32> %r5, i32 %ab6, i32 6		%r6 = insertelement <8 x i32> %r5, i32 %ab6, i32 6
%r7 = insertelement <8 x i32> %r6, i32 %ab7, i32 7		%r7 = insertelement <8 x i32> %r6, i32 %ab7, i32 7
ret <8 x i32> %r7		ret <8 x i32> %r7
}		}

define <8 x float> @fneg_fabs(<8 x float> %a) {		define <8 x float> @fneg_fabs(<8 x float> %a) {
; CHECK-LABEL: @fneg_fabs(		; CHECK-LABEL: @fneg_fabs(
; CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x float> [[A:%.]] to <8 x i32>		; CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x float> [[A:%.]] to <8 x i32>
; CHECK-NEXT: [[TMP2:%.*]] = xor <8 x i32> [[TMP1]], <i32 -2147483648, i32 -2147483648, i32 -2147483648, i32 -2147483648, i32 undef, i32 undef, i32 undef, i32 undef>		; CHECK-NEXT: [[TMP2:%.*]] = xor <8 x i32> [[TMP1]], <i32 -2147483648, i32 -2147483648, i32 -2147483648, i32 -2147483648, i32 poison, i32 poison, i32 poison, i32 poison>
; CHECK-NEXT: [[TMP3:%.*]] = and <8 x i32> [[TMP1]], <i32 undef, i32 undef, i32 undef, i32 undef, i32 2147483647, i32 2147483647, i32 2147483647, i32 2147483647>		; CHECK-NEXT: [[TMP3:%.*]] = and <8 x i32> [[TMP1]], <i32 poison, i32 poison, i32 poison, i32 poison, i32 2147483647, i32 2147483647, i32 2147483647, i32 2147483647>
; CHECK-NEXT: [[TMP4:%.*]] = shufflevector <8 x i32> [[TMP2]], <8 x i32> [[TMP3]], <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 12, i32 13, i32 14, i32 15>		; CHECK-NEXT: [[TMP4:%.*]] = shufflevector <8 x i32> [[TMP2]], <8 x i32> [[TMP3]], <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 12, i32 13, i32 14, i32 15>
; CHECK-NEXT: [[TMP5:%.*]] = bitcast <8 x i32> [[TMP4]] to <8 x float>		; CHECK-NEXT: [[TMP5:%.*]] = bitcast <8 x i32> [[TMP4]] to <8 x float>
; CHECK-NEXT: ret <8 x float> [[TMP5]]		; CHECK-NEXT: ret <8 x float> [[TMP5]]
;		;
%a0 = extractelement <8 x float> %a, i32 0		%a0 = extractelement <8 x float> %a, i32 0
%a1 = extractelement <8 x float> %a, i32 1		%a1 = extractelement <8 x float> %a, i32 1
%a2 = extractelement <8 x float> %a, i32 2		%a2 = extractelement <8 x float> %a, i32 2
%a3 = extractelement <8 x float> %a, i32 3		%a3 = extractelement <8 x float> %a, i32 3
▲ Show 20 Lines • Show All 257 Lines • Show Last 20 Lines

llvm/test/Transforms/SLPVectorizer/X86/alternate-int-inseltpoison.ll

	Show First 20 Lines • Show All 419 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[A6:%.*]] = extractelement <8 x i32> [[A]], i32 6			; CHECK-NEXT: [[A6:%.*]] = extractelement <8 x i32> [[A]], i32 6
	; CHECK-NEXT: [[A7:%.*]] = extractelement <8 x i32> [[A]], i32 7			; CHECK-NEXT: [[A7:%.*]] = extractelement <8 x i32> [[A]], i32 7
	; CHECK-NEXT: [[AB1:%.*]] = sdiv i32 [[A1]], 4			; CHECK-NEXT: [[AB1:%.*]] = sdiv i32 [[A1]], 4
	; CHECK-NEXT: [[AB2:%.*]] = sdiv i32 [[A2]], 8			; CHECK-NEXT: [[AB2:%.*]] = sdiv i32 [[A2]], 8
	; CHECK-NEXT: [[AB3:%.*]] = sdiv i32 [[A3]], 16			; CHECK-NEXT: [[AB3:%.*]] = sdiv i32 [[A3]], 16
	; CHECK-NEXT: [[AB5:%.*]] = sdiv i32 [[A5]], 4			; CHECK-NEXT: [[AB5:%.*]] = sdiv i32 [[A5]], 4
	; CHECK-NEXT: [[AB6:%.*]] = sdiv i32 [[A6]], 8			; CHECK-NEXT: [[AB6:%.*]] = sdiv i32 [[A6]], 8
	; CHECK-NEXT: [[AB7:%.*]] = sdiv i32 [[A7]], 16			; CHECK-NEXT: [[AB7:%.*]] = sdiv i32 [[A7]], 16
	; CHECK-NEXT: [[TMP1:%.*]] = insertelement <8 x i32> undef, i32 [[AB1]], i32 1			; CHECK-NEXT: [[TMP1:%.*]] = insertelement <8 x i32> <i32 undef, i32 poison, i32 poison, i32 poison, i32 undef, i32 poison, i32 poison, i32 poison>, i32 [[AB1]], i32 1
	; CHECK-NEXT: [[TMP2:%.*]] = insertelement <8 x i32> [[TMP1]], i32 [[AB2]], i32 2			; CHECK-NEXT: [[TMP2:%.*]] = insertelement <8 x i32> [[TMP1]], i32 [[AB2]], i32 2
	; CHECK-NEXT: [[R4:%.*]] = insertelement <8 x i32> [[TMP2]], i32 [[AB3]], i32 3			; CHECK-NEXT: [[R4:%.*]] = insertelement <8 x i32> [[TMP2]], i32 [[AB3]], i32 3
	; CHECK-NEXT: [[R5:%.*]] = insertelement <8 x i32> [[R4]], i32 [[AB5]], i32 5			; CHECK-NEXT: [[R5:%.*]] = insertelement <8 x i32> [[R4]], i32 [[AB5]], i32 5
	; CHECK-NEXT: [[R6:%.*]] = insertelement <8 x i32> [[R5]], i32 [[AB6]], i32 6			; CHECK-NEXT: [[R6:%.*]] = insertelement <8 x i32> [[R5]], i32 [[AB6]], i32 6
	; CHECK-NEXT: [[R7:%.*]] = insertelement <8 x i32> [[R6]], i32 [[AB7]], i32 7			; CHECK-NEXT: [[R7:%.*]] = insertelement <8 x i32> [[R6]], i32 [[AB7]], i32 7
	; CHECK-NEXT: ret <8 x i32> [[R7]]			; CHECK-NEXT: ret <8 x i32> [[R7]]
	;			;
	%a0 = extractelement <8 x i32> %a, i32 0			%a0 = extractelement <8 x i32> %a, i32 0
	▲ Show 20 Lines • Show All 61 Lines • Show Last 20 Lines

llvm/test/Transforms/SLPVectorizer/X86/blending-shuffle-inseltpoison.ll

	Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines
	define <4 x i8> @h_undef(<4 x i8> %x, <4 x i8> %y) {			define <4 x i8> @h_undef(<4 x i8> %x, <4 x i8> %y) {
	; CHECK-LABEL: @h_undef(			; CHECK-LABEL: @h_undef(
	; CHECK-NEXT: [[X3:%.]] = extractelement <4 x i8> [[X:%.]], i32 3			; CHECK-NEXT: [[X3:%.]] = extractelement <4 x i8> [[X:%.]], i32 3
	; CHECK-NEXT: [[Y1:%.]] = extractelement <4 x i8> [[Y:%.]], i32 1			; CHECK-NEXT: [[Y1:%.]] = extractelement <4 x i8> [[Y:%.]], i32 1
	; CHECK-NEXT: [[Y2:%.*]] = extractelement <4 x i8> [[Y]], i32 2			; CHECK-NEXT: [[Y2:%.*]] = extractelement <4 x i8> [[Y]], i32 2
	; CHECK-NEXT: [[X3X3:%.*]] = mul i8 [[X3]], [[X3]]			; CHECK-NEXT: [[X3X3:%.*]] = mul i8 [[X3]], [[X3]]
	; CHECK-NEXT: [[Y1Y1:%.*]] = mul i8 [[Y1]], [[Y1]]			; CHECK-NEXT: [[Y1Y1:%.*]] = mul i8 [[Y1]], [[Y1]]
	; CHECK-NEXT: [[Y2Y2:%.*]] = mul i8 [[Y2]], [[Y2]]			; CHECK-NEXT: [[Y2Y2:%.*]] = mul i8 [[Y2]], [[Y2]]
	; CHECK-NEXT: [[INS2:%.*]] = insertelement <4 x i8> undef, i8 [[X3X3]], i32 1			; CHECK-NEXT: [[INS2:%.*]] = insertelement <4 x i8> <i8 undef, i8 poison, i8 poison, i8 poison>, i8 [[X3X3]], i32 1
	; CHECK-NEXT: [[INS3:%.*]] = insertelement <4 x i8> [[INS2]], i8 [[Y1Y1]], i32 2			; CHECK-NEXT: [[INS3:%.*]] = insertelement <4 x i8> [[INS2]], i8 [[Y1Y1]], i32 2
	; CHECK-NEXT: [[INS4:%.*]] = insertelement <4 x i8> [[INS3]], i8 [[Y2Y2]], i32 3			; CHECK-NEXT: [[INS4:%.*]] = insertelement <4 x i8> [[INS3]], i8 [[Y2Y2]], i32 3
	; CHECK-NEXT: ret <4 x i8> [[INS4]]			; CHECK-NEXT: ret <4 x i8> [[INS4]]
	;			;
	%x0 = extractelement <4 x i8> undef, i32 0			%x0 = extractelement <4 x i8> undef, i32 0
	%x3 = extractelement <4 x i8> %x, i32 3			%x3 = extractelement <4 x i8> %x, i32 3
	%y1 = extractelement <4 x i8> %y, i32 1			%y1 = extractelement <4 x i8> %y, i32 1
	%y2 = extractelement <4 x i8> %y, i32 2			%y2 = extractelement <4 x i8> %y, i32 2
	▲ Show 20 Lines • Show All 130 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] use poison as placeholder for undemanded elemsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 313808

clang/test/CodeGen/SystemZ/builtins-systemz-zvector-constrained.c

clang/test/CodeGen/SystemZ/builtins-systemz-zvector.c

clang/test/CodeGen/SystemZ/builtins-systemz-zvector2-constrained.c

clang/test/CodeGen/SystemZ/builtins-systemz-zvector2.c

llvm/lib/Transforms/InstCombine/InstCombineSimplifyDemanded.cpp

llvm/test/CodeGen/AMDGPU/vector-alloca-bitcast.ll

llvm/test/Transforms/InstCombine/AArch64/tbl1.ll

llvm/test/Transforms/InstCombine/ARM/tbl1.ll

llvm/test/Transforms/InstCombine/X86/clmulqdq.ll

llvm/test/Transforms/InstCombine/X86/x86-addsub-inseltpoison.ll

llvm/test/Transforms/InstCombine/X86/x86-addsub.ll

llvm/test/Transforms/InstCombine/X86/x86-avx512-inseltpoison.ll

llvm/test/Transforms/InstCombine/X86/x86-avx512.ll

llvm/test/Transforms/InstCombine/X86/x86-insertps.ll

llvm/test/Transforms/InstCombine/X86/x86-masked-memops.ll

llvm/test/Transforms/InstCombine/X86/x86-muldq.ll

llvm/test/Transforms/InstCombine/X86/x86-pack-inseltpoison.ll

llvm/test/Transforms/InstCombine/X86/x86-pack.ll

llvm/test/Transforms/InstCombine/X86/x86-pshufb.ll

llvm/test/Transforms/InstCombine/X86/x86-sse-inseltpoison.ll

llvm/test/Transforms/InstCombine/X86/x86-sse.ll

llvm/test/Transforms/InstCombine/X86/x86-sse2-inseltpoison.ll

llvm/test/Transforms/InstCombine/X86/x86-sse2.ll

llvm/test/Transforms/InstCombine/X86/x86-sse41-inseltpoison.ll

llvm/test/Transforms/InstCombine/X86/x86-sse41.ll

llvm/test/Transforms/InstCombine/X86/x86-sse4a.ll

llvm/test/Transforms/InstCombine/X86/x86-vec_demanded_elts-inseltpoison.ll

llvm/test/Transforms/InstCombine/X86/x86-vec_demanded_elts.ll

llvm/test/Transforms/InstCombine/X86/x86-vector-shifts-inseltpoison.ll

llvm/test/Transforms/InstCombine/X86/x86-vector-shifts.ll

llvm/test/Transforms/InstCombine/X86/x86-vpermil.ll

llvm/test/Transforms/InstCombine/X86/x86-xop-inseltpoison.ll

llvm/test/Transforms/InstCombine/X86/x86-xop.ll

llvm/test/Transforms/InstCombine/bitcast-bigendian.ll

llvm/test/Transforms/InstCombine/bitcast-inseltpoison.ll

llvm/test/Transforms/InstCombine/bitcast-vec-canon-inseltpoison.ll

llvm/test/Transforms/InstCombine/bitcast-vec-canon.ll

llvm/test/Transforms/InstCombine/bitcast.ll

llvm/test/Transforms/InstCombine/broadcast-inseltpoison.ll

llvm/test/Transforms/InstCombine/broadcast.ll

llvm/test/Transforms/InstCombine/cast.ll

llvm/test/Transforms/InstCombine/getelementptr.ll

llvm/test/Transforms/InstCombine/insert-const-shuf.ll

llvm/test/Transforms/InstCombine/insert-extract-shuffle-inseltpoison.ll

llvm/test/Transforms/InstCombine/insert-extract-shuffle.ll

llvm/test/Transforms/InstCombine/masked_intrinsics-inseltpoison.ll

llvm/test/Transforms/InstCombine/masked_intrinsics.ll

llvm/test/Transforms/InstCombine/minmax-fold.ll

llvm/test/Transforms/InstCombine/pr2645-0.ll

llvm/test/Transforms/InstCombine/shuffle_select.ll

llvm/test/Transforms/InstCombine/shufflevector-div-rem-inseltpoison.ll

llvm/test/Transforms/InstCombine/shufflevector-div-rem.ll

llvm/test/Transforms/InstCombine/sub-of-negatible.ll

llvm/test/Transforms/InstCombine/trunc.ll

llvm/test/Transforms/InstCombine/vec_demanded_elts-inseltpoison.ll

llvm/test/Transforms/InstCombine/vec_demanded_elts.ll

llvm/test/Transforms/InstCombine/vec_shuffle-inseltpoison.ll

llvm/test/Transforms/InstCombine/vec_shuffle.ll

llvm/test/Transforms/InstCombine/vector-casts-inseltpoison.ll

llvm/test/Transforms/InstCombine/vector-casts.ll

llvm/test/Transforms/InstCombine/vector_insertelt_shuffle-inseltpoison.ll

llvm/test/Transforms/InstCombine/vector_insertelt_shuffle.ll

llvm/test/Transforms/LoopVectorize/X86/invariant-load-gather.ll

llvm/test/Transforms/LoopVectorize/induction.ll

llvm/test/Transforms/LoopVectorize/reduction-inloop-pred.ll

llvm/test/Transforms/LoopVectorize/reduction-inloop.ll

llvm/test/Transforms/PhaseOrdering/X86/horiz-math-inseltpoison.ll

llvm/test/Transforms/PhaseOrdering/X86/horiz-math.ll

llvm/test/Transforms/SLPVectorizer/AArch64/getelementptr.ll

llvm/test/Transforms/SLPVectorizer/X86/alternate-cast-inseltpoison.ll

llvm/test/Transforms/SLPVectorizer/X86/alternate-cast.ll

llvm/test/Transforms/SLPVectorizer/X86/alternate-int-inseltpoison.ll

llvm/test/Transforms/SLPVectorizer/X86/blending-shuffle-inseltpoison.ll

[InstCombine] use poison as placeholder for undemanded elems
ClosedPublic