This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
2/6
InstructionCombining.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
2
vec_shuffle.ll

Differential D47686

[InstCombine] refine UB-handling in shuffle-binop transform
ClosedPublic

Authored by spatel on Jun 3 2018, 7:40 AM.

Download Raw Diff

Details

Reviewers

efriedma
lebedev.ri
zvi

Commits

rGdcb8d304c319: [InstCombine] refine UB-handling in shuffle-binop transform
rL333962: [InstCombine] refine UB-handling in shuffle-binop transform

Summary

As noted in rL333782, we can be both better for optimization and safer with this transform:
BinOp (shuffle V1, Mask), C --> shuffle (BinOp V1, NewC), Mask

The only potentially unsafe-to-speculate binops are integer div/rem. All other binops are always safe (although I don't see a way to assert that in code here).

For opcodes like shifts that can produce poison, it can't matter here because we know the lanes with undef are dropped by the subsequent shuffle.

Diff Detail

Event Timeline

spatel created this revision.Jun 3 2018, 7:40 AM

Herald added a subscriber: mcrosier. · View Herald TranscriptJun 3 2018, 7:40 AM

lebedev.ri added inline comments.Jun 3 2018, 7:50 AM

lib/Transforms/InstCombine/InstructionCombining.cpp
1441–1442	No special-handling for `fdiv`/`frem`?
1445	We want `int(1)` even for `float`s?
test/Transforms/InstCombine/vec_shuffle.ll
855	This looks like a float `NAN`?

spatel added inline comments.Jun 3 2018, 8:18 AM

lib/Transforms/InstCombine/InstructionCombining.cpp
1441–1442	No - this was clarified recently following the llvm-dev discussion about FP undef: D44216
1445	We're only changing integer opcodes/operands here. I can assert that if it would make the code clearer?
test/Transforms/InstCombine/vec_shuffle.ll
855	Yes. Interesting - I didn't notice that test was different than the rest of the FP cases! So what happens here: This transform puts in an undef element in the constant vector. We canonicalize fsub with constant operand 1 to fadd: // X - C --> X + (-C) IC: Visiting: %1 = fsub <2 x float> %x, <float 4.200000e+01, float undef> IC: Old = %1 = fsub <2 x float> %x, <float 4.200000e+01, float undef> New = <badref> = fadd <2 x float> %x, <float -4.200000e+01, float 0x7FF8000000000000> ...and in that constant conversion, we constant fold the undef vector element to a NaN constant.

This makes sense to me, but i'm not really familiar with shufflevector,
so i'm going to leave the actual LGTM to others.

lib/Transforms/InstCombine/InstructionCombining.cpp
1434–1435	Then i'd suggest // With integer div/rem instructions, it is not safe to use a vector with undef but that will require reflowing the entire comment :S
1445	I think this is precisely the case of things to assert, so ideally yes please.

Patch updated:
Make it clearer via code comment and assert that we are only dealing with integer opcodes/constants (FP binops are always safe to speculate).

In D47686#1120186, @lebedev.ri wrote:

This makes sense to me, but i'm not really familiar with shufflevector,
so i'm going to leave the actual LGTM to others.

No problem - thanks for the suggestions! @efriedma noticed the potential holes in my initial fix, so I'll certainly wait for him to comment.

LGTM

This revision is now accepted and ready to land.Jun 4 2018, 11:08 AM

Closed by commit rL333962: [InstCombine] refine UB-handling in shuffle-binop transform (authored by spatel). · Explain WhyJun 4 2018, 3:31 PM

This revision was automatically updated to reflect the committed changes.

spatel mentioned this in D48401: [InstCombine] fold vector select of binops with constant ops to 1 binop (PR37806).Jun 20 2018, 3:51 PM

spatel mentioned this in rL335283: [InstCombine] fold vector select of binops with constant ops to 1 binop….Jun 21 2018, 1:19 PM

Revision Contents

Path

Size

lib/

Transforms/

InstCombine/

InstructionCombining.cpp

28 lines

test/

Transforms/

InstCombine/

vec_shuffle.ll

38 lines

Diff 149643

lib/Transforms/InstCombine/InstructionCombining.cpp

Show First 20 Lines • Show All 1,411 Lines • ▼ Show 20 Lines	for (unsigned I = 0; I < VWidth; ++I) {
if (!CElt \|\| (!isa<UndefValue>(NewCElt) && NewCElt != CElt)) {		if (!CElt \|\| (!isa<UndefValue>(NewCElt) && NewCElt != CElt)) {
MayChange = false;		MayChange = false;
break;		break;
}		}
NewVecC[ShMask[I]] = CElt;		NewVecC[ShMask[I]] = CElt;
}		}
}		}
if (MayChange) {		if (MayChange) {
// It's not safe to use a vector with undef elements because the entire		// With integer div/rem instructions, it is not safe to use a vector with
// instruction can be folded to undef (for example, div/rem divisors).		// undef elements because the entire instruction can be folded to undef.
// Replace undef lanes with the first non-undef element. Vector demanded		// So replace undef elements with '1' because that can never induce
// elements can change those back to undef values if that is safe.		// undefined behavior. All other binop opcodes are always safe to
Constant *SafeDummyConstant = nullptr;		// speculate, and therefore, it is fine to include undef elements for
for (unsigned i = 0; i < VWidth; ++i) {		// unused lanes (and using undefs may help optimization).
if (!isa<UndefValue>(NewVecC[i])) {		BinaryOperator::BinaryOps Opcode = Inst.getOpcode();
SafeDummyConstant = NewVecC[i];		if (Opcode == Instruction::UDiv \|\| Opcode == Instruction::URem \|\|
break;		Opcode == Instruction::SDiv \|\| Opcode == Instruction::SRem) {
}		assert(C->getType()->getScalarType()->isIntegerTy() &&
}		"Not expecting FP opcodes/operands/constants here");
assert(SafeDummyConstant && "Undef constant vector was not simplified?");
for (unsigned i = 0; i < VWidth; ++i)		for (unsigned i = 0; i < VWidth; ++i)
if (isa<UndefValue>(NewVecC[i]))		if (isa<UndefValue>(NewVecC[i]))
NewVecC[i] = SafeDummyConstant;		NewVecC[i] = ConstantInt::get(NewVecC[i]->getType(), 1);
		}

		lebedev.riUnsubmitted Done Reply Inline Actions Then i'd suggest // With integer div/rem instructions, it is not safe to use a vector with undef but that will require reflowing the entire comment :S lebedev.ri: Then i'd suggest ``` // With integer div/rem instructions, it is not safe to use a vector with…
// Op(shuffle(V1, Mask), C) -> shuffle(Op(V1, NewC), Mask)		// Op(shuffle(V1, Mask), C) -> shuffle(Op(V1, NewC), Mask)
// Op(C, shuffle(V1, Mask)) -> shuffle(Op(NewC, V1), Mask)		// Op(C, shuffle(V1, Mask)) -> shuffle(Op(NewC, V1), Mask)
Constant *NewC = ConstantVector::get(NewVecC);		Constant *NewC = ConstantVector::get(NewVecC);
Value *NewLHS = isa<Constant>(LHS) ? NewC : V1;		Value *NewLHS = isa<Constant>(LHS) ? NewC : V1;
Value *NewRHS = isa<Constant>(LHS) ? V1 : NewC;		Value *NewRHS = isa<Constant>(LHS) ? V1 : NewC;
return createBinOpShuffle(NewLHS, NewRHS, Mask);		return createBinOpShuffle(NewLHS, NewRHS, Mask);
}		}
		lebedev.riUnsubmitted Not Done Reply Inline Actions No special-handling for `fdiv`/`frem`? lebedev.ri: No special-handling for `fdiv`/`frem`?
		spatelAuthorUnsubmitted Not Done Reply Inline Actions No - this was clarified recently following the llvm-dev discussion about FP undef: D44216 spatel: No - this was clarified recently following the llvm-dev discussion about FP undef: D44216
}		}

return nullptr;		return nullptr;
		lebedev.riUnsubmitted Not Done Reply Inline Actions We want `int(1)` even for `float`s? lebedev.ri: We want `int(1)` even for `float`s?
		spatelAuthorUnsubmitted Not Done Reply Inline Actions We're only changing integer opcodes/operands here. I can assert that if it would make the code clearer? spatel: We're only changing integer opcodes/operands here. I can assert that if it would make the code…
		lebedev.riUnsubmitted Done Reply Inline Actions I think this is precisely the case of things to assert, so ideally yes please. lebedev.ri: I think this is precisely the case of things to assert, so ideally yes please.
}		}

Instruction *InstCombiner::visitGetElementPtrInst(GetElementPtrInst &GEP) {		Instruction *InstCombiner::visitGetElementPtrInst(GetElementPtrInst &GEP) {
SmallVector<Value*, 8> Ops(GEP.op_begin(), GEP.op_end());		SmallVector<Value*, 8> Ops(GEP.op_begin(), GEP.op_end());
Type *GEPType = GEP.getType();		Type *GEPType = GEP.getType();
Type *GEPEltType = GEP.getSourceElementType();		Type *GEPEltType = GEP.getSourceElementType();
if (Value *V = SimplifyGEPInst(GEPEltType, Ops, SQ.getWithInstruction(&GEP)))		if (Value *V = SimplifyGEPInst(GEPEltType, Ops, SQ.getWithInstruction(&GEP)))
return replaceInstUsesWith(GEP, V);		return replaceInstUsesWith(GEP, V);
▲ Show 20 Lines • Show All 1,931 Lines • Show Last 20 Lines

test/Transforms/InstCombine/vec_shuffle.ll

Show First 20 Lines • Show All 446 Lines • ▼ Show 20 Lines	;
%r = mul <4 x i32> <i32 42, i32 42, i32 42, i32 42>, %t1		%r = mul <4 x i32> <i32 42, i32 42, i32 42, i32 42>, %t1
ret <4 x i32> %r		ret <4 x i32> %r
}		}

; Take 2 elements of a vector and shift each of those by a different amount		; Take 2 elements of a vector and shift each of those by a different amount

define <4 x i32> @lshr_const_half_splat(<4 x i32> %v) {		define <4 x i32> @lshr_const_half_splat(<4 x i32> %v) {
; CHECK-LABEL: @lshr_const_half_splat(		; CHECK-LABEL: @lshr_const_half_splat(
; CHECK-NEXT: [[TMP1:%.]] = lshr <4 x i32> <i32 8, i32 8, i32 9, i32 8>, [[V:%.]]		; CHECK-NEXT: [[TMP1:%.]] = lshr <4 x i32> <i32 undef, i32 8, i32 9, i32 undef>, [[V:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 2, i32 2>		; CHECK-NEXT: [[R:%.*]] = shufflevector <4 x i32> [[TMP1]], <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 2, i32 2>
; CHECK-NEXT: ret <4 x i32> [[R]]		; CHECK-NEXT: ret <4 x i32> [[R]]
;		;
%t1 = shufflevector <4 x i32> %v, <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 2, i32 2>		%t1 = shufflevector <4 x i32> %v, <4 x i32> undef, <4 x i32> <i32 1, i32 1, i32 2, i32 2>
%r = lshr <4 x i32> <i32 8, i32 8, i32 9, i32 9>, %t1		%r = lshr <4 x i32> <i32 8, i32 8, i32 9, i32 9>, %t1
ret <4 x i32> %r		ret <4 x i32> %r
}		}

▲ Show 20 Lines • Show All 174 Lines • ▼ Show 20 Lines
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = mul <2 x i32> %splat, <i32 42, i32 42>		%r = mul <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @shl_splat_constant0(<2 x i32> %x) {		define <2 x i32> @shl_splat_constant0(<2 x i32> %x) {
; CHECK-LABEL: @shl_splat_constant0(		; CHECK-LABEL: @shl_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = shl <2 x i32> <i32 5, i32 5>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = shl <2 x i32> <i32 5, i32 undef>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = shl <2 x i32> <i32 5, i32 5>, %splat		%r = shl <2 x i32> <i32 5, i32 5>, %splat
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @shl_splat_constant1(<2 x i32> %x) {		define <2 x i32> @shl_splat_constant1(<2 x i32> %x) {
; CHECK-LABEL: @shl_splat_constant1(		; CHECK-LABEL: @shl_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = shl <2 x i32> [[X:%.]], <i32 5, i32 5>		; CHECK-NEXT: [[TMP1:%.]] = shl <2 x i32> [[X:%.]], <i32 5, i32 undef>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = shl <2 x i32> %splat, <i32 5, i32 5>		%r = shl <2 x i32> %splat, <i32 5, i32 5>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @ashr_splat_constant0(<2 x i32> %x) {		define <2 x i32> @ashr_splat_constant0(<2 x i32> %x) {
; CHECK-LABEL: @ashr_splat_constant0(		; CHECK-LABEL: @ashr_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = lshr <2 x i32> <i32 5, i32 5>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = ashr <2 x i32> <i32 5, i32 undef>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = ashr <2 x i32> <i32 5, i32 5>, %splat		%r = ashr <2 x i32> <i32 5, i32 5>, %splat
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @ashr_splat_constant1(<2 x i32> %x) {		define <2 x i32> @ashr_splat_constant1(<2 x i32> %x) {
; CHECK-LABEL: @ashr_splat_constant1(		; CHECK-LABEL: @ashr_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = ashr <2 x i32> [[X:%.]], <i32 5, i32 5>		; CHECK-NEXT: [[TMP1:%.]] = ashr <2 x i32> [[X:%.]], <i32 5, i32 undef>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = ashr <2 x i32> %splat, <i32 5, i32 5>		%r = ashr <2 x i32> %splat, <i32 5, i32 5>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @lshr_splat_constant0(<2 x i32> %x) {		define <2 x i32> @lshr_splat_constant0(<2 x i32> %x) {
; CHECK-LABEL: @lshr_splat_constant0(		; CHECK-LABEL: @lshr_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = lshr <2 x i32> <i32 5, i32 5>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = lshr <2 x i32> <i32 5, i32 undef>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = lshr <2 x i32> <i32 5, i32 5>, %splat		%r = lshr <2 x i32> <i32 5, i32 5>, %splat
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @lshr_splat_constant1(<2 x i32> %x) {		define <2 x i32> @lshr_splat_constant1(<2 x i32> %x) {
; CHECK-LABEL: @lshr_splat_constant1(		; CHECK-LABEL: @lshr_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = lshr <2 x i32> [[X:%.]], <i32 5, i32 5>		; CHECK-NEXT: [[TMP1:%.]] = lshr <2 x i32> [[X:%.]], <i32 5, i32 undef>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = lshr <2 x i32> %splat, <i32 5, i32 5>		%r = lshr <2 x i32> %splat, <i32 5, i32 5>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @urem_splat_constant0(<2 x i32> %x) {		define <2 x i32> @urem_splat_constant0(<2 x i32> %x) {
; CHECK-LABEL: @urem_splat_constant0(		; CHECK-LABEL: @urem_splat_constant0(
; CHECK-NEXT: [[SPLAT:%.]] = shufflevector <2 x i32> [[X:%.]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[SPLAT:%.]] = shufflevector <2 x i32> [[X:%.]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: [[R:%.*]] = urem <2 x i32> <i32 42, i32 42>, [[SPLAT]]		; CHECK-NEXT: [[R:%.*]] = urem <2 x i32> <i32 42, i32 42>, [[SPLAT]]
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = urem <2 x i32> <i32 42, i32 42>, %splat		%r = urem <2 x i32> <i32 42, i32 42>, %splat
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @urem_splat_constant1(<2 x i32> %x) {		define <2 x i32> @urem_splat_constant1(<2 x i32> %x) {
; CHECK-LABEL: @urem_splat_constant1(		; CHECK-LABEL: @urem_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = urem <2 x i32> [[X:%.]], <i32 42, i32 42>		; CHECK-NEXT: [[TMP1:%.]] = urem <2 x i32> [[X:%.]], <i32 42, i32 1>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = urem <2 x i32> %splat, <i32 42, i32 42>		%r = urem <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @srem_splat_constant0(<2 x i32> %x) {		define <2 x i32> @srem_splat_constant0(<2 x i32> %x) {
; CHECK-LABEL: @srem_splat_constant0(		; CHECK-LABEL: @srem_splat_constant0(
; CHECK-NEXT: [[SPLAT:%.]] = shufflevector <2 x i32> [[X:%.]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[SPLAT:%.]] = shufflevector <2 x i32> [[X:%.]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: [[R:%.*]] = srem <2 x i32> <i32 42, i32 42>, [[SPLAT]]		; CHECK-NEXT: [[R:%.*]] = srem <2 x i32> <i32 42, i32 42>, [[SPLAT]]
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = srem <2 x i32> <i32 42, i32 42>, %splat		%r = srem <2 x i32> <i32 42, i32 42>, %splat
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @srem_splat_constant1(<2 x i32> %x) {		define <2 x i32> @srem_splat_constant1(<2 x i32> %x) {
; CHECK-LABEL: @srem_splat_constant1(		; CHECK-LABEL: @srem_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = srem <2 x i32> [[X:%.]], <i32 42, i32 42>		; CHECK-NEXT: [[TMP1:%.]] = srem <2 x i32> [[X:%.]], <i32 42, i32 1>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = srem <2 x i32> %splat, <i32 42, i32 42>		%r = srem <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @udiv_splat_constant0(<2 x i32> %x) {		define <2 x i32> @udiv_splat_constant0(<2 x i32> %x) {
; CHECK-LABEL: @udiv_splat_constant0(		; CHECK-LABEL: @udiv_splat_constant0(
; CHECK-NEXT: [[SPLAT:%.]] = shufflevector <2 x i32> [[X:%.]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[SPLAT:%.]] = shufflevector <2 x i32> [[X:%.]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: [[R:%.*]] = udiv <2 x i32> <i32 42, i32 42>, [[SPLAT]]		; CHECK-NEXT: [[R:%.*]] = udiv <2 x i32> <i32 42, i32 42>, [[SPLAT]]
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = udiv <2 x i32> <i32 42, i32 42>, %splat		%r = udiv <2 x i32> <i32 42, i32 42>, %splat
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @udiv_splat_constant1(<2 x i32> %x) {		define <2 x i32> @udiv_splat_constant1(<2 x i32> %x) {
; CHECK-LABEL: @udiv_splat_constant1(		; CHECK-LABEL: @udiv_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = udiv <2 x i32> [[X:%.]], <i32 42, i32 42>		; CHECK-NEXT: [[TMP1:%.]] = udiv <2 x i32> [[X:%.]], <i32 42, i32 1>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = udiv <2 x i32> %splat, <i32 42, i32 42>		%r = udiv <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @sdiv_splat_constant0(<2 x i32> %x) {		define <2 x i32> @sdiv_splat_constant0(<2 x i32> %x) {
; CHECK-LABEL: @sdiv_splat_constant0(		; CHECK-LABEL: @sdiv_splat_constant0(
; CHECK-NEXT: [[SPLAT:%.]] = shufflevector <2 x i32> [[X:%.]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[SPLAT:%.]] = shufflevector <2 x i32> [[X:%.]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: [[R:%.*]] = sdiv <2 x i32> <i32 42, i32 42>, [[SPLAT]]		; CHECK-NEXT: [[R:%.*]] = sdiv <2 x i32> <i32 42, i32 42>, [[SPLAT]]
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = sdiv <2 x i32> <i32 42, i32 42>, %splat		%r = sdiv <2 x i32> <i32 42, i32 42>, %splat
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x i32> @sdiv_splat_constant1(<2 x i32> %x) {		define <2 x i32> @sdiv_splat_constant1(<2 x i32> %x) {
; CHECK-LABEL: @sdiv_splat_constant1(		; CHECK-LABEL: @sdiv_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = sdiv <2 x i32> [[X:%.]], <i32 42, i32 42>		; CHECK-NEXT: [[TMP1:%.]] = sdiv <2 x i32> [[X:%.]], <i32 42, i32 1>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x i32> [[TMP1]], <2 x i32> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x i32> [[R]]		; CHECK-NEXT: ret <2 x i32> [[R]]
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = sdiv <2 x i32> %splat, <i32 42, i32 42>		%r = sdiv <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

Show All 27 Lines
;		;
%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x i32> %x, <2 x i32> undef, <2 x i32> zeroinitializer
%r = xor <2 x i32> %splat, <i32 42, i32 42>		%r = xor <2 x i32> %splat, <i32 42, i32 42>
ret <2 x i32> %r		ret <2 x i32> %r
}		}

define <2 x float> @fadd_splat_constant(<2 x float> %x) {		define <2 x float> @fadd_splat_constant(<2 x float> %x) {
; CHECK-LABEL: @fadd_splat_constant(		; CHECK-LABEL: @fadd_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x float> [[X:%.]], <float 4.200000e+01, float 4.200000e+01>		; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x float> [[X:%.]], <float 4.200000e+01, float undef>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fadd <2 x float> %splat, <float 42.0, float 42.0>		%r = fadd <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fsub_splat_constant0(<2 x float> %x) {		define <2 x float> @fsub_splat_constant0(<2 x float> %x) {
; CHECK-LABEL: @fsub_splat_constant0(		; CHECK-LABEL: @fsub_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = fsub <2 x float> <float 4.200000e+01, float 4.200000e+01>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = fsub <2 x float> <float 4.200000e+01, float undef>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fsub <2 x float> <float 42.0, float 42.0>, %splat		%r = fsub <2 x float> <float 42.0, float 42.0>, %splat
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fsub_splat_constant1(<2 x float> %x) {		define <2 x float> @fsub_splat_constant1(<2 x float> %x) {
; CHECK-LABEL: @fsub_splat_constant1(		; CHECK-LABEL: @fsub_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x float> [[X:%.]], <float -4.200000e+01, float -4.200000e+01>		; CHECK-NEXT: [[TMP1:%.]] = fadd <2 x float> [[X:%.]], <float -4.200000e+01, float 0x7FF8000000000000>
		lebedev.riUnsubmitted Not Done Reply Inline Actions This looks like a float `NAN`? lebedev.ri: This looks like a float `NAN`?
		spatelAuthorUnsubmitted Not Done Reply Inline Actions Yes. Interesting - I didn't notice that test was different than the rest of the FP cases! So what happens here: This transform puts in an undef element in the constant vector. We canonicalize fsub with constant operand 1 to fadd: // X - C --> X + (-C) IC: Visiting: %1 = fsub <2 x float> %x, <float 4.200000e+01, float undef> IC: Old = %1 = fsub <2 x float> %x, <float 4.200000e+01, float undef> New = <badref> = fadd <2 x float> %x, <float -4.200000e+01, float 0x7FF8000000000000> ...and in that constant conversion, we constant fold the undef vector element to a NaN constant. spatel: Yes. Interesting - I didn't notice that test was different than the rest of the FP cases! So…
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fsub <2 x float> %splat, <float 42.0, float 42.0>		%r = fsub <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fmul_splat_constant(<2 x float> %x) {		define <2 x float> @fmul_splat_constant(<2 x float> %x) {
; CHECK-LABEL: @fmul_splat_constant(		; CHECK-LABEL: @fmul_splat_constant(
; CHECK-NEXT: [[TMP1:%.]] = fmul <2 x float> [[X:%.]], <float 4.200000e+01, float 4.200000e+01>		; CHECK-NEXT: [[TMP1:%.]] = fmul <2 x float> [[X:%.]], <float 4.200000e+01, float undef>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fmul <2 x float> %splat, <float 42.0, float 42.0>		%r = fmul <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fdiv_splat_constant0(<2 x float> %x) {		define <2 x float> @fdiv_splat_constant0(<2 x float> %x) {
; CHECK-LABEL: @fdiv_splat_constant0(		; CHECK-LABEL: @fdiv_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = fdiv <2 x float> <float 4.200000e+01, float 4.200000e+01>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = fdiv <2 x float> <float 4.200000e+01, float undef>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fdiv <2 x float> <float 42.0, float 42.0>, %splat		%r = fdiv <2 x float> <float 42.0, float 42.0>, %splat
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @fdiv_splat_constant1(<2 x float> %x) {		define <2 x float> @fdiv_splat_constant1(<2 x float> %x) {
; CHECK-LABEL: @fdiv_splat_constant1(		; CHECK-LABEL: @fdiv_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = fdiv <2 x float> [[X:%.]], <float 4.200000e+01, float 4.200000e+01>		; CHECK-NEXT: [[TMP1:%.]] = fdiv <2 x float> [[X:%.]], <float 4.200000e+01, float undef>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = fdiv <2 x float> %splat, <float 42.0, float 42.0>		%r = fdiv <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @frem_splat_constant0(<2 x float> %x) {		define <2 x float> @frem_splat_constant0(<2 x float> %x) {
; CHECK-LABEL: @frem_splat_constant0(		; CHECK-LABEL: @frem_splat_constant0(
; CHECK-NEXT: [[TMP1:%.]] = frem <2 x float> <float 4.200000e+01, float 4.200000e+01>, [[X:%.]]		; CHECK-NEXT: [[TMP1:%.]] = frem <2 x float> <float 4.200000e+01, float undef>, [[X:%.]]
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = frem <2 x float> <float 42.0, float 42.0>, %splat		%r = frem <2 x float> <float 42.0, float 42.0>, %splat
ret <2 x float> %r		ret <2 x float> %r
}		}

define <2 x float> @frem_splat_constant1(<2 x float> %x) {		define <2 x float> @frem_splat_constant1(<2 x float> %x) {
; CHECK-LABEL: @frem_splat_constant1(		; CHECK-LABEL: @frem_splat_constant1(
; CHECK-NEXT: [[TMP1:%.]] = frem <2 x float> [[X:%.]], <float 4.200000e+01, float 4.200000e+01>		; CHECK-NEXT: [[TMP1:%.]] = frem <2 x float> [[X:%.]], <float 4.200000e+01, float undef>
; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer		; CHECK-NEXT: [[R:%.*]] = shufflevector <2 x float> [[TMP1]], <2 x float> undef, <2 x i32> zeroinitializer
; CHECK-NEXT: ret <2 x float> [[R]]		; CHECK-NEXT: ret <2 x float> [[R]]
;		;
%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer		%splat = shufflevector <2 x float> %x, <2 x float> undef, <2 x i32> zeroinitializer
%r = frem <2 x float> %splat, <float 42.0, float 42.0>		%r = frem <2 x float> %splat, <float 42.0, float 42.0>
ret <2 x float> %r		ret <2 x float> %r
}		}