This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
InstCombineAddSub.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
rem.ll
-
set-lowbits-mask-canonicalize.ll

Differential D47428

[InstCombine] PR37603: low bit mask canonicalization
ClosedPublic

Authored by lebedev.ri on May 27 2018, 11:14 AM.

Download Raw Diff

Details

Reviewers

spatel
craig.topper
RKSimon

Commits

rGcbf8446359a2: [InstCombine] PR37603: low bit mask canonicalization
rL334127: [InstCombine] PR37603: low bit mask canonicalization

Summary

This is PR37603.

https://godbolt.org/g/VCMNpS
https://rise4fun.com/Alive/idM

When doing bit manipulations, it is quite common to calculate some bit mask,
and apply it to some value via and.

The typical C code looks like:

int mask_signed_add(int nbits) {
    return (1 << nbits) - 1;
}

which is translated into (with -O3)

define dso_local i32 @mask_signed_add(int)(i32) local_unnamed_addr #0 {
  %2 = shl i32 1, %0
  %3 = add nsw i32 %2, -1
  ret i32 %3
}

But there is a second, less readable variant:

int mask_signed_xor(int nbits) {
    return ~(-(1 << nbits));
}

which is translated into (with -O3)

define dso_local i32 @mask_signed_xor(int)(i32) local_unnamed_addr #0 {
  %2 = shl i32 -1, %0
  %3 = xor i32 %2, -1
  ret i32 %3
}

Since we created such a mask, it is quite likely that we will use it in and next.
And then we may get rid of not op by folding into andn.

But now that i have actually looked:
https://godbolt.org/g/VTUDmU
_some_ backend changes will be needed too.
We clearly loose bzhi recognition.

Diff Detail

Repository: rL LLVM

Event Timeline

lebedev.ri created this revision.May 27 2018, 11:14 AM

lebedev.ri edited the summary of this revision. (Show Details)May 27 2018, 11:16 AM

Actually run $ ninja check-llvm -- update test/Transforms/InstCombine/rem.ll

lebedev.ri added a parent revision: D47427: [InstCombine][NFC] PR37603: low bit mask canonicalization.May 27 2018, 12:01 PM

lebedev.ri added inline comments.May 27 2018, 2:56 PM

test/Transforms/InstCombine/set-lowbits-mask-canonicalize.ll
20–21 ↗	(On Diff #148763)	Note: i'm not sure we want to go this way. Maybe we want to canonicalize the other way around? (Another unrelated question could be, do we want to "canonicalize" all `add nuw i32 %val, -1` to `xor i32 %val, -1`? I'm not sure here, because we can't go from `not` to `dec` as per alive, and the latter does not produce lea)

lebedev.ri mentioned this in D47452: [NFC][X86][AArch64] Reorganize/cleanup BZHI test patterns.May 28 2018, 8:11 AM

lebedev.ri mentioned this in D47453: [X86] Emit BZHI when mask is ~(-1 << nbits)).

This is a good IR canonicalization regardless of what happens in the backend because a 'not' is better for bit-tracking analysis and other transforms than an 'add'.

I don't understand the nuw question:
https://rise4fun.com/Alive/ydD

We're missing this in instsimplify?

In D47428#1117675, @spatel wrote:

This is a good IR canonicalization regardless of what happens in the backend because a 'not' is better for bit-tracking analysis and other transforms than an 'add'.

Yep, i thought so too..

I don't understand the nuw question:
https://rise4fun.com/Alive/ydD

We're missing this in instsimplify?

I think we do? https://godbolt.org/g/rT3k7v
But i was only thinking about replacing and nuw with xor there.

In D47428#1117688, @lebedev.ri wrote:

In D47428#1117675, @spatel wrote:

This is a good IR canonicalization regardless of what happens in the backend because a 'not' is better for bit-tracking analysis and other transforms than an 'add'.

Yep, i thought so too..

I don't understand the nuw question:
https://rise4fun.com/Alive/ydD

We're missing this in instsimplify?

I think we do? https://godbolt.org/g/rT3k7v
But i was only thinking about replacing and nuw with xor there.

I'm still not clear on the nuw question:
https://godbolt.org/g/mTHjhc

Are you trying to transform with a constant other than -1? If -1, then we should have simplified the IR to a constant.

Ping.

We didn't resolve the 'nuw' question - am I not seeing the scenario that you asked about?

lib/Transforms/InstCombine/InstCombineAddSub.cpp
1102 ↗	(On Diff #148763)	The comment should match what the code is doing: ~(-1 << NBits)
1103 ↗	(On Diff #148763)	The IR motivation is what I'd prefer to state here, so something like: Because a 'not' is better for bit-tracking analysis and other transforms than an 'add'.
1112 ↗	(On Diff #148763)	Is this always safe to cast rather than dyn_cast? What happens if NBits is a constant expression?

Address review notes.

In D47428#1120820, @spatel wrote:

We didn't resolve the 'nuw' question - am I not seeing the scenario that you asked about?

True.
It was just a passing-by thought.
Here i don't particularly care about that fold.
I was just thinking about profitability of transform to xor %x, -1,
and thought whether we could always do that if the input is add %x, -1.

I can submit that fold as a follow-up.

This comment has been deleted.

lib/Transforms/InstCombine/InstCombineAddSub.cpp
1112 ↗	(On Diff #148763)	New test with constant seems to work. So in this case, the constant folding seems to already happen, but i agree i guess it is better to be proactively safer here.

In D47428#1120867, @lebedev.ri wrote:

I was just thinking about profitability of transform to xor %x, -1,
and thought whether we could always do that if the input is add %x, -1.

I can submit that fold as a follow-up.

Sounds good. This patch LGTM.

This revision is now accepted and ready to land.Jun 4 2018, 7:01 AM

In D47428#1120886, @spatel wrote:

In D47428#1120867, @lebedev.ri wrote:

I was just thinking about profitability of transform to xor %x, -1,
and thought whether we could always do that if the input is add %x, -1.

I can submit that fold as a follow-up.

Sounds good. This patch LGTM.

Thank you for the review!
To not degrade x86 backend, this can't land until after D47453.

spatel added inline comments.Jun 4 2018, 7:14 AM

test/Transforms/InstCombine/set-lowbits-mask-canonicalize.ll
204–211 ↗	(On Diff #149752)	I didn't see this test initially - it's ok to add it here, but it's better to include under test/Transforms/InstSimplify instead (if there's nothing like it already there). But this doesn't exercise the case that I was thinking of. We should always constant fold simple constants like this. It's constant expressions that we (or at least me...multiple times!) forget and then result in fuzzer failures some time later. So the test will need a global variable or some other magic, but now that the code is dyn_cast'ing, it's probably a moot point.

lebedev.ri added a parent revision: D47453: [X86] Emit BZHI when mask is ~(-1 << nbits)).Jun 4 2018, 7:42 AM

Closed by commit rL334127: [InstCombine] PR37603: low bit mask canonicalization (authored by lebedevri). · Explain WhyJun 6 2018, 12:42 PM

This revision was automatically updated to reflect the committed changes.

Diffusion mentioned this in rL334124: [NFC][X86][AArch64] Reorganize/cleanup BZHI test patterns.

Diffusion mentioned this in rL334125: [X86] Emit BZHI when mask is ~(-1 << nbits)).

Diffusion mentioned this in rL334200: [NFC][InstSimplify] Add tests for shl nuw C, %x -> C fold..Jun 7 2018, 7:23 AM

lebedev.ri mentioned this in D47883: [InstSimplify] shl nuw C, %x -> C iff signbit is set on C..Jun 7 2018, 7:40 AM

Diffusion mentioned this in rL334222: [InstSimplify] shl nuw C, %x -> C iff signbit is set on C..Jun 7 2018, 1:08 PM

Diffusion mentioned this in rL334236: [NFC][InstSimplify] Add tests for add nuw %x, -1 -> -1 fold..Jun 7 2018, 2:24 PM

lebedev.ri mentioned this in D47908: [InstSimplify] add nuw %x, -1 -> -1 fold..Jun 7 2018, 2:46 PM

Diffusion mentioned this in rL334298: [InstSimplify] add nuw %x, -1 -> -1 fold..Jun 8 2018, 8:49 AM

lebedev.ri mentioned this in D48012: [AMDGPU] Recognize x & ~(-1 << y) pattern..Jun 11 2018, 4:51 AM

Diffusion mentioned this in rL334817: [AMDGPU] Recognize x & ~(-1 << y) pattern..Jun 15 2018, 3:01 AM

Revision Contents

Path

Size

llvm/

trunk/

lib/

Transforms/

InstCombine/

InstCombineAddSub.cpp

27 lines

test/

Transforms/

InstCombine/

rem.ll

4 lines

set-lowbits-mask-canonicalize.ll

80 lines

Diff 150186

llvm/trunk/lib/Transforms/InstCombine/InstCombineAddSub.cpp

Show First 20 Lines • Show All 1,090 Lines • ▼ Show 20 Lines	if (MatchRem(MulOpV, RemOpV, C1, Rem2IsSigned) &&
: Builder.CreateURem(X, NewDivisor, "urem");		: Builder.CreateURem(X, NewDivisor, "urem");
}		}
}		}
}		}

return nullptr;		return nullptr;
}		}

		/// Fold
		/// (1 << NBits) - 1
		/// Into:
		/// ~(-(1 << NBits))
		/// Because a 'not' is better for bit-tracking analysis and other transforms
		/// than an 'add'. The new shl is always nsw, and is nuw if old `and` was.
		static Instruction *canonicalizeLowbitMask(BinaryOperator &I,
		InstCombiner::BuilderTy &Builder) {
		Value *NBits;
		if (!match(&I, m_Add(m_OneUse(m_Shl(m_One(), m_Value(NBits))), m_AllOnes())))
		return nullptr;

		Constant *MinusOne = Constant::getAllOnesValue(NBits->getType());
		Value *NotMask = Builder.CreateShl(MinusOne, NBits, "notmask");
		// Be wary of constant folding.
		if (auto *BOp = dyn_cast<BinaryOperator>(NotMask)) {
		// Always NSW. But NUW propagates from `add`.
		BOp->setHasNoSignedWrap();
		BOp->setHasNoUnsignedWrap(I.hasNoUnsignedWrap());
		}

		return BinaryOperator::CreateNot(NotMask, I.getName());
		}

Instruction *InstCombiner::visitAdd(BinaryOperator &I) {		Instruction *InstCombiner::visitAdd(BinaryOperator &I) {
bool Changed = SimplifyAssociativeOrCommutative(I);		bool Changed = SimplifyAssociativeOrCommutative(I);
Value LHS = I.getOperand(0), RHS = I.getOperand(1);		Value LHS = I.getOperand(0), RHS = I.getOperand(1);
if (Value *V =		if (Value *V =
SimplifyAddInst(LHS, RHS, I.hasNoSignedWrap(), I.hasNoUnsignedWrap(),		SimplifyAddInst(LHS, RHS, I.hasNoSignedWrap(), I.hasNoUnsignedWrap(),
SQ.getWithInstruction(&I)))		SQ.getWithInstruction(&I)))
return replaceInstUsesWith(I, V);		return replaceInstUsesWith(I, V);

▲ Show 20 Lines • Show All 235 Lines • ▼ Show 20 Lines	if (!I.hasNoSignedWrap() && willNotOverflowSignedAdd(LHS, RHS, I)) {
Changed = true;		Changed = true;
I.setHasNoSignedWrap(true);		I.setHasNoSignedWrap(true);
}		}
if (!I.hasNoUnsignedWrap() && willNotOverflowUnsignedAdd(LHS, RHS, I)) {		if (!I.hasNoUnsignedWrap() && willNotOverflowUnsignedAdd(LHS, RHS, I)) {
Changed = true;		Changed = true;
I.setHasNoUnsignedWrap(true);		I.setHasNoUnsignedWrap(true);
}		}

		if (Instruction *V = canonicalizeLowbitMask(I, Builder))
		return V;

return Changed ? &I : nullptr;		return Changed ? &I : nullptr;
}		}

Instruction *InstCombiner::visitFAdd(BinaryOperator &I) {		Instruction *InstCombiner::visitFAdd(BinaryOperator &I) {
bool Changed = SimplifyAssociativeOrCommutative(I);		bool Changed = SimplifyAssociativeOrCommutative(I);
Value LHS = I.getOperand(0), RHS = I.getOperand(1);		Value LHS = I.getOperand(0), RHS = I.getOperand(1);
if (Value *V = SimplifyFAddInst(LHS, RHS, I.getFastMathFlags(),		if (Value *V = SimplifyFAddInst(LHS, RHS, I.getFastMathFlags(),
SQ.getWithInstruction(&I)))		SQ.getWithInstruction(&I)))
▲ Show 20 Lines • Show All 471 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/InstCombine/rem.ll

Show First 20 Lines • Show All 308 Lines • ▼ Show 20 Lines	;
%shl = shl i32 1, %y		%shl = shl i32 1, %y
%zext = zext i32 %shl to i64		%zext = zext i32 %shl to i64
%urem = urem i64 %x, %zext		%urem = urem i64 %x, %zext
ret i64 %urem		ret i64 %urem
}		}

define i64 @test15(i32 %x, i32 %y) {		define i64 @test15(i32 %x, i32 %y) {
; CHECK-LABEL: @test15(		; CHECK-LABEL: @test15(
; CHECK-NEXT: [[SHL:%.]] = shl nuw i32 1, [[Y:%.]]		; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw i32 -1, [[Y:%.]]
; CHECK-NEXT: [[TMP1:%.*]] = add i32 [[SHL]], -1		; CHECK-NEXT: [[TMP1:%.*]] = xor i32 [[NOTMASK]], -1
; CHECK-NEXT: [[TMP2:%.]] = and i32 [[TMP1]], [[X:%.]]		; CHECK-NEXT: [[TMP2:%.]] = and i32 [[TMP1]], [[X:%.]]
; CHECK-NEXT: [[UREM:%.*]] = zext i32 [[TMP2]] to i64		; CHECK-NEXT: [[UREM:%.*]] = zext i32 [[TMP2]] to i64
; CHECK-NEXT: ret i64 [[UREM]]		; CHECK-NEXT: ret i64 [[UREM]]
;		;
%shl = shl i32 1, %y		%shl = shl i32 1, %y
%zext0 = zext i32 %shl to i64		%zext0 = zext i32 %shl to i64
%zext1 = zext i32 %x to i64		%zext1 = zext i32 %x to i64
%urem = urem i64 %zext1, %zext0		%urem = urem i64 %zext1, %zext0
▲ Show 20 Lines • Show All 328 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/InstCombine/set-lowbits-mask-canonicalize.ll

	Show All 11 Lines
	; ============================================================================ ;			; ============================================================================ ;
	; Most basic positive tests			; Most basic positive tests
	; ============================================================================ ;			; ============================================================================ ;

	; No no-wrap tags on shl			; No no-wrap tags on shl

	define i32 @shl_add(i32 %NBits) {			define i32 @shl_add(i32 %NBits) {
	; CHECK-LABEL: @shl_add(			; CHECK-LABEL: @shl_add(
	; CHECK-NEXT: [[SETBIT:%.]] = shl i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl i32 1, %NBits			%setbit = shl i32 1, %NBits
	%ret = add i32 %setbit, -1			%ret = add i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	define i32 @shl_add_nsw(i32 %NBits) {			define i32 @shl_add_nsw(i32 %NBits) {
	; CHECK-LABEL: @shl_add_nsw(			; CHECK-LABEL: @shl_add_nsw(
	; CHECK-NEXT: [[SETBIT:%.]] = shl i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add nsw i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl i32 1, %NBits			%setbit = shl i32 1, %NBits
	%ret = add nsw i32 %setbit, -1			%ret = add nsw i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	define i32 @shl_add_nuw(i32 %NBits) {			define i32 @shl_add_nuw(i32 %NBits) {
	; CHECK-LABEL: @shl_add_nuw(			; CHECK-LABEL: @shl_add_nuw(
	; CHECK-NEXT: [[SETBIT:%.]] = shl i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nuw nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add nuw i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl i32 1, %NBits			%setbit = shl i32 1, %NBits
	%ret = add nuw i32 %setbit, -1			%ret = add nuw i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	define i32 @shl_add_nsw_nuw(i32 %NBits) {			define i32 @shl_add_nsw_nuw(i32 %NBits) {
	; CHECK-LABEL: @shl_add_nsw_nuw(			; CHECK-LABEL: @shl_add_nsw_nuw(
	; CHECK-NEXT: [[SETBIT:%.]] = shl i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nuw nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add nuw nsw i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl i32 1, %NBits			%setbit = shl i32 1, %NBits
	%ret = add nuw nsw i32 %setbit, -1			%ret = add nuw nsw i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	; shl is nsw			; shl is nsw

	define i32 @shl_nsw_add(i32 %NBits) {			define i32 @shl_nsw_add(i32 %NBits) {
	; CHECK-LABEL: @shl_nsw_add(			; CHECK-LABEL: @shl_nsw_add(
	; CHECK-NEXT: [[SETBIT:%.]] = shl nsw i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl nsw i32 1, %NBits			%setbit = shl nsw i32 1, %NBits
	%ret = add i32 %setbit, -1			%ret = add i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	define i32 @shl_nsw_add_nsw(i32 %NBits) {			define i32 @shl_nsw_add_nsw(i32 %NBits) {
	; CHECK-LABEL: @shl_nsw_add_nsw(			; CHECK-LABEL: @shl_nsw_add_nsw(
	; CHECK-NEXT: [[SETBIT:%.]] = shl nsw i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add nsw i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl nsw i32 1, %NBits			%setbit = shl nsw i32 1, %NBits
	%ret = add nsw i32 %setbit, -1			%ret = add nsw i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	define i32 @shl_nsw_add_nuw(i32 %NBits) {			define i32 @shl_nsw_add_nuw(i32 %NBits) {
	; CHECK-LABEL: @shl_nsw_add_nuw(			; CHECK-LABEL: @shl_nsw_add_nuw(
	; CHECK-NEXT: [[SETBIT:%.]] = shl nsw i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nuw nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add nuw i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl nsw i32 1, %NBits			%setbit = shl nsw i32 1, %NBits
	%ret = add nuw i32 %setbit, -1			%ret = add nuw i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	define i32 @shl_nsw_add_nsw_nuw(i32 %NBits) {			define i32 @shl_nsw_add_nsw_nuw(i32 %NBits) {
	; CHECK-LABEL: @shl_nsw_add_nsw_nuw(			; CHECK-LABEL: @shl_nsw_add_nsw_nuw(
	; CHECK-NEXT: [[SETBIT:%.]] = shl nsw i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nuw nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add nuw nsw i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl nsw i32 1, %NBits			%setbit = shl nsw i32 1, %NBits
	%ret = add nuw nsw i32 %setbit, -1			%ret = add nuw nsw i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	; shl is nuw			; shl is nuw

	define i32 @shl_nuw_add(i32 %NBits) {			define i32 @shl_nuw_add(i32 %NBits) {
	; CHECK-LABEL: @shl_nuw_add(			; CHECK-LABEL: @shl_nuw_add(
	; CHECK-NEXT: [[SETBIT:%.]] = shl nuw i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl nuw i32 1, %NBits			%setbit = shl nuw i32 1, %NBits
	%ret = add i32 %setbit, -1			%ret = add i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	define i32 @shl_nuw_add_nsw(i32 %NBits) {			define i32 @shl_nuw_add_nsw(i32 %NBits) {
	; CHECK-LABEL: @shl_nuw_add_nsw(			; CHECK-LABEL: @shl_nuw_add_nsw(
	; CHECK-NEXT: [[SETBIT:%.]] = shl nuw i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add nsw i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl nuw i32 1, %NBits			%setbit = shl nuw i32 1, %NBits
	%ret = add nsw i32 %setbit, -1			%ret = add nsw i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	define i32 @shl_nuw_add_nuw(i32 %NBits) {			define i32 @shl_nuw_add_nuw(i32 %NBits) {
	; CHECK-LABEL: @shl_nuw_add_nuw(			; CHECK-LABEL: @shl_nuw_add_nuw(
	; CHECK-NEXT: [[SETBIT:%.]] = shl nuw i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nuw nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add nuw i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl nuw i32 1, %NBits			%setbit = shl nuw i32 1, %NBits
	%ret = add nuw i32 %setbit, -1			%ret = add nuw i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	define i32 @shl_nuw_add_nsw_nuw(i32 %NBits) {			define i32 @shl_nuw_add_nsw_nuw(i32 %NBits) {
	; CHECK-LABEL: @shl_nuw_add_nsw_nuw(			; CHECK-LABEL: @shl_nuw_add_nsw_nuw(
	; CHECK-NEXT: [[SETBIT:%.]] = shl nuw i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nuw nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add nuw nsw i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl nuw i32 1, %NBits			%setbit = shl nuw i32 1, %NBits
	%ret = add nuw nsw i32 %setbit, -1			%ret = add nuw nsw i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	; shl is nuw nsw			; shl is nuw nsw

	define i32 @shl_nsw_nuw_add(i32 %NBits) {			define i32 @shl_nsw_nuw_add(i32 %NBits) {
	; CHECK-LABEL: @shl_nsw_nuw_add(			; CHECK-LABEL: @shl_nsw_nuw_add(
	; CHECK-NEXT: [[SETBIT:%.]] = shl nuw nsw i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl nuw nsw i32 1, %NBits			%setbit = shl nuw nsw i32 1, %NBits
	%ret = add i32 %setbit, -1			%ret = add i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	define i32 @shl_nsw_nuw_add_nsw(i32 %NBits) {			define i32 @shl_nsw_nuw_add_nsw(i32 %NBits) {
	; CHECK-LABEL: @shl_nsw_nuw_add_nsw(			; CHECK-LABEL: @shl_nsw_nuw_add_nsw(
	; CHECK-NEXT: [[SETBIT:%.]] = shl nuw nsw i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add nsw i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl nuw nsw i32 1, %NBits			%setbit = shl nuw nsw i32 1, %NBits
	%ret = add nsw i32 %setbit, -1			%ret = add nsw i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	define i32 @shl_nsw_nuw_add_nuw(i32 %NBits) {			define i32 @shl_nsw_nuw_add_nuw(i32 %NBits) {
	; CHECK-LABEL: @shl_nsw_nuw_add_nuw(			; CHECK-LABEL: @shl_nsw_nuw_add_nuw(
	; CHECK-NEXT: [[SETBIT:%.]] = shl nuw nsw i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nuw nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add nuw i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl nuw nsw i32 1, %NBits			%setbit = shl nuw nsw i32 1, %NBits
	%ret = add nuw i32 %setbit, -1			%ret = add nuw i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	define i32 @shl_nsw_nuw_add_nsw_nuw(i32 %NBits) {			define i32 @shl_nsw_nuw_add_nsw_nuw(i32 %NBits) {
	; CHECK-LABEL: @shl_nsw_nuw_add_nsw_nuw(			; CHECK-LABEL: @shl_nsw_nuw_add_nsw_nuw(
	; CHECK-NEXT: [[SETBIT:%.]] = shl nuw nsw i32 1, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nuw nsw i32 -1, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add nuw nsw i32 [[SETBIT]], -1			; CHECK-NEXT: [[RET:%.*]] = xor i32 [[NOTMASK]], -1
	; CHECK-NEXT: ret i32 [[RET]]			; CHECK-NEXT: ret i32 [[RET]]
	;			;
	%setbit = shl nuw nsw i32 1, %NBits			%setbit = shl nuw nsw i32 1, %NBits
	%ret = add nuw nsw i32 %setbit, -1			%ret = add nuw nsw i32 %setbit, -1
	ret i32 %ret			ret i32 %ret
	}			}

	; ============================================================================ ;			; ============================================================================ ;
	; Vectors			; Vectors
	; ============================================================================ ;			; ============================================================================ ;

	define <2 x i32> @shl_add_vec(<2 x i32> %NBits) {			define <2 x i32> @shl_add_vec(<2 x i32> %NBits) {
	; CHECK-LABEL: @shl_add_vec(			; CHECK-LABEL: @shl_add_vec(
	; CHECK-NEXT: [[SETBIT:%.]] = shl <2 x i32> <i32 1, i32 1>, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw <2 x i32> <i32 -1, i32 -1>, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add <2 x i32> [[SETBIT]], <i32 -1, i32 -1>			; CHECK-NEXT: [[RET:%.*]] = xor <2 x i32> [[NOTMASK]], <i32 -1, i32 -1>
	; CHECK-NEXT: ret <2 x i32> [[RET]]			; CHECK-NEXT: ret <2 x i32> [[RET]]
	;			;
	%setbit = shl <2 x i32> <i32 1, i32 1>, %NBits			%setbit = shl <2 x i32> <i32 1, i32 1>, %NBits
	%ret = add <2 x i32> %setbit, <i32 -1, i32 -1>			%ret = add <2 x i32> %setbit, <i32 -1, i32 -1>
	ret <2 x i32> %ret			ret <2 x i32> %ret
	}			}

	define <3 x i32> @shl_add_vec_undef0(<3 x i32> %NBits) {			define <3 x i32> @shl_add_vec_undef0(<3 x i32> %NBits) {
	; CHECK-LABEL: @shl_add_vec_undef0(			; CHECK-LABEL: @shl_add_vec_undef0(
	; CHECK-NEXT: [[SETBIT:%.]] = shl <3 x i32> <i32 1, i32 undef, i32 1>, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw <3 x i32> <i32 -1, i32 -1, i32 -1>, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add <3 x i32> [[SETBIT]], <i32 -1, i32 -1, i32 -1>			; CHECK-NEXT: [[RET:%.*]] = xor <3 x i32> [[NOTMASK]], <i32 -1, i32 -1, i32 -1>
	; CHECK-NEXT: ret <3 x i32> [[RET]]			; CHECK-NEXT: ret <3 x i32> [[RET]]
	;			;
	%setbit = shl <3 x i32> <i32 1, i32 undef, i32 1>, %NBits			%setbit = shl <3 x i32> <i32 1, i32 undef, i32 1>, %NBits
	%ret = add <3 x i32> %setbit, <i32 -1, i32 -1, i32 -1>			%ret = add <3 x i32> %setbit, <i32 -1, i32 -1, i32 -1>
	ret <3 x i32> %ret			ret <3 x i32> %ret
	}			}

	define <3 x i32> @shl_add_vec_undef1(<3 x i32> %NBits) {			define <3 x i32> @shl_add_vec_undef1(<3 x i32> %NBits) {
	; CHECK-LABEL: @shl_add_vec_undef1(			; CHECK-LABEL: @shl_add_vec_undef1(
	; CHECK-NEXT: [[SETBIT:%.]] = shl <3 x i32> <i32 1, i32 1, i32 1>, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw <3 x i32> <i32 -1, i32 -1, i32 -1>, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add <3 x i32> [[SETBIT]], <i32 -1, i32 undef, i32 -1>			; CHECK-NEXT: [[RET:%.*]] = xor <3 x i32> [[NOTMASK]], <i32 -1, i32 -1, i32 -1>
	; CHECK-NEXT: ret <3 x i32> [[RET]]			; CHECK-NEXT: ret <3 x i32> [[RET]]
	;			;
	%setbit = shl <3 x i32> <i32 1, i32 1, i32 1>, %NBits			%setbit = shl <3 x i32> <i32 1, i32 1, i32 1>, %NBits
	%ret = add <3 x i32> %setbit, <i32 -1, i32 undef, i32 -1>			%ret = add <3 x i32> %setbit, <i32 -1, i32 undef, i32 -1>
	ret <3 x i32> %ret			ret <3 x i32> %ret
	}			}

	define <3 x i32> @shl_add_vec_undef2(<3 x i32> %NBits) {			define <3 x i32> @shl_add_vec_undef2(<3 x i32> %NBits) {
	; CHECK-LABEL: @shl_add_vec_undef2(			; CHECK-LABEL: @shl_add_vec_undef2(
	; CHECK-NEXT: [[SETBIT:%.]] = shl <3 x i32> <i32 1, i32 undef, i32 1>, [[NBITS:%.]]			; CHECK-NEXT: [[NOTMASK:%.]] = shl nsw <3 x i32> <i32 -1, i32 -1, i32 -1>, [[NBITS:%.]]
	; CHECK-NEXT: [[RET:%.*]] = add <3 x i32> [[SETBIT]], <i32 -1, i32 undef, i32 -1>			; CHECK-NEXT: [[RET:%.*]] = xor <3 x i32> [[NOTMASK]], <i32 -1, i32 -1, i32 -1>
	; CHECK-NEXT: ret <3 x i32> [[RET]]			; CHECK-NEXT: ret <3 x i32> [[RET]]
	;			;
	%setbit = shl <3 x i32> <i32 1, i32 undef, i32 1>, %NBits			%setbit = shl <3 x i32> <i32 1, i32 undef, i32 1>, %NBits
	%ret = add <3 x i32> %setbit, <i32 -1, i32 undef, i32 -1>			%ret = add <3 x i32> %setbit, <i32 -1, i32 undef, i32 -1>
	ret <3 x i32> %ret			ret <3 x i32> %ret
	}			}

	; ============================================================================ ;			; ============================================================================ ;
	▲ Show 20 Lines • Show All 68 Lines • Show Last 20 Lines