This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
1/2
InstCombineMulDivRem.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
div.ll
2/5
sdiv-canonicalize.ll

Differential D60395

[InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y)
ClosedPublic

Authored by shchenz on Apr 7 2019, 9:45 PM.

Download Raw Diff

Details

Reviewers

spatel
lebedev.ri
RKSimon

Commits

rG5e13ff1da20b: [InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y).
rL358050: [InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y).
rG1383a9168948: [InstCombine] [InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y).
rL358017: [InstCombine] [InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y).

Summary

int foo(int i, int j)
{
  int res = -( i / j);
  int res2 = -i / j;
  return res + res2;
}

Currently, we get:

define dso_local i32 @foo(i32, i32) local_unnamed_addr #0 {
  %3 = sdiv i32 %0, %1
  %4 = sub nsw i32 0, %0
  %5 = sdiv i32 %4, %1
  %6 = sub i32 %5, %3
  ret i32 %6
}

With this fold we will get

define dso_local i32 @foo(i32, i32) local_unnamed_addr #0 {
  %3 = sdiv i32 %0, %1
  %4 = sdiv i32 %0, %1
  %5 = sub nsw i32 0, %4
  %6 = sub i32 %5, %3
  ret i32 %6
}

Which will then get folded into

define dso_local i32 @foo(i32, i32) local_unnamed_addr #0 {
  %3 = sdiv i32 %0, %1
  %factor = mul i32 %3, -2
  ret i32 %factor
}

So we end with just one division:
https://godbolt.org/z/gSCKQ1

Diff Detail

Event Timeline

shchenz created this revision.Apr 7 2019, 9:45 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 7 2019, 9:45 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

move exact flag fixup code to a seperated patch https://reviews.llvm.org/D60396

Looks promising.

llvm/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp

1050–1051

It's not commutative like that.

----------------------------------------
Optimization: -X / Y  to  -(X / Y)
Precondition: true
  %t0 = sub nsw i8 0, %x
  %r = sdiv i8 %t0, %y
=>
  %n0 = sdiv i8 %x, %y
  %r = sub i8 0, %n0

Done: 1
Optimization is correct!

----------------------------------------
Optimization: X / -Y --> -(X / Y)
Precondition: true
  %t0 = sub nsw i8 0, %y
  %r = sdiv i8 %x, %t0
=>
  %n0 = sdiv i8 %x, %y
  %r = sub i8 0, %n0


ERROR: Domain of definedness of Target is smaller than Source's for i8 %r

Example:
%y i8 = 0xFF (255, -1)
%x i8 = 0x80 (128, -128)
%t0 i8 = 0x01 (1)
%n0 i8 = 0x80 (128, -128)
Source value: 0x80 (128, -128)
Target value: undef

(You could use https://rise4fun.com/Alive, but it appears down for the moment?)

This revision now requires changes to proceed.Apr 7 2019, 11:27 PM

Will fix it later. Thanks for your comments @lebedev.ri

llvm/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
1050–1051	hmm, seems can not canonicalize `X/-Y` ---> `-(X/Y)`. I will only focus on `-X / Y` to `-(X / Y)` for this patch. There should be same opportunity for `(X/-Y)` if `Y` is constant. Will let it to be improved in later patch.

address comments.

Now only canonicalize pattern -X/Y ---> -(X/Y).

shchenz updated this revision to Diff 194126.Apr 8 2019, 6:07 AM

lebedev.ri retitled this revision from [InstCombine] canonicalize sdiv with NEG operand to [InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y).Apr 8 2019, 7:25 AM

lebedev.ri edited the summary of this revision. (Show Details)

lebedev.ri set the repository for this revision to rL LLVM.

lebedev.ri edited the summary of this revision. (Show Details)

Almost there.
Since we never touch the denominator, i think we are always good re vectors.
Does this need rebasing now that the other patch has landed?

llvm/test/Transforms/InstCombine/sdiv-canonicalize.ll

Shouldn't this sub also be nsw?

----------------------------------------
Optimization: nsw preserved
Precondition: true
  %o0 = sub nsw i8 0, %x
  %r = sdiv i8 %o0, %y
=>
  %n0 = sdiv i8 %x, %y
  %r = sub nsw i8 0, %n0

Done: 1
Optimization is correct!

----------------------------------------
Optimization: exact preserved
Precondition: true
  %o0 = sub nsw i8 0, %x
  %r = sdiv exact i8 %o0, %y
=>
  %n0 = sdiv exact i8 %x, %y
  %r = sub i8 0, %n0

Done: 1
Optimization is correct!

----------------------------------------
Optimization: both preserved
Precondition: true
  %o0 = sub nsw i8 0, %x
  %r = sdiv exact i8 %o0, %y
=>
  %n0 = sdiv exact i8 %x, %y
  %r = sub nsw i8 0, %n0

Done: 1
Optimization is correct!

There is a test that will show that we propagate exact?

Given that this doesn't fall into endless loop, i guess we don't do reverse transform.
I'm just wondering, this is the direction we should be doing it? @spatel

In D60395#1458421, @lebedev.ri wrote:

Given that this doesn't fall into endless loop, i guess we don't do reverse transform.
I'm just wondering, this is the direction we should be doing it? @spatel

We already have these for mul:

// -X * Y --> -(X * Y)
// X * -Y --> -(X * Y)

...so that provides some precedent for the direction to move the negation.

Is there a sibling fold for srem?

In D60395#1458889, @spatel wrote:
In D60395#1458421, @lebedev.ri wrote:

Given that this doesn't fall into endless loop, i guess we don't do reverse transform.
I'm just wondering, this is the direction we should be doing it? @spatel

We already have these for mul:
// -X * Y --> -(X * Y)
// X * -Y --> -(X * Y)
...so that provides some precedent for the direction to move the negation.

Okay, that confirmed my suspicions.

Is there a sibling fold for srem?

Yep!

----------------------------------------
Optimization: nsw preserved
Precondition: true
  %o0 = sub nsw i8 0, %x
  %r = srem i8 %o0, %y
=>
  %n0 = srem i8 %x, %y
  %r = sub nsw i8 0, %n0

Done: 1
Optimization is correct!

shchenz marked 2 inline comments as done.Apr 8 2019, 7:22 PM

shchenz added inline comments.

llvm/test/Transforms/InstCombine/sdiv-canonicalize.ll
7	I have one question here before I set `sub` with `nsw`. %o0 = sub nsw i8 0, %x %r = sdiv i8 %o0, %y => %n0 = sdiv i8 %x, %y %r = sub i8 0, %n0 Is this a valid transformation? I know we get a affirmative answer in https://rise4fun.com/Alive/9G4. But assume this input `%x` is -128, `%y` is -2, for the source `%o0` is a position value so `%r` is a position value too. But for the target `%n0` is 64, and `%r` is -64? So source `%r` and target `%r` is not equal? Can you please help to point out what's wrong here? Thanks @lebedev.ri @spatel
11	will add one.

lebedev.ri added inline comments.Apr 8 2019, 11:42 PM

llvm/test/Transforms/InstCombine/sdiv-canonicalize.ll
7	`-128` is `SINT_MIN` for `i8`, therefore the `sub nsw 0, -128` is UB since it overflows despite the `nsw` flag. https://godbolt.org/z/sWekHw Therefore any and every answer is correct.

address comments.

In D60395#1459712, @shchenz wrote:

address comments.

Ok, thank you, looks good now.

In D60395#1458897, @lebedev.ri wrote:

----------------------------------------
Optimization: nsw preserved
Precondition: true
  %o0 = sub nsw i8 0, %x
  %r = srem i8 %o0, %y
=>
  %n0 = srem i8 %x, %y
  %r = sub nsw i8 0, %n0

Done: 1
Optimization is correct!

@shchenz if you don't intend to immediately-ish submit a patch for that sibling pattern, could you please file a bug?

This revision is now accepted and ready to land.Apr 9 2019, 7:09 AM

In D60395#1459781, @lebedev.ri wrote:
In D60395#1459712, @shchenz wrote:

address comments.

Ok, thank you, looks good now.
In D60395#1458897, @lebedev.ri wrote:
----------------------------------------
Optimization: nsw preserved
Precondition: true
  %o0 = sub nsw i8 0, %x
  %r = srem i8 %o0, %y
=>
  %n0 = srem i8 %x, %y
  %r = sub nsw i8 0, %n0

Done: 1
Optimization is correct!
@shchenz if you don't intend to immediately-ish submit a patch for that sibling pattern, could you please file a bug?

Sure, https://bugs.llvm.org/show_bug.cgi?id=41443 is filed.

In D60395#1460010, @shchenz wrote:
In D60395#1459781, @lebedev.ri wrote:
In D60395#1459712, @shchenz wrote:

address comments.

Ok, thank you, looks good now.
In D60395#1458897, @lebedev.ri wrote:
----------------------------------------
Optimization: nsw preserved
Precondition: true
  %o0 = sub nsw i8 0, %x
  %r = srem i8 %o0, %y
=>
  %n0 = srem i8 %x, %y
  %r = sub nsw i8 0, %n0

Done: 1
Optimization is correct!
@shchenz if you don't intend to immediately-ish submit a patch for that sibling pattern, could you please file a bug?
Sure, https://bugs.llvm.org/show_bug.cgi?id=41443 is filed.

Thank you.

Closed by commit rL358017: [InstCombine] [InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y). (authored by shchenz). · Explain WhyApr 9 2019, 9:35 AM

This revision was automatically updated to reflect the committed changes.

@shchenz how did rL358017 end up committing this diff into completely different place in this function?

lebedev.ri reopened this revision.Apr 9 2019, 11:45 AM

This revision is now accepted and ready to land.Apr 9 2019, 11:45 AM

lebedev.ri requested changes to this revision.Apr 9 2019, 11:45 AM

This revision now requires changes to proceed.Apr 9 2019, 11:45 AM

lebedev.ri mentioned this in D60478: [InstCombine] Fix canonicalization of (-X s/ Y) to -(X s/ Y)..Apr 9 2019, 11:47 AM

In D60395#1460248, @lebedev.ri wrote:

@shchenz how did rL358017 end up committing this diff into completely different place in this function?

Sorry, my bad. My remote dev machine is very slow yesterday night so I did not do a make check-all test before I committed the code. And I didn't reliaze that git merges my code into a wrong place in file lib/Transforms/InstCombine/InstCombineMulDivRem.cpp which was also changed by other patch yesterday.

I will send out another patch later.

rebased.

Sorry for breaking down unit testing. Could you please help to do another review for this patch. Thanks a lot.

LG, please do run at least the check-llvm before committing..

This revision is now accepted and ready to land.Apr 9 2019, 11:22 PM

In D60395#1460782, @lebedev.ri wrote:

LG, please do run at least the check-llvm before committing..

absolutely will. ^-^

Closed by commit rL358050: [InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y). (authored by shchenz). · Explain WhyApr 9 2019, 11:51 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

lib/

Transforms/

InstCombine/

InstCombineMulDivRem.cpp

6 lines

test/

Transforms/

InstCombine/

div.ll

12 lines

sdiv-canonicalize.ll

18 lines

Diff 194445

llvm/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp

Show First 20 Lines • Show All 1,041 Lines • ▼ Show 20 Lines	if (match(Op0, m_OneUse(m_SExt(m_Value(Op0Src)))) &&
ConstantExpr::getTrunc(cast<Constant>(Op1), Op0Src->getType());		ConstantExpr::getTrunc(cast<Constant>(Op1), Op0Src->getType());
Value *NarrowOp = Builder.CreateSDiv(Op0Src, NarrowDivisor);		Value *NarrowOp = Builder.CreateSDiv(Op0Src, NarrowDivisor);
return new SExtInst(NarrowOp, Op0->getType());		return new SExtInst(NarrowOp, Op0->getType());
}		}

// -X / C --> X / -C (if the negation doesn't overflow).		// -X / C --> X / -C (if the negation doesn't overflow).
// TODO: This could be enhanced to handle arbitrary vector constants by		// TODO: This could be enhanced to handle arbitrary vector constants by
// checking if all elements are not the min-signed-val.		// checking if all elements are not the min-signed-val.
if (!Op1C->isMinSignedValue() &&		if (!Op1C->isMinSignedValue() &&
match(Op0, m_NSWSub(m_Zero(), m_Value(X)))) {		match(Op0, m_NSWSub(m_Zero(), m_Value(X)))) {
		lebedev.riUnsubmitted Not Done Reply Inline Actions It's not commutative like that. ---------------------------------------- Optimization: -X / Y to -(X / Y) Precondition: true %t0 = sub nsw i8 0, %x %r = sdiv i8 %t0, %y => %n0 = sdiv i8 %x, %y %r = sub i8 0, %n0 Done: 1 Optimization is correct! ---------------------------------------- Optimization: X / -Y --> -(X / Y) Precondition: true %t0 = sub nsw i8 0, %y %r = sdiv i8 %x, %t0 => %n0 = sdiv i8 %x, %y %r = sub i8 0, %n0 ERROR: Domain of definedness of Target is smaller than Source's for i8 %r Example: %y i8 = 0xFF (255, -1) %x i8 = 0x80 (128, -128) %t0 i8 = 0x01 (1) %n0 i8 = 0x80 (128, -128) Source value: 0x80 (128, -128) Target value: undef (You could use https://rise4fun.com/Alive, but it appears down for the moment?) lebedev.ri: It's not commutative like that. ``` ---------------------------------------- Optimization: -X /…
		shchenzAuthorUnsubmitted Done Reply Inline Actions hmm, seems can not canonicalize `X/-Y` ---> `-(X/Y)`. I will only focus on `-X / Y` to `-(X / Y)` for this patch. There should be same opportunity for `(X/-Y)` if `Y` is constant. Will let it to be improved in later patch. shchenz: hmm, seems can not canonicalize `X/-Y ` ---> `-(X/Y)`. I will only focus on `-X / Y` to `-(X…
Constant NegC = ConstantInt::get(I.getType(), -(Op1C));		Constant NegC = ConstantInt::get(I.getType(), -(Op1C));
Instruction *BO = BinaryOperator::CreateSDiv(X, NegC);		Instruction *BO = BinaryOperator::CreateSDiv(X, NegC);
BO->setIsExact(I.isExact());		BO->setIsExact(I.isExact());
return BO;		return BO;
}		}
}		}

		// -X / Y --> -(X / Y)
		Value *Y;
		if (match(&I, m_SDiv(m_OneUse(m_NSWSub(m_Zero(), m_Value(X))), m_Value(Y))))
		return BinaryOperator::CreateNSWNeg(
		Builder.CreateSDiv(X, Y, I.getName(), I.isExact()));

// If the sign bits of both operands are zero (i.e. we can prove they are		// If the sign bits of both operands are zero (i.e. we can prove they are
// unsigned inputs), turn this into a udiv.		// unsigned inputs), turn this into a udiv.
APInt Mask(APInt::getSignMask(I.getType()->getScalarSizeInBits()));		APInt Mask(APInt::getSignMask(I.getType()->getScalarSizeInBits()));
if (MaskedValueIsZero(Op0, Mask, 0, &I)) {		if (MaskedValueIsZero(Op0, Mask, 0, &I)) {
if (MaskedValueIsZero(Op1, Mask, 0, &I)) {		if (MaskedValueIsZero(Op1, Mask, 0, &I)) {
// X sdiv Y -> X udiv Y, iff X and Y don't have sign bit set		// X sdiv Y -> X udiv Y, iff X and Y don't have sign bit set
auto *BO = BinaryOperator::CreateUDiv(Op0, Op1, I.getName());		auto *BO = BinaryOperator::CreateUDiv(Op0, Op1, I.getName());
BO->setIsExact(I.isExact());		BO->setIsExact(I.isExact());
▲ Show 20 Lines • Show All 342 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/div.ll

	Show First 20 Lines • Show All 516 Lines • ▼ Show 20 Lines
	;			;
	%neg = sub nsw <2 x i8> zeroinitializer, %x			%neg = sub nsw <2 x i8> zeroinitializer, %x
	%d = sdiv <2 x i8> %neg, <i8 -128, i8 undef>			%d = sdiv <2 x i8> %neg, <i8 -128, i8 undef>
	ret <2 x i8> %d			ret <2 x i8> %d
	}			}

	define <2 x i64> @sdiv_negated_dividend_constant_divisor_vec(<2 x i64> %x) {			define <2 x i64> @sdiv_negated_dividend_constant_divisor_vec(<2 x i64> %x) {
	; CHECK-LABEL: @sdiv_negated_dividend_constant_divisor_vec(			; CHECK-LABEL: @sdiv_negated_dividend_constant_divisor_vec(
	; CHECK-NEXT: [[NEG:%.]] = sub nsw <2 x i64> zeroinitializer, [[X:%.]]			; CHECK-NEXT: [[DIV1:%.]] = sdiv <2 x i64> [[X:%.]], <i64 3, i64 4>
	; CHECK-NEXT: [[DIV:%.*]] = sdiv <2 x i64> [[NEG]], <i64 3, i64 4>			; CHECK-NEXT: [[DIV:%.*]] = sub nsw <2 x i64> zeroinitializer, [[DIV1]]
	; CHECK-NEXT: ret <2 x i64> [[DIV]]			; CHECK-NEXT: ret <2 x i64> [[DIV]]
	;			;
	%neg = sub nsw <2 x i64> zeroinitializer, %x			%neg = sub nsw <2 x i64> zeroinitializer, %x
	%div = sdiv <2 x i64> %neg, <i64 3, i64 4>			%div = sdiv <2 x i64> %neg, <i64 3, i64 4>
	ret <2 x i64> %div			ret <2 x i64> %div
	}			}

	define <2 x i64> @sdiv_exact_negated_dividend_constant_divisor_vec(<2 x i64> %x) {			define <2 x i64> @sdiv_exact_negated_dividend_constant_divisor_vec(<2 x i64> %x) {
	; CHECK-LABEL: @sdiv_exact_negated_dividend_constant_divisor_vec(			; CHECK-LABEL: @sdiv_exact_negated_dividend_constant_divisor_vec(
	; CHECK-NEXT: [[NEG:%.]] = sub nsw <2 x i64> zeroinitializer, [[X:%.]]			; CHECK-NEXT: [[DIV1:%.]] = sdiv exact <2 x i64> [[X:%.]], <i64 3, i64 4>
	; CHECK-NEXT: [[DIV:%.*]] = sdiv exact <2 x i64> [[NEG]], <i64 3, i64 4>			; CHECK-NEXT: [[DIV:%.*]] = sub nsw <2 x i64> zeroinitializer, [[DIV1]]
	; CHECK-NEXT: ret <2 x i64> [[DIV]]			; CHECK-NEXT: ret <2 x i64> [[DIV]]
	;			;
	%neg = sub nsw <2 x i64> zeroinitializer, %x			%neg = sub nsw <2 x i64> zeroinitializer, %x
	%div = sdiv exact <2 x i64> %neg, <i64 3, i64 4>			%div = sdiv exact <2 x i64> %neg, <i64 3, i64 4>
	ret <2 x i64> %div			ret <2 x i64> %div
	}			}

	; Can't negate signed min vector element.			; Can't negate signed min vector element.

	define <2 x i8> @sdiv_exact_negated_dividend_constant_divisor_vec_overflow(<2 x i8> %x) {			define <2 x i8> @sdiv_exact_negated_dividend_constant_divisor_vec_overflow(<2 x i8> %x) {
	; CHECK-LABEL: @sdiv_exact_negated_dividend_constant_divisor_vec_overflow(			; CHECK-LABEL: @sdiv_exact_negated_dividend_constant_divisor_vec_overflow(
	; CHECK-NEXT: [[NEG:%.]] = sub nsw <2 x i8> zeroinitializer, [[X:%.]]			; CHECK-NEXT: [[DIV1:%.]] = sdiv exact <2 x i8> [[X:%.]], <i8 -128, i8 42>
	; CHECK-NEXT: [[DIV:%.*]] = sdiv exact <2 x i8> [[NEG]], <i8 -128, i8 42>			; CHECK-NEXT: [[DIV:%.*]] = sub nsw <2 x i8> zeroinitializer, [[DIV1]]
	; CHECK-NEXT: ret <2 x i8> [[DIV]]			; CHECK-NEXT: ret <2 x i8> [[DIV]]
	;			;
	%neg = sub nsw <2 x i8> zeroinitializer, %x			%neg = sub nsw <2 x i8> zeroinitializer, %x
	%div = sdiv exact <2 x i8> %neg, <i8 -128, i8 42>			%div = sdiv exact <2 x i8> %neg, <i8 -128, i8 42>
	ret <2 x i8> %div			ret <2 x i8> %div
	}			}

	define i32 @test35(i32 %A) {			define i32 @test35(i32 %A) {
	▲ Show 20 Lines • Show All 491 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/sdiv-canonicalize.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -instcombine -S \| FileCheck %s			; RUN: opt < %s -instcombine -S \| FileCheck %s

	define i32 @test_sdiv_canonicalize_op0(i32 %x, i32 %y) {			define i32 @test_sdiv_canonicalize_op0(i32 %x, i32 %y) {
	; CHECK-LABEL: @test_sdiv_canonicalize_op0(			; CHECK-LABEL: @test_sdiv_canonicalize_op0(
	; CHECK-NEXT: [[NEG:%.]] = sub nsw i32 0, [[X:%.]]			; CHECK-NEXT: [[SDIV1:%.]] = sdiv i32 [[X:%.]], [[Y:%.*]]
	; CHECK-NEXT: [[SDIV:%.]] = sdiv i32 [[NEG]], [[Y:%.]]			; CHECK-NEXT: [[SDIV:%.*]] = sub nsw i32 0, [[SDIV1]]
				lebedev.riUnsubmitted Not Done Reply Inline Actions Shouldn't this `sub` also be `nsw`? ---------------------------------------- Optimization: nsw preserved Precondition: true %o0 = sub nsw i8 0, %x %r = sdiv i8 %o0, %y => %n0 = sdiv i8 %x, %y %r = sub nsw i8 0, %n0 Done: 1 Optimization is correct! ---------------------------------------- Optimization: exact preserved Precondition: true %o0 = sub nsw i8 0, %x %r = sdiv exact i8 %o0, %y => %n0 = sdiv exact i8 %x, %y %r = sub i8 0, %n0 Done: 1 Optimization is correct! ---------------------------------------- Optimization: both preserved Precondition: true %o0 = sub nsw i8 0, %x %r = sdiv exact i8 %o0, %y => %n0 = sdiv exact i8 %x, %y %r = sub nsw i8 0, %n0 Done: 1 Optimization is correct! lebedev.ri: Shouldn't this `sub` also be `nsw`? ``` ---------------------------------------- Optimization…
				shchenzAuthorUnsubmitted Done Reply Inline Actions I have one question here before I set `sub` with `nsw`. %o0 = sub nsw i8 0, %x %r = sdiv i8 %o0, %y => %n0 = sdiv i8 %x, %y %r = sub i8 0, %n0 Is this a valid transformation? I know we get a affirmative answer in https://rise4fun.com/Alive/9G4. But assume this input `%x` is -128, `%y` is -2, for the source `%o0` is a position value so `%r` is a position value too. But for the target `%n0` is 64, and `%r` is -64? So source `%r` and target `%r` is not equal? Can you please help to point out what's wrong here? Thanks @lebedev.ri @spatel shchenz: I have one question here before I set `sub` with `nsw`. ``` %o0 = sub nsw i8 0, %x %r =…
				lebedev.riUnsubmitted Not Done Reply Inline Actions `-128` is `SINT_MIN` for `i8`, therefore the `sub nsw 0, -128` is UB since it overflows despite the `nsw` flag. https://godbolt.org/z/sWekHw Therefore any and every answer is correct. lebedev.ri: `-128` is `SINT_MIN` for `i8`, therefore the `sub nsw 0, -128` is UB since it overflows despite…
	; CHECK-NEXT: ret i32 [[SDIV]]			; CHECK-NEXT: ret i32 [[SDIV]]
	;			;
	%neg = sub nsw i32 0, %x			%neg = sub nsw i32 0, %x
	%sdiv = sdiv i32 %neg, %y			%sdiv = sdiv i32 %neg, %y
				lebedev.riUnsubmitted Not Done Reply Inline Actions There is a test that will show that we propagate `exact`? lebedev.ri: There is a test that will show that we propagate `exact`?
				shchenzAuthorUnsubmitted Done Reply Inline Actions will add one. shchenz: will add one.
	ret i32 %sdiv			ret i32 %sdiv
	}			}

	define i32 @test_sdiv_canonicalize_op0_exact(i32 %x, i32 %y) {			define i32 @test_sdiv_canonicalize_op0_exact(i32 %x, i32 %y) {
	; CHECK-LABEL: @test_sdiv_canonicalize_op0_exact(			; CHECK-LABEL: @test_sdiv_canonicalize_op0_exact(
	; CHECK-NEXT: [[NEG:%.]] = sub nsw i32 0, [[X:%.]]			; CHECK-NEXT: [[SDIV1:%.]] = sdiv exact i32 [[X:%.]], [[Y:%.*]]
	; CHECK-NEXT: [[SDIV:%.]] = sdiv exact i32 [[NEG]], [[Y:%.]]			; CHECK-NEXT: [[SDIV:%.*]] = sub nsw i32 0, [[SDIV1]]
	; CHECK-NEXT: ret i32 [[SDIV]]			; CHECK-NEXT: ret i32 [[SDIV]]
	;			;
	%neg = sub nsw i32 0, %x			%neg = sub nsw i32 0, %x
	%sdiv = sdiv exact i32 %neg, %y			%sdiv = sdiv exact i32 %neg, %y
	ret i32 %sdiv			ret i32 %sdiv
	}			}

				; (X/-Y) is not equal to -(X/Y), don't canonicalize.
	define i32 @test_sdiv_canonicalize_op1(i32 %x, i32 %z) {			define i32 @test_sdiv_canonicalize_op1(i32 %x, i32 %z) {
	; CHECK-LABEL: @test_sdiv_canonicalize_op1(			; CHECK-LABEL: @test_sdiv_canonicalize_op1(
	; CHECK-NEXT: [[Y:%.]] = mul i32 [[Z:%.]], 3			; CHECK-NEXT: [[Y:%.]] = mul i32 [[Z:%.]], 3
	; CHECK-NEXT: [[NEG:%.]] = sub nsw i32 0, [[X:%.]]			; CHECK-NEXT: [[NEG:%.]] = sub nsw i32 0, [[X:%.]]
	; CHECK-NEXT: [[SDIV:%.*]] = sdiv i32 [[Y]], [[NEG]]			; CHECK-NEXT: [[SDIV:%.*]] = sdiv i32 [[Y]], [[NEG]]
	; CHECK-NEXT: ret i32 [[SDIV]]			; CHECK-NEXT: ret i32 [[SDIV]]
	;			;
	%y = mul i32 %z, 3			%y = mul i32 %z, 3
	Show All 10 Lines
	;			;
	%neg = sub i32 0, %x			%neg = sub i32 0, %x
	%sdiv = sdiv i32 %neg, %y			%sdiv = sdiv i32 %neg, %y
	ret i32 %sdiv			ret i32 %sdiv
	}			}

	define <2 x i32> @test_sdiv_canonicalize_vec(<2 x i32> %x, <2 x i32> %y) {			define <2 x i32> @test_sdiv_canonicalize_vec(<2 x i32> %x, <2 x i32> %y) {
	; CHECK-LABEL: @test_sdiv_canonicalize_vec(			; CHECK-LABEL: @test_sdiv_canonicalize_vec(
	; CHECK-NEXT: [[NEG:%.]] = sub nsw <2 x i32> zeroinitializer, [[X:%.]]			; CHECK-NEXT: [[SDIV1:%.]] = sdiv <2 x i32> [[X:%.]], [[Y:%.*]]
	; CHECK-NEXT: [[SDIV:%.]] = sdiv <2 x i32> [[NEG]], [[Y:%.]]			; CHECK-NEXT: [[SDIV:%.*]] = sub nsw <2 x i32> zeroinitializer, [[SDIV1]]
	; CHECK-NEXT: ret <2 x i32> [[SDIV]]			; CHECK-NEXT: ret <2 x i32> [[SDIV]]
	;			;
	%neg = sub nsw <2 x i32> <i32 0, i32 0>, %x			%neg = sub nsw <2 x i32> <i32 0, i32 0>, %x
	%sdiv = sdiv <2 x i32> %neg, %y			%sdiv = sdiv <2 x i32> %neg, %y
	ret <2 x i32> %sdiv			ret <2 x i32> %sdiv
	}			}

	define i32 @test_sdiv_canonicalize_multiple_uses(i32 %x, i32 %y) {			define i32 @test_sdiv_canonicalize_multiple_uses(i32 %x, i32 %y) {
	; CHECK-LABEL: @test_sdiv_canonicalize_multiple_uses(			; CHECK-LABEL: @test_sdiv_canonicalize_multiple_uses(
	; CHECK-NEXT: [[NEG:%.]] = sub nsw i32 0, [[X:%.]]			; CHECK-NEXT: [[NEG:%.]] = sub nsw i32 0, [[X:%.]]
	; CHECK-NEXT: [[SDIV:%.]] = sdiv i32 [[NEG]], [[Y:%.]]			; CHECK-NEXT: [[SDIV:%.]] = sdiv i32 [[NEG]], [[Y:%.]]
	; CHECK-NEXT: [[SDIV2:%.*]] = sdiv i32 [[SDIV]], [[NEG]]			; CHECK-NEXT: [[SDIV2:%.*]] = sdiv i32 [[SDIV]], [[NEG]]
	; CHECK-NEXT: ret i32 [[SDIV2]]			; CHECK-NEXT: ret i32 [[SDIV2]]
	;			;
	%neg = sub nsw i32 0, %x			%neg = sub nsw i32 0, %x
	%sdiv = sdiv i32 %neg, %y			%sdiv = sdiv i32 %neg, %y
	%sdiv2 = sdiv i32 %sdiv, %neg			%sdiv2 = sdiv i32 %sdiv, %neg
	ret i32 %sdiv2			ret i32 %sdiv2
	}			}

	; There is combination: (X/-CE) -> -(X/CE)			; There is combination: -(X/CE) -> (X/-CE).
	; There is another combination: -(X/CE) -> (X/-CE)			; If combines (X/-CE) to -(X/CE), make sure don't combine them endless.
	; Make sure don't combine them endless.

	@X = global i32 5			@X = global i32 5

	define i64 @test_sdiv_canonicalize_constexpr(i64 %L1) {			define i64 @test_sdiv_canonicalize_constexpr(i64 %L1) {
	; currently opt folds (sub nsw i64 0, constexpr) -> (sub i64, 0, constexpr).			; currently opt folds (sub nsw i64 0, constexpr) -> (sub i64, 0, constexpr).
	; sdiv canonicalize requires a nsw sub.			; sdiv canonicalize requires a nsw sub.
	; CHECK-LABEL: @test_sdiv_canonicalize_constexpr(			; CHECK-LABEL: @test_sdiv_canonicalize_constexpr(
	; CHECK-NEXT: [[B4:%.]] = sdiv i64 [[L1:%.]], sub (i64 0, i64 ptrtoint (i32* @X to i64))			; CHECK-NEXT: [[B4:%.]] = sdiv i64 [[L1:%.]], sub (i64 0, i64 ptrtoint (i32* @X to i64))
	; CHECK-NEXT: ret i64 [[B4]]			; CHECK-NEXT: ret i64 [[B4]]
	;			;
	%v1 = ptrtoint i32* @X to i64			%v1 = ptrtoint i32* @X to i64
	%B8 = sub nsw i64 0, %v1			%B8 = sub nsw i64 0, %v1
	%B4 = sdiv i64 %L1, %B8			%B4 = sdiv i64 %L1, %B8
	ret i64 %B4			ret i64 %B4
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y)ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 194445

llvm/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp

llvm/test/Transforms/InstCombine/div.ll

llvm/test/Transforms/InstCombine/sdiv-canonicalize.ll

[InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y)
ClosedPublic