Download Raw Diff

Details

Reviewers

majnemer
• rafael
bkramer
jingyue

Commits

rGb62e52e1b52f: Refactored and updated SimplifyUsingDistributiveLaws() to * Find factorization…
rL211261: Refactored and updated SimplifyUsingDistributiveLaws() to

Summary

I tried to fix this but I am not sure about few things.

We always convert (X * C1) + (X * C2) to X *(C1 + C2) even if (C1 + C2)

overflows. e.g.

%1 = mul nsw i32 %x, 2147483647
%2 = mul nsw i32 %x, 3
%3 = add nsw i32 %1, %2

becomes

%1 = mul i32 %x, -2147483646

Is this what we want. In that case, we can blindly copy nsw from
add instruction to mul instruction. If so, I will update this patch.

In this patch, I have tried not to combine add to mul for above case

but that is not enough to stop this conversion (getting converted from
some other code). If above conversion is wrong, I can go through code
and fix this. For now, this patch should take care of not dropping nsw
flags.

I haven't added similar checks in visitSub. Actually, all c examples,

I can come up with, was not creating sub with nsw. If is it required to
add similar check, I can update this patch.

Diff Detail

Event Timeline

dinesh.d updated this revision to Diff 9481.May 16 2014, 7:34 AM

dinesh.d retitled this revision from to Fixing inst-combine not to drops nsw when combining adds into mul (PR19263).

dinesh.d updated this object.

dinesh.d edited the test plan for this revision. (Show Details)

dinesh.d added reviewers: bkramer, • rafael, majnemer.

dinesh.d added a subscriber: Unknown Object (MLST).

gentle ping

+ if (ConstantInt *CI1 = dyn_cast<ConstantInt>(LHS))
+ if (ConstantInt *CI2 = dyn_cast<ConstantInt>(RHS)) {
+ APInt ACI1 = CI1->getValue();
+ APInt ACI2 = CI2->getValue();
+ bool IsLHSNegative = ACI1.isNegative();
+ bool IsRHSNegative = ACI2.isNegative();

If both are constants, the result should have been folded before
getting here, no?

Can you rebase on trunk? WillNotOverflowSignedAdd has learned a few
extra tricks.

rebased code on trunk

+ if (ConstantInt *CI1 = dyn_cast<ConstantInt>(LHS))
+ if (ConstantInt *CI2 = dyn_cast<ConstantInt>(RHS)) {
+ APInt ACI1 = CI1->getValue();
+ APInt ACI2 = CI2->getValue();
+ bool IsLHSNegative = ACI1.isNegative();
+ bool IsRHSNegative = ACI2.isNegative();

If both are constants, the result should have been folded before
getting here, no?

Here both constants are from different expression, one from LHS and other
one is from RHS.

Can you rebase on trunk? WillNotOverflowSignedAdd has learned a few
extra tricks.

rebased.

Here both constants are from different expression, one from LHS and other
one is from RHS.

All tests still pass if I delete the extra logic from
WillNotOverflowSignedAdd. Can you add an extra test of split this off
to an independent patch?

Cheers,
Rafael

All tests still pass if I delete the extra logic from
WillNotOverflowSignedAdd. Can you add an extra test of split this off
to an independent patch?

More importantly, it looks like this misoptimizes a few cases. For example,

define i16 @f(i16 %a) {

%b = mul i16 %a, 2
%c = mul i16 %a, 3
%d = add nsw i16 %b, %c
ret i16 %d

}

this would optimize it into

define i16 @f(i16 %a) {

%d = mul nsw i16 %a, 5
ret i16 %d

}

But I think that is invalid. There are inputs where the original add
would not overflow the add but the mul will. For example:

a = 0x2aab
b = 0x5556
c = 0x8001 the computation of c has an overflow, but that is fine
since the mul is not nws
d = 0xd557 no overflow in the add

Cheers,
Rafael

Yes, now WillNotOverflowSignedAdd can handle cases where
one of LHS and RHS is power of 2 and other has known zero
after high bit in first one. I will update test accordingly or if you
think that should go as independent patch I can surely do that.

For you other comment, I think we can not have nsw in addition
of 2 variables even if any one of them is not known to have nsw.
So I think nsw in add is wrong. But I may be wrong.

How about this input
a = 0x2aaa
b = 0x5554
c = 0x7ffe
d = 0xd552

b and c are ok here as mul instructions does not guaranty either
(overflow or no overflow) but add instruction guaranties no sign
overflow and d is overflowing.

updated patch as per comments

There is an interesting organizational problem:

Given

define i16 @f(i16 %a) {

%b = mul nsw i16 %a, 3
%c = mul nsw i16 %a, 7
%d = add nsw i16 %b, %c
ret i16 %d

}

With your patch we produce

define i16 @f(i16 %a) {

%d = mul i16 %a, 10
ret i16 %d

}

but given

define i16 @f(i16 %a) {

%b = mul nsw i16 %a, 2
%c = mul nsw i16 %a, 7
%d = add nsw i16 %b, %c
ret i16 %d

}

it produces

define i16 @f(i16 %a) {

%d = mul nsw i16 %a, 9
ret i16 %d

}

The problem comes from some cases being transformed in
SimplifyUsingDistributiveLaws before getting to your code.

Maybe what should happen is that as an early cleanup patch
dyn_castFoldableMul should be moved and used by
SimplifyUsingDistributiveLaws which would now handle all relevant
cases.

What do you think?

Yet another thing to be careful about. The proposed patch would have
disable the combining in

define i16 @f(i16 %a) {

%b = mul i16 %a, 2
%c = mul i16 %a, 32767
%d = add i16 %b, %c
ret i16 %d

}

I added the testcase in r210287.

I agree with the idea that these cases should get handled in SimplifyUsingDistributiveLaws and
I am working on generalized approach for that.

I have updated patch to just put following changes in record.
I have removed check for overflow because after analyzing few cases, I think we should not
disable transformation to pattern like this even if (2 + 32767)

%b = mul i16 %a, 2
%c = mul i16 %a, 32767
%d = add i16 %b, %c
ret i16 %d

as original case and transformed case will result in same output. When I pointed out case similar to
this, I thought that for some value of 'a', '(b+c)' might be positive but if we add constant which will become
negative due to overflow, result will mismatch.
I realized that as 'a' in integer, if a == 0, we will get 0 in any case and any value of a > 0, b and c will never
be less than 2 and 32767 respectively. so add result will always overflow.

What do you think? Should be disable above transform.

I have added a TODO to fix transform happening in SimplifyUsingDistributiveLaws and dropping nsw.
I will update it soon so that all these pattern gets handle in SimplifyUsingDistributiveLaws.

moved dyn_castFoldableMul logic to InstructionCombining.cpp.
Now SimplifyUsingDistributiveLaws handles all relevant cases.

gentle ping

Looks a lot better, thanks.

lib/Transforms/InstCombine/InstructionCombining.cpp
400 ↗	(On Diff #10270)	Please name functions starting with a lowercase letter. This function can be static.
418 ↗	(On Diff #10270)	lowercase, static. It seems a bit strange to return one operand and put the other one in an out value. Also, this function doesn't do the factorization, it just splits the expression. Also, the above function for handling 1 could be merged into this (and with that avoid the extra indentation), no? Something like Try to split Op0 and Op1 into (A op B) and (C op D) with the same operation in both. This handles cases like matching a shift as a mul and B or D being // implicitly 1. static bool matchForFactorization(const BinaryOperator Op0, const BinaryOperator Op1, Instruction::BinaryOps &OpCode, Value &A, Value &B, Value &C, Value &D)

Hi Dinesh,

It looks pretty good overall. Thanks for working on this! Please find my comments inlined.

lib/Transforms/InstCombine/InstructionCombining.cpp
445 ↗	(On Diff #10270)	I'd move these definitions closer to their uses. e.g., Value B = nullptr, D = nullptr; Value *A = FactorBinaryOps(Op0, B, LHSOpcode);
452 ↗	(On Diff #10270)	Correct me if I'm wrong. I think the current logic seems unable to convert %z = mul nsw i32 %x, %y %z3 = mul nsw i32 %z, 3 %z4 = add nsw i32 %z, %z3 to %z = mul nsw i32 %x, %y %z4 = mul nsw i32 %z, 4 because if %z is in the format of %x * %y, you will not try the identity factorization (i.e., %z * 1). If it is indeed a missed opportunity, it may not be too difficult to cover. For example, we can try all four combinations: LHS (binary op), RHS (binary op) LHS (identity), RHS (binary op) LHS (binary op), RHS (identity) LHS (identity), RHS (identity) // probably already handled somewhere else? Anyway, we can do this later.
466 ↗	(On Diff #10270)	I feel the logic of this IF and next IF (if (RightDistributesOverLeft) a bit difficult to follow. There is also some copy and paste. What about extracting the check on InnerCommutative out? Something like: try(a, b, c, d); if (op' is commutative) { try(b, a, c, d); try(a, b, d, c); try(b, a, c, d); } By doing this, inside each "try", we don't need to think about whether op' is commutative. What do you think?

Update patch as per review comments

Thanks for review. I ahve added comments inline.

lib/Transforms/InstCombine/InstructionCombining.cpp
400 ↗	(On Diff #10270)	updated. I have kept this funtion so it can be used to get identity values for other operations e.g. and, or etc. as well.
418 ↗	(On Diff #10270)	Update function name and added static. I have to keep both function separate to try to explore optimization, jingyue has mentioned in his comment. I have changed function parameter and return value. Let me know if it looks ok.
445 ↗	(On Diff #10270)	updated.
452 ↗	(On Diff #10270)	updated. New patch takes care of all except LHS (identity), RHS (identity). it is getting handled before SimplifyUsingDistributiveLaws called.
466 ↗	(On Diff #10270)	Updated patch to handle this. We can not just depend on isCommutative as there are oparation which are not commutative but still can be distributed e.g. "(X + Y) / Z = X/Z + Y/Z" We do not handle this now but we might want to do it in future.

jingyue added inline comments.Jun 17 2014, 10:31 AM

lib/Transforms/InstCombine/InstructionCombining.cpp
466 ↗	(On Diff #10270)	I think the first "try" in my pseudo-code handle this case. Right? While "try" doesn't swap a and b or c and d, it considers both LeftDistributesOverRight and RightDistributesOverLeft. Therefore, it is able to convert "a / b + c / d" to "(a + c) / d".

dinesh.d added inline comments.Jun 17 2014, 1:41 PM

lib/Transforms/InstCombine/InstructionCombining.cpp
466 ↗	(On Diff #10270)	Yes, if we considers both LeftDistributesOverRight and RightDistributesOverLeft, it handles all cases. But then I think current code is handling all cases more efficiently. If left distributes over right and inner op is commutative, then right will also distribute over left too and vice versa. So Checking following 2 combinations if left distributes over right try(a, b, c, d); if (op' is commutative) { try(a, b, d, c); } and following 2 combination if right distributes ove left should be sufficient. try(a, b, c, d); if (op' is commutative) { try(a, b, d, c); } Because for inner commutative operation if try(a, b, c, d) fails for LeftDistributesOverRight then try(b, a, d, c) will fail for RightDistributesOverLeft if try(a, b, c, d) fails for RightDistributesOverLeft then try(b, a, d, c) will fail for LeftDistributesOverRight similarly if try(b, a, c, d) fails for LeftDistributesOverRight then try(a, b, d, c) will fail for RightDistributesOverLeft if try(a, b, c, d) fails for RightDistributesOverLeft then try(a, b, d, c) will fail for LeftDistributesOverRight Am I missing something?

typo :(

lib/Transforms/InstCombine/InstructionCombining.cpp
466 ↗	(On Diff #10270)	Please read last line in previous reply as if try(b, a, c, d) fails for RightDistributesOverLeft then try(a, b, d, c) will fail for LeftDistributesOverRight

Still reading the code, just wanted to post the easy bits first.

lib/Transforms/InstCombine/InstructionCombining.cpp
398 ↗	(On Diff #10499)	Nit: don't duplicate the function name in the comment.
418 ↗	(On Diff #10499)	same
442 ↗	(On Diff #10499)	here too.

Mostly nits. This is LGTM with the issues on line 552 addressed, but please wait for Jingyue Wu to confirm that he is OK with it too.

Thanks!

lib/Transforms/InstCombine/InstructionCombining.cpp
415 ↗	(On Diff #10499)	You have both a default case with a return and a return after the switch. Given that we only handle Mul, it is probably better to write this as an if for now. When we add support for other opcodes it is easy to go back to an switch.
552 ↗	(On Diff #10499)	This should be if (Value *V = tryFactorization(Builder, DL, I, RHSOpcode, LHS, getIdentityValue(LHSOpcode, LHS), C, D)) no?

jingyue added a reviewer: jingyue.Jun 17 2014, 10:16 PM

jingyue added inline comments.

lib/Transforms/InstCombine/InstructionCombining.cpp
481 ↗	(On Diff #10499)	Do we still need to try right distribution if left distribution already succeeds?
494 ↗	(On Diff #10499)	A question: any chance to preserve NSW/NUW for TopLevelOpcode?
508 ↗	(On Diff #10499)	I think it's better to write: if (BinaryOperator *Op0 = dyn_cast<BinaryOperator>(LHS)) { if (isa<OverflowingBinaryOperator>(Op0)) { HasNSW &= Op0->hasNoSignedWrap(); }
515 ↗	(On Diff #10499)	I may be over-concerned, but is this cast dangerous? CreateBinOp returns a ConstantExpr if its operands are both Constant. Note that ConstantInt op ConstantInt is not always foldable, e.g., ptrtoint (a global variable) + 5.
543 ↗	(On Diff #10499)	`(A op' B) op (C)`?
466 ↗	(On Diff #10270)	Good point. I agree try(a, b, c, d); if (op' is commutative) { try(a, b, d, c); } is enough.

Thanks for review and comments. I have replied them inline.

lib/Transforms/InstCombine/InstructionCombining.cpp
398 ↗	(On Diff #10499)	updated.
415 ↗	(On Diff #10499)	updated.
418 ↗	(On Diff #10499)	updated.
442 ↗	(On Diff #10499)	updated.
481 ↗	(On Diff #10499)	No. We should not. I have updated code to check if left distribution already succeeds
494 ↗	(On Diff #10499)	I will try to do that in next patches.
508 ↗	(On Diff #10499)	updated.
515 ↗	(On Diff #10499)	We have already check if 'isa<OverflowingBinaryOperator>(SimplifiedInst)' so casting SimplifiedInst to BinaryOperator should be safe.
543 ↗	(On Diff #10499)	updated.
552 ↗	(On Diff #10499)	updated code. last minute copy paste issue. it should be getIdentityValue(RHSOpcode, LHS) I will be more carefull to avoid these mistakes.

Updated patch as per comments. Thanks for review.

Only one nit, otherwise LGTM.

lib/Transforms/InstCombine/InstructionCombining.cpp
515 ↗	(On Diff #10499)	http://llvm.org/docs/doxygen/html/classllvm_1_1OverflowingBinaryOperator.html OverflowingBinaryOperator is not a subclass of BinaryOperator. BinaryOperator is an Instruction, but OverflowingBinaryOperator can be either an Instruction or a ConstantExpr. I agree the naming is a little confusing :)

This revision is now accepted and ready to land.Jun 18 2014, 3:04 PM

Thanks for review. I have updated cast<BinaryOperator>(SimplifiedInst) to use dyn_cast.

Closed by commit rL211261 (authored by dinesh).

Diff 10173

lib/Transforms/InstCombine/InstCombineAddSub.cpp

Show First 20 Lines • Show All 865 Lines • ▼ Show 20 Lines
}		}

// dyn_castFoldableMul - If this value is a multiply that can be folded into		// dyn_castFoldableMul - If this value is a multiply that can be folded into
// other computations (because it has a constant operand), return the		// other computations (because it has a constant operand), return the
// non-constant operand of the multiply, and set CST to point to the multiplier.		// non-constant operand of the multiply, and set CST to point to the multiplier.
// Otherwise, return null.		// Otherwise, return null.
//		//
static inline Value dyn_castFoldableMul(Value V, Constant *&CST) {		static inline Value dyn_castFoldableMul(Value V, Constant *&CST) {
if (!V->hasOneUse() \|\| !V->getType()->isIntOrIntVectorTy())		if (!V->getType()->isIntOrIntVectorTy())
return nullptr;		return nullptr;

Instruction *I = dyn_cast<Instruction>(V);		if (V->hasOneUse()) {
if (!I) return nullptr;		if (Instruction *I = dyn_cast<Instruction>(V)) {

if (I->getOpcode() == Instruction::Mul)		if (I->getOpcode() == Instruction::Mul)
if ((CST = dyn_cast<Constant>(I->getOperand(1))))		if ((CST = dyn_cast<Constant>(I->getOperand(1))))
return I->getOperand(0);		return I->getOperand(0);
if (I->getOpcode() == Instruction::Shl)		if (I->getOpcode() == Instruction::Shl)
if ((CST = dyn_cast<Constant>(I->getOperand(1)))) {		if ((CST = dyn_cast<Constant>(I->getOperand(1)))) {
// The multiplier is really 1 << CST.		// The multiplier is really 1 << CST.
CST = ConstantExpr::getShl(ConstantInt::get(V->getType(), 1), CST);		CST = ConstantExpr::getShl(ConstantInt::get(V->getType(), 1), CST);
return I->getOperand(0);		return I->getOperand(0);
}		}
return nullptr;		}
		}

		CST = ConstantInt::get(V->getType(), 1);
		return V;
}		}

// If one of the operands only has one non-zero bit, and if the other		// If one of the operands only has one non-zero bit, and if the other
// operand has a known-zero bit in a more significant place than it (not		// operand has a known-zero bit in a more significant place than it (not
// including the sign bit) the ripple may go up to and fill the zero, but		// including the sign bit) the ripple may go up to and fill the zero, but
// won't change the sign. For example, (X & ~4) + 1.		// won't change the sign. For example, (X & ~4) + 1.
static bool checkRippleForAdd(const APInt &Op0KnownZero,		static bool checkRippleForAdd(const APInt &Op0KnownZero,
const APInt &Op1KnownZero) {		const APInt &Op1KnownZero) {
▲ Show 20 Lines • Show All 171 Lines • ▼ Show 20 Lines	if (Value *LHSV = dyn_castNegVal(LHS)) {
return BinaryOperator::CreateSub(RHS, LHSV);		return BinaryOperator::CreateSub(RHS, LHSV);
}		}

// A + -B --> A - B		// A + -B --> A - B
if (!isa<Constant>(RHS))		if (!isa<Constant>(RHS))
if (Value *V = dyn_castNegVal(RHS))		if (Value *V = dyn_castNegVal(RHS))
return BinaryOperator::CreateSub(LHS, V);		return BinaryOperator::CreateSub(LHS, V);


{		{
Constant *C2;		Constant C1, C2;
if (Value *X = dyn_castFoldableMul(LHS, C2)) {
if (X == RHS) // XC + X --> X (C+1)
return BinaryOperator::CreateMul(RHS, AddOne(C2));

// XC1 + XC2 --> X * (C1+C2)		// X * C1 + X * C2 --> X * (C1 + C2)
Constant *C1;		if (Value *X = dyn_castFoldableMul(LHS, C1))
if (X == dyn_castFoldableMul(RHS, C1))		if (Value *Y = dyn_castFoldableMul(RHS, C2)) {
return BinaryOperator::CreateMul(X, ConstantExpr::getAdd(C1, C2));		if (X == Y) {
}		if (BinaryOperator *NewInst =
		BinaryOperator::CreateMul(X, ConstantExpr::getAdd(C1, C2))) {

		bool hasNSW = I.hasNoSignedWrap();
		if (BinaryOperator *LHSI = dyn_cast<BinaryOperator>(LHS))
		hasNSW &= LHSI->hasNoSignedWrap();
		if (BinaryOperator *RHSI = dyn_cast<BinaryOperator>(RHS))
		hasNSW &= RHSI->hasNoSignedWrap();

// X + XC --> X (C+1)		NewInst->setHasNoSignedWrap(hasNSW);
if (dyn_castFoldableMul(RHS, C2) == LHS)
return BinaryOperator::CreateMul(LHS, AddOne(C2));		// TODO: Check for unsigned wrap
		return NewInst;
		}
		}
		}
}		}

// A+B --> A\|B iff A and B have no bits set in common.		// A+B --> A\|B iff A and B have no bits set in common.
if (IntegerType *IT = dyn_cast<IntegerType>(I.getType())) {		if (IntegerType *IT = dyn_cast<IntegerType>(I.getType())) {
APInt LHSKnownOne(IT->getBitWidth(), 0);		APInt LHSKnownOne(IT->getBitWidth(), 0);
APInt LHSKnownZero(IT->getBitWidth(), 0);		APInt LHSKnownZero(IT->getBitWidth(), 0);
computeKnownBits(LHS, LHSKnownZero, LHSKnownOne);		computeKnownBits(LHS, LHSKnownZero, LHSKnownOne);
if (LHSKnownZero != 0) {		if (LHSKnownZero != 0) {
▲ Show 20 Lines • Show All 465 Lines • ▼ Show 20 Lines	if (Op1->hasOneUse()) {
// X - CIA -> X + A-CI		// X - CIA -> X + A-CI
if (match(Op1, m_Mul(m_Value(A), m_Constant(CI))) \|\|		if (match(Op1, m_Mul(m_Value(A), m_Constant(CI))) \|\|
match(Op1, m_Mul(m_Constant(CI), m_Value(A)))) {		match(Op1, m_Mul(m_Constant(CI), m_Value(A)))) {
Value *NewMul = Builder->CreateMul(A, ConstantExpr::getNeg(CI));		Value *NewMul = Builder->CreateMul(A, ConstantExpr::getNeg(CI));
return BinaryOperator::CreateAdd(Op0, NewMul);		return BinaryOperator::CreateAdd(Op0, NewMul);
}		}
}		}

Constant *C1;		Constant C1, C2;
if (Value *X = dyn_castFoldableMul(Op0, C1)) {		// X * C1 - X * C2 --> X * (C1 - C2)
if (X == Op1) // XC - X --> X (C-1)		if (Value *X = dyn_castFoldableMul(Op0, C1))
return BinaryOperator::CreateMul(Op1, SubOne(C1));		if (Value *Y = dyn_castFoldableMul(Op1, C2))
		if (X == Y)
Constant C2; // XC1 - XC2 -> X (C1-C2)
if (X == dyn_castFoldableMul(Op1, C2))
return BinaryOperator::CreateMul(X, ConstantExpr::getSub(C1, C2));		return BinaryOperator::CreateMul(X, ConstantExpr::getSub(C1, C2));
}

// Optimize pointer differences into the same array into a size. Consider:		// Optimize pointer differences into the same array into a size. Consider:
// &A[10] - &A[0]: we should compile this to "10".		// &A[10] - &A[0]: we should compile this to "10".
if (DL) {		if (DL) {
Value LHSOp, RHSOp;		Value LHSOp, RHSOp;
if (match(Op0, m_PtrToInt(m_Value(LHSOp))) &&		if (match(Op0, m_PtrToInt(m_Value(LHSOp))) &&
match(Op1, m_PtrToInt(m_Value(RHSOp))))		match(Op1, m_PtrToInt(m_Value(RHSOp))))
if (Value *Res = OptimizePointerDifference(LHSOp, RHSOp, I.getType()))		if (Value *Res = OptimizePointerDifference(LHSOp, RHSOp, I.getType()))
▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

test/Transforms/InstCombine/add2.ll

Show First 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	define i16 @test9(i16 %a) {
%b = mul i16 %a, 2		%b = mul i16 %a, 2
%c = mul i16 %a, 32767		%c = mul i16 %a, 32767
%d = add i16 %b, %c		%d = add i16 %b, %c
ret i16 %d		ret i16 %d
; CHECK-LABEL: @test9(		; CHECK-LABEL: @test9(
; CHECK-NEXT: %d = mul i16 %a, -32767		; CHECK-NEXT: %d = mul i16 %a, -32767
; CHECK-NEXT: ret i16 %d		; CHECK-NEXT: ret i16 %d
}		}

		define i16 @add_nsw_mul_nsw(i16 %x) {
		%add1 = add nsw i16 %x, %x
		%add2 = add nsw i16 %add1, %x
		ret i16 %add2
		; CHECK-LABEL: @add_nsw_mul_nsw(
		; CHECK-NEXT: %add2 = mul nsw i16 %x, 3
		; CHECK-NEXT: ret i16 %add2
		}

		define i16 @mul_add_to_mul_1(i16 %x) {
		%mul1 = mul nsw i16 %x, 8
		%add2 = add nsw i16 %x, %mul1
		ret i16 %add2
		; CHECK-LABEL: @mul_add_to_mul_1(
		; CHECK-NEXT: %add2 = mul nsw i16 %x, 9
		; CHECK-NEXT: ret i16 %add2
		}

		define i16 @mul_add_to_mul_2(i16 %x) {
		%mul1 = mul nsw i16 %x, 8
		%add2 = add nsw i16 %mul1, %x
		ret i16 %add2
		; CHECK-LABEL: @mul_add_to_mul_2(
		; CHECK-NEXT: %add2 = mul nsw i16 %x, 9
		; CHECK-NEXT: ret i16 %add2
		}

		define i16 @mul_add_to_mul_3(i16 %a) {
		%mul1 = mul i16 %a, 2
		%mul2 = mul i16 %a, 3
		%add = add nsw i16 %mul1, %mul2
		ret i16 %add
		; CHECK-LABEL: @mul_add_to_mul_3(
		; CHECK-NEXT: %add = mul i16 %a, 5
		; CHECK-NEXT: ret i16 %add
		}

		define i16 @mul_add_to_mul_4(i16 %a) {
		%mul1 = mul nsw i16 %a, 2
		%mul2 = mul nsw i16 %a, 7
		%add = add nsw i16 %mul1, %mul2
		ret i16 %add
		; CHECK-LABEL: @mul_add_to_mul_4(
		; CHECK-NEXT: %add = mul nsw i16 %a, 9
		; CHECK-NEXT: ret i16 %add
		}

		; TODO: 'add nsw' in the test should get transformed in to mul nsw
		define i16 @mul_add_to_mul_5(i16 %a) {
		%mul1 = mul nsw i16 %a, 3
		%mul2 = mul nsw i16 %a, 7
		%add = add nsw i16 %mul1, %mul2
		ret i16 %add
		; CHECK-LABEL: @mul_add_to_mul_5(
		; CHECK-NEXT: %add = mul i16 %a, 10
		; CHECK-NEXT: ret i16 %add
		}
		No newline at end of file

This is an archive of the discontinued LLVM Phabricator instance.

Fixing inst-combine not to drops nsw when combining adds into mul (PR19263)
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 10173

lib/Transforms/InstCombine/InstCombineAddSub.cpp

test/Transforms/InstCombine/add2.ll

This is an archive of the discontinued LLVM Phabricator instance.

Fixing inst-combine not to drops nsw when combining adds into mul (PR19263)ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 10173

lib/Transforms/InstCombine/InstCombineAddSub.cpp

test/Transforms/InstCombine/add2.ll

Fixing inst-combine not to drops nsw when combining adds into mul (PR19263)
ClosedPublic