This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
InstCombineAddSub.cpp
-
InstructionCombining.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
add2.ll

Differential D3799

Fixing inst-combine not to drops nsw when combining adds into mul (PR19263)
ClosedPublic

Authored by dinesh.d on May 16 2014, 7:34 AM.

Download Raw Diff

Details

Reviewers

majnemer
• rafael
bkramer
jingyue

Commits

rGb62e52e1b52f: Refactored and updated SimplifyUsingDistributiveLaws() to * Find factorization…
rL211261: Refactored and updated SimplifyUsingDistributiveLaws() to

Summary

I tried to fix this but I am not sure about few things.

We always convert (X * C1) + (X * C2) to X *(C1 + C2) even if (C1 + C2)

overflows. e.g.

%1 = mul nsw i32 %x, 2147483647
%2 = mul nsw i32 %x, 3
%3 = add nsw i32 %1, %2

becomes

%1 = mul i32 %x, -2147483646

Is this what we want. In that case, we can blindly copy nsw from
add instruction to mul instruction. If so, I will update this patch.

In this patch, I have tried not to combine add to mul for above case

but that is not enough to stop this conversion (getting converted from
some other code). If above conversion is wrong, I can go through code
and fix this. For now, this patch should take care of not dropping nsw
flags.

I haven't added similar checks in visitSub. Actually, all c examples,

I can come up with, was not creating sub with nsw. If is it required to
add similar check, I can update this patch.

Diff Detail

Repository: rL LLVM

Event Timeline

dinesh.d updated this revision to Diff 9481.May 16 2014, 7:34 AM

dinesh.d retitled this revision from to Fixing inst-combine not to drops nsw when combining adds into mul (PR19263).

dinesh.d updated this object.

dinesh.d edited the test plan for this revision. (Show Details)

dinesh.d added reviewers: bkramer, • rafael, majnemer.

dinesh.d added a subscriber: Unknown Object (MLST).

gentle ping

+ if (ConstantInt *CI1 = dyn_cast<ConstantInt>(LHS))
+ if (ConstantInt *CI2 = dyn_cast<ConstantInt>(RHS)) {
+ APInt ACI1 = CI1->getValue();
+ APInt ACI2 = CI2->getValue();
+ bool IsLHSNegative = ACI1.isNegative();
+ bool IsRHSNegative = ACI2.isNegative();

If both are constants, the result should have been folded before
getting here, no?

Can you rebase on trunk? WillNotOverflowSignedAdd has learned a few
extra tricks.

rebased code on trunk

+ if (ConstantInt *CI1 = dyn_cast<ConstantInt>(LHS))
+ if (ConstantInt *CI2 = dyn_cast<ConstantInt>(RHS)) {
+ APInt ACI1 = CI1->getValue();
+ APInt ACI2 = CI2->getValue();
+ bool IsLHSNegative = ACI1.isNegative();
+ bool IsRHSNegative = ACI2.isNegative();

If both are constants, the result should have been folded before
getting here, no?

Here both constants are from different expression, one from LHS and other
one is from RHS.

Can you rebase on trunk? WillNotOverflowSignedAdd has learned a few
extra tricks.

rebased.

Here both constants are from different expression, one from LHS and other
one is from RHS.

All tests still pass if I delete the extra logic from
WillNotOverflowSignedAdd. Can you add an extra test of split this off
to an independent patch?

Cheers,
Rafael

All tests still pass if I delete the extra logic from
WillNotOverflowSignedAdd. Can you add an extra test of split this off
to an independent patch?

More importantly, it looks like this misoptimizes a few cases. For example,

define i16 @f(i16 %a) {

%b = mul i16 %a, 2
%c = mul i16 %a, 3
%d = add nsw i16 %b, %c
ret i16 %d

}

this would optimize it into

define i16 @f(i16 %a) {

%d = mul nsw i16 %a, 5
ret i16 %d

}

But I think that is invalid. There are inputs where the original add
would not overflow the add but the mul will. For example:

a = 0x2aab
b = 0x5556
c = 0x8001 the computation of c has an overflow, but that is fine
since the mul is not nws
d = 0xd557 no overflow in the add

Cheers,
Rafael

Yes, now WillNotOverflowSignedAdd can handle cases where
one of LHS and RHS is power of 2 and other has known zero
after high bit in first one. I will update test accordingly or if you
think that should go as independent patch I can surely do that.

For you other comment, I think we can not have nsw in addition
of 2 variables even if any one of them is not known to have nsw.
So I think nsw in add is wrong. But I may be wrong.

How about this input
a = 0x2aaa
b = 0x5554
c = 0x7ffe
d = 0xd552

b and c are ok here as mul instructions does not guaranty either
(overflow or no overflow) but add instruction guaranties no sign
overflow and d is overflowing.

updated patch as per comments

There is an interesting organizational problem:

Given

define i16 @f(i16 %a) {

%b = mul nsw i16 %a, 3
%c = mul nsw i16 %a, 7
%d = add nsw i16 %b, %c
ret i16 %d

}

With your patch we produce

define i16 @f(i16 %a) {

%d = mul i16 %a, 10
ret i16 %d

}

but given

define i16 @f(i16 %a) {

%b = mul nsw i16 %a, 2
%c = mul nsw i16 %a, 7
%d = add nsw i16 %b, %c
ret i16 %d

}

it produces

define i16 @f(i16 %a) {

%d = mul nsw i16 %a, 9
ret i16 %d

}

The problem comes from some cases being transformed in
SimplifyUsingDistributiveLaws before getting to your code.

Maybe what should happen is that as an early cleanup patch
dyn_castFoldableMul should be moved and used by
SimplifyUsingDistributiveLaws which would now handle all relevant
cases.

What do you think?

Yet another thing to be careful about. The proposed patch would have
disable the combining in

define i16 @f(i16 %a) {

%b = mul i16 %a, 2
%c = mul i16 %a, 32767
%d = add i16 %b, %c
ret i16 %d

}

I added the testcase in r210287.

I agree with the idea that these cases should get handled in SimplifyUsingDistributiveLaws and
I am working on generalized approach for that.

I have updated patch to just put following changes in record.
I have removed check for overflow because after analyzing few cases, I think we should not
disable transformation to pattern like this even if (2 + 32767)

%b = mul i16 %a, 2
%c = mul i16 %a, 32767
%d = add i16 %b, %c
ret i16 %d

as original case and transformed case will result in same output. When I pointed out case similar to
this, I thought that for some value of 'a', '(b+c)' might be positive but if we add constant which will become
negative due to overflow, result will mismatch.
I realized that as 'a' in integer, if a == 0, we will get 0 in any case and any value of a > 0, b and c will never
be less than 2 and 32767 respectively. so add result will always overflow.

What do you think? Should be disable above transform.

I have added a TODO to fix transform happening in SimplifyUsingDistributiveLaws and dropping nsw.
I will update it soon so that all these pattern gets handle in SimplifyUsingDistributiveLaws.

moved dyn_castFoldableMul logic to InstructionCombining.cpp.
Now SimplifyUsingDistributiveLaws handles all relevant cases.

gentle ping

Looks a lot better, thanks.

lib/Transforms/InstCombine/InstructionCombining.cpp
400 ↗	(On Diff #10270)	Please name functions starting with a lowercase letter. This function can be static.
418 ↗	(On Diff #10270)	lowercase, static. It seems a bit strange to return one operand and put the other one in an out value. Also, this function doesn't do the factorization, it just splits the expression. Also, the above function for handling 1 could be merged into this (and with that avoid the extra indentation), no? Something like Try to split Op0 and Op1 into (A op B) and (C op D) with the same operation in both. This handles cases like matching a shift as a mul and B or D being // implicitly 1. static bool matchForFactorization(const BinaryOperator Op0, const BinaryOperator Op1, Instruction::BinaryOps &OpCode, Value &A, Value &B, Value &C, Value &D)

Hi Dinesh,

It looks pretty good overall. Thanks for working on this! Please find my comments inlined.

lib/Transforms/InstCombine/InstructionCombining.cpp
445 ↗	(On Diff #10270)	I'd move these definitions closer to their uses. e.g., Value B = nullptr, D = nullptr; Value *A = FactorBinaryOps(Op0, B, LHSOpcode);
452 ↗	(On Diff #10270)	Correct me if I'm wrong. I think the current logic seems unable to convert %z = mul nsw i32 %x, %y %z3 = mul nsw i32 %z, 3 %z4 = add nsw i32 %z, %z3 to %z = mul nsw i32 %x, %y %z4 = mul nsw i32 %z, 4 because if %z is in the format of %x * %y, you will not try the identity factorization (i.e., %z * 1). If it is indeed a missed opportunity, it may not be too difficult to cover. For example, we can try all four combinations: LHS (binary op), RHS (binary op) LHS (identity), RHS (binary op) LHS (binary op), RHS (identity) LHS (identity), RHS (identity) // probably already handled somewhere else? Anyway, we can do this later.
466 ↗	(On Diff #10270)	I feel the logic of this IF and next IF (if (RightDistributesOverLeft) a bit difficult to follow. There is also some copy and paste. What about extracting the check on InnerCommutative out? Something like: try(a, b, c, d); if (op' is commutative) { try(b, a, c, d); try(a, b, d, c); try(b, a, c, d); } By doing this, inside each "try", we don't need to think about whether op' is commutative. What do you think?

Update patch as per review comments

Thanks for review. I ahve added comments inline.

lib/Transforms/InstCombine/InstructionCombining.cpp
400 ↗	(On Diff #10270)	updated. I have kept this funtion so it can be used to get identity values for other operations e.g. and, or etc. as well.
418 ↗	(On Diff #10270)	Update function name and added static. I have to keep both function separate to try to explore optimization, jingyue has mentioned in his comment. I have changed function parameter and return value. Let me know if it looks ok.
445 ↗	(On Diff #10270)	updated.
452 ↗	(On Diff #10270)	updated. New patch takes care of all except LHS (identity), RHS (identity). it is getting handled before SimplifyUsingDistributiveLaws called.
466 ↗	(On Diff #10270)	Updated patch to handle this. We can not just depend on isCommutative as there are oparation which are not commutative but still can be distributed e.g. "(X + Y) / Z = X/Z + Y/Z" We do not handle this now but we might want to do it in future.

jingyue added inline comments.Jun 17 2014, 10:31 AM

lib/Transforms/InstCombine/InstructionCombining.cpp
466 ↗	(On Diff #10270)	I think the first "try" in my pseudo-code handle this case. Right? While "try" doesn't swap a and b or c and d, it considers both LeftDistributesOverRight and RightDistributesOverLeft. Therefore, it is able to convert "a / b + c / d" to "(a + c) / d".

dinesh.d added inline comments.Jun 17 2014, 1:41 PM

lib/Transforms/InstCombine/InstructionCombining.cpp
466 ↗	(On Diff #10270)	Yes, if we considers both LeftDistributesOverRight and RightDistributesOverLeft, it handles all cases. But then I think current code is handling all cases more efficiently. If left distributes over right and inner op is commutative, then right will also distribute over left too and vice versa. So Checking following 2 combinations if left distributes over right try(a, b, c, d); if (op' is commutative) { try(a, b, d, c); } and following 2 combination if right distributes ove left should be sufficient. try(a, b, c, d); if (op' is commutative) { try(a, b, d, c); } Because for inner commutative operation if try(a, b, c, d) fails for LeftDistributesOverRight then try(b, a, d, c) will fail for RightDistributesOverLeft if try(a, b, c, d) fails for RightDistributesOverLeft then try(b, a, d, c) will fail for LeftDistributesOverRight similarly if try(b, a, c, d) fails for LeftDistributesOverRight then try(a, b, d, c) will fail for RightDistributesOverLeft if try(a, b, c, d) fails for RightDistributesOverLeft then try(a, b, d, c) will fail for LeftDistributesOverRight Am I missing something?

typo :(

lib/Transforms/InstCombine/InstructionCombining.cpp
466 ↗	(On Diff #10270)	Please read last line in previous reply as if try(b, a, c, d) fails for RightDistributesOverLeft then try(a, b, d, c) will fail for LeftDistributesOverRight

Still reading the code, just wanted to post the easy bits first.

lib/Transforms/InstCombine/InstructionCombining.cpp
398 ↗	(On Diff #10499)	Nit: don't duplicate the function name in the comment.
418 ↗	(On Diff #10499)	same
442 ↗	(On Diff #10499)	here too.

Mostly nits. This is LGTM with the issues on line 552 addressed, but please wait for Jingyue Wu to confirm that he is OK with it too.

Thanks!

lib/Transforms/InstCombine/InstructionCombining.cpp
415 ↗	(On Diff #10499)	You have both a default case with a return and a return after the switch. Given that we only handle Mul, it is probably better to write this as an if for now. When we add support for other opcodes it is easy to go back to an switch.
552 ↗	(On Diff #10499)	This should be if (Value *V = tryFactorization(Builder, DL, I, RHSOpcode, LHS, getIdentityValue(LHSOpcode, LHS), C, D)) no?

jingyue added a reviewer: jingyue.Jun 17 2014, 10:16 PM

jingyue added inline comments.

lib/Transforms/InstCombine/InstructionCombining.cpp
481 ↗	(On Diff #10499)	Do we still need to try right distribution if left distribution already succeeds?
494 ↗	(On Diff #10499)	A question: any chance to preserve NSW/NUW for TopLevelOpcode?
508 ↗	(On Diff #10499)	I think it's better to write: if (BinaryOperator *Op0 = dyn_cast<BinaryOperator>(LHS)) { if (isa<OverflowingBinaryOperator>(Op0)) { HasNSW &= Op0->hasNoSignedWrap(); }
515 ↗	(On Diff #10499)	I may be over-concerned, but is this cast dangerous? CreateBinOp returns a ConstantExpr if its operands are both Constant. Note that ConstantInt op ConstantInt is not always foldable, e.g., ptrtoint (a global variable) + 5.
543 ↗	(On Diff #10499)	`(A op' B) op (C)`?
466 ↗	(On Diff #10270)	Good point. I agree try(a, b, c, d); if (op' is commutative) { try(a, b, d, c); } is enough.

Thanks for review and comments. I have replied them inline.

lib/Transforms/InstCombine/InstructionCombining.cpp
398 ↗	(On Diff #10499)	updated.
415 ↗	(On Diff #10499)	updated.
418 ↗	(On Diff #10499)	updated.
442 ↗	(On Diff #10499)	updated.
481 ↗	(On Diff #10499)	No. We should not. I have updated code to check if left distribution already succeeds
494 ↗	(On Diff #10499)	I will try to do that in next patches.
508 ↗	(On Diff #10499)	updated.
515 ↗	(On Diff #10499)	We have already check if 'isa<OverflowingBinaryOperator>(SimplifiedInst)' so casting SimplifiedInst to BinaryOperator should be safe.
543 ↗	(On Diff #10499)	updated.
552 ↗	(On Diff #10499)	updated code. last minute copy paste issue. it should be getIdentityValue(RHSOpcode, LHS) I will be more carefull to avoid these mistakes.

Updated patch as per comments. Thanks for review.

Only one nit, otherwise LGTM.

lib/Transforms/InstCombine/InstructionCombining.cpp
515 ↗	(On Diff #10499)	http://llvm.org/docs/doxygen/html/classllvm_1_1OverflowingBinaryOperator.html OverflowingBinaryOperator is not a subclass of BinaryOperator. BinaryOperator is an Instruction, but OverflowingBinaryOperator can be either an Instruction or a ConstantExpr. I agree the naming is a little confusing :)

This revision is now accepted and ready to land.Jun 18 2014, 3:04 PM

Thanks for review. I have updated cast<BinaryOperator>(SimplifiedInst) to use dyn_cast.

Closed by commit rL211261 (authored by dinesh).

Revision Contents

Path

Size

llvm/

trunk/

lib/

Transforms/

InstCombine/

InstCombineAddSub.cpp

52 lines

InstructionCombining.cpp

199 lines

test/

Transforms/

InstCombine/

add2.ll

68 lines

Diff 10616

llvm/trunk/lib/Transforms/InstCombine/InstCombineAddSub.cpp

Show First 20 Lines • Show All 859 Lines • ▼ Show 20 Lines	if (Coeff.isTwo() \|\| Coeff.isMinusTwo()) {
NeedNeg = Coeff.isMinusTwo();		NeedNeg = Coeff.isMinusTwo();
return createFAdd(OpndVal, OpndVal);		return createFAdd(OpndVal, OpndVal);
}		}

NeedNeg = false;		NeedNeg = false;
return createFMul(OpndVal, Coeff.getValue(Instr->getType()));		return createFMul(OpndVal, Coeff.getValue(Instr->getType()));
}		}

// dyn_castFoldableMul - If this value is a multiply that can be folded into
// other computations (because it has a constant operand), return the
// non-constant operand of the multiply, and set CST to point to the multiplier.
// Otherwise, return null.
//
static inline Value dyn_castFoldableMul(Value V, Constant *&CST) {
if (!V->hasOneUse() \|\| !V->getType()->isIntOrIntVectorTy())
return nullptr;

Instruction *I = dyn_cast<Instruction>(V);
if (!I) return nullptr;

if (I->getOpcode() == Instruction::Mul)
if ((CST = dyn_cast<Constant>(I->getOperand(1))))
return I->getOperand(0);
if (I->getOpcode() == Instruction::Shl)
if ((CST = dyn_cast<Constant>(I->getOperand(1)))) {
// The multiplier is really 1 << CST.
CST = ConstantExpr::getShl(ConstantInt::get(V->getType(), 1), CST);
return I->getOperand(0);
}
return nullptr;
}

// If one of the operands only has one non-zero bit, and if the other		// If one of the operands only has one non-zero bit, and if the other
// operand has a known-zero bit in a more significant place than it (not		// operand has a known-zero bit in a more significant place than it (not
// including the sign bit) the ripple may go up to and fill the zero, but		// including the sign bit) the ripple may go up to and fill the zero, but
// won't change the sign. For example, (X & ~4) + 1.		// won't change the sign. For example, (X & ~4) + 1.
static bool checkRippleForAdd(const APInt &Op0KnownZero,		static bool checkRippleForAdd(const APInt &Op0KnownZero,
const APInt &Op1KnownZero) {		const APInt &Op1KnownZero) {
APInt Op1MaybeOne = ~Op1KnownZero;		APInt Op1MaybeOne = ~Op1KnownZero;
// Make sure that one of the operand has at most one bit set to 1.		// Make sure that one of the operand has at most one bit set to 1.
▲ Show 20 Lines • Show All 184 Lines • ▼ Show 20 Lines	if (Value *LHSV = dyn_castNegVal(LHS)) {
return BinaryOperator::CreateSub(RHS, LHSV);		return BinaryOperator::CreateSub(RHS, LHSV);
}		}

// A + -B --> A - B		// A + -B --> A - B
if (!isa<Constant>(RHS))		if (!isa<Constant>(RHS))
if (Value *V = dyn_castNegVal(RHS))		if (Value *V = dyn_castNegVal(RHS))
return BinaryOperator::CreateSub(LHS, V);		return BinaryOperator::CreateSub(LHS, V);


{
Constant *C2;
if (Value *X = dyn_castFoldableMul(LHS, C2)) {
if (X == RHS) // XC + X --> X (C+1)
return BinaryOperator::CreateMul(RHS, AddOne(C2));

// XC1 + XC2 --> X * (C1+C2)
Constant *C1;
if (X == dyn_castFoldableMul(RHS, C1))
return BinaryOperator::CreateMul(X, ConstantExpr::getAdd(C1, C2));
}

// X + XC --> X (C+1)
if (dyn_castFoldableMul(RHS, C2) == LHS)
return BinaryOperator::CreateMul(LHS, AddOne(C2));
}

// A+B --> A\|B iff A and B have no bits set in common.		// A+B --> A\|B iff A and B have no bits set in common.
if (IntegerType *IT = dyn_cast<IntegerType>(I.getType())) {		if (IntegerType *IT = dyn_cast<IntegerType>(I.getType())) {
APInt LHSKnownOne(IT->getBitWidth(), 0);		APInt LHSKnownOne(IT->getBitWidth(), 0);
APInt LHSKnownZero(IT->getBitWidth(), 0);		APInt LHSKnownZero(IT->getBitWidth(), 0);
computeKnownBits(LHS, LHSKnownZero, LHSKnownOne);		computeKnownBits(LHS, LHSKnownZero, LHSKnownOne);
if (LHSKnownZero != 0) {		if (LHSKnownZero != 0) {
APInt RHSKnownOne(IT->getBitWidth(), 0);		APInt RHSKnownOne(IT->getBitWidth(), 0);
APInt RHSKnownZero(IT->getBitWidth(), 0);		APInt RHSKnownZero(IT->getBitWidth(), 0);
▲ Show 20 Lines • Show All 470 Lines • ▼ Show 20 Lines	if (Op1->hasOneUse()) {
// X - CIA -> X + A-CI		// X - CIA -> X + A-CI
if (match(Op1, m_Mul(m_Value(A), m_Constant(CI))) \|\|		if (match(Op1, m_Mul(m_Value(A), m_Constant(CI))) \|\|
match(Op1, m_Mul(m_Constant(CI), m_Value(A)))) {		match(Op1, m_Mul(m_Constant(CI), m_Value(A)))) {
Value *NewMul = Builder->CreateMul(A, ConstantExpr::getNeg(CI));		Value *NewMul = Builder->CreateMul(A, ConstantExpr::getNeg(CI));
return BinaryOperator::CreateAdd(Op0, NewMul);		return BinaryOperator::CreateAdd(Op0, NewMul);
}		}
}		}

Constant *C1;
if (Value *X = dyn_castFoldableMul(Op0, C1)) {
if (X == Op1) // XC - X --> X (C-1)
return BinaryOperator::CreateMul(Op1, SubOne(C1));

Constant C2; // XC1 - XC2 -> X (C1-C2)
if (X == dyn_castFoldableMul(Op1, C2))
return BinaryOperator::CreateMul(X, ConstantExpr::getSub(C1, C2));
}

// Optimize pointer differences into the same array into a size. Consider:		// Optimize pointer differences into the same array into a size. Consider:
// &A[10] - &A[0]: we should compile this to "10".		// &A[10] - &A[0]: we should compile this to "10".
if (DL) {		if (DL) {
Value LHSOp, RHSOp;		Value LHSOp, RHSOp;
if (match(Op0, m_PtrToInt(m_Value(LHSOp))) &&		if (match(Op0, m_PtrToInt(m_Value(LHSOp))) &&
match(Op1, m_PtrToInt(m_Value(RHSOp))))		match(Op1, m_PtrToInt(m_Value(RHSOp))))
if (Value *Res = OptimizePointerDifference(LHSOp, RHSOp, I.getType()))		if (Value *Res = OptimizePointerDifference(LHSOp, RHSOp, I.getType()))
return ReplaceInstUsesWith(I, Res);		return ReplaceInstUsesWith(I, Res);
▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

llvm/trunk/lib/Transforms/InstCombine/InstructionCombining.cpp

Show First 20 Lines • Show All 389 Lines • ▼ Show 20 Lines	static bool RightDistributesOverLeft(Instruction::BinaryOps LOp,
if (Instruction::isCommutative(ROp))		if (Instruction::isCommutative(ROp))
return LeftDistributesOverRight(ROp, LOp);		return LeftDistributesOverRight(ROp, LOp);
// TODO: It would be nice to handle division, aka "(X + Y)/Z = X/Z + Y/Z",		// TODO: It would be nice to handle division, aka "(X + Y)/Z = X/Z + Y/Z",
// but this requires knowing that the addition does not overflow and other		// but this requires knowing that the addition does not overflow and other
// such subtleties.		// such subtleties.
return false;		return false;
}		}

/// SimplifyUsingDistributiveLaws - This tries to simplify binary operations		/// This function returns identity value for given opcode, which can be used to
/// which some other binary operation distributes over either by factorizing		/// factor patterns like (X * 2) + X ==> (X * 2) + (X * 1) ==> X * (2 + 1).
/// out common terms (eg "(AB)+(AC)" -> "A*(B+C)") or expanding out if this		static Value getIdentityValue(Instruction::BinaryOps OpCode, Value V) {
/// results in simplifications (eg: "A & (B \| C) -> (A&B) \| (A&C)" if this is		if (isa<Constant>(V))
/// a win). Returns the simplified value, or null if it didn't simplify.		return nullptr;
Value *InstCombiner::SimplifyUsingDistributiveLaws(BinaryOperator &I) {
Value LHS = I.getOperand(0), RHS = I.getOperand(1);
BinaryOperator *Op0 = dyn_cast<BinaryOperator>(LHS);
BinaryOperator *Op1 = dyn_cast<BinaryOperator>(RHS);
Instruction::BinaryOps TopLevelOpcode = I.getOpcode(); // op

// Factorization.		if (OpCode == Instruction::Mul)
if (Op0 && Op1 && Op0->getOpcode() == Op1->getOpcode()) {		return ConstantInt::get(V->getType(), 1);
// The instruction has the form "(A op' B) op (C op' D)". Try to factorize
// a common term.		// TODO: We can handle other cases e.g. Instruction::And, Instruction::Or etc.
Value A = Op0->getOperand(0), B = Op0->getOperand(1);
Value C = Op1->getOperand(0), D = Op1->getOperand(1);		return nullptr;
Instruction::BinaryOps InnerOpcode = Op0->getOpcode(); // op'		}

		/// This function factors binary ops which can be combined using distributive
		/// laws. This also factor SHL as MUL e.g. SHL(X, 2) ==> MUL(X, 4).
		Instruction::BinaryOps getBinOpsForFactorization(BinaryOperator *Op,
		Value &LHS, Value &RHS) {
		if (!Op)
		return Instruction::BinaryOpsEnd;

		if (Op->getOpcode() == Instruction::Shl) {
		if (Constant *CST = dyn_cast<Constant>(Op->getOperand(1))) {
		// The multiplier is really 1 << CST.
		RHS = ConstantExpr::getShl(ConstantInt::get(Op->getType(), 1), CST);
		LHS = Op->getOperand(0);
		return Instruction::Mul;
		}
		}

		// TODO: We can add other conversions e.g. shr => div etc.

		LHS = Op->getOperand(0);
		RHS = Op->getOperand(1);
		return Op->getOpcode();
		}

		/// This tries to simplify binary operations by factorizing out common terms
		/// (e. g. "(AB)+(AC)" -> "A*(B+C)").
		static Value tryFactorization(InstCombiner::BuilderTy Builder,
		const DataLayout *DL, BinaryOperator &I,
		Instruction::BinaryOps InnerOpcode, Value *A,
		Value B, Value C, Value *D) {

		// If any of A, B, C, D are null, we can not factor I, return early.
		// Checking A and C should be enough.
		if (!A \|\| !C \|\| !B \|\| !D)
		return nullptr;

		Value *SimplifiedInst = nullptr;
		Value LHS = I.getOperand(0), RHS = I.getOperand(1);
		Instruction::BinaryOps TopLevelOpcode = I.getOpcode();

// Does "X op' Y" always equal "Y op' X"?		// Does "X op' Y" always equal "Y op' X"?
bool InnerCommutative = Instruction::isCommutative(InnerOpcode);		bool InnerCommutative = Instruction::isCommutative(InnerOpcode);

// Does "X op' (Y op Z)" always equal "(X op' Y) op (X op' Z)"?		// Does "X op' (Y op Z)" always equal "(X op' Y) op (X op' Z)"?
if (LeftDistributesOverRight(InnerOpcode, TopLevelOpcode))		if (LeftDistributesOverRight(InnerOpcode, TopLevelOpcode))
// Does the instruction have the form "(A op' B) op (A op' D)" or, in the		// Does the instruction have the form "(A op' B) op (A op' D)" or, in the
// commutative case, "(A op' B) op (C op' A)"?		// commutative case, "(A op' B) op (C op' A)"?
if (A == C \|\| (InnerCommutative && A == D)) {		if (A == C \|\| (InnerCommutative && A == D)) {
if (A != C)		if (A != C)
std::swap(C, D);		std::swap(C, D);
// Consider forming "A op' (B op D)".		// Consider forming "A op' (B op D)".
// If "B op D" simplifies then it can be formed with no cost.		// If "B op D" simplifies then it can be formed with no cost.
Value *V = SimplifyBinOp(TopLevelOpcode, B, D, DL);		Value *V = SimplifyBinOp(TopLevelOpcode, B, D, DL);
// If "B op D" doesn't simplify then only go on if both of the existing		// If "B op D" doesn't simplify then only go on if both of the existing
// operations "A op' B" and "C op' D" will be zapped as no longer used.		// operations "A op' B" and "C op' D" will be zapped as no longer used.
if (!V && Op0->hasOneUse() && Op1->hasOneUse())		if (!V && LHS->hasOneUse() && RHS->hasOneUse())
V = Builder->CreateBinOp(TopLevelOpcode, B, D, Op1->getName());		V = Builder->CreateBinOp(TopLevelOpcode, B, D, RHS->getName());
if (V) {		if (V) {
++NumFactor;		SimplifiedInst = Builder->CreateBinOp(InnerOpcode, A, V);
V = Builder->CreateBinOp(InnerOpcode, A, V);
V->takeName(&I);
return V;
}		}
}		}

// Does "(X op Y) op' Z" always equal "(X op' Z) op (Y op' Z)"?		// Does "(X op Y) op' Z" always equal "(X op' Z) op (Y op' Z)"?
if (RightDistributesOverLeft(TopLevelOpcode, InnerOpcode))		if (!SimplifiedInst && RightDistributesOverLeft(TopLevelOpcode, InnerOpcode))
// Does the instruction have the form "(A op' B) op (C op' B)" or, in the		// Does the instruction have the form "(A op' B) op (C op' B)" or, in the
// commutative case, "(A op' B) op (B op' D)"?		// commutative case, "(A op' B) op (B op' D)"?
if (B == D \|\| (InnerCommutative && B == C)) {		if (B == D \|\| (InnerCommutative && B == C)) {
if (B != D)		if (B != D)
std::swap(C, D);		std::swap(C, D);
// Consider forming "(A op C) op' B".		// Consider forming "(A op C) op' B".
// If "A op C" simplifies then it can be formed with no cost.		// If "A op C" simplifies then it can be formed with no cost.
Value *V = SimplifyBinOp(TopLevelOpcode, A, C, DL);		Value *V = SimplifyBinOp(TopLevelOpcode, A, C, DL);

// If "A op C" doesn't simplify then only go on if both of the existing		// If "A op C" doesn't simplify then only go on if both of the existing
// operations "A op' B" and "C op' D" will be zapped as no longer used.		// operations "A op' B" and "C op' D" will be zapped as no longer used.
if (!V && Op0->hasOneUse() && Op1->hasOneUse())		if (!V && LHS->hasOneUse() && RHS->hasOneUse())
V = Builder->CreateBinOp(TopLevelOpcode, A, C, Op0->getName());		V = Builder->CreateBinOp(TopLevelOpcode, A, C, LHS->getName());
if (V) {		if (V) {
		SimplifiedInst = Builder->CreateBinOp(InnerOpcode, V, B);
		}
		}

		if (SimplifiedInst) {
++NumFactor;		++NumFactor;
V = Builder->CreateBinOp(InnerOpcode, V, B);		SimplifiedInst->takeName(&I);
V->takeName(&I);
return V;		// Check if we can add NSW flag to SimplifiedInst. If so, set NSW flag.
		// TODO: Check for NUW.
		if (BinaryOperator *BO = dyn_cast<BinaryOperator>(SimplifiedInst)) {
		if (isa<OverflowingBinaryOperator>(SimplifiedInst)) {
		bool HasNSW = false;
		if (isa<OverflowingBinaryOperator>(&I))
		HasNSW = I.hasNoSignedWrap();

		if (BinaryOperator *Op0 = dyn_cast<BinaryOperator>(LHS))
		if (isa<OverflowingBinaryOperator>(Op0))
		HasNSW &= Op0->hasNoSignedWrap();

		if (BinaryOperator *Op1 = dyn_cast<BinaryOperator>(RHS))
		if (isa<OverflowingBinaryOperator>(Op1))
		HasNSW &= Op1->hasNoSignedWrap();
		BO->setHasNoSignedWrap(HasNSW);
		}
}		}
}		}
		return SimplifiedInst;
}		}

		/// SimplifyUsingDistributiveLaws - This tries to simplify binary operations
		/// which some other binary operation distributes over either by factorizing
		/// out common terms (eg "(AB)+(AC)" -> "A*(B+C)") or expanding out if this
		/// results in simplifications (eg: "A & (B \| C) -> (A&B) \| (A&C)" if this is
		/// a win). Returns the simplified value, or null if it didn't simplify.
		Value *InstCombiner::SimplifyUsingDistributiveLaws(BinaryOperator &I) {
		Value LHS = I.getOperand(0), RHS = I.getOperand(1);
		BinaryOperator *Op0 = dyn_cast<BinaryOperator>(LHS);
		BinaryOperator *Op1 = dyn_cast<BinaryOperator>(RHS);

		// Factorization.
		Value A = nullptr, B = nullptr, C = nullptr, D = nullptr;
		Instruction::BinaryOps LHSOpcode = getBinOpsForFactorization(Op0, A, B);
		Instruction::BinaryOps RHSOpcode = getBinOpsForFactorization(Op1, C, D);

		// The instruction has the form "(A op' B) op (C op' D)". Try to factorize
		// a common term.
		if (LHSOpcode == RHSOpcode) {
		if (Value *V = tryFactorization(Builder, DL, I, LHSOpcode, A, B, C, D))
		return V;
		}

		// The instruction has the form "(A op' B) op (C)". Try to factorize common
		// term.
		if (Value *V = tryFactorization(Builder, DL, I, LHSOpcode, A, B, RHS,
		getIdentityValue(LHSOpcode, RHS)))
		return V;

		// The instruction has the form "(B) op (C op' D)". Try to factorize common
		// term.
		if (Value *V = tryFactorization(Builder, DL, I, RHSOpcode, LHS,
		getIdentityValue(RHSOpcode, LHS), C, D))
		return V;

// Expansion.		// Expansion.
		Instruction::BinaryOps TopLevelOpcode = I.getOpcode();
if (Op0 && RightDistributesOverLeft(Op0->getOpcode(), TopLevelOpcode)) {		if (Op0 && RightDistributesOverLeft(Op0->getOpcode(), TopLevelOpcode)) {
// The instruction has the form "(A op' B) op C". See if expanding it out		// The instruction has the form "(A op' B) op C". See if expanding it out
// to "(A op C) op' (B op C)" results in simplifications.		// to "(A op C) op' (B op C)" results in simplifications.
Value A = Op0->getOperand(0), B = Op0->getOperand(1), *C = RHS;		Value A = Op0->getOperand(0), B = Op0->getOperand(1), *C = RHS;
Instruction::BinaryOps InnerOpcode = Op0->getOpcode(); // op'		Instruction::BinaryOps InnerOpcode = Op0->getOpcode(); // op'

// Do "A op C" and "B op C" both simplify?		// Do "A op C" and "B op C" both simplify?
if (Value *L = SimplifyBinOp(TopLevelOpcode, A, C, DL))		if (Value *L = SimplifyBinOp(TopLevelOpcode, A, C, DL))
▲ Show 20 Lines • Show All 2,284 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/InstCombine/add2.ll

Show First 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	define i16 @test9(i16 %a) {
%b = mul i16 %a, 2		%b = mul i16 %a, 2
%c = mul i16 %a, 32767		%c = mul i16 %a, 32767
%d = add i16 %b, %c		%d = add i16 %b, %c
ret i16 %d		ret i16 %d
; CHECK-LABEL: @test9(		; CHECK-LABEL: @test9(
; CHECK-NEXT: %d = mul i16 %a, -32767		; CHECK-NEXT: %d = mul i16 %a, -32767
; CHECK-NEXT: ret i16 %d		; CHECK-NEXT: ret i16 %d
}		}

		define i16 @add_nsw_mul_nsw(i16 %x) {
		%add1 = add nsw i16 %x, %x
		%add2 = add nsw i16 %add1, %x
		ret i16 %add2
		; CHECK-LABEL: @add_nsw_mul_nsw(
		; CHECK-NEXT: %add2 = mul nsw i16 %x, 3
		; CHECK-NEXT: ret i16 %add2
		}

		define i16 @mul_add_to_mul_1(i16 %x) {
		%mul1 = mul nsw i16 %x, 8
		%add2 = add nsw i16 %x, %mul1
		ret i16 %add2
		; CHECK-LABEL: @mul_add_to_mul_1(
		; CHECK-NEXT: %add2 = mul nsw i16 %x, 9
		; CHECK-NEXT: ret i16 %add2
		}

		define i16 @mul_add_to_mul_2(i16 %x) {
		%mul1 = mul nsw i16 %x, 8
		%add2 = add nsw i16 %mul1, %x
		ret i16 %add2
		; CHECK-LABEL: @mul_add_to_mul_2(
		; CHECK-NEXT: %add2 = mul nsw i16 %x, 9
		; CHECK-NEXT: ret i16 %add2
		}

		define i16 @mul_add_to_mul_3(i16 %a) {
		%mul1 = mul i16 %a, 2
		%mul2 = mul i16 %a, 3
		%add = add nsw i16 %mul1, %mul2
		ret i16 %add
		; CHECK-LABEL: @mul_add_to_mul_3(
		; CHECK-NEXT: %add = mul i16 %a, 5
		; CHECK-NEXT: ret i16 %add
		}

		define i16 @mul_add_to_mul_4(i16 %a) {
		%mul1 = mul nsw i16 %a, 2
		%mul2 = mul nsw i16 %a, 7
		%add = add nsw i16 %mul1, %mul2
		ret i16 %add
		; CHECK-LABEL: @mul_add_to_mul_4(
		; CHECK-NEXT: %add = mul nsw i16 %a, 9
		; CHECK-NEXT: ret i16 %add
		}

		define i16 @mul_add_to_mul_5(i16 %a) {
		%mul1 = mul nsw i16 %a, 3
		%mul2 = mul nsw i16 %a, 7
		%add = add nsw i16 %mul1, %mul2
		ret i16 %add
		; CHECK-LABEL: @mul_add_to_mul_5(
		; CHECK-NEXT: %add = mul nsw i16 %a, 10
		; CHECK-NEXT: ret i16 %add
		}

		define i32 @mul_add_to_mul_6(i32 %x, i32 %y) {
		%mul1 = mul nsw i32 %x, %y
		%mul2 = mul nsw i32 %mul1, 5
		%add = add nsw i32 %mul1, %mul2
		ret i32 %add
		; CHECK-LABEL: @mul_add_to_mul_6(
		; CHECK-NEXT: %mul1 = mul nsw i32 %x, %y
		; CHECK-NEXT: %add = mul nsw i32 %mul1, 6
		; CHECK-NEXT: ret i32 %add
		}

This is an archive of the discontinued LLVM Phabricator instance.

Fixing inst-combine not to drops nsw when combining adds into mul (PR19263)ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 10616

llvm/trunk/lib/Transforms/InstCombine/InstCombineAddSub.cpp

llvm/trunk/lib/Transforms/InstCombine/InstructionCombining.cpp

llvm/trunk/test/Transforms/InstCombine/add2.ll

Fixing inst-combine not to drops nsw when combining adds into mul (PR19263)
ClosedPublic