This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Analysis/
-
Analysis/
2/5
InstructionSimplify.cpp
-
test/Transforms/InstSimplify/
-
Transforms/
-
InstSimplify/
5/6
rem.ll

Differential D142901

[InstSimplify] Simplify UREM and SREM left shifted operands
AbandonedPublic

Authored by MattDevereau on Jan 30 2023, 8:24 AM.

Download Raw Diff

Details

Reviewers

sdesmalen
peterwaller-arm
david-arm
spatel

Summary

If both operands of a rem instruction are left shifts of the same value, this can be simplified to 0 or the first operand

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	130 ms	x64 debian > Flang.Driver::target-cpu-features.f90

Event Timeline

MattDevereau created this revision.Jan 30 2023, 8:24 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 30 2023, 8:24 AM

Herald added subscribers: StephenFan, hiraditya. · View Herald Transcript

MattDevereau requested review of this revision.Jan 30 2023, 8:24 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 30 2023, 8:24 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Here are some alive tests which support the transforms:
https://alive2.llvm.org/ce/z/9KpSpV
https://alive2.llvm.org/ce/z/JFsIPK

negative transformations:
https://alive2.llvm.org/ce/z/9HD6L_
https://alive2.llvm.org/ce/z/PDymNH

Nice improvement @MattDevereau! I just had a few minor comments ...

llvm/lib/Analysis/InstructionSimplify.cpp
1006	Perhaps you can create a temp variable here to reuse the logic, i.e. bool NoWrap = (IsSigned && Q.IIQ.hasNoSignedWrap(Shift)) \|\| (!IsSigned && Q.IIQ.hasNoUnsignedWrap(Shift)); if (S1 >= S2 && NoWrap) return Constant::getNullValue(Shift->getType()); if (NoWrap) return Op0;
llvm/test/Transforms/InstSimplify/rem.ll
502	Could you leave comments on the negative tests explaining why they fail? If it makes it easier you could put all the negative tests together?
539	Is it worth having a negative test for `srem` with `nuw` shifts as well? We shouldn't do any simplification in that case either.

MattDevereau added inline comments.Jan 30 2023, 9:13 AM

llvm/lib/Analysis/InstructionSimplify.cpp
1006	I'm not sure this suggestion is correct, they key thing here being `Shift = cast<OverflowingBinaryOperator>(Op1);` on line 1010 which changes `Shift` to be a cast of Op1 instead of Op0 on which the nsw/nuw flags are checked, which is because the larger shift value needs its flags checking. If we went ahead with this, in this example (https://alive2.llvm.org/ce/z/YiFELt) NoWrap would evaluate to true while the transform is not viable.
llvm/test/Transforms/InstSimplify/rem.ll
502	Sure thing, should be simple enough
539	I personally think the current tests are ok since we have one test for nsw which is testing the presence of nsw, and one which tests for it's absence and other flags aren't particularly relevant. These tests are quite compact though so I suppose there's no harm in adding extra cases.

Harbormaster completed remote builds in B210784: Diff 493321.Jan 30 2023, 9:21 AM

goldstein.w.n added a subscriber: goldstein.w.n.Jan 30 2023, 10:29 AM

goldstein.w.n added inline comments.

llvm/lib/Analysis/InstructionSimplify.cpp

995

Instead of a new helper maybe this should be in simplifyRem which already appears to have:

// (X << Y) % X -> 0
if (Q.IIQ.UseInstrInfo &&
    ((Opcode == Instruction::SRem &&
      match(Op0, m_NSWShl(m_Specific(Op1), m_Value()))) ||
     (Opcode == Instruction::URem &&
      match(Op0, m_NUWShl(m_Specific(Op1), m_Value())))))
  return Constant::getNullValue(Op0->getType());

This is really just a superset of that case, so maybe expanding the existing logic would be simpler?

goldstein.w.n added inline comments.Jan 30 2023, 10:35 AM

llvm/lib/Analysis/InstructionSimplify.cpp
997	Maybe add a `TODO` for the more generalized case of `(rem (mul nsw/nuw X, C1), (mul nsw/nuw X, C2) if C1 % C2 == 0 -> 0` We seem to be missing it: https://godbolt.org/z/fzxb4sxb5

david-arm added inline comments.Jan 31 2023, 1:17 AM

llvm/test/Transforms/InstSimplify/rem.ll
539	Sure, I was just thinking that specifically we don't want to perform the simplification for any of nsw or nuw. They both mark the instruction as not wrapping, but only the signed version applies here. It's just because your logic accepts both nsw and nuw flags as valid, but only if the signedness matches the instruction. I was hoping to defend against someone accidentally rewriting your code in future and losing the signedness checks.

Inlined unnecessary helper function
Added comments to negative tests
Added more incorrect flag tests

MattDevereau marked 3 inline comments as done.Jan 31 2023, 3:38 AM

MattDevereau added inline comments.

llvm/lib/Analysis/InstructionSimplify.cpp
995	I've inlined it into `simplifyRem` which I think looks a lot neater now. Thank you for the feedback.
llvm/test/Transforms/InstSimplify/rem.ll
539	I've added negative tests for urem with nsw and srem with nuw now.

Harbormaster completed remote builds in B210950: Diff 493550.Jan 31 2023, 4:58 AM

goldstein.w.n mentioned this in D143014: Add constant combines for `(urem/srem (mul X, Y), (mul X, Z))`.Jan 31 2023, 1:57 PM

Note that this patch will likely be dropped in favour of https://reviews.llvm.org/D143014

goldstein.w.n mentioned this in D143417: [InstCombine] Add fold for `(rem (mul/shl X, Y), (mul/shl X, Z))` -> `(mul X, (rem Y, Z))`.Feb 6 2023, 9:50 AM

goldstein.w.n mentioned this in D144225: [InstCombine] Add constant combines for `(urem/srem (shl X, Y), (shl X, Z))`.Feb 16 2023, 3:29 PM

MattDevereau abandoned this revision.Mar 14 2023, 3:06 AM

goldstein.w.n mentioned this in rG2cb6b06c8930: [InstCombine] Add constant combines for `(urem/srem (shl X, Y), (shl X, Z))`.Jul 6 2023, 12:47 PM

Revision Contents

Path

Size

llvm/

lib/

Analysis/

InstructionSimplify.cpp

26 lines

test/

Transforms/

InstSimplify/

rem.ll

94 lines

Diff 493321

llvm/lib/Analysis/InstructionSimplify.cpp

Show First 20 Lines • Show All 986 Lines • ▼ Show 20 Lines	static Value simplifyMulInst(Value Op0, Value *Op1, bool IsNSW, bool IsNUW,
return nullptr;		return nullptr;
}		}

Value llvm::simplifyMulInst(Value Op0, Value *Op1, bool IsNSW, bool IsNUW,		Value llvm::simplifyMulInst(Value Op0, Value *Op1, bool IsNSW, bool IsNUW,
const SimplifyQuery &Q) {		const SimplifyQuery &Q) {
return ::simplifyMulInst(Op0, Op1, IsNSW, IsNUW, Q, RecursionLimit);		return ::simplifyMulInst(Op0, Op1, IsNSW, IsNUW, Q, RecursionLimit);
}		}

		Value simplifyRemShifts(Value Op0, Value *Op1, bool IsSigned,
		goldstein.w.nUnsubmitted Not Done Reply Inline Actions Instead of a new helper maybe this should be in `simplifyRem` which already appears to have: // (X << Y) % X -> 0 if (Q.IIQ.UseInstrInfo && ((Opcode == Instruction::SRem && match(Op0, m_NSWShl(m_Specific(Op1), m_Value()))) \|\| (Opcode == Instruction::URem && match(Op0, m_NUWShl(m_Specific(Op1), m_Value()))))) return Constant::getNullValue(Op0->getType()); This is really just a superset of that case, so maybe expanding the existing logic would be simpler? goldstein.w.n: Instead of a new helper maybe this should be in `simplifyRem` which already appears to have…
		MattDevereauAuthorUnsubmitted Done Reply Inline Actions I've inlined it into `simplifyRem` which I think looks a lot neater now. Thank you for the feedback. MattDevereau: I've inlined it into `simplifyRem` which I think looks a lot neater now. Thank you for the…
		const SimplifyQuery &Q) {
		// rem X << S1, X << S2
		goldstein.w.nUnsubmitted Not Done Reply Inline Actions Maybe add a `TODO` for the more generalized case of `(rem (mul nsw/nuw X, C1), (mul nsw/nuw X, C2) if C1 % C2 == 0 -> 0` We seem to be missing it: https://godbolt.org/z/fzxb4sxb5 goldstein.w.n: Maybe add a `TODO` for the more generalized case of `(rem (mul nsw/nuw X, C1), (mul nsw/nuw X…
		// if (S1 >= S2) -> 0; else -> X << S1
		Value *ShiftX;
		ConstantInt S1, S2;
		if (!match(Op0, m_Shl(m_Value(ShiftX), m_ConstantInt(S1))) \|\|
		!match(Op1, m_Shl(m_Deferred(ShiftX), m_ConstantInt(S2))))
		return nullptr;

		auto Shift = cast<OverflowingBinaryOperator>(Op0);
		if (S1 >= S2 && ((IsSigned && Q.IIQ.hasNoSignedWrap(Shift)) \|\|
		david-armUnsubmitted Not Done Reply Inline Actions Perhaps you can create a temp variable here to reuse the logic, i.e. bool NoWrap = (IsSigned && Q.IIQ.hasNoSignedWrap(Shift)) \|\| (!IsSigned && Q.IIQ.hasNoUnsignedWrap(Shift)); if (S1 >= S2 && NoWrap) return Constant::getNullValue(Shift->getType()); if (NoWrap) return Op0; david-arm: Perhaps you can create a temp variable here to reuse the logic, i.e. bool NoWrap = (IsSigned…
		MattDevereauAuthorUnsubmitted Done Reply Inline Actions I'm not sure this suggestion is correct, they key thing here being `Shift = cast<OverflowingBinaryOperator>(Op1);` on line 1010 which changes `Shift` to be a cast of Op1 instead of Op0 on which the nsw/nuw flags are checked, which is because the larger shift value needs its flags checking. If we went ahead with this, in this example (https://alive2.llvm.org/ce/z/YiFELt) NoWrap would evaluate to true while the transform is not viable. MattDevereau: I'm not sure this suggestion is correct, they key thing here being `Shift =…
		(!IsSigned && Q.IIQ.hasNoUnsignedWrap(Shift))))
		return Constant::getNullValue(Shift->getType());

		Shift = cast<OverflowingBinaryOperator>(Op1);
		if ((IsSigned && Q.IIQ.hasNoSignedWrap(Shift)) \|\|
		(!IsSigned && Q.IIQ.hasNoUnsignedWrap(Shift)))
		return Op0;

		return nullptr;
		}

/// Check for common or similar folds of integer division or integer remainder.		/// Check for common or similar folds of integer division or integer remainder.
/// This applies to all 4 opcodes (sdiv/udiv/srem/urem).		/// This applies to all 4 opcodes (sdiv/udiv/srem/urem).
static Value simplifyDivRem(Instruction::BinaryOps Opcode, Value Op0,		static Value simplifyDivRem(Instruction::BinaryOps Opcode, Value Op0,
Value *Op1, const SimplifyQuery &Q,		Value *Op1, const SimplifyQuery &Q,
unsigned MaxRecurse) {		unsigned MaxRecurse) {
bool IsDiv = (Opcode == Instruction::SDiv \|\| Opcode == Instruction::UDiv);		bool IsDiv = (Opcode == Instruction::SDiv \|\| Opcode == Instruction::UDiv);
bool IsSigned = (Opcode == Instruction::SDiv \|\| Opcode == Instruction::SRem);		bool IsSigned = (Opcode == Instruction::SDiv \|\| Opcode == Instruction::SRem);

▲ Show 20 Lines • Show All 66 Lines • ▼ Show 20 Lines	if ((IsSigned && Q.IIQ.hasNoSignedWrap(Mul)) \|\|
(!IsSigned && match(X, m_UDiv(m_Value(), m_Specific(Op1))))) {		(!IsSigned && match(X, m_UDiv(m_Value(), m_Specific(Op1))))) {
return IsDiv ? X : Constant::getNullValue(Op0->getType());		return IsDiv ? X : Constant::getNullValue(Op0->getType());
}		}
}		}

if (Value *V = simplifyByDomEq(Opcode, Op0, Op1, Q, MaxRecurse))		if (Value *V = simplifyByDomEq(Opcode, Op0, Op1, Q, MaxRecurse))
return V;		return V;

		if (!IsDiv)
		if (Value *V = simplifyRemShifts(Op0, Op1, IsSigned, Q))
		return V;
return nullptr;		return nullptr;
}		}

/// Given a predicate and two operands, return true if the comparison is true.		/// Given a predicate and two operands, return true if the comparison is true.
/// This is a helper for div/rem simplification where we return some other value		/// This is a helper for div/rem simplification where we return some other value
/// when we can prove a relationship between the operands.		/// when we can prove a relationship between the operands.
static bool isICmpTrue(ICmpInst::Predicate Pred, Value LHS, Value RHS,		static bool isICmpTrue(ICmpInst::Predicate Pred, Value LHS, Value RHS,
const SimplifyQuery &Q, unsigned MaxRecurse) {		const SimplifyQuery &Q, unsigned MaxRecurse) {
▲ Show 20 Lines • Show All 5,755 Lines • Show Last 20 Lines

llvm/test/Transforms/InstSimplify/rem.ll

	Show First 20 Lines • Show All 482 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[MOD:%.*]] = urem i8 [[MUL]], [[Y]]			; CHECK-NEXT: [[MOD:%.*]] = urem i8 [[MUL]], [[Y]]
	; CHECK-NEXT: ret i8 [[MOD]]			; CHECK-NEXT: ret i8 [[MOD]]
	;			;
	%d = sdiv i8 %x, %y			%d = sdiv i8 %x, %y
	%mul = mul i8 %y, %d			%mul = mul i8 %y, %d
	%mod = urem i8 %mul, %y			%mod = urem i8 %mul, %y
	ret i8 %mod			ret i8 %mod
	}			}

				define i8 @urem_shl(i8 %x){
				; CHECK-LABEL: @urem_shl(
				; CHECK-NEXT: ret i8 0
				;
				%x1 = shl i8 %x, 1
				%x2 = shl nuw i8 %x, 2
				%1 = urem i8 %x2, %x1
				ret i8 %1
				}

				define i8 @neg_urem_shl(i8 %x){
				david-armUnsubmitted Not Done Reply Inline Actions Could you leave comments on the negative tests explaining why they fail? If it makes it easier you could put all the negative tests together? david-arm: Could you leave comments on the negative tests explaining why they fail? If it makes it easier…
				MattDevereauAuthorUnsubmitted Done Reply Inline Actions Sure thing, should be simple enough MattDevereau: Sure thing, should be simple enough
				; CHECK-LABEL: @neg_urem_shl(
				; CHECK-NEXT: [[X1:%.]] = shl i8 [[X:%.]], 1
				; CHECK-NEXT: [[X2:%.*]] = shl i8 [[X]], 2
				; CHECK-NEXT: [[TMP1:%.*]] = urem i8 [[X2]], [[X1]]
				; CHECK-NEXT: ret i8 [[TMP1]]
				;
				%x1 = shl i8 %x, 1
				%x2 = shl i8 %x, 2
				%1 = urem i8 %x2, %x1
				ret i8 %1
				}

				define i8 @urem_shl_2(i8 %x){
				; CHECK-LABEL: @urem_shl_2(
				; CHECK-NEXT: [[X1:%.]] = shl i8 [[X:%.]], 1
				; CHECK-NEXT: ret i8 [[X1]]
				;
				%x1 = shl i8 %x, 1
				%x2 = shl nuw i8 %x, 2
				%1 = urem i8 %x1, %x2
				ret i8 %1
				}

				define i8 @neg_urem_shl_2(i8 %x){
				; CHECK-LABEL: @neg_urem_shl_2(
				; CHECK-NEXT: [[X1:%.]] = shl i8 [[X:%.]], 1
				; CHECK-NEXT: [[X2:%.*]] = shl i8 [[X]], 2
				; CHECK-NEXT: [[TMP1:%.*]] = urem i8 [[X1]], [[X2]]
				; CHECK-NEXT: ret i8 [[TMP1]]
				;
				%x1 = shl i8 %x, 1
				%x2 = shl i8 %x, 2
				%1 = urem i8 %x1, %x2
				ret i8 %1
				}

				define i8 @srem_shl(i8 %x){
				david-armUnsubmitted Done Reply Inline Actions Is it worth having a negative test for `srem` with `nuw` shifts as well? We shouldn't do any simplification in that case either. david-arm: Is it worth having a negative test for `srem` with `nuw` shifts as well? We shouldn't do any…
				MattDevereauAuthorUnsubmitted Done Reply Inline Actions I personally think the current tests are ok since we have one test for nsw which is testing the presence of nsw, and one which tests for it's absence and other flags aren't particularly relevant. These tests are quite compact though so I suppose there's no harm in adding extra cases. MattDevereau: I personally think the current tests are ok since we have one test for nsw which is testing the…
				david-armUnsubmitted Done Reply Inline Actions Sure, I was just thinking that specifically we don't want to perform the simplification for any of nsw or nuw. They both mark the instruction as not wrapping, but only the signed version applies here. It's just because your logic accepts both nsw and nuw flags as valid, but only if the signedness matches the instruction. I was hoping to defend against someone accidentally rewriting your code in future and losing the signedness checks. david-arm: Sure, I was just thinking that specifically we don't want to perform the simplification for any…
				MattDevereauAuthorUnsubmitted Done Reply Inline Actions I've added negative tests for urem with nsw and srem with nuw now. MattDevereau: I've added negative tests for urem with nsw and srem with nuw now.
				; CHECK-LABEL: @srem_shl(
				; CHECK-NEXT: ret i8 0
				;
				%x1 = shl i8 %x, 1
				%x2 = shl nsw i8 %x, 2
				%1 = srem i8 %x2, %x1
				ret i8 %1
				}

				define i8 @neg_srem_shl(i8 %x){
				; CHECK-LABEL: @neg_srem_shl(
				; CHECK-NEXT: [[X1:%.]] = shl i8 [[X:%.]], 1
				; CHECK-NEXT: [[X2:%.*]] = shl i8 [[X]], 2
				; CHECK-NEXT: [[TMP1:%.*]] = srem i8 [[X2]], [[X1]]
				; CHECK-NEXT: ret i8 [[TMP1]]
				;
				%x1 = shl i8 %x, 1
				%x2 = shl i8 %x, 2
				%1 = srem i8 %x2, %x1
				ret i8 %1
				}

				define i8 @srem_shl_2(i8 %x){
				; CHECK-LABEL: @srem_shl_2(
				; CHECK-NEXT: [[X1:%.]] = shl i8 [[X:%.]], 1
				; CHECK-NEXT: [[X2:%.*]] = shl i8 [[X]], 2
				; CHECK-NEXT: [[TMP1:%.*]] = srem i8 [[X1]], [[X2]]
				; CHECK-NEXT: ret i8 [[TMP1]]
				;
				%x1 = shl i8 %x, 1
				%x2 = shl i8 %x, 2
				%1 = srem i8 %x1, %x2
				ret i8 %1
				}

				define i8 @neg_srem_shl_2(i8 %x){
				; CHECK-LABEL: @neg_srem_shl_2(
				; CHECK-NEXT: [[X1:%.]] = shl i8 [[X:%.]], 1
				; CHECK-NEXT: ret i8 [[X1]]
				;
				%x1 = shl i8 %x, 1
				%x2 = shl nsw i8 %x, 2
				%1 = srem i8 %x1, %x2
				ret i8 %1
				}

This is an archive of the discontinued LLVM Phabricator instance.

[InstSimplify] Simplify UREM and SREM left shifted operandsAbandonedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 493321

llvm/lib/Analysis/InstructionSimplify.cpp

llvm/test/Transforms/InstSimplify/rem.ll

[InstSimplify] Simplify UREM and SREM left shifted operands
AbandonedPublic