Download Raw Diff

Details

Reviewers

lebedev.ri
arsenm
spatel

Commits

rGb92412fb286b: [InstCombine] Fold `(a & ~b) & ~c` to `a & ~(b | c)`

Summary

%not1 = xor i32 %b, -1
%not2 = xor i32 %c, -1
%and1 = and i32 %a, %not1
%and2 = and i32 %and1, %not2

>

%i1 = or i32 %b, %c
%i2 = xor i32 %1, -1
%and2 = and i32 %i2, %a

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

rampitec created this revision.Oct 19 2021, 4:22 PM

Herald added a subscriber: hiraditya. · View Herald TranscriptOct 19 2021, 4:22 PM

rampitec requested review of this revision.Oct 19 2021, 4:22 PM

Herald added a project: Restricted Project. · View Herald TranscriptOct 19 2021, 4:22 PM

Herald added a subscriber: wdng. · View Herald Transcript

xbolva00 added a subscriber: xbolva00.Oct 19 2021, 4:36 PM

xbolva00 added inline comments.

llvm/test/Transforms/InstCombine/and-xor-or.ll
566	Atleast one vector test please

xbolva00 added a reviewer: spatel.Oct 19 2021, 4:36 PM

Added vector test.

Harbormaster completed remote builds in B129640: Diff 380823.Oct 19 2021, 5:24 PM

Please pre-commit the tests with current output, so we can confirm that we are testing all of the commuted patterns correctly.

This seems like another short-coming of the reassociation pass, but I think it's ok to deal with the minimal case here.
For example (if I'm seeing it correctly), this test still won't change:

define i4 @src(i4 %a, i4 %b, i4 %c, i4 %d) {
  %notb = xor i4 %b, -1
  %notc = xor i4 %c, -1
  %and1 = and i4 %a, %notb
  %and2 = and i4 %and1, %d
  %and3 = and i4 %and2, %notc
  ret i4 %and3
}

https://alive2.llvm.org/ce/z/_T8ZhP

llvm/lib/Transforms/InstCombine/InstCombineAndOrXor.cpp
2024–2025	I suspect this is dead code (so it can be removed) - complexity-based canonicalization should guarantee that an `and` is always op0 in this pattern.
llvm/test/Transforms/InstCombine/and-xor-or.ll
531	'not' is considered more complex than an argument, so this is not testing the pattern that you wanted to test. See InstCombiner::getComplexity() for the details. Search for "thwart complexity-based canonicalization" in this test directory for test coverage that works around it.
578	Similar to above test comment - this is probably getting altered before we get to the new code in this patch, so it doesn't test what you expected.

Addressed review comments.
Pre-commited test and rebased.

In D112108#3074952, @spatel wrote:
This seems like another short-coming of the reassociation pass, but I think it's ok to deal with the minimal case here.
For example (if I'm seeing it correctly), this test still won't change:
define i4 @src(i4 %a, i4 %b, i4 %c, i4 %d) {
  %notb = xor i4 %b, -1
  %notc = xor i4 %c, -1
  %and1 = and i4 %a, %notb
  %and2 = and i4 %and1, %d
  %and3 = and i4 %and2, %notc
  ret i4 %and3
}
https://alive2.llvm.org/ce/z/_T8ZhP

Yes, this test does not change.

llvm/lib/Transforms/InstCombine/InstCombineAndOrXor.cpp
2024–2025	That seems to be right. Removed.

spatel added inline comments.Oct 20 2021, 11:53 AM

llvm/lib/Transforms/InstCombine/InstCombineAndOrXor.cpp
2018–2020	You can fold the one-use check into the 1st match with m_OneUse(). Could also use && for the match clauses instead of nested ifs to make the structure more like the code above here.
llvm/test/Transforms/InstCombine/and-xor-or.ll
556	In this test, we do not want to thwart complexity-based canonicalization - we want `%a` to remain as op1, so you don't need the extra `sdiv`.
571–572	This one is going to be commuted no matter what we do. It's fine to include this for completeness, but let's have it show another variation - add an extra use for `%not2` with something like: call void @use(i32 %not2) Similarly, we should have a negative test for the transform when `%and1` has another use.

Harbormaster completed remote builds in B129789: Diff 381034.Oct 20 2021, 12:16 PM

Addressed review comments.

LGTM - I'd prefer that the NFC test changes get committed first followed by the code patch and its test changes. That way, we'll still have the expected tests/results in place even if the patch gets reverted for some reason.

This revision is now accepted and ready to land.Oct 20 2021, 12:47 PM

Rebased on pre-comitted test update.

This revision was landed with ongoing or failed builds.Oct 20 2021, 1:06 PM

Closed by commit rGb92412fb286b: [InstCombine] Fold `(a & ~b) & ~c` to `a & ~(b | c)` (authored by rampitec). · Explain Why

This revision was automatically updated to reflect the committed changes.

rampitec added a commit: rGb92412fb286b: [InstCombine] Fold `(a & ~b) & ~c` to `a & ~(b | c)`.

Harbormaster completed remote builds in B129814: Diff 381073.Oct 20 2021, 1:30 PM

In D112108#3075918, @rampitec wrote:
In D112108#3074952, @spatel wrote:
This seems like another short-coming of the reassociation pass, but I think it's ok to deal with the minimal case here.
For example (if I'm seeing it correctly), this test still won't change:
define i4 @src(i4 %a, i4 %b, i4 %c, i4 %d) {
  %notb = xor i4 %b, -1
  %notc = xor i4 %c, -1
  %and1 = and i4 %a, %notb
  %and2 = and i4 %and1, %d
  %and3 = and i4 %and2, %notc
  ret i4 %and3
}
https://alive2.llvm.org/ce/z/_T8ZhP
Yes, this test does not change.

Actually it is transformed too. It does not with opt -instcombine but does with opt -reassociate -instcombine or just opt -O3.

In D112108#3076632, @rampitec wrote:
In D112108#3075918, @rampitec wrote:
In D112108#3074952, @spatel wrote:
This seems like another short-coming of the reassociation pass, but I think it's ok to deal with the minimal case here.
For example (if I'm seeing it correctly), this test still won't change:
define i4 @src(i4 %a, i4 %b, i4 %c, i4 %d) {
  %notb = xor i4 %b, -1
  %notc = xor i4 %c, -1
  %and1 = and i4 %a, %notb
  %and2 = and i4 %and1, %d
  %and3 = and i4 %and2, %notc
  ret i4 %and3
}
https://alive2.llvm.org/ce/z/_T8ZhP
Yes, this test does not change.
Actually it is transformed too. It does not with opt -instcombine but does with opt -reassociate -instcombine or just opt -O3.

Thanks, that's good news. If you want to make sure that combination of transforms doesn't break invisibly, you could add a test like that for -O{1,2,3} runs to /test/Transforms/PhaseOrdering..

spatel mentioned this in rG6b560a8e235b: [InstCombine] add tests for DeMorgan with reassociation; NFC.Oct 21 2021, 7:44 AM

spatel mentioned this in rG3888de9507c7: [InstCombine] generalize reassociated Demorgan folds.

I forgot to mention it during the review, but we should always have these kinds of bitwise logic folds apply to the Demorgan'ized sibling form too (and this fold itself is just reassociation + Demorgan):
3888de9507c7

In D112108#3077962, @spatel wrote:

I forgot to mention it during the review, but we should always have these kinds of bitwise logic folds apply to the Demorgan'ized sibling form too (and this fold itself is just reassociation + Demorgan):
3888de9507c7

Thanks Sanjay! I now wander if a next testcase I am looking at needs a pattern as I wanted to add to visitOr, because it also looks like a more complex case of reassociation:

(c & ~(a | b)) | (b & ~(a | c)) --> ~a & (b ^ c)

We currently cannot simplify it.

define i32 @or_not_and(i32 %a, i32 %b, i32 %c) {
  %or1 = or i32 %a, %b
  %not1 = xor i32 %or1, -1
  %and1 = and i32 %not1, %c
  %or2 = or i32 %a, %c
  %not2 = xor i32 %or2, -1
  %and2 = and i32 %not2, %b
  %or3 = or i32 %and1, %and2
  ret i32 %or3
}

In D112108#3076662, @spatel wrote:

In D112108#3076632, @rampitec wrote:

Actually it is transformed too. It does not with opt -instcombine but does with opt -reassociate -instcombine or just opt -O3.

Thanks, that's good news. If you want to make sure that combination of transforms doesn't break invisibly, you could add a test like that for -O{1,2,3} runs to /test/Transforms/PhaseOrdering..

D112258

In D112108#3078420, @rampitec wrote:

(c & ~(a | b)) | (b & ~(a | c)) --> ~a & (b ^ c)

We currently cannot simplify it.

define i32 @or_not_and(i32 %a, i32 %b, i32 %c) {
  %or1 = or i32 %a, %b
  %not1 = xor i32 %or1, -1
  %and1 = and i32 %not1, %c
  %or2 = or i32 %a, %c
  %not2 = xor i32 %or2, -1
  %and2 = and i32 %not2, %b
  %or3 = or i32 %and1, %and2
  ret i32 %or3
}

Hmm...so that's 2 of the patterns in this patch glued together. I don't see an intermediate fold that we can use to reduce it. Either you have to hard-code a big pattern match or add a generalized boolean logic solver (as its own pass).

In D112108#3078966, @spatel wrote:
In D112108#3078420, @rampitec wrote:
(c & ~(a | b)) | (b & ~(a | c)) --> ~a & (b ^ c)
We currently cannot simplify it.
define i32 @or_not_and(i32 %a, i32 %b, i32 %c) {
  %or1 = or i32 %a, %b
  %not1 = xor i32 %or1, -1
  %and1 = and i32 %not1, %c
  %or2 = or i32 %a, %c
  %not2 = xor i32 %or2, -1
  %and2 = and i32 %not2, %b
  %or3 = or i32 %and1, %and2
  ret i32 %or3
}
Hmm...so that's 2 of the patterns in this patch glued together. I don't see an intermediate fold that we can use to reduce it. Either you have to hard-code a big pattern match or add a generalized boolean logic solver (as its own pass).

Having a separate boolean logic solver might be a good idea in a long run. I.e. collect operands unused ouside of a dag solved, build truth table, generate a minimal expression. But that probably needs much more thought than this initial idea.

spatel mentioned this in D112276: [InstCombine] Fold `(c & ~(a | b)) | (b & ~(a | c))` to `~a & (b ^ c)`.Oct 25 2021, 6:58 AM

Diff 381077

llvm/lib/Transforms/InstCombine/InstCombineAndOrXor.cpp

Show First 20 Lines • Show All 2,006 Lines • ▼ Show 20 Lines	if (Instruction *DeMorgan = matchDeMorgansLaws(I, Builder))

// ((~A) ^ B) & (A \| B) -> (A & B)		// ((~A) ^ B) & (A \| B) -> (A & B)
// ((~A) ^ B) & (B \| A) -> (A & B)		// ((~A) ^ B) & (B \| A) -> (A & B)
// (B ^ (~A)) & (A \| B) -> (A & B)		// (B ^ (~A)) & (A \| B) -> (A & B)
// (B ^ (~A)) & (B \| A) -> (A & B)		// (B ^ (~A)) & (B \| A) -> (A & B)
if (match(Op0, m_c_Xor(m_Not(m_Value(A)), m_Value(B))) &&		if (match(Op0, m_c_Xor(m_Not(m_Value(A)), m_Value(B))) &&
match(Op1, m_c_Or(m_Specific(A), m_Specific(B))))		match(Op1, m_c_Or(m_Specific(A), m_Specific(B))))
return BinaryOperator::CreateAnd(A, B);		return BinaryOperator::CreateAnd(A, B);

		// (A & ~B) & ~C -> A & ~(B \| C)
		// (~B & A) & ~C -> A & ~(B \| C)
		if (match(Op0, m_OneUse(m_c_And(m_Value(A), m_Not(m_Value(B))))) &&
		match(Op1, m_Not(m_Value(C))))
		return BinaryOperator::CreateAnd(
		spatelUnsubmitted Done Reply Inline Actions You can fold the one-use check into the 1st match with m_OneUse(). Could also use && for the match clauses instead of nested ifs to make the structure more like the code above here. spatel: You can fold the one-use check into the 1st match with m_OneUse(). Could also use && for the…
		A, Builder.CreateNot(Builder.CreateOr(B, C)));
}		}

{		{
ICmpInst *LHS = dyn_cast<ICmpInst>(Op0);		ICmpInst *LHS = dyn_cast<ICmpInst>(Op0);
		spatelUnsubmitted Done Reply Inline Actions I suspect this is dead code (so it can be removed) - complexity-based canonicalization should guarantee that an `and` is always op0 in this pattern. spatel: I suspect this is dead code (so it can be removed) - complexity-based canonicalization should…
		rampitecAuthorUnsubmitted Done Reply Inline Actions That seems to be right. Removed. rampitec: That seems to be right. Removed.
ICmpInst *RHS = dyn_cast<ICmpInst>(Op1);		ICmpInst *RHS = dyn_cast<ICmpInst>(Op1);
if (LHS && RHS)		if (LHS && RHS)
if (Value *Res = foldAndOfICmps(LHS, RHS, I))		if (Value *Res = foldAndOfICmps(LHS, RHS, I))
return replaceInstUsesWith(I, Res);		return replaceInstUsesWith(I, Res);

// TODO: Make this recursive; it's a little tricky because an arbitrary		// TODO: Make this recursive; it's a little tricky because an arbitrary
// number of 'and' instructions might have to be created.		// number of 'and' instructions might have to be created.
if (LHS && match(Op1, m_OneUse(m_And(m_Value(X), m_Value(Y))))) {		if (LHS && match(Op1, m_OneUse(m_And(m_Value(X), m_Value(Y))))) {
▲ Show 20 Lines • Show All 1,676 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/and-xor-or.ll

Show First 20 Lines • Show All 516 Lines • ▼ Show 20 Lines	;
ret i64 %use2		ret i64 %use2
}		}

; (a & ~b) & ~c --> a & ~(b \| c)		; (a & ~b) & ~c --> a & ~(b \| c)

define i32 @not_and_and_not(i32 %a0, i32 %b, i32 %c) {		define i32 @not_and_and_not(i32 %a0, i32 %b, i32 %c) {
; CHECK-LABEL: @not_and_and_not(		; CHECK-LABEL: @not_and_and_not(
; CHECK-NEXT: [[A:%.]] = sdiv i32 42, [[A0:%.]]		; CHECK-NEXT: [[A:%.]] = sdiv i32 42, [[A0:%.]]
; CHECK-NEXT: [[NOT1:%.]] = xor i32 [[B:%.]], -1		; CHECK-NEXT: [[TMP1:%.]] = or i32 [[B:%.]], [[C:%.*]]
; CHECK-NEXT: [[NOT2:%.]] = xor i32 [[C:%.]], -1		; CHECK-NEXT: [[TMP2:%.*]] = xor i32 [[TMP1]], -1
; CHECK-NEXT: [[AND1:%.*]] = and i32 [[A]], [[NOT1]]		; CHECK-NEXT: [[AND2:%.*]] = and i32 [[A]], [[TMP2]]
; CHECK-NEXT: [[AND2:%.*]] = and i32 [[AND1]], [[NOT2]]
; CHECK-NEXT: ret i32 [[AND2]]		; CHECK-NEXT: ret i32 [[AND2]]
;		;
%a = sdiv i32 42, %a0 ; thwart complexity-based canonicalization		%a = sdiv i32 42, %a0 ; thwart complexity-based canonicalization
%not1 = xor i32 %b, -1		%not1 = xor i32 %b, -1
		spatelUnsubmitted Done Reply Inline Actions 'not' is considered more complex than an argument, so this is not testing the pattern that you wanted to test. See InstCombiner::getComplexity() for the details. Search for "thwart complexity-based canonicalization" in this test directory for test coverage that works around it. spatel: 'not' is considered more complex than an argument, so this is not testing the pattern that you…
%not2 = xor i32 %c, -1		%not2 = xor i32 %c, -1
%and1 = and i32 %a, %not1		%and1 = and i32 %a, %not1
%and2 = and i32 %and1, %not2		%and2 = and i32 %and1, %not2
ret i32 %and2		ret i32 %and2
}		}

define <4 x i64> @not_and_and_not_4i64(<4 x i64> %a0, <4 x i64> %b, <4 x i64> %c) {		define <4 x i64> @not_and_and_not_4i64(<4 x i64> %a0, <4 x i64> %b, <4 x i64> %c) {
; CHECK-LABEL: @not_and_and_not_4i64(		; CHECK-LABEL: @not_and_and_not_4i64(
; CHECK-NEXT: [[A:%.]] = sdiv <4 x i64> <i64 42, i64 42, i64 42, i64 42>, [[A0:%.]]		; CHECK-NEXT: [[A:%.]] = sdiv <4 x i64> <i64 42, i64 42, i64 42, i64 42>, [[A0:%.]]
; CHECK-NEXT: [[NOT1:%.]] = xor <4 x i64> [[B:%.]], <i64 -1, i64 -1, i64 -1, i64 -1>		; CHECK-NEXT: [[TMP1:%.]] = or <4 x i64> [[B:%.]], [[C:%.*]]
; CHECK-NEXT: [[NOT2:%.]] = xor <4 x i64> [[C:%.]], <i64 -1, i64 -1, i64 -1, i64 -1>		; CHECK-NEXT: [[TMP2:%.*]] = xor <4 x i64> [[TMP1]], <i64 -1, i64 -1, i64 -1, i64 -1>
; CHECK-NEXT: [[AND1:%.*]] = and <4 x i64> [[A]], [[NOT1]]		; CHECK-NEXT: [[AND2:%.*]] = and <4 x i64> [[A]], [[TMP2]]
; CHECK-NEXT: [[AND2:%.*]] = and <4 x i64> [[AND1]], [[NOT2]]
; CHECK-NEXT: ret <4 x i64> [[AND2]]		; CHECK-NEXT: ret <4 x i64> [[AND2]]
;		;
%a = sdiv <4 x i64> <i64 42, i64 42, i64 42, i64 42>, %a0 ; thwart complexity-based canonicalization		%a = sdiv <4 x i64> <i64 42, i64 42, i64 42, i64 42>, %a0 ; thwart complexity-based canonicalization
%not1 = xor <4 x i64> %b, <i64 -1, i64 -1, i64 -1, i64 -1>		%not1 = xor <4 x i64> %b, <i64 -1, i64 -1, i64 -1, i64 -1>
%not2 = xor <4 x i64> %c, <i64 -1, i64 -1, i64 -1, i64 -1>		%not2 = xor <4 x i64> %c, <i64 -1, i64 -1, i64 -1, i64 -1>
%and1 = and <4 x i64> %a, %not1		%and1 = and <4 x i64> %a, %not1
%and2 = and <4 x i64> %and1, %not2		%and2 = and <4 x i64> %and1, %not2
ret <4 x i64> %and2		ret <4 x i64> %and2
}		}

; (~b & a) & ~c --> a & ~(b \| c)		; (~b & a) & ~c --> a & ~(b \| c)

define i32 @not_and_and_not_commute1(i32 %a, i32 %b, i32 %c) {		define i32 @not_and_and_not_commute1(i32 %a, i32 %b, i32 %c) {
		spatelUnsubmitted Done Reply Inline Actions In this test, we do not want to thwart complexity-based canonicalization - we want `%a` to remain as op1, so you don't need the extra `sdiv`. spatel: In this test, we do not want to thwart complexity-based canonicalization - we want `%a` to…
; CHECK-LABEL: @not_and_and_not_commute1(		; CHECK-LABEL: @not_and_and_not_commute1(
; CHECK-NEXT: [[NOT1:%.]] = xor i32 [[B:%.]], -1		; CHECK-NEXT: [[TMP1:%.]] = or i32 [[B:%.]], [[C:%.*]]
; CHECK-NEXT: [[NOT2:%.]] = xor i32 [[C:%.]], -1		; CHECK-NEXT: [[TMP2:%.*]] = xor i32 [[TMP1]], -1
; CHECK-NEXT: [[AND1:%.]] = and i32 [[NOT1]], [[A:%.]]		; CHECK-NEXT: [[AND2:%.]] = and i32 [[TMP2]], [[A:%.]]
; CHECK-NEXT: [[AND2:%.*]] = and i32 [[AND1]], [[NOT2]]
; CHECK-NEXT: ret i32 [[AND2]]		; CHECK-NEXT: ret i32 [[AND2]]
;		;
%not1 = xor i32 %b, -1		%not1 = xor i32 %b, -1
%not2 = xor i32 %c, -1		%not2 = xor i32 %c, -1
%and1 = and i32 %not1, %a		%and1 = and i32 %not1, %a
%and2 = and i32 %and1, %not2		%and2 = and i32 %and1, %not2
		xbolva00Unsubmitted Done Reply Inline Actions Atleast one vector test please xbolva00: Atleast one vector test please
ret i32 %and2		ret i32 %and2
}		}

; ~c & (a & ~b) --> a & ~(b \| c)		; ~c & (a & ~b) --> a & ~(b \| c)

declare void @use(i32)		declare void @use(i32)
		spatelUnsubmitted Done Reply Inline Actions This one is going to be commuted no matter what we do. It's fine to include this for completeness, but let's have it show another variation - add an extra use for `%not2` with something like: call void @use(i32 %not2) Similarly, we should have a negative test for the transform when `%and1` has another use. spatel: This one is going to be commuted no matter what we do. It's fine to include this for…
define i32 @not_and_and_not_commute2_extra_not_use(i32 %a0, i32 %b, i32 %c) {		define i32 @not_and_and_not_commute2_extra_not_use(i32 %a0, i32 %b, i32 %c) {
; CHECK-LABEL: @not_and_and_not_commute2_extra_not_use(		; CHECK-LABEL: @not_and_and_not_commute2_extra_not_use(
; CHECK-NEXT: [[A:%.]] = sdiv i32 42, [[A0:%.]]		; CHECK-NEXT: [[A:%.]] = sdiv i32 42, [[A0:%.]]
; CHECK-NEXT: [[NOT1:%.]] = xor i32 [[B:%.]], -1
; CHECK-NEXT: [[NOT2:%.]] = xor i32 [[C:%.]], -1		; CHECK-NEXT: [[NOT2:%.]] = xor i32 [[C:%.]], -1
; CHECK-NEXT: [[AND1:%.*]] = and i32 [[A]], [[NOT1]]		; CHECK-NEXT: [[TMP1:%.]] = or i32 [[B:%.]], [[C]]
; CHECK-NEXT: [[AND2:%.*]] = and i32 [[AND1]], [[NOT2]]		; CHECK-NEXT: [[TMP2:%.*]] = xor i32 [[TMP1]], -1
		spatelUnsubmitted Done Reply Inline Actions Similar to above test comment - this is probably getting altered before we get to the new code in this patch, so it doesn't test what you expected. spatel: Similar to above test comment - this is probably getting altered before we get to the new code…
		; CHECK-NEXT: [[AND2:%.*]] = and i32 [[A]], [[TMP2]]
; CHECK-NEXT: call void @use(i32 [[NOT2]])		; CHECK-NEXT: call void @use(i32 [[NOT2]])
; CHECK-NEXT: ret i32 [[AND2]]		; CHECK-NEXT: ret i32 [[AND2]]
;		;
%a = sdiv i32 42, %a0 ; thwart complexity-based canonicalization		%a = sdiv i32 42, %a0 ; thwart complexity-based canonicalization
%not1 = xor i32 %b, -1		%not1 = xor i32 %b, -1
%not2 = xor i32 %c, -1		%not2 = xor i32 %c, -1
%and1 = and i32 %a, %not1		%and1 = and i32 %a, %not1
%and2 = and i32 %not2, %and1		%and2 = and i32 %not2, %and1
Show All 22 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Fold `(a & ~b) & ~c` to `a & ~(b | c)`
ClosedPublic

Details

>

Diff Detail

Event Timeline

Revision Contents

Diff 381077

llvm/lib/Transforms/InstCombine/InstCombineAndOrXor.cpp

llvm/test/Transforms/InstCombine/and-xor-or.ll

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Fold `(a & ~b) & ~c` to `a & ~(b | c)`ClosedPublic

Details

>

Diff Detail

Event Timeline

Revision Contents

Diff 381077

llvm/lib/Transforms/InstCombine/InstCombineAndOrXor.cpp

llvm/test/Transforms/InstCombine/and-xor-or.ll

[InstCombine] Fold `(a & ~b) & ~c` to `a & ~(b | c)`
ClosedPublic