This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
3/4
InstCombineSelect.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
1
select-with-bitwise-ops.ll

Differential D148414

[InstCombine] Expand `foldSelectICmpAndOr` -> `foldSelectICmpAndBinOp` to work for more binops
ClosedPublic

Authored by goldstein.w.n on Apr 14 2023, 11:12 PM.

Download Raw Diff

Details

Reviewers

majnemer
spatel
nikic
craig.topper

Commits

rG54ec8bcaf85e: Recommit "[InstCombine] Expand `foldSelectICmpAndOr` ->…
rG397a9cc4d875: Recommit "[InstCombine] Expand `foldSelectICmpAndOr` ->…
rGd3402bc4460a: [InstCombine] Expand `foldSelectICmpAndOr` -> `foldSelectICmpAndBinOp` to work…

Summary

This just expands on the existing logic that worked for Or and
applies it to any binop where 0 is the identity value on the RHS
i.e: add, or, xor, shl, etc...

Proofs For Some: https://alive2.llvm.org/ce/z/XZo6JD

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

goldstein.w.n created this revision.Apr 14 2023, 11:12 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 14 2023, 11:12 PM

Herald added subscribers: StephenFan, hiraditya. · View Herald Transcript

goldstein.w.n requested review of this revision.Apr 14 2023, 11:12 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 14 2023, 11:12 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B225807: Diff 513849.Apr 14 2023, 11:12 PM

goldstein.w.n added a child revision: D148415: [InstCombine] Improve cost calculation in foldSelectICmpAndBinOp.Apr 14 2023, 11:13 PM

goldstein.w.n added a parent revision: D148413: [InstCombine] Remove requirement on `trunc` in `slt/sgt` case in `foldSelectICmpAndOr`.

Rebase

goldstein.w.n removed a parent revision: D148413: [InstCombine] Remove requirement on `trunc` in `slt/sgt` case in `foldSelectICmpAndOr`.Apr 19 2023, 1:45 PM

goldstein.w.n added a parent revision: D148744: [InstCombine] Refactor foldSelectICmpAndOr to use `decomposeBitTestICmp` instead of bespoke logic.

Harbormaster completed remote builds in B226689: Diff 515080.Apr 19 2023, 3:05 PM

Why is this legal for all binops? Doesn't this transform require that the neutral element of the binop is zero? So it would work for or, xor or add, but not for mul or and for example?

I think this entire transform should probably be handled as a two step process: First, Cond ? X : BinOp(X, C) should become BinOp(X, Cond ? NeutralC : C) and then this fold should work on Cond ? 0 : C as the root. We actually already do the former canonicalization, but not in the case where C is constant. This will cleanly separate out the actual binop handling. Only disadvantage is that we won't be able to handle the case where the binop has multiple uses anymore, but that's a general issue with composable folds that we can ignore unless there is reason to believe that multi-use is motivating for the current handling.

This revision now requires changes to proceed.Apr 23 2023, 1:15 AM

In D148414#4290296, @nikic wrote:

Why is this legal for all binops? Doesn't this transform require that the neutral element of the binop is zero? So it would work for or, xor or add, but not for mul or and for example?

Yeah you are right. Not sure what I was thinking...

I think this entire transform should probably be handled as a two step process: First, Cond ? X : BinOp(X, C) should become BinOp(X, Cond ? NeutralC : C) and then this fold should work on Cond ? 0 : C as the root.
We actually already do the former canonicalization, but not in the case where C is constant.

Where do we do the canonicalization?

This will cleanly separate out the actual binop handling. Only disadvantage is that we won't be able to handle the case where the binop has multiple uses anymore, but that's a general issue with composable folds that we can ignore unless there is reason to believe that multi-use is motivating for the current handling.

Why won't we be able to handle multi-use anymore? If we do it like this, seems this function can be deleted once we get the canonicalization of Cond ? X : BinOp(X, C) -> BinOp(X, Cond ? NeutralC : C)?

In D148414#4290762, @goldstein.w.n wrote:

I think this entire transform should probably be handled as a two step process: First, Cond ? X : BinOp(X, C) should become BinOp(X, Cond ? NeutralC : C) and then this fold should work on Cond ? 0 : C as the root.
We actually already do the former canonicalization, but not in the case where C is constant.

Where do we do the canonicalization?

In foldSelectIntoOp(), with the current constant restriction at https://github.com/llvm/llvm-project/blob/8d163e5045073a5ac570225cc8e14cc9f6d72f09/llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp#L507.

In D148414#4290787, @nikic wrote:

In D148414#4290762, @goldstein.w.n wrote:

I think this entire transform should probably be handled as a two step process: First, Cond ? X : BinOp(X, C) should become BinOp(X, Cond ? NeutralC : C) and then this fold should work on Cond ? 0 : C as the root.
We actually already do the former canonicalization, but not in the case where C is constant.

Where do we do the canonicalization?

In foldSelectIntoOp(), with the current constant restriction at https://github.com/llvm/llvm-project/blob/8d163e5045073a5ac570225cc8e14cc9f6d72f09/llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp#L507.

Thanks. So how does the following sound:

Patch foldSelectICmpAnd ->FoldSelectICmpPow2OrZero so we can handle:

define i8 @should_be_doable(i8 %x) {
  %xm1 = sub i8 %x, 1
  %xa = and i8 %xm1, %x
  %cmp = icmp eq i8 %xa, 16
  %s = select i1 %cmp, i8 64, i8 0
  ret i8 %s
}

Update foldSelectIntoOp to do constants iff isPow2(abs(TC - FC)) and Cond is either (ICmp eq Pow2OrZero, C_Pow2) or a SignTest

At that point this function should be mostly redundant. There may be some edgecases with NeutralC is non-zero (i.e mul) which
we will still miss (currently as well) but I'm going to try and remove this function.

In D148414#4290790, @goldstein.w.n wrote:

Update foldSelectIntoOp to do constants iff isPow2(abs(TC - FC)) and Cond is either (ICmp eq Pow2OrZero, C_Pow2) or a SignTest

Ideally we would just remove the limitation entirely -- at the IR level, we would certainly prefer having a select on constants. However, this would likely need a backend undo transform, because that saves materializing the zero constant and likely allows folding the other constant into an immediate operand.

In D148414#4290852, @nikic wrote:

In D148414#4290790, @goldstein.w.n wrote:

Update foldSelectIntoOp to do constants iff isPow2(abs(TC - FC)) and Cond is either (ICmp eq Pow2OrZero, C_Pow2) or a SignTest

Ideally we would just remove the limitation entirely -- at the IR level, we would certainly prefer having a select on constants. However, this would likely need a backend undo transform, because that saves materializing the zero constant and likely allows folding the other constant into an immediate operand.

Yeah that makes sense.

I'll get started on a series to do that.

In D148414#4290852, @nikic wrote:

In D148414#4290790, @goldstein.w.n wrote:

Update foldSelectIntoOp to do constants iff isPow2(abs(TC - FC)) and Cond is either (ICmp eq Pow2OrZero, C_Pow2) or a SignTest

Ideally we would just remove the limitation entirely -- at the IR level, we would certainly prefer having a select on constants. However, this would likely need a backend undo transform, because that saves materializing the zero constant and likely allows folding the other constant into an immediate operand.

@nikic so the backend has code to undo this already. Its just not enabled for scalars. I enabled it on a branch for x86 here:
https://github.com/goldsteinn/llvm-project/pull/new/enable-select-backend
but a lot of regressions. Maybe simpler to just handle the known cases here?

In D148414#4291480, @goldstein.w.n wrote:

In D148414#4290852, @nikic wrote:

In D148414#4290790, @goldstein.w.n wrote:

Update foldSelectIntoOp to do constants iff isPow2(abs(TC - FC)) and Cond is either (ICmp eq Pow2OrZero, C_Pow2) or a SignTest

Ideally we would just remove the limitation entirely -- at the IR level, we would certainly prefer having a select on constants. However, this would likely need a backend undo transform, because that saves materializing the zero constant and likely allows folding the other constant into an immediate operand.

@nikic so the backend has code to undo this already. Its just not enabled for scalars. I enabled it on a branch for x86 here:
https://github.com/goldsteinn/llvm-project/pull/new/enable-select-backend
but a lot of regressions. Maybe simpler to just handle the known cases here?

I think your patch handles a few more cases than we are interested in here. Per the code in getSelectFoldableOperands() we don't do this transform for div/rem, so I don't think we need the additional handling for those. More importantly, it looks like your patch will also do the undo transform for the case where the select has the identity as one element and a non-constant as the other. I think the diffs will look better if you limited the scalar case to two constant operands. The constant + non-constant case can probably also be beneficial, but the heuristics for that are less obvious.

In D148414#4291664, @nikic wrote:

In D148414#4291480, @goldstein.w.n wrote:

In D148414#4290852, @nikic wrote:

In D148414#4290790, @goldstein.w.n wrote:

Update foldSelectIntoOp to do constants iff isPow2(abs(TC - FC)) and Cond is either (ICmp eq Pow2OrZero, C_Pow2) or a SignTest

Ideally we would just remove the limitation entirely -- at the IR level, we would certainly prefer having a select on constants. However, this would likely need a backend undo transform, because that saves materializing the zero constant and likely allows folding the other constant into an immediate operand.

@nikic so the backend has code to undo this already. Its just not enabled for scalars. I enabled it on a branch for x86 here:
https://github.com/goldsteinn/llvm-project/pull/new/enable-select-backend
but a lot of regressions. Maybe simpler to just handle the known cases here?

I think your patch handles a few more cases than we are interested in here. Per the code in getSelectFoldableOperands() we don't do this transform for div/rem, so I don't think we need the additional handling for those. More importantly, it looks like your patch will also do the undo transform for the case where the select has the identity as one element and a non-constant as the other. I think the diffs will look better if you limited the scalar case to two constant operands. The constant + non-constant case can probably also be beneficial, but the heuristics for that are less obvious.

Posted WIP, then saw this. Updating so we can specify if both arms constant only.

Fix to only use some binops

goldstein.w.n edited the summary of this revision. (Show Details)Jun 27 2023, 2:48 PM

In D148414#4290296, @nikic wrote:

Why is this legal for all binops? Doesn't this transform require that the neutral element of the binop is zero? So it would work for or, xor or add, but not for mul or and for example?

I think this entire transform should probably be handled as a two step process: First, Cond ? X : BinOp(X, C) should become BinOp(X, Cond ? NeutralC : C) and then this fold should work on Cond ? 0 : C as the root. We actually already do the former canonicalization, but not in the case where C is constant. This will cleanly separate out the actual binop handling. Only disadvantage is that we won't be able to handle the case where the binop has multiple uses anymore, but that's a general issue with composable folds that we can ignore unless there is reason to believe that multi-use is motivating for the current handling.

Fixed, have proofs for all the binops I added.

Harbormaster completed remote builds in B241617: Diff 535150.Jun 27 2023, 5:17 PM

nikic added inline comments.Jun 29 2023, 7:38 AM

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp
722	Possibly make this a check that getBinOpIdentity() with AllowRHSConstant=true is zero instead? Or at least comment on what the criterion for valid binops is...

goldstein.w.n marked an inline comment as done.Jul 9 2023, 7:09 PM

Use getIdentityBinOp instead of just having a table

goldstein.w.n retitled this revision from [InstCombine] Expand `foldSelectICmpAndOr` -> `foldSelectICmpAndBinOp` to work for any binop to [InstCombine] Expand `foldSelectICmpAndOr` -> `foldSelectICmpAndBinOp` to work for more binops.Jul 9 2023, 7:13 PM

goldstein.w.n edited the summary of this revision. (Show Details)

Rebase

Harbormaster completed remote builds in B244031: Diff 538489.Jul 9 2023, 8:13 PM

LGTM

This revision is now accepted and ready to land.Jul 10 2023, 3:34 AM

This revision was landed with ongoing or failed builds.Aug 16 2023, 8:43 PM

Closed by commit rGd3402bc4460a: [InstCombine] Expand `foldSelectICmpAndOr` -> `foldSelectICmpAndBinOp` to work… (authored by goldstein.w.n). · Explain Why

This revision was automatically updated to reflect the committed changes.

goldstein.w.n added a commit: rGd3402bc4460a: [InstCombine] Expand `foldSelectICmpAndOr` -> `foldSelectICmpAndBinOp` to work….

Since this change, our 2 stage 32 bit Armv7 builder has been failing to build the second stage: https://lab.llvm.org/buildbot/#/builders/182/builds/7193/steps/9/logs/stdio

FAILED: lib/Target/X86/CMakeFiles/LLVMX86CodeGen.dir/X86ISelLowering.cpp.o 
/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage1.install/bin/clang++ -DGTEST_HAS_RTTI=0 -D_DEBUG -D_FILE_OFFSET_BITS=64 -D_GLIBCXX_ASSERTIONS -D_GNU_SOURCE -D_LARGEFILE_SOURCE -D_LIBCPP_ENABLE_HARDENED_MODE -D__STDC_CONSTANT_MACROS -D__STDC_FORMAT_MACROS -D__STDC_LIMIT_MACROS -I/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage2/lib/Target/X86 -I/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/llvm/llvm/lib/Target/X86 -I/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage2/include -I/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/llvm/llvm/include -mcpu=cortex-a15 -mfpu=vfpv3 -marm -fPIC -fno-semantic-interposition -fvisibility-inlines-hidden -Werror=date-time -Werror=unguarded-availability-new -Wall -Wextra -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wmissing-field-initializers -pedantic -Wno-long-long -Wc++98-compat-extra-semi -Wimplicit-fallthrough -Wcovered-switch-default -Wno-noexcept-type -Wnon-virtual-dtor -Wdelete-non-virtual-dtor -Wsuggest-override -Wstring-conversion -Wmisleading-indentation -Wctad-maybe-unsupported -fdiagnostics-color -ffunction-sections -fdata-sections -O3 -DNDEBUG -fvisibility=hidden  -fno-exceptions -funwind-tables -fno-rtti -UNDEBUG -std=c++17 -MD -MT lib/Target/X86/CMakeFiles/LLVMX86CodeGen.dir/X86ISelLowering.cpp.o -MF lib/Target/X86/CMakeFiles/LLVMX86CodeGen.dir/X86ISelLowering.cpp.o.d -o lib/Target/X86/CMakeFiles/LLVMX86CodeGen.dir/X86ISelLowering.cpp.o -c /home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/llvm/llvm/lib/Target/X86/X86ISelLowering.cpp
PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace, preprocessed source, and associated run script.
Stack dump:
0.	Program arguments: /home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage1.install/bin/clang++ -DGTEST_HAS_RTTI=0 -D_DEBUG -D_FILE_OFFSET_BITS=64 -D_GLIBCXX_ASSERTIONS -D_GNU_SOURCE -D_LARGEFILE_SOURCE -D_LIBCPP_ENABLE_HARDENED_MODE -D__STDC_CONSTANT_MACROS -D__STDC_FORMAT_MACROS -D__STDC_LIMIT_MACROS -I/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage2/lib/Target/X86 -I/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/llvm/llvm/lib/Target/X86 -I/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage2/include -I/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/llvm/llvm/include -mcpu=cortex-a15 -mfpu=vfpv3 -marm -fPIC -fno-semantic-interposition -fvisibility-inlines-hidden -Werror=date-time -Werror=unguarded-availability-new -Wall -Wextra -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wmissing-field-initializers -pedantic -Wno-long-long -Wc++98-compat-extra-semi -Wimplicit-fallthrough -Wcovered-switch-default -Wno-noexcept-type -Wnon-virtual-dtor -Wdelete-non-virtual-dtor -Wsuggest-override -Wstring-conversion -Wmisleading-indentation -Wctad-maybe-unsupported -fdiagnostics-color -ffunction-sections -fdata-sections -O3 -DNDEBUG -fvisibility=hidden -fno-exceptions -funwind-tables -fno-rtti -UNDEBUG -std=c++17 -MD -MT lib/Target/X86/CMakeFiles/LLVMX86CodeGen.dir/X86ISelLowering.cpp.o -MF lib/Target/X86/CMakeFiles/LLVMX86CodeGen.dir/X86ISelLowering.cpp.o.d -o lib/Target/X86/CMakeFiles/LLVMX86CodeGen.dir/X86ISelLowering.cpp.o -c /home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/llvm/llvm/lib/Target/X86/X86ISelLowering.cpp
1.	<eof> parser at end of file
2.	Optimizer
 #0 0x03ae2214 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage1.install/bin/clang+++0x3246214)
 #1 0x03adfb88 llvm::sys::RunSignalHandlers() (/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage1.install/bin/clang+++0x3243b88)
 #2 0x03ae1248 llvm::sys::CleanupOnSignal(unsigned int) (/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage1.install/bin/clang+++0x3245248)
 #3 0x03a4998c CrashRecoverySignalHandler(int) CrashRecoveryContext.cpp:0:0
 #4 0xf790e530 __default_sa_restorer /build/glibc-9MGTF6/glibc-2.31/signal/../sysdeps/unix/sysv/linux/arm/sigrestorer.S:67:0
 #5 0x033a2ad4 llvm::Constant::isNullValue() const (/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage1.install/bin/clang+++0x2b06ad4)
 #6 0x0376b91c foldSelectICmpAndBinOp(llvm::ICmpInst const*, llvm::Value*, llvm::Value*, llvm::IRBuilder<llvm::TargetFolder, llvm::IRBuilderCallbackInserter>&) InstCombineSelect.cpp:0:0
 #7 0x0376a104 llvm::InstCombinerImpl::foldSelectInstWithICmp(llvm::SelectInst&, llvm::ICmpInst*) (/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage1.install/bin/clang+++0x2ece104)
 #8 0x03770248 llvm::InstCombinerImpl::visitSelectInst(llvm::SelectInst&) (/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage1.install/bin/clang+++0x2ed4248)
 #9 0x036a5f74 llvm::InstCombinerImpl::run() (/home/tcwg-buildbot/worker/clang-armv7-vfpv3-2stage/stage1.install/bin/clang+++0x2e09f74)
<...>

You have many patches in that build and I'm picking out this one just due to foldSelectICmpAndBinOp.

It's not failing on the other 32 bit Arm builders because they are a few hours behind.

I'm not sure if I can cleanly revert this patch but before I look at that I'll get a reproducer for you.

I've confirmed just this commit is enough to hit the issue, and here's the reproducer:

X86ISelLowering-82e2a2.zip8 MBDownload

DavidSpickett added a reverting change: rG2121e35ac237: Revert "[InstCombine] Expand `foldSelectICmpAndOr` -> `foldSelectICmpAndBinOp`….Aug 17 2023, 3:14 AM

chapuni added a subscriber: chapuni.Aug 17 2023, 3:36 AM

bjope added a subscriber: bjope.Aug 17 2023, 11:03 AM

In D148414#4594813, @DavidSpickett wrote:

I've confirmed just this commit is enough to hit the issue, and here's the reproducer:
X86ISelLowering-82e2a2.zip8 MBDownload

Thank you for creating the repro and reverting this.
Issue was it was missing a nullptr check when checking the identity constant.

@nikic I've retested this. Going to repush in a few days unless anyone objects.

goldstein.w.n reopened this revision.Aug 24 2023, 11:06 AM

This revision is now accepted and ready to land.Aug 24 2023, 11:06 AM

Add nullptr check when checking identity constant

Can you please also add a test for that case (unless you already committed it)?

Add tests for reproducing prior bug

Thanks, LGTM.

Harbormaster completed remote builds in B254695: Diff 553225.Aug 24 2023, 2:12 PM

This revision was landed with ongoing or failed builds.Aug 24 2023, 5:43 PM

Closed by commit rG397a9cc4d875: Recommit "[InstCombine] Expand `foldSelectICmpAndOr` ->… (authored by goldstein.w.n). · Explain Why

This revision was automatically updated to reflect the committed changes.

goldstein.w.n added a commit: rG397a9cc4d875: Recommit "[InstCombine] Expand `foldSelectICmpAndOr` ->….

Seems this still miscompiles llvm-rc/ResourceFileWriter.cpp for targeting x86-64. Investigating.

FYI, similar miscompilation in llvm-rc; https://lab.llvm.org/buildbot/#/builders/124/builds/8260

In D148414#4616067, @chapuni wrote:

FYI, similar miscompilation in llvm-rc; https://lab.llvm.org/buildbot/#/builders/124/builds/8260

Investigating, if don't see obvious fix shortly will revert.

In D148414#4616052, @chapuni wrote:

Seems this still miscompiles llvm-rc/ResourceFileWriter.cpp for targeting x86-64. Investigating.

Are you certain its this? I just checked and the IR for llvm-rc/ResourceFileWriter.cpp is unchanged with/without this commit.

goldstein.w.n added a reverting change: rG2acf00bd0ac2: Revert "Recommit "[InstCombine] Expand `foldSelectICmpAndOr` ->….Aug 25 2023, 12:22 AM

Re-reverted to be safe. Will look into this more tomorrow

uabelho added a subscriber: uabelho.Aug 25 2023, 2:01 AM

I am using ubuntu-20.04 (amd64) and stage2-clang uses libstdc++ for bootstrapping.

Ubuntu (aarch64) didn't complain. (I don't know why)

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp
757	Would it be right if `BinOp` is not commutative? I saw a malformed `lshr` in the miscompilation. `%spec.select = lshr i64 %sub78, %foo` was transformed to `%spec.select = lshr i64 %bar, %sub78`.
llvm/test/Transforms/InstCombine/select-with-bitwise-ops.ll
1616	`XOR` (and `%xor`) is odd here, even if this checks that instructions are not transformed. (ditto in the next test)

goldstein.w.n added inline comments.Aug 25 2023, 10:26 AM

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp
757	Yup, this is it. Didn't see that original code had inverted the operands.

goldstein.w.n reopened this revision.Aug 25 2023, 10:40 AM

This revision is now accepted and ready to land.Aug 25 2023, 10:40 AM

Seems overkill for adding tests for each binop, but ran them offline and all transform verify with alive2:

; $> /home/noah/programs/opensource/llvm-dev/src/alive2/build/alive-tv (-smt-to=200000000)

----------------------------------------
define i8 @select_icmp_eq_and_1_0_add_fv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %badd = add i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %badd
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_add_fv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = shl i8 %x, 1
  %1 = and i8 %and, 2
  %select = add i8 %1, %y
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_add_tv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp ne i8 %and, 0
  %badd = add i8 %y, 2
  %select = select i1 %cmp, i8 %badd, i8 %y
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_add_tv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = shl i8 %x, 1
  %1 = and i8 %and, 2
  %select = add i8 %1, %y
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_or_fv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %bor = or i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %bor
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_or_fv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = shl i8 %x, 1
  %1 = and i8 %and, 2
  %select = or i8 %1, %y
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_or_tv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp ne i8 %and, 0
  %bor = or i8 %y, 2
  %select = select i1 %cmp, i8 %bor, i8 %y
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_or_tv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = shl i8 %x, 1
  %1 = and i8 %and, 2
  %select = or i8 %1, %y
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_xor_fv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %bxor = xor i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %bxor
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_xor_fv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = shl i8 %x, 1
  %1 = and i8 %and, 2
  %select = xor i8 %1, %y
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_xor_tv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp ne i8 %and, 0
  %bxor = xor i8 %y, 2
  %select = select i1 %cmp, i8 %bxor, i8 %y
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_xor_tv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = shl i8 %x, 1
  %1 = and i8 %and, 2
  %select = xor i8 %1, %y
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_mul_fv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %bmul = mul i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %bmul
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_mul_fv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = and i8 %x, 1
  %select = shl i8 %y, %and
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_mul_tv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp ne i8 %and, 0
  %bmul = mul i8 %y, 2
  %select = select i1 %cmp, i8 %bmul, i8 %y
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_mul_tv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = and i8 %x, 1
  %select = shl i8 %y, %and
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_and_fv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %band = and i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %band
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_and_fv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %band = and i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %band
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_and_tv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp ne i8 %and, 0
  %band = and i8 %y, 2
  %select = select i1 %cmp, i8 %band, i8 %y
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_and_tv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = and i8 %x, 1
  %cmp.not = icmp eq i8 %and, 0
  %band = and i8 %y, 2
  %select = select i1 %cmp.not, i8 %y, i8 %band
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_sub_fv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %bsub = sub i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %bsub
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_sub_fv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %bsub = add i8 %y, 254
  %select = select i1 %cmp, i8 %y, i8 %bsub
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_sub_tv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp ne i8 %and, 0
  %bsub = sub i8 %y, 2
  %select = select i1 %cmp, i8 %bsub, i8 %y
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_sub_tv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = and i8 %x, 1
  %cmp.not = icmp eq i8 %and, 0
  %bsub = add i8 %y, 254
  %select = select i1 %cmp.not, i8 %y, i8 %bsub
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_shl_fv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %bshl = shl i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %bshl
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_shl_fv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = shl i8 %x, 1
  %1 = and i8 %and, 2
  %select = shl i8 %y, %1
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_shl_tv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp ne i8 %and, 0
  %bshl = shl i8 %y, 2
  %select = select i1 %cmp, i8 %bshl, i8 %y
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_shl_tv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = shl i8 %x, 1
  %1 = and i8 %and, 2
  %select = shl i8 %y, %1
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_lshr_fv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %blshr = lshr i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %blshr
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_lshr_fv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = shl i8 %x, 1
  %1 = and i8 %and, 2
  %select = lshr i8 %y, %1
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_lshr_tv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp ne i8 %and, 0
  %blshr = lshr i8 %y, 2
  %select = select i1 %cmp, i8 %blshr, i8 %y
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_lshr_tv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = shl i8 %x, 1
  %1 = and i8 %and, 2
  %select = lshr i8 %y, %1
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_ashr_fv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %bashr = ashr i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %bashr
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_ashr_fv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = shl i8 %x, 1
  %1 = and i8 %and, 2
  %select = ashr i8 %y, %1
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_ashr_tv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp ne i8 %and, 0
  %bashr = ashr i8 %y, 2
  %select = select i1 %cmp, i8 %bashr, i8 %y
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_ashr_tv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = shl i8 %x, 1
  %1 = and i8 %and, 2
  %select = ashr i8 %y, %1
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_sdiv_fv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %bsdiv = sdiv i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %bsdiv
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_sdiv_fv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %bsdiv = sdiv i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %bsdiv
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_sdiv_tv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp ne i8 %and, 0
  %bsdiv = sdiv i8 %y, 2
  %select = select i1 %cmp, i8 %bsdiv, i8 %y
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_sdiv_tv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = and i8 %x, 1
  %cmp.not = icmp eq i8 %and, 0
  %bsdiv = sdiv i8 %y, 2
  %select = select i1 %cmp.not, i8 %y, i8 %bsdiv
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_udiv_fv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp eq i8 %and, 0
  %budiv = udiv i8 %y, 2
  %select = select i1 %cmp, i8 %y, i8 %budiv
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_udiv_fv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = and i8 %x, 1
  %select = lshr i8 %y, %and
  ret i8 %select
}
Transformation seems to be correct!


----------------------------------------
define i8 @select_icmp_eq_and_1_0_udiv_tv(i8 %x, i8 %y) {
%0:
  %and = and i8 %x, 1
  %cmp = icmp ne i8 %and, 0
  %budiv = udiv i8 %y, 2
  %select = select i1 %cmp, i8 %budiv, i8 %y
  ret i8 %select
}
=>
define i8 @select_icmp_eq_and_1_0_udiv_tv(i8 %x, i8 %y) nofree willreturn memory(none) {
%0:
  %and = and i8 %x, 1
  %select = lshr i8 %y, %and
  ret i8 %select
}
Transformation seems to be correct!

Summary:
  22 correct transformations
  0 incorrect transformations
  0 failed-to-prove transformations
  0 Alive2 errors

Don't commute V and Y when creating final binop

Harbormaster completed remote builds in B254933: Diff 553539.Aug 25 2023, 12:39 PM

Thanks for the fix.

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp
655–660	Could you update expressions in the comment please?

Update comment

goldstein.w.n marked an inline comment as done.Aug 25 2023, 3:24 PM

Harbormaster completed remote builds in B255014: Diff 553646.Aug 25 2023, 6:40 PM

Okay for push this for attempt #3?

LGTM

This revision was landed with ongoing or failed builds.Sep 1 2023, 3:16 PM

Closed by commit rG54ec8bcaf85e: Recommit "[InstCombine] Expand `foldSelectICmpAndOr` ->… (authored by goldstein.w.n). · Explain Why

This revision was automatically updated to reflect the committed changes.

goldstein.w.n added a commit: rG54ec8bcaf85e: Recommit "[InstCombine] Expand `foldSelectICmpAndOr` ->….

Revision Contents

Path

Size

llvm/

lib/

Transforms/

InstCombine/

InstCombineSelect.cpp

40 lines

test/

Transforms/

InstCombine/

select-with-bitwise-ops.ll

102 lines

Diff 538489

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp

Show First 20 Lines • Show All 644 Lines • ▼ Show 20 Lines	if (match(TrueVal, m_LShr(m_Value(X), m_Value(Y))) &&
bool IsExact = Ashr->isExact() && cast<Instruction>(TrueVal)->isExact();		bool IsExact = Ashr->isExact() && cast<Instruction>(TrueVal)->isExact();
return Builder.CreateAShr(X, Y, IC->getName(), IsExact);		return Builder.CreateAShr(X, Y, IC->getName(), IsExact);
}		}

return nullptr;		return nullptr;
}		}

/// We want to turn:		/// We want to turn:
/// (select (icmp eq (and X, C1), 0), Y, (or Y, C2))		/// (select (icmp eq (and X, C1), 0), Y, (BinOp Y, C2))
/// into:		/// into:
/// (or (shl (and X, C1), C3), Y)		/// IF C2 u>= C1
		/// (BinOp (shl (and X, C1), C3), Y)
		/// ELSE
		/// (BinOp (lshr (and X, C1), C3), Y)
/// iff:		/// iff:
		/// 0 on the RHS is the identity value (i.e add, xor, shl, etc...)
		chapuniUnsubmitted Done Reply Inline Actions Could you update expressions in the comment please? chapuni: Could you update expressions in the comment please?
/// C1 and C2 are both powers of 2		/// C1 and C2 are both powers of 2
/// where:		/// where:
		/// IF C2 u>= C1
/// C3 = Log(C2) - Log(C1)		/// C3 = Log(C2) - Log(C1)
		/// ELSE
		/// C3 = Log(C1) - Log(C2)
///		///
/// This transform handles cases where:		/// This transform handles cases where:
/// 1. The icmp predicate is inverted		/// 1. The icmp predicate is inverted
/// 2. The select operands are reversed		/// 2. The select operands are reversed
/// 3. The magnitude of C2 and C1 are flipped		/// 3. The magnitude of C2 and C1 are flipped
static Value foldSelectICmpAndOr(const ICmpInst IC, Value *TrueVal,		static Value foldSelectICmpAndBinOp(const ICmpInst IC, Value *TrueVal,
Value *FalseVal,		Value *FalseVal,
InstCombiner::BuilderTy &Builder) {		InstCombiner::BuilderTy &Builder) {
// Only handle integer compares. Also, if this is a vector select, we need a		// Only handle integer compares. Also, if this is a vector select, we need a
// vector compare.		// vector compare.
if (!TrueVal->getType()->isIntOrIntVectorTy() \|\|		if (!TrueVal->getType()->isIntOrIntVectorTy() \|\|
TrueVal->getType()->isVectorTy() != IC->getType()->isVectorTy())		TrueVal->getType()->isVectorTy() != IC->getType()->isVectorTy())
return nullptr;		return nullptr;

Value *CmpLHS = IC->getOperand(0);		Value *CmpLHS = IC->getOperand(0);
Value *CmpRHS = IC->getOperand(1);		Value *CmpRHS = IC->getOperand(1);

unsigned C1Log;		unsigned C1Log;
bool NeedAnd = false;		bool NeedAnd = false;
CmpInst::Predicate Pred = IC->getPredicate();		CmpInst::Predicate Pred = IC->getPredicate();
Show All 11 Lines	if (IC->isEquality()) {
if (!decomposeBitTestICmp(CmpLHS, CmpRHS, Pred, CmpLHS, C1) \|\|		if (!decomposeBitTestICmp(CmpLHS, CmpRHS, Pred, CmpLHS, C1) \|\|
!C1.isPowerOf2())		!C1.isPowerOf2())
return nullptr;		return nullptr;

C1Log = C1.logBase2();		C1Log = C1.logBase2();
NeedAnd = true;		NeedAnd = true;
}		}

Value Or, Y, *V = CmpLHS;		Value Y, V = CmpLHS;
		BinaryOperator *BinOp;
const APInt *C2;		const APInt *C2;
bool NeedXor;		bool NeedXor;
if (match(FalseVal, m_Or(m_Specific(TrueVal), m_Power2(C2)))) {		if (match(FalseVal, m_BinOp(m_Specific(TrueVal), m_Power2(C2)))) {
Y = TrueVal;		Y = TrueVal;
Or = FalseVal;		BinOp = cast<BinaryOperator>(FalseVal);
NeedXor = Pred == ICmpInst::ICMP_NE;		NeedXor = Pred == ICmpInst::ICMP_NE;
} else if (match(TrueVal, m_Or(m_Specific(FalseVal), m_Power2(C2)))) {		} else if (match(TrueVal, m_BinOp(m_Specific(FalseVal), m_Power2(C2)))) {
Y = FalseVal;		Y = FalseVal;
Or = TrueVal;		BinOp = cast<BinaryOperator>(TrueVal);
NeedXor = Pred == ICmpInst::ICMP_EQ;		NeedXor = Pred == ICmpInst::ICMP_EQ;
} else {		} else {
return nullptr;		return nullptr;
}		}

		// Check that 0 on RHS is identity value for this binop.
		nikicUnsubmitted Done Reply Inline Actions Possibly make this a check that getBinOpIdentity() with AllowRHSConstant=true is zero instead? Or at least comment on what the criterion for valid binops is... nikic: Possibly make this a check that getBinOpIdentity() with AllowRHSConstant=true is zero instead?
		if (!ConstantExpr::getBinOpIdentity(BinOp->getOpcode(), BinOp->getType(),
		/AllowRHSConstant/ true)
		->isNullValue())
		return nullptr;

unsigned C2Log = C2->logBase2();		unsigned C2Log = C2->logBase2();

bool NeedShift = C1Log != C2Log;		bool NeedShift = C1Log != C2Log;
bool NeedZExtTrunc = Y->getType()->getScalarSizeInBits() !=		bool NeedZExtTrunc = Y->getType()->getScalarSizeInBits() !=
V->getType()->getScalarSizeInBits();		V->getType()->getScalarSizeInBits();

// Make sure we don't create more instructions than we save.		// Make sure we don't create more instructions than we save.
if ((NeedShift + NeedXor + NeedZExtTrunc + NeedAnd) >		if ((NeedShift + NeedXor + NeedZExtTrunc + NeedAnd) >
(IC->hasOneUse() + Or->hasOneUse()))		(IC->hasOneUse() + BinOp->hasOneUse()))
return nullptr;		return nullptr;

if (NeedAnd) {		if (NeedAnd) {
// Insert the AND instruction on the input to the truncate.		// Insert the AND instruction on the input to the truncate.
APInt C1 = APInt::getOneBitSet(V->getType()->getScalarSizeInBits(), C1Log);		APInt C1 = APInt::getOneBitSet(V->getType()->getScalarSizeInBits(), C1Log);
V = Builder.CreateAnd(V, ConstantInt::get(V->getType(), C1));		V = Builder.CreateAnd(V, ConstantInt::get(V->getType(), C1));
}		}

if (C2Log > C1Log) {		if (C2Log > C1Log) {
V = Builder.CreateZExtOrTrunc(V, Y->getType());		V = Builder.CreateZExtOrTrunc(V, Y->getType());
V = Builder.CreateShl(V, C2Log - C1Log);		V = Builder.CreateShl(V, C2Log - C1Log);
} else if (C1Log > C2Log) {		} else if (C1Log > C2Log) {
V = Builder.CreateLShr(V, C1Log - C2Log);		V = Builder.CreateLShr(V, C1Log - C2Log);
V = Builder.CreateZExtOrTrunc(V, Y->getType());		V = Builder.CreateZExtOrTrunc(V, Y->getType());
} else		} else
V = Builder.CreateZExtOrTrunc(V, Y->getType());		V = Builder.CreateZExtOrTrunc(V, Y->getType());

if (NeedXor)		if (NeedXor)
V = Builder.CreateXor(V, *C2);		V = Builder.CreateXor(V, *C2);

return Builder.CreateOr(V, Y);		return Builder.CreateBinOp(BinOp->getOpcode(), V, Y);
		chapuniUnsubmitted Not Done Reply Inline Actions Would it be right if `BinOp` is not commutative? I saw a malformed `lshr` in the miscompilation. `%spec.select = lshr i64 %sub78, %foo` was transformed to `%spec.select = lshr i64 %bar, %sub78`. chapuni: Would it be right if `BinOp` is not commutative? I saw a malformed `lshr` in the miscompilation.
		goldstein.w.nAuthorUnsubmitted Done Reply Inline Actions Yup, this is it. Didn't see that original code had inverted the operands. goldstein.w.n: Yup, this is it. Didn't see that original code had inverted the operands.
}		}

/// Canonicalize a set or clear of a masked set of constant bits to		/// Canonicalize a set or clear of a masked set of constant bits to
/// select-of-constants form.		/// select-of-constants form.
static Instruction *foldSetClearBits(SelectInst &Sel,		static Instruction *foldSetClearBits(SelectInst &Sel,
InstCombiner::BuilderTy &Builder) {		InstCombiner::BuilderTy &Builder) {
Value *Cond = Sel.getCondition();		Value *Cond = Sel.getCondition();
Value *T = Sel.getTrueValue();		Value *T = Sel.getTrueValue();
▲ Show 20 Lines • Show All 983 Lines • ▼ Show 20 Lines	if (Instruction *V =
return V;		return V;

if (Instruction *V = foldSelectCtlzToCttz(ICI, TrueVal, FalseVal, Builder))		if (Instruction *V = foldSelectCtlzToCttz(ICI, TrueVal, FalseVal, Builder))
return V;		return V;

if (Instruction *V = foldSelectZeroOrOnes(ICI, TrueVal, FalseVal, Builder))		if (Instruction *V = foldSelectZeroOrOnes(ICI, TrueVal, FalseVal, Builder))
return V;		return V;

if (Value *V = foldSelectICmpAndOr(ICI, TrueVal, FalseVal, Builder))		if (Value *V = foldSelectICmpAndBinOp(ICI, TrueVal, FalseVal, Builder))
return replaceInstUsesWith(SI, V);		return replaceInstUsesWith(SI, V);

if (Value *V = foldSelectICmpLshrAshr(ICI, TrueVal, FalseVal, Builder))		if (Value *V = foldSelectICmpLshrAshr(ICI, TrueVal, FalseVal, Builder))
return replaceInstUsesWith(SI, V);		return replaceInstUsesWith(SI, V);

if (Value *V = foldSelectCttzCtlz(ICI, TrueVal, FalseVal, Builder))		if (Value *V = foldSelectCttzCtlz(ICI, TrueVal, FalseVal, Builder))
return replaceInstUsesWith(SI, V);		return replaceInstUsesWith(SI, V);

▲ Show 20 Lines • Show All 1,900 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/select-with-bitwise-ops.ll

Show All 30 Lines	;
%cmp = icmp eq <2 x i32> %and, zeroinitializer		%cmp = icmp eq <2 x i32> %and, zeroinitializer
%or = or <2 x i32> %y, <i32 2, i32 2>		%or = or <2 x i32> %y, <i32 2, i32 2>
%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or		%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or
ret <2 x i32> %select		ret <2 x i32> %select
}		}

define i32 @select_icmp_eq_and_1_0_xor_2(i32 %x, i32 %y) {		define i32 @select_icmp_eq_and_1_0_xor_2(i32 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_eq_and_1_0_xor_2(		; CHECK-LABEL: @select_icmp_eq_and_1_0_xor_2(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 1		; CHECK-NEXT: [[AND:%.]] = shl i32 [[X:%.]], 1
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[TMP1:%.*]] = and i32 [[AND]], 2
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2		; CHECK-NEXT: [[SELECT:%.]] = xor i32 [[TMP1]], [[Y:%.]]
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 1		%and = and i32 %x, 1
%cmp = icmp eq i32 %and, 0		%cmp = icmp eq i32 %and, 0
%xor = xor i32 %y, 2		%xor = xor i32 %y, 2
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}
Show All 38 Lines	;
%cmp = icmp eq <2 x i32> %and, zeroinitializer		%cmp = icmp eq <2 x i32> %and, zeroinitializer
%or = or <2 x i32> %y, <i32 8, i32 8>		%or = or <2 x i32> %y, <i32 8, i32 8>
%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or		%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or
ret <2 x i32> %select		ret <2 x i32> %select
}		}

define i32 @select_icmp_eq_and_32_0_xor_8(i32 %x, i32 %y) {		define i32 @select_icmp_eq_and_32_0_xor_8(i32 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_eq_and_32_0_xor_8(		; CHECK-LABEL: @select_icmp_eq_and_32_0_xor_8(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 32		; CHECK-NEXT: [[AND:%.]] = lshr i32 [[X:%.]], 2
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[TMP1:%.*]] = and i32 [[AND]], 8
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 8		; CHECK-NEXT: [[SELECT:%.]] = xor i32 [[TMP1]], [[Y:%.]]
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 32		%and = and i32 %x, 32
%cmp = icmp eq i32 %and, 0		%cmp = icmp eq i32 %and, 0
%xor = xor i32 %y, 8		%xor = xor i32 %y, 8
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}
Show All 39 Lines	;
%or = or <2 x i32> %y, <i32 4096, i32 4096>		%or = or <2 x i32> %y, <i32 4096, i32 4096>
%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or		%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or
ret <2 x i32> %select		ret <2 x i32> %select
}		}

define i32 @select_icmp_ne_0_and_4096_xor_4096(i32 %x, i32 %y) {		define i32 @select_icmp_ne_0_and_4096_xor_4096(i32 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_4096_xor_4096(		; CHECK-LABEL: @select_icmp_ne_0_and_4096_xor_4096(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[TMP1:%.]] = xor i32 [[AND]], [[Y:%.]]
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096		; CHECK-NEXT: [[SELECT:%.*]] = xor i32 [[TMP1]], 4096
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}
Show All 37 Lines	;
%or = or <2 x i32> %y, <i32 4096, i32 4096>		%or = or <2 x i32> %y, <i32 4096, i32 4096>
%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or		%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or
ret <2 x i32> %select		ret <2 x i32> %select
}		}

define i32 @select_icmp_eq_and_4096_0_xor_4096(i32 %x, i32 %y) {		define i32 @select_icmp_eq_and_4096_0_xor_4096(i32 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_eq_and_4096_0_xor_4096(		; CHECK-LABEL: @select_icmp_eq_and_4096_0_xor_4096(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[SELECT:%.]] = xor i32 [[AND]], [[Y:%.]]
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp eq i32 %and, 0		%cmp = icmp eq i32 %and, 0
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}
Show All 39 Lines	;
%or = or <2 x i32> %y, <i32 1, i32 1>		%or = or <2 x i32> %y, <i32 1, i32 1>
%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or		%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or
ret <2 x i32> %select		ret <2 x i32> %select
}		}

define i32 @select_icmp_eq_0_and_1_xor_1(i64 %x, i32 %y) {		define i32 @select_icmp_eq_0_and_1_xor_1(i64 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_eq_0_and_1_xor_1(		; CHECK-LABEL: @select_icmp_eq_0_and_1_xor_1(
; CHECK-NEXT: [[TMP1:%.]] = trunc i64 [[X:%.]] to i32		; CHECK-NEXT: [[TMP1:%.]] = trunc i64 [[X:%.]] to i32
; CHECK-NEXT: [[XOR:%.*]] = and i32 [[TMP1]], 1		; CHECK-NEXT: [[TMP2:%.*]] = and i32 [[TMP1]], 1
; CHECK-NEXT: [[SELECT:%.]] = xor i32 [[XOR]], [[Y:%.]]		; CHECK-NEXT: [[SELECT:%.]] = xor i32 [[TMP2]], [[Y:%.]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i64 %x, 1		%and = and i64 %x, 1
%cmp = icmp eq i64 %and, 0		%cmp = icmp eq i64 %and, 0
%xor = xor i32 %y, 1		%xor = xor i32 %y, 1
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}
Show All 25 Lines	;
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%or = or i32 %y, 32		%or = or i32 %y, 32
%select = select i1 %cmp, i32 %y, i32 %or		%select = select i1 %cmp, i32 %y, i32 %or
ret i32 %select		ret i32 %select
}		}

define i32 @select_icmp_ne_0_and_4096_xor_32(i32 %x, i32 %y) {		define i32 @select_icmp_ne_0_and_4096_xor_32(i32 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_4096_xor_32(		; CHECK-LABEL: @select_icmp_ne_0_and_4096_xor_32(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = lshr i32 [[X:%.]], 7
; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[TMP1:%.*]] = and i32 [[AND]], 32
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 32		; CHECK-NEXT: [[TMP2:%.]] = xor i32 [[TMP1]], [[Y:%.]]
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = xor i32 [[TMP2]], 32
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 32		%xor = xor i32 %y, 32
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}
Show All 40 Lines	;
%cmp = icmp ne <2 x i32> zeroinitializer, %and		%cmp = icmp ne <2 x i32> zeroinitializer, %and
%or = or <2 x i32> %y, <i32 4096, i32 4096>		%or = or <2 x i32> %y, <i32 4096, i32 4096>
%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or		%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or
ret <2 x i32> %select		ret <2 x i32> %select
}		}

define i32 @select_icmp_ne_0_and_32_xor_4096(i32 %x, i32 %y) {		define i32 @select_icmp_ne_0_and_32_xor_4096(i32 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_32_xor_4096(		; CHECK-LABEL: @select_icmp_ne_0_and_32_xor_4096(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 32		; CHECK-NEXT: [[AND:%.]] = shl i32 [[X:%.]], 7
; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[TMP1:%.*]] = and i32 [[AND]], 4096
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096		; CHECK-NEXT: [[TMP2:%.]] = xor i32 [[TMP1]], [[Y:%.]]
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = xor i32 [[TMP2]], 4096
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 32		%and = and i32 %x, 32
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}
▲ Show 20 Lines • Show All 142 Lines • ▼ Show 20 Lines	;
%xor = xor i32 %x, 8		%xor = xor i32 %x, 8
%xor.x = select i1 %cmp, i32 %xor, i32 %x		%xor.x = select i1 %cmp, i32 %xor, i32 %x
ret i32 %xor.x		ret i32 %xor.x
}		}

define i64 @select_icmp_x_and_8_eq_0_y_xor_8(i32 %x, i64 %y) {		define i64 @select_icmp_x_and_8_eq_0_y_xor_8(i32 %x, i64 %y) {
; CHECK-LABEL: @select_icmp_x_and_8_eq_0_y_xor_8(		; CHECK-LABEL: @select_icmp_x_and_8_eq_0_y_xor_8(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 8		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 8
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[TMP1:%.*]] = zext i32 [[AND]] to i64
; CHECK-NEXT: [[XOR:%.]] = xor i64 [[Y:%.]], 8		; CHECK-NEXT: [[Y_XOR:%.]] = xor i64 [[TMP1]], [[Y:%.]]
; CHECK-NEXT: [[Y_XOR:%.*]] = select i1 [[CMP]], i64 [[Y]], i64 [[XOR]]
; CHECK-NEXT: ret i64 [[Y_XOR]]		; CHECK-NEXT: ret i64 [[Y_XOR]]
;		;
%and = and i32 %x, 8		%and = and i32 %x, 8
%cmp = icmp eq i32 %and, 0		%cmp = icmp eq i32 %and, 0
%xor = xor i64 %y, 8		%xor = xor i64 %y, 8
%y.xor = select i1 %cmp, i64 %y, i64 %xor		%y.xor = select i1 %cmp, i64 %y, i64 %xor
ret i64 %y.xor		ret i64 %y.xor
}		}

define i64 @select_icmp_x_and_8_ne_0_y_xor_8(i32 %x, i64 %y) {		define i64 @select_icmp_x_and_8_ne_0_y_xor_8(i32 %x, i64 %y) {
; CHECK-LABEL: @select_icmp_x_and_8_ne_0_y_xor_8(		; CHECK-LABEL: @select_icmp_x_and_8_ne_0_y_xor_8(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 8		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 8
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[TMP1:%.*]] = xor i32 [[AND]], 8
; CHECK-NEXT: [[XOR:%.]] = xor i64 [[Y:%.]], 8		; CHECK-NEXT: [[TMP2:%.*]] = zext i32 [[TMP1]] to i64
; CHECK-NEXT: [[XOR_Y:%.*]] = select i1 [[CMP]], i64 [[XOR]], i64 [[Y]]		; CHECK-NEXT: [[XOR_Y:%.]] = xor i64 [[TMP2]], [[Y:%.]]
; CHECK-NEXT: ret i64 [[XOR_Y]]		; CHECK-NEXT: ret i64 [[XOR_Y]]
;		;
%and = and i32 %x, 8		%and = and i32 %x, 8
%cmp = icmp eq i32 %and, 0		%cmp = icmp eq i32 %and, 0
%xor = xor i64 %y, 8		%xor = xor i64 %y, 8
%xor.y = select i1 %cmp, i64 %xor, i64 %y		%xor.y = select i1 %cmp, i64 %xor, i64 %y
ret i64 %xor.y		ret i64 %xor.y
}		}
▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	;
%cmp = icmp eq <2 x i32> %and, zeroinitializer		%cmp = icmp eq <2 x i32> %and, zeroinitializer
%or = or <2 x i32> %y, <i32 2, i32 2>		%or = or <2 x i32> %y, <i32 2, i32 2>
%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or		%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or
ret <2 x i32> %select		ret <2 x i32> %select
}		}

define i32 @test68_xor(i32 %x, i32 %y) {		define i32 @test68_xor(i32 %x, i32 %y) {
; CHECK-LABEL: @test68_xor(		; CHECK-LABEL: @test68_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 128		; CHECK-NEXT: [[AND:%.]] = lshr i32 [[X:%.]], 6
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[TMP1:%.*]] = and i32 [[AND]], 2
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2		; CHECK-NEXT: [[SELECT:%.]] = xor i32 [[TMP1]], [[Y:%.]]
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 128		%and = and i32 %x, 128
%cmp = icmp eq i32 %and, 0		%cmp = icmp eq i32 %and, 0
%xor = xor i32 %y, 2		%xor = xor i32 %y, 2
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}
Show All 40 Lines	;
%cmp = icmp ne <2 x i32> %and, zeroinitializer		%cmp = icmp ne <2 x i32> %and, zeroinitializer
%or = or <2 x i32> %y, <i32 2, i32 2>		%or = or <2 x i32> %y, <i32 2, i32 2>
%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or		%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or
ret <2 x i32> %select		ret <2 x i32> %select
}		}

define i32 @test69_xor(i32 %x, i32 %y) {		define i32 @test69_xor(i32 %x, i32 %y) {
; CHECK-LABEL: @test69_xor(		; CHECK-LABEL: @test69_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 128		; CHECK-NEXT: [[AND:%.]] = lshr i32 [[X:%.]], 6
; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[TMP1:%.*]] = and i32 [[AND]], 2
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2		; CHECK-NEXT: [[TMP2:%.]] = xor i32 [[TMP1]], [[Y:%.]]
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = xor i32 [[TMP2]], 2
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 128		%and = and i32 %x, 128
%cmp = icmp ne i32 %and, 0		%cmp = icmp ne i32 %and, 0
%xor = xor i32 %y, 2		%xor = xor i32 %y, 2
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}
▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	;
%or = or i32 %y, 2		%or = or i32 %y, 2
%select = select i1 %cmp, i32 %y, i32 %or		%select = select i1 %cmp, i32 %y, i32 %or
%res = mul i32 %select, %or ; to bump up use count of the Or		%res = mul i32 %select, %or ; to bump up use count of the Or
ret i32 %res		ret i32 %res
}		}

define i32 @shift_no_xor_multiuse_xor(i32 %x, i32 %y) {		define i32 @shift_no_xor_multiuse_xor(i32 %x, i32 %y) {
; CHECK-LABEL: @shift_no_xor_multiuse_xor(		; CHECK-LABEL: @shift_no_xor_multiuse_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 1
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]		; CHECK-NEXT: [[AND:%.]] = shl i32 [[X:%.]], 1
		; CHECK-NEXT: [[TMP1:%.*]] = and i32 [[AND]], 2
		; CHECK-NEXT: [[SELECT:%.*]] = xor i32 [[TMP1]], [[Y]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[XOR]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[XOR]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 1		%and = and i32 %x, 1
%cmp = icmp eq i32 %and, 0		%cmp = icmp eq i32 %and, 0
%xor = xor i32 %y, 2		%xor = xor i32 %y, 2
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
%res = mul i32 %select, %xor ; to bump up use count of the Xor		%res = mul i32 %select, %xor ; to bump up use count of the Xor
Show All 31 Lines	;
%select = select i1 %cmp, i32 %y, i32 %or		%select = select i1 %cmp, i32 %y, i32 %or
%res = mul i32 %select, %or ; to bump up use count of the Or		%res = mul i32 %select, %or ; to bump up use count of the Or
ret i32 %res		ret i32 %res
}		}

define i32 @no_shift_no_xor_multiuse_xor(i32 %x, i32 %y) {		define i32 @no_shift_no_xor_multiuse_xor(i32 %x, i32 %y) {
; CHECK-LABEL: @no_shift_no_xor_multiuse_xor(		; CHECK-LABEL: @no_shift_no_xor_multiuse_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]		; CHECK-NEXT: [[SELECT:%.*]] = xor i32 [[AND]], [[Y]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[XOR]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[XOR]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp eq i32 %and, 0		%cmp = icmp eq i32 %and, 0
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
%res = mul i32 %select, %xor ; to bump up use count of the Xor		%res = mul i32 %select, %xor ; to bump up use count of the Xor
Show All 32 Lines	;
%select = select i1 %cmp, i32 %y, i32 %or		%select = select i1 %cmp, i32 %y, i32 %or
%res = mul i32 %select, %or ; to bump up use count of the Or		%res = mul i32 %select, %or ; to bump up use count of the Or
ret i32 %res		ret i32 %res
}		}

define i32 @no_shift_xor_multiuse_xor(i32 %x, i32 %y) {		define i32 @no_shift_xor_multiuse_xor(i32 %x, i32 %y) {
; CHECK-LABEL: @no_shift_xor_multiuse_xor(		; CHECK-LABEL: @no_shift_xor_multiuse_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]		; CHECK-NEXT: [[TMP1:%.*]] = xor i32 [[AND]], [[Y]]
		; CHECK-NEXT: [[SELECT:%.*]] = xor i32 [[TMP1]], 4096
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[XOR]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[XOR]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
%res = mul i32 %select, %xor ; to bump up use count of the Xor		%res = mul i32 %select, %xor ; to bump up use count of the Xor
▲ Show 20 Lines • Show All 86 Lines • ▼ Show 20 Lines	;
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
ret i32 %res		ret i32 %res
}		}

define i32 @shift_no_xor_multiuse_cmp_with_xor(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @shift_no_xor_multiuse_cmp_with_xor(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @shift_no_xor_multiuse_cmp_with_xor(		; CHECK-LABEL: @shift_no_xor_multiuse_cmp_with_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 1		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 1
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2		; CHECK-NEXT: [[TMP1:%.*]] = shl nuw nsw i32 [[AND]], 1
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]		; CHECK-NEXT: [[SELECT:%.]] = xor i32 [[TMP1]], [[Y:%.]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 1		%and = and i32 %x, 1
%cmp = icmp eq i32 %and, 0		%cmp = icmp eq i32 %and, 0
%xor = xor i32 %y, 2		%xor = xor i32 %y, 2
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
Show All 38 Lines	;
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
ret i32 %res		ret i32 %res
}		}

define i32 @no_shift_no_xor_multiuse_cmp_with_xor(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @no_shift_no_xor_multiuse_cmp_with_xor(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @no_shift_no_xor_multiuse_cmp_with_xor(		; CHECK-LABEL: @no_shift_no_xor_multiuse_cmp_with_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096		; CHECK-NEXT: [[SELECT:%.]] = xor i32 [[AND]], [[Y:%.]]
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp eq i32 %and, 0		%cmp = icmp eq i32 %and, 0
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
Show All 39 Lines	;
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
ret i32 %res		ret i32 %res
}		}

define i32 @no_shift_xor_multiuse_cmp_with_xor(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @no_shift_xor_multiuse_cmp_with_xor(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @no_shift_xor_multiuse_cmp_with_xor(		; CHECK-LABEL: @no_shift_xor_multiuse_cmp_with_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096		; CHECK-NEXT: [[TMP1:%.]] = xor i32 [[AND]], [[Y:%.]]
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = xor i32 [[TMP1]], 4096
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
▲ Show 20 Lines • Show All 162 Lines • ▼ Show 20 Lines	;
ret i32 %res2		ret i32 %res2
}		}

define i32 @no_shift_no_xor_multiuse_cmp_xor(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @no_shift_no_xor_multiuse_cmp_xor(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @no_shift_no_xor_multiuse_cmp_xor(		; CHECK-LABEL: @no_shift_no_xor_multiuse_cmp_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]		; CHECK-NEXT: [[SELECT:%.*]] = xor i32 [[AND]], [[Y]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[XOR]]		; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[XOR]]
; CHECK-NEXT: ret i32 [[RES2]]		; CHECK-NEXT: ret i32 [[RES2]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp eq i32 %and, 0		%cmp = icmp eq i32 %and, 0
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
▲ Show 20 Lines • Show All 284 Lines • ▼ Show 20 Lines	;
%cmp = icmp eq i8 %xx, 0		%cmp = icmp eq i8 %xx, 0
%z = xor i64 %y, -9223372036854775808		%z = xor i64 %y, -9223372036854775808
%r = select i1 %cmp, i64 %z, i64 %y		%r = select i1 %cmp, i64 %z, i64 %y
ret i64 %r		ret i64 %r
}		}

define i64 @xor_i8_to_i64_shl_save_and_ne(i8 %x, i64 %y) {		define i64 @xor_i8_to_i64_shl_save_and_ne(i8 %x, i64 %y) {
; CHECK-LABEL: @xor_i8_to_i64_shl_save_and_ne(		; CHECK-LABEL: @xor_i8_to_i64_shl_save_and_ne(
; CHECK-NEXT: [[XX:%.]] = and i8 [[X:%.]], 1		; CHECK-NEXT: [[TMP1:%.]] = zext i8 [[X:%.]] to i64
; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i8 [[XX]], 0		; CHECK-NEXT: [[TMP2:%.*]] = shl i64 [[TMP1]], 63
; CHECK-NEXT: [[Z:%.]] = xor i64 [[Y:%.]], -9223372036854775808		; CHECK-NEXT: [[R:%.]] = xor i64 [[TMP2]], [[Y:%.]]
; CHECK-NEXT: [[R:%.*]] = select i1 [[CMP_NOT]], i64 [[Y]], i64 [[Z]]
; CHECK-NEXT: ret i64 [[R]]		; CHECK-NEXT: ret i64 [[R]]
;		;
%xx = and i8 %x, 1		%xx = and i8 %x, 1
%cmp = icmp ne i8 %xx, 0		%cmp = icmp ne i8 %xx, 0
%z = xor i64 %y, -9223372036854775808		%z = xor i64 %y, -9223372036854775808
%r = select i1 %cmp, i64 %z, i64 %y		%r = select i1 %cmp, i64 %z, i64 %y
ret i64 %r		ret i64 %r
}		}
		chapuniUnsubmitted Not Done Reply Inline Actions `XOR` (and `%xor`) is odd here, even if this checks that instructions are not transformed. (ditto in the next test) chapuni: `XOR` (and `%xor`) is odd here, even if this checks that instructions are not transformed.

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Expand `foldSelectICmpAndOr` -> `foldSelectICmpAndBinOp` to work for more binopsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 538489

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp

llvm/test/Transforms/InstCombine/select-with-bitwise-ops.ll

[InstCombine] Expand `foldSelectICmpAndOr` -> `foldSelectICmpAndBinOp` to work for more binops
ClosedPublic