⚙ D86578 [TargetLowering] Combine known bits for icmp in SimplifySetCC (PR41182)

darcangelo.matthew created this revision.Aug 25 2020, 3:51 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 25 2020, 3:51 PM

Herald added subscribers: llvm-commits, steven.zhang, hiraditya. · View Herald Transcript

darcangelo.matthew requested review of this revision.Aug 25 2020, 3:51 PM

Removed the Signed argument from getMinValue

Harbormaster completed remote builds in B69517: Diff 287792.Aug 25 2020, 4:04 PM

You need to upload the diff between the master branch and your branch, not the last commit in your branch.

RKSimon added reviewers: lebedev.ri, nikic.Aug 26 2020, 2:03 AM

RKSimon retitled this revision from [TargetLowering] Combine known bits for icmp in SimplifySetCC - https://bugs.llvm.org/ show_bug.cgi?id=41182 to [TargetLowering] Combine known bits for icmp in SimplifySetCC (PR41182).

Removed mistaken Signed argument

What is the motivation for this patch? The added tests should already get folded before reaching the backend (by InstCombine). We should only add folds to DAGCombine if the folded pattern is created by legalization. What is the original context for this change?

In D86578#2239392, @nikic wrote:

What is the motivation for this patch? The added tests should already get folded before reaching the backend (by InstCombine). We should only add folds to DAGCombine if the folded pattern is created by legalization. What is the original context for this change?

This was an unassigned enhancement - https://bugs.llvm.org/show_bug.cgi?id=41182 - that I was interested in and wanted to have a go at. Unfortunately that's all I have for context on my end

RKSimon added a reviewer: craig.topper.Aug 27 2020, 2:06 AM

craig.topper added inline comments.Aug 27 2020, 10:23 PM

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
14059 ↗	(On Diff #288003)	I'm not sure if we can really do this from SelectionDAG. Is there anywhere else that does this in tree?
llvm/test/CodeGen/X86/test-shrink.ll
574 ↗	(On Diff #288003)	Can we pre-commit the IR changes here?

@RKSimon Meant to say this before - Thank you for putting up additional reviewers Simon, I appreciate your help with this

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
14059 ↗	(On Diff #288003)	I'm not sure if we can really do this from SelectionDAG. Is there anywhere else that does this in tree? I found the UnreachableMachineBlockElim pass which may be able to be enhanced for this, but I'm still looking
llvm/test/CodeGen/X86/test-shrink.ll
574 ↗	(On Diff #288003)	Thank you for taking a look at the changes. Can we pre-commit the IR changes here? Sorry, would you be able to elaborate what you mean here? Do you mean separate these out into another pre-commit review?

Herald added a subscriber: danielkiss. · View Herald TranscriptAug 29 2020, 8:37 AM

craig.topper mentioned this in rG81733fd27657: [X86] Pre-commit the test-shrink.ll changes from D86578..Aug 30 2020, 12:25 PM

craig.topper added inline comments.Aug 30 2020, 1:51 PM

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
14059 ↗	(On Diff #288003)	Sorry, I meant is there anywhere else that does something like this while SelectionDAG exists?
llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
3998 ↗	(On Diff #288003)	Is this just a formatting change?
llvm/test/CodeGen/X86/test-shrink.ll
574 ↗	(On Diff #288003)	I went ahead and modified the tests in trunk. Can you rebase this patch?

darcangelo.matthew added inline comments.Aug 30 2020, 8:20 PM

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
14059 ↗	(On Diff #288003)	Thanks for elaborating, sorry for the confusion. I don't believe there is another place that does something like this while SelectionDAG exists. It seems like BranchFolding might be a good place for me to handle this? For some additional context, without this change if we have define i32 @simple(i32 %x) nounwind { %1 = and i32 %x, 2147483648 %2 = icmp sle i32 %1, 0 br i1 %2, label %yes, label %no yes: ; will always get here ret i32 1 no: ret i32 2 } we get ## %bb.0: xorl %eax, %eax testb %al, %al jne LBB0_2 ## %bb.1: ## %yes movl $1, %eax retq LBB0_2: ## %no movl $2, %eax retq so jne will never be taken instead of just removing the branch completely
llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
3998 ↗	(On Diff #288003)	Yes this is just formatting, I must have messed up the merge when I rebased and missed this. Sorry about that, should be fixed now
llvm/test/CodeGen/X86/test-shrink.ll
574 ↗	(On Diff #288003)	I went ahead and modified the tests in trunk. Can you rebase this patch? Sure can, thanks for updating them. Sorry for the confusion

rebase

Harbormaster completed remote builds in B70060: Diff 288883.Aug 30 2020, 9:01 PM

@darcangelo.matthew are you still looking at this?

llvm/include/llvm/Support/KnownBits.h
145	Please add unit test coverage in llvm\unittests\Support\KnownBitsTest.cpp

Herald added subscribers: dexonsmith, ecnelises, pengfei. · View Herald TranscriptDec 15 2020, 9:51 AM

RKSimon mentioned this in rG6895581fd2c1: [Support] Add KnownBits::getSignedMinValue/getSignedMaxValue helpers..Dec 24 2020, 6:10 AM

RKSimon mentioned this in rG23b41986527a: [Support] Add KnownBits::icmp helpers..Jan 4 2021, 5:08 AM

RKSimon mentioned this in rG37ac4f865fba: [Hexagon] Regenerate zext-v4i1.ll tests.Jan 6 2021, 5:45 AM

@darcangelo.matthew Are you in a position to continue with this, if not I would like to commandeer and complete this please?

In D86578#3243273, @RKSimon wrote:

@darcangelo.matthew Are you in a position to continue with this, if not I would like to commandeer and complete this please?

I’m not able to continue with this. You can go ahead and complete it. Sorry I missed your previous comment as well.

RKSimon commandeered this revision.Jan 17 2022, 4:11 AM

RKSimon edited reviewers, added: darcangelo.matthew; removed: RKSimon.

Updated the old patch to use the KnownBits comparison helpers that have been added since the original patch was created and add all test changes for review.

I've removed the branch removal code for now - which means we have a number of "constant" conditional branches that we currently have no way to fold to a jump, whatever we go for here handling this in the DAG would be rather ugly.

I'm open to suggestions for any individual test changes - some are definite improvements, some just need some tweaking and others probably affect the original regression the test was reduced from.

Herald added subscribers: kerbowa, javed.absar, nhaehnle, jvesely. · View Herald TranscriptJan 17 2022, 9:52 AM

RKSimon added a subscriber: foad.Jan 17 2022, 9:59 AM

RKSimon added inline comments.

llvm/test/CodeGen/AMDGPU/wave32.ll
720 ↗	(On Diff #400593)	@foad I guess there's no good way around this if on wave32 targets - unless we duplicate the test with a 16 divergent variant?
llvm/test/CodeGen/X86/avx512-mask-op.ll
623 ↗	(On Diff #400593)	test7 (and test6) don't look great (and the codegen was already redundant) - I wonder if we should at least have them return something?

Harbormaster completed remote builds in B143835: Diff 400593.Jan 17 2022, 10:46 AM

foad added a subscriber: critson.Jan 18 2022, 2:28 AM

foad added inline comments.

llvm/test/CodeGen/AMDGPU/wave32.ll
720 ↗	(On Diff #400593)	Let's just replace the "cmp uge 32" with "cmp uge 16", and then your patch won't affect this test at all. (@critson is that OK?) Would you like me to do that in a separate commit?

RKSimon added inline comments.Jan 18 2022, 9:32 AM

llvm/test/CodeGen/AMDGPU/wave32.ll
720 ↗	(On Diff #400593)	Not sure if the commit question was to me or @critson - but if you can alter the test and then ping this ticket I will rebase the patch.

critson added inline comments.Jan 18 2022, 9:33 PM

llvm/test/CodeGen/AMDGPU/wave32.ll
720 ↗	(On Diff #400593)	@foad your suggested test change seems fine to me

foad mentioned this in rG7af959673e67: [AMDGPU] Tweak some compares in wave32.ll test.Jan 19 2022, 12:25 AM

foad added inline comments.Jan 19 2022, 12:26 AM

llvm/test/CodeGen/AMDGPU/wave32.ll
720 ↗	(On Diff #400593)	OK, please rebase on 7af959673e67256c5ba711070aca509dce794350.

RKSimon added inline comments.Jan 19 2022, 4:10 AM

llvm/test/CodeGen/AMDGPU/wave32.ll
720 ↗	(On Diff #400593)	Thanks - looks like we have an equivalent issue in wqm.ll

rebase

xbolva00 added a subscriber: xbolva00.Jan 19 2022, 4:21 AM

xbolva00 added inline comments.

llvm/test/CodeGen/X86/fold-rmw-ops.ll
1830 ↗	(On Diff #401168)	Regression?

RKSimon added inline comments.Jan 19 2022, 4:35 AM

llvm/test/CodeGen/X86/fold-rmw-ops.ll
1830 ↗	(On Diff #401168)	I'm really at a loss as to what is best to do for these cases where we've created a constant conditional branch - the middle-end would have nuked most of these cases already (which probably means doing something in CGP is a waste of time as well) and I'm not certain how often they will be generated in backend. We can tweak the tests in some cases I suppose, which is a cheat but probably the most realistic approach. Opinions welcome!

foad mentioned this in rG0bc14a0a989f: [AMDGPU] Tweak some compares in wqm.ll test.Jan 19 2022, 4:51 AM

foad added inline comments.Jan 19 2022, 4:52 AM

llvm/test/CodeGen/AMDGPU/wave32.ll
720 ↗	(On Diff #400593)	Fixed in 0bc14a0a989fe4268b899100aafc07e3d94decbb.

Harbormaster completed remote builds in B144244: Diff 401168.Jan 19 2022, 5:04 AM

rebase

Harbormaster completed remote builds in B144258: Diff 401191.Jan 19 2022, 6:23 AM

RKSimon mentioned this in rGa4f202549208: [X86] Regenerate avx512-mask-op.ll.Jan 23 2022, 3:34 AM

rebase

Herald added subscribers: frasercrmck, luismarques, apazos and 19 others. · View Herald TranscriptJan 23 2022, 3:49 AM

To repeat my previous question, because it's not obvious from tests: Which legalizations does this improve? That is, for input IR that has already gone through the opt -O2 pipeline, which cases does this improve?

Harbormaster completed remote builds in B145090: Diff 402311.Jan 23 2022, 4:29 AM

In D86578#3264329, @nikic wrote:

To repeat my previous question, because it's not obvious from tests: Which legalizations does this improve? That is, for input IR that has already gone through the opt -O2 pipeline, which cases does this improve?

Its quite a long time ago when I first encountered this but IIRC one of the places I hit this was in the sdiv/udiv by constant expansions

dexonsmith removed a subscriber: dexonsmith.Jan 24 2022, 6:45 PM

Herald added a subscriber: • pcwang-thead. · View Herald TranscriptJan 24 2022, 6:45 PM

RKSimon planned changes to this revision.Feb 8 2022, 10:02 AM

This review may be stuck/dead, consider abandoning if no longer relevant.
Removing myself as reviewer in attempt to clean dashboard.

Herald added a project: Restricted Project. · View Herald TranscriptJan 12 2023, 4:57 PM

Herald added a subscriber: StephenFan. · View Herald Transcript

foad added inline comments.Jan 13 2023, 2:19 AM

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
3640–3642 ↗	(On Diff #402311)	Just put `Result = KnownBits::whatever(LHSKnown, RHSKnown)` inside the switch, and sink the `if (Result) return ...` out?

RKSimon mentioned this in D149383: [SelectionDAG][WIP] Add support for evaluating SetCC based on knownbits.Apr 28 2023, 7:16 AM

Abandoning as @goldstein.w.n has taken on something similar with D149383

Herald added a subscriber: luke. · View Herald TranscriptApr 29 2023, 10:12 AM

This is an archive of the discontinued LLVM Phabricator instance.

[TargetLowering] Combine known bits for icmp in SimplifySetCC (PR41182)
AbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 287792

llvm/include/llvm/Support/KnownBits.h

This is an archive of the discontinued LLVM Phabricator instance.

[TargetLowering] Combine known bits for icmp in SimplifySetCC (PR41182)AbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 287792

llvm/include/llvm/Support/KnownBits.h

[TargetLowering] Combine known bits for icmp in SimplifySetCC (PR41182)
AbandonedPublic