This is an archive of the discontinued LLVM Phabricator instance.

[SimplifyCFG] Try to fold switch with single result value and power-of-2 cases to mask+select
ClosedPublic

Authored by bcl5980 on Mar 25 2022, 8:31 AM.

Download Raw Diff

Details

Reviewers

spatel
lebedev.ri
nikic
RKSimon
craig.topper
nick
majnemer

Commits

rG00871e2f4f9f: [SimplifyCFG] Try to fold switch with single result value and power-of-2 cases…

Summary

When switch with 2^n cases go to one result, check if the 2^n cases can be covered by n bit masks.
If yes we can use "and condition, ~mask" to simplify the switch

case 0 2 4 6 -> and condition, -7
https://alive2.llvm.org/ce/z/jjH_0N

case 0 2 8 10 -> and condition, -11
https://alive2.llvm.org/ce/z/K7E-2V

case 2 4 8 12 -> and (sub condition, 2), -11
https://alive2.llvm.org/ce/z/CrxbYg

Fix one case of https://github.com/llvm/llvm-project/issues/39957

Diff Detail

Event Timeline

bcl5980 created this revision.Mar 25 2022, 8:31 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 25 2022, 8:31 AM

Herald added subscribers: StephenFan, hiraditya. · View Herald Transcript

bcl5980 requested review of this revision.Mar 25 2022, 8:31 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 25 2022, 8:31 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

I have no too much confidence about this change, so this should be a scatch without detail comments and the magic number is also a rough value.
So can someone help to give me suggestion it is worth or not?

https://godbolt.org/z/zPhnrexca
This change should get result similar to MSVC except ztfos4

Harbormaster completed remote builds in B156298: Diff 418231.Mar 25 2022, 9:04 AM

bcl5980 added a reviewer: nick.Mar 25 2022, 9:14 AM

bcl5980 edited the summary of this revision. (Show Details)Mar 27 2022, 9:31 PM

fix the bitmask init value wrong
remove the MaxCasesPerResult limitation

update the correct mask0 case

Harbormaster completed remote builds in B156718: Diff 418825.Mar 29 2022, 11:39 PM

RKSimon added inline comments.Mar 30 2022, 4:01 AM

llvm/test/Transforms/SimplifyCFG/switch-to-select-two-case.ll
70	(style) please can you replace the mask# naming sequence with test names that are more descriptive?

update test name

bcl5980 marked an inline comment as done.Mar 30 2022, 4:38 AM

Harbormaster completed remote builds in B156924: Diff 419101.Mar 30 2022, 4:15 PM

Ping.
Any suggestions for the patch?

@RKSimon Any suggestions for the patch?

Ping.

bcl5980 added a reviewer: majnemer.Apr 12 2022, 2:07 AM

spatel mentioned this in D123614: [SimplifyCFG] cleanup code for converting switch to select (NFC).Apr 12 2022, 8:40 AM

I have not stepped through the logic in "ConvertTwoCaseSwitch" in detail, but this seems to be on the right track.
We need to update function names and code comments for the more general transform now, so we should clean this set of functions up as much as possible before this patch. See if this makes sense:
D123614

In D122485#3445834, @spatel wrote:

I have not stepped through the logic in "ConvertTwoCaseSwitch" in detail, but this seems to be on the right track.
We need to update function names and code comments for the more general transform now, so we should clean this set of functions up as much as possible before this patch. See if this makes sense:
D123614

OK, I will rebase after D123614 land.

spatel mentioned this in rGd9211be13dda: [SimplifyCFG] cleanup code for converting switch to select (NFC).Apr 12 2022, 9:19 AM

rebase code

Update comments

bcl5980 added inline comments.Apr 12 2022, 10:34 AM

llvm/lib/Transforms/Utils/SimplifyCFG.cpp
5646	@spatel I'm a little worry about here. I remove the early out this version. It will cause compile time increase if we have some very large switch with many cases to the same result but not the pattern we can fold. But I have no detail data to show how much extra compile this change will be involved. Do we have some common compile time tests? Another way to do is enlarge MaxCasesPerResult. The patch first version adjust MaxCasesPerResult to 16. But 16 is also a magic number. Maybe we can add a config for it. How about your suggestions?

spatel added inline comments.Apr 12 2022, 11:16 AM

llvm/lib/Transforms/Utils/SimplifyCFG.cpp
5646	@nikic has a system that is used to check for compile time changes with benchmarks from the test suite: https://llvm-compile-time-tracker.com/index.php So we can run an experiment there. But I think it is fine to use "16" as a limit for this analysis. You can give it a name and make it a debug option like the many others at the top of this file. For example: static cl::opt<unsigned> MaxSpeculationDepth( "max-speculation-depth", cl::Hidden, cl::init(10), cl::desc("Limit maximum recursion depth when calculating costs of " "speculatively executed instructions"));

Harbormaster completed remote builds in B159282: Diff 422281.Apr 12 2022, 11:39 AM

spatel mentioned this in D123625: [SimplifyCFG] make a debug option for case max when converting switch to select .Apr 12 2022, 12:33 PM

spatel added inline comments.Apr 12 2022, 12:35 PM

llvm/lib/Transforms/Utils/SimplifyCFG.cpp
5646	We can reduce this patch by doing that part independently: D123625

bcl5980 added inline comments.Apr 13 2022, 1:11 AM

llvm/lib/Transforms/Utils/SimplifyCFG.cpp
5646	Thanks for the help. I will rebase after D123625

spatel mentioned this in rGcd0d0d633bc6: [SimplifyCFG] make a debug option for case max when converting switch to select.Apr 13 2022, 4:10 AM

rebase main

Harbormaster completed remote builds in B159427: Diff 422476.Apr 13 2022, 5:47 AM

Let's add the baseline tests as an NFC commit now, so it is easier to see the diffs.

The order of the transforms creates an interesting trade-off, so we need a test like this (and probably even more tests):

define i8 @same_value_two_case(i32 %i) {
entry:
  switch i32 %i, label %default [
  i32 -3, label %end
  i32 5, label %end
  ]

default:
  br label %end

end:
  %t0 = phi i8 [ 42, %default ], [ 3, %entry ], [ 3, %entry ]
  ret i8 %t0
}

This patch creates a difference that survives all the way through codegen - instcombine does not recognize the equivalence between the 2 patterns:
https://alive2.llvm.org/ce/z/mqo87Z

It's not clear to me if there is a universal better form (depends on target?) or even which one is better for IR. To avoid those questions, you can re-order the transforms, so we do not have to answer it in this patch (add a TODO comment though).

In D122485#3448240, @spatel wrote:
Let's add the baseline tests as an NFC commit now, so it is easier to see the diffs.

The order of the transforms creates an interesting trade-off, so we need a test like this (and probably even more tests):
define i8 @same_value_two_case(i32 %i) {
entry:
  switch i32 %i, label %default [
  i32 -3, label %end
  i32 5, label %end
  ]

default:
  br label %end

end:
  %t0 = phi i8 [ 42, %default ], [ 3, %entry ], [ 3, %entry ]
  ret i8 %t0
}

I'm sorry but what's the difference between this test with switch_to_and0 ? For the negative offset?

In D122485#3448273, @bcl5980 wrote:
In D122485#3448240, @spatel wrote:
Let's add the baseline tests as an NFC commit now, so it is easier to see the diffs.

The order of the transforms creates an interesting trade-off, so we need a test like this (and probably even more tests):
define i8 @same_value_two_case(i32 %i) {
entry:
  switch i32 %i, label %default [
  i32 -3, label %end
  i32 5, label %end
  ]

default:
  br label %end

end:
  %t0 = phi i8 [ 42, %default ], [ 3, %entry ], [ 3, %entry ]
  ret i8 %t0
}
I'm sorry but what's the difference between this test with switch_to_and0 ? For the negative offset?

Ah, you're correct - I did not see that test diff. But we might want to include a test with negative offset, so we have coverage for that too.

In D122485#3448283, @spatel wrote:
In D122485#3448273, @bcl5980 wrote:
In D122485#3448240, @spatel wrote:
Let's add the baseline tests as an NFC commit now, so it is easier to see the diffs.

The order of the transforms creates an interesting trade-off, so we need a test like this (and probably even more tests):
define i8 @same_value_two_case(i32 %i) {
entry:
  switch i32 %i, label %default [
  i32 -3, label %end
  i32 5, label %end
  ]

default:
  br label %end

end:
  %t0 = phi i8 [ 42, %default ], [ 3, %entry ], [ 3, %entry ]
  ret i8 %t0
}
I'm sorry but what's the difference between this test with switch_to_and0 ? For the negative offset?
Ah, you're correct - I did not see that test diff. But we might want to include a test with negative offset, so we have coverage for that too.

@spatel I feel so sorry that I check in the baseline test here rGe2d77a160c with a wrong issue number(show be #39957 but I write #54649).
Can I amend the commit message now?

rebase main

In D122485#3448422, @bcl5980 wrote:

@spatel I feel so sorry that I check in the baseline test here rGe2d77a160c with a wrong issue number(show be #39957 but I write #54649).
Can I amend the commit message now?

I do not know if you can amend a commit message after a push (but my git knowledge is not very good).

I don't think it is a big problem, but you could revert and recommit.

Alternatively, you can post a correction on the Phab review thread:
https://reviews.llvm.org/rGe2d77a160c5b8141eca3db1fca6dafd97e78288d
or on github directly?
https://github.com/llvm/llvm-project/commit/e2d77a160c5b8141eca3db1fca6dafd97e78288d

bcl5980 mentioned this in rGfd0641b58c37: [SimplifyCFG] add tests for switch to select; NFC.Apr 13 2022, 7:36 AM

Harbormaster completed remote builds in B159446: Diff 422511.Apr 13 2022, 7:55 AM

In D122485#3448240, @spatel wrote:
Let's add the baseline tests as an NFC commit now, so it is easier to see the diffs.

The order of the transforms creates an interesting trade-off, so we need a test like this (and probably even more tests):
define i8 @same_value_two_case(i32 %i) {
entry:
  switch i32 %i, label %default [
  i32 -3, label %end
  i32 5, label %end
  ]

default:
  br label %end

end:
  %t0 = phi i8 [ 42, %default ], [ 3, %entry ], [ 3, %entry ]
  ret i8 %t0
}
This patch creates a difference that survives all the way through codegen - instcombine does not recognize the equivalence between the 2 patterns:
https://alive2.llvm.org/ce/z/mqo87Z

It's not clear to me if there is a universal better form (depends on target?) or even which one is better for IR. To avoid those questions, you can re-order the transforms, so we do not have to answer it in this patch (add a TODO comment though).

Try two patterns on current backend, this patch's implementation generate better asm on x86 and aarch64: https://godbolt.org/z/nsoWa7Kqr

spatel mentioned this in rGd038135e1913: [SimplifyCFG] add more tests for switch to select transform; NFC.Apr 13 2022, 2:15 PM

In D122485#3448597, @bcl5980 wrote:

Try two patterns on current backend, this patch's implementation generate better asm on x86 and aarch64: https://godbolt.org/z/nsoWa7Kqr

Thanks for checking that. I suspect that a target that has condition-logic instructions (like PowerPC) might be better off with the icmp pattern for some examples, but I agree that we can default to the 'and' pattern here. It is shorter IR for the case with no case minimum offset.
I added some more tests and tried to clean up the existing code a bit more. Please rebase and address minor issues. Then I think this patch will be good to go.

llvm/lib/Transforms/Utils/SimplifyCFG.cpp
5684	I think it would be better to put these examples closer to the code that does the transform. I moved the existing example comment with: 0ef46dc0f9f3
5688	Remove the TODO - if we are going to use the new code on the 2-case pattern, then it is really a TODO for instcombine and/or codegen to alter it if needed.
5692	Don't move this comment.
5710	I added: ArrayRef<ConstantInt *> CaseValues = ResultVector[0].second; to make this more readable. You can use that in the new code.
5714	Use complete sentence/punctuation for code comments: // Find minimal value.
5720	Add period at end of sentence.
5725–5726	Add period at end of sentence.

Also, I'm not sure what "mutil" means in the patch title. Could this be "Try to fold switch with single result value and power-of-2 cases to mask+select" ?

rebase and update comments

Harbormaster completed remote builds in B159598: Diff 422724.Apr 13 2022, 8:19 PM

LGTM - I made a few more minor comments.

llvm/lib/Transforms/Utils/SimplifyCFG.cpp
5711–5713	The bit numbering is not clear to me. "0,4" clears bit 2? It might help to show the mask as a bit value (for example -5 is "0b1..1011"). Maybe better to spell out the comparison too: "Cond & 0b1..1011 == 0 ? result : default"
5716	Move this variable declaration/initialization to just above the loop where it is filled in.
5724–5726	Remove unnecessary braces around 1-line loop - { }.
llvm/test/Transforms/SimplifyCFG/switch-to-select-two-case.ll
68–223	It is independent of this patch, but we should put a TODO comment on this test or in the code because we do not produce the optimal code when there is no default: https://alive2.llvm.org/ce/z/7qxumF

This revision is now accepted and ready to land.Apr 14 2022, 7:44 AM

update comments

This revision was landed with ongoing or failed builds.Apr 14 2022, 9:17 AM

Closed by commit rG00871e2f4f9f: [SimplifyCFG] Try to fold switch with single result value and power-of-2 cases… (authored by bcl5980). · Explain Why

This revision was automatically updated to reflect the committed changes.

bcl5980 added a commit: rG00871e2f4f9f: [SimplifyCFG] Try to fold switch with single result value and power-of-2 cases….

@spatel Thanks for the review. For me write comments and commit message is much harder than code because of my poor English. These comments are really helpful .

Harbormaster completed remote builds in B159702: Diff 422889.Apr 14 2022, 10:02 AM

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Utils/

SimplifyCFG.cpp

70 lines

test/

Transforms/

SimplifyCFG/

switch-to-select-two-case.ll

155 lines

Diff 422268

llvm/lib/Transforms/Utils/SimplifyCFG.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,614 Lines • ▼ Show 20 Lines	UniqueResults.push_back(
std::make_pair(Result, SmallVector<ConstantInt *, 4>(1, CaseVal)));		std::make_pair(Result, SmallVector<ConstantInt *, 4>(1, CaseVal)));
return 1;		return 1;
}		}

// Helper function that initializes a map containing		// Helper function that initializes a map containing
// results for the PHI node of the common destination block for a switch		// results for the PHI node of the common destination block for a switch
// instruction. Returns false if multiple PHI nodes have been found or if		// instruction. Returns false if multiple PHI nodes have been found or if
// there is not a common destination block for the switch.		// there is not a common destination block for the switch.
static bool		static bool initializeUniqueCases(SwitchInst SI, PHINode &PHI,
initializeUniqueCases(SwitchInst SI, PHINode &PHI, BasicBlock *&CommonDest,		BasicBlock *&CommonDest,
SwitchCaseResultVectorTy &UniqueResults,		SwitchCaseResultVectorTy &UniqueResults,
Constant *&DefaultResult, const DataLayout &DL,		Constant *&DefaultResult,
		const DataLayout &DL,
const TargetTransformInfo &TTI,		const TargetTransformInfo &TTI,
uintptr_t MaxUniqueResults, uintptr_t MaxCasesPerResult) {		uintptr_t MaxUniqueResults) {
for (auto &I : SI->cases()) {		for (auto &I : SI->cases()) {
ConstantInt *CaseVal = I.getCaseValue();		ConstantInt *CaseVal = I.getCaseValue();

// Resulting value at phi nodes for this case value.		// Resulting value at phi nodes for this case value.
SwitchCaseResultsTy Results;		SwitchCaseResultsTy Results;
if (!getCaseResults(SI, CaseVal, I.getCaseSuccessor(), &CommonDest, Results,		if (!getCaseResults(SI, CaseVal, I.getCaseSuccessor(), &CommonDest, Results,
DL, TTI))		DL, TTI))
return false;		return false;

// Only one value per case is permitted.		// Only one value per case is permitted.
if (Results.size() > 1)		if (Results.size() > 1)
return false;		return false;

// Add the case->result mapping to UniqueResults.		// Add the case->result mapping to UniqueResults.
const uintptr_t NumCasesForResult =		const uintptr_t NumCasesForResult =
mapCaseToResult(CaseVal, UniqueResults, Results.begin()->second);		mapCaseToResult(CaseVal, UniqueResults, Results.begin()->second);

// Early out if there are too many cases for this result.
bcl5980AuthorUnsubmitted Done Reply Inline Actions @spatel I'm a little worry about here. I remove the early out this version. It will cause compile time increase if we have some very large switch with many cases to the same result but not the pattern we can fold. But I have no detail data to show how much extra compile this change will be involved. Do we have some common compile time tests? Another way to do is enlarge MaxCasesPerResult. The patch first version adjust MaxCasesPerResult to 16. But 16 is also a magic number. Maybe we can add a config for it. How about your suggestions? bcl5980: @spatel I'm a little worry about here. I remove the early out this version. It will cause…
spatelUnsubmitted Not Done Reply Inline Actions @nikic has a system that is used to check for compile time changes with benchmarks from the test suite: https://llvm-compile-time-tracker.com/index.php So we can run an experiment there. But I think it is fine to use "16" as a limit for this analysis. You can give it a name and make it a debug option like the many others at the top of this file. For example: static cl::opt<unsigned> MaxSpeculationDepth( "max-speculation-depth", cl::Hidden, cl::init(10), cl::desc("Limit maximum recursion depth when calculating costs of " "speculatively executed instructions")); spatel: @nikic has a system that is used to check for compile time changes with benchmarks from the…
spatelUnsubmitted Not Done Reply Inline Actions We can reduce this patch by doing that part independently: D123625 spatel: We can reduce this patch by doing that part independently: D123625
bcl5980AuthorUnsubmitted Done Reply Inline Actions Thanks for the help. I will rebase after D123625 bcl5980: Thanks for the help. I will rebase after D123625
if (NumCasesForResult > MaxCasesPerResult)
return false;

// Early out if there are too many unique results.		// Early out if there are too many unique results.
if (UniqueResults.size() > MaxUniqueResults)		if (UniqueResults.size() > MaxUniqueResults)
return false;		return false;

// Check the PHI consistency.		// Check the PHI consistency.
if (!PHI)		if (!PHI)
PHI = Results[0].first;		PHI = Results[0].first;
else if (PHI != Results[0].first)		else if (PHI != Results[0].first)
Show All 21 Lines
// switch (a) {		// switch (a) {
// case 10: %0 = icmp eq i32 %a, 10		// case 10: %0 = icmp eq i32 %a, 10
// return 10; %1 = select i1 %0, i32 10, i32 4		// return 10; %1 = select i1 %0, i32 10, i32 4
// case 20: ----> %2 = icmp eq i32 %a, 20		// case 20: ----> %2 = icmp eq i32 %a, 20
// return 2; %3 = select i1 %2, i32 2, i32 %1		// return 2; %3 = select i1 %2, i32 2, i32 %1
// default:		// default:
// return 4;		// return 4;
// }		// }
// TODO: Handle switches with more than 2 cases that map to the same result.		// TODO: Handle switches with more than 2 cases that map to the same result.
		spatelUnsubmitted Done Reply Inline Actions I think it would be better to put these examples closer to the code that does the transform. I moved the existing example comment with: 0ef46dc0f9f3 spatel: I think it would be better to put these examples closer to the code that does the transform. I…
static Value *foldSwitchToSelect(const SwitchCaseResultVectorTy &ResultVector,		static Value *foldSwitchToSelect(const SwitchCaseResultVectorTy &ResultVector,
Constant DefaultResult, Value Condition,		Constant DefaultResult, Value Condition,
IRBuilder<> &Builder) {		IRBuilder<> &Builder) {
// If we are selecting between only two cases transform into a simple		// If we are selecting between only two cases transform into a simple
		spatelUnsubmitted Done Reply Inline Actions Remove the TODO - if we are going to use the new code on the 2-case pattern, then it is really a TODO for instcombine and/or codegen to alter it if needed. spatel: Remove the TODO - if we are going to use the new code on the 2-case pattern, then it is really…
// select or a two-way select if default is possible.		// select or a two-way select if default is possible.
if (ResultVector.size() == 2 && ResultVector[0].second.size() == 1 &&		if (ResultVector.size() == 2 && ResultVector[0].second.size() == 1 &&
ResultVector[1].second.size() == 1) {		ResultVector[1].second.size() == 1) {
ConstantInt *const FirstCase = ResultVector[0].second[0];		ConstantInt *const FirstCase = ResultVector[0].second[0];
		spatelUnsubmitted Done Reply Inline Actions Don't move this comment. spatel: Don't move this comment.
ConstantInt *const SecondCase = ResultVector[1].second[0];		ConstantInt *const SecondCase = ResultVector[1].second[0];

bool DefaultCanTrigger = DefaultResult;		bool DefaultCanTrigger = DefaultResult;
Value *SelectValue = ResultVector[1].first;		Value *SelectValue = ResultVector[1].first;
if (DefaultCanTrigger) {		if (DefaultCanTrigger) {
Value *const ValueCompare =		Value *const ValueCompare =
Builder.CreateICmpEQ(Condition, SecondCase, "switch.selectcmp");		Builder.CreateICmpEQ(Condition, SecondCase, "switch.selectcmp");
SelectValue = Builder.CreateSelect(ValueCompare, ResultVector[1].first,		SelectValue = Builder.CreateSelect(ValueCompare, ResultVector[1].first,
DefaultResult, "switch.select");		DefaultResult, "switch.select");
}		}
Value *const ValueCompare =		Value *const ValueCompare =
Builder.CreateICmpEQ(Condition, FirstCase, "switch.selectcmp");		Builder.CreateICmpEQ(Condition, FirstCase, "switch.selectcmp");
return Builder.CreateSelect(ValueCompare, ResultVector[0].first,		return Builder.CreateSelect(ValueCompare, ResultVector[0].first,
SelectValue, "switch.select");		SelectValue, "switch.select");
}		}

		if (ResultVector.size() == 1 && DefaultResult) {
		unsigned CaseCount = ResultVector[0].second.size();
		spatelUnsubmitted Done Reply Inline Actions I added: ArrayRef<ConstantInt > CaseValues = ResultVector[0].second; to make this more readable. You can use that in the new code. spatel:* I added: ArrayRef<ConstantInt *> CaseValues = ResultVector[0].second; to make this more…
		if (isPowerOf2_32(CaseCount)) {
		// switch (a) {
		// case 0:
		spatelUnsubmitted Not Done Reply Inline Actions The bit numbering is not clear to me. "0,4" clears bit 2? It might help to show the mask as a bit value (for example -5 is "0b1..1011"). Maybe better to spell out the comparison too: "Cond & 0b1..1011 == 0 ? result : default" spatel: The bit numbering is not clear to me. "0,4" clears bit 2? It might help to show the mask as a…
		// case 2:
		spatelUnsubmitted Done Reply Inline Actions Use complete sentence/punctuation for code comments: // Find minimal value. spatel: Use complete sentence/punctuation for code comments: // Find minimal value.
		// case 4:
		// case 6:
		spatelUnsubmitted Not Done Reply Inline Actions Move this variable declaration/initialization to just above the loop where it is filled in. spatel: Move this variable declaration/initialization to just above the loop where it is filled in.
		// return 1; ----> %0 = and i32 %a, -7
		// default: %1 = icmp eq i32 %0, 0
		// return 2; %2 = select i1 %1, i32 1, i32 2
		// }
		spatelUnsubmitted Done Reply Inline Actions Add period at end of sentence. spatel: Add period at end of sentence.
		ConstantInt *MinCaseVal = ResultVector[0].second[0];
		APInt BitMask = APInt::getZero(MinCaseVal->getBitWidth());
		for (auto Case : ResultVector[0].second) {
		if (Case->getValue().slt(MinCaseVal->getValue()))
		MinCaseVal = Case;
		}
		spatelUnsubmitted Done Reply Inline Actions Add period at end of sentence. spatel: Add period at end of sentence.
		spatelUnsubmitted Not Done Reply Inline Actions Remove unnecessary braces around 1-line loop - { }. spatel: Remove unnecessary braces around 1-line loop - { }.
		for (auto Case : ResultVector[0].second) {
		BitMask \|= (Case->getValue() - MinCaseVal->getValue());
		}

		if (BitMask.countPopulation() == Log2_32(CaseCount)) {
		if (!MinCaseVal->isNullValue())
		Condition = Builder.CreateSub(Condition, MinCaseVal);
		Value *And = Builder.CreateAnd(Condition, ~BitMask, "switch.and");
		Value *Cmp = Builder.CreateICmpEQ(
		And, Constant::getNullValue(And->getType()), "switch.selectcmp");
		return Builder.CreateSelect(Cmp, ResultVector[0].first, DefaultResult);
		}
		}

// Handle the degenerate case where two cases have the same value.		// Handle the degenerate case where two cases have the same value.
if (ResultVector.size() == 1 && ResultVector[0].second.size() == 2 &&		if (ResultVector[0].second.size() == 2) {
DefaultResult) {		Value *Cmp1 = Builder.CreateICmpEQ(Condition, ResultVector[0].second[0],
Value *Cmp1 = Builder.CreateICmpEQ(		"switch.selectcmp.case1");
Condition, ResultVector[0].second[0], "switch.selectcmp.case1");		Value *Cmp2 = Builder.CreateICmpEQ(Condition, ResultVector[0].second[1],
Value *Cmp2 = Builder.CreateICmpEQ(		"switch.selectcmp.case2");
Condition, ResultVector[0].second[1], "switch.selectcmp.case2");
Value *Cmp = Builder.CreateOr(Cmp1, Cmp2, "switch.selectcmp");		Value *Cmp = Builder.CreateOr(Cmp1, Cmp2, "switch.selectcmp");
return Builder.CreateSelect(Cmp, ResultVector[0].first, DefaultResult);		return Builder.CreateSelect(Cmp, ResultVector[0].first, DefaultResult);
}		}
		}

return nullptr;		return nullptr;
}		}

// Helper function to cleanup a switch instruction that has been converted into		// Helper function to cleanup a switch instruction that has been converted into
// a select, fixing up PHI nodes and basic blocks.		// a select, fixing up PHI nodes and basic blocks.
static void removeSwitchAfterSelectFold(SwitchInst SI, PHINode PHI,		static void removeSwitchAfterSelectFold(SwitchInst SI, PHINode PHI,
Value *SelectValue,		Value *SelectValue,
Show All 37 Lines	static bool trySwitchToSelect(SwitchInst *SI, IRBuilder<> &Builder,
const TargetTransformInfo &TTI) {		const TargetTransformInfo &TTI) {
Value *const Cond = SI->getCondition();		Value *const Cond = SI->getCondition();
PHINode *PHI = nullptr;		PHINode *PHI = nullptr;
BasicBlock *CommonDest = nullptr;		BasicBlock *CommonDest = nullptr;
Constant *DefaultResult;		Constant *DefaultResult;
SwitchCaseResultVectorTy UniqueResults;		SwitchCaseResultVectorTy UniqueResults;
// Collect all the cases that will deliver the same value from the switch.		// Collect all the cases that will deliver the same value from the switch.
if (!initializeUniqueCases(SI, PHI, CommonDest, UniqueResults, DefaultResult,		if (!initializeUniqueCases(SI, PHI, CommonDest, UniqueResults, DefaultResult,
DL, TTI, /MaxUniqueResults/ 2,		DL, TTI, /MaxUniqueResults/ 2))
/MaxCasesPerResult/ 2))
return false;		return false;

assert(PHI != nullptr && "PHI for value select not found");		assert(PHI != nullptr && "PHI for value select not found");
Builder.SetInsertPoint(SI);		Builder.SetInsertPoint(SI);
Value *SelectValue =		Value *SelectValue =
foldSwitchToSelect(UniqueResults, DefaultResult, Cond, Builder);		foldSwitchToSelect(UniqueResults, DefaultResult, Cond, Builder);
if (!SelectValue)		if (!SelectValue)
return false;		return false;
▲ Show 20 Lines • Show All 1,363 Lines • Show Last 20 Lines

llvm/test/Transforms/SimplifyCFG/switch-to-select-two-case.ll

Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	sw.bb:
br label %return		br label %return

sw.epilog:		sw.epilog:
br label %return		br label %return

return:		return:
%retval.0 = phi i32 [ 4, %sw.epilog ], [ 10, %sw.bb ]		%retval.0 = phi i32 [ 4, %sw.epilog ], [ 10, %sw.bb ]
ret i32 %retval.0		ret i32 %retval.0
}		}

		define i1 @switch_to_and0(i8 %0) {
		RKSimonUnsubmitted Done Reply Inline Actions (style) please can you replace the mask# naming sequence with test names that are more descriptive? RKSimon: (style) please can you replace the mask# naming sequence with test names that are more…
		; CHECK-LABEL: @switch_to_and0(
		; CHECK-NEXT: [[TMP2:%.]] = sub i8 [[TMP0:%.]], 43
		; CHECK-NEXT: [[SWITCH_AND:%.*]] = and i8 [[TMP2]], -3
		; CHECK-NEXT: [[SWITCH_SELECTCMP:%.*]] = icmp eq i8 [[SWITCH_AND]], 0
		; CHECK-NEXT: [[TMP3:%.*]] = select i1 [[SWITCH_SELECTCMP]], i1 true, i1 false
		; CHECK-NEXT: ret i1 [[TMP3]]
		;
		switch i8 %0, label %2 [
		i8 43, label %3
		i8 45, label %3
		]

		2:
		br label %3

		3:
		%4 = phi i1 [ false, %2 ], [ true, %1 ], [ true, %1 ]
		ret i1 %4
		}


		define i1 @switch_to_and1(i32 %i) {
		; CHECK-LABEL: @switch_to_and1(
		; CHECK-NEXT: entry:
		; CHECK-NEXT: [[SWITCH_AND:%.]] = and i32 [[I:%.]], -7
		; CHECK-NEXT: [[SWITCH_SELECTCMP:%.*]] = icmp eq i32 [[SWITCH_AND]], 0
		; CHECK-NEXT: [[TMP0:%.*]] = select i1 [[SWITCH_SELECTCMP]], i1 true, i1 false
		; CHECK-NEXT: ret i1 [[TMP0]]
		;
		entry:
		switch i32 %i, label %lor.rhs [
		i32 0, label %lor.end
		i32 2, label %lor.end
		i32 4, label %lor.end
		i32 6, label %lor.end
		]

		lor.rhs:
		br label %lor.end

		lor.end:
		%0 = phi i1 [ true, %entry ], [ false, %lor.rhs ], [ true, %entry ], [ true, %entry ], [ true, %entry ]
		ret i1 %0
		}

		define i1 @switch_to_and2(i32 %i) {
		; CHECK-LABEL: @switch_to_and2(
		; CHECK-NEXT: entry:
		; CHECK-NEXT: [[SWITCH_AND:%.]] = and i32 [[I:%.]], -11
		; CHECK-NEXT: [[SWITCH_SELECTCMP:%.*]] = icmp eq i32 [[SWITCH_AND]], 0
		; CHECK-NEXT: [[TMP0:%.*]] = select i1 [[SWITCH_SELECTCMP]], i1 true, i1 false
		; CHECK-NEXT: ret i1 [[TMP0]]
		;
		entry:
		switch i32 %i, label %lor.rhs [
		i32 0, label %lor.end
		i32 2, label %lor.end
		i32 8, label %lor.end
		i32 10, label %lor.end
		]

		lor.rhs:
		br label %lor.end

		lor.end:
		%0 = phi i1 [ true, %entry ], [ false, %lor.rhs ], [ true, %entry ], [ true, %entry ], [ true, %entry ]
		ret i1 %0
		}

		define i1 @switch_to_and3(i32 %i) {
		; CHECK-LABEL: @switch_to_and3(
		; CHECK-NEXT: entry:
		; CHECK-NEXT: [[TMP0:%.]] = sub i32 [[I:%.]], 2
		; CHECK-NEXT: [[SWITCH_AND:%.*]] = and i32 [[TMP0]], -11
		; CHECK-NEXT: [[SWITCH_SELECTCMP:%.*]] = icmp eq i32 [[SWITCH_AND]], 0
		; CHECK-NEXT: [[TMP1:%.*]] = select i1 [[SWITCH_SELECTCMP]], i1 true, i1 false
		; CHECK-NEXT: ret i1 [[TMP1]]
		;
		entry:
		switch i32 %i, label %lor.rhs [
		i32 2, label %lor.end
		i32 4, label %lor.end
		i32 10, label %lor.end
		i32 12, label %lor.end
		]

		lor.rhs:
		br label %lor.end

		lor.end:
		%0 = phi i1 [ true, %entry ], [ false, %lor.rhs ], [ true, %entry ], [ true, %entry ], [ true, %entry ]
		ret i1 %0
		}

		define i1 @negative_switch_to_and0(i32 %i) {
		; CHECK-LABEL: @negative_switch_to_and0(
		; CHECK-NEXT: entry:
		; CHECK-NEXT: switch i32 [[I:%.]], label [[LOR_RHS:%.]] [
		; CHECK-NEXT: i32 1, label [[LOR_END:%.*]]
		; CHECK-NEXT: i32 4, label [[LOR_END]]
		; CHECK-NEXT: i32 10, label [[LOR_END]]
		; CHECK-NEXT: i32 12, label [[LOR_END]]
		; CHECK-NEXT: ]
		; CHECK: lor.rhs:
		; CHECK-NEXT: br label [[LOR_END]]
		; CHECK: lor.end:
		; CHECK-NEXT: [[TMP0:%.]] = phi i1 [ true, [[ENTRY:%.]] ], [ false, [[LOR_RHS]] ], [ true, [[ENTRY]] ], [ true, [[ENTRY]] ], [ true, [[ENTRY]] ]
		; CHECK-NEXT: ret i1 [[TMP0]]
		;
		entry:
		switch i32 %i, label %lor.rhs [
		i32 1, label %lor.end
		i32 4, label %lor.end
		i32 10, label %lor.end
		i32 12, label %lor.end
		]

		lor.rhs:
		br label %lor.end

		lor.end:
		%0 = phi i1 [ true, %entry ], [ false, %lor.rhs ], [ true, %entry ], [ true, %entry ], [ true, %entry ]
		ret i1 %0
		}

		define i1 @negative_switch_to_and1(i32 %i) {
		; CHECK-LABEL: @negative_switch_to_and1(
		; CHECK-NEXT: entry:
		; CHECK-NEXT: switch i32 [[I:%.]], label [[LOR_RHS:%.]] [
		; CHECK-NEXT: i32 0, label [[LOR_END:%.*]]
		; CHECK-NEXT: i32 2, label [[LOR_END]]
		; CHECK-NEXT: i32 4, label [[LOR_END]]
		; CHECK-NEXT: ]
		; CHECK: lor.rhs:
		; CHECK-NEXT: br label [[LOR_END]]
		; CHECK: lor.end:
		; CHECK-NEXT: [[TMP0:%.]] = phi i1 [ true, [[ENTRY:%.]] ], [ false, [[LOR_RHS]] ], [ true, [[ENTRY]] ], [ true, [[ENTRY]] ]
		; CHECK-NEXT: ret i1 [[TMP0]]
		;
		entry:
		switch i32 %i, label %lor.rhs [
		i32 0, label %lor.end
		i32 2, label %lor.end
		i32 4, label %lor.end
		]

		lor.rhs:
		br label %lor.end

		lor.end:
		%0 = phi i1 [ true, %entry ], [ false, %lor.rhs ], [ true, %entry ], [ true, %entry ]
		ret i1 %0
		}
		spatelUnsubmitted Not Done Reply Inline Actions It is independent of this patch, but we should put a TODO comment on this test or in the code because we do not produce the optimal code when there is no default: https://alive2.llvm.org/ce/z/7qxumF spatel: It is independent of this patch, but we should put a TODO comment on this test or in the code…