This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Analysis/
-
llvm/
-
Analysis/
-
LazyValueInfo.h
-
lib/
-
Analysis/
3/6
LazyValueInfo.cpp
-
Transforms/Scalar/
-
Scalar/
-
CorrelatedValuePropagation.cpp
-
test/Transforms/CorrelatedValuePropagation/
-
Transforms/
-
CorrelatedValuePropagation/
-
basic.ll
-
icmp.ll
-
range.ll

Differential D69686

[LVI][CVP] Use block value when simplifying icmps
ClosedPublic

Authored by nikic on Oct 31 2019, 2:38 PM.

Download Raw Diff

Details

Reviewers

reames
spatel
apilipenko
fhahn
lebedev.ri

Commits

rGfe79061be2b2: [LVI][CVP] Use block value when simplifying icmps

Summary

Add a flag to getPredicateAt() that allows making use of the block value. This allows us to take into account range information from the current block, rather than only information that is threaded over edges, making the icmp simplification in CVP a lot more useful.

I'm not changing getPredicateAt() to use the block value unconditionally to avoid any impact on the JumpThreading pass, which is somewhat picky about LVI query order.

Most test changes here are just icmps that now get dropped (while previously only a result used in a return was replaced). The two tests in icmp.ll show two representative improvements. Once this lands, I will also clean up CVP tests to drop lots of dummy block splits that had to be added to work around the previous weakness.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

nikic created this revision.Oct 31 2019, 2:38 PM

Herald added a project: Restricted Project. · View Herald TranscriptOct 31 2019, 2:38 PM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

nikic marked 2 inline comments as done.Oct 31 2019, 2:45 PM

nikic added inline comments.

llvm/test/Transforms/JumpThreading/bb-unreachable-from-entry.ll
13 ↗	(On Diff #227351)	This change is caused because we now successfully evaluate the icmp to a constant, and jump threading does not clean up after that (simplifycfg will). Previously this used some of the threading logic, which follows the chain to the end.
llvm/test/Transforms/JumpThreading/header-succ.ll
113 ↗	(On Diff #227351)	It took me a while to understand what is going here... what happens is that we have a branch on undef and previously this branch was folded in one direction (to an exit), while now it goes into the other (into an infinite loop). Both are valid. The reason is again that we evaluate the icmp to a constant.

Ooh, finally, nice!
What's going on with JumpThreading tests?

@lebedev.ri I added a couple of comments on the JT tests, probably after you opened the tab ^^ The reason for the diffs is that we now evaluate the condition to a constant, while previously these were phi threaded. It has a bit of an odd effect here, but I would expect that in practice folding a branch completely is generally preferable.

In D69686#1730358, @nikic wrote:

@lebedev.ri I added a couple of comments on the JT tests, probably after you opened the tab ^^

Yes.

The diff seems okay to me but i'm not deeply familiar with LVI.
I wonder who is the current code owner of LVI/CVP.

What correctness testing was done, test-suite?

alex added a subscriber: alex.Nov 1 2019, 2:37 PM

In D69686#1730600, @lebedev.ri wrote:

What correctness testing was done, test-suite?

I've done a test-suite run now, it passes.

Okay, LG, thank you for looking into it.
@spatel / @reames ?

This revision is now accepted and ready to land.Nov 2 2019, 8:55 AM

Interestingly, we seriously regress in NSW deductions:

| statistic                                 |     old |     new | delta |  % change |
| correlated-value-propagation.NumAShrs     |     209 |     438 |   229 | 109.5694% |
| correlated-value-propagation.NumAddNSW    |    4972 |    4237 |  -735 | -14.7828% |
| correlated-value-propagation.NumAddNUW    |    7141 |    7157 |    16 |   0.2241% |
| correlated-value-propagation.NumAddNW     |   12113 |   11394 |  -719 |  -5.9358% |
| correlated-value-propagation.NumAnd       |     442 |     441 |    -1 |  -0.2262% |
| correlated-value-propagation.NumCmps      |    1194 |    1183 |   -11 |  -0.9213% |
| correlated-value-propagation.NumDeadCases |     111 |     115 |     4 |   3.6036% |
| correlated-value-propagation.NumMulNSW    |     278 |     276 |    -2 |  -0.7194% |
| correlated-value-propagation.NumMulNUW    |    1326 |    1324 |    -2 |  -0.1508% |
| correlated-value-propagation.NumMulNW     |    1604 |    1600 |    -4 |  -0.2494% |
| correlated-value-propagation.NumNSW       |    7160 |    6415 |  -745 | -10.4050% |
| correlated-value-propagation.NumNUW       |   13306 |   13315 |     9 |   0.0676% |
| correlated-value-propagation.NumNW        |   20466 |   19730 |  -736 |  -3.5962% |
| correlated-value-propagation.NumOverflows |       7 |       4 |    -3 | -42.8571% |
| correlated-value-propagation.NumPhiCommon |     397 |     401 |     4 |   1.0076% |
| correlated-value-propagation.NumPhis      |   15360 |   15435 |    75 |   0.4883% |
| correlated-value-propagation.NumSDivs     |     207 |     304 |    97 |  46.8599% |
| correlated-value-propagation.NumSExt      |    6278 |    7226 |   948 |  15.1004% |
| correlated-value-propagation.NumSRems     |      28 |      49 |    21 |  75.0000% |
| correlated-value-propagation.NumShlNSW    |    1171 |    1160 |   -11 |  -0.9394% |
| correlated-value-propagation.NumShlNUW    |    2793 |    2783 |   -10 |  -0.3580% |
| correlated-value-propagation.NumShlNW     |    3964 |    3943 |   -21 |  -0.5298% |
| correlated-value-propagation.NumSubNSW    |     739 |     742 |     3 |   0.4060% |
| correlated-value-propagation.NumSubNUW    |    2046 |    2051 |     5 |   0.2444% |
| correlated-value-propagation.NumSubNW     |    2785 |    2793 |     8 |   0.2873% |
| correlated-value-propagation.NumUDivs     |     353 |     375 |    22 |   6.2323% |
| instcount.NumAShrInst                     |   13761 |   13588 |  -173 |  -1.2572% |
| instcount.NumAddInst                      |  277538 |  277585 |    47 |   0.0169% |
| instcount.NumAllocaInst                   |   28040 |   28043 |     3 |   0.0107% |
| instcount.NumAndInst                      |   66082 |   66163 |    81 |   0.1226% |
| instcount.NumBitCastInst                  |  675473 |  675587 |   114 |   0.0169% |
| instcount.NumBrInst                       |  712987 |  712797 |  -190 |  -0.0266% |
| instcount.NumCallInst                     |  528453 |  528461 |     8 |   0.0015% |
| instcount.NumExtractElementInst           |   19418 |   19420 |     2 |   0.0103% |
| instcount.NumFCmpInst                     |   11039 |   11037 |    -2 |  -0.0181% |
| instcount.NumFNegInst                     |    1270 |    1272 |     2 |   0.1575% |
| instcount.NumFSubInst                     |   90911 |   90915 |     4 |   0.0044% |
| instcount.NumGetElementPtrInst            | 1229297 | 1229518 |   221 |   0.0180% |
| instcount.NumICmpInst                     |  487467 |  487413 |   -54 |  -0.0111% |
| instcount.NumInvokeInst                   |   21780 |   21779 |    -1 |  -0.0046% |
| instcount.NumLShrInst                     |   27409 |   27587 |   178 |   0.6494% |
| instcount.NumLoadInst                     |  890831 |  890834 |     3 |   0.0003% |
| instcount.NumMulInst                      |   43841 |   43832 |    -9 |  -0.0205% |
| instcount.NumOrInst                       |  102648 |  102652 |     4 |   0.0039% |
| instcount.NumPHIInst                      |  317805 |  317726 |   -79 |  -0.0249% |
| instcount.NumPtrToIntInst                 |   16114 |   16116 |     2 |   0.0124% |
| instcount.NumRetInst                      |   88782 |   88777 |    -5 |  -0.0056% |
| instcount.NumSDivInst                     |    8732 |    8670 |   -62 |  -0.7100% |
| instcount.NumSExtInst                     |   79306 |   78745 |  -561 |  -0.7074% |
| instcount.NumSRemInst                     |    1679 |    1678 |    -1 |  -0.0596% |
| instcount.NumSelectInst                   |   46181 |   46251 |    70 |   0.1516% |
| instcount.NumShlInst                      |   40640 |   40575 |   -65 |  -0.1599% |
| instcount.NumShuffleVectorInst            |  100326 |  100317 |    -9 |  -0.0090% |
| instcount.NumStoreInst                    |  814080 |  814211 |   131 |   0.0161% |
| instcount.NumSubInst                      |   61973 |   61976 |     3 |   0.0048% |
| instcount.NumTruncInst                    |   62132 |   62166 |    34 |   0.0547% |
| instcount.NumUDivInst                     |    2526 |    2535 |     9 |   0.3563% |
| instcount.NumURemInst                     |    1589 |    1598 |     9 |   0.5664% |
| instcount.NumUnreachableInst              |   13486 |   13467 |   -19 |  -0.1409% |
| instcount.NumXorInst                      |   10632 |   10643 |    11 |   0.1035% |
| instcount.NumZExtInst                     |   68227 |   69001 |   774 |   1.1344% |
| instcount.TotalBlocks                     |  847787 |  847572 |  -215 |  -0.0254% |
| instcount.TotalFuncs                      |   88847 |   88843 |    -4 |  -0.0045% |
| instcount.TotalInsts                      | 7411666 | 7412146 |   480 |   0.0065% |

fhahn added a subscriber: fhahn.Nov 4 2019, 1:55 AM

lebedev.ri added inline comments.Nov 4 2019, 6:31 AM

llvm/lib/Analysis/LazyValueInfo.cpp
1766–1768	Should we be getting both, and combining their knowledge somehow?

Can you give some context on the problem you're trying to solve? This doesn't look quite right, but maybe with some context I can make a suggestion as to how to approach cleanly?

llvm/lib/Analysis/LazyValueInfo.cpp
1762	Wait, no, please don't do this. Please don't bake in assumption about the semantics of the function based on the type of the argument. If this difference exists, we should fix/remove it. I've long thought we needed to have a getValueAtBegin(BB) variant. Would that solve your use case?

(there is no "revoke review" so marking as "changes needed" instead)

This revision now requires changes to proceed.Nov 5 2019, 12:56 AM

@lebedev.ri Weirdly that does not match the results I get:

correlated-value-propagation.NumAShrs | 199 | 425
correlated-value-propagation.NumAddNSW | 1975 | 2056
correlated-value-propagation.NumAddNUW | 4156 | 4173
correlated-value-propagation.NumAddNW | 6131 | 6229
correlated-value-propagation.NumAnd | 194 | 204
correlated-value-propagation.NumCmps | 809 | 933
correlated-value-propagation.NumDeadCases | 110 | 114
correlated-value-propagation.NumMulNSW | 131 | 129
correlated-value-propagation.NumMulNUW | 832 | 830
correlated-value-propagation.NumMulNW | 963 | 959
correlated-value-propagation.NumNSW | 3781 | 3850
correlated-value-propagation.NumNUW | 7737 | 7745
correlated-value-propagation.NumNW | 11518 | 11595
correlated-value-propagation.NumOverflows | 7 | 4
correlated-value-propagation.NumPhiCommon | 393 | 393
correlated-value-propagation.NumPhis | 11375 | 11442
correlated-value-propagation.NumSDivs | 201 | 295
correlated-value-propagation.NumSExt | 3667 | 4602
correlated-value-propagation.NumSRems | 28 | 47
correlated-value-propagation.NumSelects | 25 | 25
correlated-value-propagation.NumShlNSW | 1082 | 1069
correlated-value-propagation.NumShlNUW | 2262 | 2251
correlated-value-propagation.NumShlNW | 3344 | 3320
correlated-value-propagation.NumSubNSW | 593 | 596
correlated-value-propagation.NumSubNUW | 487 | 491
correlated-value-propagation.NumSubNW | 1080 | 1087
correlated-value-propagation.NumUDivs | 189 | 211

I ran this together with a second patch to CVP which actually computes the icmp result for non-local values, which I was planning to submit as a followup. I don't think that part should impact the NSW deductions though.

llvm/lib/Analysis/LazyValueInfo.cpp
1766–1768	getValueInBlock() is strictly more powerful, so that should not be necessary.

In D69686#1732629, @reames wrote:

Can you give some context on the problem you're trying to solve? This doesn't look quite right, but maybe with some context I can make a suggestion as to how to approach cleanly?

I want to determine the result of icmps based on range information within the same basic block (see the two tests for examples). LVI has all the necessary machinery to do that, but right now that getPredicateAt() method which is used to evaluate icmps in JT and CVP does not actually make use of it. getPrediateAt() is (mostly) weaker than manually evaluating the predicate based on the getConstantRange() result, and CVP actually does that for some of the "non-negative" optimizations.

See also D44252, where you suggested (I think, if I understood correctly) to do what I'm trying to do here.

llvm/lib/Analysis/LazyValueInfo.cpp
1762	I agree that this is pretty weird. I can take a look at fixing the behavior of pointers, which I believe are in the wrong here (the integer behavior is already heavily embedded in how CVP works).

nikic mentioned this in D69914: [LVI] Normalize pointer behavior.Nov 6 2019, 10:59 AM

nikic mentioned this in rG15bc4dc9a894: [LVI] Normalize pointer behavior.Nov 8 2019, 9:03 AM

nikic mentioned this in rG885a05f48a5d: Reapply [LVI] Normalize pointer behavior.Nov 8 2019, 11:21 AM

Rebase over changes to pointer handling.

There are some concerning changes to lvi-after-jumpthreading.ll which I probably missed before.

nikic marked an inline comment as done.Nov 8 2019, 12:54 PM

nikic added inline comments.

llvm/test/Analysis/LazyValueAnalysis/lvi-after-jumpthreading.ll

25 ↗

(On Diff #228504)

There are two issues here. The first is that JumpThreading marks LVI as preserved, but only seems to preserve it in the sense that the result isn't incorrect -- but not necessarily the same as rerunning from scratch.

The second is that we now end up computing block values in a different order, and this has an impact on the results. Here is the debug output of running JumpThreading on this function:

Before this patch:

LVI Getting value   %iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ] at ''
  Result = overdefined
LVI Getting edge value i32 0 from 'entry' to 'loop'
  Result = constantrange<0, 1>
LVI Getting edge value   %iv.next = add nsw i32 %iv, 1 from 'backedge' to 'loop'
PUSH:   %iv.next = add nsw i32 %iv, 1 in backedge
PUSH:   %iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ] in backedge
PUSH:   %iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ] in loop
POP   %iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ] in loop = constantrange<-2147483648, 400>
POP   %iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ] in backedge = constantrange<0, 400>
POP   %iv.next = add nsw i32 %iv, 1 in backedge = constantrange<1, 401>
  Result = constantrange<1, 400>
LVI Getting value   %iv.next = add nsw i32 %iv, 1 at ''
  Result = overdefined
LVI Getting value   %iv.next = add nsw i32 %iv, 1 at ''
  Result = overdefined

After this patch:

LVI Getting block end value   %iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ] at 'loop'
PUSH:   %iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ] in loop
PUSH:   %iv.next = add nsw i32 %iv, 1 in backedge
PUSH:   %iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ] in backedge
POP   %iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ] in backedge = constantrange<0, -2147483648>
POP   %iv.next = add nsw i32 %iv, 1 in backedge = constantrange<1, -2147483648>
POP   %iv = phi i32 [ 0, %entry ], [ %iv.next, %backedge ] in loop = constantrange<0, 400>
  Result = constantrange<0, 400>
LVI Getting block end value   %iv.next = add nsw i32 %iv, 1 at 'backedge'
  Result = constantrange<1, -2147483648>
LVI Getting block end value   %iv.next = add nsw i32 %iv, 1 at 'backedge'
  Result = constantrange<1, -2147483648>

Previously we ended up computing %iv in loop first, then %iv in backedge, then %iv.next in backedge. Now we compute %iv in backedge first, then %iv.next in backedge`, then %iv in loop. This eventually results in different results being calculated.

This seems pretty concerning regarding the general design of LVI, and I'm not really sure what I should do about this.

nikic mentioned this in D70044: [LVI] Make results independent of query order (WIP).Nov 9 2019, 6:10 AM

Please can you specify the order of these LVI patches, which one should go after which one, where one would start reviewing?

nikic added a parent revision: D70044: [LVI] Make results independent of query order (WIP).Nov 23 2019, 2:48 AM

nikic added a parent revision: D69914: [LVI] Normalize pointer behavior.Dec 12 2019, 11:10 AM

nikic mentioned this in rG21fbd5587cdf: Reapply [LVI] Normalize pointer behavior.Dec 13 2019, 12:14 AM

nikic mentioned this in D71660: [ValueTracking] isKnownNonZero() should take non-null-ness assumptions into consideration (PR43267).Dec 18 2019, 6:47 AM

nikic mentioned this in D72169: [CVP] Simplify cmp of local phi node.Jan 3 2020, 9:49 AM

nikic mentioned this in rG9d9633fb700d: [CVP] Simplify cmp of local phi node.Feb 26 2020, 11:41 AM

No idea who is comfortable/qualified doing LVI reviews, but that isn't me.

nikic mentioned this in D81544: [LVI] Make use of 'assume'-provided data.Jun 10 2020, 3:00 AM

junparser added a subscriber: junparser.Jun 10 2020, 3:35 AM

nikic mentioned this in rG6d88f6efd448: Reapply [LVI] Normalize pointer behavior.Aug 29 2020, 12:17 PM

Add flag to getPredicateOf() that allows using the block value, and enable it in CVP.

Herald added a subscriber: danielkiss. · View Herald TranscriptAug 30 2020, 8:22 AM

Harbormaster completed remote builds in B70040: Diff 288855.Aug 30 2020, 8:23 AM

nikic added a comment.Aug 30 2020, 8:32 AM

This comment was removed by nikic.

nikic removed a parent revision: D70044: [LVI] Make results independent of query order (WIP).Aug 30 2020, 8:41 AM

Oops, I looked at the wrong revision in my previous comment, that was for D69914.

Now looking at the right one: https://llvm-compile-time-tracker.com/compare.php?from=cb392c870d12eb520f84c8b7eb4f57e37483baed&to=d06337c311c36ce4fd0e939bbcef65f6d53c9d47&stat=instructions Compile-time impact is pretty minimal (< 0.1%).

Relevant stats on test-suite:

correlated-value-propagation.NumCmps | 816 | 1004
sccp.NumDeadBlocks | 7420 | 7413
sccp.NumInstRemoved | 18200 | 18168

I think that the addition of range support in SCCP has already covered many of the cases I wanted this for. Still think it makes sense to do this in CVP as well, as we already have the needed information, and LVI and SCCP have somewhat different strengths.

Ping

Add additional test case @test_br_cmp_with_offset. Found in Rust code and not folded by IPSCCP (or anything else in LLVM).

LGTM as long as this is not too expensive in terms of compile-time, thanks!

As mentioned in the comments and description, this catches cases we cannot really catch in (IP)SCCP or other places at the moment. In particular, the way conditional information is handled in IPSCCP does not allow this kind of simplifications without major changes.

llvm/lib/Analysis/LazyValueInfo.cpp
1763	Currently the comment for `getValueInBlock` says it returns the value at the end of the block. IIUC this only refers to the fact that all assumes in the block are used. But if `CxtI` is passed, only assumes dominating `CxtI` are passed, so effectively we get the value at `CxtI`? Might be good to clarify this in the docs.

nikic added inline comments.Sep 27 2020, 9:25 AM

llvm/lib/Analysis/LazyValueInfo.cpp
1763	Right, the comment there is outdated. Prior to D69914 the value was valid either at the start of the block (for integers) or the end of the block (for pointers), now it is always valid at the start (or at the context instruction if given). I've pushed a clarification to those doc comments in https://github.com/llvm/llvm-project/commit/709d03f8af4da4204849a70f01798e7cebba2e32.

nikic mentioned this in rG4f6e11948c37: [CVP] Make srem test more robust (NFC).Sep 27 2020, 9:57 AM

This revision was not accepted when it landed; it landed in state Needs Review.Sep 27 2020, 11:33 AM

Closed by commit rGfe79061be2b2: [LVI][CVP] Use block value when simplifying icmps (authored by nikic). · Explain Why

This revision was automatically updated to reflect the committed changes.

nikic added a commit: rGfe79061be2b2: [LVI][CVP] Use block value when simplifying icmps.

nikic mentioned this in rG01bde7310bb9: [CVP] Remove unnecessary block splits in tests (NFC).Sep 27 2020, 11:57 AM

Hello.

We've ran into an issues with a case like this:
https://godbolt.org/z/KnvGvo

This now manages to prove that the "max" in a saturating the max(min(X, 32767), -32768) is always false. For some cpu's that's fantastic, and give a decent speed increase. But others have a ssat instruction that can do that saturation quickly, that match from a min(max(..)).
Same goes for MVE instructions where there are a lot of saturating instructions we were previously picking (even if we were not doing it optimally before).

Any ideas of a good way of re-proving that the lower bounds don't need to be checked, but in the backend during ISel?

(In this case the _only_ value that can saturate is a -32768*-32768 multiply, but the code remains and the performance change can be quite substantial. )

In D69686#2298687, @dmgreen wrote:

Hello.

We've ran into an issues with a case like this:
https://godbolt.org/z/KnvGvo

This now manages to prove that the "max" in a saturating the max(min(X, 32767), -32768) is always false. For some cpu's that's fantastic, and give a decent speed increase. But others have a ssat instruction that can do that saturation quickly, that match from a min(max(..)).
Same goes for MVE instructions where there are a lot of saturating instructions we were previously picking (even if we were not doing it optimally before).

Any ideas of a good way of re-proving that the lower bounds don't need to be checked, but in the backend during ISel?

(In this case the _only_ value that can saturate is a -32768*-32768 multiply, but the code remains and the performance change can be quite substantial. )

I don't have a good suggestion here. This doesn't seem to be provable from known bits or known sign bits reasoning (...right?) and I don't think there's any range based reasoning available at the SDAG level right now. Maybe that would be the motivation to introduce it...

I'm pretty surprised this managed to survive this long. I guess IPSCCP did not fold this yet because it doesn't handle SPF.

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

LazyValueInfo.h

7 lines

lib/

Analysis/

LazyValueInfo.cpp

7 lines

Transforms/

Scalar/

CorrelatedValuePropagation.cpp

13 lines

test/

Transforms/

CorrelatedValuePropagation/

basic.ll

23 lines

icmp.ll

9 lines

range.ll

4 lines

Diff 294561

llvm/include/llvm/Analysis/LazyValueInfo.h

Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	public:
/// Determine whether the specified value comparison with a constant is known		/// Determine whether the specified value comparison with a constant is known
/// to be true or false on the specified CFG edge.		/// to be true or false on the specified CFG edge.
/// Pred is a CmpInst predicate.		/// Pred is a CmpInst predicate.
Tristate getPredicateOnEdge(unsigned Pred, Value V, Constant C,		Tristate getPredicateOnEdge(unsigned Pred, Value V, Constant C,
BasicBlock FromBB, BasicBlock ToBB,		BasicBlock FromBB, BasicBlock ToBB,
Instruction *CxtI = nullptr);		Instruction *CxtI = nullptr);

/// Determine whether the specified value comparison with a constant is known		/// Determine whether the specified value comparison with a constant is known
/// to be true or false at the specified instruction		/// to be true or false at the specified instruction.
/// (from an assume intrinsic). Pred is a CmpInst predicate.		/// \p Pred is a CmpInst predicate. If \p UseBlockValue is true, the block
		/// value is also taken into account.
Tristate getPredicateAt(unsigned Pred, Value V, Constant C,		Tristate getPredicateAt(unsigned Pred, Value V, Constant C,
Instruction *CxtI);		Instruction *CxtI, bool UseBlockValue = false);

/// Determine whether the specified value is known to be a constant at the		/// Determine whether the specified value is known to be a constant at the
/// specified instruction. Return null if not.		/// specified instruction. Return null if not.
Constant getConstant(Value V, Instruction *CxtI);		Constant getConstant(Value V, Instruction *CxtI);

/// Return the ConstantRange constraint that is known to hold for the		/// Return the ConstantRange constraint that is known to hold for the
/// specified value at the specified instruction. This may only be called		/// specified value at the specified instruction. This may only be called
/// on integer-typed Values.		/// on integer-typed Values.
▲ Show 20 Lines • Show All 70 Lines • Show Last 20 Lines

llvm/lib/Analysis/LazyValueInfo.cpp

Show First 20 Lines • Show All 1,738 Lines • ▼ Show 20 Lines	LazyValueInfo::getPredicateOnEdge(unsigned Pred, Value V, Constant C,
ValueLatticeElement Result =		ValueLatticeElement Result =
getImpl(PImpl, AC, M).getValueOnEdge(V, FromBB, ToBB, CxtI);		getImpl(PImpl, AC, M).getValueOnEdge(V, FromBB, ToBB, CxtI);

return getPredicateResult(Pred, C, Result, M->getDataLayout(), TLI);		return getPredicateResult(Pred, C, Result, M->getDataLayout(), TLI);
}		}

LazyValueInfo::Tristate		LazyValueInfo::Tristate
LazyValueInfo::getPredicateAt(unsigned Pred, Value V, Constant C,		LazyValueInfo::getPredicateAt(unsigned Pred, Value V, Constant C,
Instruction *CxtI) {		Instruction *CxtI, bool UseBlockValue) {
// Is or is not NonNull are common predicates being queried. If		// Is or is not NonNull are common predicates being queried. If
// isKnownNonZero can tell us the result of the predicate, we can		// isKnownNonZero can tell us the result of the predicate, we can
// return it quickly. But this is only a fastpath, and falling		// return it quickly. But this is only a fastpath, and falling
// through would still be correct.		// through would still be correct.
Module *M = CxtI->getModule();		Module *M = CxtI->getModule();
const DataLayout &DL = M->getDataLayout();		const DataLayout &DL = M->getDataLayout();
if (V->getType()->isPointerTy() && C->isNullValue() &&		if (V->getType()->isPointerTy() && C->isNullValue() &&
isKnownNonZero(V->stripPointerCastsSameRepresentation(), DL)) {		isKnownNonZero(V->stripPointerCastsSameRepresentation(), DL)) {
if (Pred == ICmpInst::ICMP_EQ)		if (Pred == ICmpInst::ICMP_EQ)
return LazyValueInfo::False;		return LazyValueInfo::False;
else if (Pred == ICmpInst::ICMP_NE)		else if (Pred == ICmpInst::ICMP_NE)
return LazyValueInfo::True;		return LazyValueInfo::True;
}		}
ValueLatticeElement Result = getImpl(PImpl, AC, M).getValueAt(V, CxtI);
		ValueLatticeElement Result = UseBlockValue
		reamesUnsubmitted Not Done Reply Inline Actions Wait, no, please don't do this. Please don't bake in assumption about the semantics of the function based on the type of the argument. If this difference exists, we should fix/remove it. I've long thought we needed to have a getValueAtBegin(BB) variant. Would that solve your use case? reames: Wait, no, please don't do this. Please don't bake in assumption about the semantics of the…
		nikicAuthorUnsubmitted Done Reply Inline Actions I agree that this is pretty weird. I can take a look at fixing the behavior of pointers, which I believe are in the wrong here (the integer behavior is already heavily embedded in how CVP works). nikic: I agree that this is pretty weird. I can take a look at fixing the behavior of pointers, which…
		? getImpl(PImpl, AC, M).getValueInBlock(V, CxtI->getParent(), CxtI)
		fhahnUnsubmitted Not Done Reply Inline Actions Currently the comment for `getValueInBlock` says it returns the value at the end of the block. IIUC this only refers to the fact that all assumes in the block are used. But if `CxtI` is passed, only assumes dominating `CxtI` are passed, so effectively we get the value at `CxtI`? Might be good to clarify this in the docs. fhahn: Currently the comment for `getValueInBlock` says it returns the value at the end of the block.
		nikicAuthorUnsubmitted Done Reply Inline Actions Right, the comment there is outdated. Prior to D69914 the value was valid either at the start of the block (for integers) or the end of the block (for pointers), now it is always valid at the start (or at the context instruction if given). I've pushed a clarification to those doc comments in https://github.com/llvm/llvm-project/commit/709d03f8af4da4204849a70f01798e7cebba2e32. nikic: Right, the comment there is outdated. Prior to D69914 the value was valid either at the start…
		: getImpl(PImpl, AC, M).getValueAt(V, CxtI);
Tristate Ret = getPredicateResult(Pred, C, Result, DL, TLI);		Tristate Ret = getPredicateResult(Pred, C, Result, DL, TLI);
if (Ret != Unknown)		if (Ret != Unknown)
return Ret;		return Ret;

		lebedev.riUnsubmitted Not Done Reply Inline Actions Should we be getting both, and combining their knowledge somehow? lebedev.ri: Should we be getting both, and combining their knowledge somehow?
		nikicAuthorUnsubmitted Done Reply Inline Actions getValueInBlock() is strictly more powerful, so that should not be necessary. nikic: getValueInBlock() is strictly more powerful, so that should not be necessary.
// Note: The following bit of code is somewhat distinct from the rest of LVI;		// Note: The following bit of code is somewhat distinct from the rest of LVI;
// LVI as a whole tries to compute a lattice value which is conservatively		// LVI as a whole tries to compute a lattice value which is conservatively
// correct at a given location. In this case, we have a predicate which we		// correct at a given location. In this case, we have a predicate which we
// weren't able to prove about the merged result, and we're pushing that		// weren't able to prove about the merged result, and we're pushing that
// predicate back along each incoming edge to see if we can prove it		// predicate back along each incoming edge to see if we can prove it
// separately for each input. As a motivating example, consider:		// separately for each input. As a motivating example, consider:
// bb1:		// bb1:
// %v1 = ... ; constantrange<1, 5>		// %v1 = ... ; constantrange<1, 5>
▲ Show 20 Lines • Show All 180 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/CorrelatedValuePropagation.cpp

	Show First 20 Lines • Show All 298 Lines • ▼ Show 20 Lines
	/// conditions, this can sometimes prove conditions instcombine can't by			/// conditions, this can sometimes prove conditions instcombine can't by
	/// exploiting range information.			/// exploiting range information.
	static bool processCmp(CmpInst Cmp, LazyValueInfo LVI) {			static bool processCmp(CmpInst Cmp, LazyValueInfo LVI) {
	Value *Op0 = Cmp->getOperand(0);			Value *Op0 = Cmp->getOperand(0);
	auto *C = dyn_cast<Constant>(Cmp->getOperand(1));			auto *C = dyn_cast<Constant>(Cmp->getOperand(1));
	if (!C)			if (!C)
	return false;			return false;

	// As a policy choice, we choose not to waste compile time on anything where
	// the comparison is testing local values. While LVI can sometimes reason
	// about such cases, it's not its primary purpose. We do make sure to do
	// the block local query for uses from terminator instructions, but that's
	// handled in the code for each terminator. As an exception, we allow phi
	// nodes, for which LVI can thread the condition into predecessors.
	auto *I = dyn_cast<Instruction>(Op0);
	if (I && I->getParent() == Cmp->getParent() && !isa<PHINode>(I))
	return false;

	LazyValueInfo::Tristate Result =			LazyValueInfo::Tristate Result =
	LVI->getPredicateAt(Cmp->getPredicate(), Op0, C, Cmp);			LVI->getPredicateAt(Cmp->getPredicate(), Op0, C, Cmp,
				/UseBlockValue=/true);
	if (Result == LazyValueInfo::Unknown)			if (Result == LazyValueInfo::Unknown)
	return false;			return false;

	++NumCmps;			++NumCmps;
	Constant *TorF = ConstantInt::get(Type::getInt1Ty(Cmp->getContext()), Result);			Constant *TorF = ConstantInt::get(Type::getInt1Ty(Cmp->getContext()), Result);
	Cmp->replaceAllUsesWith(TorF);			Cmp->replaceAllUsesWith(TorF);
	Cmp->eraseFromParent();			Cmp->eraseFromParent();
	return true;			return true;
	▲ Show 20 Lines • Show All 766 Lines • Show Last 20 Lines

llvm/test/Transforms/CorrelatedValuePropagation/basic.ll

Show First 20 Lines • Show All 289 Lines • ▼ Show 20 Lines
exit:		exit:
ret i1 %cmp		ret i1 %cmp
}		}

declare nonnull i8* @return_nonnull()		declare nonnull i8* @return_nonnull()
define i1 @call_attribute() {		define i1 @call_attribute() {
; CHECK-LABEL: @call_attribute(		; CHECK-LABEL: @call_attribute(
; CHECK-NEXT: [[A:%.]] = call i8 @return_nonnull()		; CHECK-NEXT: [[A:%.]] = call i8 @return_nonnull()
; CHECK-NEXT: [[CMP:%.]] = icmp eq i8 [[A]], null
; CHECK-NEXT: br label [[EXIT:%.*]]		; CHECK-NEXT: br label [[EXIT:%.*]]
; CHECK: exit:		; CHECK: exit:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
;		;
%a = call i8* @return_nonnull()		%a = call i8* @return_nonnull()
%cmp = icmp eq i8* %a, null		%cmp = icmp eq i8* %a, null
br label %exit		br label %exit

exit:		exit:
ret i1 %cmp		ret i1 %cmp
}		}

define i1 @umin(i32 %a, i32 %b) {		define i1 @umin(i32 %a, i32 %b) {
; CHECK-LABEL: @umin(		; CHECK-LABEL: @umin(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[CMP:%.]] = icmp ult i32 [[A:%.]], 5		; CHECK-NEXT: [[CMP:%.]] = icmp ult i32 [[A:%.]], 5
; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]		; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]
; CHECK: a_guard:		; CHECK: a_guard:
; CHECK-NEXT: [[CMP2:%.]] = icmp ult i32 [[B:%.]], 20		; CHECK-NEXT: [[CMP2:%.]] = icmp ult i32 [[B:%.]], 20
; CHECK-NEXT: br i1 [[CMP2]], label [[B_GUARD:%.*]], label [[OUT]]		; CHECK-NEXT: br i1 [[CMP2]], label [[B_GUARD:%.*]], label [[OUT]]
; CHECK: b_guard:		; CHECK: b_guard:
; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp ult i32 [[A]], [[B]]		; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp ult i32 [[A]], [[B]]
; CHECK-NEXT: [[MIN:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 [[B]]		; CHECK-NEXT: [[MIN:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 [[B]]
; CHECK-NEXT: [[RES:%.*]] = icmp eq i32 [[MIN]], 7
; CHECK-NEXT: br label [[NEXT:%.*]]		; CHECK-NEXT: br label [[NEXT:%.*]]
; CHECK: next:		; CHECK: next:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
; CHECK: out:		; CHECK: out:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
;		;
entry:		entry:
%cmp = icmp ult i32 %a, 5		%cmp = icmp ult i32 %a, 5
Show All 20 Lines
; CHECK-NEXT: [[CMP:%.]] = icmp ult i32 [[A:%.]], 5		; CHECK-NEXT: [[CMP:%.]] = icmp ult i32 [[A:%.]], 5
; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]		; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]
; CHECK: a_guard:		; CHECK: a_guard:
; CHECK-NEXT: [[CMP2:%.]] = icmp ult i32 [[B:%.]], 20		; CHECK-NEXT: [[CMP2:%.]] = icmp ult i32 [[B:%.]], 20
; CHECK-NEXT: br i1 [[CMP2]], label [[B_GUARD:%.*]], label [[OUT]]		; CHECK-NEXT: br i1 [[CMP2]], label [[B_GUARD:%.*]], label [[OUT]]
; CHECK: b_guard:		; CHECK: b_guard:
; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp sle i32 [[A]], [[B]]		; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp sle i32 [[A]], [[B]]
; CHECK-NEXT: [[MIN:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 [[B]]		; CHECK-NEXT: [[MIN:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 [[B]]
; CHECK-NEXT: [[RES:%.*]] = icmp eq i32 [[MIN]], 7
; CHECK-NEXT: br label [[NEXT:%.*]]		; CHECK-NEXT: br label [[NEXT:%.*]]
; CHECK: next:		; CHECK: next:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
; CHECK: out:		; CHECK: out:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
;		;
entry:		entry:
%cmp = icmp ult i32 %a, 5		%cmp = icmp ult i32 %a, 5
Show All 20 Lines
; CHECK-NEXT: [[CMP:%.]] = icmp sgt i32 [[A:%.]], 5		; CHECK-NEXT: [[CMP:%.]] = icmp sgt i32 [[A:%.]], 5
; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]		; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]
; CHECK: a_guard:		; CHECK: a_guard:
; CHECK-NEXT: [[CMP2:%.]] = icmp sgt i32 [[B:%.]], 20		; CHECK-NEXT: [[CMP2:%.]] = icmp sgt i32 [[B:%.]], 20
; CHECK-NEXT: br i1 [[CMP2]], label [[B_GUARD:%.*]], label [[OUT]]		; CHECK-NEXT: br i1 [[CMP2]], label [[B_GUARD:%.*]], label [[OUT]]
; CHECK: b_guard:		; CHECK: b_guard:
; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp sge i32 [[A]], [[B]]		; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp sge i32 [[A]], [[B]]
; CHECK-NEXT: [[MAX:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 [[B]]		; CHECK-NEXT: [[MAX:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 [[B]]
; CHECK-NEXT: [[RES:%.*]] = icmp eq i32 [[MAX]], 7
; CHECK-NEXT: br label [[NEXT:%.*]]		; CHECK-NEXT: br label [[NEXT:%.*]]
; CHECK: next:		; CHECK: next:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
; CHECK: out:		; CHECK: out:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
;		;
entry:		entry:
%cmp = icmp sgt i32 %a, 5		%cmp = icmp sgt i32 %a, 5
Show All 20 Lines
; CHECK-NEXT: [[CMP:%.]] = icmp sgt i32 [[A:%.]], 5		; CHECK-NEXT: [[CMP:%.]] = icmp sgt i32 [[A:%.]], 5
; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]		; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]
; CHECK: a_guard:		; CHECK: a_guard:
; CHECK-NEXT: [[CMP2:%.]] = icmp sgt i32 [[B:%.]], 20		; CHECK-NEXT: [[CMP2:%.]] = icmp sgt i32 [[B:%.]], 20
; CHECK-NEXT: br i1 [[CMP2]], label [[B_GUARD:%.*]], label [[OUT]]		; CHECK-NEXT: br i1 [[CMP2]], label [[B_GUARD:%.*]], label [[OUT]]
; CHECK: b_guard:		; CHECK: b_guard:
; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp uge i32 [[A]], [[B]]		; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp uge i32 [[A]], [[B]]
; CHECK-NEXT: [[MAX:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 [[B]]		; CHECK-NEXT: [[MAX:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 [[B]]
; CHECK-NEXT: [[RES:%.*]] = icmp eq i32 [[MAX]], 7
; CHECK-NEXT: br label [[NEXT:%.*]]		; CHECK-NEXT: br label [[NEXT:%.*]]
; CHECK: next:		; CHECK: next:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
; CHECK: out:		; CHECK: out:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
;		;
entry:		entry:
%cmp = icmp sgt i32 %a, 5		%cmp = icmp sgt i32 %a, 5
Show All 18 Lines
; CHECK-LABEL: @clamp_low1(		; CHECK-LABEL: @clamp_low1(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[CMP:%.]] = icmp sge i32 [[A:%.]], 5		; CHECK-NEXT: [[CMP:%.]] = icmp sge i32 [[A:%.]], 5
; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]		; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]
; CHECK: a_guard:		; CHECK: a_guard:
; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp eq i32 [[A]], 5		; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp eq i32 [[A]], 5
; CHECK-NEXT: [[ADD:%.*]] = add nsw i32 [[A]], -1		; CHECK-NEXT: [[ADD:%.*]] = add nsw i32 [[A]], -1
; CHECK-NEXT: [[SEL:%.*]] = select i1 [[SEL_CMP]], i32 5, i32 [[A]]		; CHECK-NEXT: [[SEL:%.*]] = select i1 [[SEL_CMP]], i32 5, i32 [[A]]
; CHECK-NEXT: [[RES:%.*]] = icmp eq i32 [[SEL]], 4
; CHECK-NEXT: br label [[NEXT:%.*]]		; CHECK-NEXT: br label [[NEXT:%.*]]
; CHECK: next:		; CHECK: next:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
; CHECK: out:		; CHECK: out:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
;		;
entry:		entry:
%cmp = icmp sge i32 %a, 5		%cmp = icmp sge i32 %a, 5
Show All 15 Lines
; CHECK-LABEL: @clamp_low2(		; CHECK-LABEL: @clamp_low2(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[CMP:%.]] = icmp sge i32 [[A:%.]], 5		; CHECK-NEXT: [[CMP:%.]] = icmp sge i32 [[A:%.]], 5
; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]		; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]
; CHECK: a_guard:		; CHECK: a_guard:
; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp ne i32 [[A]], 5		; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp ne i32 [[A]], 5
; CHECK-NEXT: [[ADD:%.*]] = add nsw i32 [[A]], -1		; CHECK-NEXT: [[ADD:%.*]] = add nsw i32 [[A]], -1
; CHECK-NEXT: [[SEL:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 5		; CHECK-NEXT: [[SEL:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 5
; CHECK-NEXT: [[RES:%.*]] = icmp eq i32 [[SEL]], 4
; CHECK-NEXT: br label [[NEXT:%.*]]		; CHECK-NEXT: br label [[NEXT:%.*]]
; CHECK: next:		; CHECK: next:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
; CHECK: out:		; CHECK: out:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
;		;
entry:		entry:
%cmp = icmp sge i32 %a, 5		%cmp = icmp sge i32 %a, 5
Show All 15 Lines
; CHECK-LABEL: @clamp_high1(		; CHECK-LABEL: @clamp_high1(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[CMP:%.]] = icmp sle i32 [[A:%.]], 5		; CHECK-NEXT: [[CMP:%.]] = icmp sle i32 [[A:%.]], 5
; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]		; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]
; CHECK: a_guard:		; CHECK: a_guard:
; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp eq i32 [[A]], 5		; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp eq i32 [[A]], 5
; CHECK-NEXT: [[ADD:%.*]] = add nsw i32 [[A]], 1		; CHECK-NEXT: [[ADD:%.*]] = add nsw i32 [[A]], 1
; CHECK-NEXT: [[SEL:%.*]] = select i1 [[SEL_CMP]], i32 5, i32 [[A]]		; CHECK-NEXT: [[SEL:%.*]] = select i1 [[SEL_CMP]], i32 5, i32 [[A]]
; CHECK-NEXT: [[RES:%.*]] = icmp eq i32 [[SEL]], 6
; CHECK-NEXT: br label [[NEXT:%.*]]		; CHECK-NEXT: br label [[NEXT:%.*]]
; CHECK: next:		; CHECK: next:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
; CHECK: out:		; CHECK: out:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
;		;
entry:		entry:
%cmp = icmp sle i32 %a, 5		%cmp = icmp sle i32 %a, 5
Show All 15 Lines
; CHECK-LABEL: @clamp_high2(		; CHECK-LABEL: @clamp_high2(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[CMP:%.]] = icmp sle i32 [[A:%.]], 5		; CHECK-NEXT: [[CMP:%.]] = icmp sle i32 [[A:%.]], 5
; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]		; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]
; CHECK: a_guard:		; CHECK: a_guard:
; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp ne i32 [[A]], 5		; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp ne i32 [[A]], 5
; CHECK-NEXT: [[ADD:%.*]] = add nsw i32 [[A]], 1		; CHECK-NEXT: [[ADD:%.*]] = add nsw i32 [[A]], 1
; CHECK-NEXT: [[SEL:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 5		; CHECK-NEXT: [[SEL:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 5
; CHECK-NEXT: [[RES:%.*]] = icmp eq i32 [[SEL]], 6
; CHECK-NEXT: br label [[NEXT:%.*]]		; CHECK-NEXT: br label [[NEXT:%.*]]
; CHECK: next:		; CHECK: next:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
; CHECK: out:		; CHECK: out:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
;		;
entry:		entry:
%cmp = icmp sle i32 %a, 5		%cmp = icmp sle i32 %a, 5
Show All 16 Lines
; CHECK-LABEL: @clamp_high3(		; CHECK-LABEL: @clamp_high3(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[CMP:%.]] = icmp sle i32 [[A:%.]], 5		; CHECK-NEXT: [[CMP:%.]] = icmp sle i32 [[A:%.]], 5
; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]		; CHECK-NEXT: br i1 [[CMP]], label [[A_GUARD:%.]], label [[OUT:%.]]
; CHECK: a_guard:		; CHECK: a_guard:
; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp ne i32 [[A]], 5		; CHECK-NEXT: [[SEL_CMP:%.*]] = icmp ne i32 [[A]], 5
; CHECK-NEXT: [[ADD:%.*]] = add nsw i32 [[A]], 100		; CHECK-NEXT: [[ADD:%.*]] = add nsw i32 [[A]], 100
; CHECK-NEXT: [[SEL:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 5		; CHECK-NEXT: [[SEL:%.*]] = select i1 [[SEL_CMP]], i32 [[A]], i32 5
; CHECK-NEXT: [[RES:%.*]] = icmp eq i32 [[SEL]], 105
; CHECK-NEXT: br label [[NEXT:%.*]]		; CHECK-NEXT: br label [[NEXT:%.*]]
; CHECK: next:		; CHECK: next:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
; CHECK: out:		; CHECK: out:
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
;		;
entry:		entry:
%cmp = icmp sle i32 %a, 5		%cmp = icmp sle i32 %a, 5
▲ Show 20 Lines • Show All 210 Lines • ▼ Show 20 Lines
exit:		exit:
ret void		ret void
}		}

define i1 @zext_unknown(i8 %a) {		define i1 @zext_unknown(i8 %a) {
; CHECK-LABEL: @zext_unknown(		; CHECK-LABEL: @zext_unknown(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[A32:%.]] = zext i8 [[A:%.]] to i32		; CHECK-NEXT: [[A32:%.]] = zext i8 [[A:%.]] to i32
; CHECK-NEXT: [[CMP:%.*]] = icmp sle i32 [[A32]], 256
; CHECK-NEXT: br label [[EXIT:%.*]]		; CHECK-NEXT: br label [[EXIT:%.*]]
; CHECK: exit:		; CHECK: exit:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
;		;
entry:		entry:
%a32 = zext i8 %a to i32		%a32 = zext i8 %a to i32
%cmp = icmp sle i32 %a32, 256		%cmp = icmp sle i32 %a32, 256
br label %exit		br label %exit
exit:		exit:
ret i1 %cmp		ret i1 %cmp
}		}

define i1 @trunc_unknown(i32 %a) {		define i1 @trunc_unknown(i32 %a) {
; CHECK-LABEL: @trunc_unknown(		; CHECK-LABEL: @trunc_unknown(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[A8:%.]] = trunc i32 [[A:%.]] to i8		; CHECK-NEXT: [[A8:%.]] = trunc i32 [[A:%.]] to i8
; CHECK-NEXT: [[A32:%.*]] = sext i8 [[A8]] to i32		; CHECK-NEXT: [[A32:%.*]] = sext i8 [[A8]] to i32
; CHECK-NEXT: [[CMP:%.*]] = icmp sle i32 [[A32]], 128
; CHECK-NEXT: br label [[EXIT:%.*]]		; CHECK-NEXT: br label [[EXIT:%.*]]
; CHECK: exit:		; CHECK: exit:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
;		;
entry:		entry:
%a8 = trunc i32 %a to i8		%a8 = trunc i32 %a to i8
%a32 = sext i8 %a8 to i32		%a32 = sext i8 %a8 to i32
%cmp = icmp sle i32 %a32, 128		%cmp = icmp sle i32 %a32, 128
Show All 38 Lines	exit:
ret i1 %cmp		ret i1 %cmp
}		}


define i1 @and_unknown(i32 %a) {		define i1 @and_unknown(i32 %a) {
; CHECK-LABEL: @and_unknown(		; CHECK-LABEL: @and_unknown(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[AND:%.]] = and i32 [[A:%.]], 128		; CHECK-NEXT: [[AND:%.]] = and i32 [[A:%.]], 128
; CHECK-NEXT: [[CMP:%.*]] = icmp sle i32 [[AND]], 128
; CHECK-NEXT: br label [[EXIT:%.*]]		; CHECK-NEXT: br label [[EXIT:%.*]]
; CHECK: exit:		; CHECK: exit:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
;		;
entry:		entry:
%and = and i32 %a, 128		%and = and i32 %a, 128
%cmp = icmp sle i32 %and, 128		%cmp = icmp sle i32 %and, 128
br label %exit		br label %exit
exit:		exit:
ret i1 %cmp		ret i1 %cmp
}		}

define i1 @lshr_unknown(i32 %a) {		define i1 @lshr_unknown(i32 %a) {
; CHECK-LABEL: @lshr_unknown(		; CHECK-LABEL: @lshr_unknown(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[AND:%.]] = lshr i32 [[A:%.]], 30		; CHECK-NEXT: [[AND:%.]] = lshr i32 [[A:%.]], 30
; CHECK-NEXT: [[CMP:%.*]] = icmp sle i32 [[AND]], 128
; CHECK-NEXT: br label [[EXIT:%.*]]		; CHECK-NEXT: br label [[EXIT:%.*]]
; CHECK: exit:		; CHECK: exit:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
;		;
entry:		entry:
%and = lshr i32 %a, 30		%and = lshr i32 %a, 30
%cmp = icmp sle i32 %and, 128		%cmp = icmp sle i32 %and, 128
br label %exit		br label %exit
exit:		exit:
ret i1 %cmp		ret i1 %cmp
}		}

define i1 @urem_unknown(i32 %a) {		define i1 @urem_unknown(i32 %a) {
; CHECK-LABEL: @urem_unknown(		; CHECK-LABEL: @urem_unknown(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[UREM:%.]] = urem i32 [[A:%.]], 30		; CHECK-NEXT: [[UREM:%.]] = urem i32 [[A:%.]], 30
; CHECK-NEXT: [[CMP:%.*]] = icmp ult i32 [[UREM]], 30
; CHECK-NEXT: br label [[EXIT:%.*]]		; CHECK-NEXT: br label [[EXIT:%.*]]
; CHECK: exit:		; CHECK: exit:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
;		;
entry:		entry:
%urem = urem i32 %a, 30		%urem = urem i32 %a, 30
%cmp = icmp ult i32 %urem, 30		%cmp = icmp ult i32 %urem, 30
br label %exit		br label %exit
exit:		exit:
ret i1 %cmp		ret i1 %cmp
}		}

define i1 @srem_unknown(i32 %a) {		define i1 @srem_unknown(i32 %a) {
; CHECK-LABEL: @srem_unknown(		; CHECK-LABEL: @srem_unknown(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[SREM:%.]] = srem i32 [[A:%.]], 30		; CHECK-NEXT: [[SREM:%.]] = srem i32 [[A:%.]], 30
; CHECK-NEXT: [[CMP1:%.*]] = icmp slt i32 [[SREM]], 30
; CHECK-NEXT: [[CMP2:%.*]] = icmp sgt i32 [[SREM]], -30
; CHECK-NEXT: br i1 undef, label [[EXIT1:%.]], label [[EXIT2:%.]]		; CHECK-NEXT: br i1 undef, label [[EXIT1:%.]], label [[EXIT2:%.]]
; CHECK: exit1:		; CHECK: exit1:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
; CHECK: exit2:		; CHECK: exit2:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
;		;
entry:		entry:
%srem = srem i32 %a, 30		%srem = srem i32 %a, 30
%cmp1 = icmp slt i32 %srem, 30		%cmp1 = icmp slt i32 %srem, 30
%cmp2 = icmp sgt i32 %srem, -30		%cmp2 = icmp sgt i32 %srem, -30
br i1 undef, label %exit1, label %exit2		br i1 undef, label %exit1, label %exit2
exit1:		exit1:
ret i1 %cmp1		ret i1 %cmp1
exit2:		exit2:
ret i1 %cmp2		ret i1 %cmp2
}		}

define i1 @sdiv_unknown(i32 %a) {		define i1 @sdiv_unknown(i32 %a) {
; CHECK-LABEL: @sdiv_unknown(		; CHECK-LABEL: @sdiv_unknown(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[SREM:%.]] = sdiv i32 [[A:%.]], 123		; CHECK-NEXT: [[SREM:%.]] = sdiv i32 [[A:%.]], 123
; CHECK-NEXT: [[CMP1:%.*]] = icmp slt i32 [[SREM]], 17459217
; CHECK-NEXT: [[CMP2:%.*]] = icmp sgt i32 [[SREM]], -17459217
; CHECK-NEXT: br i1 undef, label [[EXIT1:%.]], label [[EXIT2:%.]]		; CHECK-NEXT: br i1 undef, label [[EXIT1:%.]], label [[EXIT2:%.]]
; CHECK: exit1:		; CHECK: exit1:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
; CHECK: exit2:		; CHECK: exit2:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
;		;
entry:		entry:
%srem = sdiv i32 %a, 123		%srem = sdiv i32 %a, 123
%cmp1 = icmp slt i32 %srem, 17459217		%cmp1 = icmp slt i32 %srem, 17459217
%cmp2 = icmp sgt i32 %srem, -17459217		%cmp2 = icmp sgt i32 %srem, -17459217
br i1 undef, label %exit1, label %exit2		br i1 undef, label %exit1, label %exit2
exit1:		exit1:
ret i1 %cmp1		ret i1 %cmp1
exit2:		exit2:
ret i1 %cmp2		ret i1 %cmp2
}		}

define i1 @uadd_sat_unknown(i32 %a) {		define i1 @uadd_sat_unknown(i32 %a) {
; CHECK-LABEL: @uadd_sat_unknown(		; CHECK-LABEL: @uadd_sat_unknown(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[VAL:%.]] = call i32 @llvm.uadd.sat.i32(i32 [[A:%.]], i32 100)		; CHECK-NEXT: [[VAL:%.]] = call i32 @llvm.uadd.sat.i32(i32 [[A:%.]], i32 100)
; CHECK-NEXT: [[CMP1:%.*]] = icmp uge i32 [[VAL]], 100
; CHECK-NEXT: [[CMP2:%.*]] = icmp ugt i32 [[VAL]], 100		; CHECK-NEXT: [[CMP2:%.*]] = icmp ugt i32 [[VAL]], 100
; CHECK-NEXT: br i1 undef, label [[EXIT1:%.]], label [[EXIT2:%.]]		; CHECK-NEXT: br i1 undef, label [[EXIT1:%.]], label [[EXIT2:%.]]
; CHECK: exit1:		; CHECK: exit1:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
; CHECK: exit2:		; CHECK: exit2:
; CHECK-NEXT: ret i1 [[CMP2]]		; CHECK-NEXT: ret i1 [[CMP2]]
;		;
entry:		entry:
%val = call i32 @llvm.uadd.sat.i32(i32 %a, i32 100)		%val = call i32 @llvm.uadd.sat.i32(i32 %a, i32 100)
%cmp1 = icmp uge i32 %val, 100		%cmp1 = icmp uge i32 %val, 100
%cmp2 = icmp ugt i32 %val, 100		%cmp2 = icmp ugt i32 %val, 100
br i1 undef, label %exit1, label %exit2		br i1 undef, label %exit1, label %exit2
exit1:		exit1:
ret i1 %cmp1		ret i1 %cmp1
exit2:		exit2:
ret i1 %cmp2		ret i1 %cmp2
}		}

define i1 @usub_sat_unknown(i32 %a) {		define i1 @usub_sat_unknown(i32 %a) {
; CHECK-LABEL: @usub_sat_unknown(		; CHECK-LABEL: @usub_sat_unknown(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[VAL:%.]] = call i32 @llvm.usub.sat.i32(i32 [[A:%.]], i32 100)		; CHECK-NEXT: [[VAL:%.]] = call i32 @llvm.usub.sat.i32(i32 [[A:%.]], i32 100)
; CHECK-NEXT: [[CMP1:%.*]] = icmp ule i32 [[VAL]], -101
; CHECK-NEXT: [[CMP2:%.*]] = icmp ult i32 [[VAL]], -101		; CHECK-NEXT: [[CMP2:%.*]] = icmp ult i32 [[VAL]], -101
; CHECK-NEXT: br i1 undef, label [[EXIT1:%.]], label [[EXIT2:%.]]		; CHECK-NEXT: br i1 undef, label [[EXIT1:%.]], label [[EXIT2:%.]]
; CHECK: exit1:		; CHECK: exit1:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
; CHECK: exit2:		; CHECK: exit2:
; CHECK-NEXT: ret i1 [[CMP2]]		; CHECK-NEXT: ret i1 [[CMP2]]
;		;
entry:		entry:
%val = call i32 @llvm.usub.sat.i32(i32 %a, i32 100)		%val = call i32 @llvm.usub.sat.i32(i32 %a, i32 100)
%cmp1 = icmp ule i32 %val, 4294967195		%cmp1 = icmp ule i32 %val, 4294967195
%cmp2 = icmp ult i32 %val, 4294967195		%cmp2 = icmp ult i32 %val, 4294967195
br i1 undef, label %exit1, label %exit2		br i1 undef, label %exit1, label %exit2
exit1:		exit1:
ret i1 %cmp1		ret i1 %cmp1
exit2:		exit2:
ret i1 %cmp2		ret i1 %cmp2
}		}

define i1 @sadd_sat_unknown(i32 %a) {		define i1 @sadd_sat_unknown(i32 %a) {
; CHECK-LABEL: @sadd_sat_unknown(		; CHECK-LABEL: @sadd_sat_unknown(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[VAL:%.]] = call i32 @llvm.sadd.sat.i32(i32 [[A:%.]], i32 100)		; CHECK-NEXT: [[VAL:%.]] = call i32 @llvm.sadd.sat.i32(i32 [[A:%.]], i32 100)
; CHECK-NEXT: [[CMP1:%.*]] = icmp sge i32 [[VAL]], -2147483548
; CHECK-NEXT: [[CMP2:%.*]] = icmp sgt i32 [[VAL]], -2147483548		; CHECK-NEXT: [[CMP2:%.*]] = icmp sgt i32 [[VAL]], -2147483548
; CHECK-NEXT: br i1 undef, label [[EXIT1:%.]], label [[EXIT2:%.]]		; CHECK-NEXT: br i1 undef, label [[EXIT1:%.]], label [[EXIT2:%.]]
; CHECK: exit1:		; CHECK: exit1:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
; CHECK: exit2:		; CHECK: exit2:
; CHECK-NEXT: ret i1 [[CMP2]]		; CHECK-NEXT: ret i1 [[CMP2]]
;		;
entry:		entry:
%val = call i32 @llvm.sadd.sat.i32(i32 %a, i32 100)		%val = call i32 @llvm.sadd.sat.i32(i32 %a, i32 100)
%cmp1 = icmp sge i32 %val, -2147483548		%cmp1 = icmp sge i32 %val, -2147483548
%cmp2 = icmp sgt i32 %val, -2147483548		%cmp2 = icmp sgt i32 %val, -2147483548
br i1 undef, label %exit1, label %exit2		br i1 undef, label %exit1, label %exit2
exit1:		exit1:
ret i1 %cmp1		ret i1 %cmp1
exit2:		exit2:
ret i1 %cmp2		ret i1 %cmp2
}		}

define i1 @ssub_sat_unknown(i32 %a) {		define i1 @ssub_sat_unknown(i32 %a) {
; CHECK-LABEL: @ssub_sat_unknown(		; CHECK-LABEL: @ssub_sat_unknown(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[VAL:%.]] = call i32 @llvm.ssub.sat.i32(i32 [[A:%.]], i32 100)		; CHECK-NEXT: [[VAL:%.]] = call i32 @llvm.ssub.sat.i32(i32 [[A:%.]], i32 100)
; CHECK-NEXT: [[CMP1:%.*]] = icmp sle i32 [[VAL]], 2147483547
; CHECK-NEXT: [[CMP2:%.*]] = icmp slt i32 [[VAL]], 2147483547		; CHECK-NEXT: [[CMP2:%.*]] = icmp slt i32 [[VAL]], 2147483547
; CHECK-NEXT: br i1 undef, label [[EXIT1:%.]], label [[EXIT2:%.]]		; CHECK-NEXT: br i1 undef, label [[EXIT1:%.]], label [[EXIT2:%.]]
; CHECK: exit1:		; CHECK: exit1:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
; CHECK: exit2:		; CHECK: exit2:
; CHECK-NEXT: ret i1 [[CMP2]]		; CHECK-NEXT: ret i1 [[CMP2]]
;		;
entry:		entry:
Show All 14 Lines

llvm/test/Transforms/CorrelatedValuePropagation/icmp.ll

	Show First 20 Lines • Show All 375 Lines • ▼ Show 20 Lines
	}			}

	define i1 @test12(i32 %x) {			define i1 @test12(i32 %x) {
	; CHECK-LABEL: @test12(			; CHECK-LABEL: @test12(
	; CHECK-NEXT: [[ZEXT:%.]] = zext i32 [[X:%.]] to i64			; CHECK-NEXT: [[ZEXT:%.]] = zext i32 [[X:%.]] to i64
	; CHECK-NEXT: [[MUL:%.*]] = mul nuw nsw i64 [[ZEXT]], 7			; CHECK-NEXT: [[MUL:%.*]] = mul nuw nsw i64 [[ZEXT]], 7
	; CHECK-NEXT: [[SHR:%.*]] = lshr i64 [[MUL]], 32			; CHECK-NEXT: [[SHR:%.*]] = lshr i64 [[MUL]], 32
	; CHECK-NEXT: [[TRUNC:%.*]] = trunc i64 [[SHR]] to i32			; CHECK-NEXT: [[TRUNC:%.*]] = trunc i64 [[SHR]] to i32
	; CHECK-NEXT: [[CMP:%.*]] = icmp ult i32 [[TRUNC]], 7			; CHECK-NEXT: ret i1 true
	; CHECK-NEXT: ret i1 [[CMP]]
	;			;
	%zext = zext i32 %x to i64			%zext = zext i32 %x to i64
	%mul = mul nuw i64 %zext, 7			%mul = mul nuw i64 %zext, 7
	%shr = lshr i64 %mul, 32			%shr = lshr i64 %mul, 32
	%trunc = trunc i64 %shr to i32			%trunc = trunc i64 %shr to i32
	%cmp = icmp ult i32 %trunc, 7			%cmp = icmp ult i32 %trunc, 7
	ret i1 %cmp			ret i1 %cmp
	}			}

	define i1 @test13(i8 %x, i64* %p) {			define i1 @test13(i8 %x, i64* %p) {
	; CHECK-LABEL: @test13(			; CHECK-LABEL: @test13(
	; CHECK-NEXT: [[ZEXT:%.]] = zext i8 [[X:%.]] to i64			; CHECK-NEXT: [[ZEXT:%.]] = zext i8 [[X:%.]] to i64
	; CHECK-NEXT: [[ADD:%.*]] = add nuw nsw i64 [[ZEXT]], 128			; CHECK-NEXT: [[ADD:%.*]] = add nuw nsw i64 [[ZEXT]], 128
	; CHECK-NEXT: [[CMP:%.*]] = icmp ult i64 [[ADD]], 384
	; CHECK-NEXT: store i64 [[ADD]], i64* [[P:%.*]], align 8			; CHECK-NEXT: store i64 [[ADD]], i64* [[P:%.*]], align 8
	; CHECK-NEXT: ret i1 [[CMP]]			; CHECK-NEXT: ret i1 true
	;			;
	%zext = zext i8 %x to i64			%zext = zext i8 %x to i64
	%add = add nuw nsw i64 %zext, 128			%add = add nuw nsw i64 %zext, 128
	%cmp = icmp ult i64 %add, 384			%cmp = icmp ult i64 %add, 384
	; Without this extra use, InstSimplify could handle this			; Without this extra use, InstSimplify could handle this
	store i64 %add, i64* %p			store i64 %add, i64* %p
	ret i1 %cmp			ret i1 %cmp
	}			}
	▲ Show 20 Lines • Show All 212 Lines • ▼ Show 20 Lines

	define i1 @test_br_cmp_with_offset(i64 %idx) {			define i1 @test_br_cmp_with_offset(i64 %idx) {
	; CHECK-LABEL: @test_br_cmp_with_offset(			; CHECK-LABEL: @test_br_cmp_with_offset(
	; CHECK-NEXT: [[IDX_OFF1:%.]] = add i64 [[IDX:%.]], -5			; CHECK-NEXT: [[IDX_OFF1:%.]] = add i64 [[IDX:%.]], -5
	; CHECK-NEXT: [[CMP1:%.*]] = icmp ult i64 [[IDX_OFF1]], 3			; CHECK-NEXT: [[CMP1:%.*]] = icmp ult i64 [[IDX_OFF1]], 3
	; CHECK-NEXT: br i1 [[CMP1]], label [[IF_TRUE:%.]], label [[IF_FALSE:%.]]			; CHECK-NEXT: br i1 [[CMP1]], label [[IF_TRUE:%.]], label [[IF_FALSE:%.]]
	; CHECK: if.true:			; CHECK: if.true:
	; CHECK-NEXT: [[IDX_OFF2:%.*]] = add nsw i64 [[IDX]], -1			; CHECK-NEXT: [[IDX_OFF2:%.*]] = add nsw i64 [[IDX]], -1
	; CHECK-NEXT: [[CMP2:%.*]] = icmp ult i64 [[IDX_OFF2]], 10			; CHECK-NEXT: ret i1 true
	; CHECK-NEXT: ret i1 [[CMP2]]
	; CHECK: if.false:			; CHECK: if.false:
	; CHECK-NEXT: ret i1 undef			; CHECK-NEXT: ret i1 undef
	;			;
	%idx.off1 = add i64 %idx, -5			%idx.off1 = add i64 %idx, -5
	%cmp1 = icmp ult i64 %idx.off1, 3			%cmp1 = icmp ult i64 %idx.off1, 3
	br i1 %cmp1, label %if.true, label %if.false			br i1 %cmp1, label %if.true, label %if.false

	if.true:			if.true:
	▲ Show 20 Lines • Show All 522 Lines • Show Last 20 Lines

llvm/test/Transforms/CorrelatedValuePropagation/range.ll

Show First 20 Lines • Show All 235 Lines • ▼ Show 20 Lines	sw.default:
%or = or i1 %cmp5, %cmp6		%or = or i1 %cmp5, %cmp6
%or2 = or i1 %cmp7, %cmp8		%or2 = or i1 %cmp7, %cmp8
ret i1 false		ret i1 false
}		}

define i1 @test8(i64* %p) {		define i1 @test8(i64* %p) {
; CHECK-LABEL: @test8(		; CHECK-LABEL: @test8(
; CHECK-NEXT: [[A:%.]] = load i64, i64 [[P:%.]], align 4, [[RNG0:!range !.]]		; CHECK-NEXT: [[A:%.]] = load i64, i64 [[P:%.]], align 4, [[RNG0:!range !.]]
; CHECK-NEXT: [[RES:%.*]] = icmp eq i64 [[A]], 0
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
;		;
%a = load i64, i64* %p, !range !{i64 4, i64 255}		%a = load i64, i64* %p, !range !{i64 4, i64 255}
%res = icmp eq i64 %a, 0		%res = icmp eq i64 %a, 0
ret i1 %res		ret i1 %res
}		}

define i1 @test9(i64* %p) {		define i1 @test9(i64* %p) {
; CHECK-LABEL: @test9(		; CHECK-LABEL: @test9(
; CHECK-NEXT: [[A:%.]] = load i64, i64 [[P:%.]], align 4, [[RNG1:!range !.]]		; CHECK-NEXT: [[A:%.]] = load i64, i64 [[P:%.]], align 4, [[RNG1:!range !.]]
; CHECK-NEXT: [[RES:%.*]] = icmp eq i64 [[A]], 0
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
;		;
%a = load i64, i64* %p, !range !{i64 0, i64 1}		%a = load i64, i64* %p, !range !{i64 0, i64 1}
%res = icmp eq i64 %a, 0		%res = icmp eq i64 %a, 0
ret i1 %res		ret i1 %res
}		}

define i1 @test10(i64* %p) {		define i1 @test10(i64* %p) {
; CHECK-LABEL: @test10(		; CHECK-LABEL: @test10(
; CHECK-NEXT: [[A:%.]] = load i64, i64 [[P:%.]], align 4, [[RNG2:!range !.]]		; CHECK-NEXT: [[A:%.]] = load i64, i64 [[P:%.]], align 4, [[RNG2:!range !.]]
; CHECK-NEXT: [[RES:%.*]] = icmp eq i64 [[A]], 0
; CHECK-NEXT: ret i1 false		; CHECK-NEXT: ret i1 false
;		;
%a = load i64, i64* %p, !range !{i64 4, i64 8, i64 15, i64 20}		%a = load i64, i64* %p, !range !{i64 4, i64 8, i64 15, i64 20}
%res = icmp eq i64 %a, 0		%res = icmp eq i64 %a, 0
ret i1 %res		ret i1 %res
}		}

@g = external global i32		@g = external global i32

define i1 @test11() {		define i1 @test11() {
; CHECK-LABEL: @test11(		; CHECK-LABEL: @test11(
; CHECK-NEXT: [[POSITIVE:%.]] = load i32, i32 @g, align 4, [[RNG3:!range !.*]]		; CHECK-NEXT: [[POSITIVE:%.]] = load i32, i32 @g, align 4, [[RNG3:!range !.*]]
; CHECK-NEXT: [[ADD:%.*]] = add nuw nsw i32 [[POSITIVE]], 1		; CHECK-NEXT: [[ADD:%.*]] = add nuw nsw i32 [[POSITIVE]], 1
; CHECK-NEXT: [[TEST:%.*]] = icmp sgt i32 [[ADD]], 0
; CHECK-NEXT: br label [[NEXT:%.*]]		; CHECK-NEXT: br label [[NEXT:%.*]]
; CHECK: next:		; CHECK: next:
; CHECK-NEXT: ret i1 true		; CHECK-NEXT: ret i1 true
;		;
%positive = load i32, i32* @g, !range !{i32 1, i32 2048}		%positive = load i32, i32* @g, !range !{i32 1, i32 2048}
%add = add i32 %positive, 1		%add = add i32 %positive, 1
%test = icmp sgt i32 %add, 0		%test = icmp sgt i32 %add, 0
br label %next		br label %next
▲ Show 20 Lines • Show All 653 Lines • Show Last 20 Lines