This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Analysis/
-
Analysis/
44/51
ScalarEvolution.cpp
-
test/Analysis/ScalarEvolution/
-
Analysis/
-
ScalarEvolution/
3/3
trip-multiple-guard-info.ll
-
unittests/Analysis/
-
Analysis/
-
ScalarEvolutionTest.cpp

Differential D141850

[SCEV] Preserve divisibility and min/max information in applyLoopGuards
ClosedPublic

Authored by alonkom on Jan 16 2023, 7:01 AM.

Download Raw Diff

Details

Reviewers

fhahn
mkazantsev

Commits

rG219ba2fb7b0a: [SCEV] Preserve divisibility and min/max information in applyLoopGuards

Summary

applyLoopGuards doesn't always preserve information when there are multiple assumes.
This patch tries to deal with multiple assumes regarding a SCEV's divisibility and min/max values, and rewrite it into a SCEV that still preserves all of the information.
For example, let the trip count of the loop be TC. Consider the 3 following assumes:

__builtin_assume(TC % 8 == 0);
__builtin_assume(TC > 0);
__builtin_assume(TC < 100);

Before this patch, depending on the assume processing order applyLoopGuards could create the following SCEV:
max(min((8 * (TC / 8)) , 99), 1)

Looking at this SCEV, it doesn't preserve the divisibility by 8 information.

After this patch, depending on the assume processing order applyLoopGuards could create the following SCEV:
max(min((8 * (TC / 8)) , 96), 8)

By aligning up 1 to 8, and aligning down 99 to 96, the new SCEV still preserves all of the original assumes.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

alonkom created this revision.Jan 16 2023, 7:01 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 16 2023, 7:01 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

alonkom requested review of this revision.Jan 16 2023, 7:01 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 16 2023, 7:01 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B208049: Diff 489545.Jan 16 2023, 8:14 AM

What is the relation to https://reviews.llvm.org/D128701? Seems maybe you should close the other one.

Can you make the commit title more specific, and add more info in the commit message? In particular, I would like to see something like:

after applyLoopGuards, turns <A> into <B>

where A and B are SCEV expressions.

Other information like, what the problem was before, and how you address it, is also helpful. I would suggest looking at the git log of ScalarEvolution.cpp for examples of descriptive commit messages. This will save the reviewers time, and will be helpful when people look at the git log/blame.

alonkom retitled this revision from [SCEV] Preserve information in applyLoopGuards to [SCEV] Preserve divisibility and min/max information in applyLoopGuards.Jan 17 2023, 4:24 AM

alonkom edited the summary of this revision. (Show Details)

Herald added a subscriber: javed.absar. · View Herald TranscriptJan 17 2023, 4:24 AM

In D141850#4057996, @caojoshua wrote:
What is the relation to https://reviews.llvm.org/D128701? Seems maybe you should close the other one.

Can you make the commit title more specific, and add more info in the commit message? In particular, I would like to see something like:
after applyLoopGuards, turns <A> into <B>
where A and B are SCEV expressions.

Other information like, what the problem was before, and how you address it, is also helpful. I would suggest looking at the git log of ScalarEvolution.cpp for examples of descriptive commit messages. This will save the reviewers time, and will be helpful when people look at the git log/blame.

Thanks, updated the commit message.

alonkom updated this revision to Diff 490073.Jan 18 2023, 1:33 AM

alonkom added a reviewer: fhahn.

Herald added a subscriber: StephenFan. · View Herald TranscriptJan 18 2023, 1:33 AM

Harbormaster completed remote builds in B208432: Diff 490073.Jan 18 2023, 2:32 AM

The improved trip multiples from the test results look good. Ordering in applyLoopGuards is an issue. However, I think we can simplify this down a bit. What if we always applied min/max first, before we apply divisibility guards? For example, given:

__builtin_assume(TC % 8 == 0);
__builtin_assume(TC > 0);
__builtin_assume(TC < 100);

apply max: umax(1, TC)
apply min: umin(100, umax(1, TC))
apply divisibility info: 8 * (umin(100, umax(1, TC))) / 8

This makes divisibility info obvious. And traversing the SCEV, we can still see TC > 0 and TC < 100. I believe that if we always apply max/min first, we will never lose this info. This approach seems much simpler and easier to understand.

FYI. I haven't been around that long, and this change is non-trivial enough where I prefer a longstanding developer to give the final approval. I will still be around to give my thoughts.

llvm/lib/Analysis/ScalarEvolution.cpp
15042	SequentialMinMax SCEVs can be applied here as well.
15043	unused var
15047	Constant should always be on the left side. Lets not change that.
15062	Have not thought about it too deeply yet, but I'm concerned that we may need to take a look at when this is legal given Expr's NoWrapFlags. Same for the equivalent getMinusSCEV below
15171	Shouldn't swap the constant. Assume constant is on left.

caojoshua added a reviewer: mkazantsev.Jan 19 2023, 12:32 AM

In D141850#4064477, @caojoshua wrote:
The improved trip multiples from the test results look good. Ordering in applyLoopGuards is an issue. However, I think we can simplify this down a bit. What if we always applied min/max first, before we apply divisibility guards? For example, given:
__builtin_assume(TC % 8 == 0);
__builtin_assume(TC > 0);
__builtin_assume(TC < 100);
apply max: umax(1, TC)

apply min: umin(100, umax(1, TC))

apply divisibility info: 8 * (umin(100, umax(1, TC))) / 8

This makes divisibility info obvious. And traversing the SCEV, we can still see TC > 0 and TC < 100. I believe that if we always apply max/min first, we will never lose this info. This approach seems much simpler and easier to understand.

FYI. I haven't been around that long, and this change is non-trivial enough where I prefer a longstanding developer to give the final approval. I will still be around to give my thoughts.

Thanks for the reply.
This works in this example, but what would happen if we had only these assumes:

__builtin_assume(TC % 8 == 0);
__builtin_assume(TC > 0);

In that case, we would have created the following SCEV:

8 * (umax(1, TC) / 8)

since umax(1, TC) may still be between [1,7], when we divide it by 8, and multiply by 8, we get 0. So this doesn't preserve the fact that TC > 0.
In order to do that we must align up 1 to 8, and then umax(8, TC) / 8 is always > 0.

In D141850#4064581, @alonkom wrote:
In D141850#4064477, @caojoshua wrote:
The improved trip multiples from the test results look good. Ordering in applyLoopGuards is an issue. However, I think we can simplify this down a bit. What if we always applied min/max first, before we apply divisibility guards? For example, given:
__builtin_assume(TC % 8 == 0);
__builtin_assume(TC > 0);
__builtin_assume(TC < 100);
apply max: umax(1, TC)

apply min: umin(100, umax(1, TC))

apply divisibility info: 8 * (umin(100, umax(1, TC))) / 8

This makes divisibility info obvious. And traversing the SCEV, we can still see TC > 0 and TC < 100. I believe that if we always apply max/min first, we will never lose this info. This approach seems much simpler and easier to understand.

FYI. I haven't been around that long, and this change is non-trivial enough where I prefer a longstanding developer to give the final approval. I will still be around to give my thoughts.
Thanks for the reply.
This works in this example, but what would happen if we had only these assumes:
__builtin_assume(TC % 8 == 0);
__builtin_assume(TC > 0);
In that case, we would have created the following SCEV:
8 * (umax(1, TC) / 8)
since umax(1, TC) may still be between [1,7], when we divide it by 8, and multiply by 8, we get 0. So this doesn't preserve the fact that TC > 0.
In order to do that we must align up 1 to 8, and then umax(8, TC) / 8 is always > 0.

umax(1, TC) implies TC <= 1, which holds in this case.

I think what you meant is in the case that, TC > 0, we get: 8 * (umin(1, TC)) / 8. If TC is in [0, 7], the expression evaluates to 0, and TC > 0 is lost.

Your point comes across that we can't just apply guards in a certain order. I have some thoughts, but I'll think about it and write it down later.

After this patch, depending on the assume processing order applyLoopGuards could create the following SCEV:
max(min((8 * (TC / 8)) , 96), 8)

This example looks wrong. I think min/max should be switched. Should be

min(max((8 * (TC / 8)), 96), 8)

Please update description.

In terms of overall approach, I'm not sure. Feels a bit hacky to have custom logic to check that an expressions is a min/max of mul/div. I'll let others chime in here.

llvm/lib/Analysis/ScalarEvolution.cpp
15163	typo: wether I'm not sure what `the divisor B in \p DividesBy` means. I think this paragraph needs to be more clear. What does it mean to be composed on Min/Max SCEVs?

Some nits, and potentially a bug in formula.

llvm/lib/Analysis/ScalarEvolution.cpp
15042	I'd prefer it to be a separate patch, if it's legal at all. Need to check carefully how poison flows in these formulae.
15045	MinMax can have more than 2 operands. Check that it is exactly 2 of them?
15054	greater or equal
15057	If I'm reading this correctly, this is a guarantee of no-overflow for computations you are doing. Maybe add this comment explicitly?
15062	This computation is inconsistent. If `Rem` is constant `0`, you'll return `Expr`. But if `Rem` is effectively zero, but not a constant (e.g. some complex expression which is always zero), you return `Expr + Divisor`. Bug?
15068	less or equal
15076	I think you can safely drop check for `Rem->isZero()` here as it will be trivially simplified away in `getMinusSCEV`
15084	auto
15165	auto
15179	`if (auto *MinMax = dyn_cast<SCEVMinMaxExpr>(Expr))`
15188	auto

This revision now requires changes to proceed.Jan 24 2023, 1:54 AM

In D141850#4072675, @caojoshua wrote:
After this patch, depending on the assume processing order applyLoopGuards could create the following SCEV:
max(min((8 * (TC / 8)) , 96), 8)
This example looks wrong. I think min/max should be switched. Should be
min(max((8 * (TC / 8)), 96), 8)
Please update description.

In terms of overall approach, I'm not sure. Feels a bit hacky to have custom logic to check that an expressions is a min/max of mul/div. I'll let others chime in here.

I think the example is correct.
Even before this patch:
TC > 0 is translated to max (TC, 1)
TC < 99 is translated to min (TC, 98)

llvm/lib/Analysis/ScalarEvolution.cpp
15042	They are not created in this function, so I prefer only handling MinMax at this point.
15042	can you explain what isn't legal here? just checking if this is a min/max of 2 operands, when the 2nd one is constant.
15043	it is used as an output of the function
15057	I will only handle constants for now.
15062	I will only support constants for now
15062	I will only support constants for now
15084	can't use auto since there's a recursive call here
15163	rephrased it a bit.
15165	can't use auto since there's a recursive call here
15188	can't use auto, due to recursive call.

alonkom updated this revision to Diff 493504.Jan 31 2023, 12:28 AM

alonkom marked 5 inline comments as done.

Harbormaster completed remote builds in B210922: Diff 493504.Jan 31 2023, 12:56 AM

mkazantsev added inline comments.Jan 31 2023, 8:46 AM

llvm/lib/Analysis/ScalarEvolution.cpp
15042	I mean "I would prefer adding sequential min/max in a separate patch". Meaning, so far so good here.

mkazantsev added inline comments.Jan 31 2023, 8:50 AM

llvm/lib/Analysis/ScalarEvolution.cpp
15165	`auto HasDivisibiltyInfo = [&](const SCEV Expr, const SCEV &DividesBy) -> bool {...` ?

I think the example is correct.
Even before this patch:
TC > 0 is translated to max (TC, 1)
TC < 99 is translated to min (TC, 98)

My mistake. You're correct.

alonkom marked an inline comment as done.Jan 31 2023, 11:45 PM

alonkom added inline comments.

llvm/lib/Analysis/ScalarEvolution.cpp
15165	I'm still getting this build error: use of ‘HasDivisibiltyInfo’ before deduction of ‘auto’

mkazantsev added inline comments.Feb 2 2023, 9:59 PM

llvm/lib/Analysis/ScalarEvolution.cpp
15224	How do you account for `RHS` being `SINT_MIN` and similar cases here?

mkazantsev added inline comments.Feb 2 2023, 10:05 PM

llvm/test/Analysis/ScalarEvolution/trip-multiple-guard-info.ll
397	Please precommit tests as is, I want to see what exactly this patch changes.

mkazantsev added inline comments.Feb 2 2023, 10:10 PM

llvm/lib/Analysis/ScalarEvolution.cpp
15216	`auto *`

alonkom marked an inline comment as done.Feb 5 2023, 3:58 AM

alonkom added inline comments.

llvm/test/Analysis/ScalarEvolution/trip-multiple-guard-info.ll
397	https://reviews.llvm.org/D143337

I think supporting smin/smax is a bug, this code is only written for unsigned values.

llvm/lib/Analysis/ScalarEvolution.cpp
15042	nit: `auto *`
15064	Do you check bit width anywhere? What if it doesn't fit into `unsigned`? Better use APInt unless you have a reason not to.
15100	Init with nullptr, makes it easier to debug.
15105	Do you really support `smin` here? Your code is only correct for unsigned values. It interprets all values as non-negative.

This revision now requires changes to proceed.Feb 5 2023, 11:12 PM

alonkom marked an inline comment as done.Feb 6 2023, 5:08 AM

alonkom added inline comments.

llvm/lib/Analysis/ScalarEvolution.cpp
15224	I wonder how this worked before my patch. if we have assume (N < SINT_MIN) then the following SCEV was generated: smin(N, SINT_MIN - 1) which would overflow.

alonkom updated this revision to Diff 496081.Feb 9 2023, 4:07 AM

alonkom marked 3 inline comments as done.

alonkom added inline comments.

llvm/lib/Analysis/ScalarEvolution.cpp
15105	I check if the operands are non-negative inside GetPreviousSCEV/GetNextSCEVD

Harbormaster completed remote builds in B212773: Diff 496081.Feb 9 2023, 5:43 AM

Mostly looks good, but I think I found a bug in IsMinMaxSCEVWithConstant . Also please rebase on top of your tests.

llvm/lib/Analysis/ScalarEvolution.cpp
15048	If I remember correctly, SCEV operands are always sorted by kind. It means that constants always go first. `RHS` can't be a constant because then `LHS` is also a constant and the whole thing should be folded away. So it should be trivially `false`, and I guess this code part isn't covered by tests if it wasn't caught. Usually matcher methods aren't supposed to change operands if they return `false`. In your case it doesn't matter, but generally better to follow the standard practice and do all checks before modifying the operands.
15105	Ah ok, then it makes sense. Can we assert on that here?
15196	`if (auto *MinMax = dyn_cast<SCEVMinMaxExpr>(Expr)) { ...`
15224	Well, if there is a check somewhere that `RHS` is non-negative, then it makes sense. Otherwise it's possibly broken. :)
llvm/test/Analysis/ScalarEvolution/trip-multiple-guard-info.ll
397	Rebase patch on top of it?

This revision now requires changes to proceed.Feb 10 2023, 1:04 AM

alonkom updated this revision to Diff 496757.Feb 12 2023, 5:49 AM

alonkom marked 3 inline comments as done.

Harbormaster completed remote builds in B213275: Diff 496757.Feb 12 2023, 6:41 AM

@mkazantsev Let me know if you have any other comments

llvm/lib/Analysis/ScalarEvolution.cpp
15048	Nice catch! It didn't hurt the divisibility info, but some min/max info was lost. I Fixed this and added a unit-test since the lit only checks the divisibility. Done.
15224	Anyway, the change I've made only applies to non-negative values

Ok, thanks, let's give it a try.

llvm/lib/Analysis/ScalarEvolution.cpp
15069	nit: `{}` not needed
15081–15088	This code is copy-paste-ish (see lambda above), maybe factor out?
15095	If you see any problems with compile time from this patch, this is a potential place where things can be limited. Not sure if it's worth it in practice.
15108	nit: `auto*`
15135	Is it possible that the map contains both `LHSUnknown -> RewrittenLHS` and `RewrittenLHS -> SomeOtherLHS`? Should this be a loop? I'm ok if it's a separate patch.

This revision is now accepted and ready to land.Feb 20 2023, 9:52 PM

alonkom updated this revision to Diff 499115.Feb 21 2023, 4:59 AM

alonkom marked 3 inline comments as done.

Harbormaster completed remote builds in B214980: Diff 499115.Feb 21 2023, 6:09 AM

alonkom updated this revision to Diff 499375.Feb 21 2023, 11:07 PM

Harbormaster completed remote builds in B215165: Diff 499375.Feb 22 2023, 12:16 AM

alonkom added inline comments.Feb 23 2023, 12:02 AM

llvm/lib/Analysis/ScalarEvolution.cpp
15095	I don't expect more than 3 assumes per value in the real-world, so I don't believe it'll affect compilation time drastically, but let's see.
15135	The entire function assumes 1 level of nesting

Closed by commit rG219ba2fb7b0a: [SCEV] Preserve divisibility and min/max information in applyLoopGuards (authored by alonkom, committed by komalon1 <alon.kom@mobileye.com>). · Explain WhyFeb 23 2023, 1:16 AM

This revision was automatically updated to reflect the committed changes.

komalon1 <alon.kom@mobileye.com> added a commit: rG219ba2fb7b0a: [SCEV] Preserve divisibility and min/max information in applyLoopGuards.

komalon1 <alon.kom@mobileye.com> added a reverting change: rG02e08d06aac9: Revert "[SCEV] Preserve divisibility and min/max information in applyLoopGuards".Feb 23 2023, 4:44 AM

uabelho added a subscriber: uabelho.Feb 23 2023, 11:17 PM

Why was this reverted?

In D141850#4152680, @caojoshua wrote:

Why was this reverted?

Failed on an assertion. Uploading a fix now.
(Also some buildbots failed on time-outs, but I'm not sure this is due to my change)

alonkom updated this revision to Diff 500525.Feb 26 2023, 1:38 AM

Harbormaster completed remote builds in B216021: Diff 500525.Feb 26 2023, 2:25 AM

Revision Contents

Path

Size

llvm/

lib/

Analysis/

ScalarEvolution.cpp

214 lines

test/

Analysis/

ScalarEvolution/

trip-multiple-guard-info.ll

14 lines

unittests/

Analysis/

ScalarEvolutionTest.cpp

38 lines

Diff 499764

llvm/lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 15,028 Lines • ▼ Show 20 Lines	auto MatchRangeCheckIdiom = [this, Predicate, LHS, RHS, &RewriteMap,
getConstant(ExactRegion.getUnsignedMin()),		getConstant(ExactRegion.getUnsignedMin()),
getUMinExpr(RewrittenLHS, getConstant(ExactRegion.getUnsignedMax())));		getUMinExpr(RewrittenLHS, getConstant(ExactRegion.getUnsignedMax())));
ExprsToRewrite.push_back(LHSUnknown);		ExprsToRewrite.push_back(LHSUnknown);
return true;		return true;
};		};
if (MatchRangeCheckIdiom())		if (MatchRangeCheckIdiom())
return;		return;

		// Return true if \p Expr is a MinMax SCEV expression with a constant
		// operand. If so, return in \p SCTy the SCEV type and in \p RHS the
		// non-constant operand and in \p LHS the constant operand.
		auto IsMinMaxSCEVWithConstant = [&](const SCEV *Expr, SCEVTypes &SCTy,
		const SCEV &LHS, const SCEV &RHS) {
		if (auto *MinMax = dyn_cast<SCEVMinMaxExpr>(Expr)) {
		caojoshuaUnsubmitted Done Reply Inline Actions SequentialMinMax SCEVs can be applied here as well. caojoshua: SequentialMinMax SCEVs can be applied here as well.
		mkazantsevUnsubmitted Done Reply Inline Actions I'd prefer it to be a separate patch, if it's legal at all. Need to check carefully how poison flows in these formulae. mkazantsev: I'd prefer it to be a separate patch, if it's legal at all. Need to check carefully how poison…
		alonkomAuthorUnsubmitted Done Reply Inline Actions can you explain what isn't legal here? just checking if this is a min/max of 2 operands, when the 2nd one is constant. alonkom: can you explain what isn't legal here? just checking if this is a min/max of 2 operands, when…
		alonkomAuthorUnsubmitted Done Reply Inline Actions They are not created in this function, so I prefer only handling MinMax at this point. alonkom: They are not created in this function, so I prefer only handling MinMax at this point.
		mkazantsevUnsubmitted Done Reply Inline Actions I mean "I would prefer adding sequential min/max in a separate patch". Meaning, so far so good here. mkazantsev: I mean "I would prefer adding sequential min/max in a separate patch". Meaning, so far so good…
		mkazantsevUnsubmitted Done Reply Inline Actions nit: `auto ` mkazantsev:* nit: `auto *`
		if (MinMax->getNumOperands() != 2)
		caojoshuaUnsubmitted Not Done Reply Inline Actions unused var caojoshua: unused var
		alonkomAuthorUnsubmitted Done Reply Inline Actions it is used as an output of the function alonkom: it is used as an output of the function
		return false;
		SCTy = MinMax->getSCEVType();
		mkazantsevUnsubmitted Done Reply Inline Actions MinMax can have more than 2 operands. Check that it is exactly 2 of them? mkazantsev: MinMax can have more than 2 operands. Check that it is exactly 2 of them?
		if (!isa<SCEVConstant>(MinMax->getOperand(0)))
		return false;
		caojoshuaUnsubmitted Done Reply Inline Actions Constant should always be on the left side. Lets not change that. caojoshua: Constant should always be on the left side. Lets not change that.
		LHS = MinMax->getOperand(0);
		mkazantsevUnsubmitted Not Done Reply Inline Actions If I remember correctly, SCEV operands are always sorted by kind. It means that constants always go first. `RHS` can't be a constant because then `LHS` is also a constant and the whole thing should be folded away. So it should be trivially `false`, and I guess this code part isn't covered by tests if it wasn't caught. Usually matcher methods aren't supposed to change operands if they return `false`. In your case it doesn't matter, but generally better to follow the standard practice and do all checks before modifying the operands. mkazantsev: 1. If I remember correctly, SCEV operands are always sorted by kind. It means that constants…
		alonkomAuthorUnsubmitted Done Reply Inline Actions Nice catch! It didn't hurt the divisibility info, but some min/max info was lost. I Fixed this and added a unit-test since the lit only checks the divisibility. Done. alonkom: # Nice catch! It didn't hurt the divisibility info, but some min/max info was lost. I Fixed…
		RHS = MinMax->getOperand(1);
		return true;
		}
		return false;
		};

		mkazantsevUnsubmitted Done Reply Inline Actions greater or equal mkazantsev: greater or equal
		// Checks whether Expr is a non-negative constant, and Divisor is a positive
		// constant, and returns their APInt in ExprVal and in DivisorVal.
		auto GetNonNegExprAndPosDivisor = [&](const SCEV Expr, const SCEV Divisor,
		mkazantsevUnsubmitted Done Reply Inline Actions If I'm reading this correctly, this is a guarantee of no-overflow for computations you are doing. Maybe add this comment explicitly? mkazantsev: If I'm reading this correctly, this is a guarantee of no-overflow for computations you are…
		alonkomAuthorUnsubmitted Done Reply Inline Actions I will only handle constants for now. alonkom: I will only handle constants for now.
		APInt &ExprVal, APInt &DivisorVal) {
		if (!isKnownNonNegative(Expr) \|\| !isKnownPositive(Divisor))
		return false;
		auto *ConstExpr = dyn_cast<SCEVConstant>(Expr);
		auto *ConstDivisor = dyn_cast<SCEVConstant>(Divisor);
		caojoshuaUnsubmitted Done Reply Inline Actions Have not thought about it too deeply yet, but I'm concerned that we may need to take a look at when this is legal given Expr's NoWrapFlags. Same for the equivalent getMinusSCEV below caojoshua: Have not thought about it too deeply yet, but I'm concerned that we may need to take a look at…
		alonkomAuthorUnsubmitted Done Reply Inline Actions I will only support constants for now alonkom: I will only support constants for now
		alonkomAuthorUnsubmitted Done Reply Inline Actions I will only support constants for now alonkom: I will only support constants for now
		mkazantsevUnsubmitted Done Reply Inline Actions This computation is inconsistent. If `Rem` is constant `0`, you'll return `Expr`. But if `Rem` is effectively zero, but not a constant (e.g. some complex expression which is always zero), you return `Expr + Divisor`. Bug? mkazantsev: This computation is inconsistent. If `Rem` is constant `0`, you'll return `Expr`. But if `Rem`…
		if (!ConstExpr \|\| !ConstDivisor)
		return false;
		mkazantsevUnsubmitted Done Reply Inline Actions Do you check bit width anywhere? What if it doesn't fit into `unsigned`? Better use APInt unless you have a reason not to. mkazantsev: Do you check bit width anywhere? What if it doesn't fit into `unsigned`? Better use APInt…
		ExprVal = ConstExpr->getAPInt();
		DivisorVal = ConstDivisor->getAPInt();
		return true;
		};
		mkazantsevUnsubmitted Done Reply Inline Actions less or equal mkazantsev: less or equal

		mkazantsevUnsubmitted Done Reply Inline Actions nit: `{}` not needed mkazantsev: nit: `{}` not needed
		// Return a new SCEV that modifies \p Expr to the closest number divides by
		// \p Divisor and greater or equal than Expr.
		// For now, only handle constant Expr and Divisor.
		auto GetNextSCEVDividesByDivisor = [&](const SCEV *Expr,
		const SCEV *Divisor) {
		APInt ExprVal;
		APInt DivisorVal;
		mkazantsevUnsubmitted Done Reply Inline Actions I think you can safely drop check for `Rem->isZero()` here as it will be trivially simplified away in `getMinusSCEV` mkazantsev: I think you can safely drop check for `Rem->isZero()` here as it will be trivially simplified…
		if (!GetNonNegExprAndPosDivisor(Expr, Divisor, ExprVal, DivisorVal))
		return Expr;
		APInt Rem = ExprVal.urem(DivisorVal);
		if (!Rem.isZero())
		// return the SCEV: Expr + Divisor - Expr % Divisor
		return getConstant(ExprVal + DivisorVal - Rem);
		return Expr;
		};
		mkazantsevUnsubmitted Done Reply Inline Actions auto mkazantsev: auto
		alonkomAuthorUnsubmitted Done Reply Inline Actions can't use auto since there's a recursive call here alonkom: can't use auto since there's a recursive call here

		// Return a new SCEV that modifies \p Expr to the closest number divides by
		// \p Divisor and less or equal than Expr.
		// For now, only handle constant Expr and Divisor.
		mkazantsevUnsubmitted Done Reply Inline Actions This code is copy-paste-ish (see lambda above), maybe factor out? mkazantsev: This code is copy-paste-ish (see lambda above), maybe factor out?
		auto GetPreviousSCEVDividesByDivisor = [&](const SCEV *Expr,
		const SCEV *Divisor) {
		APInt ExprVal;
		APInt DivisorVal;
		if (!GetNonNegExprAndPosDivisor(Expr, Divisor, ExprVal, DivisorVal))
		return Expr;
		APInt Rem = ExprVal.urem(DivisorVal);
		mkazantsevUnsubmitted Not Done Reply Inline Actions If you see any problems with compile time from this patch, this is a potential place where things can be limited. Not sure if it's worth it in practice. mkazantsev: If you see any problems with compile time from this patch, this is a potential place where…
		alonkomAuthorUnsubmitted Done Reply Inline Actions I don't expect more than 3 assumes per value in the real-world, so I don't believe it'll affect compilation time drastically, but let's see. alonkom: I don't expect more than 3 assumes per value in the real-world, so I don't believe it'll affect…
		// return the SCEV: Expr - Expr % Divisor
		return getConstant(ExprVal - Rem);
		};

		// Apply divisibilty by \p Divisor on MinMaxExpr with constant values,
		mkazantsevUnsubmitted Done Reply Inline Actions Init with nullptr, makes it easier to debug. mkazantsev: Init with nullptr, makes it easier to debug.
		// recursively. This is done by aligning up/down the constant value to the
		// Divisor.
		std::function<const SCEV (const SCEV , const SCEV *)>
		ApplyDivisibiltyOnMinMaxExpr = [&](const SCEV *MinMaxExpr,
		const SCEV *Divisor) {
		mkazantsevUnsubmitted Done Reply Inline Actions Do you really support `smin` here? Your code is only correct for unsigned values. It interprets all values as non-negative. mkazantsev: Do you really support `smin` here? Your code is only correct for unsigned values. It interprets…
		alonkomAuthorUnsubmitted Done Reply Inline Actions I check if the operands are non-negative inside GetPreviousSCEV/GetNextSCEVD alonkom: I check if the operands are non-negative inside GetPreviousSCEV/GetNextSCEVD
		mkazantsevUnsubmitted Done Reply Inline Actions Ah ok, then it makes sense. Can we assert on that here? mkazantsev: Ah ok, then it makes sense. Can we assert on that here?
		const SCEV MinMaxLHS = nullptr, MinMaxRHS = nullptr;
		SCEVTypes SCTy;
		if (!IsMinMaxSCEVWithConstant(MinMaxExpr, SCTy, MinMaxLHS, MinMaxRHS))
		mkazantsevUnsubmitted Done Reply Inline Actions nit: `auto` mkazantsev:* nit: `auto*`
		return MinMaxExpr;
		auto IsMin =
		isa<SCEVSMinExpr>(MinMaxExpr) \|\| isa<SCEVUMinExpr>(MinMaxExpr);
		assert(isKnownNonNegative(MinMaxLHS) &&
		"Expected non-negative operand!");
		auto *DivisibleExpr =
		IsMin ? GetPreviousSCEVDividesByDivisor(MinMaxLHS, Divisor)
		: GetNextSCEVDividesByDivisor(MinMaxLHS, Divisor);
		SmallVector<const SCEV *> Ops = {
		ApplyDivisibiltyOnMinMaxExpr(MinMaxRHS, Divisor), DivisibleExpr};
		return getMinMaxExpr(SCTy, Ops);
		};

// If we have LHS == 0, check if LHS is computing a property of some unknown		// If we have LHS == 0, check if LHS is computing a property of some unknown
// SCEV %v which we can rewrite %v to express explicitly.		// SCEV %v which we can rewrite %v to express explicitly.
const SCEVConstant *RHSC = dyn_cast<SCEVConstant>(RHS);		const SCEVConstant *RHSC = dyn_cast<SCEVConstant>(RHS);
if (Predicate == CmpInst::ICMP_EQ && RHSC &&		if (Predicate == CmpInst::ICMP_EQ && RHSC &&
RHSC->getValue()->isNullValue()) {		RHSC->getValue()->isNullValue()) {
// If LHS is A % B, i.e. A % B == 0, rewrite A to (A /u B) * B to		// If LHS is A % B, i.e. A % B == 0, rewrite A to (A /u B) * B to
// explicitly express that.		// explicitly express that.
const SCEV *URemLHS = nullptr;		const SCEV *URemLHS = nullptr;
const SCEV *URemRHS = nullptr;		const SCEV *URemRHS = nullptr;
if (matchURem(LHS, URemLHS, URemRHS)) {		if (matchURem(LHS, URemLHS, URemRHS)) {
if (const SCEVUnknown *LHSUnknown = dyn_cast<SCEVUnknown>(URemLHS)) {		if (const SCEVUnknown *LHSUnknown = dyn_cast<SCEVUnknown>(URemLHS)) {
const auto *Multiple = getMulExpr(getUDivExpr(URemLHS, URemRHS), URemRHS);		auto I = RewriteMap.find(LHSUnknown);
		const SCEV *RewrittenLHS =
		I != RewriteMap.end() ? I->second : LHSUnknown;
		mkazantsevUnsubmitted Not Done Reply Inline Actions Is it possible that the map contains both `LHSUnknown -> RewrittenLHS` and `RewrittenLHS -> SomeOtherLHS`? Should this be a loop? I'm ok if it's a separate patch. mkazantsev: Is it possible that the map contains both `LHSUnknown -> RewrittenLHS ` and `RewrittenLHS ->…
		alonkomAuthorUnsubmitted Done Reply Inline Actions The entire function assumes 1 level of nesting alonkom: The entire function assumes 1 level of nesting
		RewrittenLHS = ApplyDivisibiltyOnMinMaxExpr(RewrittenLHS, URemRHS);
		const auto *Multiple =
		getMulExpr(getUDivExpr(RewrittenLHS, URemRHS), URemRHS);
RewriteMap[LHSUnknown] = Multiple;		RewriteMap[LHSUnknown] = Multiple;
ExprsToRewrite.push_back(LHSUnknown);		ExprsToRewrite.push_back(LHSUnknown);
return;		return;
}		}
}		}
}		}

// Do not apply information for constants or if RHS contains an AddRec.		// Do not apply information for constants or if RHS contains an AddRec.
if (isa<SCEVConstant>(LHS) \|\| containsAddRecurrence(RHS))		if (isa<SCEVConstant>(LHS) \|\| containsAddRecurrence(RHS))
return;		return;

// If RHS is SCEVUnknown, make sure the information is applied to it.		// If RHS is SCEVUnknown, make sure the information is applied to it.
if (!isa<SCEVUnknown>(LHS) && isa<SCEVUnknown>(RHS)) {		if (!isa<SCEVUnknown>(LHS) && isa<SCEVUnknown>(RHS)) {
std::swap(LHS, RHS);		std::swap(LHS, RHS);
Predicate = CmpInst::getSwappedPredicate(Predicate);		Predicate = CmpInst::getSwappedPredicate(Predicate);
}		}

// Check whether LHS has already been rewritten. In that case we want to		// Check whether LHS has already been rewritten. In that case we want to
// chain further rewrites onto the already rewritten value.		// chain further rewrites onto the already rewritten value.
auto I = RewriteMap.find(LHS);		auto I = RewriteMap.find(LHS);
const SCEV *RewrittenLHS = I != RewriteMap.end() ? I->second : LHS;		const SCEV *RewrittenLHS = I != RewriteMap.end() ? I->second : LHS;

		// Check for the SCEV expression (A /u B) * B while B is a constant, inside
		// \p Expr. The check is done recuresively on \p Expr, which is assumed to
		// be a composition of Min/Max SCEVs. Return whether the SCEV expression (A
		caojoshuaUnsubmitted Done Reply Inline Actions typo: wether I'm not sure what `the divisor B in \p DividesBy` means. I think this paragraph needs to be more clear. What does it mean to be composed on Min/Max SCEVs? caojoshua: typo: wether I'm not sure what `the divisor B in \p DividesBy` means. I think this paragraph…
		alonkomAuthorUnsubmitted Done Reply Inline Actions rephrased it a bit. alonkom: rephrased it a bit.
		// /u B) * B was found, and return the divisor B in \p DividesBy. For
		// example, if Expr = umin (umax ((A /u 8) * 8, 16), 64), return true since
		mkazantsevUnsubmitted Done Reply Inline Actions auto mkazantsev: auto
		alonkomAuthorUnsubmitted Done Reply Inline Actions can't use auto since there's a recursive call here alonkom: can't use auto since there's a recursive call here
		mkazantsevUnsubmitted Not Done Reply Inline Actions `auto HasDivisibiltyInfo = [&](const SCEV Expr, const SCEV &DividesBy) -> bool {...` ? mkazantsev: `auto HasDivisibiltyInfo = [&](const SCEV Expr, const SCEV &DividesBy) -> bool {...` ?
		alonkomAuthorUnsubmitted Done Reply Inline Actions I'm still getting this build error: use of ‘HasDivisibiltyInfo’ before deduction of ‘auto’ alonkom: I'm still getting this build error: use of ‘HasDivisibiltyInfo’ before deduction of ‘auto’
		// (A /u 8) * 8 matched the pattern, and return the constant SCEV 8 in \p
		// DividesBy.
		std::function<bool(const SCEV , const SCEV &)> HasDivisibiltyInfo =
		[&](const SCEV Expr, const SCEV &DividesBy) {
		if (auto *Mul = dyn_cast<SCEVMulExpr>(Expr)) {
		if (Mul->getNumOperands() != 2)
		caojoshuaUnsubmitted Done Reply Inline Actions Shouldn't swap the constant. Assume constant is on left. caojoshua: Shouldn't swap the constant. Assume constant is on left.
		return false;
		auto *MulLHS = Mul->getOperand(0);
		auto *MulRHS = Mul->getOperand(1);
		if (isa<SCEVConstant>(MulLHS))
		std::swap(MulLHS, MulRHS);
		if (auto *Div = dyn_cast<SCEVUDivExpr>(MulLHS)) {
		if (Div->getOperand(1) == MulRHS) {
		DividesBy = MulRHS;
		mkazantsevUnsubmitted Done Reply Inline Actions `if (auto MinMax = dyn_cast<SCEVMinMaxExpr>(Expr))` mkazantsev:* `if (auto *MinMax = dyn_cast<SCEVMinMaxExpr>(Expr))`
		return true;
		}
		}
		}
		if (auto *MinMax = dyn_cast<SCEVMinMaxExpr>(Expr)) {
		return HasDivisibiltyInfo(MinMax->getOperand(0), DividesBy) \|\|
		HasDivisibiltyInfo(MinMax->getOperand(1), DividesBy);
		}
		return false;
		mkazantsevUnsubmitted Done Reply Inline Actions auto mkazantsev: auto
		alonkomAuthorUnsubmitted Done Reply Inline Actions can't use auto, due to recursive call. alonkom: can't use auto, due to recursive call.
		};

		// Return true if Expr known to divide by \p DividesBy.
		std::function<bool(const SCEV , const SCEV &)> IsKnownToDivideBy =
		[&](const SCEV Expr, const SCEV DividesBy) {
		if (getURemExpr(Expr, DividesBy)->isZero())
		return true;
		if (auto *MinMax = dyn_cast<SCEVMinMaxExpr>(Expr)) {
		mkazantsevUnsubmitted Done Reply Inline Actions `if (auto MinMax = dyn_cast<SCEVMinMaxExpr>(Expr)) { ...` mkazantsev:* `if (auto *MinMax = dyn_cast<SCEVMinMaxExpr>(Expr)) { ...`
		return IsKnownToDivideBy(MinMax->getOperand(0), DividesBy) &&
		IsKnownToDivideBy(MinMax->getOperand(1), DividesBy);
		}
		return false;
		};

		const SCEV *DividesBy = nullptr;
		if (HasDivisibiltyInfo(RewrittenLHS, DividesBy))
		// Check that the whole expression is divided by DividesBy
		DividesBy =
		IsKnownToDivideBy(RewrittenLHS, DividesBy) ? DividesBy : nullptr;

const SCEV *RewrittenRHS = nullptr;		const SCEV *RewrittenRHS = nullptr;
switch (Predicate) {		switch (Predicate) {
case CmpInst::ICMP_ULT: {		case CmpInst::ICMP_ULT: {
if (RHS->getType()->isPointerTy())		if (RHS->getType()->isPointerTy())
break;		break;
const SCEV *One = getOne(RHS->getType());		const SCEV *One = getOne(RHS->getType());
RewrittenRHS =		auto *ModifiedRHS = getMinusSCEV(getUMaxExpr(RHS, One), One);
getUMinExpr(RewrittenLHS, getMinusSCEV(getUMaxExpr(RHS, One), One));		ModifiedRHS =
		mkazantsevUnsubmitted Done Reply Inline Actions `auto ` mkazantsev:* `auto *`
		DividesBy ? GetPreviousSCEVDividesByDivisor(ModifiedRHS, DividesBy)
		: ModifiedRHS;
		RewrittenRHS = getUMinExpr(RewrittenLHS, ModifiedRHS);
break;		break;
}		}
case CmpInst::ICMP_SLT:		case CmpInst::ICMP_SLT: {
RewrittenRHS =		auto *ModifiedRHS = getMinusSCEV(RHS, getOne(RHS->getType()));
getSMinExpr(RewrittenLHS, getMinusSCEV(RHS, getOne(RHS->getType())));		ModifiedRHS =
		mkazantsevUnsubmitted Not Done Reply Inline Actions How do you account for `RHS` being `SINT_MIN` and similar cases here? mkazantsev: How do you account for `RHS` being `SINT_MIN` and similar cases here?
		alonkomAuthorUnsubmitted Done Reply Inline Actions I wonder how this worked before my patch. if we have assume (N < SINT_MIN) then the following SCEV was generated: smin(N, SINT_MIN - 1) which would overflow. alonkom: I wonder how this worked before my patch. if we have assume (N < SINT_MIN) then the following…
		mkazantsevUnsubmitted Not Done Reply Inline Actions Well, if there is a check somewhere that `RHS` is non-negative, then it makes sense. Otherwise it's possibly broken. :) mkazantsev: Well, if there is a check somewhere that `RHS` is non-negative, then it makes sense. Otherwise…
		alonkomAuthorUnsubmitted Done Reply Inline Actions Anyway, the change I've made only applies to non-negative values alonkom: Anyway, the change I've made only applies to non-negative values
		DividesBy ? GetPreviousSCEVDividesByDivisor(ModifiedRHS, DividesBy)
		: ModifiedRHS;
		RewrittenRHS = getSMinExpr(RewrittenLHS, ModifiedRHS);
break;		break;
case CmpInst::ICMP_ULE:		}
RewrittenRHS = getUMinExpr(RewrittenLHS, RHS);		case CmpInst::ICMP_ULE: {
		auto *ModifiedRHS =
		DividesBy ? GetPreviousSCEVDividesByDivisor(RHS, DividesBy) : RHS;
		RewrittenRHS = getUMinExpr(RewrittenLHS, ModifiedRHS);
break;		break;
case CmpInst::ICMP_SLE:		}
RewrittenRHS = getSMinExpr(RewrittenLHS, RHS);		case CmpInst::ICMP_SLE: {
		auto *ModifiedRHS =
		DividesBy ? GetPreviousSCEVDividesByDivisor(RHS, DividesBy) : RHS;
		RewrittenRHS = getSMinExpr(RewrittenLHS, ModifiedRHS);
break;		break;
case CmpInst::ICMP_UGT:		}
RewrittenRHS =		case CmpInst::ICMP_UGT: {
getUMaxExpr(RewrittenLHS, getAddExpr(RHS, getOne(RHS->getType())));		auto *ModifiedRHS = getAddExpr(RHS, getOne(RHS->getType()));
		ModifiedRHS = DividesBy
		? GetNextSCEVDividesByDivisor(ModifiedRHS, DividesBy)
		: ModifiedRHS;
		RewrittenRHS = getUMaxExpr(RewrittenLHS, ModifiedRHS);
break;		break;
case CmpInst::ICMP_SGT:		}
RewrittenRHS =		case CmpInst::ICMP_SGT: {
getSMaxExpr(RewrittenLHS, getAddExpr(RHS, getOne(RHS->getType())));		auto *ModifiedRHS = getAddExpr(RHS, getOne(RHS->getType()));
		ModifiedRHS = DividesBy
		? GetNextSCEVDividesByDivisor(ModifiedRHS, DividesBy)
		: ModifiedRHS;
		RewrittenRHS = getSMaxExpr(RewrittenLHS, ModifiedRHS);
break;		break;
case CmpInst::ICMP_UGE:		}
RewrittenRHS = getUMaxExpr(RewrittenLHS, RHS);		case CmpInst::ICMP_UGE: {
		auto *ModifiedRHS =
		DividesBy ? GetNextSCEVDividesByDivisor(RHS, DividesBy) : RHS;
		RewrittenRHS = getUMaxExpr(RewrittenLHS, ModifiedRHS);
break;		break;
case CmpInst::ICMP_SGE:		}
RewrittenRHS = getSMaxExpr(RewrittenLHS, RHS);		case CmpInst::ICMP_SGE: {
		auto *ModifiedRHS =
		DividesBy ? GetNextSCEVDividesByDivisor(RHS, DividesBy) : RHS;
		RewrittenRHS = getSMaxExpr(RewrittenLHS, ModifiedRHS);
break;		break;
		}
case CmpInst::ICMP_EQ:		case CmpInst::ICMP_EQ:
if (isa<SCEVConstant>(RHS))		if (isa<SCEVConstant>(RHS))
RewrittenRHS = RHS;		RewrittenRHS = RHS;
break;		break;
case CmpInst::ICMP_NE:		case CmpInst::ICMP_NE:
if (isa<SCEVConstant>(RHS) &&		if (isa<SCEVConstant>(RHS) &&
cast<SCEVConstant>(RHS)->getValue()->isNullValue())		cast<SCEVConstant>(RHS)->getValue()->isNullValue()) {
RewrittenRHS = getUMaxExpr(RewrittenLHS, getOne(RHS->getType()));		auto *ModifiedRHS = getOne(RHS->getType());
		ModifiedRHS = DividesBy
		? GetNextSCEVDividesByDivisor(ModifiedRHS, DividesBy)
		: ModifiedRHS;
		RewrittenRHS = getUMaxExpr(RewrittenLHS, ModifiedRHS);
		}
break;		break;
default:		default:
break;		break;
}		}

if (RewrittenRHS) {		if (RewrittenRHS) {
RewriteMap[LHS] = RewrittenRHS;		RewriteMap[LHS] = RewrittenRHS;
if (LHS == RewrittenLHS)		if (LHS == RewrittenLHS)
▲ Show 20 Lines • Show All 93 Lines • Show Last 20 Lines

llvm/test/Analysis/ScalarEvolution/trip-multiple-guard-info.ll

Show First 20 Lines • Show All 119 Lines • ▼ Show 20 Lines
; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1		; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1
; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }		; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }
; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_ugt_5_order_swapped		; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_ugt_5_order_swapped
; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is -2		; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is -2
; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)
; CHECK-NEXT: Predicates:		; CHECK-NEXT: Predicates:
; CHECK: Loop %for.body: Trip multiple is 2		; CHECK: Loop %for.body: Trip multiple is 4
;		;
entry:		entry:
%u = urem i32 %num, 4		%u = urem i32 %num, 4
%cmp.1 = icmp ugt i32 %num, 5		%cmp.1 = icmp ugt i32 %num, 5
tail call void @llvm.assume(i1 %cmp.1)		tail call void @llvm.assume(i1 %cmp.1)
%cmp = icmp eq i32 %u, 0		%cmp = icmp eq i32 %u, 0
tail call void @llvm.assume(i1 %cmp)		tail call void @llvm.assume(i1 %cmp)
br label %for.body		br label %for.body
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1		; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1
; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }		; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }
; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_sgt_5_order_swapped		; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_sgt_5_order_swapped
; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is 2147483646		; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is 2147483646
; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)
; CHECK-NEXT: Predicates:		; CHECK-NEXT: Predicates:
; CHECK: Loop %for.body: Trip multiple is 2		; CHECK: Loop %for.body: Trip multiple is 4
;		;
entry:		entry:
%u = urem i32 %num, 4		%u = urem i32 %num, 4
%cmp.1 = icmp sgt i32 %num, 5		%cmp.1 = icmp sgt i32 %num, 5
tail call void @llvm.assume(i1 %cmp.1)		tail call void @llvm.assume(i1 %cmp.1)
%cmp = icmp eq i32 %u, 0		%cmp = icmp eq i32 %u, 0
tail call void @llvm.assume(i1 %cmp)		tail call void @llvm.assume(i1 %cmp)
br label %for.body		br label %for.body
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1		; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1
; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }		; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }
; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_uge_5_order_swapped		; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_uge_5_order_swapped
; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is -2		; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is -2
; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)
; CHECK-NEXT: Predicates:		; CHECK-NEXT: Predicates:
; CHECK: Loop %for.body: Trip multiple is 1		; CHECK: Loop %for.body: Trip multiple is 4
;		;
entry:		entry:
%u = urem i32 %num, 4		%u = urem i32 %num, 4
%cmp = icmp eq i32 %u, 0		%cmp = icmp eq i32 %u, 0
%cmp.1 = icmp uge i32 %num, 5		%cmp.1 = icmp uge i32 %num, 5
tail call void @llvm.assume(i1 %cmp.1)		tail call void @llvm.assume(i1 %cmp.1)
tail call void @llvm.assume(i1 %cmp)		tail call void @llvm.assume(i1 %cmp)
br label %for.body		br label %for.body
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1		; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1
; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }		; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }
; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_sge_5_order_swapped		; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_sge_5_order_swapped
; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is 2147483646		; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is 2147483646
; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)
; CHECK-NEXT: Predicates:		; CHECK-NEXT: Predicates:
; CHECK: Loop %for.body: Trip multiple is 1		; CHECK: Loop %for.body: Trip multiple is 4
;		;
entry:		entry:
%u = urem i32 %num, 4		%u = urem i32 %num, 4
%cmp = icmp eq i32 %u, 0		%cmp = icmp eq i32 %u, 0
%cmp.1 = icmp sge i32 %num, 5		%cmp.1 = icmp sge i32 %num, 5
tail call void @llvm.assume(i1 %cmp.1)		tail call void @llvm.assume(i1 %cmp.1)
tail call void @llvm.assume(i1 %cmp)		tail call void @llvm.assume(i1 %cmp)
br label %for.body		br label %for.body
Show All 39 Lines	for.body:
%inc = add nuw nsw i32 %i.010, 1		%inc = add nuw nsw i32 %i.010, 1
%cmp2 = icmp ult i32 %inc, %num		%cmp2 = icmp ult i32 %inc, %num
br i1 %cmp2, label %for.body, label %exit		br i1 %cmp2, label %for.body, label %exit

exit:		exit:
ret void		ret void
}		}

define void @test_trip_multiple_4_upper_lower_bounds(i32 %num) {		define void @test_trip_multiple_4_upper_lower_bounds(i32 %num) {
		mkazantsevUnsubmitted Done Reply Inline Actions Please precommit tests as is, I want to see what exactly this patch changes. mkazantsev: Please precommit tests as is, I want to see what exactly this patch changes.
		alonkomAuthorUnsubmitted Done Reply Inline Actions https://reviews.llvm.org/D143337 alonkom: https://reviews.llvm.org/D143337
		mkazantsevUnsubmitted Done Reply Inline Actions Rebase patch on top of it? mkazantsev: Rebase patch on top of it?
; CHECK-LABEL: 'test_trip_multiple_4_upper_lower_bounds'		; CHECK-LABEL: 'test_trip_multiple_4_upper_lower_bounds'
; CHECK-NEXT: Classifying expressions for: @test_trip_multiple_4_upper_lower_bounds		; CHECK-NEXT: Classifying expressions for: @test_trip_multiple_4_upper_lower_bounds
; CHECK-NEXT: %u = urem i32 %num, 4		; CHECK-NEXT: %u = urem i32 %num, 4
; CHECK-NEXT: --> (zext i2 (trunc i32 %num to i2) to i32) U: [0,4) S: [0,4)		; CHECK-NEXT: --> (zext i2 (trunc i32 %num to i2) to i32) U: [0,4) S: [0,4)
; CHECK-NEXT: %i.010 = phi i32 [ 0, %entry ], [ %inc, %for.body ]		; CHECK-NEXT: %i.010 = phi i32 [ 0, %entry ], [ %inc, %for.body ]
; CHECK-NEXT: --> {0,+,1}<nuw><nsw><%for.body> U: [0,-2147483648) S: [0,-2147483648) Exits: (-1 + %num) LoopDispositions: { %for.body: Computable }		; CHECK-NEXT: --> {0,+,1}<nuw><nsw><%for.body> U: [0,-2147483648) S: [0,-2147483648) Exits: (-1 + %num) LoopDispositions: { %for.body: Computable }
; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1		; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1
; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }		; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }
; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_upper_lower_bounds		; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_upper_lower_bounds
; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is -2		; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is -2
; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)
; CHECK-NEXT: Predicates:		; CHECK-NEXT: Predicates:
; CHECK: Loop %for.body: Trip multiple is 1		; CHECK: Loop %for.body: Trip multiple is 4
;		;
entry:		entry:
%cmp.1 = icmp uge i32 %num, 5		%cmp.1 = icmp uge i32 %num, 5
tail call void @llvm.assume(i1 %cmp.1)		tail call void @llvm.assume(i1 %cmp.1)
%u = urem i32 %num, 4		%u = urem i32 %num, 4
%cmp = icmp eq i32 %u, 0		%cmp = icmp eq i32 %u, 0
tail call void @llvm.assume(i1 %cmp)		tail call void @llvm.assume(i1 %cmp)
%cmp.2 = icmp ult i32 %num, 59000		%cmp.2 = icmp ult i32 %num, 59000
Show All 20 Lines
; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1		; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1
; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }		; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }
; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_upper_lower_bounds_swapped1		; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_upper_lower_bounds_swapped1
; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is -2		; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is -2
; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)
; CHECK-NEXT: Predicates:		; CHECK-NEXT: Predicates:
; CHECK: Loop %for.body: Trip multiple is 1		; CHECK: Loop %for.body: Trip multiple is 4
;		;
entry:		entry:
%cmp.1 = icmp uge i32 %num, 5		%cmp.1 = icmp uge i32 %num, 5
tail call void @llvm.assume(i1 %cmp.1)		tail call void @llvm.assume(i1 %cmp.1)
%u = urem i32 %num, 4		%u = urem i32 %num, 4
%cmp = icmp eq i32 %u, 0		%cmp = icmp eq i32 %u, 0
tail call void @llvm.assume(i1 %cmp)		tail call void @llvm.assume(i1 %cmp)
%cmp.2 = icmp ult i32 %num, 59000		%cmp.2 = icmp ult i32 %num, 59000
Show All 20 Lines
; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1		; CHECK-NEXT: %inc = add nuw nsw i32 %i.010, 1
; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }		; CHECK-NEXT: --> {1,+,1}<nuw><nsw><%for.body> U: [1,-2147483648) S: [1,-2147483648) Exits: %num LoopDispositions: { %for.body: Computable }
; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_upper_lower_bounds_swapped2		; CHECK-NEXT: Determining loop execution counts for: @test_trip_multiple_4_upper_lower_bounds_swapped2
; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is -2		; CHECK-NEXT: Loop %for.body: constant max backedge-taken count is -2
; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: symbolic max backedge-taken count is (-1 + %num)
; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)		; CHECK-NEXT: Loop %for.body: Predicated backedge-taken count is (-1 + %num)
; CHECK-NEXT: Predicates:		; CHECK-NEXT: Predicates:
; CHECK: Loop %for.body: Trip multiple is 1		; CHECK: Loop %for.body: Trip multiple is 4
;		;
entry:		entry:
%cmp.1 = icmp uge i32 %num, 5		%cmp.1 = icmp uge i32 %num, 5
tail call void @llvm.assume(i1 %cmp.1)		tail call void @llvm.assume(i1 %cmp.1)
%cmp.2 = icmp ult i32 %num, 59000		%cmp.2 = icmp ult i32 %num, 59000
tail call void @llvm.assume(i1 %cmp.2)		tail call void @llvm.assume(i1 %cmp.2)
%u = urem i32 %num, 4		%u = urem i32 %num, 4
%cmp = icmp eq i32 %u, 0		%cmp = icmp eq i32 %u, 0
▲ Show 20 Lines • Show All 117 Lines • Show Last 20 Lines

llvm/unittests/Analysis/ScalarEvolutionTest.cpp

Show First 20 Lines • Show All 1,738 Lines • ▼ Show 20 Lines	runWithSE(*M, "foo", [](Function &F, LoopInfo &LI, ScalarEvolution &SE) {
auto *ScevIV = SE.getSCEV(getInstructionByName(F, "iv"));		auto *ScevIV = SE.getSCEV(getInstructionByName(F, "iv"));
const Loop *L = cast<SCEVAddRecExpr>(ScevIV)->getLoop();		const Loop *L = cast<SCEVAddRecExpr>(ScevIV)->getLoop();

const SCEV *ITC = SE.getConstantMaxTripCountFromArray(L);		const SCEV *ITC = SE.getConstantMaxTripCountFromArray(L);
EXPECT_TRUE(isa<SCEVCouldNotCompute>(ITC));		EXPECT_TRUE(isa<SCEVCouldNotCompute>(ITC));
});		});
}		}

		TEST_F(ScalarEvolutionsTest, ApplyLoopGuards) {
		LLVMContext C;
		SMDiagnostic Err;
		std::unique_ptr<Module> M = parseAssemblyString(
		"declare void @llvm.assume(i1)\n"
		"define void @test(i32 %num) {\n"
		"entry:\n"
		" %u = urem i32 %num, 4\n"
		" %cmp = icmp eq i32 %u, 0\n"
		" tail call void @llvm.assume(i1 %cmp)\n"
		" %cmp.1 = icmp ugt i32 %num, 0\n"
		" tail call void @llvm.assume(i1 %cmp.1)\n"
		" br label %for.body\n"
		"for.body:\n"
		" %i.010 = phi i32 [ 0, %entry ], [ %inc, %for.body ]\n"
		" %inc = add nuw nsw i32 %i.010, 1\n"
		" %cmp2 = icmp ult i32 %inc, %num\n"
		" br i1 %cmp2, label %for.body, label %exit\n"
		"exit:\n"
		" ret void\n"
		"}\n",
		Err, C);

		ASSERT_TRUE(M && "Could not parse module?");
		ASSERT_TRUE(!verifyModule(*M) && "Must have been well formed!");

		runWithSE(*M, "test", [](Function &F, LoopInfo &LI, ScalarEvolution &SE) {
		auto *TCScev = SE.getSCEV(getArgByName(F, "num"));
		auto ApplyLoopGuardsTC = SE.applyLoopGuards(TCScev, LI.begin());
		// Assert that the new TC is (4 * ((4 umax %num) /u 4))
		APInt Four(32, 4);
		auto *Constant4 = SE.getConstant(Four);
		auto *Max = SE.getUMaxExpr(TCScev, Constant4);
		auto *Mul = SE.getMulExpr(SE.getUDivExpr(Max, Constant4), Constant4);
		ASSERT_TRUE(Mul == ApplyLoopGuardsTC);
		});
		}

} // end namespace llvm		} // end namespace llvm

This is an archive of the discontinued LLVM Phabricator instance.

[SCEV] Preserve divisibility and min/max information in applyLoopGuardsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 499764

llvm/lib/Analysis/ScalarEvolution.cpp

llvm/test/Analysis/ScalarEvolution/trip-multiple-guard-info.ll

llvm/unittests/Analysis/ScalarEvolutionTest.cpp

[SCEV] Preserve divisibility and min/max information in applyLoopGuards
ClosedPublic